Write a script that would set the depth based on the distance from the camera to the target, and you solve the problem neatly.
You can get 2.7 like behaviour back, but then you have to manage Z for everything, and you have to have multiple panels -- one per speech bubble. So your drawcall saving won't work as each panel is a new set of draw calls.