Hi Yuancheng,
They do. Listeners are defined on the sound engine API with 3 vectors expressed in cartesian coordinates: position, direction front and direction top. Azimuth and elevation of an emiter-listener pair are derived from all these vectors, plus the vector (also in cartesian coordinates) describing the game object position.
Regards,
Xavier.