Wwise will create two Audio Objects. You will still get a single decoding (e.g. Vorbis) and a single set of A-M effects, but two Audio Objects will be spawned at the point of being mixed into busses. The same applies for Spatial Audio features like diffraction, where each virtual emitter will become a separate Audio Object.