A spatial audio Immersive Voice and Audio Services (IVAS) codec for automatic language translation receives a primary audio track (eg. a speech + spatial metadata/ambisonics transport signal 302) and a secondary audio track 301 based on the primary track (eg. speech in a second language from a Real Time Language Translator RTLT) and renders both tracks using spatial audio decoding 307. The RTLT may be local, external or network-based.
展开▼