Name: Perceptual Evaluation of a Mix Presentation for Immersive Audio with IAMF
Start: 2025-05-23T11:00:00+0200
End: 2025-05-23T11:20:00+0200

Friday May 23, 2025 11:00am - 11:20am CEST

Immersive audio mix presentations involve transmitting and rendering several audio elements simultaneously. This enables next-generation applications, such as personalized playback. Using immersive loudspeaker and headphone MUSHRA tests, we investigate rate vs. quality for a typical mix presentation use case of a foreground stereo element, plus a background Ambisonics scene. For coding, we use Immersive Audio Model and Formats, a recently proposed system for Next-Generation Audio. Excellent quality is achieved at 384 kbit/s, even with reasonable amount of personalization. We also propose a framework for content-aware analysis that can significantly reduce the bitrate even when using underlying legacy audio coding instances.

Speakers

Carlos Tejeda Ocampo

Samsung Research Tijuana

Toni Hirvonen

Ema Souza Blanes

Mahmoud Namazi

Jan Skoglund

Google

Jan Skoglund leads a team at Google in San Francisco, CA, developing speech and audio signal processing components for capture, real-time communication, storage, and rendering. These components have been deployed in Google software products such as Meet and hardware products such... Read More →

Friday May 23, 2025 11:00am - 11:20am CEST
C1 ATM Studio Warsaw, Poland

Perception & Listening Tests

Presentation Type Paper Presentation

AES Europe 2025

Carlos Tejeda Ocampo

Toni Hirvonen

Ema Souza Blanes

Mahmoud Namazi

Jan Skoglund

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!