Loading…
AES Show 2024 NY has ended
Exhibits+ badges provide access to the ADAM Audio Immersive Room, the Genelec Immersive Room, Tech Tours, and the presentations on the Main Stage.

All Access badges provide access to all content in the Program (Tech Tours still require registration)

View the Exhibit Floor Plan.
Wednesday October 9, 2024 9:00am - 10:30am EDT
The aim of this workshop is to provide participants with hands-on expertise in utilizing machine learning for audio quality modeling.

This workshop is a follow-up from the initial edition presented at the 156th AES Convention. The workshop expands on its first edition by including the participation of new panelists working at cutting-edge technology with multimedia industry leaders. Particularly, very recent advancements on deep learning for auditory perception will be presented, as well as a new open audio quality dataset for informing auditory models in audio quality metrics.

Machine learning techniques have been instrumental in understanding audio quality perception, revealing hidden relationships in subject response data to enhance device and algorithm development. Moreover, accurate quality models can be used in AI systems to predict audio quality -- a critical part of customer experience -- in situations where using human subjects is costly (e.g., in day-to-day product development) or impractical such as in audio transmission network monitoring and informing deep learning audio algorithms.

The complex nature of quality perception requires users and developers of these models to possess specific domain knowledge that extends beyond the general machine learning set of skills. This expertise includes experimental design in the domain of subjective audio quality assessment, data collection, data augmentation and filtering, in addition to model design and cross-validation.

The workshop aims to shed light on historical approaches to addressing these challenges by marrying aspects of machine learning and auditory perception knowledge. Furthermore, it will provide insights into state-of-the-art techniques and current related research topics in data-driven quality perception modelling.

The main goal of the workshop is to offer its attendees —- whether users or developers —- the necessary tools to assess the suitability of existing ML-based quality models for their use case.
Speakers
avatar for Pablo Delgado

Pablo Delgado

Fraunhofer IIS
Pablo Delgado is part of the scientific staff of the Advanced Audio Research Group at the Fraunhofer Institute for Integrated Circuits (IIS) in Erlangen, Germany. He specializes in psychoacoustics applied to audio and speech coding, as well as machine learning applications in audio... Read More →
avatar for Jan Skoglund

Jan Skoglund

Google
Jan Skoglund leads a team at Google in San Francisco, CA, developing speech and audio signal processing components for capture, real-time communication, storage, and rendering. These components have been deployed in Google software products such as Meet and hardware products such... Read More →
avatar for Phill Williams

Phill Williams

Audio Algorithms, Netflix
Phill is member of Netflix's Audio Algorithms team, working on all aspects of the audio delivery toolchain to get the best possible sound to everybody who watches Netflix, all over the world, all of the time.Prior to working at Netflix, Phill worked at Dolby Laboratories, as a contributor... Read More →
avatar for Sascha Dick

Sascha Dick

Sascha Dick received his Dipl.-Ing. degree in Information and Communication Technologies from the Friedrich Alexander University (FAU) of Erlangen-Nuremberg, Germany in 2011 with a thesis on an improved psychoacoustic model for spatial audio coding, and joined the Fraunhofer Institute... Read More →
Wednesday October 9, 2024 9:00am - 10:30am EDT
1E16

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link