AES Show 2024 NY: Full Schedule

Exhibits+ badges provide access to the ADAM Audio Immersive Room , the Genelec Immersive Room, Tech Tours, and the presentations on the Main Stage .

All Access badges provide access to all content in the Program (Tech Tours still require registration)

View the Exhibit Floor Plan.

arrow_back View All Dates

9:30am EDT

Physiological measurement of the arousing effect of bass amplification in music

Tuesday October 8, 2024 9:30am - 10:00am EDT

1E04

Music's amazing ability to evoke emotions has been the focus of various scientific studies, with researchers testing how different musical structures or interpretations impacted the emotions induced in the listener. However, in the context of amplified music, little is known about the influence of the sound reinforcement system. In this study, we investigate whether the amount of low-frequency amplification produced by a sound system impacts the listener's arousal. We organized two listening experiments whereby we measured the skin conductance of the participants while they were listening to music excerpts with different levels of low-frequency amplification. Our results indicate that an increase in the level of bass is correlated with a small but measurable rise in electrodermal activity, which is correlated with arousal. In addition this effect seems to depend on the nature of the music.

Moderators

Brett Leonard

Director of Music Technology Programs, University of Indianapolis

Speakers

Nicolas Epain

Application Research Engineer, L-Acoustics

Authors

Nicolas Epain

Application Research Engineer, L-Acoustics

Etienne Corteel

Luc Arnal

Mérové Wallerich

Thomas Mouterde

Field application research engineer, L-Acoustics

Thomas Mouterde is a field application research engineer at L-Acoustics, a French manufacturer of loudspeakers, amplifiers, and signal processing devices. He is a member of the “Education and Scientific Outreach” department that aims at developing the education program of the... Read More →

Tuesday October 8, 2024 9:30am - 10:00am EDT
1E04

Recording and Production, Paper Lecture

10:00am EDT

Exploring trends in audio mixes and masters: Insights from a dataset analysis

Tuesday October 8, 2024 10:00am - 10:30am EDT

1E04

We present an analysis of a dataset of audio metrics and aesthetic considerations about mixes and masters provided by the web platform MixCheck studio.The platform is designed for educational purposes, primarily targeting amateur music producers, and aimed at analysing their recordings prior to them being released. The analysis focuses on the following data points: integrated loudness, mono compatibility, presence of clipping and phase issues, compression and tonal profile across 30 user-specified genres. Both mixed (mixes) and mastered audio (masters)are included in the analysis, where mixes refer to the initial combination and balance of individual tracks, and masters refer to the final refined version optimized for distribution. Results show that loudness-related issues along with dynamics issues are the most prevalent, particularly in mastered audio. However mastered audio presents better results in compression than just mixed audio. Additionally, results show that mastered audio has a lower percentage of stereo field and phase issues.

Moderators

Brett Leonard

Director of Music Technology Programs, University of Indianapolis

Speakers

David Ronan

CEO, RoEx

CEO

Authors

David Ronan

CEO, RoEx

CEO

Angeliki Mourgela

Elio Quinton

Joshua Reiss

Spyridon Bissas

Tuesday October 8, 2024 10:00am - 10:30am EDT
1E04

Recording and Production, Paper Lecture

10:30am EDT

Audience Effect in the Low-Frequency Range, Part 2: Impact on Time Alignment of Loudspeaker Systems

Tuesday October 8, 2024 10:30am - 11:00am EDT

1E04

A sound reinforcement system typically combines a full-range system with a subwoofer system to deliver a consistent frequency bandwidth. The two systems must be time-aligned, which is usually done without an audience. This paper investigates the impact of the audience on the time alignment of loudspeaker systems at low frequencies. The study demonstrates, through on-site measurements and simulations, that the audience significantly affects sound propagation. The research highlights the greater phase shift observed with ground-stacked subwoofers compared to flown systems due to the audience’s presence, requiring adjustments of the system time alignment with the audience when flown and ground-stacked sources are used together. Moreover, in this case, the results demonstrate the lower quality of the summation with the audience even with the alignment adjustment. Lastly, recommendations for system design and calibration are proposed.

Moderators

Brett Leonard

Director of Music Technology Programs, University of Indianapolis

Speakers

Thomas Mouterde

Field application research engineer, L-Acoustics

Authors

Thomas Mouterde

Field application research engineer, L-Acoustics

Etienne Corteel

Nicolas Epain

Application Research Engineer, L-Acoustics

Tuesday October 8, 2024 10:30am - 11:00am EDT
1E04

Recording and Production, Paper Lecture

11:00am EDT

Part of the Band: Virtual Acoustic Space as a Participant in Musical Performance

Tuesday October 8, 2024 11:00am - 11:30am EDT

1E04

We detail a real-time application of active acoustics used to create a shared virtual environment over a closed audio network as a research-creation project exploring the concept of room participation in musical performance. As part of a concert given in the Immersive Media Lab at McGill University, musicians and audience members were located in a virtual acoustic environment while a second audience was located in an adjacent but acoustically isolated space on the same audio network. Overall, the blending of computer-generated and acoustic sources created a specific use case for virtual acoustics while the immersive capture and distribution method examined an avenue for producing a real-time shared experience. Future work in this area includes audio networks with multiple virtual acoustic environments.

Moderators

Brett Leonard

Director of Music Technology Programs, University of Indianapolis

Speakers

Kathleen Ying-Ying Zhang

PhD Candidate, McGill University

YIng-Ying Zhang is a music technology researcher and sound engineer. She is currently a PhD candidate at McGill University in the Sound Recording program where her research focuses on musician-centered virtual acoustic applications in recording environments. She received her Masters... Read More →

Authors

Kathleen Ying-Ying Zhang

PhD Candidate, McGill University

Michail Oikonomidis

Mihai-Vlad Baran

McGill University

Richard King

Professor, McGill University

Richard King is an Educator, Researcher, and a Grammy Award winning recording engineer. Richard has garnered Grammy Awards in various fields including Best Engineered Album in both the Classical and Non-Classical categories. Richard is an Associate Professor at the Schulich School... Read More →

Wieslaw Woszczyk

Tuesday October 8, 2024 11:00am - 11:30am EDT
1E04

Recording and Production, Paper Lecture

11:30am EDT

INFLUENCE OF RECORDING TECHNIQUE AND ENSEMBLE SIZE ON APPARENT SOURCE WIDTH

Tuesday October 8, 2024 11:30am - 12:00pm EDT

1E04

The impression of listeners to aurally “see” the size of a performing entity is crucial to the success of both a concert hall and a reproduced sound field. Previous studies have looked at how different concert halls with different lateral reflections affect apparent source width. Yet, the perceptual effects of different source distributions with different recording techniques on apparent source width are not well understood. This study explores how listeners perceive the width of an orchestra by using four stereo and one binaural recording techniques and three wave field synthesis ensemble settings. Subjective experiments were conducted using stereo loudspeakers and headphone to play back the recording clips asking the listeners to rate the perceived wideness of the sound source. Results show that recording techniques greatly influence how wide an orchestra is perceived. The primary mechanism used to judge auditory spatial impression differs between stereo loudspeaker and headphone listening. When western classical symphony is recorded and reproduced by two-channel stereophony, the changes in instrument positions in terms of increasing or reducing the physical source width do not lead to an obvious increase or reduction on the spatial impression of the performing entity.

Moderators

Brett Leonard

Director of Music Technology Programs, University of Indianapolis

Speakers

Jonas Braasch

Authors

Jonas Braasch

Renzhi Guo

Tuesday October 8, 2024 11:30am - 12:00pm EDT
1E04

Recording and Production, Paper Lecture

2:00pm EDT

Fourier Paradoxes

Tuesday October 8, 2024 2:00pm - 2:30pm EDT

1E04

Fourier theory is quite ubiquitous in modern audio signal processing. However, this framework is often at odds with our intuitions behind audio signals. Strictly speaking, Fourier theory is ideal to analyze periodic behaviors but when periodicities change across time it is easy to misinterpret its results. Of course, we have developed strategies around it like the Short Time Fourier Transform, yet again many times our interpretations of it falls beyond what the theory really says. This paper pushes the exact theoretical description showing examples where our interpretation of the data is incorrect. Furthermore, it shows specific instances where we incorrectly take decisions based on such paradoxical framework.

Moderators

Rob Maher

Professor, Montana State University

Audio digital signal processing, audio forensics, music analysis and synthesis.

Speakers

Juan Sierra

NYU

Currently, I am a PhD Candidate in Music Technology at NYU and am currently based in NYUAD as part of the Global Fellowship program. As a professional musician, my expertise lies in Audio Engineering, and I hold a master's degree in Music, Science, and Technology from the prestigious... Read More →

Authors

Juan Sierra

NYU

Tuesday October 8, 2024 2:00pm - 2:30pm EDT
1E04

Signal Processing, Paper Lecture

2:30pm EDT

Nonlinear distortion in analog modeled DSP plugins in consequence of recording levels

Tuesday October 8, 2024 2:30pm - 3:00pm EDT

1E04

The nominal audio level is where developers of professional analog equipment design their units to have an optimal performance. Audio levels above the nominal level will at some point lead to increased harmonic distortion and eventually clipping. DSP plugins emulating such nonlinear behavior must – in the same manner as analog equipment – align to a nominal level that is simulated within the digital environment. A listening test was tailored to investigate if, or to which extent, misalignments in the audio levels compared to the simulated nominal level in analog-modelled DSP plugins are audible, thus affecting the outcome, depending on which level you choose to record at. The results of this study indicate that harmonic distortion in analog-modeled DSP plugins may become audible as the recording level increases. However, for the plugins included in this study, the immediate consequence of the harmonics added is not critical and, in most cases, not noticed by the listener.

Moderators

Rob Maher

Professor, Montana State University

Audio digital signal processing, audio forensics, music analysis and synthesis.

Speakers

Tore Teigland

Professor, Kristiania University College

Authors

Tore Teigland

Professor, Kristiania University College

Tuesday October 8, 2024 2:30pm - 3:00pm EDT
1E04

Signal Processing, Paper Lecture

3:00pm EDT

A Survey of Methods for the Discretization of Phonograph Record Playback Filters

Tuesday October 8, 2024 3:00pm - 3:30pm EDT

1E04

Since the inception of electrical recording for phonograph records in 1924, records have been intentionally cut with a non-uniform frequency response to maximize the information density on a disc and to improve the signal-to-noise ratio. To reproduce a nominally flat signal within the available bandwidth, the effects of this cutting curve must be undone by applying an inverse curve on playback. Until 1953, with the introduction of what has become known as the RIAA curve, the playback curve required for any particular disc could vary by record company and over time. As a consequence, anyone seeking to hear or restore the information on a disc must have access to equipment that is capable of implementing multiple playback equalizations. This correction may be accomplished with either analog hardware or digital processing. The digital approach has the advantages of reduced cost and expanded versatility, but requires a transformation from continuous time, where the original curves are defined, to discrete time. This transformation inevitably comes with some deviations from the continuous-time response near the Nyquist frequency. There are many established methods for discretizing continuous-time filters, and these vary in performance, computational cost, and inherent latency. In this work, several methods for performing this transformation are explored in the context of phonograph playback equalization, and the performance of each approach is quantified. This work is intended as a resource for anyone developing systems for digital playback equalization or similar applications that require approximating the response of a continuous-time filter digitally.

Moderators

Rob Maher

Professor, Montana State University

Audio digital signal processing, audio forensics, music analysis and synthesis.

Speakers

Benjamin Thompson

PhD Student, University of Rochester

Authors

Benjamin Thompson

PhD Student, University of Rochester

Jenna Rutowski

Michael Heilemann

Tre DiPassio

Tuesday October 8, 2024 3:00pm - 3:30pm EDT
1E04

Signal Processing, Paper Lecture

3:30pm EDT

Leveraging TSN Protocols to Support AES67: Achieving AVB Quality with Layer 3 Benefits

Tuesday October 8, 2024 3:30pm - 3:50pm EDT

1E04

This paper investigates using Time-Sensitive Networking (TSN) protocols, particularly from Audio Video Bridging (AVB), to support AES67 audio transport. By leveraging the IEEE 1588 Level 3 Precision Time Protocol (PTP) Media Profile, packet scheduling, and bandwidth reservation, we demonstrate that AES67 can be transported with AVB-equivalent quality guarantees while benefiting from Layer 3 networking advantages. The evolution of professional audio networking has increased the demand for high-quality, interoperable, and efficiently managed networks. AVB provides robust Layer 2 delivery guarantees but is limited by Layer 2 constraints. AES67 offers Layer 3 interoperability but lacks strict quality of service (QoS) guarantees. This paper proposes combining the strengths of both approaches by using TSN protocols to support AES67, ensuring precise audio transmission with Layer 3 flexibility. TSN extends AVB standards for time synchronization, traffic shaping, and resource reservation, ensuring low latency, low jitter, and minimal packet loss. AES67, a standard for high-performance audio over IP, leverages ubiquitous IP infrastructure for scalability and flexibility but lacks the QoS needed for professional audio. Integrating TSN protocols with AES67 achieves AVB's QoS guarantees in a Layer 3 environment. IEEE 1588 Level 3 PTP Media Profile ensures precise synchronization, packet scheduling reduces latency and jitter, and bandwidth reservation prevents congestion. Experiments show that TSN protocols enable AES67 to achieve latency, jitter, and packet loss performance on par with AVB, providing reliable audio transmission suitable for professional applications in modern, scalable networks.

Moderators

Rob Maher

Professor, Montana State University

Audio digital signal processing, audio forensics, music analysis and synthesis.

Speakers

Nicolas Sturmel

Directout GmbH

Authors

Nicolas Sturmel

Directout GmbH

Claudio Becker-Foss

Tuesday October 8, 2024 3:30pm - 3:50pm EDT
1E04

Signal Processing, Paper Lecture

3:50pm EDT

Harnessing Diffuse Signal Processing (DiSP) to Mitigate Coherent Interference

Tuesday October 8, 2024 3:50pm - 4:10pm EDT

1E04

Coherent sound wave interference is a persistent challenge in live sound reinforcement, where phase differences between multiple loudspeakers lead to destructive interference, resulting in inconsistent audio coverage. This review paper presents a modern solution: Diffuse Signal Processing (DiSP), which utilizes Temporally Diffuse Impulses (TDIs) to mitigate phase cancellation. Unlike traditional methods focused on phase alignment, DiSP manipulates the temporal and spectral characteristics of sound, effectively diffusing coherent wavefronts. TDIs, designed to spread acoustic energy over time, are synthesized and convolved with audio signals to reduce the likelihood of interference. This process maintains the original sound’s perceptual integrity while enhancing spatial consistency, particularly in large-scale sound reinforcement systems. Practical implementation methods are demonstrated, including a MATLAB-based workflow for generating TDIs and optimizing them for specific frequency ranges or acoustic environments. Furthermore, dynamic DiSP is introduced as a method for addressing interference caused by early reflections in small-to-medium sized rooms. This technique adapts TDIs in real-time, ensuring ongoing decorrelation in complex environments. The potential for future developments, such as integrating DiSP with immersive audio systems or creating dedicated hardware for real-time signal processing, is also discussed.

Moderators

Rob Maher

Professor, Montana State University

Audio digital signal processing, audio forensics, music analysis and synthesis.

Speakers

Tommy Spurgeon

Physics Student & Undergraduate Researcher, University of South Carolina

Authors

Tommy Spurgeon

Physics Student & Undergraduate Researcher, University of South Carolina

Tuesday October 8, 2024 3:50pm - 4:10pm EDT
1E04

Signal Processing, Paper Lecture