Enhancing loudspeaker-based 3D audio with room modeling

Myung Suk Song, Cha Zhang, Dinei Florencio, Hong Goo Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

For many years, spatial (3D) sound using headphones has been widely used in a number of applications. A rich spatial sensation is obtained by using head related transfer functions (HRTF) and playing the appropriate sound through headphones. In theory, loudspeaker audio systems would be capable of rendering 3D sound fields almost as rich as headphones, as long as the room impulse responses (RIRs) between the loudspeakers and the ears are known. In practice, however, obtaining these RIRs is hard, and the performance of loudspeaker based systems is far from perfect. New hope has been recently raised by a system that tracks the user's head position and orientation, and incorporates them into the RIRs estimates in real time. That system made two simplifying assumptions: it used generic HRTFs, and it ignored room reverberation. In this paper we tackle the second problem: we incorporate a room reverberation estimate into the RIRs. Note that this is a non-trivial task: RIRs vary significantly with the listener's positions, and even if one could measure them at a few points, they are notoriously hard to interpolate. Instead, we take an indirect approach: we model the room, and from that model we obtain an estimate of the main reflections. Position and characteristics of walls do not vary with the users' movement, yet they allow to quickly compute an estimate of the RIR for each new user position. Of course the key question is whether the estimates are good enough. We show an improvement in localization perception of up to 32% (i.e., reducing average error from 23.5° to 15.9°).

Original languageEnglish
Title of host publication2010 IEEE International Workshop on Multimedia Signal Processing, MMSP2010
Pages34-39
Number of pages6
DOIs
Publication statusPublished - 2010 Dec 1
Event2010 IEEE International Workshop on Multimedia Signal Processing, MMSP2010 - Saint Malo, France
Duration: 2010 Oct 42010 Oct 6

Publication series

Name2010 IEEE International Workshop on Multimedia Signal Processing, MMSP2010

Other

Other2010 IEEE International Workshop on Multimedia Signal Processing, MMSP2010
CountryFrance
CitySaint Malo
Period10/10/410/10/6

All Science Journal Classification (ASJC) codes

  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction
  • Signal Processing

Fingerprint Dive into the research topics of 'Enhancing loudspeaker-based 3D audio with room modeling'. Together they form a unique fingerprint.

  • Cite this

    Song, M. S., Zhang, C., Florencio, D., & Kang, H. G. (2010). Enhancing loudspeaker-based 3D audio with room modeling. In 2010 IEEE International Workshop on Multimedia Signal Processing, MMSP2010 (pp. 34-39). [5661990] (2010 IEEE International Workshop on Multimedia Signal Processing, MMSP2010). https://doi.org/10.1109/MMSP.2010.5661990