Efficient video coding in H.264/AVC by using audio-visual information

Jong Seok Lee, Touradj Ebrahimi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

This paper proposes an efficient video coding method which utilizes audio-visual information, based on the observation that sound-emitting regions in a video sequence attract observer's attention. The regions responsible for the sound are identified by an audio-visual source localization algorithm. Then, the result is used for encoding different regions in the scene with different quality in such a way that a region far from the sound source is coded with a lesser quality than the sound-emitting regions. This is implemented by assigning different quantization parameter values for different regions in H.264/AVC. Experimental results demonstrate the effectiveness of the proposed approach.

Original languageEnglish
Title of host publication2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09
DOIs
Publication statusPublished - 2009
Event2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 - Rio De Janeiro, Brazil
Duration: 2009 Oct 52009 Oct 7

Publication series

Name2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09

Other

Other2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09
CountryBrazil
CityRio De Janeiro
Period09/10/509/10/7

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint Dive into the research topics of 'Efficient video coding in H.264/AVC by using audio-visual information'. Together they form a unique fingerprint.

  • Cite this

    Lee, J. S., & Ebrahimi, T. (2009). Efficient video coding in H.264/AVC by using audio-visual information. In 2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 [5293288] (2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09). https://doi.org/10.1109/MMSP.2009.5293288