Video coding based on audio-visual attention

Jong Seok Lee, Francesca De Simone, Touradj Ebrahimi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective evaluation.

Original languageEnglish
Title of host publicationProceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009
Pages57-60
Number of pages4
DOIs
Publication statusPublished - 2009 Nov 20
Event2009 IEEE International Conference on Multimedia and Expo, ICME 2009 - New York, NY, United States
Duration: 2009 Jun 282009 Jul 3

Publication series

NameProceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009

Other

Other2009 IEEE International Conference on Multimedia and Expo, ICME 2009
CountryUnited States
CityNew York, NY
Period09/6/2809/7/3

Fingerprint

Image coding
Acoustic waves

All Science Journal Classification (ASJC) codes

  • Computer Graphics and Computer-Aided Design
  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Cite this

Lee, J. S., De Simone, F., & Ebrahimi, T. (2009). Video coding based on audio-visual attention. In Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009 (pp. 57-60). [5202435] (Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009). https://doi.org/10.1109/ICME.2009.5202435
Lee, Jong Seok ; De Simone, Francesca ; Ebrahimi, Touradj. / Video coding based on audio-visual attention. Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009. 2009. pp. 57-60 (Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009).
@inproceedings{c9a0bb1907a94202a29bcd9d8c182fde,
title = "Video coding based on audio-visual attention",
abstract = "This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective evaluation.",
author = "Lee, {Jong Seok} and {De Simone}, Francesca and Touradj Ebrahimi",
year = "2009",
month = "11",
day = "20",
doi = "10.1109/ICME.2009.5202435",
language = "English",
isbn = "9781424442911",
series = "Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009",
pages = "57--60",
booktitle = "Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009",

}

Lee, JS, De Simone, F & Ebrahimi, T 2009, Video coding based on audio-visual attention. in Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009., 5202435, Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009, pp. 57-60, 2009 IEEE International Conference on Multimedia and Expo, ICME 2009, New York, NY, United States, 09/6/28. https://doi.org/10.1109/ICME.2009.5202435

Video coding based on audio-visual attention. / Lee, Jong Seok; De Simone, Francesca; Ebrahimi, Touradj.

Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009. 2009. p. 57-60 5202435 (Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Video coding based on audio-visual attention

AU - Lee, Jong Seok

AU - De Simone, Francesca

AU - Ebrahimi, Touradj

PY - 2009/11/20

Y1 - 2009/11/20

N2 - This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective evaluation.

AB - This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order to reduce redundant high-frequency information and achieve coding efficiency. We demonstrate the effectiveness of the proposed method for H.264/AVC coding along with the results of a subjective evaluation.

UR - http://www.scopus.com/inward/record.url?scp=70449553985&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70449553985&partnerID=8YFLogxK

U2 - 10.1109/ICME.2009.5202435

DO - 10.1109/ICME.2009.5202435

M3 - Conference contribution

AN - SCOPUS:70449553985

SN - 9781424442911

T3 - Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009

SP - 57

EP - 60

BT - Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009

ER -

Lee JS, De Simone F, Ebrahimi T. Video coding based on audio-visual attention. In Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009. 2009. p. 57-60. 5202435. (Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009). https://doi.org/10.1109/ICME.2009.5202435