Visual-speech-pass filtering for robust automatic lip-reading

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker's lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.

Original languageEnglish
Pages (from-to)611-621
Number of pages11
JournalPattern Analysis and Applications
Volume17
Issue number3
DOIs
Publication statusPublished - 2014 Aug

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Visual-speech-pass filtering for robust automatic lip-reading'. Together they form a unique fingerprint.

  • Cite this