Visual-speech-pass filtering for robust automatic lip-reading

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker's lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.

Original languageEnglish
Pages (from-to)611-621
Number of pages11
JournalPattern Analysis and Applications
Volume17
Issue number3
DOIs
Publication statusPublished - 2014 Aug

Fingerprint

Bandpass filters
Pixels

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Cite this

@article{c585ec3b21904421ab8479c3c919e2de,
title = "Visual-speech-pass filtering for robust automatic lip-reading",
abstract = "This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker's lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.",
author = "Lee, {Jong Seok}",
year = "2014",
month = "8",
doi = "10.1007/s10044-013-0350-x",
language = "English",
volume = "17",
pages = "611--621",
journal = "Pattern Analysis and Applications",
issn = "1433-7541",
publisher = "Springer London",
number = "3",

}

Visual-speech-pass filtering for robust automatic lip-reading. / Lee, Jong Seok.

In: Pattern Analysis and Applications, Vol. 17, No. 3, 08.2014, p. 611-621.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Visual-speech-pass filtering for robust automatic lip-reading

AU - Lee, Jong Seok

PY - 2014/8

Y1 - 2014/8

N2 - This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker's lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.

AB - This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker's lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.

UR - http://www.scopus.com/inward/record.url?scp=84903955792&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84903955792&partnerID=8YFLogxK

U2 - 10.1007/s10044-013-0350-x

DO - 10.1007/s10044-013-0350-x

M3 - Article

AN - SCOPUS:84903955792

VL - 17

SP - 611

EP - 621

JO - Pattern Analysis and Applications

JF - Pattern Analysis and Applications

SN - 1433-7541

IS - 3

ER -