An improved estimation of a priori speech absence probability for speech enhancement: In perspective of speech perception

Min Seok Choi, Hong Goo Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

The purpose of this paper is to improve the perceptual quality of a single channel speech enhancement algorithm using MMSE LSA estimator. The proposed algorithm uses a non-linear decision rule and an adaptive recursive averaging factor for tracking a priori speech absence probability (SAP) fast. We also introduce one-third of approximated critical bandwidth to efficiently smooth the a priori SAP and final gain term, which successfully eliminates the musical noise without much distortion of signal. The performance of the proposed algorithm is evaluated by performing subjective A/B listening tests and measuring spectral distance. Simulation results verify the effectiveness of the proposed algorithm compared to conventional algorithms.

Original languageEnglish
Title of host publication2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PublisherInstitute of Electrical and Electronics Engineers Inc.
PagesI1117-I1120
ISBN (Print)0780388747, 9780780388741
DOIs
Publication statusPublished - 2005 Jan 1
Event2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
Duration: 2005 Mar 182005 Mar 23

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
VolumeI
ISSN (Print)1520-6149

Other

Other2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
CountryUnited States
CityPhiladelphia, PA
Period05/3/1805/3/23

Fingerprint

Speech enhancement
Bandwidth

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Choi, M. S., & Kang, H. G. (2005). An improved estimation of a priori speech absence probability for speech enhancement: In perspective of speech perception. In 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing (pp. I1117-I1120). [1415314] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. I). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2005.1415314
Choi, Min Seok ; Kang, Hong Goo. / An improved estimation of a priori speech absence probability for speech enhancement : In perspective of speech perception. 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc., 2005. pp. I1117-I1120 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).
@inproceedings{1cd59f9b5cc8496f9d3fd815327fd188,
title = "An improved estimation of a priori speech absence probability for speech enhancement: In perspective of speech perception",
abstract = "The purpose of this paper is to improve the perceptual quality of a single channel speech enhancement algorithm using MMSE LSA estimator. The proposed algorithm uses a non-linear decision rule and an adaptive recursive averaging factor for tracking a priori speech absence probability (SAP) fast. We also introduce one-third of approximated critical bandwidth to efficiently smooth the a priori SAP and final gain term, which successfully eliminates the musical noise without much distortion of signal. The performance of the proposed algorithm is evaluated by performing subjective A/B listening tests and measuring spectral distance. Simulation results verify the effectiveness of the proposed algorithm compared to conventional algorithms.",
author = "Choi, {Min Seok} and Kang, {Hong Goo}",
year = "2005",
month = "1",
day = "1",
doi = "10.1109/ICASSP.2005.1415314",
language = "English",
isbn = "0780388747",
series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "I1117--I1120",
booktitle = "2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing",
address = "United States",

}

Choi, MS & Kang, HG 2005, An improved estimation of a priori speech absence probability for speech enhancement: In perspective of speech perception. in 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing., 1415314, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. I, Institute of Electrical and Electronics Engineers Inc., pp. I1117-I1120, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, Philadelphia, PA, United States, 05/3/18. https://doi.org/10.1109/ICASSP.2005.1415314

An improved estimation of a priori speech absence probability for speech enhancement : In perspective of speech perception. / Choi, Min Seok; Kang, Hong Goo.

2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc., 2005. p. I1117-I1120 1415314 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. I).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - An improved estimation of a priori speech absence probability for speech enhancement

T2 - In perspective of speech perception

AU - Choi, Min Seok

AU - Kang, Hong Goo

PY - 2005/1/1

Y1 - 2005/1/1

N2 - The purpose of this paper is to improve the perceptual quality of a single channel speech enhancement algorithm using MMSE LSA estimator. The proposed algorithm uses a non-linear decision rule and an adaptive recursive averaging factor for tracking a priori speech absence probability (SAP) fast. We also introduce one-third of approximated critical bandwidth to efficiently smooth the a priori SAP and final gain term, which successfully eliminates the musical noise without much distortion of signal. The performance of the proposed algorithm is evaluated by performing subjective A/B listening tests and measuring spectral distance. Simulation results verify the effectiveness of the proposed algorithm compared to conventional algorithms.

AB - The purpose of this paper is to improve the perceptual quality of a single channel speech enhancement algorithm using MMSE LSA estimator. The proposed algorithm uses a non-linear decision rule and an adaptive recursive averaging factor for tracking a priori speech absence probability (SAP) fast. We also introduce one-third of approximated critical bandwidth to efficiently smooth the a priori SAP and final gain term, which successfully eliminates the musical noise without much distortion of signal. The performance of the proposed algorithm is evaluated by performing subjective A/B listening tests and measuring spectral distance. Simulation results verify the effectiveness of the proposed algorithm compared to conventional algorithms.

UR - http://www.scopus.com/inward/record.url?scp=33646793341&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33646793341&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2005.1415314

DO - 10.1109/ICASSP.2005.1415314

M3 - Conference contribution

AN - SCOPUS:33646793341

SN - 0780388747

SN - 9780780388741

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - I1117-I1120

BT - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Choi MS, Kang HG. An improved estimation of a priori speech absence probability for speech enhancement: In perspective of speech perception. In 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc. 2005. p. I1117-I1120. 1415314. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). https://doi.org/10.1109/ICASSP.2005.1415314