Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement

Haemin Yang, Soyeon Choe, Keulbit Kim, Hong Goo Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a function of noise power spectral density (PSD) in time-frequency domain, and the state-of-the-art algorithm also introduces additional processes to estimate speech presence probability (SPP) to further enhance the estimation. Due to many tuning parameters, however, it is not easy to implement an algorithm that reliably estimates SPP in noise varying environment. We proposed a combination of deep learning network and an effective training method to enhance the performance of the SPP estimation module. The proposed approach is regarded as a hybrid approach, with the noise reduction factor still estimated by the conventional statistic-based single channel enhancement algorithms. The advantages and disadvantages of the proposed approach compared to deep learning approach of single channel speech enhancement are also investigated.

Original languageEnglish
Title of host publication2018 International Conference on Signals and Systems, ICSigSys 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages267-270
Number of pages4
ISBN (Electronic)9781538656891
DOIs
Publication statusPublished - 2018 Jun 4
Event2nd International Conference on Signals and Systems, ICSigSys 2018 - Bali, Indonesia
Duration: 2018 May 12018 May 3

Publication series

Name2018 International Conference on Signals and Systems, ICSigSys 2018 - Proceedings

Conference

Conference2nd International Conference on Signals and Systems, ICSigSys 2018
CountryIndonesia
CityBali
Period18/5/118/5/3

All Science Journal Classification (ASJC) codes

  • Signal Processing
  • Radiology Nuclear Medicine and imaging
  • Instrumentation

Fingerprint Dive into the research topics of 'Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement'. Together they form a unique fingerprint.

  • Cite this

    Yang, H., Choe, S., Kim, K., & Kang, H. G. (2018). Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement. In 2018 International Conference on Signals and Systems, ICSigSys 2018 - Proceedings (pp. 267-270). (2018 International Conference on Signals and Systems, ICSigSys 2018 - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSIGSYS.2018.8372770