Parametric-based non-intrusive speech quality assessment by deep neural network

Haemin Yang, Kyungguen Byun, Hong Goo Kang, Youngsu Kwak

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

This paper proposes a deep neural network (DNN) based non-intrusive speech quality estimation method in real-Time voice communication systems. Since the proposed method only utilizes real-Time control protocol (RTCP) information in the receiver side and does not need a reference signal, it is possible to continuously monitor the quality of service (QoS). Unlike the conventional non-intrusive E-model system that predicts QoS by utilizing delay, jitter, and type of codec with a rule-based method, the proposed method actively estimates the non-linear relationship between multi-dimensional parameters of RTCP and subjectively motivated reference scores using a DNN structure. In order to select efficient features, the relationship between each parameter of RTCP and perceptual objective listening quality assessment (POLQA) is thoroughly investigated, then we train the DNN model by changing the number of layers and nodes. The proposed algorithm achieved 0.8693 correlation with 21,206 reference POLQA scores that are sampled from real environment.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE International Conference on Digital Signal Processing, DSP 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages99-103
Number of pages5
ISBN (Electronic)9781509041657
DOIs
Publication statusPublished - 2016 Jul 2
Event2016 IEEE International Conference on Digital Signal Processing, DSP 2016 - Beijing, China
Duration: 2016 Oct 162016 Oct 18

Publication series

NameInternational Conference on Digital Signal Processing, DSP
Volume0

Other

Other2016 IEEE International Conference on Digital Signal Processing, DSP 2016
Country/TerritoryChina
CityBeijing
Period16/10/1616/10/18

All Science Journal Classification (ASJC) codes

  • Signal Processing

Fingerprint

Dive into the research topics of 'Parametric-based non-intrusive speech quality assessment by deep neural network'. Together they form a unique fingerprint.

Cite this