Predicting speech discrimination scores from pure-tone thresholds—A machine learning-based approach using data from 12,697 subjects

Hantai Kim, Jae Yeon Park, Yun Hoon Choung, Jeong Hun Jang, Jeong Gil Ko

Research output: Contribution to journalArticlepeer-review


Diagnostic tests for hearing impairment not only determines the presence (or absence) of hearing loss, but also evaluates its degree and type, and provides physicians with essential data for future treatment and rehabilitation. Therefore, accurately measuring hearing loss conditions is very important for proper patient understanding and treatment. In current-day practice, to quantify the level of hearing loss, physicians exploit specialized test scores such as the pure-tone audiometry (PTA) thresholds and speech discrimination scores (SDS) as quantitative metrics in examining a patient’s auditory function. However, given that these metrics can be easily affected by various human factors, which includes intentional (or accidental) patient intervention, there are needs to cross validate the accuracy of each metric. By understanding a “normal” relationship between the SDS and PTA, physicians can reveal the need for re-testing, additional testing in different dimensions, and also potential malingering cases. For this purpose, in this work, we propose a prediction model for estimating the SDS of a patient by using PTA thresholds via a Random Forest-based machine learning approach to overcome the limitations of the conventional statistical (or even manual) methods. For designing and evaluating the Random Forest-based prediction model, we collected a large-scale dataset from 12,697 subjects, and report a SDS level prediction accuracy of 95.05% and 96.64% for the left and right ears, respectively. We also present comparisons with other widely-used machine learning algorithms (e.g., Support Vector Machine, Multilayer Perceptron) to show the effectiveness of our proposed Random Forest-based approach. Results obtained from this study provides implications and potential feasibility in providing a practically-applicable screening tool for identifying patient-intended malingering in hearing loss-related tests.

Original languageEnglish
Article numbere0261433
JournalPloS one
Issue number12 December
Publication statusPublished - 2021 Dec

Bibliographical note

Funding Information:
This study was supported by the National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MSIT) in the form of funds to JGK [Basic Science Research Program: 2021R1A2C4002380; ERC: 2015R1A5A1037668] and HK, YHC, and JHJ [KMDFPR202105270007202102]. This study was also supported by the Institute for Information and Communications Technology Planning and Evaluation to JGK [IITP-2020-2020-0-01461].

Publisher Copyright:
© 2021 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

All Science Journal Classification (ASJC) codes

  • General


Dive into the research topics of 'Predicting speech discrimination scores from pure-tone thresholds—A machine learning-based approach using data from 12,697 subjects'. Together they form a unique fingerprint.

Cite this