TY - JOUR
T1 - A context-based approach to predict speech intelligibility in interrupted noise
T2 - Model design
AU - van Schoonhoven, Jelmer
AU - Rhebergen, Koenraad S.
AU - Dreschler, Wouter A.
N1 - Funding Information: This work was financially supported by the Heinsius-Houbolt foundation. The authors thank the two reviewers and the associate editor for their valuable comments. Publisher Copyright: © 2022 Acoustical Society of America.
PY - 2022/2/1
Y1 - 2022/2/1
N2 - The Extended Speech Transmission Index (ESTI) by van Schoonhoven et al. [(2019). J. Acoust. Soc. Am. 145, 1178-1194] was used successfully to predict intelligibility of sentences in fluctuating background noise. However, prediction accuracy was poor when the modulation frequency of the masker was low (<8 Hz). In the current paper, the ESTI was calculated per phoneme to estimate phoneme intelligibility. In the next step, the ESTI model was combined with one of two context models {Boothroyd and Nittrouer, [(1988). J. Acoust. Soc. Am. 84, 101-114]; Bronkhorst et al., [(1993). J. Acoust. Soc. Am. 93, 499-509} in order to improve model predictions. This approach was validated using interrupted speech data, after which it was used to predict speech intelligibility of words in interrupted noise. Model predictions improved using this new method, especially for maskers with interruption rates below 5 Hz. Calculating the ESTI at phoneme level combined with a context model is therefore a viable option to improve prediction accuracy.
AB - The Extended Speech Transmission Index (ESTI) by van Schoonhoven et al. [(2019). J. Acoust. Soc. Am. 145, 1178-1194] was used successfully to predict intelligibility of sentences in fluctuating background noise. However, prediction accuracy was poor when the modulation frequency of the masker was low (<8 Hz). In the current paper, the ESTI was calculated per phoneme to estimate phoneme intelligibility. In the next step, the ESTI model was combined with one of two context models {Boothroyd and Nittrouer, [(1988). J. Acoust. Soc. Am. 84, 101-114]; Bronkhorst et al., [(1993). J. Acoust. Soc. Am. 93, 499-509} in order to improve model predictions. This approach was validated using interrupted speech data, after which it was used to predict speech intelligibility of words in interrupted noise. Model predictions improved using this new method, especially for maskers with interruption rates below 5 Hz. Calculating the ESTI at phoneme level combined with a context model is therefore a viable option to improve prediction accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85125572173&partnerID=8YFLogxK
U2 - https://doi.org/10.1121/10.0009617
DO - https://doi.org/10.1121/10.0009617
M3 - Article
C2 - 35232064
SN - 0001-4966
VL - 151
SP - 1404
EP - 1415
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 2
ER -