A context-based approach to predict speech intelligibility in interrupted noise: Model design

Jelmer van Schoonhoven; Koenraad S. Rhebergen; Wouter A. Dreschler

doi:https://doi.org/10.1121/10.0009617

A context-based approach to predict speech intelligibility in interrupted noise: Model design

Jelmer van Schoonhoven, Koenraad S. Rhebergen, Wouter A. Dreschler

Research output: Contribution to journal › Article › Academic › peer-review

1 Citation (Scopus)

Abstract

The Extended Speech Transmission Index (ESTI) by van Schoonhoven et al. [(2019). J. Acoust. Soc. Am. 145, 1178-1194] was used successfully to predict intelligibility of sentences in fluctuating background noise. However, prediction accuracy was poor when the modulation frequency of the masker was low (<8 Hz). In the current paper, the ESTI was calculated per phoneme to estimate phoneme intelligibility. In the next step, the ESTI model was combined with one of two context models {Boothroyd and Nittrouer, [(1988). J. Acoust. Soc. Am. 84, 101-114]; Bronkhorst et al., [(1993). J. Acoust. Soc. Am. 93, 499-509} in order to improve model predictions. This approach was validated using interrupted speech data, after which it was used to predict speech intelligibility of words in interrupted noise. Model predictions improved using this new method, especially for maskers with interruption rates below 5 Hz. Calculating the ESTI at phoneme level combined with a context model is therefore a viable option to improve prediction accuracy.

Original language	English
Pages (from-to)	1404-1415
Number of pages	12
Journal	Journal of the Acoustical Society of America
Volume	151
Issue number	2
DOIs	https://doi.org/10.1121/10.0009617
Publication status	Published - 1 Feb 2022

Access to Document

https://doi.org/10.1121/10.0009617

Cite this

@article{15ea057e19184f1eaf5e1ec4f085b763,

title = "A context-based approach to predict speech intelligibility in interrupted noise: Model design",

abstract = "The Extended Speech Transmission Index (ESTI) by van Schoonhoven et al. [(2019). J. Acoust. Soc. Am. 145, 1178-1194] was used successfully to predict intelligibility of sentences in fluctuating background noise. However, prediction accuracy was poor when the modulation frequency of the masker was low (<8 Hz). In the current paper, the ESTI was calculated per phoneme to estimate phoneme intelligibility. In the next step, the ESTI model was combined with one of two context models {Boothroyd and Nittrouer, [(1988). J. Acoust. Soc. Am. 84, 101-114]; Bronkhorst et al., [(1993). J. Acoust. Soc. Am. 93, 499-509} in order to improve model predictions. This approach was validated using interrupted speech data, after which it was used to predict speech intelligibility of words in interrupted noise. Model predictions improved using this new method, especially for maskers with interruption rates below 5 Hz. Calculating the ESTI at phoneme level combined with a context model is therefore a viable option to improve prediction accuracy.",

author = "{van Schoonhoven}, Jelmer and Rhebergen, {Koenraad S.} and Dreschler, {Wouter A.}",

note = "Funding Information: This work was financially supported by the Heinsius-Houbolt foundation. The authors thank the two reviewers and the associate editor for their valuable comments. Publisher Copyright: {\textcopyright} 2022 Acoustical Society of America.",

year = "2022",

month = feb,

day = "1",

doi = "https://doi.org/10.1121/10.0009617",

language = "English",

volume = "151",

pages = "1404--1415",

journal = "Journal of the Acoustical Society of America",

issn = "0001-4966",

publisher = "Acoustical Society of America",

number = "2",

}

TY - JOUR

T1 - A context-based approach to predict speech intelligibility in interrupted noise

T2 - Model design

AU - van Schoonhoven, Jelmer

AU - Rhebergen, Koenraad S.

AU - Dreschler, Wouter A.

N1 - Funding Information: This work was financially supported by the Heinsius-Houbolt foundation. The authors thank the two reviewers and the associate editor for their valuable comments. Publisher Copyright: © 2022 Acoustical Society of America.

PY - 2022/2/1

Y1 - 2022/2/1

N2 - The Extended Speech Transmission Index (ESTI) by van Schoonhoven et al. [(2019). J. Acoust. Soc. Am. 145, 1178-1194] was used successfully to predict intelligibility of sentences in fluctuating background noise. However, prediction accuracy was poor when the modulation frequency of the masker was low (<8 Hz). In the current paper, the ESTI was calculated per phoneme to estimate phoneme intelligibility. In the next step, the ESTI model was combined with one of two context models {Boothroyd and Nittrouer, [(1988). J. Acoust. Soc. Am. 84, 101-114]; Bronkhorst et al., [(1993). J. Acoust. Soc. Am. 93, 499-509} in order to improve model predictions. This approach was validated using interrupted speech data, after which it was used to predict speech intelligibility of words in interrupted noise. Model predictions improved using this new method, especially for maskers with interruption rates below 5 Hz. Calculating the ESTI at phoneme level combined with a context model is therefore a viable option to improve prediction accuracy.

AB - The Extended Speech Transmission Index (ESTI) by van Schoonhoven et al. [(2019). J. Acoust. Soc. Am. 145, 1178-1194] was used successfully to predict intelligibility of sentences in fluctuating background noise. However, prediction accuracy was poor when the modulation frequency of the masker was low (<8 Hz). In the current paper, the ESTI was calculated per phoneme to estimate phoneme intelligibility. In the next step, the ESTI model was combined with one of two context models {Boothroyd and Nittrouer, [(1988). J. Acoust. Soc. Am. 84, 101-114]; Bronkhorst et al., [(1993). J. Acoust. Soc. Am. 93, 499-509} in order to improve model predictions. This approach was validated using interrupted speech data, after which it was used to predict speech intelligibility of words in interrupted noise. Model predictions improved using this new method, especially for maskers with interruption rates below 5 Hz. Calculating the ESTI at phoneme level combined with a context model is therefore a viable option to improve prediction accuracy.

UR - http://www.scopus.com/inward/record.url?scp=85125572173&partnerID=8YFLogxK

U2 - https://doi.org/10.1121/10.0009617

DO - https://doi.org/10.1121/10.0009617

M3 - Article

C2 - 35232064

SN - 0001-4966

VL - 151

SP - 1404

EP - 1415

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

IS - 2

ER -

A context-based approach to predict speech intelligibility in interrupted noise: Model design

Abstract

Access to Document

Other files and links

Cite this