当前位置: X-MOL 学术J. Acoust. Soc. Am. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Auditory and auditory-visual frequency-band importance functions for consonant recognition.
The Journal of the Acoustical Society of America ( IF 2.4 ) Pub Date : 2020-05-29 , DOI: 10.1121/10.0001301
Joshua G W Bernstein 1 , Jonathan H Venezia 2 , Ken W Grant 1
Affiliation  

The relative importance of individual frequency regions for speech intelligibility has been firmly established for broadband auditory-only (AO) conditions. Yet, speech communication often takes place face-to-face. This study tested the hypothesis that under auditory-visual (AV) conditions, where visual information is redundant with high-frequency auditory cues, lower frequency regions will increase in relative importance compared to AO conditions. Frequency band-importance functions for consonants were measured for eight hearing-impaired and four normal-hearing listeners. Speech was filtered into four 1/3-octave bands each separated by an octave to minimize energetic masking. On each trial, the signal-to-noise ratio (SNR) in each band was selected randomly from a 10-dB range. AO and AV band-importance functions were estimated using three logistic-regression analyses: a primary model relating performance to the four independent SNRs; a control model that also included band-interaction terms; and a different set of four control models, each examining one band at a time. For both listener groups, the relative importance of the low-frequency bands increased under AV conditions, consistent with earlier studies using isolated speech bands. All three analyses showed similar results, indicating the absence of cross-band interactions. These results suggest that accurate prediction of AV speech intelligibility may require different frequency-importance functions than for AO conditions.

中文翻译:

用于辅音识别的听觉和听觉视觉频段的重要功能。

对于纯语音听觉(AO)条件,已经牢固地确定了各个频率区域对于语音清晰度的相对重要性。但是,语音交流通常是面对面的。这项研究检验了以下假设:在听觉(AV)条件下,视觉信息在高频听觉提示下是多余的,与AO条件相比,低频区域的相对重要性会增加。对八名听力受损和四名正常听力的听众测量了辅音的频段重要性函数。语音被过滤成四个1/3倍频程带,每个频带被一个八度音程隔开,以最大程度地减少能量掩盖。在每个试验中,从10 dB范围内随机选择每个频带中的信噪比(SNR)。AO和AV频段重要性函数是使用三个logistic回归分析估算的:与性能相关的四个独立SNR的主要模型;一个控制模型,其中还包括带相互作用项;以及一组不同的四个控制模型,每个模型一次检查一个频段。对于两个听众组,在AV条件下,低频频段的相对重要性都增加了,这与早期使用隔离语音频段的研究一致。所有三个分析均显示相似的结果,表明不存在跨带相互作用。这些结果表明,AV语音清晰度的准确预测可能需要与AO条件不同的频率重要性函数。以及一组不同的四个控制模型,每个模型一次检查一个频段。对于两个听众组,在AV条件下,低频频段的相对重要性都增加了,这与早期使用隔离语音频段的研究一致。所有三个分析均显示相似的结果,表明不存在跨带相互作用。这些结果表明,AV语音清晰度的准确预测可能需要与AO条件不同的频率重要性函数。以及一组不同的四个控制模型,每个模型一次检查一个频段。对于两个听众组,在AV条件下,低频频段的相对重要性都增加了,这与早期使用隔离语音频段的研究一致。所有三个分析均显示相似的结果,表明不存在跨带相互作用。这些结果表明,AV语音清晰度的准确预测可能需要与AO条件不同的频率重要性函数。
更新日期:2020-05-29
down
wechat
bug