The effects of lexical content, acoustic and linguistic variability, and vocoding on voice cue perception,The Journal of the Acoustical Society of America

当前位置： X-MOL 学术 › J. Acoust. Soc. Am. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

The effects of lexical content, acoustic and linguistic variability, and vocoding on voice cue perception
The Journal of the Acoustical Society of America ( IF 2.4 ) Pub Date : 2021-09-07 , DOI: 10.1121/10.0005938
Thomas Koelewijn ₁ , Etienne Gaudrain ₂ , Terrin Tamati ₃ , Deniz Başkent ₁

Affiliation

Perceptual differences in voice cues, such as fundamental frequency (F0) and vocal tract length (VTL), can facilitate speech understanding in challenging conditions. Yet, we hypothesized that in the presence of spectrotemporal signal degradations, as imposed by cochlear implants (CIs) and vocoders, acoustic cues that overlap for voice perception and phonemic categorization could be mistaken for one another, leading to a strong interaction between linguistic and indexical (talker-specific) content. Fifteen normal-hearing participants performed an odd-one-out adaptive task measuring just-noticeable differences (JNDs) in F0 and VTL. Items used were words (lexical content) or time-reversed words (no lexical content). The use of lexical content was either promoted (by using variable items across comparison intervals) or not (fixed item). Finally, stimuli were presented without or with vocoding. Results showed that JNDs for both F0 and VTL were significantly smaller (better) for non-vocoded compared with vocoded speech and for fixed compared with variable items. Lexical content (forward vs reversed) affected VTL JNDs in the variable item condition, but F0 JNDs only in the non-vocoded, fixed condition. In conclusion, lexical content had a positive top–down effect on VTL perception when acoustic and linguistic variability was present but not on F0 perception. Lexical advantage persisted in the most degraded conditions and vocoding even enhanced the effect of item variability, suggesting that linguistic content could support compensation for poor voice perception in CI users.

中文翻译：

词汇内容、声学和语言可变性以及声码对语音提示感知的影响

语音提示中的感知差异，例如基频 (F0) 和声道长度 (VTL)，可以促进具有挑战性的条件下的语音理解。然而，我们假设，在存在由人工耳蜗 (CI) 和声码器强加的光谱时间信号退化的情况下，声音感知和音素分类重叠的声学线索可能会被误认为彼此，导致语言和索引之间的强烈相互作用（特定于谈话者的）内容。15 名听力正常的参与者执行了一项奇一出适应性任务，测量 F0 和 VTL 中的刚好可察觉的差异 (JND)。使用的项目是单词（词汇内容）或时间反转单词（无词汇内容）。词汇内容的使用要么被提升（通过使用跨比较区间的变量项），要么不被（固定项）。最后，在没有或有声码的情况下呈现刺激。结果表明，与声码语音相比，非声码语音和固定语音与可变项目相比，F0 和 VTL 的 JND 显着更小（更好）。词汇内容（正向与反向）在可变项目条件下影响 VTL JND，但 F0 JND 仅在非语音编码、固定条件下。总之，当存在声学和语言变异性时，词汇内容对 VTL 感知具有积极的自上而下的影响，但对 F0 感知没有。词汇优势在最退化的条件下仍然存在，声码甚至增强了项目可变性的影响，这表明语言内容可以支持对 CI 用户语音感知不佳的补偿。结果表明，与声码语音相比，非声码语音和固定语音与可变项目相比，F0 和 VTL 的 JND 显着更小（更好）。词汇内容（正向与反向）在可变项目条件下影响 VTL JND，但 F0 JND 仅在非语音编码、固定条件下。总之，当存在声学和语言变异性时，词汇内容对 VTL 感知具有积极的自上而下的影响，但对 F0 感知没有。词汇优势在最退化的条件下仍然存在，声码甚至增强了项目可变性的影响，这表明语言内容可以支持对 CI 用户语音感知不佳的补偿。结果表明，与声码语音相比，非声码语音和固定语音与可变项目相比，F0 和 VTL 的 JND 显着更小（更好）。词汇内容（正向与反向）在可变项目条件下影响 VTL JND，但 F0 JND 仅在非语音编码、固定条件下。总之，当存在声学和语言变异性时，词汇内容对 VTL 感知具有积极的自上而下的影响，但对 F0 感知没有。词汇优势在最退化的条件下仍然存在，声码甚至增强了项目可变性的影响，这表明语言内容可以支持对 CI 用户语音感知不佳的补偿。词汇内容（正向与反向）在可变项目条件下影响 VTL JND，但 F0 JND 仅在非语音编码、固定条件下。总之，当存在声学和语言变异性时，词汇内容对 VTL 感知有积极的自上而下的影响，但对 F0 感知没有。词汇优势在最退化的条件下仍然存在，声码甚至增强了项目可变性的影响，这表明语言内容可以支持对 CI 用户语音感知不佳的补偿。词汇内容（正向与反向）在可变项目条件下影响 VTL JND，但 F0 JND 仅在非语音编码、固定条件下。总之，当存在声学和语言变异性时，词汇内容对 VTL 感知具有积极的自上而下的影响，但对 F0 感知没有。词汇优势在最退化的条件下仍然存在，声码甚至增强了项目可变性的影响，这表明语言内容可以支持对 CI 用户语音感知不佳的补偿。

更新日期：2021-09-07

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>