Sculpting speech from noise, music, and other sources.,The Journal of the Acoustical Society of America

当前位置： X-MOL 学术 › J. Acoust. Soc. Am. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Sculpting speech from noise, music, and other sources.
The Journal of the Acoustical Society of America ( IF 2.4 ) Pub Date : 2020-07-06 , DOI: 10.1121/10.0001474
Martin Cooke ₁ , María Luisa García Lecumberri ₂

Affiliation

Intelligible speech can be generated by passing a signal through a time-frequency mask that selects which information to retain, even when the signal is speech-shaped noise, suggesting an important role for the mask pattern itself. The current study examined the relationship between the signal and the mask by varying the availability of target speech cues in the signal while holding the mask constant. Keyword identification rates in everyday sentences varied from near-ceiling to near-floor levels as the signal was varied, indicating that the interaction between the signal and mask, rather than the mask alone, determines intelligibility.

中文翻译：

从噪音，音乐和其他来源雕刻语音。

可以通过使信号通过时频掩码来生成可理解的语音，即使该信号是语音形噪声，该时频掩码也会选择要保留的信息，这暗示了掩码模式本身的重要作用。当前的研究通过在保持模板常数不变的情况下改变信号中目标语音提示的可用性来检查信号与模板之间的关系。随着信号的变化，日常句子中的关键字识别率从接近上限到接近下限的水平不等，这表明信号和掩码（而不是单独的掩码）之间的相互作用决定了清晰度。

更新日期：2020-07-06

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文

全部期刊列表>>