当前位置: X-MOL 学术arXiv.cs.SD › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Glottal Closure and Opening Instant Detection from Speech Signals
arXiv - CS - Sound Pub Date : 2019-12-28 , DOI: arxiv-2001.00841
Thomas Drugman, Thierry Dutoit

This paper proposes a new procedure to detect Glottal Closure and Opening Instants (GCIs and GOIs) directly from speech waveforms. The procedure is divided into two successive steps. First a mean-based signal is computed, and intervals where speech events are expected to occur are extracted from it. Secondly, at each interval a precise position of the speech event is assigned by locating a discontinuity in the Linear Prediction residual. The proposed method is compared to the DYPSA algorithm on the CMU ARCTIC database. A significant improvement as well as a better noise robustness are reported. Besides, results of GOI identification accuracy are promising for the glottal source characterization.

中文翻译:

从语音信号中即时检测声门闭合和打开

本文提出了一种直接从语音波形中检测声门闭合和打开瞬间(GCI 和 GOI)的新程序。该过程分为两个连续的步骤。首先计算一个基于均值的信号,并从中提取预期发生语音事件的间隔。其次,通过定位线性预测残差中的不连续点,在每个间隔分配语音事件的精确位置。将所提出的方法与 CMU ARCTIC 数据库上的 DYPSA 算法进行了比较。报告了显着的改进以及更好的噪声鲁棒性。此外,GOI 识别精度的结果对于声门声源表征很有希望。
更新日期:2020-01-06
down
wechat
bug