当前位置: X-MOL 学术Comput. Speech Lang › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Detection of vowel transition regions from Hindi language
Computer Speech & Language ( IF 4.3 ) Pub Date : 2021-04-26 , DOI: 10.1016/j.csl.2021.101231
Jainath Yadav

The vowel transition regions are the crucial landmarks in the speech signal. These vital regions are present at both ends of the vowel. They lie in the junction between a consonant and a vowel (CV) regions. This region plays an important role in numerous speech applications like speaker recognition, emotion conversion, speech rate modification, and CV unit recognition. The performance of these applications crucially depends on the accuracy of the estimation of vowel transition regions. In this paper, we have proposed a method for determining the transition regions based on the rate of change of formant frequencies using zero-time windowing and numerator of the group-delay function. Zero-time windowing derives the instantaneous formant frequencies accurately at every sample location due to the contribution of that sample itself. The numerator of the group-delay function enhances the formant frequencies. The proposed transition region detection method is evaluated on CV, and continuous speech databases recorded in the Hindi language. The proposed method has shown around 12% improvement in accuracy compared to the existing method.



中文翻译:

用印地语检测元音过渡区

元音过渡区域是语音信号中的关键标志。这些重要区域存在于元音的两端。它们位于辅音和元音(CV)区域之间的交界处。该区域在众多语音应用中扮演重要角色,例如说话者识别,情感转换,语速修改和CV单元识别。这些应用程序的性能关键取决于元音过渡区域的估计精度。在本文中,我们提出了一种使用零时窗和群延迟函数分子基于共振峰频率变化率确定过渡区域的方法。由于该样本本身的贡献,零时加窗精确地导出了每个样本位置处的瞬时共振峰频率。群延迟函数的分子提高了共振峰频率。提出的过渡区域检测方法在CV上进行评估,并以印地语记录连续语音数据库。与现有方法相比,该方法的准确性提高了约12%。

更新日期:2021-05-09
down
wechat
bug