Novel model for pitch estimation using hybrid DWT-DCT HPS,International Journal of Information Technology

当前位置： X-MOL 学术 › Int. J. Inf. Technol. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Novel model for pitch estimation using hybrid DWT-DCT HPS
International Journal of Information Technology Pub Date : 2021-03-13 , DOI: 10.1007/s41870-021-00618-w
Dipti Kalra , Rashmi Gupta

Pitch is an important feature of speech. Therefore, extraction of pitch becomes a vital task for processes like speaker coding, speaker recognition, speech synthesis, speech recognition and many such applications. The few available algorithms such as Discrete Cosine Transform (DCT) based pitch extraction, harmonic product spectrum (HPS) which is obtained from DCT are useful for extraction of pitch. In this paper, we propose a hybrid Discrete Wavelet Transform-Discrete Cosine Transform (DWT- DCT HPS) based pitch extraction. A voice sample is taken and de-segmented into 36 bands in the frequency domain. Then on those bands spatial domain transformation is performed to get the most prominent features. The Gross pitch error (GPE) and Fine pitch error (FPE) criteria is used as a measure to find the accuracy of the novel method. The result depicts that the novel proposed Hybrid method is better as compared to DCT-HPS in terms of Pitch error.

中文翻译：

使用混合DWT-DCT HPS进行音高估计的新模型

音调是语音的重要特征。因此，对于诸如说话者编码，说话者识别，语音合成，语音识别以及许多此类应用的处理，音调的提取成为至关重要的任务。很少有可用的算法，例如基于离散余弦变换（DCT）的基音提取，从DCT获得的谐波乘积谱（HPS）可用于基音提取。在本文中，我们提出了一种基于离散小波变换-离散余弦变换（DWT-DCT HPS）的基音提取方法。采集语音样本并将其在频域中细分为36个频段。然后，在这些频段上执行空间域变换，以获取最突出的特征。粗节距误差（GPE）和细节距误差（FPE）准则被用作一种度量，以找到该新方法的准确性。

更新日期：2021-03-15

点击分享查看原文

点击收藏

阅读更多本刊最新论文