当前位置: X-MOL 学术Trans. Emerg. Telecommun. Technol. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A novel double pole transfer function‐single frequency filtering approach for speech enhancement
Transactions on Emerging Telecommunications Technologies ( IF 3.6 ) Pub Date : 2020-08-07 , DOI: 10.1002/ett.4038
V. Srinivasarao 1 , Umesh Ghanekar 1
Affiliation  

Speech intelligibility improvement is a major field of recent research with in audiology and hearing domain. Speech signal usually gets affected due to various environmental and background noise. Several approaches for speech intelligibility improvement were presented in the literature. In this paper, a method of speech enhancement based on Double Pole Transfer Function‐Single Frequency Filter (DPTF‐SFF) has been proposed. In the SFF approach the temporal and spectral resolutions are controlled with a one particular value of the filter that corresponds to the pole position in the complex z‐plane. The proposed method uses double pole transfer function SFF that improves the magnitude and phase for various frequencies which are used in the synthesis of the original signal. In the analysis of this method clean speech signals are used from GRID corpus database and noise signals are taken from NOIZEUS and NOISEX‐92 databases. The performance evaluation of this method is carried out in terms of Perceptual Evaluation of Speech Quality, Short‐Time Objective Intelligibility, and Segmental Signal to Noise Ratio metrics. The implementation results proved that the proposed method is an efficient one as it gives higher intelligibility scores compared to the existing methods at different SNR levels (dB) and for different noises employed.

中文翻译:

一种新颖的双极转移函数-单频滤波方法,用于语音增强

语音清晰度的提高是听力学和听力领域最近研究的一个主要领域。语音信号通常会受到各种环境和背景噪声的影响。文献中提出了几种提高语音清晰度的方法。本文提出了一种基于双极传递函数-单频滤波器(DPTF-SFF)的语音增强方法。在SFF方法中,时间和频谱分辨率由滤波器的一个特定值控制,该值对应于复z平面中的极点位置。所提出的方法使用双极传递函数SFF,该函数改善了原始信号合成中使用的各种频率的幅度和相位。在此方法的分析中,从GRID语料库数据库中使用干净的语音信号,并从NOIZEUS和NOISEX-92数据库中获取噪声信号。该方法的性能评估是根据语音质量的感知评估,短期目标清晰度和分段信噪比指标进行的。实施结果证明,该方法是一种有效的方法,因为与现有方法相比,在不同的SNR级别(dB)和采用的不同噪声下,该方法具有更高的清晰度。
更新日期:2020-08-07
down
wechat
bug