Real-time Performance Evaluation of Modified Cascaded Median-based Noise Estimation for Speech Enhancement System,Fluctuation and Noise Letters

当前位置： X-MOL 学术 › Fluct. Noise Lett. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Real-time Performance Evaluation of Modified Cascaded Median-based Noise Estimation for Speech Enhancement System
Fluctuation and Noise Letters ( IF 1.8 ) Pub Date : 2019-04-08 , DOI: 10.1142/s0219477519500202
Bittu Kumar ₁

Affiliation

In this paper, the performance evaluation of Modified Cascaded Median (MCM)-based noise estimation method for speech enhancement system has been carried out. The MCM-based method, though reported earlier, was not extensively evaluated; particularly, its real-time performance had not been considered. In the present study, the performance of the MCM-based noise estimation method has been compared with those based on Dynamic Quantile Tracking (DQT) and Cascaded Median (CM), through simulation as well as real-time implementation using TMS320C6416T DSK. All comparisons were made for speech quality (subjectively — mean opinion score and objectively — PESQ score, log-likelihood ratio, weighted spectral slope distance, segmented signal-to-noise ratio and composite measures for signal distortion [Formula: see text], background intrusiveness [Formula: see text] and overall distortion [Formula: see text]) at the 95% level of confidence. The real-time parameters such as memory consumption and execution time have been used for real-time implementation and compared for the three methods. The results, for different SNR-based degraded speech signals, show that the modified cascaded median-based noise estimation is the best in terms of PESQ score, [Formula: see text], [Formula: see text], [Formula: see text] and mean opinion score. On the other hand, for different noise corrupted-based speech signals, it performs well as compared to the original CM. Memory consumption and average execution time for the MCM-based noise estimation lie in-between those for DQT and CM-based methods.

中文翻译：

语音增强系统改进级联中值噪声估计的实时性能评估

本文对基于改进级联中值（MCM）的语音增强系统噪声估计方法进行了性能评估。基于 MCM 的方法虽然较早报道，但并未得到广泛评估；特别是，它的实时性能没有被考虑在内。在本研究中，通过仿真以及使用 TMS320C6416T DSK 的实时实现，将基于 MCM 的噪声估计方法的性能与基于动态分位数跟踪 (DQT) 和级联中位数 (CM) 的方法进行了比较。所有的比较都是针对语音质量进行的（主观上——平均意见得分，客观上——PESQ得分、对数似然比、加权频谱斜率距离、分段信噪比和信号失真的综合测量[公式：见文本]、背景侵入性 [公式：见文本]和整体失真[公式：见文本]）在 95% 的置信水平。内存消耗和执行时间等实时参数已用于实时实现，并对三种方法进行了比较。结果表明，对于不同的基于 SNR 的退化语音信号，改进的基于中值的级联噪声估计在 PESQ 得分方面是最好的，[公式：见文本]，[公式：见文本]，[公式：见文本] 和平均意见得分。另一方面，对于不同的基于噪声破坏的语音信号，它与原始 CM 相比表现良好。基于 MCM 的噪声估计的内存消耗和平均执行时间介于 DQT 和基于 CM 的方法之间。内存消耗和执行时间等实时参数已用于实时实现，并对三种方法进行了比较。结果表明，对于不同的基于 SNR 的退化语音信号，改进的基于中值的级联噪声估计在 PESQ 得分方面是最好的，[公式：见文本]，[公式：见文本]，[公式：见文本] 和平均意见得分。另一方面，对于不同的基于噪声破坏的语音信号，它与原始 CM 相比表现良好。基于 MCM 的噪声估计的内存消耗和平均执行时间介于 DQT 和基于 CM 的方法之间。内存消耗和执行时间等实时参数已用于实时实现，并对三种方法进行了比较。结果表明，对于不同的基于 SNR 的退化语音信号，改进的基于中值的级联噪声估计在 PESQ 得分方面是最好的，[公式：见文本]，[公式：见文本]，[公式：见文本] 和平均意见得分。另一方面，对于不同的基于噪声破坏的语音信号，它与原始 CM 相比表现良好。基于 MCM 的噪声估计的内存消耗和平均执行时间介于 DQT 和基于 CM 的方法之间。表明改进的基于中值的级联噪声估计在 PESQ 得分、[公式：见文本]、[公式：见文本]、[公式：见文本]和平均意见得分方面是最好的。另一方面，对于不同的基于噪声破坏的语音信号，它与原始 CM 相比表现良好。基于 MCM 的噪声估计的内存消耗和平均执行时间介于 DQT 和基于 CM 的方法之间。表明改进的基于中值的级联噪声估计在 PESQ 得分、[公式：见文本]、[公式：见文本]、[公式：见文本]和平均意见得分方面是最好的。另一方面，对于不同的基于噪声破坏的语音信号，它与原始 CM 相比表现良好。基于 MCM 的噪声估计的内存消耗和平均执行时间介于 DQT 和基于 CM 的方法之间。

更新日期：2019-04-08

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>