当前位置: X-MOL 学术Radioelectron. Commun. Syst. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Minimum of Information Divergence Criterion for Signals with Tuning to Speaker Voice in Automatic Speech Recognition
Radioelectronics and Communications Systems Pub Date : 2020-01-01 , DOI: 10.3103/s0735272720010045
V. V. Savchenko

It is considered a problem of automatic speech recognition at basic, phonetic level of speech signal processing. It is researched a problem of noise-immunity increase. For its solution it is proposed a criterion of minimum information divergence of the signals with tuning to a speaker voice and automatic scaling of speech template to thin structure of observed (current) speech frame. An example of its practical realization is considered, efficiency characteristics are researched. Applying the author’s software we carry out an experiment and obtain qualitative estimations of threshold signals gain in case of application of proposed criterion. It is shown than this gain can be 10 dB and greater under certain conditions. Obtained results and drawn conclusions are intended it to their application for development and modernization of existent systems and techniques of automatic processing and recognition of speech intended it to operation in conditions of intensive noise effect.

中文翻译:

自动语音识别中调到说话人语音信号的最小信息发散准则

它被认为是语音信号处理的基本语音级别的自动语音识别问题。研究了噪声抗扰度增加的问题。对于它的解决方案,它提出了一个信号最小信息发散的标准,调谐到说话者的声音和语音模板的自动缩放到观察到的(当前)语音帧的细结构。考虑了其实际实现的一个例子,研究了效率特性。应用作者的软件,我们进行了一项实验,并在应用建议标准的情况下获得阈值信号增益的定性估计。结果表明,在某些条件下,该增益可以达到 10 dB 或更大。
更新日期:2020-01-01
down
wechat
bug