当前位置:
X-MOL 学术
›
Opt. Mem. Neural Networks
›
论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Performance Optimization of Speech Recognition System with Deep Neural Network Model
Optical Memory and Neural Networks Pub Date : 2019-02-01 , DOI: 10.3103/s1060992x18040094 Wei Guan
中文翻译:
深度神经网络模型的语音识别系统性能优化
更新日期:2019-02-01
Optical Memory and Neural Networks Pub Date : 2019-02-01 , DOI: 10.3103/s1060992x18040094 Wei Guan
Abstract
With the development of internet, man-machine interaction has tended to be more important. Precise speech recognition has become an important means to achieve man-machine interaction. In this study, deep neural network model was used to enhance speech recognition performance. Feedforward fully connected deep neural network, time-delay neural network, convolutional neural network and feedforward sequence memory neural network were studied, and their speech recognition performance was studied by comparing their acoustic models. Moreover, the recognition performance of the model after adding different dimension human voice features was tested. The results showed that the performance of the speech recognition system could be improved effectively by using the deep neural network model, and the performance of feedforward sequence memory neural network was the best, followed by deep neural network, time-delay neural network and convolutional neural network. Different extraction features had different improvement effects on model performance. The performance of the model which was added with Fbank extraction features was superior to that added with Mel-frequency cepstrum coefficient (MFCC) extraction feature. The model performance improved after the addition of vocal characteristics. Different models had different vocal characteristic dimensions.中文翻译:
深度神经网络模型的语音识别系统性能优化