Decision tree SVM model with Fisher feature selection for speech emotion recognition,EURASIP Journal on Audio, Speech, and Music Processing

当前位置： X-MOL 学术 › EURASIP J. Audio Speech Music Proc. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Decision tree SVM model with Fisher feature selection for speech emotion recognition
EURASIP Journal on Audio, Speech, and Music Processing ( IF 1.7 ) Pub Date : 2019-01-07 , DOI: 10.1186/s13636-018-0145-5
Linhui Sun , Sheng Fu , Fu Wang

The overall recognition rate will reduce due to the increase of emotional confusion in multiple speech emotion recognition. To solve the problem, we propose a speech emotion recognition method based on the decision tree support vector machine (SVM) model with Fisher feature selection. At the stage of feature selection, Fisher criterion is used to filter out the feature parameters of higher distinguish ability. At the emotion classification stage, an algorithm is proposed to determine the structure of decision tree. The decision tree SVM can realize the two-step classification of the first rough classification and the fine classification. Thus the redundant parameters are eliminated and the performance of emotion recognition is improved. In this method, the decision tree SVM framework is firstly established by calculating the confusion degree of emotion, and then the features with higher distinguish ability are selected for each SVM of the decision tree according to Fisher criterion. Finally, speech emotion recognition is realized based on this model. The decision tree SVM with Fisher feature selection on CASIA Chinese emotion speech corpus and Berlin speech corpus are constructed to validate the effectiveness of our framework. The experimental results show that the average emotion recognition rate based on the proposed method is 9% higher than traditional SVM classification method on CASIA, and 8.26% higher on Berlin speech corpus. It is verified that the proposed method can effectively reduce the emotional confusion and improve the emotion recognition rate.

中文翻译：

带有Fisher特征选择的决策树SVM模型用于语音情感识别

由于多语音情感识别中情感混淆的增加，整体识别率会降低。为了解决这个问题，我们提出了一种基于决策树支持向量机（SVM）模型和Fisher特征选择的语音情感识别方法。在特征选择阶段，利用Fisher准则过滤掉区分能力较高的特征参数。在情感分类阶段，提出了一种确定决策树结构的算法。决策树支持向量机可以实现第一次粗分类和细分类两步分类。从而消除了冗余参数，提高了情感识别的性能。在这种方法中，首先通过计算情感的混淆度建立决策树SVM框架，然后根据Fisher准则为决策树的每个SVM选择区分能力较高的特征。最后，基于该模型实现了语音情感识别。在CASIA中文情感语料库和柏林语料库上构建了具有Fisher特征选择的决策树SVM，以验证我们框架的有效性。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。然后根据Fisher准则为决策树的每个SVM选择区分能力较高的特征。最后，基于该模型实现了语音情感识别。在CASIA中文情感语料库和柏林语料库上构建了具有Fisher特征选择的决策树SVM，以验证我们框架的有效性。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。然后根据Fisher准则为决策树的每个SVM选择区分能力较高的特征。最后，基于该模型实现了语音情感识别。在CASIA中文情感语料库和柏林语料库上构建了具有Fisher特征选择的决策树SVM，以验证我们框架的有效性。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。基于该模型实现语音情感识别。在CASIA中文情感语料库和柏林语料库上构建了具有Fisher特征选择的决策树SVM，以验证我们框架的有效性。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。基于该模型实现语音情感识别。在CASIA中文情感语料库和柏林语料库上构建了具有Fisher特征选择的决策树SVM，以验证我们框架的有效性。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。实验结果表明，基于该方法的平均情感识别率在CASIA上比传统SVM分类方法高9%，在柏林语音语料库上高8.26%。验证了所提出的方法能够有效减少情感混淆，提高情感识别率。

更新日期：2019-01-07

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文