当前位置: X-MOL 学术Speech Commun. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers
Speech Communication ( IF 3.2 ) Pub Date : 2019-12-13 , DOI: 10.1016/j.specom.2019.12.001
Mehmet Berkehan Akçay , Kaya Oğuz

Speech is the most natural way of expressing ourselves as humans. It is only natural then to extend this communication medium to computer applications. We define speech emotion recognition (SER) systems as a collection of methodologies that process and classify speech signals to detect the embedded emotions. SER is not a new field, it has been around for over two decades, and has regained attention thanks to the recent advancements. These novel studies make use of the advances in all fields of computing and technology, making it necessary to have an update on the current methodologies and techniques that make SER possible. We have identified and discussed distinct areas of SER, provided a detailed survey of current literature of each, and also listed the current challenges.



中文翻译:

语音情感识别:情感模型,数据库,功能,预处理方法,支持方式和分类器

言语是表达自己作为人类的最自然的方式。然后,将这种通信介质扩展到计算机应用程序是很自然的。我们将语音情感识别(SER)系统定义为处理和分类语音信号以检测嵌入的情感的方法论的集合。SER并不是一个新领域,它已经存在了二十多年,并且由于最近的进展而重新受到关注。这些新颖的研究利用了计算和技术所有领域的进步,因此有必要对使SER成为可能的当前方法和技术进行更新。我们已经确定并讨论了SER的不同领域,提供了有关每个领域的最新文献的详细调查,还列出了当前的挑战。

更新日期:2019-12-13
down
wechat
bug