To read this content please select one of the options below:

A novel speech emotion recognition model using mean update of particle swarm and whale optimization-based deep belief network

Rajasekhar B (Gudlavalleru Engineering College, Gudlavalleru, Krishna, India)
Kamaraju M (Gudlavalleru Engineering College, Gudlavalleru, Krishna, India)
Sumalatha V (Jawaharlal Nehru Technological University, Ananthapur, India)

Data Technologies and Applications

ISSN: 2514-9288

Article publication date: 17 April 2020

Issue publication date: 7 July 2020

180

Abstract

Purpose

Nowadays, the speech emotion recognition (SER) model has enhanced as the main research topic in various fields including human–computer interaction as well as speech processing. Generally, it focuses on utilizing the models of machine learning for predicting the exact emotional status from speech. The advanced SER applications go successful in affective computing and human–computer interaction, which is making as the main component of computer system's next generation. This is because the natural human machine interface could grant the automatic service provisions, which need a better appreciation of user's emotional states.

Design/methodology/approach

This paper implements a new SER model that incorporates both gender and emotion recognition. Certain features are extracted and subjected for classification of emotions. For this, this paper uses deep belief network DBN model.

Findings

Through the performance analysis, it is observed that the developed method attains high accuracy rate (for best case) when compared to other methods, and it is 1.02% superior to whale optimization algorithm (WOA), 0.32% better from firefly (FF), 23.45% superior to particle swarm optimization (PSO) and 23.41% superior to genetic algorithm (GA). In case of worst scenario, the mean update of particle swarm and whale optimization (MUPW) in terms of accuracy is 15.63, 15.98, 16.06% and 16.03% superior to WOA, FF, PSO and GA, respectively. Under the mean case, the performance of MUPW is high, and it is 16.67, 10.38, 22.30 and 22.47% better from existing methods like WOA, FF, PSO, as well as GA, respectively.

Originality/value

This paper presents a new model for SER that aids both gender and emotion recognition. For the classification purpose, DBN is used and the weight of DBN is used and this is the first work uses MUPW algorithm for finding the optimal weight of DBN model.

Keywords

Acknowledgements

The author would like to thank M. Kamaraju (guide), V. Sumalatha (co-guide) for the suggestion of framework in a novel speech emotion recognition model.Funding: None

Citation

B, R., M, K. and V, S. (2020), "A novel speech emotion recognition model using mean update of particle swarm and whale optimization-based deep belief network", Data Technologies and Applications, Vol. 54 No. 3, pp. 297-322. https://doi.org/10.1108/DTA-07-2019-0120

Publisher

:

Emerald Publishing Limited

Copyright © 2020, Emerald Publishing Limited

Related articles