Speech emotion recognition using amplitude modulation parameters and a combined feature selection procedure

doi:10.1016/J.KNOSYS.2014.03.019

Journal Article10.1016/J.KNOSYS.2014.03.019

Speech emotion recognition using amplitude modulation parameters and a combined feature selection procedure

Arianna Mencattini, +6 more

- 01 Jun 2014

- Knowledge Based Systems

- Vol. 63, Iss: 1, pp 68-81

89

TL;DR: This study proposes the use of a PLS regression model, optimized according to specific features selection procedures and trained on the Italian speech corpus EMOVO, suggesting a way to automatically label the corpus in terms of arousal and valence.

Abstract: Speech emotion recognition (SER) is a challenging framework in demanding human machine interaction systems. Standard approaches based on the categorical model of emotions reach low performance, probably due to the modelization of emotions as distinct and independent affective states. Starting from the recently investigated assumption on the dimensional circumplex model of emotions, SER systems are structured as the prediction of valence and arousal on a continuous scale in a two-dimensional domain. In this study, we propose the use of a PLS regression model, optimized according to specific features selection procedures and trained on the Italian speech corpus EMOVO, suggesting a way to automatically label the corpus in terms of arousal and valence. New speech features related to the speech amplitude modulation, caused by the slowly-varying articulatory motion, and standard features extracted from the pitch contour, have been included in the regression model. An average value for the coefficient of determination R2 of 0.72 (maximum value of 0.95 for fear and minimum of 0.60 for sadness) is obtained for the female model and a value for R2 of 0.81 (maximum value of 0.89 for anger and minimum value of 0.71 for joy) is obtained for the male model, over the seven primary emotions (including the neutral state).

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/S10772-018-9491-Z

Databases, features and classifiers for speech emotion recognition: a review

Monorama Swain, +2 more

- 01 Mar 2018

- International Journal of Speech Technolo...

TL;DR: In this study, available literature on various databases, different features and classifiers have been taken in to consideration for speech emotion recognition from assorted languages.

...read moreread less

333

Journal Article•10.1007/S10489-021-02550-9

A comprehensive survey on feature selection in the various fields of machine learning

Pradip Dhal, +1 more

- 23 Jul 2021

- Applied Intelligence

TL;DR: A descriptive survey on FS with the associated area of real-world problem domains to understand the main idea of FS work and identify the core idea of how FS will be applicable in various problem domains.

...read moreread less

276

Journal Article•10.1016/J.NEUCOM.2017.07.050

Speech emotion recognition based on feature selection and extreme learning machine decision tree

Zhen-Tao Liu, +5 more

- 17 Jan 2018

- Neurocomputing

TL;DR: A feature selection method based on correlation analysis and Fisher is proposed, which can remove the redundant features that have close correlations with each other, which would make it possible to realize the interaction between speaker-independent and computer/robot in the future.

...read moreread less

261

Journal Article•10.1016/J.APACOUST.2018.11.028

A novel feature selection method for speech emotion recognition

Turgut Özseven

- 01 Mar 2019

- Applied Acoustics

TL;DR: A new statistical feature selection method is proposed based on the changes in emotions on acoustic features that provides a significant reduction in the number of features, as well as increasing the classification success.

...read moreread less

157

•Journal Article•10.1016/J.CSL.2019.06.001

Preserving privacy in speaker and speech characterisation

Andreas Nautsch, +23 more

- 01 Nov 2019

- Computer Speech & Language

TL;DR: The requirements for effective privacy preservation are established, generic cryptography-based solutions are reviewed, followed by specific techniques that are applicable to speaker characterisation and speech characterisation (biometrics and non-biometric applications), and common, empirical evaluation metrics for the assessment of privacy-preserving technologies for speech data are outlined.

...read moreread less

144

...

Expand

References

Journal Article•10.1119/1.1972842

Handbook of Mathematical Functions

Milton Abramowitz, +2 more

- 01 Feb 1966

- American Journal of Physics

51.2K

•Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

- 01 Jan 1995

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

46K

•Book

Applied Regression Analysis

Norman R. Draper, +1 more

- 01 Jan 1966

TL;DR: In this article, the Straight Line Case is used to fit a straight line by least squares, and the Durbin-Watson Test is used for checking the straight line fit.

...read moreread less

19K

•Book