Speech Emotion Recognition (SER) is a recent field of research that aims at identifying the emotional state of a speaker through a collection of machine learning and pattern recognition techniques. Features based on linear source-filter models have so far characterized emotional content in speech. However, the presence of nonlinear and chaotic phenomena in speech generation have been widely proven in literature. In this work, recurrence properties of vowels are used to describe nonlinear dynamics of speech with different emotional contents. An automatic vowel extraction module has been developed to extract vowel segments from a set of spoken sentences of the publicly available German Berlin Emotional Speech Database (EmoDB). Recurrence Plots (RPs) and Recurrence Quantitative Analysis (RQA) have been used to explore the dynamic behavior of six basic emotions (anger, boredom, fear, happiness, neutral, sadness). Statistical tests have been performed to compare the six groups and check possible differences between them. The results are promising since some RQA measures are able to capture the key aspects of each emotion

Exploring Recurrence Properties of Vowels for Analysis of Emotions in Speech / Lombardi, Angela; Guccione, Pietro; Guaragnella, Cataldo. - In: SENSORS & TRANSDUCERS. - ISSN 2306-8515. - 204:9(2016), pp. 45-57.

Exploring Recurrence Properties of Vowels for Analysis of Emotions in Speech

LOMBARDI, Angela;GUCCIONE, Pietro;GUARAGNELLA, Cataldo
2016-01-01

Abstract

Speech Emotion Recognition (SER) is a recent field of research that aims at identifying the emotional state of a speaker through a collection of machine learning and pattern recognition techniques. Features based on linear source-filter models have so far characterized emotional content in speech. However, the presence of nonlinear and chaotic phenomena in speech generation have been widely proven in literature. In this work, recurrence properties of vowels are used to describe nonlinear dynamics of speech with different emotional contents. An automatic vowel extraction module has been developed to extract vowel segments from a set of spoken sentences of the publicly available German Berlin Emotional Speech Database (EmoDB). Recurrence Plots (RPs) and Recurrence Quantitative Analysis (RQA) have been used to explore the dynamic behavior of six basic emotions (anger, boredom, fear, happiness, neutral, sadness). Statistical tests have been performed to compare the six groups and check possible differences between them. The results are promising since some RQA measures are able to capture the key aspects of each emotion
2016
Exploring Recurrence Properties of Vowels for Analysis of Emotions in Speech / Lombardi, Angela; Guccione, Pietro; Guaragnella, Cataldo. - In: SENSORS & TRANSDUCERS. - ISSN 2306-8515. - 204:9(2016), pp. 45-57.
File in questo prodotto:
File Dimensione Formato  
P_2859(3).pdf

accesso aperto

Tipologia: Versione editoriale
Licenza: Creative commons
Dimensione 2.47 MB
Formato Adobe PDF
2.47 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/92944
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact