An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals

A. D. Herrera-Ortiz; G. A. Yáñez-Casas; J. J. Hernández-Gómez; M. G. Orozco-del-Castillo; M. F. Mata-Rivera; R. de la Rosa-Rábago

doi:10.1007/978-3-031-18082-8_7

An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals

A. D. Herrera-Ortiz, G. A. Yáñez-Casas, J. J. Hernández-Gómez, M. G. Orozco-del-Castillo, M. F. Mata-Rivera, R. de la Rosa-Rábago

Centro de Desarrollo Aeroespacial (CDA)

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

1 Cita (Scopus)

Resumen

The detection, classification and analysis of emotions has been an intense research area in the last years. Most of the techniques applied for emotion recognition are those comprised by Artificial Intelligence, such as neural networks, machine learning and deep learning, which are focused on the training and learning of models. In this work, we propose a rather different approach to the problem of detection and classification of emotion within voice speech, regarding sound files as information sources in the context of Shannon’s information theory. By computing the entropy content of each audio, we find that emotion in speech can be classified into two subsets: positive and negative. To be able to perform the entropy computation, we first compute the Fourier transform to digital audio recordings, bearing in mind that the voice signal has a bandwidth 100 Hz and 4 kHz. The discrete Fourier spectrum is then used to set the alphabet and then the occurrence probabilities of each symbol (frequency) is used to compute the entropy for non-hysterical information sources. A dataset consisting of 1,440 voice audios performed by professional voice actors was analysed through this methodology, showing that in most cases, this simple approach is capable of performing the positive/negative emotion classification.

Idioma original	Inglés
Título de la publicación alojada	Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings
Editores	Miguel Félix Mata-Rivera, Roberto Zagal-Flores, Cristian Barria-Huidobro
Editorial	Springer Science and Business Media Deutschland GmbH
Páginas	100-121
Número de páginas	22
ISBN (versión impresa)	9783031180811
DOI	https://doi.org/10.1007/978-3-031-18082-8_7
Estado	Publicada - 2022
Evento	11th International Congress of Telematics and Computing, WITCOM 2022 - Cancún, México Duración: 7 nov. 2022 → 11 nov. 2022

Serie de la publicación

Nombre	Communications in Computer and Information Science
Volumen	1659 CCIS
ISSN (versión impresa)	1865-0929
ISSN (versión digital)	1865-0937

Conferencia

Conferencia	11th International Congress of Telematics and Computing, WITCOM 2022
País/Territorio	México
Ciudad	Cancún
Período	7/11/22 → 11/11/22

Acceder al documento

10.1007/978-3-031-18082-8_7

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Herrera-Ortiz, A. D., Yáñez-Casas, G. A., Hernández-Gómez, J. J., Orozco-del-Castillo, M. G., Mata-Rivera, M. F., & de la Rosa-Rábago, R. (2022). An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals. En M. F. Mata-Rivera, R. Zagal-Flores, & C. Barria-Huidobro (Eds.), Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings (pp. 100-121). (Communications in Computer and Information Science; Vol. 1659 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-18082-8_7

Herrera-Ortiz, A. D. ; Yáñez-Casas, G. A. ; Hernández-Gómez, J. J. et al. / An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals. Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings. editor / Miguel Félix Mata-Rivera ; Roberto Zagal-Flores ; Cristian Barria-Huidobro. Springer Science and Business Media Deutschland GmbH, 2022. pp. 100-121 (Communications in Computer and Information Science).

@inproceedings{f4c02c2641a246d39cd14f76a7dab3d4,

title = "An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals",

abstract = "The detection, classification and analysis of emotions has been an intense research area in the last years. Most of the techniques applied for emotion recognition are those comprised by Artificial Intelligence, such as neural networks, machine learning and deep learning, which are focused on the training and learning of models. In this work, we propose a rather different approach to the problem of detection and classification of emotion within voice speech, regarding sound files as information sources in the context of Shannon{\textquoteright}s information theory. By computing the entropy content of each audio, we find that emotion in speech can be classified into two subsets: positive and negative. To be able to perform the entropy computation, we first compute the Fourier transform to digital audio recordings, bearing in mind that the voice signal has a bandwidth 100 Hz and 4 kHz. The discrete Fourier spectrum is then used to set the alphabet and then the occurrence probabilities of each symbol (frequency) is used to compute the entropy for non-hysterical information sources. A dataset consisting of 1,440 voice audios performed by professional voice actors was analysed through this methodology, showing that in most cases, this simple approach is capable of performing the positive/negative emotion classification.",

keywords = "Computational entropy, Emotion analysis, Fourier transform, Frequency alphabet, Information source, Information theory, Pattern recognition, Sound, Speech, Voice signals",

author = "Herrera-Ortiz, {A. D.} and Y{\'a}{\~n}ez-Casas, {G. A.} and Hern{\'a}ndez-G{\'o}mez, {J. J.} and Orozco-del-Castillo, {M. G.} and Mata-Rivera, {M. F.} and {de la Rosa-R{\'a}bago}, R.",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.; 11th International Congress of Telematics and Computing, WITCOM 2022 ; Conference date: 07-11-2022 Through 11-11-2022",

year = "2022",

doi = "10.1007/978-3-031-18082-8_7",

language = "Ingl{\'e}s",

isbn = "9783031180811",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "100--121",

editor = "Mata-Rivera, {Miguel F{\'e}lix} and Roberto Zagal-Flores and Cristian Barria-Huidobro",

booktitle = "Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings",

address = "Alemania",

}

Herrera-Ortiz, AD, Yáñez-Casas, GA, Hernández-Gómez, JJ, Orozco-del-Castillo, MG, Mata-Rivera, MF & de la Rosa-Rábago, R 2022, An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals. En MF Mata-Rivera, R Zagal-Flores & C Barria-Huidobro (eds.), Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings. Communications in Computer and Information Science, vol. 1659 CCIS, Springer Science and Business Media Deutschland GmbH, pp. 100-121, 11th International Congress of Telematics and Computing, WITCOM 2022, Cancún, México, 7/11/22. https://doi.org/10.1007/978-3-031-18082-8_7

An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals. / Herrera-Ortiz, A. D.; Yáñez-Casas, G. A.; Hernández-Gómez, J. J. et al.
Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings. ed. / Miguel Félix Mata-Rivera; Roberto Zagal-Flores; Cristian Barria-Huidobro. Springer Science and Business Media Deutschland GmbH, 2022. p. 100-121 (Communications in Computer and Information Science; Vol. 1659 CCIS).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals

AU - Herrera-Ortiz, A. D.

AU - Yáñez-Casas, G. A.

AU - Hernández-Gómez, J. J.

AU - Orozco-del-Castillo, M. G.

AU - Mata-Rivera, M. F.

AU - de la Rosa-Rábago, R.

PY - 2022

Y1 - 2022

N2 - The detection, classification and analysis of emotions has been an intense research area in the last years. Most of the techniques applied for emotion recognition are those comprised by Artificial Intelligence, such as neural networks, machine learning and deep learning, which are focused on the training and learning of models. In this work, we propose a rather different approach to the problem of detection and classification of emotion within voice speech, regarding sound files as information sources in the context of Shannon’s information theory. By computing the entropy content of each audio, we find that emotion in speech can be classified into two subsets: positive and negative. To be able to perform the entropy computation, we first compute the Fourier transform to digital audio recordings, bearing in mind that the voice signal has a bandwidth 100 Hz and 4 kHz. The discrete Fourier spectrum is then used to set the alphabet and then the occurrence probabilities of each symbol (frequency) is used to compute the entropy for non-hysterical information sources. A dataset consisting of 1,440 voice audios performed by professional voice actors was analysed through this methodology, showing that in most cases, this simple approach is capable of performing the positive/negative emotion classification.

AB - The detection, classification and analysis of emotions has been an intense research area in the last years. Most of the techniques applied for emotion recognition are those comprised by Artificial Intelligence, such as neural networks, machine learning and deep learning, which are focused on the training and learning of models. In this work, we propose a rather different approach to the problem of detection and classification of emotion within voice speech, regarding sound files as information sources in the context of Shannon’s information theory. By computing the entropy content of each audio, we find that emotion in speech can be classified into two subsets: positive and negative. To be able to perform the entropy computation, we first compute the Fourier transform to digital audio recordings, bearing in mind that the voice signal has a bandwidth 100 Hz and 4 kHz. The discrete Fourier spectrum is then used to set the alphabet and then the occurrence probabilities of each symbol (frequency) is used to compute the entropy for non-hysterical information sources. A dataset consisting of 1,440 voice audios performed by professional voice actors was analysed through this methodology, showing that in most cases, this simple approach is capable of performing the positive/negative emotion classification.

KW - Computational entropy

KW - Emotion analysis

KW - Fourier transform

KW - Frequency alphabet

KW - Information source

KW - Information theory

KW - Pattern recognition

KW - Sound

KW - Speech

KW - Voice signals

UR - http://www.scopus.com/inward/record.url?scp=85142768169&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-18082-8_7

DO - 10.1007/978-3-031-18082-8_7

M3 - Contribución a la conferencia

AN - SCOPUS:85142768169

SN - 9783031180811

T3 - Communications in Computer and Information Science

SP - 100

EP - 121

BT - Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings

A2 - Mata-Rivera, Miguel Félix

A2 - Zagal-Flores, Roberto

A2 - Barria-Huidobro, Cristian

PB - Springer Science and Business Media Deutschland GmbH

T2 - 11th International Congress of Telematics and Computing, WITCOM 2022

Y2 - 7 November 2022 through 11 November 2022

ER -

Herrera-Ortiz AD, Yáñez-Casas GA, Hernández-Gómez JJ, Orozco-del-Castillo MG, Mata-Rivera MF, de la Rosa-Rábago R. An Entropy-Based Computational Classifier for Positive and Negative Emotions in Voice Signals. En Mata-Rivera MF, Zagal-Flores R, Barria-Huidobro C, editores, Telematics and Computing - 11th International Congress, WITCOM 2022, Proceedings. Springer Science and Business Media Deutschland GmbH. 2022. p. 100-121. (Communications in Computer and Information Science). doi: 10.1007/978-3-031-18082-8_7