Environmental sound recognition by measuring significant changes in the spectral entropy

Jessica Beltrán-Márquez, Edgar Chávez, Jesús Favela

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

6 Citas (Scopus)

Resumen

Automatic identification of activities can be used to provide information to caregivers of persons with dementia for identifying assistance needs. Environmental audio provides significant and representative information of the context, making microphones a choice to identify activities automatically. However, in real situations, the audio captured by microphones comes from overlapping sound sources, making its identification a challenge for audio analysis and retrieval. In this paper we propose a succinct representation of the signal by measuring the multiband spectral entropy of the signal frame by frame, followed by a cosine transform and binary codification, we call this the Cosine Multi-Band Spectral Entropy Signature (CMBSES). To test our proposal, we created a database of a mix-up of triples from a collection of nine environmental sounds in four different signal-to-noise ratios (SNR). We codified both the original sounds and the triples and then searched all the original sounds in the mix-up collection. To establish a ground truth we also tested the same database with 48 people of assorted ages. Our feature extraction outperforms the state-of-the-art Mel Frequency Cepstral Coefficients (MFCC) and it also surpass humans in the experiment.

Idioma originalInglés
Título de la publicación alojadaPattern Recognition - 4th Mexican Conference, MCPR 2012, Proceedings
Páginas334-343
Número de páginas10
DOI
EstadoPublicada - 2012
Publicado de forma externa
Evento4th Mexican Conference on Pattern Recognition, MCPR 2012 - Huatulco, México
Duración: 27 jun. 201230 jun. 2012

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen7329 LNCS
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia4th Mexican Conference on Pattern Recognition, MCPR 2012
País/TerritorioMéxico
CiudadHuatulco
Período27/06/1230/06/12

Huella

Profundice en los temas de investigación de 'Environmental sound recognition by measuring significant changes in the spectral entropy'. En conjunto forman una huella única.

Citar esto