Using adaptive filter and wavelets to increase automatic speech recognition rate in noisy environment

José Luis Oropeza Rodríguez, Sergio Suárez Guerra

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. In the experiments, we used several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm) and two different wavelets (Haar and Daubechies). With LMS we obtained an error rate lower than if it was not present and it was better than wavelets employed for this experiment of Automatic Speech Recognition. For decreasing the error rate we trained with 50% of contaminated and originals signals to the ASR system. The results showed in this paper are focused to try analyses the ASR performance in a noisy environment and to demonstrate that if we are controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and we can decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus mentioned before was employed and LMS algorithm was used. Haar wavelet level 1 reached up the most important results as an alternative to LMS algorithm, but only when the noise level was 40 dB and using original corpus.

Idioma originalInglés
Título de la publicación alojadaMICAI 2007
Subtítulo de la publicación alojadaAdvances in Artificial Intelligence - 6th Mexican International Conference on Artificial Intelligence, Proceedings
Páginas1015-1024
Número de páginas10
EstadoPublicada - 2007
Evento6th Mexican International Conference on Artificial Intelligence, MICAI 2007 - Aguascalientes, México
Duración: 4 nov. 200710 nov. 2007

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen4827 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia6th Mexican International Conference on Artificial Intelligence, MICAI 2007
País/TerritorioMéxico
CiudadAguascalientes
Período4/11/0710/11/07

Huella

Profundice en los temas de investigación de 'Using adaptive filter and wavelets to increase automatic speech recognition rate in noisy environment'. En conjunto forman una huella única.

Citar esto