Using adaptive filter to increase automatic speech recognition rate in a digit corpus

José Luis Oropeza Rodríguez; Sergio Suárez Guerra; Luis Pastor Sánchez Fernández

Using adaptive filter to increase automatic speech recognition rate in a digit corpus

José Luis Oropeza Rodríguez, Sergio Suárez Guerra, Luis Pastor Sánchez Fernández

Centro de Investigación en Computación (CIC)

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

1 Cita (Scopus)

Resumen

This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.

Idioma original	Inglés
Título de la publicación alojada	Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings
Páginas	78-87
Número de páginas	10
Estado	Publicada - 2007
Evento	12th Iberoamerican Congress on Pattern Recognition, CIARP 2007 - Vina del Mar-Valparaiso, Chile Duración: 13 nov. 2007 → 16 nov. 2007

Serie de la publicación

Nombre	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen	4756 LNCS
ISSN (versión impresa)	0302-9743
ISSN (versión digital)	1611-3349

Conferencia

Conferencia	12th Iberoamerican Congress on Pattern Recognition, CIARP 2007
País/Territorio	Chile
Ciudad	Vina del Mar-Valparaiso
Período	13/11/07 → 16/11/07

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Rodríguez, J. L. O., Guerra, S. S., & Fernández, L. P. S. (2007). Using adaptive filter to increase automatic speech recognition rate in a digit corpus. En Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings (pp. 78-87). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4756 LNCS).

Rodríguez, José Luis Oropeza ; Guerra, Sergio Suárez ; Fernández, Luis Pastor Sánchez. / Using adaptive filter to increase automatic speech recognition rate in a digit corpus. Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. 2007. pp. 78-87 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{67f1c0c7fd7c464e9a05d230ef29ab15,

title = "Using adaptive filter to increase automatic speech recognition rate in a digit corpus",

abstract = "This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.",

keywords = "Adaptative filters, Automatic speech recognition, Continuous density hidden Markov models, Gaussian mixtures and noisy speech signals",

author = "Rodr{\'i}guez, {Jos{\'e} Luis Oropeza} and Guerra, {Sergio Su{\'a}rez} and Fern{\'a}ndez, {Luis Pastor S{\'a}nchez}",

year = "2007",

language = "Ingl{\'e}s",

isbn = "9783540767244",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "78--87",

booktitle = "Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings",

note = "12th Iberoamerican Congress on Pattern Recognition, CIARP 2007 ; Conference date: 13-11-2007 Through 16-11-2007",

}

Rodríguez, JLO, Guerra, SS & Fernández, LPS 2007, Using adaptive filter to increase automatic speech recognition rate in a digit corpus. En Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4756 LNCS, pp. 78-87, 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Vina del Mar-Valparaiso, Chile, 13/11/07.

Using adaptive filter to increase automatic speech recognition rate in a digit corpus. / Rodríguez, José Luis Oropeza; Guerra, Sergio Suárez; Fernández, Luis Pastor Sánchez.
Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. 2007. p. 78-87 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4756 LNCS).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - Using adaptive filter to increase automatic speech recognition rate in a digit corpus

AU - Rodríguez, José Luis Oropeza

AU - Guerra, Sergio Suárez

AU - Fernández, Luis Pastor Sánchez

PY - 2007

Y1 - 2007

N2 - This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.

AB - This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.

KW - Adaptative filters

KW - Automatic speech recognition

KW - Continuous density hidden Markov models

KW - Gaussian mixtures and noisy speech signals

UR - http://www.scopus.com/inward/record.url?scp=38449107490&partnerID=8YFLogxK

M3 - Contribución a la conferencia

SN - 9783540767244

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 78

EP - 87

BT - Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings

T2 - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007

Y2 - 13 November 2007 through 16 November 2007

ER -

Rodríguez JLO, Guerra SS, Fernández LPS. Using adaptive filter to increase automatic speech recognition rate in a digit corpus. En Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. 2007. p. 78-87. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Using adaptive filter to increase automatic speech recognition rate in a digit corpus

Resumen

Serie de la publicación

Conferencia

Otros archivos y enlaces

Huella

Citar esto