Using adaptive filter to increase automatic speech recognition rate in a digit corpus

José Luis Oropeza Rodríguez; Sergio Suárez Guerra; Luis Pastor Sánchez Fernández

Using adaptive filter to increase automatic speech recognition rate in a digit corpus

José Luis Oropeza Rodríguez, Sergio Suárez Guerra, Luis Pastor Sánchez Fernández

Centro de Investigación en Computación (CIC)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.

Original language	English
Title of host publication	Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings
Pages	78-87
Number of pages	10
State	Published - 2007
Event	12th Iberoamerican Congress on Pattern Recognition, CIARP 2007 - Vina del Mar-Valparaiso, Chile Duration: 13 Nov 2007 → 16 Nov 2007

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	4756 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	12th Iberoamerican Congress on Pattern Recognition, CIARP 2007
Country/Territory	Chile
City	Vina del Mar-Valparaiso
Period	13/11/07 → 16/11/07

Keywords

Adaptative filters
Automatic speech recognition
Continuous density hidden Markov models
Gaussian mixtures and noisy speech signals

Cite this

Rodríguez, J. L. O., Guerra, S. S., & Fernández, L. P. S. (2007). Using adaptive filter to increase automatic speech recognition rate in a digit corpus. In Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings (pp. 78-87). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4756 LNCS).

Rodríguez, José Luis Oropeza ; Guerra, Sergio Suárez ; Fernández, Luis Pastor Sánchez. / Using adaptive filter to increase automatic speech recognition rate in a digit corpus. Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. 2007. pp. 78-87 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{67f1c0c7fd7c464e9a05d230ef29ab15,

title = "Using adaptive filter to increase automatic speech recognition rate in a digit corpus",

abstract = "This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.",

keywords = "Adaptative filters, Automatic speech recognition, Continuous density hidden Markov models, Gaussian mixtures and noisy speech signals",

author = "Rodr{\'i}guez, {Jos{\'e} Luis Oropeza} and Guerra, {Sergio Su{\'a}rez} and Fern{\'a}ndez, {Luis Pastor S{\'a}nchez}",

year = "2007",

language = "Ingl{\'e}s",

isbn = "9783540767244",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "78--87",

booktitle = "Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings",

note = "12th Iberoamerican Congress on Pattern Recognition, CIARP 2007 ; Conference date: 13-11-2007 Through 16-11-2007",

}

Rodríguez, JLO, Guerra, SS & Fernández, LPS 2007, Using adaptive filter to increase automatic speech recognition rate in a digit corpus. in Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 4756 LNCS, pp. 78-87, 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Vina del Mar-Valparaiso, Chile, 13/11/07.

Using adaptive filter to increase automatic speech recognition rate in a digit corpus. / Rodríguez, José Luis Oropeza; Guerra, Sergio Suárez; Fernández, Luis Pastor Sánchez.
Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. 2007. p. 78-87 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4756 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Using adaptive filter to increase automatic speech recognition rate in a digit corpus

AU - Rodríguez, José Luis Oropeza

AU - Guerra, Sergio Suárez

AU - Fernández, Luis Pastor Sánchez

PY - 2007

Y1 - 2007

N2 - This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.

AB - This paper shows results obtained in the Automatic Speech Recognition (ASR) task for a corpus of digits speech files with a determinate noise level immerse. The experiments realized treated with several speech files that contained Gaussian noise. We used HTK (Hidden Markov Model Toolkit) software of Cambridge University in the experiments. The noise level added to the speech signals was varying from fifteen to forty dB increased by a step of 5 units. We used an adaptive filtering to reduce the level noise (it was based in the Least Measure Square -LMS- algorithm). With LMS we obtained an error rate lower than if it was not present. It was obtained because of we trained with 50% of contaminated and originals signals to the ASR. The results showed in this paper to analyze the ASR performance in a noisy environment and to demonstrate that if we have controlling the noise level and if we know the application where it is going to work, then we can obtain a better response in the ASR tasks. Is very interesting to count with these results because speech signal that we can find in a real experiment (extracted from an environment work, i.e.), could be treated with these technique and decrease the error rate obtained. Finally, we report a recognition rate of 99%, 97.5% 96%, 90.5%, 81% and 78.5% obtained from 15, 20, 25, 30, 35 and 40 noise levels, respectively when the corpus that we mentioned above was employed. Finally, we made experiments with a total of 2600 sentences (between noisy and filtered sentences) of speech signal.

KW - Adaptative filters

KW - Automatic speech recognition

KW - Continuous density hidden Markov models

KW - Gaussian mixtures and noisy speech signals

UR - http://www.scopus.com/inward/record.url?scp=38449107490&partnerID=8YFLogxK

M3 - Contribución a la conferencia

SN - 9783540767244

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 78

EP - 87

BT - Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings

T2 - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007

Y2 - 13 November 2007 through 16 November 2007

ER -

Rodríguez JLO, Guerra SS, Fernández LPS. Using adaptive filter to increase automatic speech recognition rate in a digit corpus. In Progress in Pattern Recognition, Image Analysis and Applications - 12th Iberoamerican Congress on Pattern Recognition, CIARP 2007, Proceedings. 2007. p. 78-87. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Using adaptive filter to increase automatic speech recognition rate in a digit corpus

Abstract

Publication series

Conference

Keywords

Other files and links

Fingerprint

Cite this