New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis

José Luis Oropeza Rodríguez

New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis

Centro de Investigación en Computación (CIC)

Producción científica: Contribución a una revista › Artículo de la conferencia › revisión exhaustiva

Resumen

Undoubtedly the compact representation by a set of Mel Frequency Cepstrum Coefficients (MFCC) has been used satisfactorily for ASR [9]. The cochlea is an organ, in humans or mammalians that converts the frequency perceived by the ear in punctual stimulation to excite the nerve auditory that receives a set of stimulus that comes from speech sound pressure. A new approach is proposed that considers this phenomenon to construct the bank filter in our parametric representation. Then we substitute the distribution of the bank filter in the Mel scale function for a different distribution that depends of the inner ear response to the stimulus that it receives. The place theory is used which achieves a 99.8% performance. Finally, this paper compares the performance of different acoustic representations in Continuous Automatic Speech Recognition system (CASRs) based on words. The cochlea operation is explained that permits obtaining a model and we will show that one alternative solution to the model based on fluid mechanical proposed by Lesser and Berkley [19], can be obtained if resonance analysis weather Fourier series is used as a solution.

Idioma original	Inglés
Páginas (desde-hasta)	1732-1736
Número de páginas	5
Publicación	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Estado	Publicada - 2013
Evento	14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, Francia Duración: 25 ago. 2013 → 29 ago. 2013

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

@article{abcaabf3fd5d4d21ab37c1148209841d,

title = "New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis",

abstract = "Undoubtedly the compact representation by a set of Mel Frequency Cepstrum Coefficients (MFCC) has been used satisfactorily for ASR [9]. The cochlea is an organ, in humans or mammalians that converts the frequency perceived by the ear in punctual stimulation to excite the nerve auditory that receives a set of stimulus that comes from speech sound pressure. A new approach is proposed that considers this phenomenon to construct the bank filter in our parametric representation. Then we substitute the distribution of the bank filter in the Mel scale function for a different distribution that depends of the inner ear response to the stimulus that it receives. The place theory is used which achieves a 99.8% performance. Finally, this paper compares the performance of different acoustic representations in Continuous Automatic Speech Recognition system (CASRs) based on words. The cochlea operation is explained that permits obtaining a model and we will show that one alternative solution to the model based on fluid mechanical proposed by Lesser and Berkley [19], can be obtained if resonance analysis weather Fourier series is used as a solution.",

keywords = "Cochlea operation, Place theory and bank filter, Speech recognition",

author = "Rodr{\'i}guez, {Jos{\'e} Luis Oropeza}",

year = "2013",

language = "Ingl{\'e}s",

pages = "1732--1736",

journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",

issn = "2308-457X",

note = "14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 ; Conference date: 25-08-2013 Through 29-08-2013",

}

New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis. / Rodríguez, José Luis Oropeza.
En: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2013, p. 1732-1736.

Producción científica: Contribución a una revista › Artículo de la conferencia › revisión exhaustiva

TY - JOUR

T1 - New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis

AU - Rodríguez, José Luis Oropeza

PY - 2013

Y1 - 2013

N2 - Undoubtedly the compact representation by a set of Mel Frequency Cepstrum Coefficients (MFCC) has been used satisfactorily for ASR [9]. The cochlea is an organ, in humans or mammalians that converts the frequency perceived by the ear in punctual stimulation to excite the nerve auditory that receives a set of stimulus that comes from speech sound pressure. A new approach is proposed that considers this phenomenon to construct the bank filter in our parametric representation. Then we substitute the distribution of the bank filter in the Mel scale function for a different distribution that depends of the inner ear response to the stimulus that it receives. The place theory is used which achieves a 99.8% performance. Finally, this paper compares the performance of different acoustic representations in Continuous Automatic Speech Recognition system (CASRs) based on words. The cochlea operation is explained that permits obtaining a model and we will show that one alternative solution to the model based on fluid mechanical proposed by Lesser and Berkley [19], can be obtained if resonance analysis weather Fourier series is used as a solution.

AB - Undoubtedly the compact representation by a set of Mel Frequency Cepstrum Coefficients (MFCC) has been used satisfactorily for ASR [9]. The cochlea is an organ, in humans or mammalians that converts the frequency perceived by the ear in punctual stimulation to excite the nerve auditory that receives a set of stimulus that comes from speech sound pressure. A new approach is proposed that considers this phenomenon to construct the bank filter in our parametric representation. Then we substitute the distribution of the bank filter in the Mel scale function for a different distribution that depends of the inner ear response to the stimulus that it receives. The place theory is used which achieves a 99.8% performance. Finally, this paper compares the performance of different acoustic representations in Continuous Automatic Speech Recognition system (CASRs) based on words. The cochlea operation is explained that permits obtaining a model and we will show that one alternative solution to the model based on fluid mechanical proposed by Lesser and Berkley [19], can be obtained if resonance analysis weather Fourier series is used as a solution.

KW - Cochlea operation

KW - Place theory and bank filter

KW - Speech recognition

UR - http://www.scopus.com/inward/record.url?scp=84906274848&partnerID=8YFLogxK

M3 - Artículo de la conferencia

AN - SCOPUS:84906274848

SN - 2308-457X

SP - 1732

EP - 1736

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

T2 - 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013

Y2 - 25 August 2013 through 29 August 2013

ER -

New parameters for automatic speech recognition based on the mammalian cochlea model using resonance analysis

Resumen

Otros archivos y enlaces

Huella

Citar esto