Automatic phoneme border detection to improve speech recognition

Suárez Guerra Sergio, Juárez Murillo Cristian-Remington, Oropeza Rodríguez José Luis

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

A comparative study of speech recognition performance among systems trained with manually labeled corpora and systems trained with semi-automatically labeled corpora is introduced. An automatic labeling system was designed to generate phoneme labels files for all words within the corpus used to train a system of automatic speech recognition. Speech recognition experiments were performed using the same corpus, first training with manually, and later with automatically generated labels. Results show that the recognition performance is better when the training of selected diccionary, is made with automatic label files than when it is made with manual label files. Not only is the automatic labeling of speech corpora faster than manual labeling, but also it is free from the subjectivity inherent in the manual segmentation performed by specialists. The performance achieved in this work is greater than 96 %.

Idioma originalInglés
Título de la publicación alojadaAdvances in Artificial Intelligence and Soft Computing - 14th Mexican International Conference on Artificial Intelligence, MICAI 2015, Proceedings
EditoresGrigori Sidorov, SofÍa N. Galicia-Haro
EditorialSpringer Verlag
Páginas127-135
Número de páginas9
ISBN (versión impresa)9783319270593
DOI
EstadoPublicada - 2015
Evento14th Mexican International Conference on Artificial Intelligence, MICAI 2015 - Cuernavaca, Morelos, México
Duración: 25 oct. 201531 oct. 2015

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen9413
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia14th Mexican International Conference on Artificial Intelligence, MICAI 2015
País/TerritorioMéxico
CiudadCuernavaca, Morelos
Período25/10/1531/10/15

Huella

Profundice en los temas de investigación de 'Automatic phoneme border detection to improve speech recognition'. En conjunto forman una huella única.

Citar esto