Automatic phoneme border detection to improve speech recognition

Suárez Guerra Sergio, Juárez Murillo Cristian-Remington, Oropeza Rodríguez José Luis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A comparative study of speech recognition performance among systems trained with manually labeled corpora and systems trained with semi-automatically labeled corpora is introduced. An automatic labeling system was designed to generate phoneme labels files for all words within the corpus used to train a system of automatic speech recognition. Speech recognition experiments were performed using the same corpus, first training with manually, and later with automatically generated labels. Results show that the recognition performance is better when the training of selected diccionary, is made with automatic label files than when it is made with manual label files. Not only is the automatic labeling of speech corpora faster than manual labeling, but also it is free from the subjectivity inherent in the manual segmentation performed by specialists. The performance achieved in this work is greater than 96 %.

Original languageEnglish
Title of host publicationAdvances in Artificial Intelligence and Soft Computing - 14th Mexican International Conference on Artificial Intelligence, MICAI 2015, Proceedings
EditorsGrigori Sidorov, SofÍa N. Galicia-Haro
PublisherSpringer Verlag
Pages127-135
Number of pages9
ISBN (Print)9783319270593
DOIs
StatePublished - 2015
Event14th Mexican International Conference on Artificial Intelligence, MICAI 2015 - Cuernavaca, Morelos, Mexico
Duration: 25 Oct 201531 Oct 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9413
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th Mexican International Conference on Artificial Intelligence, MICAI 2015
Country/TerritoryMexico
CityCuernavaca, Morelos
Period25/10/1531/10/15

Keywords

  • ASR applications
  • Automatic speech segmentation and labeling
  • Label detection

Fingerprint

Dive into the research topics of 'Automatic phoneme border detection to improve speech recognition'. Together they form a unique fingerprint.

Cite this