Etiquetado fonético automático al nivel palabra usando la dinámica de cambio de los vectores del libro código

Translated title of the contribution: Automatic Phonetic Labeling at Word Level Using the Dynamics of Changing Codebook Vectors

Sergio Suárez Guerra, José Luis Oropeza Rodríguez

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

An alternative solution is described regarding the phonetic labeling that compose a set of pronounced by an announcer, susceptible of being used in any language, according to the needs and characteristics associated with the proposal. The procedure is based on the monitoring of the dynamics of change of the cepstral vectors associated with the frequency of Mel (MFCCs) that make up the Book Code (LC), extracted from the word to be labeled. This dynamics of change analyzes where a transition from one vector (MFCC) of the LC occurs to another, as well as the disturbances that occur in the zone of change due to the phonetic concatenation. Metrics are established to consider coarticulation noise and define the location of the phonetic separation boundary. Two methods are used to evaluate the dynamics of vector change and deliver the most accurate labeling. The percentage of recognition and correct labeling obtained with this application is 97.9% lower by 1.06%, with respect to the percentage of recognition obtained on the same corpus of words, but using manual labeling. The more important are that, the time used in the labeling of the voice corpus automatically is significantly less than the estimate of being done manually, in addition to eliminating personal subjectivity in the labeling work.

Translated title of the contributionAutomatic Phonetic Labeling at Word Level Using the Dynamics of Changing Codebook Vectors
Original languageGerman
Pages (from-to)855-868
Number of pages14
JournalComputacion y Sistemas
Volume24
Issue number2
DOIs
StatePublished - 2020

Keywords

  • Phonetic labeling
  • Voice recognition

Fingerprint

Dive into the research topics of 'Automatic Phonetic Labeling at Word Level Using the Dynamics of Changing Codebook Vectors'. Together they form a unique fingerprint.

Cite this