The place theory as an alternative solution in Automatic Speech Recognition tasks

José Luis Oropeza-Rodríguez, Sergio Suárez-Guerra, Mario Jiménez-Hernández

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently the parametric representation using cochlea behavior has been used in different studies related with Automatic Speech Recognition (ASR). This paper shows how using an alternative solution reported in the state of the art solves the Lesser and Berkeley’s cochlea model in ASR tasks. An approach that considers a new form to construct the bank filter in the parametric representation used to extract MFCC is proposed. Then this distribution of the bank filter to have a new representation of the speech in frequency domain is used. It is important to indicate that MFCC parameters use Mel scale to create a bank filter. The cochlea behavior based on the theory to create the central frequencies of the bank filter was used, .The Mel scale function was substituted for our purpose. A 98.5% performance was reached, for a task that uses isolated digits pronounced by 5 different speakers in the Spanish language and corpus SUSAS with neutral sound records with some advantages in comparison with MFCC was used.

Original languageEnglish
Title of host publicationProgress in Pattern Recognition Image Analysis, Computer Vision and Applications - 19th Iberoamerican Congress, CIARP 2014, Proceedings
EditorsEduardo Bayro-Corrochano, Edwin Hancock
PublisherSpringer Verlag
Pages167-174
Number of pages8
ISBN (Electronic)9783319125671
DOIs
StatePublished - 2014
Event19th Iberoamerican Congress on Pattern Recognition, CIARP 2014 - Puerto Vallarta, Mexico
Duration: 2 Nov 20145 Nov 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8827
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Iberoamerican Congress on Pattern Recognition, CIARP 2014
Country/TerritoryMexico
CityPuerto Vallarta
Period2/11/145/11/14

Keywords

  • Automatic speech recognition
  • Cochlea operation
  • Place theory and bank filter component
  • Speech recognition

Fingerprint

Dive into the research topics of 'The place theory as an alternative solution in Automatic Speech Recognition tasks'. Together they form a unique fingerprint.

Cite this