Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time

Jose Luis Oropeza Oropeza; Sergio Suarez Guerra; Omar Velazquez Lopez

doi:10.1109/MICAI46078.2018.00010

Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time

Jose Luis Oropeza Oropeza, Sergio Suarez Guerra, Omar Velazquez Lopez

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

Resumen

We introduce a comparative study of several features obtained from audio signal and methods of Artificial Intelligence employed for Automatic Music Transcription in real-time, specially using monophonic notes. Mel-frequency Cepstrum Coefficients (MFCC), Linear Prediction Coefficients (LPC) and Cochlear Mechanics Cepstrum Coefficient (CMCC) were the features used which are a set of coefficients obtained from our laboratory experiments, which in this paper demonstrated to be more effective for Automatic Music Transcription (ATM) than other characteristics such as Mel Frequency Cepstral Coefficients (MFCC). At same time, Vector Quantization (VQ), Hidden Markov Models (HMM), Gaussian Mixtures Models (GMM) and Artificial Neural Networks (ANN) for pattern classification task were used. The database consisted of 840 music notes, we analyzed 5 scales and 14 samples by musical note. The results obtained showed that Vector Quatization, HMM using CMCC_L&B_RA and GMM were the best methods of Artificial Inteligent for this task, while MFCC and CMCC_L&B_RA were the best features employed.

Idioma original	Inglés
Título de la publicación alojada	Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018
Editores	Ildar Batyrshin, Maria de Lourdes Martinez Villasenor, Hiram Eredin Ponce Espinosa
Editorial	Institute of Electrical and Electronics Engineers Inc.
Páginas	13-19
Número de páginas	7
ISBN (versión digital)	9780769565927
DOI	https://doi.org/10.1109/MICAI46078.2018.00010
Estado	Publicada - oct. 2018
Publicado de forma externa	Sí
Evento	17th Mexican International Conference on Artificial Intelligence, MICAI 2018 - Guadalajara, Jalisco, México Duración: 22 oct. 2018 → 27 oct. 2018

Serie de la publicación

Nombre	Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018

Conferencia

Conferencia	17th Mexican International Conference on Artificial Intelligence, MICAI 2018
País/Territorio	México
Ciudad	Guadalajara, Jalisco
Período	22/10/18 → 27/10/18

Acceder al documento

10.1109/MICAI46078.2018.00010

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Oropeza, J. L. O., Guerra, S. S., & Lopez, O. V. (2018). Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time. En I. Batyrshin, M. de Lourdes Martinez Villasenor, & H. E. P. Espinosa (Eds.), Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018 (pp. 13-19). Artículo 9046488 (Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/MICAI46078.2018.00010

Oropeza, Jose Luis Oropeza ; Guerra, Sergio Suarez ; Lopez, Omar Velazquez. / Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time. Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018. editor / Ildar Batyrshin ; Maria de Lourdes Martinez Villasenor ; Hiram Eredin Ponce Espinosa. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 13-19 (Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018).

@inproceedings{a5cbcb6073064185a230762387eec4bb,

title = "Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time",

abstract = "We introduce a comparative study of several features obtained from audio signal and methods of Artificial Intelligence employed for Automatic Music Transcription in real-time, specially using monophonic notes. Mel-frequency Cepstrum Coefficients (MFCC), Linear Prediction Coefficients (LPC) and Cochlear Mechanics Cepstrum Coefficient (CMCC) were the features used which are a set of coefficients obtained from our laboratory experiments, which in this paper demonstrated to be more effective for Automatic Music Transcription (ATM) than other characteristics such as Mel Frequency Cepstral Coefficients (MFCC). At same time, Vector Quantization (VQ), Hidden Markov Models (HMM), Gaussian Mixtures Models (GMM) and Artificial Neural Networks (ANN) for pattern classification task were used. The database consisted of 840 music notes, we analyzed 5 scales and 14 samples by musical note. The results obtained showed that Vector Quatization, HMM using CMCC_L&B_RA and GMM were the best methods of Artificial Inteligent for this task, while MFCC and CMCC_L&B_RA were the best features employed.",

keywords = "Artificial Neural Networks (ANN), Gaussian Mixture Models (GMM), Hidden Markov Models (HMM), Mel Frequency Cepstrum Coefficients (MFCC), Vector Quantization (VQ)",

author = "Oropeza, {Jose Luis Oropeza} and Guerra, {Sergio Suarez} and Lopez, {Omar Velazquez}",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 17th Mexican International Conference on Artificial Intelligence, MICAI 2018 ; Conference date: 22-10-2018 Through 27-10-2018",

year = "2018",

month = oct,

doi = "10.1109/MICAI46078.2018.00010",

language = "Ingl{\'e}s",

series = "Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "13--19",

editor = "Ildar Batyrshin and {de Lourdes Martinez Villasenor}, Maria and Espinosa, {Hiram Eredin Ponce}",

booktitle = "Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018",

address = "Estados Unidos",

}

Oropeza, JLO, Guerra, SS & Lopez, OV 2018, Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time. En I Batyrshin, M de Lourdes Martinez Villasenor & HEP Espinosa (eds.), Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018., 9046488, Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018, Institute of Electrical and Electronics Engineers Inc., pp. 13-19, 17th Mexican International Conference on Artificial Intelligence, MICAI 2018, Guadalajara, Jalisco, México, 22/10/18. https://doi.org/10.1109/MICAI46078.2018.00010

Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time. / Oropeza, Jose Luis Oropeza; Guerra, Sergio Suarez; Lopez, Omar Velazquez.
Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018. ed. / Ildar Batyrshin; Maria de Lourdes Martinez Villasenor; Hiram Eredin Ponce Espinosa. Institute of Electrical and Electronics Engineers Inc., 2018. p. 13-19 9046488 (Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time

AU - Oropeza, Jose Luis Oropeza

AU - Guerra, Sergio Suarez

AU - Lopez, Omar Velazquez

PY - 2018/10

Y1 - 2018/10

N2 - We introduce a comparative study of several features obtained from audio signal and methods of Artificial Intelligence employed for Automatic Music Transcription in real-time, specially using monophonic notes. Mel-frequency Cepstrum Coefficients (MFCC), Linear Prediction Coefficients (LPC) and Cochlear Mechanics Cepstrum Coefficient (CMCC) were the features used which are a set of coefficients obtained from our laboratory experiments, which in this paper demonstrated to be more effective for Automatic Music Transcription (ATM) than other characteristics such as Mel Frequency Cepstral Coefficients (MFCC). At same time, Vector Quantization (VQ), Hidden Markov Models (HMM), Gaussian Mixtures Models (GMM) and Artificial Neural Networks (ANN) for pattern classification task were used. The database consisted of 840 music notes, we analyzed 5 scales and 14 samples by musical note. The results obtained showed that Vector Quatization, HMM using CMCC_L&B_RA and GMM were the best methods of Artificial Inteligent for this task, while MFCC and CMCC_L&B_RA were the best features employed.

AB - We introduce a comparative study of several features obtained from audio signal and methods of Artificial Intelligence employed for Automatic Music Transcription in real-time, specially using monophonic notes. Mel-frequency Cepstrum Coefficients (MFCC), Linear Prediction Coefficients (LPC) and Cochlear Mechanics Cepstrum Coefficient (CMCC) were the features used which are a set of coefficients obtained from our laboratory experiments, which in this paper demonstrated to be more effective for Automatic Music Transcription (ATM) than other characteristics such as Mel Frequency Cepstral Coefficients (MFCC). At same time, Vector Quantization (VQ), Hidden Markov Models (HMM), Gaussian Mixtures Models (GMM) and Artificial Neural Networks (ANN) for pattern classification task were used. The database consisted of 840 music notes, we analyzed 5 scales and 14 samples by musical note. The results obtained showed that Vector Quatization, HMM using CMCC_L&B_RA and GMM were the best methods of Artificial Inteligent for this task, while MFCC and CMCC_L&B_RA were the best features employed.

KW - Artificial Neural Networks (ANN)

KW - Gaussian Mixture Models (GMM)

KW - Hidden Markov Models (HMM)

KW - Mel Frequency Cepstrum Coefficients (MFCC)

KW - Vector Quantization (VQ)

UR - http://www.scopus.com/inward/record.url?scp=85092050306&partnerID=8YFLogxK

U2 - 10.1109/MICAI46078.2018.00010

DO - 10.1109/MICAI46078.2018.00010

M3 - Contribución a la conferencia

AN - SCOPUS:85092050306

T3 - Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018

SP - 13

EP - 19

BT - Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018

A2 - Batyrshin, Ildar

A2 - de Lourdes Martinez Villasenor, Maria

A2 - Espinosa, Hiram Eredin Ponce

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 17th Mexican International Conference on Artificial Intelligence, MICAI 2018

Y2 - 22 October 2018 through 27 October 2018

ER -

Oropeza JLO, Guerra SS, Lopez OV. Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time. En Batyrshin I, de Lourdes Martinez Villasenor M, Espinosa HEP, editores, Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 13-19. 9046488. (Proceedings of the Special Session - 2018 17th Mexican International Conference on Artificial Intelligence, MICAI 2018). doi: 10.1109/MICAI46078.2018.00010

Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time

Resumen

Serie de la publicación

Conferencia

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto