Pattern analysis in DNA microarray data through PCA-based gene selection

Ricardo Ocampo, Marco A. de Luna, Roberto Vega, Gildardo Sanchez-Ante, Luis E. Falcon-Morales, Humberto Sossa

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

DNA microarrays is a technology that can be used to diagnose cancer and other diseases. To automate the analysis of such data, pattern recognition and machine learning algorithms can be applied. However, the curse of dimensionality is unavoidable: very few samples to train, and many attributes in each sample. As the predictive accuracy of supervised classifiers decays with irrelevant and redundant features, the necessity of a dimensionality reduction process is essential. In this paper, we propose a new methodology that is based on the application of Principal Component Analysis and other statistical tools to gain insight in the identification of relevant genes. We run the approaches using two benchmark datasets: Leukemia and Lymphoma. The results show that it is possible to reduce considerably the number of genes while increasing the performance of well known classifiers.

Original languageEnglish
Title of host publicationProgress in Pattern Recognition Image Analysis, Computer Vision and Applications - 19th Iberoamerican Congress, CIARP 2014, Proceedings
EditorsEduardo Bayro-Corrochano, Edwin Hancock
PublisherSpringer Verlag
Pages532-539
Number of pages8
ISBN (Electronic)9783319125671
DOIs
StatePublished - 2014
Event19th Iberoamerican Congress on Pattern Recognition, CIARP 2014 - Puerto Vallarta, Mexico
Duration: 2 Nov 20145 Nov 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8827
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Iberoamerican Congress on Pattern Recognition, CIARP 2014
Country/TerritoryMexico
CityPuerto Vallarta
Period2/11/145/11/14

Fingerprint

Dive into the research topics of 'Pattern analysis in DNA microarray data through PCA-based gene selection'. Together they form a unique fingerprint.

Cite this