Mixed data balancing through compact sets based instance selection

Yenny Villuendas-Rey, María Matilde García-Lorenzo

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Learning in datasets that suffer from imbalanced class distribution is an important problem in Pattern Recognition. This paper introduces a novel algorithm for data balancing, based on compact set clustering of the majority class. The proposed algorithm is able to deal with mixed, as well as incomplete data, and with arbitrarily dissimilarity functions. Numerical experiments over repository databases show the high quality performance of the method proposed in this paper according to area under the ROC curve and imbalance ratio.

Idioma originalInglés
Título de la publicación alojadaProgress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - 18th Iberoamerican Congress, CIARP 2013, Proceedings
Páginas254-261
Número de páginas8
EdiciónPART 1
DOI
EstadoPublicada - 2013
Publicado de forma externa
Evento18th Iberoamerican Congress on Pattern Recognition, CIARP 2013 - Havana, Cuba
Duración: 20 nov. 201323 nov. 2013

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NúmeroPART 1
Volumen8258 LNCS
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia18th Iberoamerican Congress on Pattern Recognition, CIARP 2013
País/TerritorioCuba
CiudadHavana
Período20/11/1323/11/13

Huella

Profundice en los temas de investigación de 'Mixed data balancing through compact sets based instance selection'. En conjunto forman una huella única.

Citar esto