Efficiently finding the optimum number of clusters in a dataset with a new hybrid differential evolution algorithm: DELA

Javier Arellano-Verdejo, Enrique Alba, Salvador Godoy-Calderon

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

7 Citas (Scopus)

Resumen

Clustering algorithms, a fundamental base for data mining procedures and learning techniques, suffer from the lack of efficient methods for determining the optimal number of clusters to be found in an arbitrary dataset. The few methods existing in the literature always use some sort of evolutionary algorithm having a cluster validation index as its objective function. In this article, a new evolutionary algorithm, based on a hybrid model of global and local heuristic search, is proposed for the same task, and some experimentation is done with different datasets and indexes. Due to its design, independent of any clustering procedure, it is applicable to virtually any clustering method like the widely used (Formula presented.) -means algorithm. Moreover, the use of non-parametric statistical tests over the experimental results, clearly show the proposed algorithm to be more efficient than other evolutionary algorithms currently used for the same task.

Idioma originalInglés
Páginas (desde-hasta)895-905
Número de páginas11
PublicaciónSoft Computing
Volumen20
N.º3
DOI
EstadoPublicada - 1 mar. 2016

Huella

Profundice en los temas de investigación de 'Efficiently finding the optimum number of clusters in a dataset with a new hybrid differential evolution algorithm: DELA'. En conjunto forman una huella única.

Citar esto