Towards a classification of binary similarity measures

Ivan Ramirez Mejia, Ildar Batyrshin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Similarity measures for binary variables are used in many problems of machine learning, pattern recognition and classification. Currently, the dozens of similarity measures are introduced and the problem of comparative analysis of these measures appears. One of the methods used for such analysis is clustering of similarity measures based on correlation between data similarity values obtained by different measures. The paper proposes the method of comparative analysis of similarity measures based on the set theoretic representation of these measures and comparison of algebraic properties of these representations. The results show existing relationship between results of clustering and the classification of measures by their properties. Due to the results of clustering depend on the clustering method and on data used for measuring correlation between measures we conclude that the classification based on the proposed properties of similarity measures is more suitable for comparative analysis of similarity measures.

Original languageEnglish
Title of host publicationAdvances in Soft Computing - 16th Mexican International Conference on Artificial Intelligence, MICAI 2017, Proceedings
EditorsSabino Miranda-Jiménez, Félix Castro, Miguel González-Mendoza
PublisherSpringer Verlag
Pages325-335
Number of pages11
ISBN (Print)9783030028367
DOIs
StatePublished - 2018
Event16th Mexican International Conference on Artificial Intelligence, MICAI 2017 - Enseneda, Mexico
Duration: 23 Oct 201728 Oct 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10632 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th Mexican International Conference on Artificial Intelligence, MICAI 2017
Country/TerritoryMexico
CityEnseneda
Period23/10/1728/10/17

Keywords

  • Binary data
  • Clustering
  • Contingency table
  • Similarity measure

Fingerprint

Dive into the research topics of 'Towards a classification of binary similarity measures'. Together they form a unique fingerprint.

Cite this