Hybrid data selection with preservation rough sets

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

The nearest neighbor classifier is one of the simplest yet accurate decision-making algorithms. However, it suffers in the presence of noisy or redundant data. This article addresses the instance selection problem to improve lazy learners in hybrid and incomplete datasets. It introduces Preservation Rough Set (PRS) model, which can deal with hybrid (numeric and categorical) and incomplete decision systems. The properties of PRS are demonstrated by theorems, and its capabilities are shown by means of an original instance selection algorithm to determine which instances are relevant and which are not to improve decision-making. The numerical experiments conducted allow asseverating that the proposed algorithm is competitive and lead to highly accurate decision-making for nearest neighbor, voting algorithm, and Naïve Associative Classifier. In addition, the experiments show the ability of the proposal for dealing with noisy datasets.

Original languageEnglish
Pages (from-to)11197-11223
Number of pages27
JournalSoft Computing
Volume26
Issue number21
DOIs
StatePublished - Nov 2022

Keywords

  • Decision-making
  • Hybrid and incomplete data
  • Instance selection
  • Rough sets

Fingerprint

Dive into the research topics of 'Hybrid data selection with preservation rough sets'. Together they form a unique fingerprint.

Cite this