SMOTE-D a deterministic version of SMOTE

Fredy Rodríguez Torres, Jesús A. Carrasco-Ochoa, José Fco Martínez-Trinidad

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

© Springer International Publishing Switzerland 2016. Imbalanced data is a problem of current research interest. This problem arises when the number of objects in a class is much lower than in other classes. In order to address this problem several methods for oversampling the minority class have been proposed. Oversampling methods generate synthetic objects for the minority class in order to balance the amount of objects between classes, among them, SMOTE is one of the most successful and well-known methods. In this paper, we introduce a modification of SMOTE which deterministically generates synthetic objects for the minority class. Our proposed method eliminates the random component of SMOTE and generates different amount of synthetic objects for each object of the minority class. An experimental comparison of the proposed method against SMOTE in standard imbalanced datasets is provided. The experimental results show an improvement of our proposed method regarding SMOTE, in terms of F-measure.
Original languageAmerican English
Title of host publicationSMOTE-D a deterministic version of SMOTE
Pages177-188
Number of pages158
ISBN (Electronic)9783319393926
DOIs
StatePublished - 1 Jan 2016
Externally publishedYes
EventLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) -
Duration: 1 Jan 2018 → …

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9703
ISSN (Print)0302-9743

Conference

ConferenceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Period1/01/18 → …

Fingerprint

Oversampling
Class
Object
Eliminate
Experimental Results
Standards

Cite this

Torres, F. R., Carrasco-Ochoa, J. A., & Martínez-Trinidad, J. F. (2016). SMOTE-D a deterministic version of SMOTE. In SMOTE-D a deterministic version of SMOTE (pp. 177-188). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9703). https://doi.org/10.1007/978-3-319-39393-3_18
Torres, Fredy Rodríguez ; Carrasco-Ochoa, Jesús A. ; Martínez-Trinidad, José Fco. / SMOTE-D a deterministic version of SMOTE. SMOTE-D a deterministic version of SMOTE. 2016. pp. 177-188 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{97a6cc950576426da58bd9700d74ed0f,
title = "SMOTE-D a deterministic version of SMOTE",
abstract = "{\circledC} Springer International Publishing Switzerland 2016. Imbalanced data is a problem of current research interest. This problem arises when the number of objects in a class is much lower than in other classes. In order to address this problem several methods for oversampling the minority class have been proposed. Oversampling methods generate synthetic objects for the minority class in order to balance the amount of objects between classes, among them, SMOTE is one of the most successful and well-known methods. In this paper, we introduce a modification of SMOTE which deterministically generates synthetic objects for the minority class. Our proposed method eliminates the random component of SMOTE and generates different amount of synthetic objects for each object of the minority class. An experimental comparison of the proposed method against SMOTE in standard imbalanced datasets is provided. The experimental results show an improvement of our proposed method regarding SMOTE, in terms of F-measure.",
author = "Torres, {Fredy Rodr{\'i}guez} and Carrasco-Ochoa, {Jes{\'u}s A.} and Mart{\'i}nez-Trinidad, {Jos{\'e} Fco}",
year = "2016",
month = "1",
day = "1",
doi = "10.1007/978-3-319-39393-3_18",
language = "American English",
isbn = "9783319393926",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "177--188",
booktitle = "SMOTE-D a deterministic version of SMOTE",

}

Torres, FR, Carrasco-Ochoa, JA & Martínez-Trinidad, JF 2016, SMOTE-D a deterministic version of SMOTE. in SMOTE-D a deterministic version of SMOTE. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9703, pp. 177-188, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 1/01/18. https://doi.org/10.1007/978-3-319-39393-3_18

SMOTE-D a deterministic version of SMOTE. / Torres, Fredy Rodríguez; Carrasco-Ochoa, Jesús A.; Martínez-Trinidad, José Fco.

SMOTE-D a deterministic version of SMOTE. 2016. p. 177-188 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9703).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - SMOTE-D a deterministic version of SMOTE

AU - Torres, Fredy Rodríguez

AU - Carrasco-Ochoa, Jesús A.

AU - Martínez-Trinidad, José Fco

PY - 2016/1/1

Y1 - 2016/1/1

N2 - © Springer International Publishing Switzerland 2016. Imbalanced data is a problem of current research interest. This problem arises when the number of objects in a class is much lower than in other classes. In order to address this problem several methods for oversampling the minority class have been proposed. Oversampling methods generate synthetic objects for the minority class in order to balance the amount of objects between classes, among them, SMOTE is one of the most successful and well-known methods. In this paper, we introduce a modification of SMOTE which deterministically generates synthetic objects for the minority class. Our proposed method eliminates the random component of SMOTE and generates different amount of synthetic objects for each object of the minority class. An experimental comparison of the proposed method against SMOTE in standard imbalanced datasets is provided. The experimental results show an improvement of our proposed method regarding SMOTE, in terms of F-measure.

AB - © Springer International Publishing Switzerland 2016. Imbalanced data is a problem of current research interest. This problem arises when the number of objects in a class is much lower than in other classes. In order to address this problem several methods for oversampling the minority class have been proposed. Oversampling methods generate synthetic objects for the minority class in order to balance the amount of objects between classes, among them, SMOTE is one of the most successful and well-known methods. In this paper, we introduce a modification of SMOTE which deterministically generates synthetic objects for the minority class. Our proposed method eliminates the random component of SMOTE and generates different amount of synthetic objects for each object of the minority class. An experimental comparison of the proposed method against SMOTE in standard imbalanced datasets is provided. The experimental results show an improvement of our proposed method regarding SMOTE, in terms of F-measure.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84977600144&origin=inward

UR - https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=84977600144&origin=inward

U2 - 10.1007/978-3-319-39393-3_18

DO - 10.1007/978-3-319-39393-3_18

M3 - Conference contribution

SN - 9783319393926

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 177

EP - 188

BT - SMOTE-D a deterministic version of SMOTE

ER -

Torres FR, Carrasco-Ochoa JA, Martínez-Trinidad JF. SMOTE-D a deterministic version of SMOTE. In SMOTE-D a deterministic version of SMOTE. 2016. p. 177-188. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-39393-3_18