Feature selection to detect botnets using machine learning algorithms

Francisco Villegas Alejandre, Nareli Cruz Cortés, Eleazar Aguirre Anaya

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

39 Scopus citations

Abstract

In this paper, a novel method to do feature selection to detect botnets at their phase of Command and Control (C&C) is presented. A major problem is that researchers have proposed features based on their expertise, but there is no a method to evaluate these features since some of these features could get a lower detection rate than other. To this aim, we find the feature set based on connections of botnets at their phase of C&C, that maximizes the detection rate of these botnets. A Genetic Algorithm (GA) was used to select the set of features that gives the highest detection rate. We used the machine learning algorithm C4.5, this algorithm did the classification between connections belonging or not to a botnet. The datasets used in this paper were extracted from the repositories ISOT and ISCX. Some tests were done to get the best parameters in a GA and the algorithm C4.5. We also performed experiments in order to obtain the best set of features for each botnet analyzed (specific), and for each type of botnet (general) too. The results are shown at the end of the paper, in which a considerable reduction of features and a higher detection rate than the related work presented were obtained.

Original languageEnglish
Title of host publication2017 International Conference on Electronics, Communications and Computers, CONIELECOMP 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781509036219
DOIs
StatePublished - 3 Apr 2017
Event27th International Conference on Electronics, Communications and Computers, CONIELECOMP 2017 - Cholula, Mexico
Duration: 22 Feb 201724 Feb 2017

Publication series

Name2017 International Conference on Electronics, Communications and Computers, CONIELECOMP 2017

Conference

Conference27th International Conference on Electronics, Communications and Computers, CONIELECOMP 2017
Country/TerritoryMexico
CityCholula
Period22/02/1724/02/17

Keywords

  • Botnet
  • Feature selection
  • Machine learning
  • Malware detection

Fingerprint

Dive into the research topics of 'Feature selection to detect botnets using machine learning algorithms'. Together they form a unique fingerprint.

Cite this