YouTube based religious hate speech and extremism detection dataset with machine learning baselines

Noman Ashraf, Abid Rafiq, Sabur Butt, Hafiz Muhammad Faisal Shehzad, Grigori Sidorov, Alexander Gelbukh

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

On YouTube, billions of videos are watched online and millions of short messages are posted each day. YouTube along with other social networking sites are used by individuals and extremist groups for spreading hatred among users. In this paper, we consider religion as the most targeted domain for spreading hate speech among people of different religions. We present a methodology for the detection of religion-based hate videos on YouTube. Messages posted on YouTube videos generally express the opinions of users' related to that video. We provide a novel dataset for religious hate speech detection on Youtube comments. The proposed methodology applies data mining techniques on extracted comments from religious videos in order to filter religion-oriented messages and detect those videos which are used for spreading hate. The supervised learning algorithms: Support Vector Machine (SVM), Logistic Regression (LR), and k-Nearest Neighbor (k-NN) are used for baseline results.

Original languageEnglish
Pages (from-to)4769-4777
Number of pages9
JournalJournal of Intelligent and Fuzzy Systems
Volume42
Issue number5
DOIs
StatePublished - 2022
Externally publishedYes

Keywords

  • Hate speech detection
  • YouTube comment analysis
  • hate speech dataset
  • religious extremism detection

Fingerprint

Dive into the research topics of 'YouTube based religious hate speech and extremism detection dataset with machine learning baselines'. Together they form a unique fingerprint.

Cite this