YouTube based religious hate speech and extremism detection dataset with machine learning baselines

Noman Ashraf; Abid Rafiq; Sabur Butt; Hafiz Muhammad Faisal Shehzad; Grigori Sidorov; Alexander Gelbukh

doi:10.3233/JIFS-219264

YouTube based religious hate speech and extremism detection dataset with machine learning baselines

Noman Ashraf, Abid Rafiq, Sabur Butt, Hafiz Muhammad Faisal Shehzad, Grigori Sidorov, Alexander Gelbukh

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

9 Citas (Scopus)

Resumen

On YouTube, billions of videos are watched online and millions of short messages are posted each day. YouTube along with other social networking sites are used by individuals and extremist groups for spreading hatred among users. In this paper, we consider religion as the most targeted domain for spreading hate speech among people of different religions. We present a methodology for the detection of religion-based hate videos on YouTube. Messages posted on YouTube videos generally express the opinions of users' related to that video. We provide a novel dataset for religious hate speech detection on Youtube comments. The proposed methodology applies data mining techniques on extracted comments from religious videos in order to filter religion-oriented messages and detect those videos which are used for spreading hate. The supervised learning algorithms: Support Vector Machine (SVM), Logistic Regression (LR), and k-Nearest Neighbor (k-NN) are used for baseline results.

Idioma original	Inglés
Páginas (desde-hasta)	4769-4777
Número de páginas	9
Publicación	Journal of Intelligent and Fuzzy Systems
Volumen	42
N.º	5
DOI	https://doi.org/10.3233/JIFS-219264
Estado	Publicada - 2022
Publicado de forma externa	Sí

ODS de las Naciones Unidas

Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible

Acceder al documento

10.3233/JIFS-219264

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

@article{dc6a520310b946eb96d4fd8208b542e6,

title = "YouTube based religious hate speech and extremism detection dataset with machine learning baselines",

abstract = "On YouTube, billions of videos are watched online and millions of short messages are posted each day. YouTube along with other social networking sites are used by individuals and extremist groups for spreading hatred among users. In this paper, we consider religion as the most targeted domain for spreading hate speech among people of different religions. We present a methodology for the detection of religion-based hate videos on YouTube. Messages posted on YouTube videos generally express the opinions of users' related to that video. We provide a novel dataset for religious hate speech detection on Youtube comments. The proposed methodology applies data mining techniques on extracted comments from religious videos in order to filter religion-oriented messages and detect those videos which are used for spreading hate. The supervised learning algorithms: Support Vector Machine (SVM), Logistic Regression (LR), and k-Nearest Neighbor (k-NN) are used for baseline results.",

keywords = "Hate speech detection, YouTube comment analysis, hate speech dataset, religious extremism detection",

author = "Noman Ashraf and Abid Rafiq and Sabur Butt and Shehzad, {Hafiz Muhammad Faisal} and Grigori Sidorov and Alexander Gelbukh",

year = "2022",

doi = "10.3233/JIFS-219264",

language = "Ingl{\'e}s",

volume = "42",

pages = "4769--4777",

journal = "Journal of Intelligent and Fuzzy Systems",

issn = "1064-1246",

number = "5",

}

TY - JOUR

T1 - YouTube based religious hate speech and extremism detection dataset with machine learning baselines

AU - Ashraf, Noman

AU - Rafiq, Abid

AU - Butt, Sabur

AU - Shehzad, Hafiz Muhammad Faisal

AU - Sidorov, Grigori

AU - Gelbukh, Alexander

PY - 2022

Y1 - 2022

N2 - On YouTube, billions of videos are watched online and millions of short messages are posted each day. YouTube along with other social networking sites are used by individuals and extremist groups for spreading hatred among users. In this paper, we consider religion as the most targeted domain for spreading hate speech among people of different religions. We present a methodology for the detection of religion-based hate videos on YouTube. Messages posted on YouTube videos generally express the opinions of users' related to that video. We provide a novel dataset for religious hate speech detection on Youtube comments. The proposed methodology applies data mining techniques on extracted comments from religious videos in order to filter religion-oriented messages and detect those videos which are used for spreading hate. The supervised learning algorithms: Support Vector Machine (SVM), Logistic Regression (LR), and k-Nearest Neighbor (k-NN) are used for baseline results.

AB - On YouTube, billions of videos are watched online and millions of short messages are posted each day. YouTube along with other social networking sites are used by individuals and extremist groups for spreading hatred among users. In this paper, we consider religion as the most targeted domain for spreading hate speech among people of different religions. We present a methodology for the detection of religion-based hate videos on YouTube. Messages posted on YouTube videos generally express the opinions of users' related to that video. We provide a novel dataset for religious hate speech detection on Youtube comments. The proposed methodology applies data mining techniques on extracted comments from religious videos in order to filter religion-oriented messages and detect those videos which are used for spreading hate. The supervised learning algorithms: Support Vector Machine (SVM), Logistic Regression (LR), and k-Nearest Neighbor (k-NN) are used for baseline results.

KW - Hate speech detection

KW - YouTube comment analysis

KW - hate speech dataset

KW - religious extremism detection

UR - http://www.scopus.com/inward/record.url?scp=85128185741&partnerID=8YFLogxK

U2 - 10.3233/JIFS-219264

DO - 10.3233/JIFS-219264

M3 - Artículo

AN - SCOPUS:85128185741

SN - 1064-1246

VL - 42

SP - 4769

EP - 4777

JO - Journal of Intelligent and Fuzzy Systems

JF - Journal of Intelligent and Fuzzy Systems

IS - 5

ER -

YouTube based religious hate speech and extremism detection dataset with machine learning baselines

Resumen

ODS de las Naciones Unidas

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto