Individual vs. Group Violent Threats Classification in Online Discussions

Noman Ashraf, Rabia Mustafa, Grigori Sidorov, Alexander Gelbukh

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

26 Citas (Scopus)

Resumen

Violent threat is a serious crime affecting the targeted individuals or groups. It is essential for media providers to block the users that post such threats. In this paper, we focused on detection of violent threat language in YouTube comments. We categorized the threatening comments into those targeting an individual or a group. We started from an existing dataset with violent threat language identified, but without any categorization into comments targeting individuals or groups. We adopted a binary classification approach for the prediction of individual- vs. group-targeting threats. We compared two text representations: bag of words (BOW) and pre-trained word embedding such as GloVe and fastText. We used deep-learning classifiers such as 1D-CNN, LSTM, and bidirectional LSTM (BiLSTM). GloVe embedding showed the worst results, fastText performed much better, and BiLSTM on BOW with term frequency-inverse document frequency (TF-IDF) weighting scheme gave the best results, achieving 0.94% ROC-AUC and Macro-F1 score of 0.85%.

Idioma originalInglés
Título de la publicación alojadaThe Web Conference 2020 - Companion of the World Wide Web Conference, WWW 2020
EditorialAssociation for Computing Machinery
Páginas629-633
Número de páginas5
ISBN (versión digital)9781450370240
DOI
EstadoPublicada - 20 abr. 2020
Evento29th International World Wide Web Conference, WWW 2020 - Taipei, Taiwán
Duración: 20 abr. 202024 abr. 2020

Serie de la publicación

NombreThe Web Conference 2020 - Companion of the World Wide Web Conference, WWW 2020

Conferencia

Conferencia29th International World Wide Web Conference, WWW 2020
País/TerritorioTaiwán
CiudadTaipei
Período20/04/2024/04/20

Huella

Profundice en los temas de investigación de 'Individual vs. Group Violent Threats Classification in Online Discussions'. En conjunto forman una huella única.

Citar esto