MUCIC@TamilNLP-ACL2022: Abusive Comment Detection in Tamil Language using 1D Conv-LSTM

F. Balouchzahi, M. D. Anusha, H. L. Shashirekha, G. Sidorov

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

4 Citas (Scopus)

Resumen

Abusive language content such as hate speech, profanity, and cyberbullying etc., which is common in online platforms is creating lot of problems to the users as well as policy makers. Hence, detection of such abusive language in user-generated online content has become increasingly important over the past few years. Online platforms strive hard to moderate the abusive content to reduce societal harm, comply with laws, and create a more inclusive environment for their users. In spite of various methods to automatically detect abusive languages in online platforms, the problem still persists. To address the automatic detection of abusive languages in online platforms, this paper describes the models submitted by our team - MUCIC to the shared task on "Abusive Comment Detection in Tamil-ACL 2022". This shared task addresses the abusive comment detection in native Tamil script texts and code-mixed Tamil texts. To address this challenge, two models: i) n-gram-Multilayer Perceptron (n-gram-MLP) model utilizing MLP classifier fed with char-n gram features and ii) 1D Convolutional Long Short-Term Memory (1D Conv-LSTM) model, were submitted. The n-gram-MLP model fared well among these two models with weighted F1-scores of 0.560 and 0.430 for code-mixed Tamil and native Tamil script texts, respectively. This work may be reproduced using the code available in Gthub.

Idioma originalInglés
Título de la publicación alojadaDravidianLangTech 2022 - 2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop
EditoresBharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar Madasamy, Parameswari Krishnamurthy, Elizabeth Sherly, Sinnathamby Mahesan
EditorialAssociation for Computational Linguistics (ACL)
Páginas64-69
Número de páginas6
ISBN (versión digital)9781955917346
EstadoPublicada - 2022
Evento2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop, DravidianLangTech 2022 - Dublin, Irlanda
Duración: 26 may. 2022 → …

Serie de la publicación

NombreDravidianLangTech 2022 - 2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop

Conferencia

Conferencia2nd Workshop on Speech and Language Technologies for Dravidian Languages, Proceedings of the Workshop, DravidianLangTech 2022
País/TerritorioIrlanda
CiudadDublin
Período26/05/22 → …

Huella

Profundice en los temas de investigación de 'MUCIC@TamilNLP-ACL2022: Abusive Comment Detection in Tamil Language using 1D Conv-LSTM'. En conjunto forman una huella única.

Citar esto