Sexism identification using bert and data augmentation - Exist2021

Sabur Butt, Noman Ashraf, Grigori Sidorov, Alexander Gelbukh

Producción científica: Contribución a una revistaArtículo de la conferenciarevisión exhaustiva

13 Citas (Scopus)

Resumen

Sexism is defined as discrimination among females of all ages. We have seen a rise of sexism in social media platforms manifesting itself in many forms. The paper presents best performing machine learning and deep learning algorithms as well as BERT results on \sEXism Identi_cation in Social neTworks (EXIST 2021)" shared task. The task incorporates multilingual dataset containing both Spanish and English tweets. The multilingual nature of the dataset and inconsistencies of the social media text makes it a challenging problem. Considering these challenges the paper focuses on the pre-processing techniques and data augmentation to boost results on various machine learning and deep learning methods. We achieved an F1 score of 78.02% on the sexism identification task (task 1) and F1 score of 49.08% on the sexism categorization task (task 2).

Idioma originalInglés
Páginas (desde-hasta)381-389
Número de páginas9
PublicaciónCEUR Workshop Proceedings
Volumen2943
EstadoPublicada - 2021
Evento2021 Iberian Languages Evaluation Forum, IberLEF 2021 - Virtual, Malaga, Espana
Duración: 21 sep. 2021 → …

Huella

Profundice en los temas de investigación de 'Sexism identification using bert and data augmentation - Exist2021'. En conjunto forman una huella única.

Citar esto