Transfer Learning from Multilingual DeBERTa for Sexism Identification

Hoang Thang Ta, Abu Bakar Siddiqur Rahman, Lotfollah Najjar, Alexander Gelbukh

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

In this paper, we address the Task 1 and Task 2 of the EXIST 2022 in detecting sexism in a broad sense, from ideological inequality, sexual violence, misogyny to other expressions that involve implicit sexist behaviours in social networks. We apply transfer learning from a pre-trained multilingual DeBERTa (mDeBERTa) model and its zero classification to gain a better performance than BERT-based approaches. Lastly, we combine all 3 methods: mDeBERTa, zero classification, and BERT for majority vote. For Task 1, mDeBERTa is the best method with an accuracy of 76.09% and F1 of 76.08%. Meanwhile, an accuracy of 66.26% and F1 of 47.06% are the best results in Task2, when using majority vote. Our main contribution is to use DeBERTa and zero classification with designing only one classifier in sexism identification.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume3202
StatePublished - 2022
Event2022 Iberian Languages Evaluation Forum, IberLEF 2022 - A Coruna, Spain
Duration: 20 Sep 2022 → …

Keywords

  • DeBERTa
  • EXIST 2022
  • IberLEF
  • Offensive Language
  • Sexism Identification
  • Text Classification

Fingerprint

Dive into the research topics of 'Transfer Learning from Multilingual DeBERTa for Sexism Identification'. Together they form a unique fingerprint.

Cite this