TY - GEN
T1 - UrduFake@FIRE2020
T2 - 12th Annual Meeting of the Forum for Information Retrieval Evaluation, FIRE 2020
AU - Amjad, Maaz
AU - Sidorov, Grigori
AU - Zhila, Alisa
AU - Gelbukh, Alexander
AU - Rosso, Paolo
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/12/16
Y1 - 2020/12/16
N2 - This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language. This is a binary classification task in which the goal is to identify fake news using a dataset composed of 900 annotated news articles for training and 400 news articles for testing. The dataset contains news in five domains: (i) Health, (ii) Sports, (iii) Showbiz, (iv) Technology, and (v) Business. 42 teams from 6 different countries (India, China, Egypt, Germany, Pakistan, and the UK) registered for the task. 9 teams submitted their experimental results. The participants used various machine learning methods ranging from feature-based traditional machine learning to neural network techniques. The best performing system achieved an F-score value of 0.90, showing that the BERT-based approach outperforms other machine learning classifiers.
AB - This paper gives the overview of the first shared task at FIRE 2020 on fake news detection in the Urdu language. This is a binary classification task in which the goal is to identify fake news using a dataset composed of 900 annotated news articles for training and 400 news articles for testing. The dataset contains news in five domains: (i) Health, (ii) Sports, (iii) Showbiz, (iv) Technology, and (v) Business. 42 teams from 6 different countries (India, China, Egypt, Germany, Pakistan, and the UK) registered for the task. 9 teams submitted their experimental results. The participants used various machine learning methods ranging from feature-based traditional machine learning to neural network techniques. The best performing system achieved an F-score value of 0.90, showing that the BERT-based approach outperforms other machine learning classifiers.
KW - Fake news detection
KW - Urdu language
KW - low resource languages
UR - http://www.scopus.com/inward/record.url?scp=85100401805&partnerID=8YFLogxK
U2 - 10.1145/3441501.3441541
DO - 10.1145/3441501.3441541
M3 - Contribución a la conferencia
AN - SCOPUS:85100401805
T3 - ACM International Conference Proceeding Series
SP - 37
EP - 40
BT - FIRE 2020 - Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval Evaluation
A2 - Majumder, Prasenjit
A2 - Mitra, Mandar
A2 - Gangopadhyay, Surupendu
A2 - Mehta, Parth
PB - Association for Computing Machinery
Y2 - 16 December 2020 through 20 December 2020
ER -