Bots and gender profiling using character bigrams notebook for PAN at CLEF 2019

Daniel Yacob Espinosa, Helena Gómez-Adorno, Grigori Sidorov

Research output: Contribution to journalConference articlepeer-review

3 Scopus citations

Abstract

This paper describes our approach to tackle the Author Profiling task at PAN 2019. The objective is to distinguish between bot and human users and for human users it is also necessary to detect their gender. We are given only Twitter messages in two languages (Spanish and English). Our preprocessing stage includes data cleaning as well as the extraction of features using character bi-grams. We experimented with several feature representations and machine learning algorithms (Support Vector Machines (SVM) from libSVM). For both languages we use the same methods of feature extraction and classification.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume2380
StatePublished - 2019
Event20th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2019 - Lugano, Switzerland
Duration: 9 Sep 201912 Sep 2019

Fingerprint

Dive into the research topics of 'Bots and gender profiling using character bigrams notebook for PAN at CLEF 2019'. Together they form a unique fingerprint.

Cite this