Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts

Helena Gómez-Adorno, Ilia Markov, Grigori Sidorov, Juan Pablo Posadas-Durán, Miguel A. Sanchez-Perez, Liliana Chanona-Hernandez

Research output: Contribution to journalArticlepeer-review

35 Scopus citations

Abstract

We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.

Original languageEnglish
Article number1638936
JournalComputational Intelligence and Neuroscience
Volume2016
DOIs
StatePublished - 2016

Fingerprint

Dive into the research topics of 'Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts'. Together they form a unique fingerprint.

Cite this