TY - JOUR
T1 - Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts
AU - Gómez-Adorno, Helena
AU - Markov, Ilia
AU - Sidorov, Grigori
AU - Posadas-Durán, Juan Pablo
AU - Sanchez-Perez, Miguel A.
AU - Chanona-Hernandez, Liliana
N1 - Publisher Copyright:
© 2016 Helena Gómez-Adorno et al.
PY - 2016
Y1 - 2016
N2 - We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.
AB - We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.
UR - http://www.scopus.com/inward/record.url?scp=84992213775&partnerID=8YFLogxK
U2 - 10.1155/2016/1638936
DO - 10.1155/2016/1638936
M3 - Artículo
SN - 1687-5265
VL - 2016
JO - Computational Intelligence and Neuroscience
JF - Computational Intelligence and Neuroscience
M1 - 1638936
ER -