TY - GEN
T1 - Author profiling with doc2vec neural network-based document embeddings
AU - Markov, Ilia
AU - Gómez-Adorno, Helena
AU - Posadas-Durán, Juan Pablo
AU - Sidorov, Grigori
AU - Gelbukh, Alexander
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2017
Y1 - 2017
N2 - To determine author demographics of texts in social media such as Twitter, blogs, and reviews, we use doc2vec document embeddings to train a logistic regression classifier. We experimented with age and gender identification on the PAN author profiling 2014–2016 corpora under both single- and cross-genre conditions. We show that under certain settings the neural network-based features outperform the traditional features when using the same classifier. Our method outperforms existing state of the art under some settings, though the current state-of-the-art results on those tasks have been quite weak.
AB - To determine author demographics of texts in social media such as Twitter, blogs, and reviews, we use doc2vec document embeddings to train a logistic regression classifier. We experimented with age and gender identification on the PAN author profiling 2014–2016 corpora under both single- and cross-genre conditions. We show that under certain settings the neural network-based features outperform the traditional features when using the same classifier. Our method outperforms existing state of the art under some settings, though the current state-of-the-art results on those tasks have been quite weak.
KW - Author profiling
KW - Document embeddings
KW - Machine learning
KW - Neural networks
KW - doc2vec
UR - http://www.scopus.com/inward/record.url?scp=85028474130&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-62428-0_9
DO - 10.1007/978-3-319-62428-0_9
M3 - Contribución a la conferencia
SN - 9783319624273
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 117
EP - 131
BT - Advances in Soft Computing - 15th Mexican International Conference on Artificial Intelligence, MICAI 2016, Proceedings
A2 - Pichardo-Lagunas, Obdulia
A2 - Miranda-Jimenez, Sabino
PB - Springer Verlag
T2 - 15th Mexican International Conference on Artificial Intelligence, MICAI 2016
Y2 - 23 October 2016 through 28 October 2016
ER -