Author profiling with doc2vec neural network-based document embeddings

Ilia Markov, Helena Gómez-Adorno, Juan Pablo Posadas-Durán, Grigori Sidorov, Alexander Gelbukh

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

34 Citas (Scopus)

Resumen

To determine author demographics of texts in social media such as Twitter, blogs, and reviews, we use doc2vec document embeddings to train a logistic regression classifier. We experimented with age and gender identification on the PAN author profiling 2014–2016 corpora under both single- and cross-genre conditions. We show that under certain settings the neural network-based features outperform the traditional features when using the same classifier. Our method outperforms existing state of the art under some settings, though the current state-of-the-art results on those tasks have been quite weak.

Idioma originalInglés
Título de la publicación alojadaAdvances in Soft Computing - 15th Mexican International Conference on Artificial Intelligence, MICAI 2016, Proceedings
EditoresObdulia Pichardo-Lagunas, Sabino Miranda-Jimenez
EditorialSpringer Verlag
Páginas117-131
Número de páginas15
ISBN (versión impresa)9783319624273
DOI
EstadoPublicada - 2017
Evento15th Mexican International Conference on Artificial Intelligence, MICAI 2016 - Cancun, México
Duración: 23 oct. 201628 oct. 2016

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen10062 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia15th Mexican International Conference on Artificial Intelligence, MICAI 2016
País/TerritorioMéxico
CiudadCancun
Período23/10/1628/10/16

Huella

Profundice en los temas de investigación de 'Author profiling with doc2vec neural network-based document embeddings'. En conjunto forman una huella única.

Citar esto