TY - JOUR
T1 - Authorship attribution through punctuation n-grams and averaged combination of SVM notebook for PAN at CLEF 2019
AU - Martín-Del-Campo-Rodríguez, Carolina
AU - Pérez Alvarez, Daniel Alejandro
AU - Maldonado Sifuentes, Christian Efraín
AU - Sidorov, Grigori
AU - Batyrshin, Ildar
AU - Gelbukh, Alexander
N1 - Publisher Copyright:
© 2019 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CLEF 2019, 9-12 September 2019, Lugano, Switzerland.
PY - 2019
Y1 - 2019
N2 - This work explores the exploitation of pre-processing, feature extraction and the averaged combination of Support Vector Machines (SVM) outputs for the open-set Cross-Domain Authorship Attribution task. The use of punctuation n-grams as a feature representation of a document is introduced for the Authorship Attribution in combination with traditional character n-grams. Starting from different feature representations of a document, several SVM are trained to represent the probability of membership for a certain author to latter obtain an average of all the SVM results. This approach managed to obtain 0.642 with the Macro F1-score for the PAN 2019 contest of open-set Cross-Domain Authorship Attribution.
AB - This work explores the exploitation of pre-processing, feature extraction and the averaged combination of Support Vector Machines (SVM) outputs for the open-set Cross-Domain Authorship Attribution task. The use of punctuation n-grams as a feature representation of a document is introduced for the Authorship Attribution in combination with traditional character n-grams. Starting from different feature representations of a document, several SVM are trained to represent the probability of membership for a certain author to latter obtain an average of all the SVM results. This approach managed to obtain 0.642 with the Macro F1-score for the PAN 2019 contest of open-set Cross-Domain Authorship Attribution.
UR - http://www.scopus.com/inward/record.url?scp=85070524753&partnerID=8YFLogxK
M3 - Artículo de la conferencia
AN - SCOPUS:85070524753
SN - 1613-0073
VL - 2380
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 20th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2019
Y2 - 9 September 2019 through 12 September 2019
ER -