TY - GEN
T1 - Syntactic dependency-based n-grams
T2 - 14th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013
AU - Sidorov, Grigori
AU - Velasquez, Francisco
AU - Stamatatos, Efstathios
AU - Gelbukh, Alexander
AU - Chanona-Hernández, Liliana
PY - 2013
Y1 - 2013
N2 - The paper introduces and discusses a concept of syntactic n-grams (sn-grams) that can be applied instead of traditional n-grams in many NLP tasks. Sn-grams are constructed by following paths in syntactic trees, so sn-grams allow bringing syntactic knowledge into machine learning methods. Still, previous parsing is necessary for their construction. We applied sn-grams in the task of authorship attribution for corpora of three and seven authors with very promising results.
AB - The paper introduces and discusses a concept of syntactic n-grams (sn-grams) that can be applied instead of traditional n-grams in many NLP tasks. Sn-grams are constructed by following paths in syntactic trees, so sn-grams allow bringing syntactic knowledge into machine learning methods. Still, previous parsing is necessary for their construction. We applied sn-grams in the task of authorship attribution for corpora of three and seven authors with very promising results.
KW - SVM classifier
KW - Syntactic n-grams
KW - authorship attribution task
KW - sn-grams
KW - syntactic paths
UR - http://www.scopus.com/inward/record.url?scp=84875511202&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-37247-6_2
DO - 10.1007/978-3-642-37247-6_2
M3 - Contribución a la conferencia
SN - 9783642372469
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 13
EP - 24
BT - Computational Linguistics and Intelligent Text Processing - 14th International Conference, CICLing 2013, Proceedings
Y2 - 24 March 2013 through 30 March 2013
ER -