Cic-fbk approach to native language identification

Ilia Markov, Lingzhen Chen, Carlo Strapparava, Grigori Sidorov

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

17 Citas (Scopus)

Resumen

We present the CIC-FBK system, which took part in the Native Language Identification (NLI) Shared Task 2017. Our approach combines features commonly used in previous NLI research, i.e., word n-grams, lemma n-grams, part-of-speech n-grams, and function words, with recently introduced character n-grams from misspelled words, and features that are novel in this task, such as typed character n-grams, and syntactic n-grams of words and of syntactic relation tags. We use log-entropy weighting scheme and perform classification using the Support Vector Machines (SVM) algorithm. Our system achieved 0.8808 macro-averaged F1-score and shared the 1st rank in the NLI Shared Task 2017 scoring.

Idioma originalInglés
Título de la publicación alojadaEMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop
EditorialAssociation for Computational Linguistics (ACL)
Páginas374-381
Número de páginas8
ISBN (versión digital)9781945626852
EstadoPublicada - 2017
Evento12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017, held in conjunction with EMNLP 2017 - Copenhagen, Dinamarca
Duración: 8 sep. 2017 → …

Serie de la publicación

NombreEMNLP 2017 - 12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017 - Proceedings of the Workshop

Conferencia

Conferencia12th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2017, held in conjunction with EMNLP 2017
País/TerritorioDinamarca
CiudadCopenhagen
Período8/09/17 → …

Huella

Profundice en los temas de investigación de 'Cic-fbk approach to native language identification'. En conjunto forman una huella única.

Citar esto