Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)

Grigori Sidorov, Anubhav Gupta, Martin Tozer, Dolors Catala, Angels Catena, Sandrine Fuentes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Scopus citations

Abstract

We describe the system developed for the CoNLL-2013 shared task—automatic English L2 grammar error correction. The system is based on the rule-based approach. It uses very few additional resources: a morphological analyzer and a list of 250 common uncountable nouns, along with the training data provided by the organizers. The system uses the syntactic information available in the training data: this information is represented as syntactic n-grams, i.e. n-grams extracted by following the paths in dependency trees. The system is simple and was developed in a short period of time (1 month). Since it does not employ any additional resources or any sophisticated machine learning methods, it does not achieve high scores (specifically, it has low recall) but could be considered as a baseline system for the task. On the other hand, it shows what can be obtained using a simple rule-based approach and presents a few situations where the rule-based approach can perform better than ML approach.

Original languageEnglish
Title of host publicationCoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task
PublisherAssociation for Computational Linguistics (ACL)
Pages96-101
Number of pages6
ISBN (Electronic)9781937284718
StatePublished - 2013
Event17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013 - Sofia, Bulgaria
Duration: 8 Aug 20139 Aug 2013

Publication series

NameCoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task

Conference

Conference17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013
Country/TerritoryBulgaria
CitySofia
Period8/08/139/08/13

Fingerprint

Dive into the research topics of 'Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)'. Together they form a unique fingerprint.

Cite this