Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)

Grigori Sidorov; Anubhav Gupta; Martin Tozer; Dolors Catala; Angels Catena; Sandrine Fuentes

Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)

Grigori Sidorov, Anubhav Gupta, Martin Tozer, Dolors Catala, Angels Catena, Sandrine Fuentes

Centro de Investigación en Computación (CIC)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

16 Scopus citations

Abstract

We describe the system developed for the CoNLL-2013 shared task—automatic English L2 grammar error correction. The system is based on the rule-based approach. It uses very few additional resources: a morphological analyzer and a list of 250 common uncountable nouns, along with the training data provided by the organizers. The system uses the syntactic information available in the training data: this information is represented as syntactic n-grams, i.e. n-grams extracted by following the paths in dependency trees. The system is simple and was developed in a short period of time (1 month). Since it does not employ any additional resources or any sophisticated machine learning methods, it does not achieve high scores (specifically, it has low recall) but could be considered as a baseline system for the task. On the other hand, it shows what can be obtained using a simple rule-based approach and presents a few situations where the rule-based approach can perform better than ML approach.

Original language	English
Title of host publication	CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task
Publisher	Association for Computational Linguistics (ACL)
Pages	96-101
Number of pages	6
ISBN (Electronic)	9781937284718
State	Published - 2013
Event	17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013 - Sofia, Bulgaria Duration: 8 Aug 2013 → 9 Aug 2013

Publication series

Name	CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task

Conference

Conference	17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013
Country/Territory	Bulgaria
City	Sofia
Period	8/08/13 → 9/08/13

Cite this

Sidorov, G., Gupta, A., Tozer, M., Catala, D., Catena, A., & Fuentes, S. (2013). Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2). In CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task (pp. 96-101). (CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task). Association for Computational Linguistics (ACL).

Sidorov, Grigori ; Gupta, Anubhav ; Tozer, Martin et al. / Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2). CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task. Association for Computational Linguistics (ACL), 2013. pp. 96-101 (CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task).

@inproceedings{7ff64fae3f404c94944771774fafad13,

title = "Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)",

abstract = "We describe the system developed for the CoNLL-2013 shared task—automatic English L2 grammar error correction. The system is based on the rule-based approach. It uses very few additional resources: a morphological analyzer and a list of 250 common uncountable nouns, along with the training data provided by the organizers. The system uses the syntactic information available in the training data: this information is represented as syntactic n-grams, i.e. n-grams extracted by following the paths in dependency trees. The system is simple and was developed in a short period of time (1 month). Since it does not employ any additional resources or any sophisticated machine learning methods, it does not achieve high scores (specifically, it has low recall) but could be considered as a baseline system for the task. On the other hand, it shows what can be obtained using a simple rule-based approach and presents a few situations where the rule-based approach can perform better than ML approach.",

author = "Grigori Sidorov and Anubhav Gupta and Martin Tozer and Dolors Catala and Angels Catena and Sandrine Fuentes",

note = "Publisher Copyright: {\textcopyright} 2013 Association for Computational Linguistics.; 17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013 ; Conference date: 08-08-2013 Through 09-08-2013",

year = "2013",

language = "Ingl{\'e}s",

series = "CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task",

publisher = "Association for Computational Linguistics (ACL)",

pages = "96--101",

booktitle = "CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task",

}

Sidorov, G, Gupta, A, Tozer, M, Catala, D, Catena, A & Fuentes, S 2013, Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2). in CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task. CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task, Association for Computational Linguistics (ACL), pp. 96-101, 17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013, Sofia, Bulgaria, 8/08/13.

Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2). / Sidorov, Grigori; Gupta, Anubhav; Tozer, Martin et al.
CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task. Association for Computational Linguistics (ACL), 2013. p. 96-101 (CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)

AU - Sidorov, Grigori

AU - Gupta, Anubhav

AU - Tozer, Martin

AU - Catala, Dolors

AU - Catena, Angels

AU - Fuentes, Sandrine

PY - 2013

Y1 - 2013

N2 - We describe the system developed for the CoNLL-2013 shared task—automatic English L2 grammar error correction. The system is based on the rule-based approach. It uses very few additional resources: a morphological analyzer and a list of 250 common uncountable nouns, along with the training data provided by the organizers. The system uses the syntactic information available in the training data: this information is represented as syntactic n-grams, i.e. n-grams extracted by following the paths in dependency trees. The system is simple and was developed in a short period of time (1 month). Since it does not employ any additional resources or any sophisticated machine learning methods, it does not achieve high scores (specifically, it has low recall) but could be considered as a baseline system for the task. On the other hand, it shows what can be obtained using a simple rule-based approach and presents a few situations where the rule-based approach can perform better than ML approach.

AB - We describe the system developed for the CoNLL-2013 shared task—automatic English L2 grammar error correction. The system is based on the rule-based approach. It uses very few additional resources: a morphological analyzer and a list of 250 common uncountable nouns, along with the training data provided by the organizers. The system uses the syntactic information available in the training data: this information is represented as syntactic n-grams, i.e. n-grams extracted by following the paths in dependency trees. The system is simple and was developed in a short period of time (1 month). Since it does not employ any additional resources or any sophisticated machine learning methods, it does not achieve high scores (specifically, it has low recall) but could be considered as a baseline system for the task. On the other hand, it shows what can be obtained using a simple rule-based approach and presents a few situations where the rule-based approach can perform better than ML approach.

UR - http://www.scopus.com/inward/record.url?scp=84976450377&partnerID=8YFLogxK

M3 - Contribución a la conferencia

AN - SCOPUS:84976450377

T3 - CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task

SP - 96

EP - 101

BT - CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task

PB - Association for Computational Linguistics (ACL)

T2 - 17th Conference on Computational Natural Language Learning: Shared Task, CoNLL 2013

Y2 - 8 August 2013 through 9 August 2013

ER -

Sidorov G, Gupta A, Tozer M, Catala D, Catena A, Fuentes S. Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2). In CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task. Association for Computational Linguistics (ACL). 2013. p. 96-101. (CoNLL 2013 - 17th Conference on Computational Natural Language Learning, Proceedings of the Shared Task).

Rule-based system for automatic grammar correction using syntactic n-grams for english Language Learning (L2)

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this