How Far are We from Fully Automatic High Quality Grammatical Error Correction?

Grammatical error correction (GEC) is an important application aspect of natural language processing techniques, and GEC system is a kind of very important intelligent system that has long been explored both in academic and industrial communities. The past decade has witnessed significant progress achieved in GEC for the sake of increasing popularity of machine learning and deep learning. However, there is not a survey that untangles the large amount of research works and progress in this field. We present the first survey in GEC for a comprehensive retrospective of the literature in this area. We first give the definition of GEC task and introduce the public datasets and data annotation schema. After that, we discuss six kinds of basic approaches, six commonly applied performance boosting techniques for GEC systems, and three data augmentation methods. Since GEC is typically viewed as a sister task of Machine Translation (MT), we put more emphasis on the statistical machine translation (SMT)-based approaches and neural machine translation (NMT)-based approaches for the sake of their importance. Similarly, some performance-boosting techniques are adapted from MT and are successfully combined with GEC systems for enhancement on the final performance. More importantly, after the introduction of the evaluation in GEC, we make an in-depth analysis based on empirical results in aspects of GEC approaches and GEC systems for a clearer pattern of progress in GEC, where error type analysis and system recapitulation are clearly presented. Finally, we discuss five prospective directions for future GEC researches.

Download Full-text

Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction

10.18653/v1/d16-1161 ◽

2016 ◽

Cited By ~ 10

Author(s):

Marcin Junczys-Dowmunt ◽

Roman Grundkiewicz

Keyword(s):

Error Correction ◽

Machine Translation ◽

State Of The Art ◽

Grammatical Error

Download Full-text

Automatic Metric Validation for Grammatical Error Correction

10.18653/v1/p18-1127 ◽

2018 ◽

Cited By ~ 1

Author(s):

Leshem Choshen ◽

Omri Abend

Keyword(s):

Error Correction ◽

Grammatical Error

Download Full-text

Do Grammatical Error Correction Models Realize Grammatical Generalization?

Journal of Natural Language Processing ◽

10.5715/jnlp.28.1331 ◽

2021 ◽

Vol 28 (4) ◽

pp. 1331-1335

Author(s):

Masato Mita

Keyword(s):

Error Correction ◽

Error Correction Models ◽

Grammatical Error

Download Full-text

GECToR – Grammatical Error Correction: Tag, Not Rewrite

10.18653/v1/2020.bea-1.16 ◽

2020 ◽

Cited By ~ 1

Author(s):

Kostiantyn Omelianchuk ◽

Vitaliy Atrasevych ◽

Artem Chernodub ◽

Oleksandr Skurzhanskyi

Keyword(s):

Error Correction ◽

Grammatical Error

Download Full-text

Efficient Grammatical Error Correction with Hierarchical Error Detections and Correction

10.1109/icws53863.2021.00073 ◽

2021 ◽

Author(s):

Fayu Pan ◽

Bin Cao

Keyword(s):

Error Correction ◽

Grammatical Error

Download Full-text

Indexing The Canadian Encyclopedia second edition

The Indexer The International Journal of Indexing ◽

10.3828/indexer.1988.16.2.6 ◽

1988 ◽

Vol 16 (2) ◽

pp. 87-91

Author(s):

Ron Gardner ◽

Eve Gardner

Keyword(s):

Personal Computer ◽

Data Entry ◽

Stanford University ◽

High Quality ◽

Fully Automatic ◽

Ibm Pc

The index to the second edition of The Canadian Encyclopedia was prepared on an ibm pc-xt personal computer using the ibm Professional Editor, custom programs written in ibm Pascal, and TEX1 a high quality typesetting program developed by Donald E. Knuth of Stanford University in California. The entries were chosen and keyed in by the indexer, and then alphabetized, formatted, and typeset by the computer. An unusual data entry format together with good typesetting software made possible the delivery to the printer of the 372 page camera ready index less than two weeks after the indexer received the final pages of text. TEX (which rhymes with ‘blecchhh’ not the letter ‘X’) provides fully automatic pagination.

Download Full-text