Youdao’s Winning Solution to the NLPCC-2018 Task 2 Challenge: A Neural Machine Translation Approach to Chinese Grammatical Error Correction

Grammatical error correction (GEC) is an important application aspect of natural language processing techniques, and GEC system is a kind of very important intelligent system that has long been explored both in academic and industrial communities. The past decade has witnessed significant progress achieved in GEC for the sake of increasing popularity of machine learning and deep learning. However, there is not a survey that untangles the large amount of research works and progress in this field. We present the first survey in GEC for a comprehensive retrospective of the literature in this area. We first give the definition of GEC task and introduce the public datasets and data annotation schema. After that, we discuss six kinds of basic approaches, six commonly applied performance boosting techniques for GEC systems, and three data augmentation methods. Since GEC is typically viewed as a sister task of Machine Translation (MT), we put more emphasis on the statistical machine translation (SMT)-based approaches and neural machine translation (NMT)-based approaches for the sake of their importance. Similarly, some performance-boosting techniques are adapted from MT and are successfully combined with GEC systems for enhancement on the final performance. More importantly, after the introduction of the evaluation in GEC, we make an in-depth analysis based on empirical results in aspects of GEC approaches and GEC systems for a clearer pattern of progress in GEC, where error type analysis and system recapitulation are clearly presented. Finally, we discuss five prospective directions for future GEC researches.

Download Full-text

Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction

10.18653/v1/d16-1161 ◽

2016 ◽

Cited By ~ 10

Author(s):

Marcin Junczys-Dowmunt ◽

Roman Grundkiewicz

Keyword(s):

Error Correction ◽

Machine Translation ◽

State Of The Art ◽

Grammatical Error

Download Full-text

Discriminative Reranking for Grammatical Error Correction with Statistical Machine Translation

10.18653/v1/n16-1133 ◽

2016 ◽

Cited By ~ 2

Author(s):

Tomoya Mizumoto ◽

Yuji Matsumoto

Keyword(s):

Error Correction ◽

Machine Translation ◽

Statistical Machine Translation ◽

Grammatical Error

Download Full-text

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task

10.18653/v1/n18-1055 ◽

2018 ◽

Cited By ~ 11

Author(s):

Marcin Junczys-Dowmunt ◽

Roman Grundkiewicz ◽

Shubha Guha ◽

Kenneth Heafield

Keyword(s):

Error Correction ◽

Machine Translation ◽

Low Resource ◽

Grammatical Error

Download Full-text

Near Human-Level Performance in Grammatical Error Correction with Hybrid Machine Translation

10.18653/v1/n18-2046 ◽

2018 ◽

Cited By ~ 7

Author(s):

Roman Grundkiewicz ◽

Marcin Junczys-Dowmunt

Keyword(s):

Error Correction ◽

Machine Translation ◽

Grammatical Error ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Level Performance

Download Full-text

Grammatical Error Correction: Machine Translation and Classifiers

10.18653/v1/p16-1208 ◽

2016 ◽

Cited By ~ 6

Author(s):

Alla Rozovskaya ◽

Dan Roth

Keyword(s):

Error Correction ◽

Machine Translation ◽

Grammatical Error

Download Full-text

An Approach to NMT Re-Ranking Using Sequence-Labeling for Grammatical Error Correction

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2020.p0557 ◽

2020 ◽

Vol 24 (4) ◽

pp. 557-567

Author(s):

Bo Wang ◽

◽

Kaoru Hirota ◽

Chang Liu ◽

Yaping Dai ◽

...

Keyword(s):

Error Correction ◽

Short Term Memory ◽

Conditional Random Field ◽

Short Term ◽

Neural Machine Translation ◽

Term Memory ◽

Sequence Labeling ◽

Grammatical Error ◽

Small Set ◽

Long Short Term Memory

An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

Download Full-text