CoNLL 2014 Shared Task: Grammatical Error Correction with a Syntactic N-gram Language Model from a Big Corpora

2020 ◽

Vol 34 (10) ◽

pp. 13859-13860

Author(s):

Yiyuan Li ◽

Antonios Anastasopoulos ◽

Alan W. Black

Keyword(s):

Error Correction ◽

Contextual Information ◽

Language Model ◽

Sequence Generation ◽

Strong Potential ◽

Grammatical Error

Current grammatical error correction (GEC) models typically consider the task as sequence generation, which requires large amounts of annotated data and limit the applications in data-limited settings. We try to incorporate contextual information from pre-trained language model to leverage annotation and benefit multilingual scenarios. Results show strong potential of Bidirectional Encoder Representations from Transformers (BERT) in grammatical error correction task.

Download Full-text

Adoption of a Neural Language Model in an Encoder for Encoder-Decoder based Korean Grammatical Error Correction

KIISE Transactions on Computing Practices ◽

10.5626/ktcp.2018.24.6.301 ◽

2018 ◽

Vol 24 (6) ◽

pp. 301-306

Author(s):

Seung Woo Cho ◽

Hong-seok Kwon ◽

Hun-young Jung ◽

Jong-Hyeok Lee

Keyword(s):

Error Correction ◽

Language Model ◽

Grammatical Error

Download Full-text

The AMU System in the CoNLL-2014 Shared Task: Grammatical Error Correction by Data-Intensive and Feature-Rich Statistical Machine Translation

10.3115/v1/w14-1703 ◽

2014 ◽

Cited By ~ 8

Author(s):

Marcin Junczys-Dowmunt ◽

Roman Grundkiewicz

Keyword(s):

Error Correction ◽

Machine Translation ◽

Statistical Machine Translation ◽

Shared Task ◽

Data Intensive ◽

Grammatical Error

Download Full-text

Language Model Based Grammatical Error Correction without Annotated Training Data

10.18653/v1/w18-0529 ◽

2018 ◽

Cited By ~ 5

Author(s):

Christopher Bryant ◽

Ted Briscoe

Keyword(s):

Error Correction ◽

Language Model ◽

Training Data ◽

Model Based ◽

Grammatical Error

Download Full-text

The BEA-2019 Shared Task on Grammatical Error Correction

10.18653/v1/w19-4406 ◽

2019 ◽

Cited By ~ 8

Author(s):

Christopher Bryant ◽

Mariano Felice ◽

Øistein E. Andersen ◽

Ted Briscoe

Keyword(s):

Error Correction ◽

Shared Task ◽

Grammatical Error

Download Full-text

Building a State-of-the-Art Grammatical Error Correction System

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00193 ◽

2014 ◽

Vol 2 ◽

pp. 419-434 ◽

Cited By ~ 3

Author(s):

Alla Rozovskaya ◽

Dan Roth

Keyword(s):

Error Correction ◽

State Of The Art ◽

Design Principles ◽

Shared Task ◽

Grammatical Error ◽

Different Types ◽

Correction System

This paper identifies and examines the key principles underlying building a state-of-the-art grammatical error correction system. We do this by analyzing the Illinois system that placed first among seventeen teams in the recent CoNLL-2013 shared task on grammatical error correction. The system focuses on five different types of errors common among non-native English writers. We describe four design principles that are relevant for correcting all of these errors, analyze the system along these dimensions, and show how each of these dimensions contributes to the performance.

Download Full-text

POSTECH Grammatical Error Correction System in the CoNLL-2014 Shared Task

10.3115/v1/w14-1709 ◽

2014 ◽

Cited By ~ 1

Author(s):

Kyusong Lee ◽

Gary Geunbae Lee

Keyword(s):

Error Correction ◽

Shared Task ◽

Grammatical Error ◽

Correction System

Download Full-text

Data Weighted Training Strategies for Grammatical Error Correction

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00336 ◽

2020 ◽

Vol 8 ◽

pp. 634-646

Author(s):

Jared Lichtarge ◽

Chris Alberti ◽

Shankar Kumar

Keyword(s):

Error Correction ◽

Recent Progress ◽

State Of The Art ◽

Shared Task ◽

Neural Machine Translation ◽

Training Schedule ◽

New Methods ◽

Grammatical Error ◽

Test Sets ◽

Shed Light

Recent progress in the task of Grammatical Error Correction (GEC) has been driven by addressing data sparsity, both through new methods for generating large and noisy pretraining data and through the publication of small and higher-quality finetuning data in the BEA-2019 shared task. Building upon recent work in Neural Machine Translation (NMT), we make use of both kinds of data by deriving example-level scores on our large pretraining data based on a smaller, higher-quality dataset. In this work, we perform an empirical study to discover how to best incorporate delta-log-perplexity, a type of example scoring, into a training schedule for GEC. In doing so, we perform experiments that shed light on the function and applicability of delta-log-perplexity. Models trained on scored data achieve state- of-the-art results on common GEC test sets.

Download Full-text