Improving Context-Aware Neural Machine Translation Using Self-Attentive Sentence Embedding

Hyeongu Yun; Yongkeun Hwang; Kyomin Jung

doi:10.1609/aaai.v34i05.6494

Improving Context-Aware Neural Machine Translation Using Self-Attentive Sentence Embedding

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6494 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9498-9506 ◽

Cited By ~ 1

Author(s):

Hyeongu Yun ◽

Yongkeun Hwang ◽

Kyomin Jung

Keyword(s):

Machine Translation ◽

Contextual Information ◽

Context Aware ◽

Pronoun Resolution ◽

Test Set ◽

Neural Machine Translation ◽

Attentional Networks ◽

Multiple Context ◽

Sentence Level ◽

Level Information

Fully Attentional Networks (FAN) like Transformer (Vaswani et al. 2017) has shown superior results in Neural Machine Translation (NMT) tasks and has become a solid baseline for translation tasks. More recent studies also have reported experimental results that additional contextual sentences improve translation qualities of NMT models (Voita et al. 2018; Müller et al. 2018; Zhang et al. 2018). However, those studies have exploited multiple context sentences as a single long concatenated sentence, that may cause the models to suffer from inefficient computational complexities and long-range dependencies. In this paper, we propose Hierarchical Context Encoder (HCE) that is able to exploit multiple context sentences separately using the hierarchical FAN structure. Our proposed encoder first abstracts sentence-level information from preceding sentences in a self-attentive way, and then hierarchically encodes context-level information. Through extensive experiments, we observe that our HCE records the best performance measured in BLEU score on English-German, English-Turkish, and English-Korean corpus. In addition, we observe that our HCE records the best performance in a crowd-sourced test set which is designed to evaluate how well an encoder can exploit contextual information. Finally, evaluation on English-Korean pronoun resolution test suite also shows that our HCE can properly exploit contextual information.

Download Full-text

Context-Aware Neural Machine Translation for Korean Honorific Expressions

Electronics ◽

10.3390/electronics10131589 ◽

2021 ◽

Vol 10 (13) ◽

pp. 1589

Author(s):

Yongkeun Hwang ◽

Yanghoon Kim ◽

Kyomin Jung

Keyword(s):

Machine Translation ◽

Deep Neural Networks ◽

Contextual Information ◽

Context Aware ◽

Neural Machine Translation ◽

Translation Quality ◽

Sentence Level ◽

Proposed Model ◽

The Given ◽

The Relationship

Neural machine translation (NMT) is one of the text generation tasks which has achieved significant improvement with the rise of deep neural networks. However, language-specific problems such as handling the translation of honorifics received little attention. In this paper, we propose a context-aware NMT to promote translation improvements of Korean honorifics. By exploiting the information such as the relationship between speakers from the surrounding sentences, our proposed model effectively manages the use of honorific expressions. Specifically, we utilize a novel encoder architecture that can represent the contextual information of the given input sentences. Furthermore, a context-aware post-editing (CAPE) technique is adopted to refine a set of inconsistent sentence-level honorific translations. To demonstrate the efficacy of the proposed method, honorific-labeled test data is required. Thus, we also design a heuristic that labels Korean sentences to distinguish between honorific and non-honorific styles. Experimental results show that our proposed method outperforms sentence-level NMT baselines both in overall translation quality and honorific translations.

Download Full-text

Efficient Context-Aware Neural Machine Translation with Layer-Wise Weighting and Input-Aware Gating

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/544 ◽

2020 ◽

Author(s):

Hongfei Xu ◽

Deyi Xiong ◽

Josef van Genabith ◽

Qiuhui Liu

Keyword(s):

Machine Translation ◽

Contextual Information ◽

Computational Cost ◽

Representation Learning ◽

Vital Role ◽

Context Aware ◽

Neural Machine Translation ◽

Gating Mechanism ◽

Sentence Level ◽

Parallel Data

Existing Neural Machine Translation (NMT) systems are generally trained on a large amount of sentence-level parallel data, and during prediction sentences are independently translated, ignoring cross-sentence contextual information. This leads to inconsistency between translated sentences. In order to address this issue, context-aware models have been proposed. However, document-level parallel data constitutes only a small part of the parallel data available, and many approaches build context-aware models based on a pre-trained frozen sentence-level translation model in a two-step training manner. The computational cost of these approaches is usually high. In this paper, we propose to make the most of layers pre-trained on sentence-level data in contextual representation learning, reusing representations from the sentence-level Transformer and significantly reducing the cost of incorporating contexts in translation. We find that representations from shallow layers of a pre-trained sentence-level encoder play a vital role in source context encoding, and propose to perform source context encoding upon weighted combinations of pre-trained encoder layers' outputs. Instead of separately performing source context and input encoding, we propose to iteratively and jointly encode the source input and its contexts and to generate input-aware context representations with a cross-attention layer and a gating mechanism, which resets irrelevant information in context encoding. Our context-aware Transformer model outperforms the recent CADec [Voita et al., 2019c] on the English-Russian subtitle data and is about twice as fast in training and decoding.

Download Full-text

Enhancing Lexical Translation Consistency for Document-Level Neural Machine Translation

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3485469 ◽

2022 ◽

Vol 21 (3) ◽

pp. 1-21

Author(s):

Xiaomian Kang ◽

Yang Zhao ◽

Jiajun Zhang ◽

Chengqing Zong

Keyword(s):

Machine Translation ◽

English Translation ◽

Test Set ◽

Neural Machine Translation ◽

Global Context ◽

Translation Quality ◽

Sentence Level ◽

Document Level

Document-level neural machine translation (DocNMT) has yielded attractive improvements. In this article, we systematically analyze the discourse phenomena in Chinese-to-English translation, and focus on the most obvious ones, namely lexical translation consistency. To alleviate the lexical inconsistency, we propose an effective approach that is aware of the words which need to be translated consistently and constrains the model to produce more consistent translations. Specifically, we first introduce a global context extractor to extract the document context and consistency context, respectively. Then, the two types of global context are integrated into a encoder enhancer and a decoder enhancer to improve the lexical translation consistency. We create a test set to evaluate the lexical consistency automatically. Experiments demonstrate that our approach can significantly alleviate the lexical translation inconsistency. In addition, our approach can also substantially improve the translation quality compared to sentence-level Transformer.

Download Full-text

The Suboptimal WMT Test Sets and Its Impact on Human Parity

10.20944/preprints202110.0199.v1 ◽

2021 ◽

Author(s):

Ahrii Kim ◽

Yunju Bak ◽

Jimin Sun ◽

Sungwon Lyu ◽

Changmin Lee

Keyword(s):

Machine Translation ◽

Web Crawling ◽

Data Set ◽

Test Set ◽

Neural Machine Translation ◽

Sentence Level ◽

Test Sets ◽

Source Test

With the advent of Neural Machine Translation, the more the achievement of human-machine parity is claimed at WMT, the more we come to ask ourselves if their evaluation environment can be trusted. In this paper, we argue that the low quality of the source test set of the news track at WMT may lead to an overrated human parity claim. First of all, we report nine types of so-called technical contaminants in the data set, originated from an absence of meticulous inspection after web-crawling. Our empirical findings show that when they are corrected, about 5% of the segments that have previously achieved a human parity claim turn out to be statistically invalid. Such a tendency gets evident when the contaminated sentences are solely concerned. To the best of our knowledge, it is the first attempt to question the “source” side of the test set as a potential cause of the overclaim of human parity. We cast evidence for such phenomenon that according to sentence-level TER scores, those trivial errors change a good part of system translations. We conclude that to overlook it would be a mistake, especially when it comes to an NMT evaluation.

Download Full-text

A Large-Scale Test Set for the Evaluation of Context-Aware Pronoun Translation in Neural Machine Translation

10.18653/v1/w18-6307 ◽

2018 ◽

Cited By ~ 4

Author(s):

Mathias Müller ◽

Annette Rios ◽

Elena Voita ◽

Rico Sennrich

Keyword(s):

Machine Translation ◽

Large Scale ◽

Context Aware ◽

Test Set ◽

Neural Machine Translation ◽

Scale Test

Download Full-text

Context-Aware Neural Machine Translation Learns Anaphora Resolution

10.18653/v1/p18-1117 ◽

2018 ◽

Cited By ~ 3

Author(s):

Elena Voita ◽

Pavel Serdyukov ◽

Rico Sennrich ◽

Ivan Titov

Keyword(s):

Machine Translation ◽

Anaphora Resolution ◽

Context Aware ◽

Neural Machine Translation

Download Full-text

Towards Mitigating Gender Bias in a decoder-based Neural Machine Translation model by Adding Contextual Information

10.18653/v1/2020.winlp-1.25 ◽

2020 ◽

Author(s):

Christine Basta ◽

Marta R. Costa-jussà ◽

José A. R. Fonollosa

Keyword(s):

Machine Translation ◽

Gender Bias ◽

Contextual Information ◽

Neural Machine Translation ◽

Translation Model

Download Full-text

Sentence-Level Agreement for Neural Machine Translation

10.18653/v1/p19-1296 ◽

2019 ◽

Cited By ~ 3

Author(s):

Mingming Yang ◽

Rui Wang ◽

Kehai Chen ◽

Masao Utiyama ◽

Eiichiro Sumita ◽

...

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Sentence Level

Download Full-text

Improving Context-Aware Neural Machine Translation with Target-Side Context

Communications in Computer and Information Science - Computational Linguistics ◽

10.1007/978-981-15-6168-9_10 ◽

2020 ◽

pp. 112-122

Author(s):

Hayahide Yamagishi ◽

Mamoru Komachi

Keyword(s):

Machine Translation ◽

Context Aware ◽

Neural Machine Translation ◽

Target Side

Download Full-text

A study of BERT for context-aware neural machine translation

Machine Learning ◽

10.1007/s10994-021-06070-y ◽

2022 ◽

Author(s):

Xueqing Wu ◽

Yingce Xia ◽

Jinhua Zhu ◽

Lijun Wu ◽

Shufang Xie ◽

...

Keyword(s):

Machine Translation ◽

Context Aware ◽

Neural Machine Translation

Download Full-text