Effect of linguistic information in neural machine translation

2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA) ◽

10.1109/icaicta.2017.8090975 ◽

2017 ◽

Author(s):

Naomichi Nakamura ◽

Hitoshi Isahara

Keyword(s):

Machine Translation ◽

Linguistic Information ◽

Neural Machine Translation

Download Full-text

On the Linguistic Representational Power of Neural Machine Translation Models

Computational Linguistics ◽

10.1162/coli_a_00367 ◽

2020 ◽

Vol 46 (1) ◽

pp. 1-52

Author(s):

Yonatan Belinkov ◽

Nadir Durrani ◽

Fahim Dalvi ◽

Hassan Sajjad ◽

James Glass

Keyword(s):

Machine Translation ◽

Language Processing ◽

Lexical Semantics ◽

Linguistic Information ◽

Neural Machine Translation ◽

Recent Success ◽

Part Of Speech ◽

Morphologically Rich Languages ◽

Representational Power ◽

Semantic Dependencies

Despite the recent success of deep neural networks in natural language processing and other spheres of artificial intelligence, their interpretability remains a challenge. We analyze the representations learned by neural machine translation (NMT) models at various levels of granularity and evaluate their quality through relevant extrinsic properties. In particular, we seek answers to the following questions: (i) How accurately is word structure captured within the learned representations, which is an important aspect in translating morphologically rich languages? (ii) Do the representations capture long-range dependencies, and effectively handle syntactically divergent languages? (iii) Do the representations capture lexical semantics? We conduct a thorough investigation along several parameters: (i) Which layers in the architecture capture each of these linguistic phenomena; (ii) How does the choice of translation unit (word, character, or subword unit) impact the linguistic properties captured by the underlying representations? (iii) Do the encoder and decoder learn differently and independently? (iv) Do the representations learned by multilingual NMT models capture the same amount of linguistic information as their bilingual counterparts? Our data-driven, quantitative evaluation illuminates important aspects in NMT models and their ability to capture various linguistic phenomena. We show that deep NMT models trained in an end-to-end fashion, without being provided any direct supervision during the training process, learn a non-trivial amount of linguistic information. Notable findings include the following observations: (i) Word morphology and part-of-speech information are captured at the lower layers of the model; (ii) In contrast, lexical semantics or non-local syntactic and semantic dependencies are better represented at the higher layers of the model; (iii) Representations learned using characters are more informed about word-morphology compared to those learned using subword units; and (iv) Representations learned by multilingual models are richer compared to bilingual models.

Download Full-text

Better Neural Machine Translation by Extracting Linguistic Information from BERT

10.18653/v1/2021.eacl-main.241 ◽

2021 ◽

Author(s):

Hassan S. Shavarani ◽

Anoop Sarkar

Keyword(s):

Machine Translation ◽

Linguistic Information ◽

Neural Machine Translation

Download Full-text

An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation

ACM Transactions on Software Engineering and Methodology ◽

10.1145/3340544 ◽

2019 ◽

Vol 28 (4) ◽

pp. 1-29 ◽

Author(s):

Michele Tufano ◽

Cody Watson ◽

Gabriele Bavota ◽

Massimiliano Di Penta ◽

Martin White ◽

...

Keyword(s):

Empirical Study ◽

Machine Translation ◽

Neural Machine Translation ◽

Download Full-text

Neural Machine Translation for Semantic-Driven Q&A Systems in the Factory Planning

Procedia CIRP ◽

10.1016/j.procir.2021.01.044 ◽

2021 ◽

Vol 96 ◽

pp. 9-14

Author(s):

Uwe Dombrowski ◽

Alexander Reiswich ◽

Raphael Lamprecht

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Factory Planning

Download Full-text

A Neural Machine Translation Approach for Translating Malay Parliament Hansard to English Text

2020 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp51396.2020.9310470 ◽

2020 ◽

Author(s):

Yu-Zane Low ◽

Lay-Ki Soon ◽

Shageenderan Sapai

Keyword(s):

Machine Translation ◽

English Text ◽

Neural Machine Translation

Download Full-text

An Evaluation of Neural Machine Translation and Pre-trained Word Embeddings in Multilingual Neural Sentiment Analysis

2020 IEEE International Conference on Progress in Informatics and Computing (PIC) ◽

10.1109/pic50277.2020.9350849 ◽

2020 ◽

Author(s):

George Manias ◽

Argyro Mavrogiorgou ◽

Athanasios Kiourtis ◽

Dimosthenis Kyriazis

Keyword(s):

Sentiment Analysis ◽

Machine Translation ◽

Word Embeddings ◽

Neural Machine Translation

Download Full-text

Research on the Application of BERT in Mongolian-Chinese Neural Machine Translation

2021 13th International Conference on Machine Learning and Computing ◽

10.1145/3457682.3457744 ◽

2021 ◽

Author(s):

Xiu Zhi ◽

Siriguleng Wang

Keyword(s):

Machine Translation ◽

Neural Machine Translation

Download Full-text

Context- and sequence-aware convolutional recurrent encoder for neural machine translation

Proceedings of the 36th Annual ACM Symposium on Applied Computing ◽

10.1145/3412841.3442099 ◽

2021 ◽

Author(s):

Ritam Mallick ◽

Seba Susan ◽

Vaibhaw Agrawal ◽

Rizul Garg ◽

Prateek Rawal

Keyword(s):

Machine Translation ◽

Neural Machine Translation

Download Full-text

Linked Data Effectiveness in Neural Machine Translation

Proceedings of the 2020 4th International Symposium on Computer Science and Intelligent Control ◽

10.1145/3440084.3441214 ◽

2020 ◽

Author(s):

Benyamin Ahmadnia

Keyword(s):

Machine Translation ◽

Linked Data ◽

Neural Machine Translation

Download Full-text

Neural machine translation with a polysynthetic low resource language

Machine Translation ◽

10.1007/s10590-020-09255-9 ◽

2020 ◽

Vol 34 (4) ◽

pp. 325-346

Author(s):

John E. Ortega ◽

Richard Castro Mamani ◽

Kyunghyun Cho

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Download Full-text