A Method of Unknown Words Processing for Neural Machine Translation Using HowNet

Handling Unknown Words in Neural Machine Translation System

2020 International Conference on Decision Aid Sciences and Application (DASA) ◽

10.1109/dasa51403.2020.9317169 ◽

2020 ◽

Author(s):

Kamal Deep Garg ◽

Jatin Gupta ◽

Vandana Saini

Keyword(s):

Machine Translation ◽

Translation System ◽

Neural Machine Translation ◽

Machine Translation System ◽

Unknown Words

Download Full-text

Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00105 ◽

2016 ◽

Vol 4 ◽

pp. 371-383 ◽

Cited By ~ 40

Author(s):

Jie Zhou ◽

Ying Cao ◽

Xuguang Wang ◽

Peng Li ◽

Wei Xu

Keyword(s):

Machine Translation ◽

Short Term Memory ◽

Short Term ◽

Neural Machine Translation ◽

Attention Model ◽

Linear Connections ◽

New Type ◽

Long Short Term Memory ◽

Unknown Words ◽

First Time

Neural machine translation (NMT) aims at solving machine translation (MT) problems using neural networks and has exhibited promising results in recent years. However, most of the existing NMT models are shallow and there is still a performance gap between a single NMT model and the best conventional MT system. In this work, we introduce a new type of linear connections, named fast-forward connections, based on deep Long Short-Term Memory (LSTM) networks, and an interleaved bi-directional architecture for stacking the LSTM layers. Fast-forward connections play an essential role in propagating the gradients and building a deep topology of depth 16. On the WMT’14 English-to-French task, we achieve BLEU=37.7 with a single attention model, which outperforms the corresponding single shallow model by 6.2 BLEU points. This is the first time that a single NMT model achieves state-of-the-art performance and outperforms the best conventional model by 0.7 BLEU points. We can still achieve BLEU=36.3 even without using an attention mechanism. After special handling of unknown words and model ensembling, we obtain the best score reported to date on this task with BLEU=40.4. Our models are also validated on the more difficult WMT’14 English-to-German task.

Download Full-text

Knowledge Graphs Effectiveness in Neural Machine Translation Improvement

Computer Science ◽

10.7494/csci.2020.21.3.3701 ◽

2020 ◽

Vol 21 (3) ◽

Author(s):

Benyamin Ahmadnia ◽

Bonnie J. Dorr ◽

Parisa Kordjamshidi

Keyword(s):

Machine Translation ◽

Semantic Representation ◽

Language Translation ◽

Semantic Relations ◽

Training Data ◽

Target Language ◽

Neural Machine Translation ◽

Source Language ◽

Knowledge Graphs ◽

Unknown Words

Neural Machine Translation (NMT) systems require a massive amount of Maintaining semantic relations between words during the translation process yields more accurate target-language output from Neural Machine Translation (NMT). Although difficult to achieve from training data alone, it is possible to leverage Knowledge Graphs (KGs) to retain source-language semantic relations in the corresponding target-language translation. The core idea is to use KG entity relations as embedding constraints to improve the mapping from source to target. This paper describes two embedding constraints, both of which employ Entity Linking (EL)---assigning a unique identity to entities---to associate words in training sentences with those in the KG: (1) a monolingual embedding constraint that supports an enhanced semantic representation of the source words through access to relations between entities in a KG; and (2) a bilingual embedding constraint that forces entity relations in the source-language to be carried over to the corresponding entities in the target-language translation. The method is evaluated for English-Spanish translation exploiting Freebase as a source of knowledge. Our experimental results show that exploiting KG information not only decreases the number of unknown words in the translation but also improves translation quality.

Download Full-text

A Semantic Concept Based Unknown Words Processing Method in Neural Machine Translation

Natural Language Processing and Chinese Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-73618-1_20 ◽

2018 ◽

pp. 233-242 ◽

Cited By ~ 3

Author(s):

Shaotong Li ◽

Jinan Xu ◽

Guoyi Miao ◽

Yujie Zhang ◽

Yufeng Chen

Keyword(s):

Machine Translation ◽

Processing Method ◽

Semantic Concept ◽

Neural Machine Translation ◽

Unknown Words

Download Full-text

The Solution of the Problem of Unknown Words Under Neural Machine Translation of the Kazakh Language

Communications in Computer and Information Science - Intelligent Information and Database Systems ◽

10.1007/978-981-15-3380-8_28 ◽

2020 ◽

pp. 319-328 ◽

Cited By ~ 1

Author(s):

Aliya Turganbayeva ◽

Ualsher Tukeyev

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Unknown Words

Download Full-text

Research on Unknown Words Processing of Mongolian-Chinese Neural Machine Translation Based on Semantic Similarity

2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS) ◽

10.1109/ccoms.2019.8821725 ◽

2019 ◽

Author(s):

Hasigaowa ◽

Siriguleng Wang

Keyword(s):

Machine Translation ◽

Semantic Similarity ◽

Neural Machine Translation ◽

Unknown Words

Download Full-text

Replacement of Unknown Words Using an Attention Model in Japanese to English Neural Machine Translation

Journal of Natural Language Processing ◽

10.5715/jnlp.25.511 ◽

2018 ◽

Vol 25 (5) ◽

pp. 511-525

Author(s):

Saki Ibe ◽

Yoshitatsu Matsuda ◽

Kazunori Yamaguchi

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Attention Model ◽

Unknown Words

Download Full-text

The solution of the problem of unknown words under neural machine translation of the Kazakh language

Journal of Information and Telecommunication ◽

10.1080/24751839.2020.1838713 ◽

2020 ◽

pp. 1-12

Author(s):

Aliya Turganbayeva ◽

Ualsher Tukeyev

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Unknown Words

Download Full-text

Towards Integrated Classification Lexicon for Handling Unknown Words in Chinese-Vietnamese Neural Machine Translation

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3373267 ◽

2020 ◽

Vol 19 (3) ◽

pp. 1-17

Author(s):

Wanjin Che ◽

Zhengtao Yu ◽

Zhiqiang Yu ◽

Yonghua Wen ◽

Junjun Guo

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Unknown Words

Download Full-text

An Empirical Study on Learning Bug-Fixing Patches in the Wild via Neural Machine Translation

ACM Transactions on Software Engineering and Methodology ◽

10.1145/3340544 ◽

2019 ◽

Vol 28 (4) ◽

pp. 1-29 ◽

Cited By ~ 2

Author(s):

Michele Tufano ◽

Cody Watson ◽

Gabriele Bavota ◽

Massimiliano Di Penta ◽

Martin White ◽

...

Keyword(s):

Empirical Study ◽

Machine Translation ◽

Neural Machine Translation ◽

Bug Fixing ◽

In The Wild

Download Full-text