Research on Mongolian-Chinese machine translation based on the end-to-end neural network

Ren Qing-Dao-Er-Ji; Yila Su; Nier Wu

doi:10.1142/s0219691319410030

Research on Mongolian-Chinese machine translation based on the end-to-end neural network

International Journal of Wavelets Multiresolution and Information Processing ◽

10.1142/s0219691319410030 ◽

2019 ◽

Vol 18 (01) ◽

pp. 1941003 ◽

Cited By ~ 1

Author(s):

Ren Qing-Dao-Er-Ji ◽

Yila Su ◽

Nier Wu

Keyword(s):

Neural Network ◽

Machine Translation ◽

Language Processing ◽

Conditional Random Field ◽

Target Language ◽

Word Alignment ◽

Neural Machine Translation ◽

Attention Model ◽

End To End ◽

Improved Model

With the development of natural language processing and neural machine translation, the neural machine translation method of end-to-end (E2E) neural network model has gradually become the focus of research because of its high translation accuracy and strong semantics of translation. However, there are still problems such as limited vocabulary and low translation loyalty, etc. In this paper, the discriminant method and the Conditional Random Field (CRF) model were used to segment and label the stem and affixes of Mongolian in the preprocessing stage of Mongolian-Chinese bilingual corpus. Aiming at the low translation loyalty problem, a decoding model combining Convolution Neural Network (CNN) and Gated Recurrent Unit (GRU) was constructed. The target language decoding was performed by using the GRU. A global attention model was used to obtain the bilingual word alignment information in the process of bilingual word alignment processing. Finally, the quality of the translation was evaluated by Bilingual Evaluation Understudy (BLEU) values and Perplexity (PPL) values. The improved model yields a BLEU value of 25.13 and a PPL value of [Formula: see text]. The experimental results show that the E2E Mongolian-Chinese neural machine translation model was improved in terms of translation quality and semantic confusion compared with traditional statistical methods and machine translation models based on Recurrent Neural Networks (RNN).

Download Full-text

Neural Machine Translation Using Sequence Modeling

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.37687 ◽

2021 ◽

Vol 9 (8) ◽

pp. 2153-2169

Author(s):

N Revathi

Keyword(s):

Neural Network ◽

Machine Translation ◽

Language Processing ◽

Recurrent Neural Network ◽

Deep Neural Network ◽

Short Term Memory ◽

Short Term ◽

Neural Machine Translation ◽

Term Memory ◽

Sequence Modeling

Abstract: Language is a main mode of communication, and translation is a critical tool for understanding information in a foreign language. Without the help of human translators, machine translation allows users to absorb unfamiliar linguistic material. The main goal of this project is to create a practical language translation from English to Hindi. Given its relevance and potential in the English-Hindi translation, machine translation is an efficient way to turn content into a new language without employing people. Among all available translation machines, Neural Machine Translation (NMT) is one of the most efficient ways. So, in this case, we're employing Sequence to Sequence Modeling, which includes the Recurrent Neural Network (RNN), Long and Short Term Memory (LSTM), and Encoder-Decoder methods. Deep Neural Network (DNN) comprehension and principles of deep learning, i.e. machine translation, are disclosed in the field of Natural Language Processing (NLP). In machine reclining techniques, DNN plays a crucial role. Keywords: Sequence to Sequence, Encoder-Decoder, Recurrent Neural Network, Long & Short term Memory, Deep Neural Network.

Download Full-text

Convolutional over Recurrent Encoder for Neural Machine Translation

Prague Bulletin of Mathematical Linguistics ◽

10.1515/pralin-2017-0007 ◽

2017 ◽

Vol 108 (1) ◽

pp. 37-48 ◽

Cited By ~ 2

Author(s):

Praveen Dakwale ◽

Christof Monz

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Machine Translation ◽

Recurrent Neural Network ◽

Neural Machine Translation ◽

German Translation ◽

Source Sentence ◽

End To End ◽

Target Words

AbstractNeural machine translation is a recently proposed approach which has shown competitive results to traditional MT approaches. Standard neural MT is an end-to-end neural network where the source sentence is encoded by a recurrent neural network (RNN) called encoder and the target words are predicted using another RNN known as decoder. Recently, various models have been proposed which replace the RNN encoder with a convolutional neural network (CNN). In this paper, we propose to augment the standard RNN encoder in NMT with additional convolutional layers in order to capture wider context in the encoder output. Experiments on English to German translation demonstrate that our approach can achieve significant improvements over a standard RNN-based baseline.

Download Full-text

Efficient Embedded Decoding of Neural Network Language Models in a Machine Translation System

International Journal of Neural Systems ◽

10.1142/s0129065718500077 ◽

2018 ◽

Vol 28 (09) ◽

pp. 1850007

Author(s):

Francisco Zamora-Martinez ◽

Maria Jose Castro-Bleda

Keyword(s):

Neural Network ◽

Machine Translation ◽

Language Processing ◽

Traditional Approach ◽

Computational Cost ◽

Integrated Approach ◽

Language Models ◽

Translation System ◽

Neural Net ◽

Network Language

Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing tasks, such as Machine Translation. We introduce in this work a Statistical Machine Translation (SMT) system which fully integrates NNLMs in the decoding stage, breaking the traditional approach based on [Formula: see text]-best list rescoring. The neural net models (both language models (LMs) and translation models) are fully coupled in the decoding stage, allowing to more strongly influence the translation quality. Computational issues were solved by using a novel idea based on memorization and smoothing of the softmax constants to avoid their computation, which introduces a trade-off between LM quality and computational cost. These ideas were studied in a machine translation task with different combinations of neural networks used both as translation models and as target LMs, comparing phrase-based and [Formula: see text]-gram-based systems, showing that the integrated approach seems more promising for [Formula: see text]-gram-based systems, even with nonfull-quality NNLMs.

Download Full-text

ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network

Molecules ◽

10.3390/molecules22101732 ◽

2017 ◽

Vol 22 (10) ◽

pp. 1732 ◽

Cited By ~ 84

Author(s):

Renzhi Cao ◽

Colton Freitas ◽

Leong Chan ◽

Miao Sun ◽

Haiqing Jiang ◽

...

Keyword(s):

Neural Network ◽

Machine Translation ◽

Recurrent Neural Network ◽

Protein Function ◽

Protein Function Prediction ◽

Function Prediction ◽

Neural Machine Translation

Download Full-text

Peran Text Processing Dalam Aplikasi Penerjemah Multi Bahasa Menggunakan Ajax API Google

Sainstech: Jurnal Penelitian dan Pengkajian Sains dan Teknologi ◽

10.37277/stch.v28i1.270 ◽

2018 ◽

Vol 28 (1) ◽

Author(s):

Afrizal Zein

Keyword(s):

Machine Translation ◽

Text Processing ◽

Neural Machine Translation ◽

End To End

Mesin penerjemah adalah alat penerjemah otomatis pada sebuah teks yang dapat merubah dari satu bahasa ke bahasa yang berbeda. Mesin penerjemah adalah sebuah software dengan hasil terjemahan dihasilkan atas dasar model linier regresi yang parameter-parameternya diambil dari hasil analisis statistik teks bilingual. Sekarang kami memperkenalkan langkah berikutnya dalam membuat Mesin Penerjemah yang lebih baik menggunakan metode Neural Machine Translation.Cara Neural Machine Translation menerjemahkan seluruh kalimat dalam satu waktu, bukan hanya memenggal sepotong demi sepotong. Menggunakan konteks yang lebih luas untuk membantu mencari tahu terjemahan yang paling relevan, yang kemudian menata kembali dan menyesuaikan untuk menjadi lebih seperti layaknya berbicara dengan manusia menggunakan tata bahasa yang benar.Program aplikasi ini dibuat menggunakan Bahasa pemrograman C# ditambah pustaka AJAX API Google untuk menerjemahkan teks dan mengambil terjemahan dengan mengurai konten JSON.Dari hasil penelitian didapat sebuah terjemahan yang jauh lebih halus dan mudah dibaca, dan ini semua mungkin karena sistem pembelajaran end-to-end yang dibangun di atas Neural Machine Translation yang pada dasarnya berarti bahwa sistem belajar dari waktu ke waktu untuk membuat lebih baik, terjemahan yang lebih alami.

Download Full-text

Pseudotext Injection and Advance Filtering of Low-Resource Corpus for Neural Machine Translation

Computational Intelligence and Neuroscience ◽

10.1155/2021/6682385 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Michael Adjeisah ◽

Guohua Liu ◽

Douglas Omwenga Nyabuga ◽

Richard Nuetey Nortey ◽

Jinling Song

Keyword(s):

Machine Translation ◽

Language Processing ◽

Training Data ◽

Target Language ◽

Similarity Metrics ◽

Mahalanobis Distances ◽

Parallel Corpora ◽

Parallel Corpus ◽

Low Resource ◽

Sentence Level

Scaling natural language processing (NLP) to low-resourced languages to improve machine translation (MT) performance remains enigmatic. This research contributes to the domain on a low-resource English-Twi translation based on filtered synthetic-parallel corpora. It is often perplexing to learn and understand what a good-quality corpus looks like in low-resource conditions, mainly where the target corpus is the only sample text of the parallel language. To improve the MT performance in such low-resource language pairs, we propose to expand the training data by injecting synthetic-parallel corpus obtained by translating a monolingual corpus from the target language based on bootstrapping with different parameter settings. Furthermore, we performed unsupervised measurements on each sentence pair engaging squared Mahalanobis distances, a filtering technique that predicts sentence parallelism. Additionally, we extensively use three different sentence-level similarity metrics after round-trip translation. Experimental results on a diverse amount of available parallel corpus demonstrate that injecting pseudoparallel corpus and extensive filtering with sentence-level similarity metrics significantly improves the original out-of-the-box MT systems for low-resource language pairs. Compared with existing improvements on the same original framework under the same structure, our approach exhibits tremendous developments in BLEU and TER scores.

Download Full-text

IntroVNMT: An Introspective Model for Variational Neural Machine Translation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6411 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8830-8837

Author(s):

Xin Sheng ◽

Linli Xu ◽

Junliang Guo ◽

Jingchang Liu ◽

Ruoyu Zhao ◽

...

Keyword(s):

Machine Translation ◽

Latent Variables ◽

Image Synthesis ◽

Target Language ◽

Generative Adversarial Network ◽

Neural Machine Translation ◽

Adversarial Network ◽

Proposed Model ◽

Model Training ◽

High Level

We propose a novel introspective model for variational neural machine translation (IntroVNMT) in this paper, inspired by the recent successful application of introspective variational autoencoder (IntroVAE) in high quality image synthesis. Different from the vanilla variational NMT model, IntroVNMT is capable of improving itself introspectively by evaluating the quality of the generated target sentences according to the high-level latent variables of the real and generated target sentences. As a consequence of introspective training, the proposed model is able to discriminate between the generated and real sentences of the target language via the latent variables generated by the encoder of the model. In this way, IntroVNMT is able to generate more realistic target sentences in practice. In the meantime, IntroVNMT inherits the advantages of the variational autoencoders (VAEs), and the model training process is more stable than the generative adversarial network (GAN) based models. Experimental results on different translation tasks demonstrate that the proposed model can achieve significant improvements over the vanilla variational NMT model.

Download Full-text

Controlling Neural Machine Translation Formality with Synthetic Supervision

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6379 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8568-8575

Author(s):

Xing Niu ◽

Marine Carpuat

Keyword(s):

Machine Translation ◽

Target Language ◽

Sentence Pair ◽

English Sentence ◽

Neural Machine Translation ◽

Source Language ◽

Training Scheme ◽

Training Examples ◽

Language Content ◽

Missing Element

This work aims to produce translations that convey source language content at a formality level that is appropriate for a particular audience. Framing this problem as a neural sequence-to-sequence task ideally requires training triplets consisting of a bilingual sentence pair labeled with target language formality. However, in practice, available training examples are limited to English sentence pairs of different styles, and bilingual parallel sentences of unknown formality. We introduce a novel training scheme for multi-task models that automatically generates synthetic training triplets by inferring the missing element on the fly, thus enabling end-to-end training. Comprehensive automatic and human assessments show that our best model outperforms existing models by producing translations that better match desired formality levels while preserving the source meaning.1

Download Full-text

End-to-end automated cache-timing attack driven by Machine Learning

10.29007/nwj8 ◽

2019 ◽

Cited By ~ 1

Author(s):

Sebastien Carré ◽

Victor Dyseryn ◽

Adrien Facon ◽

Sylvain Guilley ◽

Thomas Perianin

Keyword(s):

Neural Network ◽

Language Processing ◽

Relevant Information ◽

Security Threats ◽

Cache Memories ◽

Function Calls ◽

Timing Data ◽

Timing Attack ◽

End To End ◽

Timing Attacks

Cache timing attacks are serious security threats that exploit cache memories to steal secret information.We believe that the identification of a sequence of operations from a set of cache-timing data measurements is not a trivial step when building an attack. We present a recurrent neural network model able to automatically retrieve a sequence of function calls from cache-timings. Inspired from natural language processing, our model is able to learn on partially labelled data. We use the model to unfold an end-to-end automated attack on OpenSSL ECDSA on the secp256k1 curve. Contrary to most research, we did not need human processing of the traces to retrieve relevant information.

Download Full-text

A Survey on Hybrid Machine Translation

E3S Web of Conferences ◽

10.1051/e3sconf/202018401061 ◽

2020 ◽

Vol 184 ◽

pp. 01061

Author(s):

Anusha Anugu ◽

Gajula Ramesh

Keyword(s):

Machine Translation ◽

Language Processing ◽

Literature Survey ◽

Neural Machine Translation ◽

Translation Tools ◽

Translation Techniques ◽

Hybrid Machine ◽

Hybrid Machine Translation ◽

Translation Systems ◽

Evaluation Techniques

Machine translation has gradually developed in past 1940’s.It has gained more and more attention because of effective and efficient nature. As it makes the translation automatically without the involvement of human efforts. The distinct models of machine translation along with “Neural Machine Translation (NMT)” is summarized in this paper. Researchers have previously done lots of work on Machine Translation techniques and their evaluation techniques. Thus, we want to demonstrate an analysis of the existing techniques for machine translation including Neural Machine translation, their differences and the translation tools associated with them. Now-a-days the combination of two Machine Translation systems has the full advantage of using features from both the systems which attracts in the domain of natural language processing. So, the paper also includes the literature survey of the Hybrid Machine Translation (HMT).

Download Full-text