Machine translation using deep learning for universal networking language based on their structure

Machine translation (MT) has recently attracted much research on various advanced techniques (i.e., statistical-based and deep learning-based) and achieved great results for popular languages. However, the research on it involving low-resource languages such as Korean often suffer from the lack of openly available bilingual language resources. In this research, we built the open extensive parallel corpora for training MT models, named Ulsan parallel corpora (UPC). Currently, UPC contains two parallel corpora consisting of Korean-English and Korean-Vietnamese datasets. The Korean-English dataset has over 969 thousand sentence pairs, and the Korean-Vietnamese parallel corpus consists of over 412 thousand sentence pairs. Furthermore, the high rate of homographs of Korean causes an ambiguous word issue in MT. To address this problem, we developed a powerful word-sense annotation system based on a combination of sub-word conditional probability and knowledge-based methods, named UTagger. We applied UTagger to UPC and used these corpora to train both statistical-based and deep learning-based neural MT systems. The experimental results demonstrated that using UPC, high-quality MT systems (in terms of the Bi-Lingual Evaluation Understudy (BLEU) and Translation Error Rate (TER) score) can be built. Both UPC and UTagger are available for free download and usage.

Download Full-text

Machine Translation Using Deep Learning: A Comparison

Proceedings of International Conference on Artificial Intelligence, Smart Grid and Smart City Applications ◽

10.1007/978-3-030-24051-6_38 ◽

2020 ◽

pp. 389-395

Author(s):

S. Swathi ◽

L. S. Jayashree

Keyword(s):

Deep Learning ◽

Machine Translation

Download Full-text

The use of machine translation algorithm based on residual and LSTM neural network in translation teaching

PLoS ONE ◽

10.1371/journal.pone.0240663 ◽

2020 ◽

Vol 15 (11) ◽

pp. e0240663

Author(s):

Beibei Ren

Keyword(s):

Neural Network ◽

Deep Learning ◽

Machine Translation ◽

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Translation Model ◽

Translation Quality ◽

Long Short Term Memory ◽

Deep Learning Neural Network

With the rapid development of big data and deep learning, breakthroughs have been made in phonetic and textual research, the two fundamental attributes of language. Language is an essential medium of information exchange in teaching activity. The aim is to promote the transformation of the training mode and content of translation major and the application of the translation service industry in various fields. Based on previous research, the SCN-LSTM (Skip Convolutional Network and Long Short Term Memory) translation model of deep learning neural network is constructed by learning and training the real dataset and the public PTB (Penn Treebank Dataset). The feasibility of the model’s performance, translation quality, and adaptability in practical teaching is analyzed to provide a theoretical basis for the research and application of the SCN-LSTM translation model in English teaching. The results show that the capability of the neural network for translation teaching is nearly one times higher than that of the traditional N-tuple translation model, and the fusion model performs much better than the single model, translation quality, and teaching effect. To be specific, the accuracy of the SCN-LSTM translation model based on deep learning neural network is 95.21%, the degree of translation confusion is reduced by 39.21% compared with that of the LSTM (Long Short Term Memory) model, and the adaptability is 0.4 times that of the N-tuple model. With the highest level of satisfaction in practical teaching evaluation, the SCN-LSTM translation model has achieved a favorable effect on the translation teaching of the English major. In summary, the performance and quality of the translation model are improved significantly by learning the language characteristics in translations by teachers and students, providing ideas for applying machine translation in professional translation teaching.

Download Full-text

Deep Learning Machine Translation

Machine Translation ◽

10.7551/mitpress/11043.003.0015 ◽

2017 ◽

Cited By ~ 1

Keyword(s):

Deep Learning ◽

Machine Translation ◽

Learning Machine

Download Full-text

Neural Machine Translation for Turkish to English Using Deep Learning

Digital Interaction and Machine Intelligence - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-74728-2_1 ◽

2021 ◽

pp. 3-9

Author(s):

Fatih Balki ◽

Hilmi Demirhan ◽

Salih Sarp

Keyword(s):

Deep Learning ◽

Machine Translation ◽

Neural Machine Translation

Download Full-text

English-Vietnamese Machine Translation Using Deep Learning

Lecture Notes in Networks and Systems - Recent Advances in Information and Communication Technology 2021 ◽

10.1007/978-3-030-79757-7_10 ◽

2021 ◽

pp. 99-107

Author(s):

Tuan Nguyen Minh ◽

Phayung Meesad ◽

Huy Cuong Nguyen Ha

Keyword(s):

Deep Learning ◽

Machine Translation

Download Full-text

Syntactic Structure from Deep Learning

Annual Review of Linguistics ◽

10.1146/annurev-linguistics-032020-051035 ◽

2020 ◽

Vol 7 (1) ◽

Author(s):

Tal Linzen ◽

Marco Baroni

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Machine Translation ◽

Online Publication ◽

Deep Neural Networks ◽

Syntactic Structure ◽

Annual Review ◽

Publication Date ◽

Grammatical Knowledge ◽

Deep Networks

Modern deep neural networks achieve impressive performance in engineering applications that require extensive linguistic skills, such as machine translation. This success has sparked interest in probing whether these models are inducing human-like grammatical knowledge from the raw data they are exposed to and, consequently, whether they can shed new light on long-standing debates concerning the innate structure necessary for language acquisition. In this article, we survey representative studies of the syntactic abilities of deep networks and discuss the broader implications that this work has for theoretical linguistics. Expected final online publication date for the Annual Review of Linguistics, Volume 7 is January 14, 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

Download Full-text

A Position Weighted Information Based Word Embedding Model for Machine Translation

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213020400059 ◽

2020 ◽

Vol 29 (07n08) ◽

pp. 2040005

Author(s):

Zhen Li ◽

Dan Qu ◽

Yanxia Li ◽

Chaojie Xie ◽

Qi Chen

Keyword(s):

Neural Network ◽

Deep Learning ◽

Machine Translation ◽

Semantic Information ◽

Word Embedding ◽

Vector Model ◽

Learning Technology ◽

Initial Value ◽

Input Layer ◽

End To End

Deep learning technology promotes the development of neural network machine translation (NMT). End-to-End (E2E) has become the mainstream in NMT. It uses word vectors as the initial value of the input layer. The effect of word vector model directly affects the accuracy of E2E-NMT. Researchers have proposed many approaches to learn word representations and have achieved significant results. However, the drawbacks of these methods still limit the performance of E2E-NMT systems. This paper focuses on the word embedding technology and proposes the PW-CBOW word vector model which can present better semantic information. We apply these word vector models on IWSLT14 German-English, WMT14 English-German, WMT14 English-French corporas. The results evaluate the performance of the PW-CBOW model. In the latest E2E-NMT systems, the PW-CBOW word vector model can improve the performance.

Download Full-text

Ensemble and Deep Learning for Language-Independent Automatic Selection of Parallel Data

Algorithms ◽

10.3390/a12010026 ◽

2019 ◽

Vol 12 (1) ◽

pp. 26 ◽

Cited By ~ 2

Author(s):

Despoina Mouratidis ◽

Katia Kermanidis

Keyword(s):

Deep Learning ◽

Machine Translation ◽

Research Field ◽

Attribute Selection ◽

Parallel Corpora ◽

Automatic Selection ◽

Parallel Data ◽

First Time ◽

Learning Architectures ◽

Selection Of

Machine translation is used in many applications in everyday life. Due to the increase of translated documents that need to be organized as useful or not (for building a translation model), the automated categorization of texts (classification), is a popular research field of machine learning. This kind of information can be quite helpful for machine translation. Our parallel corpora (English-Greek and English-Italian) are based on educational data, which are quite difficult to translate. We apply two state of the art architectures, Random Forest (RF) and Deeplearnig4j (DL4J), to our data (which constitute three translation outputs). To our knowledge, this is the first time that deep learning architectures are applied to the automatic selection of parallel data. We also propose new string-based features that seem to be effective for the classifier, and we investigate whether an attribute selection method could be used for better classification accuracy. Experimental results indicate an increase of up to 4% (compared to our previous work) using RF and rather satisfactory results using DL4J.

Download Full-text

Biomedical Concept Recognition Using Deep Neural Sequence Models

10.1101/530337 ◽

2019 ◽

Cited By ~ 2

Author(s):

Negacy D. Hailu ◽

Michael Bada ◽

Asmelash Teka Hadgu ◽

Lawrence E. Hunter

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Machine Translation ◽

Language Processing ◽

State Of The Art ◽

Conditional Random Field ◽

Concept Recognition ◽

Performance Improvements ◽

Art Performance

AbstractBackgroundthe automated identification of mentions of ontological concepts in natural language texts is a central task in biomedical information extraction. Despite more than a decade of effort, performance in this task remains below the level necessary for many applications.Resultsrecently, applications of deep learning in natural language processing have demonstrated striking improvements over previously state-of-the-art performance in many related natural language processing tasks. Here we demonstrate similarly striking performance improvements in recognizing biomedical ontology concepts in full text journal articles using deep learning techniques originally developed for machine translation. For example, our best performing system improves the performance of the previous state-of-the-art in recognizing terms in the Gene Ontology Biological Process hierarchy, from a previous best F1 score of 0.40 to an F1 of 0.70, nearly halving the error rate. Nearly all other ontologies show similar performance improvements.ConclusionsA two-stage concept recognition system, which is a conditional random field model for span detection followed by a deep neural sequence model for normalization, improves the state-of-the-art performance for biomedical concept recognition. Treating the biomedical concept normalization task as a sequence-to-sequence mapping task similar to neural machine translation improves performance.

Download Full-text