English to Hindi Machine Translation System in the Context of Homoeopathy Literature

Over the years, researches in machine translation (MT) systems have gain momentum due to their widespread applicability. A number of systems have come up doing the task successfully for different language pairs. However, to the best of the author's knowledge, no significant work has been done in clinical and medical related domain especially in Homoeopathy. This paper describes a rule based English-Hindi MT system for Homoeopathic sentences. It has been designed to translate a variety of sentences from Homoeopathic literature. To achieve the task, the author developed English and Hindi Homoeopathic corpuses presently having the size 21096 and 23145 sentences respectively. For translation, the input sentences (in English) have been categorised in four different type's i.e. simple, complex, interrogative and ambiguous sentences. The authors tested the translation accuracy using BLEU score. At present, the overall Bleu score of the system is 0.7808 and the accuracy percentage is 82.25%.

Download Full-text

Hybrid Translation with Classification: Revisiting Rule-Based and Neural Machine Translation

Electronics ◽

10.3390/electronics9020201 ◽

2020 ◽

Vol 9 (2) ◽

pp. 201

Author(s):

Jin-Xia Huang ◽

Kyung-Soon Lee ◽

Young-Kil Kim

Keyword(s):

Machine Translation ◽

Classification Accuracy ◽

Training Data ◽

Translation System ◽

Rule Based ◽

Neural Machine Translation ◽

Machine Translation System ◽

Text Classifiers ◽

Hybrid Machine Translation ◽

Translation Accuracy

This paper proposes a hybrid machine-translation system that combines neural machine translation with well-developed rule-based machine translation to utilize the stability of the latter to compensate for the inadequacy of neural machine translation in rare-resource domains. A classifier is introduced to predict which translation from the two systems is more reliable. We explore a set of features that reflect the reliability of translation and its process, and training data is automatically expanded with a small, human-labeled dataset to solve the insufficient-data problem. A series of experiments shows that the hybrid system’s translation accuracy is improved, especially in out-of-domain translations, and classification accuracy is greatly improved when using the proposed features and the automatically constructed training set. A comparison between feature- and text-based classification is also performed, and the results show that the feature-based model achieves better classification accuracy, even when compared to neural network text classifiers.

Download Full-text

English to Sanskrit machine translation system: a rule-based approach

International Journal of Advanced Intelligence Paradigms ◽

10.1504/ijaip.2012.048144 ◽

2012 ◽

Vol 4 (2) ◽

pp. 168 ◽

Cited By ~ 2

Author(s):

Vimal Mishra ◽

R.B. Mishra

Keyword(s):

Machine Translation ◽

Translation System ◽

Rule Based ◽

System A ◽

Machine Translation System ◽

Rule Based Approach

Download Full-text

Hva er viktig for forståelse? Om maskinoversetting fra nordsamisk

Oslo Studies in Language ◽

10.5617/osla.8514 ◽

2021 ◽

Vol 11 (2) ◽

pp. 489-501

Author(s):

Trond Trosterud ◽

Lene Antonsen

Keyword(s):

Machine Translation ◽

Text Comprehension ◽

The Other ◽

Translation System ◽

Rule Based ◽

Grammatical Analysis ◽

Machine Translation System ◽

Language Quality ◽

Lexical Errors ◽

Fluent Language

The article presents a rule-based machine translation system from Northern Sami to Norwegian. The grammatical analysis is done with Giellatekno and Divvun's North Sami program for analysis and translation. We have written the transfer component (transfer lexicon and grammatical rules) within the framework of the open machine translation system Apertium. The article contains an evaluation of translated text for two different domains. The translated texts score better on the presentation of the content than on fluent language. By classifying the errors into lexical, grammatical and pragmatic errors, we show that lexical errors are the most harmful for text comprehension. The other two types of errors give a poor language quality, but they have little effect on comprehension. The type of error that is the easiest to correct is the lexical, which is a promising conclusion for the development of a machine translation system for text comprehension.

Download Full-text

Improving statistical word alignment with a rule-based machine translation system

10.3115/1220355.1220360 ◽

2004 ◽

Cited By ~ 1

Author(s):

Wu Hua ◽

Wang Haifeng

Keyword(s):

Machine Translation ◽

Translation System ◽

Word Alignment ◽

Rule Based ◽

Machine Translation System

Download Full-text

Analysing linguistic information about word combinations for a Spanish-Basque rule-based machine translation system

Multiword Units in Machine Translation and Translation Technology - Current Issues in Linguistic Theory ◽

10.1075/cilt.341.02inu ◽

2018 ◽

pp. 42-59

Author(s):

Uxoa Iñurrieta ◽

Itziar Aduriz ◽

Arantza Díaz de Ilarraza ◽

Gorka Labaka ◽

Kepa Sarasola

Keyword(s):

Machine Translation ◽

Translation System ◽

Linguistic Information ◽

Rule Based ◽

Machine Translation System

Download Full-text

Rule-Based Machine Translation for the Italian–Sardinian Language Pair

Prague Bulletin of Mathematical Linguistics ◽

10.1515/pralin-2017-0022 ◽

2017 ◽

Vol 108 (1) ◽

pp. 221-232

Author(s):

Francis M. Tyers ◽

Hèctor Alòs i Font ◽

Gianfranco Fronteddu ◽

Adrià Martín-Mor

Keyword(s):

Machine Translation ◽

Translation System ◽

Rule Based ◽

Romance Language ◽

Machine Translation System ◽

The Mediterranean ◽

Language Pair

AbstractThis paper describes the process of creation of the first machine translation system from Italian to Sardinian, a Romance language spoken on the island of Sardinia in the Mediterranean. The project was carried out by a team of translators and computational linguists. The article focuses on the technology used (Rule-Based Machine Translation) and on some of the rules created, as well as on the orthographic model used for Sardinian.

Download Full-text

A rule based approach for Japanese-Uighur machine translation system

2012 IEEE 11th International Conference on Cognitive Informatics and Cognitive Computing ◽

10.1109/icci-cc.2012.6311137 ◽

2012 ◽

Cited By ~ 2

Author(s):

Maimitili Nimaiti ◽

Yamamoto Izumi

Keyword(s):

Machine Translation ◽

Translation System ◽

Rule Based ◽

Machine Translation System ◽

Rule Based Approach

Download Full-text

Otedama: Fast Rule-Based Pre-Ordering for Machine Translation

Prague Bulletin of Mathematical Linguistics ◽

10.1515/pralin-2016-0015 ◽

2016 ◽

Vol 106 (1) ◽

pp. 159-168 ◽

Cited By ~ 1

Author(s):

Julian Hitschler ◽

Laura Jehl ◽

Sariya Karimova ◽

Mayumi Ohta ◽

Benjamin Körner ◽

...

Keyword(s):

Open Source ◽

Machine Translation ◽

State Of The Art ◽

Statistical Machine Translation ◽

Training Data ◽

Translation System ◽

Rule Based ◽

Machine Translation System ◽

Target Languages ◽

Established Technique

Abstract We present Otedama, a fast, open-source tool for rule-based syntactic pre-ordering, a well established technique in statistical machine translation. Otedama implements both a learner for pre-ordering rules, as well as a component for applying these rules to parsed sentences. Our system is compatible with several external parsers and capable of accommodating many source and all target languages in any machine translation paradigm which uses parallel training data. We demonstrate improvements on a patent translation task over a state-of-the-art English-Japanese hierarchical phrase-based machine translation system. We compare Otedama with an existing syntax-based pre-ordering system, showing comparable translation performance at a runtime speedup of a factor of 4.5-10.

Download Full-text