Named entity translation method based on machine translation lexicon

Author(s):  
Panpan Li ◽  
Mengxiang Wang ◽  
Jian Wang

This paper describes the cross-language plagiarism detection method CLAD (Cross-Language Analog Detector) between test document and indexed documents. The main difference of this method from existing versions is the detection of plagiarism among multiple languages not only two languages. While translating terms, it used the dictionary-based machine-translation method. CLAD’s working process consists of document indexing and detection process phases. In this paper, we will describe both of these phases.


2017 ◽  
Vol 17 (2) ◽  
pp. 28-43 ◽  
Author(s):  
Vivien Macketanz ◽  
Eleftherios Avramidis ◽  
Aljoscha Burchardt ◽  
Jindrich Helcl ◽  
Ankit Srivastava

Abstract In this article we present a novel linguistically driven evaluation method and apply it to the main approaches of Machine Translation (Rule-based, Phrase-based, Neural) to gain insights into their strengths and weaknesses in much more detail than provided by current evaluation schemes. Translating between two languages requires substantial modelling of knowledge about the two languages, about translation, and about the world. Using English-German IT-domain translation as a case-study, we also enhance the Phrase-based system by exploiting parallel treebanks for syntax-aware phrase extraction and by interfacing with Linked Open Data (LOD) for extracting named entity translations in a post decoding framework.


Sign in / Sign up

Export Citation Format

Share Document