Chinese Medical Paraphrase Generation: Based on Neural Machine Translation

Author(s):  
Bo Sun ◽  
Fei Zhang ◽  
Jing Yuan ◽  
Zhao Wei ◽  
Shu Ting

Abstract Background: As people prefer to obtain medical knowledge online, medical intelligence question-answer systems based on question matching have attracted more and more attention, especially in China. However, due to the lack of paraphrase corpus of medical question, the development of this field is limited.Objective: We propose a method for paraphrase generation which suitable for the Chinese medical field and use deep learning models instead of artificial evaluation for the first time. The method is designed to be able to automatically construct high quality Chinese medical paraphrase.Methods: Validation experiments were carried out on two Chinese paraphrase data (one is general data, the other is medical data). Neural machine translation is used to generated paraphrase, that is, translate a sentence into other languages, and then reverse-translate it back to the original language to get the corresponding paraphrase. BLUE, ROUGEs, are used as quantitative evaluation metrics. Three deep text matching models are used to evaluate the generated paraphrase, instead of manual. Precision, Recall, F1 and AUC are used as qualitative evaluation metrics.Results: 49908 and 4062 paraphrases were generated on the two datasets, and the generated efficiency was 97.03% and 98.38%, respectively. For the data in the two fields, the generated and original paraphrase pairs are very similar at the quantitative and qualitative evaluation metrics, especially the medical field. Take medical data as example, BLUE of generated and original paraphrase pairs are 0.556 and 0.626, respectively; the mean difference of AUC between the two groups was 0.015. Conclusions: We first propose a paraphrase generation method based on neural machine translation and use deep text matching model instead of manual evaluation to evaluate the generated paraphrase. By analyzing the evaluation metrics, it can be concluded that:the paraphrase generated method has reached or even exceeded the level of artificial construction at the semantic level, especially in medical field; the deep text matching model can replace manual evaluation and realize automated paraphrase generation. This is of great significance to the development of Chinese medical paraphrase generation.

Author(s):  
Srikanth Mujjiga ◽  
Vamsi Krishna ◽  
Kalyan Chakravarthi ◽  
Vijayananda J

Clinical documents are vital resources for radiologists when they have to consult or refer while studying similar cases. In large healthcare facilities where millions of reports are generated, searching for relevant documents is quite challenging. With abundant interchangeable words in clinical domain, understanding the semantics of the words in the clinical documents is vital to improve the search results. This paper details an end to end semantic search application to address the large scale information retrieval problem of clinical reports. The paper specifically focuses on the challenge of identifying semantics in the clinical reports to facilitate search at semantic level. The semantic search works by mapping the documents into the concept space and the search is performed in the concept space. A unique approach of framing the concept mapping problem as a language translation problem is proposed in this paper. The concept mapper is modelled using the Neural machine translation model (NMT) based on encoder-decoder with attention architecture. The regular expression based concept mapper takes approximately 3 seconds to extract UMLS concepts from a single document, where as the trained NMT does the same in approximately 30 milliseconds. NMT based model further enables incorporation of negation detection to identify whether a concept is negated or not, facilitating search for negated queries.


Author(s):  
Pramila Arulanthu ◽  
Eswaran Perumal

: The medical data has an enormous quantity of information. This data set requires effective classification for accurate prediction. Predicting medical issues is an extremely difficult task in which Chronic Kidney Disease (CKD) is one of the major unpredictable diseases in medical field. Perhaps certain medical experts do not have identical awareness and skill to solve the issues of their patients. Most of the medical experts may have underprivileged results on disease diagnosis of their patients. Sometimes patients may lose their life in nature. As per the Global Burden of Disease (GBD-2015) study, death by CKD was ranked 17th place and GBD-2010 report 27th among the causes of death globally. Death by CKD is constituted 2·9% of all death between the year 2010 and 2013 among people from 15 to 69 age. As per World Health Organization (WHO-2005) report, 58 million people expired by CKD. Hence, this article presents the state of art review on Chronic Kidney Disease (CKD) classification and prediction. Normally, advanced data mining techniques, fuzzy and machine learning algorithms are used to classify medical data and disease diagnosis. This study reviews and summarizes many classification techniques and disease diagnosis methods presented earlier. The main intention of this review is to point out and address some of the issues and complications of the existing methods. It is also attempts to discuss the limitations and accuracy level of the existing CKD classification and disease diagnosis methods.


2019 ◽  
Vol 28 (4) ◽  
pp. 1-29 ◽  
Author(s):  
Michele Tufano ◽  
Cody Watson ◽  
Gabriele Bavota ◽  
Massimiliano Di Penta ◽  
Martin White ◽  
...  

Procedia CIRP ◽  
2021 ◽  
Vol 96 ◽  
pp. 9-14
Author(s):  
Uwe Dombrowski ◽  
Alexander Reiswich ◽  
Raphael Lamprecht

Sign in / Sign up

Export Citation Format

Share Document