Deep Learning for Text Style Transfer: A Survey

Abstract Text style transfer is an important task in natural language generation, which aims to control certain attributes in the generated text, such as politeness, emotion, humor, and many others. It has a long history in the field of natural language processing, and recently has re-gained significant attention thanks to the promising performance brought by deep neural models. In this paper, we present a systematic survey of the research on neural text style transfer, spanning over 100 representative articles since the first neural text style transfer work in 2017. We discuss the task formulation, existing datasets and subtasks, evaluation, as well as the rich methodologies in the presence of parallel and non-parallel data. We also provide discussions on a variety of important topics regarding the future development of this task.

Download Full-text

Daily estimates of individual discharge likelihood with deep learning natural language processing in general medicine: a prospective and external validation study

Internal and Emergency Medicine ◽

10.1007/s11739-021-02816-7 ◽

2021 ◽

Author(s):

Stephen Bacchi ◽

Toby Gilbert ◽

Samuel Gluck ◽

Joy Cheng ◽

Yiran Tan ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Validation Study ◽

External Validation ◽

General Medicine ◽

External Validation Study

Download Full-text

Deep Learning on Graphs for Natural Language Processing

Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ◽

10.1145/3404835.3462809 ◽

2021 ◽

Author(s):

Lingfei Wu ◽

Yu Chen ◽

Heng Ji ◽

Bang Liu

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text

Deep Learning Techniques on Text Classification Using Natural Language Processing (NLP) In Social Healthcare Network: A Comprehensive Survey

2021 3rd International Conference on Signal Processing and Communication (ICPSC) ◽

10.1109/icspc51351.2021.9451752 ◽

2021 ◽

Author(s):

PM. Lavanya ◽

E. Sasikala

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Classification ◽

Healthcare Network ◽

Learning Techniques ◽

Comprehensive Survey

Download Full-text

A natural language processing approach based on embedding deep learning from heterogeneous compounds for quantitative structure–activity relationship modeling

Chemical Biology & Drug Design ◽

10.1111/cbdd.13742 ◽

2020 ◽

Vol 96 (3) ◽

pp. 961-972

Author(s):

Khalid Bouhedjar ◽

Abdelbasset Boukelia ◽

Abdelmalek Khorief Nacereddine ◽

Anouar Boucheham ◽

Amine Belaidi ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Quantitative Structure Activity Relationship ◽

Structure Activity Relationship ◽

Activity Relationship ◽

Quantitative Structure ◽

Structure Activity ◽

Processing Approach

Download Full-text

Speech Master: Natural Language Processing and Deep Learning Approach for Automated Speech Evaluation

10.1109/iemcon53756.2021.9623163 ◽

2021 ◽

Author(s):

K.G.C.M Kooragama ◽

L.R.W.D. Jayashanka ◽

J.A. Munasinghe ◽

K.W. Jayawardana ◽

Muditha Tissera ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Learning Approach ◽

Speech Evaluation

Download Full-text

Deep Learning Approaches for Spoken and Natural Language Processing

10.1007/978-3-030-79778-2 ◽

2021 ◽

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Learning Approaches

Download Full-text

Use of Natural Language Processing and Deep Learning towards Guiding Healthy Cholesterol Free Life

10.1109/icac54203.2021.9671230 ◽

2021 ◽

Author(s):

Dilith Sasanka ◽

H. K. N Malshani ◽

Uchitha I. Wickramaratne ◽

Yashmitha Kavindi ◽

Muditha Tissera ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

JMIR Medical Informatics ◽

10.2196/23230 ◽

2021 ◽

Vol 9 (8) ◽

pp. e23230

Author(s):

Pei-Fu Chen ◽

Ssu-Ming Wang ◽

Wei-Chih Liao ◽

Lu-Cheng Kuo ◽

Kuan-Chih Chen ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Deep Neural Network ◽

University Hospital ◽

Classification Model ◽

Icd 10 ◽

And Training

Background The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning– and natural language processing–related approaches have been studied to assist disease coders. Objective This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort. Methods We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F1-score and the coding time by coders before and after using our model. Results In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F1-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F1-score significantly increased from a median of 0.832 to 0.922 (P<.05), but not in a reduced interval. Conclusions The proposed model significantly improved the F1-score but did not decrease the time consumed in coding by disease coders.

Download Full-text

Towards a scientific workflow featuring Natural Language Processing for the digitisation of natural history collections

Research Ideas and Outcomes ◽

10.3897/rio.6.e55789 ◽

2020 ◽

Vol 6 ◽

Cited By ~ 3

Author(s):

David Owen ◽

Laurence Livermore ◽

Quentin Groom ◽

Alex Hardisty ◽

Thijs Leegwater ◽

...

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural History ◽

Natural Language ◽

Language Processing ◽

Scientific Workflow ◽

Entity Recognition ◽

Research Activities ◽

Handwritten Text ◽

Segmented Images

We describe an effective approach to automated text digitisation with respect to natural history specimen labels. These labels contain much useful data about the specimen including its collector, country of origin, and collection date. Our approach to automatically extracting these data takes the form of a pipeline. Recommendations are made for the pipeline's component parts based on some of the state-of-the-art technologies. Optical Character Recognition (OCR) can be used to digitise text on images of specimens. However, recognising text quickly and accurately from these images can be a challenge for OCR. We show that OCR performance can be improved by prior segmentation of specimen images into their component parts. This ensures that only text-bearing labels are submitted for OCR processing as opposed to whole specimen images, which inevitably contain non-textual information that may lead to false positive readings. In our testing Tesseract OCR version 4.0.0 offers promising text recognition accuracy with segmented images. Not all the text on specimen labels is printed. Handwritten text varies much more and does not conform to standard shapes and sizes of individual characters, which poses an additional challenge for OCR. Recently, deep learning has allowed for significant advances in this area. Google's Cloud Vision, which is based on deep learning, is trained on large-scale datasets, and is shown to be quite adept at this task. This may take us some way towards negating the need for humans to routinely transcribe handwritten text. Determining the countries and collectors of specimens has been the goal of previous automated text digitisation research activities. Our approach also focuses on these two pieces of information. An area of Natural Language Processing (NLP) known as Named Entity Recognition (NER) has matured enough to semi-automate this task. Our experiments demonstrated that existing approaches can accurately recognise location and person names within the text extracted from segmented images via Tesseract version 4.0.0. Potentially, NER could be used in conjunction with other online services, such as those of the Biodiversity Heritage Library to map the named entities to entities in the biodiversity literature (https://www.biodiversitylibrary.org/docs/api3.html). We have highlighted the main recommendations for potential pipeline components. The document also provides guidance on selecting appropriate software solutions. These include automatic language identification, terminology extraction, and integrating all pipeline components into a scientific workflow to automate the overall digitisation process.

Download Full-text

Deep Learning for Natural Language Processing

Handbook of Statistics - Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications ◽

10.1016/bs.host.2018.05.001 ◽

2018 ◽

pp. 317-328 ◽

Cited By ~ 8

Author(s):

Ying Xie ◽

Linh Le ◽

Yiyun Zhou ◽

Vijay V. Raghavan

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text