Discovering microbe-disease associations from the literature using a hierarchical long short-term memory network and an ensemble parser model

AbstractWith recent advances in biotechnology and sequencing technology, the microbial community has been intensively studied and discovered to be associated with many chronic as well as acute diseases. Even though a tremendous number of studies describing the association between microbes and diseases have been published, text mining methods that focus on such associations have been rarely studied. We propose a framework that combines machine learning and natural language processing methods to analyze the association between microbes and diseases. A hierarchical long short-term memory network was used to detect sentences that describe the association. For the sentences determined, two different parse tree-based search methods were combined to find the relation-describing word. The ensemble model of constituency parsing for structural pattern matching and dependency-based relation extraction improved the prediction accuracy. By combining deep learning and parse tree-based extractions, our proposed framework could extract the microbe-disease association with higher accuracy. The evaluation results showed that our system achieved an F-score of 0.8764 and 0.8524 in binary decisions and extracting relation words, respectively. As a case study, we performed a large-scale analysis of the association between microbes and diseases. Additionally, a set of common microbes shared by multiple diseases were also identified in this study. This study could provide valuable information for the major microbes that were studied for a specific disease. The code and data are available at https://github.com/DMnBI/mdi_predictor.

Download Full-text

Prediction of Drug-Disease Associations Based on Long Short-Term Memory Network and Gaussian Interaction Profile Kernel

Communications in Computer and Information Science - Bio-Inspired Computing: Theories and Applications ◽

10.1007/978-981-16-1354-8_31 ◽

2021 ◽

pp. 432-444

Author(s):

Han-Jing Jiang ◽

Yan-Bin Wang ◽

Yabing Huang

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Disease Associations ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

A Customized Attention-Based Long Short-Term Memory Network for Distant Supervised Relation Extraction

Neural Computation ◽

10.1162/neco_a_00970 ◽

2017 ◽

Vol 29 (7) ◽

pp. 1964-1985 ◽

Cited By ~ 1

Author(s):

Dengchao He ◽

Hongjun Zhang ◽

Wenning Hao ◽

Rui Zhang ◽

Kai Cheng

Keyword(s):

False Positive ◽

Short Term Memory ◽

Relation Extraction ◽

Short Term ◽

Training Corpus ◽

Term Memory ◽

Distant Supervision ◽

Positive Data ◽

Memory Network ◽

Long Short Term Memory

Distant supervision, a widely applied approach in the field of relation extraction can automatically generate large amounts of labeled training corpus with minimal manual effort. However, the labeled training corpus may have many false-positive data, which would hurt the performance of relation extraction. Moreover, in traditional feature-based distant supervised approaches, extraction models adopt human design features with natural language processing. It may also cause poor performance. To address these two shortcomings, we propose a customized attention-based long short-term memory network. Our approach adopts word-level attention to achieve better data representation for relation extraction without manually designed features to perform distant supervision instead of fully supervised relation extraction, and it utilizes instance-level attention to tackle the problem of false-positive data. Experimental results demonstrate that our proposed approach is effective and achieves better performance than traditional methods.

Download Full-text

Extracting entities with attributes in clinical text via joint deep learning

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocz158 ◽

2019 ◽

Vol 26 (12) ◽

pp. 1584-1591 ◽

Cited By ~ 1

Author(s):

Xue Shi ◽

Yingping Yi ◽

Ying Xiong ◽

Buzhou Tang ◽

Qingcai Chen ◽

...

Keyword(s):

Deep Learning ◽

Language Processing ◽

Short Term Memory ◽

Conditional Random Field ◽

Relation Extraction ◽

Entity Recognition ◽

Short Term ◽

Term Memory ◽

Clinical Text ◽

Long Short Term Memory

Abstract Objective Extracting clinical entities and their attributes is a fundamental task of natural language processing (NLP) in the medical domain. This task is typically recognized as 2 sequential subtasks in a pipeline, clinical entity or attribute recognition followed by entity-attribute relation extraction. One problem of pipeline methods is that errors from entity recognition are unavoidably passed to relation extraction. We propose a novel joint deep learning method to recognize clinical entities or attributes and extract entity-attribute relations simultaneously. Materials and Methods The proposed method integrates 2 state-of-the-art methods for named entity recognition and relation extraction, namely bidirectional long short-term memory with conditional random field and bidirectional long short-term memory, into a unified framework. In this method, relation constraints between clinical entities and attributes and weights of the 2 subtasks are also considered simultaneously. We compare the method with other related methods (ie, pipeline methods and other joint deep learning methods) on an existing English corpus from SemEval-2015 and a newly developed Chinese corpus. Results Our proposed method achieves the best F1 of 74.46% on entity recognition and the best F1 of 50.21% on relation extraction on the English corpus, and 89.32% and 88.13% on the Chinese corpora, respectively, which outperform the other methods on both tasks. Conclusions The joint deep learning–based method could improve both entity recognition and relation extraction from clinical text in both English and Chinese, indicating that the approach is promising.

Download Full-text

A Multi-attention-Based Bidirectional Long Short-Term Memory Network for Relation Extraction

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-70139-4_22 ◽

2017 ◽

pp. 216-227 ◽

Cited By ~ 9

Author(s):

Lingfeng Li ◽

Yuanping Nie ◽

Weihong Han ◽

Jiuming Huang

Keyword(s):

Short Term Memory ◽

Relation Extraction ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

HLSTM:Heterogeneous Long Short-Term Memory Network for Large-scale InSAR Surface Subsidence Prediction

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2021.3106666 ◽

2021 ◽

pp. 1-1

Author(s):

Qing hao Liu ◽

Yonghong Zhang ◽

Jujie Wei ◽

Hong'an Wu ◽

Min Deng

Keyword(s):

Large Scale ◽

Short Term Memory ◽

Surface Subsidence ◽

Short Term ◽

Term Memory ◽

Subsidence Prediction ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

Sleep Breathing Disorders Detection with Bioradar Using a Long Short-Term Memory Network

2020 XXXIIIrd General Assembly and Scientific Symposium of the International Union of Radio Science ◽

10.23919/ursigass49373.2020.9232203 ◽

2020 ◽

Author(s):

Lesya Anishchenko ◽

Ludmila Korostovtseva ◽

Mikhail Bochkarev ◽

Yurii Sviryaev

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Sleep Breathing Disorders ◽

Breathing Disorders ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

Application of Improved Long-short-term Memory Network in Human Morphology Detection

2019 IEEE 7th International Conference on Computer Science and Network Technology (ICCSNT) ◽

10.1109/iccsnt47585.2019.8962454 ◽

2019 ◽

Author(s):

Ming Huang ◽

Tao Wen ◽

Xu Liang

Keyword(s):

Short Term Memory ◽

Short Term ◽

Term Memory ◽

Human Morphology ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

Automated detection of arrhythmia from electrocardiogram signal based on new convolutional encoded features with bidirectional long short-term memory network classifier

Physical and Engineering Sciences in Medicine ◽

10.1007/s13246-020-00965-1 ◽

2021 ◽

Author(s):

Saroj Kumar Pandey ◽

Rekh Ram Janghel

Keyword(s):

Short Term Memory ◽

Automated Detection ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Electrocardiogram Signal ◽

Long Short Term Memory

Download Full-text

Synthetic Stage of Charge Estimation for Lithium-ion Batteries Based on Long Short-Term Memory Network Modeling and Adaptive H-Infinity Filter

Energy ◽

10.1016/j.energy.2021.120630 ◽

2021 ◽

pp. 120630

Author(s):

Zheng Chen ◽

Hongqian Zhao ◽

Xing Shu ◽

Yuanjian Zhang ◽

Jiangwei Shen ◽

...

Keyword(s):

Lithium Ion Batteries ◽

Short Term Memory ◽

Network Modeling ◽

Lithium Ion ◽

Short Term ◽

H Infinity ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory

Download Full-text

Small Floating Target Detection Method Based on Chaotic Long Short-Term Memory Network

Journal of Marine Science and Engineering ◽

10.3390/jmse9060651 ◽

2021 ◽

Vol 9 (6) ◽

pp. 651

Author(s):

Yan Yan ◽

Hongyan Xing

Keyword(s):

High Frequency ◽

Short Term Memory ◽

Detection Method ◽

Low Frequency ◽

Sea Clutter ◽

Short Term ◽

Term Memory ◽

Memory Network ◽

Long Short Term Memory ◽

Small Targets

In order for the detection ability of floating small targets in sea clutter to be improved, on the basis of the complete ensemble empirical mode decomposition (CEEMD) algorithm, the high-frequency parts and low-frequency parts are determined by the energy proportion of the intrinsic mode function (IMF); the high-frequency part is denoised by wavelet packet transform (WPT), whereas the denoised high-frequency IMFs and low-frequency IMFs reconstruct the pure sea clutter signal together. According to the chaotic characteristics of sea clutter, we proposed an adaptive training timesteps strategy. The training timesteps of network were determined by the width of embedded window, and the chaotic long short-term memory network detection was designed. The sea clutter signals after denoising were predicted by chaotic long short-term memory (LSTM) network, and small target signals were detected from the prediction errors. The experimental results showed that the CEEMD-WPT algorithm was consistent with the target distribution characteristics of sea clutter, and the denoising performance was improved by 33.6% on average. The proposed chaotic long- and short-term memory network, which determines the training step length according to the width of embedded window, is a new detection method that can accurately detect small targets submerged in the background of sea clutter.

Download Full-text