Dual-dropout graph convolutional network for predicting synthetic lethality in human cancers

Ruichu Cai; Xuexin Chen; Yuan Fang; Min Wu; Yuexing Hao

doi:10.1093/bioinformatics/btaa211

Dual-dropout graph convolutional network for predicting synthetic lethality in human cancers

Bioinformatics ◽

10.1093/bioinformatics/btaa211 ◽

2020 ◽

Vol 36 (16) ◽

pp. 4458-4465 ◽

Cited By ~ 4

Author(s):

Ruichu Cai ◽

Xuexin Chen ◽

Yuan Fang ◽

Min Wu ◽

Yuexing Hao

Keyword(s):

Link Prediction ◽

Synthetic Lethality ◽

Gene Interaction ◽

Superior Performance ◽

Coarse Grained ◽

Supplementary Information ◽

Convolutional Network ◽

Sparse Graphs ◽

Fine Grained ◽

Convolutional Networks

Abstract Motivation Synthetic lethality (SL) is a promising form of gene interaction for cancer therapy, as it is able to identify specific genes to target at cancer cells without disrupting normal cells. As high-throughput wet-lab settings are often costly and face various challenges, computational approaches have become a practical complement. In particular, predicting SLs can be formulated as a link prediction task on a graph of interacting genes. Although matrix factorization techniques have been widely adopted in link prediction, they focus on mapping genes to latent representations in isolation, without aggregating information from neighboring genes. Graph convolutional networks (GCN) can capture such neighborhood dependency in a graph. However, it is still challenging to apply GCN for SL prediction as SL interactions are extremely sparse, which is more likely to cause overfitting. Results In this article, we propose a novel dual-dropout GCN (DDGCN) for learning more robust gene representations for SL prediction. We employ both coarse-grained node dropout and fine-grained edge dropout to address the issue that standard dropout in vanilla GCN is often inadequate in reducing overfitting on sparse graphs. In particular, coarse-grained node dropout can efficiently and systematically enforce dropout at the node (gene) level, while fine-grained edge dropout can further fine-tune the dropout at the interaction (edge) level. We further present a theoretical framework to justify our model architecture. Finally, we conduct extensive experiments on human SL datasets and the results demonstrate the superior performance of our model in comparison with state-of-the-art methods. Availability and implementation DDGCN is implemented in Python 3.7, open-source and freely available at https://github.com/CXX1113/Dual-DropoutGCN. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Knowledge Enhanced LSTM for Coreference Resolution on Biomedical Texts

Bioinformatics ◽

10.1093/bioinformatics/btab153 ◽

2021 ◽

Author(s):

Yufei Li ◽

Xiaoyong Ma ◽

Xiangyu Zhou ◽

Pengzhen Cheng ◽

Kai He ◽

...

Keyword(s):

Information Integration ◽

Short Term Memory ◽

Superior Performance ◽

Supplementary Information ◽

Specific Information ◽

Coreference Resolution ◽

Fine Grained ◽

Domain Specific ◽

Memory Network ◽

Biomedical Texts

Abstract Motivation Bio-entity Coreference Resolution focuses on identifying the coreferential links in biomedical texts, which is crucial to complete bio-events’ attributes and interconnect events into bio-networks. Previously, as one of the most powerful tools, deep neural network-based general domain systems are applied to the biomedical domain with domain-specific information integration. However, such methods may raise much noise due to its insufficiency of combining context and complex domain-specific information. Results In this paper, we explore how to leverage the external knowledge base in a fine-grained way to better resolve coreference by introducing a knowledge-enhanced Long Short Term Memory network (LSTM), which is more flexible to encode the knowledge information inside the LSTM. Moreover, we further propose a knowledge attention module to extract informative knowledge effectively based on contexts. The experimental results on the BioNLP and CRAFT datasets achieve state-of-the-art performance, with a gain of 7.5 F1 on BioNLP and 10.6 F1 on CRAFT. Additional experiments also demonstrate superior performance on the cross-sentence coreferences. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Revisiting Graph Based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5330 ◽

2020 ◽

Vol 34 (01) ◽

pp. 27-34 ◽

Cited By ~ 5

Author(s):

Lei Chen ◽

Le Wu ◽

Richang Hong ◽

Kun Zhang ◽

Meng Wang

Keyword(s):

Collaborative Filtering ◽

Representation Learning ◽

Superior Performance ◽

Convolutional Network ◽

Convolutional Networks ◽

Proposed Model ◽

Non Linear ◽

Efficiency And Effectiveness ◽

Residual Graph ◽

Interaction Modeling

Graph Convolutional Networks~(GCNs) are state-of-the-art graph based representation learning models by iteratively stacking multiple layers of convolution aggregation operations and non-linear activation operations. Recently, in Collaborative Filtering~(CF) based Recommender Systems~(RS), by treating the user-item interaction behavior as a bipartite graph, some researchers model higher-layer collaborative signals with GCNs. These GCN based recommender models show superior performance compared to traditional works. However, these models suffer from training difficulty with non-linear activations for large user-item graphs. Besides, most GCN based models could not model deeper layers due to the over smoothing effect with the graph convolution operation. In this paper, we revisit GCN based CF models from two aspects. First, we empirically show that removing non-linearities would enhance recommendation performance, which is consistent with the theories in simple graph convolutional networks. Second, we propose a residual network structure that is specifically designed for CF with user-item interaction modeling, which alleviates the over smoothing problem in graph convolution aggregation operation with sparse user-item interaction data. The proposed model is a linear model and it is easy to train, scale to large datasets, and yield better efficiency and effectiveness on two real datasets. We publish the source code at https://github.com/newlei/LR-GCCF.

Download Full-text

PARALLELIZATION OF THE Α‐STABLE MODELLING ALGORITHMS

Mathematical Modelling and Analysis ◽

10.3846/1392-6292.2007.12.409-418 ◽

2007 ◽

Vol 12 (4) ◽

pp. 409-418

Author(s):

Igoris Belovas ◽

Vadimas Starikovičius

Keyword(s):

Maximum Likelihood ◽

Stable Distributions ◽

Parameters Estimation ◽

Superior Performance ◽

Coarse Grained ◽

Likelihood Method ◽

Practical Implementation ◽

Computer Clusters ◽

Fine Grained ◽

Classical Models

Stable distributions have a wide sphere of application: probability theory, physics, electronics, economics, sociology. Particularly important role they play in financial mathematics, since the classical models of financial market, which are based on the hypothesis of the normality, often become inadequate. However, the practical implementation of stable models is a nontrivial task, because the probability density functions of α‐stable distributions have no analytical representations (with a few exceptions). In this work we exploit the parallel computing technologies for acceleration of numerical solution of stable modelling problems. Specifically, we are solving the stable law parameters estimation problem by the maximum likelihood method. If we need to deal with a big number of long financial series, only the means of parallel technologies can allow us to get results in a adequate time. We have distinguished and defined several hierarchical levels of parallelism. We show that coarse‐grained Multi‐Sets parallelization is very efficient on computer clusters. Fine‐grained Maximum Likelihood level is very efficient on shared memory machines with Symmetric multiprocessing and Hyper‐threading technologies. Hybrid application, which is utilizing both of those levels, has shown superior performance compared to single level (MS) parallel application on cluster of Pentium 4 HT nodes.

Download Full-text

Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures

International Journal of Computer Assisted Radiology and Surgery ◽

10.1007/s11548-021-02388-z ◽

2021 ◽

Author(s):

Sanat Ramesh ◽

Diego Dall’Alba ◽

Cristians Gonzalez ◽

Tong Yu ◽

Pietro Mascagni ◽

...

Keyword(s):

Gastric Bypass ◽

Automatic Segmentation ◽

Joint Modeling ◽

Superior Performance ◽

Computer Assisted ◽

Convolutional Network ◽

Single Task ◽

Convolutional Networks ◽

Multi Stage ◽

Gastric Bypass Procedure

Abstract Purpose Automatic segmentation and classification of surgical activity is crucial for providing advanced support in computer-assisted interventions and autonomous functionalities in robot-assisted surgeries. Prior works have focused on recognizing either coarse activities, such as phases, or fine-grained activities, such as gestures. This work aims at jointly recognizing two complementary levels of granularity directly from videos, namely phases and steps. Methods We introduce two correlated surgical activities, phases and steps, for the laparoscopic gastric bypass procedure. We propose a multi-task multi-stage temporal convolutional network (MTMS-TCN) along with a multi-task convolutional neural network (CNN) training setup to jointly predict the phases and steps and benefit from their complementarity to better evaluate the execution of the procedure. We evaluate the proposed method on a large video dataset consisting of 40 surgical procedures (Bypass40). Results We present experimental results from several baseline models for both phase and step recognition on the Bypass40. The proposed MTMS-TCN method outperforms single-task methods in both phase and step recognition by 1-2% in accuracy, precision and recall. Furthermore, for step recognition, MTMS-TCN achieves a superior performance of 3-6% compared to LSTM-based models on all metrics. Conclusion In this work, we present a multi-task multi-stage temporal convolutional network for surgical activity recognition, which shows improved results compared to single-task models on a gastric bypass dataset with multi-level annotations. The proposed method shows that the joint modeling of phases and steps is beneficial to improve the overall recognition of each type of activity.

Download Full-text

Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labeled Nodes

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6048 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5892-5899

Author(s):

Ke Sun ◽

Zhouchen Lin ◽

Zhanxing Zhu

Keyword(s):

Supervised Learning ◽

State Of The Art ◽

Difficult Problem ◽

Superior Performance ◽

Learning Approach ◽

Training Algorithm ◽

Convolutional Network ◽

Convolutional Networks ◽

Learning Tasks ◽

Multi Stage

Graph Convolutional Networks (GCNs) play a crucial role in graph learning tasks, however, learning graph embedding with few supervised signals is still a difficult problem. In this paper, we propose a novel training algorithm for Graph Convolutional Network, called Multi-Stage Self-Supervised (M3S) Training Algorithm, combined with self-supervised learning approach, focusing on improving the generalization performance of GCNs on graphs with few labeled nodes. Firstly, a Multi-Stage Training Framework is provided as the basis of M3S training method. Then we leverage DeepCluster technique, a popular form of self-supervised learning, and design corresponding aligning mechanism on the embedding space to refine the Multi-Stage Training Framework, resulting in M3S Training Algorithm. Finally, extensive experimental results verify the superior performance of our algorithm on graphs with few labeled nodes under different label rates compared with other state-of-the-art approaches.

Download Full-text

A directed link prediction method using graph convolutional network based on social ranking theory

Intelligent Data Analysis ◽

10.3233/ida-195006 ◽

2021 ◽

Vol 25 (3) ◽

pp. 739-757

Author(s):

Zheng Wu ◽

Hongchang Chen ◽

Jianpeng Zhang ◽

Shuxin Liu ◽

Ruiyang Huang ◽

...

Keyword(s):

Link Prediction ◽

Real Life ◽

Prediction Method ◽

Prediction Problem ◽

Social Ranking ◽

Convolutional Network ◽

Directional Information ◽

Convolutional Networks ◽

Ranking Theory ◽

Real World Datasets

Graph convolutional networks (GCN) have recently emerged as powerful node embedding methods in network analysis tasks. Particularly, GCNs have been successfully leveraged to tackle the challenging link prediction problem, aiming at predicting missing links that exist yet were not found. However, most of these models are oriented to undirected graphs, which are limited to certain real-life applications. Therefore, based on the social ranking theory, we extend the GCN to address the directed link prediction problem. Firstly, motivated by the reciprocated and unreciprocated nature of social ties, we separate nodes in the neighbor subgraph of the missing link into the same, a higher-ranked and a lower-ranked set. Then, based on the three kinds of node sets, we propose a method to correctly aggregate and propagate the directional information across layers of a GCN model. Empirical study on 8 real-world datasets shows that our proposed method is capable of reserving rich information related to directed link direction and consistently performs well on graphs from numerous domains.

Download Full-text

An Efficient Residual-Based Method for Railway Image Dehazing

Sensors ◽

10.3390/s20216204 ◽

2020 ◽

Vol 20 (21) ◽

pp. 6204

Author(s):

Qinghong Liu ◽

Yong Qin ◽

Zhengyu Xie ◽

Zhiwei Cao ◽

Limin Jia

Keyword(s):

Superior Performance ◽

Coarse Grained ◽

Image Dehazing ◽

Fine Grained ◽

Full Reference ◽

Surrounding Environment ◽

Train Operation ◽

Block Based ◽

High Level ◽

Residual Block

Trains shuttle in semiopen environments, and the surrounding environment plays an important role in the safety of train operation. The weather is one of the factors that affect the surrounding environment of railways. Under haze conditions, railway monitoring and staff vision could be blurred, threatening railway safety. This paper tackles image dehazing for railways. The contributions of this paper for railway video image dehazing are as follows: (1) this paper proposes an end-to-end residual block-based haze removal method that consists of two subnetworks, namely fine-grained and coarse-grained network can directly generate the clean image from input hazy image, called RID-Net (Railway Image Dehazing Network). (2) The combined loss function (per-pixel loss and perceptual loss functions) is proposed to achieve both low-level features and high-level features so to generate the high-quality restored images. (3) We take the full-reference criterion (PSNR&SSIM), object detection, running time, and sensory vision to evaluate the proposed dehazing method. Experimental results on railway synthesized dataset, benchmark indoor dataset, and real-world dataset demonstrate our method has superior performance compared to the state-of-the-art methods.

Download Full-text

Fine-grained Event Categorization with Heterogeneous Graph Convolutional Networks

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/449 ◽

2019 ◽

Cited By ~ 10

Author(s):

Hao Peng ◽

Jianxin Li ◽

Qiran Gong ◽

Yangqiu Song ◽

Yuanxin Ning ◽

...

Keyword(s):

Semantic Relatedness ◽

Real Data ◽

Text Messages ◽

Social Event ◽

Convolutional Network ◽

Fine Grained ◽

Convolutional Networks ◽

Social Events ◽

Social Media Platforms ◽

Data Collections

Events are happening in real-world and real-time, which can be planned and organized occasions involving multiple people and objects. Social media platforms publish a lot of text messages containing public events with comprehensive topics. However, mining social events is challenging due to the heterogeneous event elements in texts and explicit and implicit social network structures. In this paper, we design an event meta-schema to characterize the semantic relatedness of social events and build an event-based heterogeneous information network (HIN) integrating information from external knowledge base, and propose a novel Pairwise Popularity Graph Convolutional Network (PP-GCN) based fine-grained social event categorization model. We propose a Knowledgeable meta-paths Instances based social Event Similarity (KIES) between events and build a weighted adjacent matrix as input to the PP-GCN model. Comprehensive experiments on real data collections are conducted to compare various social event detection and clustering tasks. Experimental results demonstrate that our proposed framework outperforms other alternative social event categorization techniques.

Download Full-text

Graph convolutional networks for epigenetic state prediction using both sequence and 3D genome data

Bioinformatics ◽

10.1093/bioinformatics/btaa793 ◽

2020 ◽

Vol 36 (Supplement_2) ◽

pp. i659-i667

Author(s):

Jack Lanchantin ◽

Yanjun Qi

Keyword(s):

Long Range ◽

Dna Sequences ◽

State Of The Art ◽

Supplementary Information ◽

Convolutional Network ◽

3D Genome ◽

Convolutional Networks ◽

Genome Data ◽

Epigenetic State ◽

Local Sequence

Abstract Motivation Predictive models of DNA chromatin profile (i.e. epigenetic state), such as transcription factor binding, are essential for understanding regulatory processes and developing gene therapies. It is known that the 3D genome, or spatial structure of DNA, is highly influential in the chromatin profile. Deep neural networks have achieved state of the art performance on chromatin profile prediction by using short windows of DNA sequences independently. These methods, however, ignore the long-range dependencies when predicting the chromatin profiles because modeling the 3D genome is challenging. Results In this work, we introduce ChromeGCN, a graph convolutional network for chromatin profile prediction by fusing both local sequence and long-range 3D genome information. By incorporating the 3D genome, we relax the independent and identically distributed assumption of local windows for a better representation of DNA. ChromeGCN explicitly incorporates known long-range interactions into the modeling, allowing us to identify and interpret those important long-range dependencies in influencing chromatin profiles. We show experimentally that by fusing sequential and 3D genome data using ChromeGCN, we get a significant improvement over the state-of-the-art deep learning methods as indicated by three metrics. Importantly, we show that ChromeGCN is particularly useful for identifying epigenetic effects in those DNA windows that have a high degree of interactions with other DNA windows. Availability and implementation https://github.com/QData/ChromeGCN. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction

Bioinformatics ◽

10.1093/bioinformatics/btz965 ◽

2020 ◽

Vol 36 (8) ◽

pp. 2538-2546 ◽

Cited By ~ 9

Author(s):

Jin Li ◽

Sai Zhang ◽

Tao Liu ◽

Chenxi Ning ◽

Zhuoxuan Zhang ◽

...

Keyword(s):

Characteristic Curve ◽

Matrix Completion ◽

Disease Association ◽

Supplementary Information ◽

Convolutional Network ◽

Convolutional Networks ◽

Feature Representations ◽

Disease Similarity ◽

Disease Associations ◽

Association Data

Abstract Motivation Predicting the association between microRNAs (miRNAs) and diseases plays an import role in identifying human disease-related miRNAs. As identification of miRNA-disease associations via biological experiments is time-consuming and expensive, computational methods are currently used as effective complements to determine the potential associations between disease and miRNA. Results We present a novel method of neural inductive matrix completion with graph convolutional network (NIMCGCN) for predicting miRNA-disease association. NIMCGCN first uses graph convolutional networks to learn miRNA and disease latent feature representations from the miRNA and disease similarity networks. Then, learned features were input into a novel neural inductive matrix completion (NIMC) model to generate an association matrix completion. The parameters of NIMCGCN were learned based on the known miRNA-disease association data in a supervised end-to-end way. We compared the proposed method with other state-of-the-art methods. The area under the receiver operating characteristic curve results showed that our method is significantly superior to existing methods. Furthermore, 50, 47 and 48 of the top 50 predicted miRNAs for three high-risk human diseases, namely, colon cancer, lymphoma and kidney cancer, were verified using experimental literature. Finally, 100% prediction accuracy was achieved when breast cancer was used as a case study to evaluate the ability of NIMCGCN for predicting a new disease without any known related miRNAs. Availability and implementation https://github.com/ljatynu/NIMCGCN/ Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text