Directed graph attention neural network utilizing 3D coordinates for molecular property prediction

Predicting molecular properties (e.g., atomization energy) is an essential issue in quantum chemistry, which could speed up much research progress, such as drug designing and substance discovery. Traditional studies based on density functional theory (DFT) in physics are proved to be time-consuming for predicting large number of molecules. Recently, the machine learning methods, which consider much rule-based information, have also shown potentials for this issue. However, the complex inherent quantum interactions of molecules are still largely underexplored by existing solutions. In this paper, we propose a generalizable and transferable Multilevel Graph Convolutional neural Network (MGCN) for molecular property prediction. Specifically, we represent each molecule as a graph to preserve its internal structure. Moreover, the well-designed hierarchical graph neural network directly extracts features from the conformation and spatial information followed by the multilevel interactions. As a consequence, the multilevel overall representations can be utilized to make the prediction. Extensive experiments on both datasets of equilibrium and off-equilibrium molecules demonstrate the effectiveness of our model. Furthermore, the detailed results also prove that MGCN is generalizable and transferable for the prediction.

Download Full-text

Scale-Aware Graph-Based Machine Learning for Accurate Molecular Property Prediction

2020 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata50022.2020.9377905 ◽

2020 ◽

Author(s):

Gyoung S. Na ◽

Hyun Woo Kim ◽

Hyunju Chang

Keyword(s):

Machine Learning ◽

Molecular Property ◽

Property Prediction

Download Full-text

A systematic comparison study on hyperparameter optimisation of graph neural networks for molecular property prediction

Proceedings of the Genetic and Evolutionary Computation Conference ◽

10.1145/3449639.3459370 ◽

2021 ◽

Author(s):

Yingfang Yuan ◽

Wenjun Wang ◽

Wei Pang

Keyword(s):

Neural Networks ◽

Molecular Property ◽

Comparison Study ◽

Property Prediction ◽

Systematic Comparison ◽

Graph Neural Networks

Download Full-text

Soft computing for qualitative and quantitative seismic object and reservoir property prediction. Part 1: Neural network applications

First Break ◽

10.3997/1365-2397.22.3.25812 ◽

2004 ◽

Vol 22 (3) ◽

Author(s):

F. Aminzadeh ◽

P. de Groot

Keyword(s):

Neural Network ◽

Soft Computing ◽

Property Prediction ◽

Qualitative And Quantitative ◽

Network Applications ◽

Reservoir Property ◽

Neural Network Applications

Download Full-text

Mol-BERT: An Effective Molecular Representation with BERT for Molecular Property Prediction

Wireless Communications and Mobile Computing ◽

10.1155/2021/7181815 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Juncai Li ◽

Xiaofei Jiang

Keyword(s):

Deep Learning ◽

Language Processing ◽

Large Scale ◽

Molecular Data ◽

Molecular Property ◽

Property Prediction ◽

Learning Framework ◽

Learning Techniques ◽

Potential Benefits ◽

Current Sequence

Molecular property prediction is an essential task in drug discovery. Most computational approaches with deep learning techniques either focus on designing novel molecular representation or combining with some advanced models together. However, researchers pay fewer attention to the potential benefits in massive unlabeled molecular data (e.g., ZINC). This task becomes increasingly challenging owing to the limitation of the scale of labeled data. Motivated by the recent advancements of pretrained models in natural language processing, the drug molecule can be naturally viewed as language to some extent. In this paper, we investigate how to develop the pretrained model BERT to extract useful molecular substructure information for molecular property prediction. We present a novel end-to-end deep learning framework, named Mol-BERT, that combines an effective molecular representation with pretrained BERT model tailored for molecular property prediction. Specifically, a large-scale prediction BERT model is pretrained to generate the embedding of molecular substructures, by using four million unlabeled drug SMILES (i.e., ZINC 15 and ChEMBL 27). Then, the pretrained BERT model can be fine-tuned on various molecular property prediction tasks. To examine the performance of our proposed Mol-BERT, we conduct several experiments on 4 widely used molecular datasets. In comparison to the traditional and state-of-the-art baselines, the results illustrate that our proposed Mol-BERT can outperform the current sequence-based methods and achieve at least 2% improvement on ROC-AUC score on Tox21, SIDER, and ClinTox dataset.

Download Full-text