Gradient-Norm Based Attentive Loss for Molecular Property Prediction

Author(s):  
Hehuan Ma ◽  
Yu Rong ◽  
Boyang Liu ◽  
Yuzhi Guo ◽  
Chaochao Yan ◽  
...  
2021 ◽  
Vol 2021 ◽  
pp. 1-7
Author(s):  
Juncai Li ◽  
Xiaofei Jiang

Molecular property prediction is an essential task in drug discovery. Most computational approaches with deep learning techniques either focus on designing novel molecular representation or combining with some advanced models together. However, researchers pay fewer attention to the potential benefits in massive unlabeled molecular data (e.g., ZINC). This task becomes increasingly challenging owing to the limitation of the scale of labeled data. Motivated by the recent advancements of pretrained models in natural language processing, the drug molecule can be naturally viewed as language to some extent. In this paper, we investigate how to develop the pretrained model BERT to extract useful molecular substructure information for molecular property prediction. We present a novel end-to-end deep learning framework, named Mol-BERT, that combines an effective molecular representation with pretrained BERT model tailored for molecular property prediction. Specifically, a large-scale prediction BERT model is pretrained to generate the embedding of molecular substructures, by using four million unlabeled drug SMILES (i.e., ZINC 15 and ChEMBL 27). Then, the pretrained BERT model can be fine-tuned on various molecular property prediction tasks. To examine the performance of our proposed Mol-BERT, we conduct several experiments on 4 widely used molecular datasets. In comparison to the traditional and state-of-the-art baselines, the results illustrate that our proposed Mol-BERT can outperform the current sequence-based methods and achieve at least 2% improvement on ROC-AUC score on Tox21, SIDER, and ClinTox dataset.


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 127968-127968
Author(s):  
Shuang Wang ◽  
Zhen Li ◽  
Shugang Zhang ◽  
Mingjian Jiang ◽  
Xiaofeng Wang ◽  
...  

2020 ◽  
Vol 60 (6) ◽  
pp. 2697-2717 ◽  
Author(s):  
Gabriele Scalia ◽  
Colin A. Grambow ◽  
Barbara Pernici ◽  
Yi-Pei Li ◽  
William H. Green

IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 18601-18614 ◽  
Author(s):  
Shuang Wang ◽  
Zhen Li ◽  
Shugang Zhang ◽  
Mingjian Jiang ◽  
Xiaofeng Wang ◽  
...  

2020 ◽  
Vol 60 (8) ◽  
pp. 3770-3780 ◽  
Author(s):  
Lior Hirschfeld ◽  
Kyle Swanson ◽  
Kevin Yang ◽  
Regina Barzilay ◽  
Connor W. Coley

Sign in / Sign up

Export Citation Format

Share Document