Research on Property Prediction of Materials Based on Machine Learning

AbstractNeural Message Passing for graphs is a promising and relatively recent approach for applying Machine Learning to networked data. As molecules can be described intrinsically as a molecular graph, it makes sense to apply these techniques to improve molecular property prediction in the field of cheminformatics. We introduce Attention and Edge Memory schemes to the existing message passing neural network framework, and benchmark our approaches against eight different physical–chemical and bioactivity datasets from the literature. We remove the need to introduce a priori knowledge of the task and chemical descriptor calculation by using only fundamental graph-derived properties. Our results consistently perform on-par with other state-of-the-art machine learning approaches, and set a new standard on sparse multi-task virtual screening targets. We also investigate model performance as a function of dataset preprocessing, and make some suggestions regarding hyperparameter selection.

Download Full-text

Application of Machine Learning in Material Property Prediction

Advances in Condensed Matter Physics ◽

10.12677/cmp.2020.92002 ◽

2020 ◽

Vol 09 (02) ◽

pp. 11-19

Author(s):

磊提

Keyword(s):

Machine Learning ◽

Material Property ◽

Property Prediction ◽

Material Property Prediction

Download Full-text

Materials Representation and Transfer Learning for Multi-Property Prediction

10.26434/chemrxiv.14612307.v1 ◽

2021 ◽

Author(s):

Shufeng Kong ◽

Dan Guevarra ◽

Carla P. Gomes ◽

John Gregoire

Keyword(s):

Machine Learning ◽

Optical Absorption ◽

Transfer Learning ◽

Materials Science ◽

Training Data ◽

Target Domain ◽

Generative Adversarial Network ◽

Property Prediction ◽

Adversarial Network ◽

Correlation Learning

The adoption of machine learning in materials science has rapidly transformed materials property prediction. Hurdles limiting full capitalization of recent advancements in machine learning include the limited development of methods to learn the underlying interactions of multiple elements, as well as the relationships among multiple properties, to facilitate property prediction in new composition spaces. To address these issues, we introduce the Hierarchical Correlation Learning for Multi-property Prediction (H-CLMP) framework that seamlessly integrates (i) prediction using only a material’s composition, (ii) learning and exploitation of correlations among target properties in multitarget regression, and (iii) leveraging training data from tangential domains via generative transfer learning. The model is demonstrated for prediction of spectral optical absorption of complex metal oxides spanning 69 3-cation metal oxide composition spaces. H-CLMP accurately predicts non-linear composition-property relationships in composition spaces for which no training data is available, which broadens the purview of machine learning to the discovery of materials with exceptional properties. This achievement results from the principled integration of latent embedding learning, property correlation learning, generative transfer learning, and attention models. The best performance is obtained using H-CLMP with Transfer learning (H-CLMP(T)) wherein a generative adversarial network is trained on computational density of states data and deployed in the target domain to augment prediction of optical absorption from composition. H-CLMP(T) aggregates multiple knowledge sources with a framework that is well-suited for multi-target regression across the physical sciences.

Download Full-text

Property prediction and properties-to-microstructure inverse analysis of steels by a machine-learning approach

Materials Science and Engineering A ◽

10.1016/j.msea.2018.12.049 ◽

2019 ◽

Vol 744 ◽

pp. 661-670 ◽

Cited By ~ 19

Author(s):

Zhi-Lei Wang ◽

Yoshitaka Adachi

Keyword(s):

Machine Learning ◽

Inverse Analysis ◽

Learning Approach ◽

Property Prediction ◽

Machine Learning Approach

Download Full-text

Machine Learning for Organic Cage Property Prediction

10.26434/chemrxiv.6995018.v2 ◽

2018 ◽

Author(s):

Lukas Turcani ◽

Rebecca L. Greenaway ◽

Kim Jelfs

Keyword(s):

Machine Learning ◽

Open Source ◽

Data Sets ◽

Cavity Size ◽

Learning Models ◽

Property Prediction ◽

Online Tool ◽

Machine Learning Models

We use machine learning to predict shape persistence and cavity size in porous organic cages. The majority of hypothetical organic cages suffer from a lack of shape persistence and as a result lack intrinsic porosity, rendering them unsuitable for many applications. We have created the largest computational database of these molecules to date, numbering 63,472 cages, formed through a range of reaction chemistries and in multiple topologies. We study our database and identify features which lead to the formation of shape persistent cages. We find that the imine condensation of trialdehydes and diamines in a [4+6] reaction is the most likely to result in shape persistent cages, whereas thiol reactions are most likely to give collapsed cages. Using this database, we develop machine learning models capable of predicting shape persistence with an accuracy of up to 93%, reducing the time taken to predict this property to milliseconds, and removing the need for specialist software. In addition, we develop machine learning models for two other key properties of these molecules, cavity size and symmetry. We provide open-source implementations of our models, together with the accompanying data sets, and an online tool giving users access to our models to easily obtain predictions for a hypothetical cage prior to a synthesis attempt.

Download Full-text

Machine Learning for Organic Cage Property Prediction

10.26434/chemrxiv.6995018.v3 ◽

2018 ◽

Author(s):

Lukas Turcani ◽

Rebecca L. Greenaway ◽

Kim Jelfs

Keyword(s):

Machine Learning ◽

Open Source ◽

Data Sets ◽

Cavity Size ◽

Learning Models ◽

Property Prediction ◽

Online Tool ◽

Machine Learning Models

We use machine learning to predict shape persistence and cavity size in porous organic cages. The majority of hypothetical organic cages suffer from a lack of shape persistence and as a result lack intrinsic porosity, rendering them unsuitable for many applications. We have created the largest computational database of these molecules to date, numbering 63,472 cages, formed through a range of reaction chemistries and in multiple topologies. We study our database and identify features which lead to the formation of shape persistent cages. We find that the imine condensation of trialdehydes and diamines in a [4+6] reaction is the most likely to result in shape persistent cages, whereas thiol reactions are most likely to give collapsed cages. Using this database, we develop machine learning models capable of predicting shape persistence with an accuracy of up to 93%, reducing the time taken to predict this property to milliseconds, and removing the need for specialist software. In addition, we develop machine learning models for two other key properties of these molecules, cavity size and symmetry. We provide open-source implementations of our models, together with the accompanying data sets, and an online tool giving users access to our models to easily obtain predictions for a hypothetical cage prior to a synthesis attempt.

Download Full-text

Machine Learning for Organic Cage Property Prediction

10.26434/chemrxiv.6995018 ◽

2018 ◽

Author(s):

Lukas Turcani ◽

Rebecca L. Greenaway ◽

Kim Jelfs

Keyword(s):

Machine Learning ◽

Open Source ◽

Data Sets ◽

Cavity Size ◽

Learning Models ◽

Property Prediction ◽

Online Tool ◽

Machine Learning Models

We use machine learning to predict shape persistence and cavity size in porous organic cages. The majority of hypothetical organic cages suffer from a lack of shape persistence and as a result lack intrinsic porosity, rendering them unsuitable for many applications. We have created the largest computational database of these molecules to date, numbering 63,472 cages, formed through a range of reaction chemistries and in multiple topologies. We study our database and identify features which lead to the formation of shape persistent cages. We find that the imine condensation of trialdehydes and diamines in a [4+6] reaction is the most likely to result in shape persistent cages, whereas thiol reactions are most likely to give collapsed cages. Using this database, we develop machine learning models capable of predicting shape persistence with an accuracy of up to 93%, reducing the time taken to predict this property to milliseconds, and removing the need for specialist software. In addition, we develop machine learning models for two other key properties of these molecules, cavity size and symmetry. We provide open-source implementations of our models, together with the accompanying data sets, and an online tool giving users access to our models to easily obtain predictions for a hypothetical cage prior to a synthesis attempt.

Download Full-text

Research on Property Prediction of Materials Based on Machine Learning

Scale-Aware Graph-Based Machine Learning for Accurate Molecular Property Prediction

Novel machine learning workflow for rock property prediction in the geologically complex presalt Santos basin, Brazil

Comparison of Group-Contribution and Machine Learning-based Property Prediction Models with Uncertainty Quantification

Building attention and edge message passing neural networks for bioactivity and physical–chemical property prediction

Application of Machine Learning in Material Property Prediction

Materials Representation and Transfer Learning for Multi-Property Prediction

Property prediction and properties-to-microstructure inverse analysis of steels by a machine-learning approach

Machine Learning for Organic Cage Property Prediction

Machine Learning for Organic Cage Property Prediction

Machine Learning for Organic Cage Property Prediction

Export Citation Format