Synthesis of covalent organic frameworks using sustainable solvents and machine learning

Green Chemistry ◽

10.1039/d1gc02796d ◽

2021 ◽

Author(s):

SUSHIL KUMAR ◽

Gergo Ignacz ◽

Gyorgy Szekely

Keyword(s):

Machine Learning ◽

Long Range ◽

Covalent Organic Frameworks ◽

Sustainable Solvents

Covalent organic frameworks (COFs) have attracted considerable interest owing to their structural predesign ability, con-trollable chemistry, long-range periodicity, and pore interior functionalization ability. The most widely adopted sol-vothermal synthesis of...

Download Full-text

Precipitation Modeling for Extreme Weather Based on Sparse Hybrid Machine Learning and Markov Chain Random Field in a Multi-Scale Subspace

Water ◽

10.3390/w13091241 ◽

2021 ◽

Vol 13 (9) ◽

pp. 1241

Author(s):

Ming-Hsi Lee ◽

Yenming J. Chen

Keyword(s):

Machine Learning ◽

Markov Chain ◽

Random Field ◽

Long Range ◽

Weather Conditions ◽

Extreme Weather ◽

Prediction Algorithm ◽

Learning Method ◽

Multi Scale ◽

Hybrid Machine

This paper proposes to apply a Markov chain random field conditioning method with a hybrid machine learning method to provide long-range precipitation predictions under increasingly extreme weather conditions. Existing precipitation models are limited in time-span, and long-range simulations cannot predict rainfall distribution for a specific year. This paper proposes a hybrid (ensemble) learning method to perform forecasting on a multi-scaled, conditioned functional time series over a sparse l1 space. Therefore, on the basis of this method, a long-range prediction algorithm is developed for applications, such as agriculture or construction works. Our findings show that the conditioning method and multi-scale decomposition in the parse space l1 are proved useful in resisting statistical variation due to increasingly extreme weather conditions. Because the predictions are year-specific, we verify our prediction accuracy for the year we are interested in, but not for other years.

Download Full-text

A Comparative Study of Supervised Machine Learning Algorithms for the Prediction of Long-Range Chromatin Interactions

Genes ◽

10.3390/genes11090985 ◽

2020 ◽

Vol 11 (9) ◽

pp. 985 ◽

Cited By ~ 2

Author(s):

Thomas Vanhaeren ◽

Federico Divina ◽

Miguel García-Torres ◽

Francisco Gómez-Vela ◽

Wim Vanhoof ◽

...

Keyword(s):

Machine Learning ◽

Transcription Factors ◽

Long Range ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The Other ◽

Supervised Machine Learning ◽

Chromatin Interaction ◽

Gradient Boosting ◽

Chromatin Interactions

The role of three-dimensional genome organization as a critical regulator of gene expression has become increasingly clear over the last decade. Most of our understanding of this association comes from the study of long range chromatin interaction maps provided by Chromatin Conformation Capture-based techniques, which have greatly improved in recent years. Since these procedures are experimentally laborious and expensive, in silico prediction has emerged as an alternative strategy to generate virtual maps in cell types and conditions for which experimental data of chromatin interactions is not available. Several methods have been based on predictive models trained on one-dimensional (1D) sequencing features, yielding promising results. However, different approaches vary both in the way they model chromatin interactions and in the machine learning-based strategy they rely on, making it challenging to carry out performance comparison of existing methods. In this study, we use publicly available 1D sequencing signals to model cohesin-mediated chromatin interactions in two human cell lines and evaluate the prediction performance of six popular machine learning algorithms: decision trees, random forests, gradient boosting, support vector machines, multi-layer perceptron and deep learning. Our approach accurately predicts long-range interactions and reveals that gradient boosting significantly outperforms the other five methods, yielding accuracies of about 95%. We show that chromatin features in close genomic proximity to the anchors cover most of the predictive information, as has been previously reported. Moreover, we demonstrate that gradient boosting models trained with different subsets of chromatin features, unlike the other methods tested, are able to produce accurate predictions. In this regard, and besides architectural proteins, transcription factors are shown to be highly informative. Our study provides a framework for the systematic prediction of long-range chromatin interactions, identifies gradient boosting as the best suited algorithm for this task and highlights cell-type specific binding of transcription factors at the anchors as important determinants of chromatin wiring mediated by cohesin.

Download Full-text

Incorporating long-range physics in atomic-scale machine learning

The Journal of Chemical Physics ◽

10.1063/1.5128375 ◽

2019 ◽

Vol 151 (20) ◽

pp. 204105 ◽

Cited By ~ 18

Author(s):

Andrea Grisafi ◽

Michele Ceriotti

Keyword(s):

Machine Learning ◽

Long Range ◽

Atomic Scale

Download Full-text

Long-range order imposed by short-range interactions in methylammonium lead iodide: Comparing point-dipole models to machine-learning force fields

Physical Review B ◽

10.1103/physrevb.100.094106 ◽

2019 ◽

Vol 100 (9) ◽

Cited By ~ 4

Author(s):

Jonathan Lahnsteiner ◽

Ryosuke Jinnouchi ◽

Menno Bokdam

Keyword(s):

Machine Learning ◽

Long Range ◽

Short Range ◽

Force Fields ◽

Range Order ◽

Point Dipole ◽

Long Range Order ◽

Lead Iodide ◽

Methylammonium Lead Iodide

Download Full-text

On the challenges of predicting microscopic dynamics of online conversations

Applied Network Science ◽

10.1007/s41109-021-00357-8 ◽

2021 ◽

Vol 6 (1) ◽

Author(s):

John Bollenbacher ◽

Diogo Pacheco ◽

Pik-Mai Hui ◽

Yong-Yeol Ahn ◽

Alessandro Flammini ◽

...

Keyword(s):

Machine Learning ◽

Social Media ◽

Long Range ◽

Cyber Security ◽

Learning Algorithms ◽

Generative Models ◽

Machine Learning Algorithms ◽

Macroscopic Structure ◽

Online Conversation ◽

Near Future

AbstractTo what extent can we predict the structure of online conversation trees? We present a generative model to predict the size and evolution of threaded conversations on social media by combining machine learning algorithms. The model is evaluated using datasets that span two topical domains (cryptocurrency and cyber-security) and two platforms (Reddit and Twitter). We show that it is able to predict both macroscopic features of the final trees and near-future microscopic events with moderate accuracy. However, predicting the macroscopic structure of conversations does not guarantee an accurate reconstruction of their microscopic evolution. Our model’s limited performance in long-range predictions highlights the challenges faced by generative models due to the accumulation of errors.

Download Full-text

A comparative study of supervised machine learning algorithms for the prediction of long-range chromatin interactions

10.1101/2020.06.09.141473 ◽

2020 ◽

Author(s):

Thomas Vanhaeren ◽

Federico Divina ◽

Miguel García-Torres ◽

Francisco Gómez-Vela ◽

Wim Vanhoof ◽

...

Keyword(s):

Machine Learning ◽

Transcription Factors ◽

Long Range ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

The Other ◽

Supervised Machine Learning ◽

Chromatin Interaction ◽

Gradient Boosting ◽

Chromatin Interactions

AbstractThe role of three-dimensional genome organization as a critical regulator of gene expression has become increasingly clear over the last decade. Most of our understanding of this association comes from the study of long range chromatin interaction maps provided by Chromatin Conformation Capture-based techniques, which have greatly improved in recent years. Since these procedures are experimentally laborious and expensive, in silico prediction has emerged as an alternative strategy to generate virtual maps in cell types and conditions for which experimental data of chromatin interactions is not available. Several methods have been based on predictive models trained on one-dimensional (1D) sequencing features, yielding promising results. However, different approaches vary both in the way they model chromatin interactions and in the machine learning-based strategy they rely on, making it challenging to carry out performance comparison of existing methods. In this study, we use publicly available 1D sequencing signals to model chromatin interactions in two human cell lines and evaluate the prediction performance of 5 popular machine learning algorithms: decision trees, random forests, gradient boosting, support vector machines and multi-layer perceptron. Our approach accurately predicts long-range interactions and reveals that gradient boosting significantly outperforms the other four algorithms, yielding accuracies of ~ 95%. We show that chromatin features in close genomic proximity to the anchors cover most of the predictive information. Moreover, we demonstrate that gradient boosting models trained with different subsets of chromatin features, unlike the other methods tested, are able to produce accurate predictions. In this regard, and besides architectural proteins, transcription factors are shown to be highly informative. Our study provides a framework for the systematic prediction of long-range chromatin interactions, identifies gradient boosting as the best suited algorithm for this task and highlights cell-type specific binding of transcription factors at the anchors as important determinants of chromatin wiring.

Download Full-text

Using Sound to Synthesize Covalent Organic Frameworks in Water

10.26434/chemrxiv.14781546.v1 ◽

2021 ◽

Author(s):

Wei Zhao ◽

Peiyao Yan ◽

Haofan Yang ◽

Mounib Bahri ◽

Alex James ◽

...

Keyword(s):

Acetic Acid ◽

Catalytic Performance ◽

Aqueous Acetic Acid ◽

Sonochemical Method ◽

Covalent Organic Frameworks ◽

Food Grade ◽

Sustainable Solvents ◽

Solvothermal Methods ◽

Solvothermal Conditions ◽

Better Than

Most covalent organic frameworks (COFs) are synthesized using solvothermal conditions (>120 °C, >72 h) in harmful organic solvents. We report a strategy for rapidly synthesizing imine-linked COFs (< 60 min) in aqueous acetic acid using sonochemistry, avoiding most of the downsides of solvothermal methods. We first synthesized seven known COFs using this method and obtained crystallinity and porosity comparable to or better than materials from previously reported solvothermal routes. This sonochemical method even works in highly sustainable solvents, such as food-grade vinegar. The generality of the method was demonstrated by preparing two unreported COFs. Finally, we showed that one sonochemical COF is an excellent photocatalyst for sacrificial hydrogen evolution from water with a more sustained catalytic performance than its solvothermal analog. The speed, ease and generality of this sonochemical method with no sacrifice in material quality makes it an enabling methodology for rapid discovery of new functional COF materials.

Download Full-text

DROUGHT FORECASTING BASED ON MACHINE LEARNING OF REMOTE SENSING AND LONG-RANGE FORECAST DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xli-b8-157-2016 ◽

2016 ◽

Vol XLI-B8 ◽

pp. 157-158

Author(s):

J. Rhee ◽

J. Im ◽

S. Park

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Long Range ◽

Initial Conditions ◽

Interpolation Method ◽

Spline Interpolation ◽

Standardized Precipitation Index ◽

Learning Models ◽

Drought Forecasting ◽

Machine Learning Models

The reduction of drought impacts may be achieved through sustainable drought management and proactive measures against drought disaster. Accurate and timely provision of drought information is essential. In this study, drought forecasting models to provide high-resolution drought information based on drought indicators for ungauged areas were developed. The developed models predict drought indices of the 6-month Standardized Precipitation Index (SPI6) and the 6-month Standardized Precipitation Evapotranspiration Index (SPEI6). An interpolation method based on multiquadric spline interpolation method as well as three machine learning models were tested. Three machine learning models of Decision Tree, Random Forest, and Extremely Randomized Trees were tested to enhance the provision of drought initial conditions based on remote sensing data, since initial conditions is one of the most important factors for drought forecasting. Machine learning-based methods performed better than interpolation methods for both classification and regression, and the methods using climatology data outperformed the methods using long-range forecast. The model based on climatological data and the machine learning method outperformed overall.

Download Full-text

Predicting of Covalent Organic Frameworks for Membrane-based Isobutene/1,3-Butadiene Separation: Combining Molecular Simulation and Machine Learning

Chemical Research in Chinese Universities ◽

10.1007/s40242-022-1452-z ◽

2022 ◽

Author(s):

Xiaohao Cao ◽

Yanjing He ◽

Zhengqing Zhang ◽

Yuxiu Sun ◽

Qi Han ◽

...

Keyword(s):

Machine Learning ◽

Molecular Simulation ◽

Covalent Organic Frameworks

Download Full-text

Machine-Learning of Long-Range Sound Propagation Through Simulated Atmospheric Turbulence

10.21079/11681/41182 ◽

2021 ◽

Author(s):

Carl R. Hart ◽

D. Keith Wilson ◽

Chris L. Pettit ◽

Edward T. Nykaza

Keyword(s):

Machine Learning ◽

Long Range ◽

Sound Propagation ◽

Transmission Loss ◽

Absolute Error ◽

Surrogate Data ◽

Machine Learning Algorithms ◽

Single Realization ◽

Time Requirements ◽

Propagation Experiment

Conventional numerical methods can capture the inherent variability of long-range outdoor sound propagation. However, computational memory and time requirements are high. In contrast, machine-learning models provide very fast predictions. This comes by learning from experimental observations or surrogate data. Yet, it is unknown what type of surrogate data is most suitable for machine-learning. This study used a Crank-Nicholson parabolic equation (CNPE) for generating the surrogate data. The CNPE input data were sampled by the Latin hypercube technique. Two separate datasets comprised 5000 samples of model input. The ﬁrst dataset consisted of transmission loss (TL) ﬁelds for single realizations of turbulence. The second dataset consisted of average TL ﬁelds for 64 realizations of turbulence. Three machine-learning algorithms were applied to each dataset, namely, ensemble decision trees, neural networks, and cluster-weighted models. Observational data come from a long-range (out to 8 km) sound propagation experiment. In comparison to the experimental observations, regression predictions have 5–7 dB in median absolute error. Surrogate data quality depends on an accurate characterization of refractive and scattering conditions. Predictions obtained through a single realization of turbulence agree better with the experimental observations.

Download Full-text