efficient exploration Latest Research Papers

MagCluster: a Tool for Identification, Annotation, and Visualization of Magnetosome Gene Clusters

Microbiology Resource Announcements ◽

10.1128/mra.01031-21 ◽

2022 ◽

Author(s):

Runjia Ji ◽

Wensi Zhang ◽

Yongxin Pan ◽

Wei Lin

Keyword(s):

Large Scale ◽

Genomic Data ◽

Gene Clusters ◽

Magnetotactic Bacteria ◽

Evolutionary Origin ◽

Organelle Biogenesis ◽

Efficient Exploration

Magnetosome gene clusters (MGCs), which are responsible for magnetosome biosynthesis and organization in magnetotactic bacteria (MTB), are the key to deciphering the mechanisms and evolutionary origin of magnetoreception, organelle biogenesis, and intracellular biomineralization in bacteria. Here, we report the development of MagCluster, a Python stand-alone tool for efficient exploration of MGCs from large-scale (meta)genomic data.

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Computational Intelligence and Neuroscience ◽

10.1155/2021/9945044 ◽

2021 ◽

Vol 2021 ◽

pp. 1-15

Author(s):

Xiaogang Ruan ◽

Peng Li ◽

Xiaoqing Zhu ◽

Hejie Yu ◽

Naigong Yu

Keyword(s):

Reinforcement Learning ◽

Intrinsic Motivation ◽

Driving Forces ◽

Temporal Distance ◽

Training Methods ◽

Complex Environments ◽

Learning Problem ◽

Autonomous Exploration ◽

Exploration Behavior ◽

Efficient Exploration

Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy.

Non-equilibrium criticality and efficient exploration of glassy landscapes with memory dynamics

Physica A Statistical Mechanics and its Applications ◽

10.1016/j.physa.2021.126727 ◽

2021 ◽

pp. 126727

Author(s):

Yan Ru Pei ◽

Massimiliano Di Ventra

Keyword(s):

Efficient Exploration ◽

With Memory ◽

Non Equilibrium

Bayesian optimization with adaptive surrogate models for automated experimental design

npj Computational Materials ◽

10.1038/s41524-021-00662-x ◽

2021 ◽

Vol 7 (1) ◽

Author(s):

Bowen Lei ◽

Tanner Quinn Kirk ◽

Anirban Bhattacharya ◽

Debdeep Pati ◽

Xiaoning Qian ◽

...

Keyword(s):

Experimental Design ◽

Design Space ◽

Materials Science ◽

Scientific Discovery ◽

Surrogate Models ◽

Multivariate Adaptive Regression Splines ◽

Bayesian Optimization ◽

Objective Functions ◽

Additive Regression ◽

Efficient Exploration

AbstractBayesian optimization (BO) is an indispensable tool to optimize objective functions that either do not have known functional forms or are expensive to evaluate. Currently, optimal experimental design is always conducted within the workflow of BO leading to more efficient exploration of the design space compared to traditional strategies. This can have a significant impact on modern scientific discovery, in particular autonomous materials discovery, which can be viewed as an optimization problem aimed at looking for the maximum (or minimum) point for the desired materials properties. The performance of BO-based experimental design depends not only on the adopted acquisition function but also on the surrogate models that help to approximate underlying objective functions. In this paper, we propose a fully autonomous experimental design framework that uses more adaptive and flexible Bayesian surrogate models in a BO procedure, namely Bayesian multivariate adaptive regression splines and Bayesian additive regression trees. They can overcome the weaknesses of widely used Gaussian process-based methods when faced with relatively high-dimensional design space or non-smooth patterns of objective functions. Both simulation studies and real-world materials science case studies demonstrate their enhanced search efficiency and robustness.

Good Vibrations: Calculating Excited State Frequencies Using Ground State Self-Consistent Field Models

10.26434/chemrxiv-2021-cch5r ◽

2021 ◽

Author(s):

Ali Abou Taka ◽

Hector Corzo ◽

Aurora Pribram-Jones ◽

Hrant Hratchian

Keyword(s):

Excited States ◽

Density Functional ◽

Photoelectron Spectra ◽

Configuration Interaction Method ◽

Functional Theory ◽

Spin Contamination ◽

Purification Technique ◽

Self Consistent Field ◽

Efficient Exploration ◽

Scf Calculations

△SCF methods have proven to be reliable computational tools for the assignment and interpretation of photoelectron spectra of isolated molecules. These results have increased the interest in △SCF techniques for electronic excited states based on improved algorithms that prevent convergence to ground states. In this work, one of these △SCF improved algorithms is studied to demonstrate its ability to explore the molecular properties for excited states. Results from △SCF calculations for a set of representative molecules are compared with results obtained using time-dependent density functional theory and single substitution configuration interaction method. For the △SCF calculations, the efficacy of a spin-purification technique is explored to remedy some of the spin-contamination presented in some of the SCF solutions. The obtained results suggest that the proposed projection-based SCF scheme, in many cases, alleviates the spin--contamination present in the SCF single determinants, and provides a computational alternative for the efficient exploration of the vibrational properties of excited states molecules.

Replacing Chemical Intuition by Machine Learning: a Mixed Design of Experiments - Reinforcement Learning Approach to the Construction of Training Sets for Model Hamiltonians

10.26434/chemrxiv-2021-6v4n0 ◽

2021 ◽

Author(s):

Ruben Staub ◽

Stephan Steinmann

Keyword(s):

Machine Learning ◽

Design Of Experiments ◽

Computational Cost ◽

Geometry Optimization ◽

Training Set ◽

Geometric Patterns ◽

Co Oxidation Reaction ◽

Model Hamiltonian ◽

Efficient Exploration ◽

Model Hamiltonians

Model Hamiltonians based on the so-called cluster expansion (CE), which consist of a linear fit of parameters corresponding to geometric patterns, provide an efficient and rigorous means to quickly evaluate the energy of diverse arrangements of adsorbate mixtures on reactive surfaces as typically relevant for heterogeneous catalysis. However, establishing the model Hamiltonian is a tedious task, requiring the construction and optimization of many geometries. Today, most of these geometries are constructed by hand, based on chemical intuition or random choices. Hence, the quality of the training set is unlikely to be optimal and its construction is not reproducible. Herein, we propose a reformulation of the construction of the training set as a strategy-based game, aiming at an efficient exploration of the relevant patterns constituting the model Hamiltonian. Based on this reformulation, we exploit a typical active learning solution for machine-learning such a strategy game: an upper confidence tree (UCT) based framework. However, in contrast to standard games, evaluating the true score is computationally expensive, as it requires a costly geometry optimization. Hence, we augment the UCT with a pre-exploration step inspired by the variance-based Design of Experiments (DoE) methods. This novel mixed UCT+DoE framework allows to automatically construct a well adapted training set, minimizing computational cost and user-intervention. As a proof of principle, we apply our UCT+DoE approach on the CO oxidation reaction on Pd(111), for which a relevant model Hamiltonian has been established previously. The results demonstrate the effectiveness of the custom built UCT and its significant benefits on a DoE-based approach.

Efficient Exploration System to Discover the Next Generation of Massively Connecting Internet of Things Devices

10.23919/icmu50196.2021.9638797 ◽

2021 ◽

Author(s):

Kentaro Tanaka ◽

Hidekazu Suzuki

Keyword(s):

Internet Of Things ◽

Next Generation ◽

Efficient Exploration

Custom ML Module of AIDrugApp for Molecular Identification, Descriptor Calculation, and Building ML/DL QSAR Models

10.33774/chemrxiv-2021-3f1f9 ◽

2021 ◽

Author(s):

Divya Karade

Keyword(s):

Molecular Identification ◽

Chemical Space ◽

Biological Properties ◽

Free Access ◽

The Novel ◽

Discovery Research ◽

Drug Discovery Research ◽

Qsar Models ◽

Efficient Exploration ◽

Tools And Techniques

Computer-aided drug design (CADD) techniques continue to struggle to provide a useful advance in the area of drug development due to the difficulties in an efficient exploration of the vast drug-like chemical space to uncover new chemical compounds with desired biological properties. Other challenges that users must overcome in order to fully use the potential of CADD tools and techniques include a lack of completely autonomous methods, the necessity for retraining even after deployment, and their lack of interpretability. To solve this issue, we created the ‘Custom ML Tools’ integrated within the framework of ‘AIDrugAPP’. ‘Custom ML Tools’ includes four modules: ‘Mol Identifier’, ‘DesCal’, ‘AutoDL’, and ‘Auto-Multi-ML’ which give users free access to molecular identification using SMILES and compound names, similarity search, descriptor calculation, the building of ML/DL QSAR models, and their usage in predicting new data. The study demonstrates the potential of the novel tool for computational investigations in drug discovery research. The WebApp with its modules has therefore been made available for public use at: https://sars-covid-app.herokuapp.com/

FrAnTK: A Frequency-based Analysis ToolKit for efficient exploration of allele sharing patterns in present-day and ancient genomic datasets

G3 Genes|Genome|Genetics ◽

10.1093/g3journal/jkab357 ◽

2021 ◽

Author(s):

J Víctor Moreno-Mayar

Keyword(s):

Large Scale ◽

Sequencing Data ◽

Scale Population ◽

Genomic Studies ◽

Minimal Data ◽

Efficient Exploration ◽

Sharing Patterns ◽

User Friendly ◽

Fast Memory ◽

Memory Efficient

Abstract Present-day and ancient population genomic studies from different study organisms have rapidly become accessible to diverse research groups worldwide. Unfortunately, as datasets and analyses become more complex, researchers with less computational experience often miss their chance to analyse their own data. We introduce FrAnTK, a user-friendly toolkit for computation and visualisation of allele frequency-based statistics in ancient and present-day genome variation datasets. We provide fast, memory-efficient tools that allow the user to go from sequencing data to complex exploratory analyses and visual representations with minimal data manipulation. Its simple usage and low computational requirements make FrAnTK ideal for users that are less familiar with computer programming carrying out large-scale population studies.

DIPEND: An Open-Source Pipeline to Generate Ensembles of Disordered Segments Using Neighbor-Dependent Backbone Preferences

Biomolecules ◽

10.3390/biom11101505 ◽

2021 ◽

Vol 11 (10) ◽

pp. 1505

Author(s):

Zita Harmat ◽

Dániel Dudola ◽

Zoltán Gáspári

Keyword(s):

Experimental Data ◽

Open Source ◽

A Priori ◽

Conformational Space ◽

Basic Premise ◽

Intrinsically Disordered ◽

Intrinsically Disordered Regions ◽

Conformational Preferences ◽

Flexible Protein ◽

Efficient Exploration

Ensemble-based structural modeling of flexible protein segments such as intrinsically disordered regions is a complex task often solved by selection of conformers from an initial pool based on their conformity to experimental data. However, the properties of the conformational pool are crucial, as the sampling of the conformational space should be sufficient and, in the optimal case, relatively uniform. In other words, the ideal sampling is both efficient and exhaustive. To achieve this, specialized tools are usually necessary, which might not be maintained in the long term, available on all platforms or flexible enough to be tweaked to individual needs. Here, we present an open-source and extendable pipeline to generate initial protein structure pools for use with selection-based tools to obtain ensemble models of flexible protein segments. Our method is implemented in Python and uses ChimeraX, Scwrl4, Gromacs and neighbor-dependent backbone distributions compiled and published previously by the Dunbrack lab. All these tools and data are publicly available and maintained. Our basic premise is that by using residue-specific, neighbor-dependent Ramachandran distributions, we can enhance the efficient exploration of the relevant region of the conformational space. We have also provided a straightforward way to bias the sampling towards specific conformations for selected residues by combining different conformational distributions. This allows the consideration of a priori known conformational preferences such as in the case of preformed structural elements. The open-source and modular nature of the pipeline allows easy adaptation for specific problems. We tested the pipeline on an intrinsically disordered segment of the protein Cd3ϵ and also a single-alpha helical (SAH) region by generating conformational pools and selecting ensembles matching experimental data using the CoNSEnsX+ server.

efficient exploration
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

MagCluster: a Tool for Identification, Annotation, and Visualization of Magnetosome Gene Clusters

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Non-equilibrium criticality and efficient exploration of glassy landscapes with memory dynamics

Bayesian optimization with adaptive surrogate models for automated experimental design

Good Vibrations: Calculating Excited State Frequencies Using Ground State Self-Consistent Field Models

Replacing Chemical Intuition by Machine Learning: a Mixed Design of Experiments - Reinforcement Learning Approach to the Construction of Training Sets for Model Hamiltonians

Efficient Exploration System to Discover the Next Generation of Massively Connecting Internet of Things Devices

Custom ML Module of AIDrugApp for Molecular Identification, Descriptor Calculation, and Building ML/DL QSAR Models

FrAnTK: A Frequency-based Analysis ToolKit for efficient exploration of allele sharing patterns in present-day and ancient genomic datasets

DIPEND: An Open-Source Pipeline to Generate Ensembles of Disordered Segments Using Neighbor-Dependent Backbone Preferences

Export Citation Format

efficient explorationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

MagCluster: a Tool for Identification, Annotation, and Visualization of Magnetosome Gene Clusters

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Non-equilibrium criticality and efficient exploration of glassy landscapes with memory dynamics

Bayesian optimization with adaptive surrogate models for automated experimental design

Good Vibrations: Calculating Excited State Frequencies Using Ground State Self-Consistent Field Models

Replacing Chemical Intuition by Machine Learning: a Mixed Design of Experiments - Reinforcement Learning Approach to the Construction of Training Sets for Model Hamiltonians

Efficient Exploration System to Discover the Next Generation of Massively Connecting Internet of Things Devices

Custom ML Module of AIDrugApp for Molecular Identification, Descriptor Calculation, and Building ML/DL QSAR Models

FrAnTK: A Frequency-based Analysis ToolKit for efficient exploration of allele sharing patterns in present-day and ancient genomic datasets

DIPEND: An Open-Source Pipeline to Generate Ensembles of Disordered Segments Using Neighbor-Dependent Backbone Preferences

efficient exploration
Recently Published Documents