scholarly journals Bioactivity descriptors for uncharacterized chemical compounds

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Martino Bertoni ◽  
Miquel Duran-Frigola ◽  
Pau Badia-i-Mompel ◽  
Eduardo Pauls ◽  
Modesto Orozco-Ruiz ◽  
...  

AbstractChemical descriptors encode the physicochemical and structural properties of small molecules, and they are at the core of chemoinformatics. The broad release of bioactivity data has prompted enriched representations of compounds, reaching beyond chemical structures and capturing their known biological properties. Unfortunately, bioactivity descriptors are not available for most small molecules, which limits their applicability to a few thousand well characterized compounds. Here we present a collection of deep neural networks able to infer bioactivity signatures for any compound of interest, even when little or no experimental information is available for them. Our signaturizers relate to bioactivities of 25 different types (including target profiles, cellular response and clinical outcomes) and can be used as drop-in replacements for chemical descriptors in day-to-day chemoinformatics tasks. Indeed, we illustrate how inferred bioactivity signatures are useful to navigate the chemical space in a biologically relevant manner, unveiling higher-order organization in natural product collections, and to enrich mostly uncharacterized chemical libraries for activity against the drug-orphan target Snail1. Moreover, we implement a battery of signature-activity relationship (SigAR) models and show a substantial improvement in performance, with respect to chemistry-based classifiers, across a series of biophysics and physiology activity prediction benchmarks.

2020 ◽  
Author(s):  
Martino Bertoni ◽  
Miquel Duran-Frigola ◽  
Pau Badia-i-Mompel ◽  
Eduardo Pauls ◽  
Modesto Orozco-Ruiz ◽  
...  

AbstractChemical descriptors encode the physicochemical and structural properties of small molecules, and they are at the core of chemoinformatics. The broad release of bioactivity data has prompted enriched representations of compounds, reaching beyond chemical structures and capturing their known biological properties. Unfortunately, ‘bioactivity descriptors’ are not available for most small molecules, which limits their applicability to a few thousand well characterized compounds. Here we present a collection of deep neural networks able to infer bioactivity signatures for any compound of interest, even when little or no experimental information is available for them. Our ‘signaturizers’ relate to bioactivities of 25 different types (including target profiles, cellular response and clinical outcomes) and can be used as drop-in replacements for chemical descriptors in day-to-day chemoinformatics tasks. Indeed, we illustrate how inferred bioactivity signatures are useful to navigate the chemical space in a biologically relevant manner, unveiling higher-order organization in natural product collections, and to enrich mostly uncharacterized chemical libraries for activity against the drug-orphan target Snail1. Moreover, we implement a battery of signature-activity relationship (SigAR) models and show a substantial improvement in performance, with respect to chemistry-based classifiers, across a series of biophysics and physiology activity prediction benchmarks.


2021 ◽  
Author(s):  
Michael A. Skinnider ◽  
R. Greg Stacey ◽  
David S. Wishart ◽  
Leonard J. Foster

Deep generative models are powerful tools for the exploration of chemical space, enabling the on-demand gener- ation of molecules with desired physical, chemical, or biological properties. However, these models are typically thought to require training datasets comprising hundreds of thousands, or even millions, of molecules. This per- ception limits the application of deep generative models in regions of chemical space populated by only a small number of examples. Here, we systematically evaluate and optimize generative models of molecules for low-data settings. We carry out a series of systematic benchmarks, training more than 5,000 deep generative models and evaluating over 2.6 billion generated molecules. We find that robust models can be learned from far fewer examples than has been widely assumed. We further identify strategies that dramatically reduce the number of molecules required to learn a model of equivalent quality, and demonstrate the application of these principles by learning models of chemical structures found in bacterial, plant, and fungal metabolomes. The structure of our experiments also allows us to benchmark the metrics used to evaluate generative models themselves. We find that many of the most widely used metrics in the field fail to capture model quality, but identify a subset of well-behaved metrics that provide a sound basis for model development. Collectively, our work provides a foundation for directly learning generative models in sparsely populated regions of chemical space.


2018 ◽  
Vol 4 (5) ◽  
Author(s):  
Fernanda I. Saldívar-González ◽  
B. Angélica Pilón-Jiménez ◽  
José L. Medina-Franco

AbstractThe chemical space of naturally occurring compounds is vast and diverse. Other than biologics, naturally occurring small molecules include a large variety of compounds covering natural products from different sources such as plant, marine, and fungi, to name a few, and several food chemicals. The systematic exploration of the chemical space of naturally occurring compounds have significant implications in many areas of research including but not limited to drug discovery, nutrition, bio- and chemical diversity analysis. The exploration of the coverage and diversity of the chemical space of compound databases can be carried out in different ways. The approach will largely depend on the criteria to define the chemical space that is commonly selected based on the goals of the study. This chapter discusses major compound databases of natural products and cheminformatics strategies that have been used to characterize the chemical space of natural products. Recent exemplary studies of the chemical space of natural products from different sources and their relationships with other compounds are also discussed. We also present novel chemical descriptors and data mining approaches that are emerging to characterize the chemical space of naturally occurring compounds.


2020 ◽  
Vol 49 (D1) ◽  
pp. D1170-D1178
Author(s):  
Tianbiao Yang ◽  
Zhaojun Li ◽  
Yingjia Chen ◽  
Dan Feng ◽  
Guangchao Wang ◽  
...  

Abstract One of the most prominent topics in drug discovery is efficient exploration of the vast drug-like chemical space to find synthesizable and novel chemical structures with desired biological properties. To address this challenge, we created the DrugSpaceX (https://drugspacex.simm.ac.cn/) database based on expert-defined transformations of approved drug molecules. The current version of DrugSpaceX contains >100 million transformed chemical products for virtual screening, with outstanding characteristics in terms of structural novelty, diversity and large three-dimensional chemical space coverage. To illustrate its practical application in drug discovery, we used a case study of discoidin domain receptor 1 (DDR1), a kinase target implicated in fibrosis and other diseases, to show DrugSpaceX performing a quick search of initial hit compounds. Additionally, for ligand identification and optimization purposes, DrugSpaceX also provides several subsets for download, including a 10% diversity subset, an extended drug-like subset, a drug-like subset, a lead-like subset, and a fragment-like subset. In addition to chemical properties and transformation instructions, DrugSpaceX can locate the position of transformation, which will enable medicinal chemists to easily integrate strategy planning and protection design.


Synthesis ◽  
2018 ◽  
Vol 51 (01) ◽  
pp. 55-66 ◽  
Author(s):  
George Karageorgis ◽  
Herbert Waldmann

Biology-oriented-synthesis (BIOS), is a chemocentric approach to identifying structurally novel molecules as tools for chemical biology and medicinal chemistry research. The vast chemical space cannot be exhaustively covered by synthetic chemistry. Thus, methods which reveal biologically relevant portions of chemical space are of high value. Guided by structural conservation in the evolution of both proteins and natural products, BIOS classifies bioactive compound classes in a hierarchical manner based on molecular architecture and bioactivity. Biologically relevant scaffolds inspire and guide the synthesis of BIOS libraries, which calls for the development of suitable synthetic methodologies. These compound collections have enriched biological relevance, leading to the discovery of bioactive small molecules. These potent and selective modulators allow the study of complex biological pathways and may serve as starting points for drug discovery programs. Thus, BIOS can also be regarded as a hypothesis-generating tool, guiding the design and preparation of novel, bioactive molecular scaffolds. This review elaborates the principles of BIOS and highlights selected examples of their application, which have in turn created future opportunities for the expansion of BIOS and its combination with fragment-based compound discovery for the identification of biologically relevant small molecules with unprecedented molecular scaffolds.1 Introduction2 Structural Classification of Natural Products3 Implications and Opportunities for Biology-Oriented Synthesis4 Applications of Biology-Oriented Synthesis4.1 Chemical Structure and Bioactivity Guided Approaches4.2 Natural-Product-Derived Fragment-Based Approaches5 Conclusions and Outlook


2021 ◽  
Author(s):  
Michael A. Skinnider ◽  
R. Greg Stacey ◽  
David S. Wishart ◽  
Leonard J. Foster

Deep generative models are powerful tools for the exploration of chemical space, enabling the on-demand gener- ation of molecules with desired physical, chemical, or biological properties. However, these models are typically thought to require training datasets comprising hundreds of thousands, or even millions, of molecules. This per- ception limits the application of deep generative models in regions of chemical space populated by only a small number of examples. Here, we systematically evaluate and optimize generative models of molecules for low-data settings. We carry out a series of systematic benchmarks, training more than 5,000 deep generative models and evaluating over 2.6 billion generated molecules. We find that robust models can be learned from far fewer examples than has been widely assumed. We further identify strategies that dramatically reduce the number of molecules required to learn a model of equivalent quality, and demonstrate the application of these principles by learning models of chemical structures found in bacterial, plant, and fungal metabolomes. The structure of our experiments also allows us to benchmark the metrics used to evaluate generative models themselves. We find that many of the most widely used metrics in the field fail to capture model quality, but identify a subset of well-behaved metrics that provide a sound basis for model development. Collectively, our work provides a foundation for directly learning generative models in sparsely populated regions of chemical space.


2017 ◽  
Vol 68 (2) ◽  
pp. 317-322
Author(s):  
Anca Mihaela Mocanu ◽  
Constantin Luca ◽  
Alina Costina Luca

The purpose of this research is to synthetize, characterize and thermal degradation of new heterolytic derivates with potential biological properties. The derivates synthesis was done by obtaining new molecules with pyralozone structure which combine two pharmacophore entities: the amidosulfonyl-R1,R2 phenoxyacetil with the 3,5-dimethyl pyrazole which can have potential biological properties. The synthesis stages of the new products are presented as well as the elemental analysis data and IR, 1H-NMR spectral measurements made for elucidating the chemical structures and thermostability study which makes evident the temperature range proper for their use and storage. The obtained results were indicative of a good correlation of the structure with the thermal stability as estimated by means of the initial degradation temperatures as well as with the degradation mechanism by means of the TG-FTIR analysis.


2019 ◽  
Vol 14 (2) ◽  
pp. 93-116 ◽  
Author(s):  
Shabnam Mohebbi ◽  
Mojtaba Nasiri Nezhad ◽  
Payam Zarrintaj ◽  
Seyed Hassan Jafari ◽  
Saman Seyed Gholizadeh ◽  
...  

Biomedical engineering seeks to enhance the quality of life by developing advanced materials and technologies. Chitosan-based biomaterials have attracted significant attention because of having unique chemical structures with desired biocompatibility and biodegradability, which play different roles in membranes, sponges and scaffolds, along with promising biological properties such as biocompatibility, biodegradability and non-toxicity. Therefore, chitosan derivatives have been widely used in a vast variety of uses, chiefly pharmaceuticals and biomedical engineering. It is attempted here to draw a comprehensive overview of chitosan emerging applications in medicine, tissue engineering, drug delivery, gene therapy, cancer therapy, ophthalmology, dentistry, bio-imaging, bio-sensing and diagnosis. The use of Stem Cells (SCs) has given an interesting feature to the use of chitosan so that regenerative medicine and therapeutic methods have benefited from chitosan-based platforms. Plenty of the most recent discussions with stimulating ideas in this field are covered that could hopefully serve as hints for more developed works in biomedical engineering.


2015 ◽  
Vol 52 (1) ◽  
pp. 76-80 ◽  
Author(s):  
Fábio Vieira TEIXEIRA ◽  
Paulo Gustavo KOTZE ◽  
Aderson Omar Mourão Cintra DAMIÃO ◽  
Sender Jankiel MISZPUTEN

ABSTRACT Biosimilars are not generic drugs. These are more complex medications than small molecules, with identical chemical structures of monoclonal antibodies that lost their patency over time. Besides identical to the original product at the end, the process of achieving its final forms differs from the one used in the reference products. These differences in the formulation process can alter final outcomes such as safety and efficacy of the drugs. Recently, a biosimilar of Infliximab was approved in some countries, even to the management of inflammatory bowel diseases. However, this decision was based on studies performed in rheumatologic conditions such as rheumatoid arthritis and ankylosing spondylitis. Extrapolation of the indications from rheumatologic conditions was done for Crohn’s disease and ulcerative colitis based on these studies. In this article, the authors explain possible different mechanisms in the pathogenesis between rheumatologic conditions and inflammatory bowel diseases, that can lead to different actions of the medications in different diseases. The authors also alert the gastroenterological community for the problem of extrapolation of indications, and explain in full details the reasons for being care with the use of biosimilars in inflammatory bowel diseases without specific data from trials performed in this scenario.


Molecules ◽  
2021 ◽  
Vol 26 (15) ◽  
pp. 4445
Author(s):  
Tiphaine Wong ◽  
Lorette Brault ◽  
Eric Gasparotto ◽  
Romuald Vallée ◽  
Pierre-Yves Morvan ◽  
...  

Marine polysaccharides are part of the huge seaweeds resources and present many applications for several industries. In order to widen their potential as additives or bioactive compounds, some structural modifications have been studied. Among them, simple hydrophobization reactions have been developed in order to yield to grafted polysaccharides bearing acyl-, aryl-, alkyl-, and alkenyl-groups or fatty acid chains. The resulting polymers are able to present modified physicochemical and/or biological properties of interest in the current pharmaceutical, cosmetics, or food fields. This review covers the chemical structures of the main marine polysaccharides, and then focuses on their structural modifications, and especially on hydrophobization reactions mainly esterification, acylation, alkylation, amidation, or even cross-linking reaction on native hydroxyl-, amine, or carboxylic acid functions. Finally, the question of the necessary requirement for more sustainable processes around these structural modulations of marine polysaccharides is addressed, considering the development of greener technologies applied to traditional polysaccharides.


Sign in / Sign up

Export Citation Format

Share Document