chemical databases
Recently Published Documents


TOTAL DOCUMENTS

130
(FIVE YEARS 28)

H-INDEX

21
(FIVE YEARS 1)

Author(s):  
Gergely Takács ◽  
Márk Sándor ◽  
Zoltán Szalai ◽  
Róbert Kiss ◽  
György T. Balogh

AbstractPhysicochemical properties are fundamental to predict the pharmacokinetic and pharmacodynamic behavior of drug candidates. Easily calculated descriptors such as molecular weight and logP have been found to correlate with the success rate of clinical trials. These properties have been previously shown to highlight a sweet-spot in the chemical space associated with favorable pharmacokinetics, which is superior against other regions during hit identification and optimization. In this study, we applied self-organizing maps (SOMs) trained on sixteen calculated properties of a subset of known drugs for the analysis of commercially available compound databases, as well as public biological and chemical databases frequently used for drug discovery. Interestingly, several regions of the property space have been identified that are highly overrepresented by commercially available chemical libraries, while we found almost completely unoccupied regions of the maps (commercially neglected chemical space resembling the properties of known drugs). Moreover, these underrepresented portions of the chemical space are compatible with most rigorous property filters applied by the pharma industry in medicinal chemistry optimization programs. Our results suggest that SOMs may be directly utilized in the strategy of library design for drug discovery to sample previously unexplored parts of the chemical space to aim at yet-undruggable targets. Graphic abstract


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Jonathan M. Goodman ◽  
Igor Pletnev ◽  
Paul Thiessen ◽  
Evan Bolton ◽  
Stephen R. Heller

AbstractThe software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integration of large chemical databases. InChI version 1.05 was released in January 2017 and version 1.06 in December 2020. In this paper, we report on the current state of the InChI Software, the details of the improvements in the v1.06 release, and the results of a test of the InChI run on PubChem, a database of more than a hundred million molecules. The upgrade introduces significant new features, including support for pseudo-element atoms and an improved description of polymers. We expect that few, if any, applications using the standard InChI will need to change as a result of the changes in version 1.06. Numerical instability was discovered for 0.002% of this database, and a small number of other molecules were discovered for which the algorithm did not run smoothly. On the basis of PubChem data, we can demonstrate that InChI version 1.05 was 99.996% accurate, and InChI version 1.06 represents a step closer to perfection. Finally, we look forward to future releases and extensions for the InChI Chemical identifier.


2021 ◽  
Author(s):  
Wanutcha Lorpaiboon ◽  
Taweetham Limpanuparb

<p>The exhaustive enumeration of 3D chemical structures based on Z-matrix templates has recently been used in the quantum chemical investigation of constitutional isomers, diastereomers and rotamers. This simple yet powerful initial structure generation approach can apply beyond the investigation of compounds of identical formula by quantum chemical methods. This paper aims to provide a short description of the overall concept followed by a practical tutorial to the approach. </p> <p>· - The four steps required for Z-matrix template-based substitution are template construction, generation of tuples for substitution sites, removal of duplicate tuples and substitution on the template. </p> <p>· -The generated tuples can be used to create chemical identifiers to query compound properties from chemical databases. </p> <p>· - All of these steps are demonstrated in this paper by common model compounds and are very straightforward for an undergraduate audience to reproduce. A comparison of the approach in this tutorial and other options is also discussed.<b></b></p><br>


2021 ◽  
Author(s):  
Wanutcha Lorpaiboon ◽  
Taweetham Limpanuparb

<p>The exhaustive enumeration of 3D chemical structures based on Z-matrix templates has recently been used in the quantum chemical investigation of constitutional isomers, diastereomers and rotamers. This simple yet powerful initial structure generation approach can apply beyond the investigation of compounds of identical formula by quantum chemical methods. This paper aims to provide a short description of the overall concept followed by a practical tutorial to the approach. </p> <p>· - The four steps required for Z-matrix template-based substitution are template construction, generation of tuples for substitution sites, removal of duplicate tuples and substitution on the template. </p> <p>· -The generated tuples can be used to create chemical identifiers to query compound properties from chemical databases. </p> <p>· - All of these steps are demonstrated in this paper by common model compounds and are very straightforward for an undergraduate audience to reproduce. A comparison of the approach in this tutorial and other options is also discussed.<b></b></p><br>


2021 ◽  
Author(s):  
Evan Bolton ◽  
Jonathan M Goodman ◽  
Stephen R Heller ◽  
Igor Pletnev ◽  
Paul Thiessen

Abstract The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integration of large chemical databases. InChI version 1.05 was released in January 2017 and v. 1.06 in December 2020. In this paper, we report on the current state of InChI Software, the details of the improvements in the v.1.06 release , and the results of a test of the InChI run on PubChem, a database of more than a hundred million molecules. The upgrade introduces significant new features, including support for pseudo-element atoms and an improved description of polymers. We expect that few, if any, applications using the standard InChI will need to change as a result of the changes in version 1.06. Numerical instability was discovered for 0.002 % of this database, and a small number of other molecules were discovered for which the algorithm did not run smoothly. On the basis of PubChem data, we can demonstrate that InChI version 1.05 was 99.996 % accurate, and InChI version 1.06 represents a step closer to perfection. Finally, we look forward to future releases and extensions for the InChI Chemical identifier.


Biomolecules ◽  
2021 ◽  
Vol 11 (3) ◽  
pp. 467
Author(s):  
Oscar Salvador Barrera-Vázquez ◽  
Juan Carlos Gómez-Verjan ◽  
Gil Alfonso Magos-Guerrero

Cellular senescence is a cellular condition that involves significant changes in gene expression and the arrest of cell proliferation. Recently, it has been suggested in experimental models that the elimination of senescent cells with pharmacological methods delays, prevents, and improves multiple adverse outcomes related to age. In this sense, the so-called senoylitic compounds are a class of drugs that selectively eliminates senescent cells (SCs) and that could be used in order to delay such adverse outcomes. Interestingly, the first senolytic drug (navitoclax) was discovered by using chemoinformatic and network analyses. Thus, in the present study, we searched for novel senolytic compounds through the use of chemoinformatic tools (fingerprinting and network pharmacology) over different chemical databases (InflamNat and BIOFACQUIM) coming from natural products (NPs) that have proven to be quite remarkable for drug development. As a result of screening, we obtained three molecules (hinokitiol, preussomerin C, and tanshinone I) that could be considered senolytic compound candidates since they share similarities in structure with senolytic leads (tunicamycin, ginsenoside Rb1, ABT 737, rapamycin, navitoclax, timosaponin A-III, digoxin, roxithromycin, and azithromycin) and targets involved in senescence pathways with potential use in the treatment of age-related diseases.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Emma L. Schymanski ◽  
Todor Kondić ◽  
Steffen Neumann ◽  
Paul A. Thiessen ◽  
Jian Zhang ◽  
...  

AbstractCompound (or chemical) databases are an invaluable resource for many scientific disciplines. Exposomics researchers need to find and identify relevant chemicals that cover the entirety of potential (chemical and other) exposures over entire lifetimes. This daunting task, with over 100 million chemicals in the largest chemical databases, coupled with broadly acknowledged knowledge gaps in these resources, leaves researchers faced with too much—yet not enough—information at the same time to perform comprehensive exposomics research. Furthermore, the improvements in analytical technologies and computational mass spectrometry workflows coupled with the rapid growth in databases and increasing demand for high throughput “big data” services from the research community present significant challenges for both data hosts and workflow developers. This article explores how to reduce candidate search spaces in non-target small molecule identification workflows, while increasing content usability in the context of environmental and exposomics analyses, so as to profit from the increasing size and information content of large compound databases, while increasing efficiency at the same time. In this article, these methods are explored using PubChem, the NORMAN Network Suspect List Exchange and the in silico fragmentation approach MetFrag. A subset of the PubChem database relevant for exposomics, PubChemLite, is presented as a database resource that can be (and has been) integrated into current workflows for high resolution mass spectrometry. Benchmarking datasets from earlier publications are used to show how experimental knowledge and existing datasets can be used to detect and fill gaps in compound databases to progressively improve large resources such as PubChem, and topic-specific subsets such as PubChemLite. PubChemLite is a living collection, updating as annotation content in PubChem is updated, and exported to allow direct integration into existing workflows such as MetFrag. The source code and files necessary to recreate or adjust this are jointly hosted between the research parties (see data availability statement). This effort shows that enhancing the FAIRness (Findability, Accessibility, Interoperability and Reusability) of open resources can mutually enhance several resources for whole community benefit. The authors explicitly welcome additional community input on ideas for future developments.


Sign in / Sign up

Export Citation Format

Share Document