Global and Local Computational Models for Aqueous Solubility Prediction of Drug-Like Molecules

2004 ◽  
Vol 44 (4) ◽  
pp. 1477-1488 ◽  
Author(s):  
Christel A. S. Bergström ◽  
Carola M. Wassvik ◽  
Ulf Norinder ◽  
Kristina Luthman ◽  
Per Artursson
ChemInform ◽  
2004 ◽  
Vol 35 (39) ◽  
Author(s):  
Christel A. S. Bergstroem ◽  
Carola M. Wassvik ◽  
Ulf Norinder ◽  
Kristina Luthman ◽  
Per Artusson

Author(s):  
Nebojša Pavlović ◽  
Nastasija Milošević ◽  
Maja Đjanić ◽  
Svetlana Goločorbin-Kon ◽  
Bojan Stanimirov ◽  
...  

Background: Urokinase-type plasminogen activator (uPA) system is a crucial pathway for tumor invasion and metastasis. Recently, multiple anticancer effects of quercetin have been described, including inhibitory activity against uPA. However, the clinical use of this flavonoid has been limited due to its low oral bioavailability. Objective: The objectives of the study were to assess the antimetastatic potential of quercetin analogues by analyzing their binding affinity for uPA and to select the compounds with improved pharmacological profiles. Methods: Binding affinities of structural analogues of quercetin to uPA receptor were determined by molecular docking analysis using Molegro Virtual Docker software, and molecular descriptors relevant for estimating pharmacological profile were calculated from ligand structures using computational models. Results: Among 44 quercetin analogues, only one quercetin analogue (3,6,2’,4’,5’-pentahydroxyflavone) was found to possess both higher aqueous solubility and membrane permeability, and a stronger affinity for uPA than quercetin, which makes it the potential lead compound for anticancer drug development. Like quercetin, this compound has five hydroxyl groups but is arranged differently, which contributes to the higher aqueous solubility and higher amphiphilic moment compared to quercetin. Since membrane permeability is not recognized as the limiting factor for quercetin absorption, analogues with higher aqueous solubility and retained or stronger uPA inhibitory activity should also be further experimentally validated for potential therapeutic use. Conclusion: Identified quercetin analogues with better physicochemical and pharmacological properties have a high potential to succeed in later stages of research in biological systems as potential anticancer agents with antimetastatic activity.


Cells ◽  
2020 ◽  
Vol 9 (4) ◽  
pp. 881 ◽  
Author(s):  
Jihwan Ha ◽  
Chihyun Park ◽  
Chanyoung Park ◽  
Sanghyun Park

The identification of potential microRNA (miRNA)-disease associations enables the elucidation of the pathogenesis of complex human diseases owing to the crucial role of miRNAs in various biologic processes and it yields insights into novel prognostic markers. In the consideration of the time and costs involved in wet experiments, computational models for finding novel miRNA-disease associations would be a great alternative. However, computational models, to date, are biased towards known miRNA-disease associations; this is not suitable for rare miRNAs (i.e., miRNAs with a few known disease associations) and uncommon diseases (i.e., diseases with a few known miRNA associations). This leads to poor prediction accuracies. The most straightforward way of improving the performance is by increasing the number of known miRNA-disease associations. However, due to lack of information, increasing attention has been paid to developing computational models that can handle insufficient data via a technical approach. In this paper, we present a general framework—improved prediction of miRNA-disease associations (IMDN)—based on matrix completion with network regularization to discover potential disease-related miRNAs. The success of adopting matrix factorization is demonstrated by its excellent performance in recommender systems. This approach considers a miRNA network as additional implicit feedback and makes predictions for disease associations relevant to a given miRNA based on its direct neighbors. Our experimental results demonstrate that IMDN achieved excellent performance with reliable area under the receiver operating characteristic (ROC) area under the curve (AUC) values of 0.9162 and 0.8965 in the frameworks of global and local leave-one-out cross-validations (LOOCV), respectively. Further, case studies demonstrated that our method can not only validate true miRNA-disease associations but also suggest novel disease-related miRNA candidates.


2008 ◽  
Vol 43 (3) ◽  
pp. 501-512 ◽  
Author(s):  
Lei Du-Cuny ◽  
Jörg Huwyler ◽  
Michael Wiese ◽  
Manfred Kansy

2020 ◽  
Author(s):  
Murat Sorkun ◽  
J. M. Koelman ◽  
Süleyman Er

Abstract Accurate prediction of the solubility of chemical substances in solvents remains a challenge. The sparsity of high-quality solubility data is recognized as the biggest hurdle in the development of robust data-driven methods for practical use. Nonetheless, the effects of the quality and quantity of data on aqueous solubility predictions have not yet been scrutinized. In this study, the roles of the size and the quality of datasets on the performances of the solubility prediction models are unraveled, and the concepts of actual and observed performances are introduced. In an effort to curtail the gap between actual and observed performances, a quality-oriented data selection method, which evaluates the quality of data and extracts the most accurate part of it through statistical validation, is designed. Applying this method on the largest publicly available solubility database and using a consensus machine learning approach, a top-performing solubility prediction model is achieved.


2021 ◽  
Author(s):  
Elif Sorkun ◽  
Qi Zhang ◽  
Abhishek Khetan ◽  
murat cihan sorkun ◽  
Süleyman Er

An increasing number of electroactive compounds have recently been explored for their use in high-performance redox flow batteries for grid-scale energy storage. Given the vast and highly diverse chemical space of the candidate compounds, it is alluring to access their physicochemical properties in a speedy way. High-throughput virtual screening approaches, which use powerful combinatorial techniques for systematic enumerations of large virtual chemical libraries and respective property evaluations, are indispensable tools for an agile exploration of the designated chemical space. Herein, RedDB: a computational database that contains 31,677 molecules from two prominent classes of organic electroactive compounds, quinones and aza-aromatics, has been presented. RedDB incorporates miscellaneous physicochemical property information of the compounds that can potentially be employed as battery performance descriptors. RedDB’s development steps, including: i)chemical library generation, ii) molecular property prediction based on quantum chemical calculations, iii) aqueous solubility prediction using machine learning, and iv) data processing and database creation, have been described.


2017 ◽  
Vol 28 (8) ◽  
pp. 661-676 ◽  
Author(s):  
O. A. Raevsky ◽  
V. Y. Grigorev ◽  
D. E. Polianczyk ◽  
O. E. Raevskaja ◽  
J. C. Dearden

Author(s):  
Jen-Hao Chen ◽  
Yufeng Jane Tseng

Abstract Aqueous solubility is the key property driving many chemical and biological phenomena and impacts experimental and computational attempts to assess those phenomena. Accurate prediction of solubility is essential and challenging, even with modern computational algorithms. Fingerprint-based, feature-based and molecular graph-based representations have all been used with different deep learning methods for aqueous solubility prediction. It has been clearly demonstrated that different molecular representations impact the model prediction and explainability. In this work, we reviewed different representations and also focused on using graph and line notations for modeling. In general, one canonical chemical structure is used to represent one molecule when computing its properties. We carefully examined the commonly used simplified molecular-input line-entry specification (SMILES) notation representing a single molecule and proposed to use the full enumerations in SMILES to achieve better accuracy. A convolutional neural network (CNN) was used. The full enumeration of SMILES can improve the presentation of a molecule and describe the molecule with all possible angles. This CNN model can be very robust when dealing with large datasets since no additional explicit chemistry knowledge is necessary to predict the solubility. Also, traditionally it is hard to use a neural network to explain the contribution of chemical substructures to a single property. We demonstrated the use of attention in the decoding network to detect the part of a molecule that is relevant to solubility, which can be used to explain the contribution from the CNN.


Sign in / Sign up

Export Citation Format

Share Document