chemical ontology
Recently Published Documents


TOTAL DOCUMENTS

18
(FIVE YEARS 6)

H-INDEX

6
(FIVE YEARS 1)

2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Janna Hastings ◽  
Martin Glauer ◽  
Adel Memariani ◽  
Fabian Neuhaus ◽  
Till Mossakowski

AbstractChemical data is increasingly openly available in databases such as PubChem, which contains approximately 110 million compound entries as of February 2021. With the availability of data at such scale, the burden has shifted to organisation, analysis and interpretation. Chemical ontologies provide structured classifications of chemical entities that can be used for navigation and filtering of the large chemical space. ChEBI is a prominent example of a chemical ontology, widely used in life science contexts. However, ChEBI is manually maintained and as such cannot easily scale to the full scope of public chemical data. There is a need for tools that are able to automatically classify chemical data into chemical ontologies, which can be framed as a hierarchical multi-class classification problem. In this paper we evaluate machine learning approaches for this task, comparing different learning frameworks including logistic regression, decision trees and long short-term memory artificial neural networks, and different encoding approaches for the chemical structures, including cheminformatics fingerprints and character-based encoding from chemical line notation representations. We find that classical learning approaches such as logistic regression perform well with sets of relatively specific, disjoint chemical classes, while the neural network is able to handle larger sets of overlapping classes but needs more examples per class to learn from, and is not able to make a class prediction for every molecule. Future work will explore hybrid and ensemble approaches, as well as alternative network architectures including neuro-symbolic approaches.


2020 ◽  
Author(s):  
Janna Hastings ◽  
Martin Glauer ◽  
Adel Memariani ◽  
Fabian Neuhaus ◽  
Till Mossakowski

Abstract Chemical data is increasingly openly available in databases such as PubChem, which contains more than 110 million compound entries as of October 2020. With the availability of data at such scale, the burden has shifted to organisation, analysis and interpretation. Chemical ontologies provide structured classifications of chemical entities that can be used for navigation and filtering of the large chemical space. ChEBI is a prominent example of a chemical ontology, widely used in life science contexts. However, ChEBI is manually maintained and as such cannot easily scale to the full scope of public chemical data. There is a need for tools that are able to automatically classify chemical data into chemical ontologies, which can be framed as a hierarchical multi-class classification problem. In this paper we evaluate machine learning approaches for this task, comparing different learning frameworks including logistic regression, decision trees and long short-term memory articial neural networks, and different encoding approaches for the chemical structures, including cheminformatics fingerprints and character-based encoding from chemical line notation representations. We nd that classical learning approaches such as logistic regression perform well with sets of relatively specific, disjoint chemical classes, while the neural network is able to handle larger sets of overlapping classes but needs more examples per class to learn from, and is not able to make a class prediction for every molecule. Future work will explore hybrid and ensemble approaches, as well as alternative network architectures including neuro-symbolic approaches.


2020 ◽  
Vol 49 (D1) ◽  
pp. D461-D467
Author(s):  
Milton H Saier ◽  
Vamsee S Reddy ◽  
Gabriel Moreno-Hagelsieb ◽  
Kevin J Hendargo ◽  
Yichi Zhang ◽  
...  

Abstract The Transporter Classification Database (TCDB; tcdb.org) is a freely accessible reference resource, which provides functional, structural, mechanistic, medical and biotechnological information about transporters from organisms of all types. TCDB is the only transport protein classification database adopted by the International Union of Biochemistry and Molecular Biology (IUBMB) and now (October 1, 2020) consists of 20 653 proteins classified in 15 528 non-redundant transport systems with 1567 tabulated 3D structures, 18 336 reference citations describing 1536 transporter families, of which 26% are members of 82 recognized superfamilies. Overall, this is an increase of over 50% since the last published update of the database in 2016. This comprehensive update of the database contents and features include (i) adoption of a chemical ontology for substrates of transporters, (ii) inclusion of new superfamilies, (iii) a domain-based characterization of transporter families for the identification of new members as well as functional and evolutionary relationships between families, (iv) development of novel software to facilitate curation and use of the database, (v) addition of new subclasses of transport systems including 11 novel types of channels and 3 types of group translocators and (vi) the inclusion of many man-made (artificial) transmembrane pores/channels and carriers.


Author(s):  
Marina Paola Banchetti-Robino

This book examines the way in which Robert Boyle seeks to accommodate his complex chemical philosophy within the framework of a mechanistic theory of matter. More specifically, the book proposes that Boyle regards chemical qualities as properties that emerge from the mechanistic structure of chymical atoms. Within Boyle’s chemical ontology, chymical atoms are structured concretions of particles that Boyle regards as chemically elementary entities, that is, as chemical wholes that resist experimental analysis. Although this interpretation of Boyle’s chemical philosophy has already been suggested by other Boyle scholars, the present book provides a sustained philosophical argument to demonstrate that, for Boyle, chemical properties are dispositional, relational, emergent, and supervenient properties. This argument is strengthened by a detailed mereological analysis of Boylean chymical atoms that establishes the kind of theory of wholes and parts that is most consistent with his emergentist conception of chemical properties. The emergentist position that is being attributed to Boyle supports his view that chemical reactions resist direct explanation in terms of the mechanistic properties of fundamental particles, as well as his position regarding the scientific autonomy of chemistry from mechanics and physics.


Author(s):  
Marina Paola Banchetti-Robino

This chapter presents original arguments for the view that Boyle regarded chemical properties as being dispositional, relational, and emergent properties. The chapter begins by discussing the hierarchy of properties in Boyle’s chemical ontology and Boyle’s notion of sensible properties as being dispositional and relational. Both of these sections are informed by Peter Anstey’s discussion of these topics. The chapter then moves beyond Anstey’s discussion by arguing for the view that Boyle regarded chemical properties as dispositional, relational, emergent, and supervenient properties. The chapter cites many examples from Boyle’s writings to demonstrate that he considered chemical properties to display the various features that are required for emergence, that is, supervenience, non-summative difference, and underdetermination. After this extensive discussion, the chapter concludes by establishing that Boyle also considered cosmical qualities as dispositional and relational, thus demonstrating the considerable philosophical sophistication of Boyle’s natural philosophy and of his entire experimental programme.


2020 ◽  
Vol 36 (11) ◽  
pp. 3568-3569 ◽  
Author(s):  
Jian-Peng Zhou ◽  
Lei Chen ◽  
Tianyun Wang ◽  
Min Liu

Abstract Motivation Anatomical therapeutic chemical (ATC) classification system is very important for drug utilization and studies. Correct prediction of the 14 classes in the first level for given drugs is an essential problem for the study on such system. Several multi-label classifiers have been proposed in this regard. However, only two of them provided the web servers and their performance was not very high. On the other hand, although some rest classifiers can provide better performance, they were built based on some prior knowledge on drugs, such as information of chemical–chemical interaction and chemical ontology, leading to limited applications. Furthermore, provided codes of these classifiers are almost inaccessible for pharmacologists. Results In this study, we built a simple web server, namely iATC-FRAKEL. This web server only required the SMILES format of drugs as input and extracted their fingerprints for making prediction. The performance of the iATC-FRAKEL was much higher than all existing web servers and was comparable to the best multi-label classifier but had much wider applications. Such web server can be visited at http://cie.shmtu.edu.cn/iatc/index. Availability and implementation The web server is available at http://cie.shmtu.edu.cn/iatc/index. Contact [email protected] Supplementary information Supplementary data are available at Bioinformatics online.


BMC Genomics ◽  
2013 ◽  
Vol 14 (1) ◽  
pp. 513 ◽  
Author(s):  
David P Hill ◽  
Nico Adams ◽  
Mike Bada ◽  
Colin Batchelor ◽  
Tanya Z Berardini ◽  
...  

2012 ◽  
Vol 4 (1) ◽  
Author(s):  
Claudia Bobach ◽  
Timo Böhme ◽  
Ulf Laube ◽  
Anett Püschel ◽  
Lutz Weber

2012 ◽  
Vol 4 (S1) ◽  
Author(s):  
Lian Duan ◽  
Janna Hastings ◽  
Paula de Matos ◽  
Marcus Ennis ◽  
Christoph Steinbeck

Sign in / Sign up

Export Citation Format

Share Document