Can an InChI for Nano Address the Need for a Simplified Representation of Complex Nanomaterials across Experimental and Nanoinformatics Studies?

Chemoinformatics has developed efficient ways of representing chemical structures for small molecules as simple text strings, simplified molecular-input line-entry system (SMILES) and the IUPAC International Chemical Identifier (InChI), which are machine-readable. In particular, InChIs have been extended to encode formalized representations of mixtures and reactions, and work is ongoing to represent polymers and other macromolecules in this way. The next frontier is encoding the multi-component structures of nanomaterials (NMs) in a machine-readable format to enable linking of datasets for nanoinformatics and regulatory applications. A workshop organized by the H2020 research infrastructure NanoCommons and the nanoinformatics project NanoSolveIT analyzed issues involved in developing an InChI for NMs (NInChI). The layers needed to capture NM structures include but are not limited to: core composition (possibly multi-layered); surface topography; surface coatings or functionalization; doping with other chemicals; and representation of impurities. NM distributions (size, shape, composition, surface properties, etc.), types of chemical linkages connecting surface functionalization and coating molecules to the core, and various crystallographic forms exhibited by NMs also need to be considered. Six case studies were conducted to elucidate requirements for unambiguous description of NMs. The suggested NInChI layers are intended to stimulate further analysis that will lead to the first version of a “nano” extension to the InChI standard.

Download Full-text

CFTR Lifecycle Map—A Systems Medicine Model of CFTR Maturation to Predict Possible Active Compound Combinations

International Journal of Molecular Sciences ◽

10.3390/ijms22147590 ◽

2021 ◽

Vol 22 (14) ◽

pp. 7590

Author(s):

Liza Vinhoven ◽

Frauke Stanke ◽

Sylvia Hafkemeyer ◽

Manuel Manfred Nietert

Keyword(s):

Large Scale ◽

Synergistic Effects ◽

Small Scale ◽

Systems Medicine ◽

Promising Candidate ◽

Cftr Mutations ◽

Machine Readable ◽

High Throughput Screens ◽

Readable Format ◽

Machine Readable Format

Different causative therapeutics for CF patients have been developed. There are still no mutation-specific therapeutics for some patients, especially those with rare CFTR mutations. For this purpose, high-throughput screens have been performed which result in various candidate compounds, with mostly unclear modes of action. In order to elucidate the mechanism of action for promising candidate substances and to be able to predict possible synergistic effects of substance combinations, we used a systems biology approach to create a model of the CFTR maturation pathway in cells in a standardized, human- and machine-readable format. It is composed of a core map, manually curated from small-scale experiments in human cells, and a coarse map including interactors identified in large-scale efforts. The manually curated core map includes 170 different molecular entities and 156 reactions from 221 publications. The coarse map encompasses 1384 unique proteins from four publications. The overlap between the two data sources amounts to 46 proteins. The CFTR Lifecycle Map can be used to support the identification of potential targets inside the cell and elucidate the mode of action for candidate substances. It thereby provides a backbone to structure available data as well as a tool to develop hypotheses regarding novel therapeutics.

Download Full-text

Recent Advances in One-Pot Modular Synthesis of 2-Quinolones

Molecules ◽

10.3390/molecules25225450 ◽

2020 ◽

Vol 25 (22) ◽

pp. 5450

Author(s):

Wan Pyo Hong ◽

Inji Shin ◽

Hee Nam Lim

Keyword(s):

Reaction Mechanism ◽

Small Molecules ◽

Functional Materials ◽

Synthetic Methods ◽

Microwave Irradiation Method ◽

One Pot ◽

Bond Forming ◽

Chemical Structures ◽

The Core ◽

Modular Synthesis

It is known that 2-quinolones are broadly applicable chemical structures in medicinal and agrochemical research as well as various functional materials. A number of current publications about their synthesis and their applications emphasize the importance of these small molecules. The early synthetic chemistry originated from the same principle of the classical Friedländer and Knorr procedures for the preparation of quinolines. The analogous processes were developed by applying new synthetic tools such as novel catalysts, the microwave irradiation method, etc., whereas recent innovations in new bond forming reactions have allowed for novel strategies to construct the core structures of 2-quinolones beyond the bond disconnections based on two classical reactions. Over the last few decades, some reviews on structure-based, catalyst-based, and bioactivity-based studies have been released. In this focused review, we extensively surveyed recent examples of one-pot reactions, particularly in view of modular approaches. Thus, the contents are categorized as three major sections (two-, three-, and four-component reactions) according to the number of reagents that ultimately compose atoms of the core structures of 2-quinolones. The collected synthetic methods are discussed from the perspectives of strategy, efficiency, selectivity, and reaction mechanism.

Download Full-text

Capturing Expert Arguments from Medical Adjudication Discussions in a Machine-readable Format

Companion Proceedings of The 2019 World Wide Web Conference on - WWW '19 ◽

10.1145/3308560.3317085 ◽

2019 ◽

Cited By ~ 3

Author(s):

Mike Schaekermann ◽

Graeme Beaton ◽

Minahz Habib ◽

Andrew Lim ◽

Kate Larson ◽

...

Keyword(s):

Machine Readable ◽

Readable Format ◽

Machine Readable Format

Download Full-text

IRS Announces Availability of EO Data in Machine-Readable Format

Bruce R Hopkins Nonprofit Counsel ◽

10.1002/npc.30226 ◽

2016 ◽

Vol 33 (8) ◽

pp. 6-6

Keyword(s):

Machine Readable ◽

Readable Format ◽

Machine Readable Format

Download Full-text

Linked Data and Metadata

Big Data in Bioeconomy ◽

10.1007/978-3-030-71069-9_7 ◽

2021 ◽

pp. 79-90

Author(s):

Christian Zinke-Wehlmann ◽

Amit Kirschenbaum ◽

Raul Palma ◽

Soumya Brahma ◽

Karel Charvát ◽

...

Keyword(s):

Linked Data ◽

Machine Readable ◽

Readable Format ◽

Machine Readable Format

AbstractData is the basis for creating information and knowledge. Having data in a structured and machine-readable format facilitates the processing and analysis of the data. Moreover, metadata—data about the data, can help discovering data based on features as, e.g., by whom they were created, when, or for which purpose. These associated features make the data more interpretable and assist in turning it into useful information. This chapter briefly introduces the concepts of metadata and Linked Data—highly structured and interlinked data, their standards and their usages, with some elaboration on the role of Linked Data in bioeconomy.

Download Full-text

OSD2F: An Open-Source Data Donation Framework

10.31235/osf.io/xjk6t ◽

2021 ◽

Author(s):

Theo Araujo ◽

Jef Ausloos ◽

Wouter van Atteveldt ◽

Felicia Loecherbach ◽

Judith Moeller ◽

...

Keyword(s):

Current State ◽

Open Source Data ◽

Source Data ◽

Data Ownership ◽

Recent Developments ◽

Development Goals ◽

Commercial Organizations ◽

Machine Readable ◽

Readable Format ◽

Machine Readable Format

The digital traces that people leave through their use of various online platforms provide tremendous opportunities for studying human behavior. However, the collection of these data is hampered by legal, ethical and technical challenges. We present a framework and tool for collecting these data through a data donation platform where consenting participants can securely submit their digital traces. This approach leverages recent developments in data rights that have given people more control over their own data, such as legislation that now mandates companies to make digital trace data available on request in a machine-readable format. By transparently requesting access to specific parts of this data for clearly communicated academic purposes, the data ownership and privacy of participants is respected and researchers are less dependent on commercial organizations that store this data in proprietary archives. In this paper we outline the general design principles, the current state of the tool, and future development goals.

Download Full-text

Effective Fuzzy Ontology Based Distributed Document Using Non-Dominated Ranked Genetic Algorithm

Organizational Efficiency through Intelligent Information Technologies ◽

10.4018/978-1-4666-2047-6.ch015 ◽

2012 ◽

pp. 243-264

Author(s):

M. Thangamani ◽

P. Thangaraj

Keyword(s):

Artificial Intelligence ◽

Data Mining ◽

Genetic Algorithm ◽

Document Clustering ◽

Distributed Environment ◽

Data Set ◽

Clustering Technique ◽

Machine Readable ◽

Readable Format ◽

Machine Readable Format

The increase in the number of documents has aggravated the difficulty of classifying those documents according to specific needs. Clustering analysis in a distributed environment is a thrust area in artificial intelligence and data mining. Its fundamental task is to utilize characters to compute the degree of related corresponding relationship between objects and to accomplish automatic classification without earlier knowledge. Document clustering utilizes clustering technique to gather the documents of high resemblance collectively by computing the documents resemblance. Recent studies have shown that ontologies are useful in improving the performance of document clustering. Ontology is concerned with the conceptualization of a domain into an individual identifiable format and machine-readable format containing entities, attributes, relationships, and axioms. By analyzing types of techniques for document clustering, a better clustering technique depending on Genetic Algorithm (GA) is determined. Non-Dominated Ranked Genetic Algorithm (NRGA) is used in this paper for clustering, which has the capability of providing a better classification result. The experiment is conducted in 20 newsgroups data set for evaluating the proposed technique. The result shows that the proposed approach is very effective in clustering the documents in the distributed environment.

Download Full-text

The Critical Evaluation of Stellar Data

International Astronomical Union Colloquium ◽

10.1017/s0252921100052933 ◽

1977 ◽

Vol 35 ◽

pp. 104-119

Author(s):

Anne B. Underhill ◽

Jaylee M. Mead

Keyword(s):

Critical Evaluation ◽

Classical Approach ◽

Secondary Importance ◽

Astronomical Data ◽

Readable Form ◽

Machine Readable ◽

Machine Readable Form ◽

Readable Format ◽

Machine Readable Format ◽

Types Of Error

AbstractMany catalogues of astronomical data appear in book form as well as in a machine-readable format. The latter form is popular because of the convenience of handling large bodies of data by machine and because it is an efficient way in which to transmit and make accessible data in books which are now out of print or very difficult to obtain. Some new catalogues are prepared entirely in a machine-readable form and the book form, if it exists at all, is of secondary importance for the preservation of the data.In this paper comments are given about the importance of prefaces for transmitting the results of a critical evaluation of a body of data and it is noted that it is essential that this type of documentation be transferred with any machine-readable catalogue. The types of error sometimes encountered in handling machine-readable catalogues are noted. The procedures followed in developing the Goddard Cross Index of eleven star catalogues are outlined as one example of how star catalogues can be compared using computers. The classical approach to evaluating data critically is reviewed and the types of question one should ask and answer for particular types of data are listed. Finally, a specific application of these precepts to the problem of line identifications is given.

Download Full-text

PhactorTM – a High Throughput Experimentation Management System

10.26434/chemrxiv.13148384.v1 ◽

2020 ◽

Author(s):

Tim Cernak ◽

Babak Mahjour

Keyword(s):

Chemical Synthesis ◽

High Throughput ◽

Chemical Laboratory ◽

Liquid Handling ◽

High Throughput Experimentation ◽

Series Of Experiments ◽

Machine Readable ◽

Liquid Handling Robot ◽

Readable Format ◽

Machine Readable Format

<p>High throughput experimentation (HTE) is an increasingly important tool in the study of chemical synthesis. While the hardware for running HTE in the synthesis lab has evolved significantly in recent years, there remains a need for software solutions to navigate data rich experiments. We have developed the software, phactor™, to facilitate the performance and analysis of HTE in a chemical laboratory. phactor™ allows experimentalists to rapidly design arrays of chemical reactions in 24, 96, 384, or 1,536 wellplates. Users can access online reagent data, such as a lab inventory, to populate wells with experiments and produce instructions to perform the screen manually, or with the assistance of a liquid handling robot. After completion of the screen, analytical results can be uploaded for facile evaluation, and to guide the next series of experiments. All chemical data, metadata, and results are stored in a machine-readable format.</p>

Download Full-text