scholarly journals SemaTyP: a knowledge graph based literature mining method for drug discovery

2018 ◽  
Vol 19 (1) ◽  
Author(s):  
Shengtian Sang ◽  
Zhihao Yang ◽  
Lei Wang ◽  
Xiaoxia Liu ◽  
Hongfei Lin ◽  
...  
Genes ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 998
Author(s):  
Peng Zhang ◽  
Yi Bu ◽  
Peng Jiang ◽  
Xiaowen Shi ◽  
Bing Lun ◽  
...  

This study builds a coronavirus knowledge graph (KG) by merging two information sources. The first source is Analytical Graph (AG), which integrates more than 20 different public datasets related to drug discovery. The second source is CORD-19, a collection of published scientific articles related to COVID-19. We combined both chemo genomic entities in AG with entities extracted from CORD-19 to expand knowledge in the COVID-19 domain. Before populating KG with those entities, we perform entity disambiguation on CORD-19 collections using Wikidata. Our newly built KG contains at least 21,700 genes, 2500 diseases, 94,000 phenotypes, and other biological entities (e.g., compound, species, and cell lines). We define 27 relationship types and use them to label each edge in our KG. This research presents two cases to evaluate the KG’s usability: analyzing a subgraph (ego-centered network) from the angiotensin-converting enzyme (ACE) and revealing paths between biological entities (hydroxychloroquine and IL-6 receptor; chloroquine and STAT1). The ego-centered network captured information related to COVID-19. We also found significant COVID-19-related information in top-ranked paths with a depth of three based on our path evaluation.


2009 ◽  
Vol 42 (2) ◽  
pp. 219-227 ◽  
Author(s):  
Ingrid Petriĕ ◽  
Tanja Urbanĕiĕ ◽  
Bojan Cestnik ◽  
Marta Macedoni-Lukšiĕ

2011 ◽  
Vol 2011 ◽  
pp. 1-8 ◽  
Author(s):  
Marta Macedoni-Lukšič ◽  
Ingrid Petrič ◽  
Bojan Cestnik ◽  
Tanja Urbančič

In the field of autism, an enormous increase in available information makes it very difficult to connect fragments of knowledge into a more coherent picture. We present a literature mining method, RaJoLink, to search for matched themes in unrelated literature that may contribute to a better understanding of complex pathological conditions, such as autism. 214 full text articles on autism, published in PubMed, served as a source of data. Using ontology construction, we identified the main concepts of what is already known about autism. Then, the RaJoLink method, based on Swanson's ABC model, was used to reveal potentially interesting, but not yet investigated, connections between different concepts in research. Among the more interesting concepts identified with RaJoLink in our study were calcineurin and NF-kappaB. Both terms can be linked to neuro-immune abnormalities in the brain of patients with autism. Further research is needed to provide stronger evidence about calcineurin and NF-kappaB involvement in autism. However, the analysis presented confirms that this method could support experts on their way towards discovering hidden relationships and towards a better understanding of the disorder.


2008 ◽  
Vol 9 (6) ◽  
pp. 479-492 ◽  
Author(s):  
P. Agarwal ◽  
D. B. Searls

2019 ◽  
Author(s):  
Xiaoyang Ji ◽  
Zhendong Feng ◽  
Qiangzu Zhang ◽  
Zhonghai Zhang ◽  
Yanhui Fan ◽  
...  

AbstractCancer clinical practice guidelines recommend different treatment options for different cancer types and are mainly developed by clinicians. In theory, those recommendation schemes that are supported by scientific research should provide better efficacy for patients. However, in actual clinical practice: “Is the choice of a specific antineoplastic drug for a specific cancer supported by the results of molecular biology mechanisms or based on the subjective experience of the clinician?” Answering this question is of significant importance for guiding clinical practice, but there is currently no operational method to provide objective judgment in specific cases. This paper describes a literature mining method that collates information from specific antineoplastic drug-related literature to establish an antineoplastic drug-gene association matrix for global or specific cancer scenarios, and further establishes a standard model and scenario models. Based on the parameters of these models, we constructed a linear regression analysis method to evaluate whether the models in different scenarios deviated from a random distribution. Finally, we determined the possible efficacy of an antineoplastic drug in different cancer types, which was validated by the Genomics of Drug Sensitivity in Cancer (GDSC) database. Using our mining method, we tested 18 antineoplastic drugs in 16 cancer types. We found that cisplatin used in ovarian cancer was more efficacious and may benefit patients more than when used in breast cancer, which provides a new paradigm for rational knowledge-driven drug distribution patterns in clinical practice.


2010 ◽  
Vol 72 (2) ◽  
pp. 201-208 ◽  
Author(s):  
Deepak K. Rajpal ◽  
Vinod Kumar ◽  
Pankaj Agarwal

Author(s):  
Kara Schatz ◽  
Cleber Melo-Filho ◽  
Alexander Tropsha ◽  
Rada Chirkova

Pharmaceutics ◽  
2021 ◽  
Vol 13 (6) ◽  
pp. 794
Author(s):  
Kevin McCoy ◽  
Sateesh Gudapati ◽  
Lawrence He ◽  
Elaina Horlander ◽  
David Kartchner ◽  
...  

Link prediction in artificial intelligence is used to identify missing links or derive future relationships that can occur in complex networks. A link prediction model was developed using the complex heterogeneous biomedical knowledge graph, SemNet, to predict missing links in biomedical literature for drug discovery. A web application visualized knowledge graph embeddings and link prediction results using TransE, CompleX, and RotatE based methods. The link prediction model achieved up to 0.44 hits@10 on the entity prediction tasks. The recent outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), also known as COVID-19, served as a case study to demonstrate the efficacy of link prediction modeling for drug discovery. The link prediction algorithm guided identification and ranking of repurposed drug candidates for SARS-CoV-2 primarily by text mining biomedical literature from previous coronaviruses, including SARS and middle east respiratory syndrome (MERS). Repurposed drugs included potential primary SARS-CoV-2 treatment, adjunctive therapies, or therapeutics to treat side effects. The link prediction accuracy for nodes ranked highly for SARS coronavirus was 0.875 as calculated by human in the loop validation on existing COVID-19 specific data sets. Drug classes predicted as highly ranked include anti-inflammatory, nucleoside analogs, protease inhibitors, antimalarials, envelope proteins, and glycoproteins. Examples of highly ranked predicted links to SARS-CoV-2: human leukocyte interferon, recombinant interferon-gamma, cyclosporine, antiviral therapy, zidovudine, chloroquine, vaccination, methotrexate, artemisinin, alkaloids, glycyrrhizic acid, quinine, flavonoids, amprenavir, suramin, complement system proteins, fluoroquinolones, bone marrow transplantation, albuterol, ciprofloxacin, quinolone antibacterial agents, and hydroxymethylglutaryl-CoA reductase inhibitors. Approximately 40% of identified drugs were not previously connected to SARS, such as edetic acid or biotin. In summary, link prediction can effectively suggest repurposed drugs for emergent diseases.


2018 ◽  
Vol 62 (6) ◽  
pp. 8:1-8:12
Author(s):  
R. L. Martin ◽  
D. Martinez Iraola ◽  
E. Louie ◽  
D. Pierce ◽  
B. A. Tagtow ◽  
...  

IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 8404-8415 ◽  
Author(s):  
Shengtian Sang ◽  
Zhihao Yang ◽  
Xiaoxia Liu ◽  
Lei Wang ◽  
Hongfei Lin ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document