scholarly journals Development of data infrastructure and in silico prediction method to promote genomic medicine

2021 ◽  
Vol 41 ◽  
pp. 02001
Author(s):  
Mayumi Kamada

In genome medicine, which is now being implemented in medical care, variants detected by genome analysis such as next-generation sequencers are clinically interpreted to determine the diagnosis and treatment plan. The clinical interpretation is performed based on the detailed clinical background and the information from journal papers and public databases, such as frequencies in the population and their relationship to the disease. A large amount of genomic data has been accumulated so far, and many genomic variant databases related to diseases have been developed, including ClinVar. On the other hand, the genes and variants involved in diseases are different between populations with different genetic backgrounds. Furthermore, it has been reported that there is a racial bias in the information shared in current public databases, which affects clinical interpretation. Therefore, increasing the diversity of genomic variant data has become an important issue worldwide. In Japan, the Japan Agency for Medical Research and Development (AMED) launched a project to develop an integrated clinical genome information database in 2016. This project targeted “Cancer,” “Rare/Intractable diseases,” “Infectious diseases,” “Dementia,” and “Hearing loss”, and in collaboration with research institutes that provide genomic medicine in Japan, we developed an integrated database named MGeND (Medical Genomics Japan Database). The MGeND is a freely accessible database, which provides disease-related genomic information detected from the Japanese population. The MGeND widely collects variant data for monogenic diseases represented by rare diseases and polygenic diseases such as dementia and infectious disease. The genome variant data are integrated by genomic position for these diseases and can be searched across diseases. The useful genome analysis methods differ depending on the disease area. Therefore, in addition to “SNV, short indel, SV, and CNV” data handled by ClinVar, MGeND includes GWAS (Genome-Wide Association Study) data, which is widely used in studies of polygenic diseases, and HLA (Human Leukemia Virus) allele frequency data, which is used in immune-related diseases such as infectious diseases. As of September 2021, more than 150,000 variants have been registered in MGeND, and 60,000 unique variants have been made public. Of these variants, about 70% were variants registered only in MGeND and not registered in ClinVar. This fact shows the importance of the efforts to collect genomic information by each ethnic group. On the other hands, many variants have not been annotated with any clinical interpretation because the effects on molecular function and the mechanisms of disease are not clear at this time. These variants of uncertain significance (VUS) are a bottleneck for genomic medicine because they cannot be used for diagnosis or treatment selection. The evaluation of VUS requires detailed experimental validation and a vast amount of knowledge integration, which is costly. In order to understand the molecular function and disease relevance of VUS and to enable optimal drug selection, we have been developing a machine learning-based method for predicting the pathogenicity of variants and a computational platform for estimating the effect of variants on drug sensitivity. Many methods for predicting the pathogenicity of genomic variants using machine learning have been developed. Most of them use the conservation of amino acid or nucleotide sequences among closely related species, physicochemical properties of proteins as features for prediction. There are also many prediction methods based on ensemble learning that aggregate the predicted scores by existing tools. These approaches focus on individual genes and variants and evaluate their effects. However, in many diseases, multiple molecules play a complex role in the pathogenesis of the disease. In other words, to assess the pathological significance of variants more accurately, it is necessary to consider the molecular association. Therefore, we constructed a knowledge graph based on molecular networks, genomic variants, and predicted scores by existing methods and proposed a prediction model using Graph Convolutional Network (GCN). The prediction performance evaluation using a benchmark set showed that the GCN-based method outperformed existing methods. It is known that variants can affect the interaction between a molecule and a drug. For optimal drug selection, it is necessary to clarify the effect of the variant on drug affinity. It is time-consuming and costly to perform experiments on a large number of VUSs. Our previous studies show that molecular dynamics calculation can evaluate the affinity between mutants and drugs energetically and estimate with high accuracy. We are currently working on a project to estimate the effects of a large number of VUSs using the supercomputer Fugaku. To realize calculations for many VUS in this project, we are developing a data platform for seamlessly performing molecular dynamics simulation from genome information. Moreover, we are constructing a database to publish calculation results and their outcomes for contributing a selection of optimal drugs. In the presentation, I will introduce the development of the databases and prediction methods to improve the efficiency of genomic medicine.

Pathogens ◽  
2021 ◽  
Vol 10 (3) ◽  
pp. 259
Author(s):  
Arne Schwelm ◽  
Jutta Ludwig-Müller

Here we review the usefulness of the currently available genomic information for the molecular identification of pathotypes. We focused on effector candidates and genes implied to be pathotype specific and tried to connect reported marker genes to Plasmodiophora brassicae genome information. The potentials for practical applications, current obstacles and future perspectives are discussed.


2020 ◽  
Vol 34 (01) ◽  
pp. 598-605
Author(s):  
Chaoran Cheng ◽  
Fei Tan ◽  
Zhi Wei

We consider the problem of Named Entity Recognition (NER) on biomedical scientific literature, and more specifically the genomic variants recognition in this work. Significant success has been achieved for NER on canonical tasks in recent years where large data sets are generally available. However, it remains a challenging problem on many domain-specific areas, especially the domains where only small gold annotations can be obtained. In addition, genomic variant entities exhibit diverse linguistic heterogeneity, differing much from those that have been characterized in existing canonical NER tasks. The state-of-the-art machine learning approaches heavily rely on arduous feature engineering to characterize those unique patterns. In this work, we present the first successful end-to-end deep learning approach to bridge the gap between generic NER algorithms and low-resource applications through genomic variants recognition. Our proposed model can result in promising performance without any hand-crafted features or post-processing rules. Our extensive experiments and results may shed light on other similar low-resource NER applications.


CNS Spectrums ◽  
2006 ◽  
Vol 11 (S3) ◽  
pp. 3-4 ◽  
Author(s):  
David A. Mrazek

AbstractAlthough most patients with depression ultimately respond to antidepressant therapy, >50% have inadequate response to an individual antidepressant trial. The desire to avoid adverse drug reactions is common among patients, and is an important determinant of drug selection among psychiatrists. However, since the major classes of antidepressants and antipsychotics appear to be comparable in efficacy, clinicians have little basis for selecting the most effective agent for an individual patient. Pharmacogenetics, often described as the study of genetic variation that explains differential response to medication, represents an important new avenue toward improving treatment outcomes. Genetic variation in drug-metabolizing enzymes has been recognized for decades. The main focus of current psychiatric pharmacogenetic testing is on the cytochrome P450 (CYP) 2D6 and, to a somewhat lesser extent, on the 2C19 genes. Data suggest that poor metabolizer status can be associated with an increased risk of adverse drug reactions with certain medications, and that ultra-rapid metabolizers may require higher-than-usual doses to achieve a therapeutic response. The importance of CYP enzymes in the metabolism of several antidepressant and antipsychotic drugs suggest that genetic variation may aid in medication selection or dosing. Advances in pharmacogenetic research may facilitate the development of personalized medicine in which genetic information can inform drug selection, leading to optimal drug effectiveness and minimal drug toxicity.


Author(s):  
Janet Piñero ◽  
Juan Manuel Ramírez-Anguita ◽  
Josep Saüch-Pitarch ◽  
Francesco Ronzano ◽  
Emilio Centeno ◽  
...  

Abstract One of the most pressing challenges in genomic medicine is to understand the role played by genetic variation in health and disease. Thanks to the exploration of genomic variants at large scale, hundreds of thousands of disease-associated loci have been uncovered. However, the identification of variants of clinical relevance is a significant challenge that requires comprehensive interrogation of previous knowledge and linkage to new experimental results. To assist in this complex task, we created DisGeNET (http://www.disgenet.org/), a knowledge management platform integrating and standardizing data about disease associated genes and variants from multiple sources, including the scientific literature. DisGeNET covers the full spectrum of human diseases as well as normal and abnormal traits. The current release covers more than 24 000 diseases and traits, 17 000 genes and 117 000 genomic variants. The latest developments of DisGeNET include new sources of data, novel data attributes and prioritization metrics, a redesigned web interface and recently launched APIs. Thanks to the data standardization, the combination of expert curated information with data automatically mined from the scientific literature, and a suite of tools for accessing its publicly available data, DisGeNET is an interoperable resource supporting a variety of applications in genomic medicine and drug R&D.


2020 ◽  
Vol 10 (4) ◽  
pp. 195
Author(s):  
Kentaro Inamura

The development of high-throughput techniques has permitted the accumulation of enormous amounts of genomic information. As increasing numbers of studies aim to utilize individual genomic information for diagnostic, preventive, or therapeutic purposes, Institutional Review Boards (IRBs) have a greater opportunity to review such types of study protocols. An article published in the Journal of Personalized Medicine titled, “Ethical Considerations Related to Return of Results from Genomic Medicine Projects: The eMERGE Network (Phase III) Experience” identified the common concerns of IRBs in the process of reviewing such studies, and some concerns included the readability of informed consent materials, potential risks to participants, information sharing with family members, options for withdrawal or receiving limited results, and provisions to clear participant questions. Since there is an increase in the number of genomic medicine implementation studies worldwide, the insights provided by this study would assist future researchers in protocol preparation as well as aid project review by IRB members.


2019 ◽  
Vol 37 (15_suppl) ◽  
pp. 11045-11045
Author(s):  
Nathan David Seligson ◽  
Achal Awasthi ◽  
Sherri Z. Millis ◽  
David A. Liebner ◽  
John L. Hays ◽  
...  

11045 Background: Epithelioid hemangioendothelioma (EHE) is a rare vascular sarcoma characterized by the WWTR1- CAMTA1 fusion ( WC-F) in a majority of cases. EHE demonstrates a biphasic clinical course; remaining indolent for many years before suddenly demonstrating aggressive progression. Cell cycle mutations have been previously noted to account for some secondary alterations; however, little is known regarding the chronicity of these secondary alterations or their clinical implications. Here we present the largest assessment of secondary genomic variants and their clinical import. Methods: Comprehensive genomic profiling from 45 WC-F positive EHE patients (pts) were provided by Foundation Medicine (FMI). 8 of these 45 pts were treated at The Ohio State University (OSU) and were evaluated retrospectively through chart review. Known deleterious alterations, variants of unknown significance (VUS), and genomic copy quantification for the WC-F was included in our analysis. Targetable gene variants were defined by OncoKB. Chi-square and student’s t-tests were used as appropriate. Results: Genomic copy number of the WC-F best fit a log-normal distribution (range: 13-2,131 copies). 20 pts (44%) did not exhibit any secondary genomic variants. The most commonly altered genes included: CDKN2A/B (7 variants), RB1 (3 variants), and ATRX (3 variants). Commonly identified pathways included: cell cycle (9 pts, 20.0%), epigenetic modulators (7 pts, 15.6%), and DNA damage repair (7 pts, 15.6%). Eight pts exhibited targetable gene variants (18%) as defined by OncoKB. Subjects ≥50 years of age exhibited a greater proportion of clinically targetable variants (27.6% vs 0%; p = 0.02). Pts with a secondary genomic variant exhibited elevated WC-F copy numbers (p < 0.001). OSU pts with aggressive EHE were more likely to have a second genomic variant (80% vs 0%; p = 0.03) when compared to indolent EHE, with trends toward higher WC-F copy numbers (809±315 vs 207±147; p = 0.2) and older age at diagnosis (59.5±5.5 vs 36.7±8.8; p = 0.1). Conclusions: In this study, secondary genomic variants in WC-F driven EHE are more common in older patients ( > 50 yo). Further, the presence of secondary genomic variants is associated with an aggressive phenotype and may drive poor prognosis. Prospective research is needed to confirm these findings.


CNS Spectrums ◽  
2006 ◽  
Vol 11 (S3) ◽  
pp. 8-12 ◽  

AbstractAlthough most patients with depression ultimately respond to antidepressant therapy, >50% have inadequate response to an individual antidepressant trial. The desire to avoid adverse drug reactions is common among patients, and is an important determinant of drug selection among psychiatrists. However, since the major classes of antidepressants and antipsychotics appear to be comparable in efficacy, clinicians have little basis for selecting the most effective agent for an individual patient. Pharmacogenetics, often described as the study of genetic variation that explains differential response to medication, represents an important new avenue toward improving treatment outcomes. Genetic variation in drug-metabolizing enzymes has been recognized for decades. The main focus of current psychiatric pharmacogenetic testing is on the cytochrome P450 (CYP) 2D6 and, to a somewhat lesser extent, on the 2C19 genes. Data suggest that poor metabolizer status can be associated with an increased risk of adverse drug reactions with certain medications, and that ultra-rapid metabolizers may require higher-than-usual doses to achieve a therapeutic response. The importance of CYP enzymes in the metabolism of several antidepressant and antipsychotic drugs suggest that genetic variation may aid in medication selection or dosing. Advances in pharmacogenetic research may facilitate the development of personalized medicine in which genetic information can inform drug selection, leading to optimal drug effectiveness and minimal drug toxicity.In this monograph, David A. Mrazek, MD, provides an overview of the context of genetic testing in clinical psychiatric practice. Next, Jordan W. Smoller, MD, ScD, discusses some of the practical issues related to medication selection. Finally, Jose de Leon, MD, presents a comprehensive review of antidepressant and antipsychotic treatment based on drug metabolism, and reviews the available testing methods for CYP 2D6 and 2C19 genotypes.


Author(s):  
Wylie Burke

Genomic information is poised to play an increasing role in clinical care, extending beyond highly penetrant genetic conditions to less penetrant genotypes and common disorders. But with this shift, the question of clinical utility becomes a major challenge. A collaborative effort is necessary to determine the information needed to evaluate different uses of genomic information and then acquire that information. Another challenge must also be addressed if that process is to provide equitable benefits: the lack of diversity of genomic data. Current genomic knowledge comes primarily from populations of European descent, which poses the risk that most of the human population will be shortchanged when health benefits of genomics emerge. These two challenges have defined my career as a geneticist and have taught me that solutions must start with dialogue across disciplinary and social divides. Expected final online publication date for the Annual Review of Genomics and Human Genetics Volume 22 is August 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.


Sign in / Sign up

Export Citation Format

Share Document