A comprehensive and bias-free evaluation of genomic variant clinical interpretation tools

In genome medicine, which is now being implemented in medical care, variants detected by genome analysis such as next-generation sequencers are clinically interpreted to determine the diagnosis and treatment plan. The clinical interpretation is performed based on the detailed clinical background and the information from journal papers and public databases, such as frequencies in the population and their relationship to the disease. A large amount of genomic data has been accumulated so far, and many genomic variant databases related to diseases have been developed, including ClinVar. On the other hand, the genes and variants involved in diseases are different between populations with different genetic backgrounds. Furthermore, it has been reported that there is a racial bias in the information shared in current public databases, which affects clinical interpretation. Therefore, increasing the diversity of genomic variant data has become an important issue worldwide. In Japan, the Japan Agency for Medical Research and Development (AMED) launched a project to develop an integrated clinical genome information database in 2016. This project targeted “Cancer,” “Rare/Intractable diseases,” “Infectious diseases,” “Dementia,” and “Hearing loss”, and in collaboration with research institutes that provide genomic medicine in Japan, we developed an integrated database named MGeND (Medical Genomics Japan Database). The MGeND is a freely accessible database, which provides disease-related genomic information detected from the Japanese population. The MGeND widely collects variant data for monogenic diseases represented by rare diseases and polygenic diseases such as dementia and infectious disease. The genome variant data are integrated by genomic position for these diseases and can be searched across diseases. The useful genome analysis methods differ depending on the disease area. Therefore, in addition to “SNV, short indel, SV, and CNV” data handled by ClinVar, MGeND includes GWAS (Genome-Wide Association Study) data, which is widely used in studies of polygenic diseases, and HLA (Human Leukemia Virus) allele frequency data, which is used in immune-related diseases such as infectious diseases. As of September 2021, more than 150,000 variants have been registered in MGeND, and 60,000 unique variants have been made public. Of these variants, about 70% were variants registered only in MGeND and not registered in ClinVar. This fact shows the importance of the efforts to collect genomic information by each ethnic group. On the other hands, many variants have not been annotated with any clinical interpretation because the effects on molecular function and the mechanisms of disease are not clear at this time. These variants of uncertain significance (VUS) are a bottleneck for genomic medicine because they cannot be used for diagnosis or treatment selection. The evaluation of VUS requires detailed experimental validation and a vast amount of knowledge integration, which is costly. In order to understand the molecular function and disease relevance of VUS and to enable optimal drug selection, we have been developing a machine learning-based method for predicting the pathogenicity of variants and a computational platform for estimating the effect of variants on drug sensitivity. Many methods for predicting the pathogenicity of genomic variants using machine learning have been developed. Most of them use the conservation of amino acid or nucleotide sequences among closely related species, physicochemical properties of proteins as features for prediction. There are also many prediction methods based on ensemble learning that aggregate the predicted scores by existing tools. These approaches focus on individual genes and variants and evaluate their effects. However, in many diseases, multiple molecules play a complex role in the pathogenesis of the disease. In other words, to assess the pathological significance of variants more accurately, it is necessary to consider the molecular association. Therefore, we constructed a knowledge graph based on molecular networks, genomic variants, and predicted scores by existing methods and proposed a prediction model using Graph Convolutional Network (GCN). The prediction performance evaluation using a benchmark set showed that the GCN-based method outperformed existing methods. It is known that variants can affect the interaction between a molecule and a drug. For optimal drug selection, it is necessary to clarify the effect of the variant on drug affinity. It is time-consuming and costly to perform experiments on a large number of VUSs. Our previous studies show that molecular dynamics calculation can evaluate the affinity between mutants and drugs energetically and estimate with high accuracy. We are currently working on a project to estimate the effects of a large number of VUSs using the supercomputer Fugaku. To realize calculations for many VUS in this project, we are developing a data platform for seamlessly performing molecular dynamics simulation from genome information. Moreover, we are constructing a database to publish calculation results and their outcomes for contributing a selection of optimal drugs. In the presentation, I will introduce the development of the databases and prediction methods to improve the efficiency of genomic medicine.

Download Full-text

The clinical interpretation of common abnormalities in the serum concentrations of certain electrolytes

Medical Clinics of North America ◽

10.1016/s0025-7125(16)35217-8 ◽

1951 ◽

Vol 35 (6) ◽

pp. 1807-1828 ◽

Cited By ~ 3

Author(s):

Russell D. Squires ◽

J. Russell Elkinton

Keyword(s):

Serum Concentrations ◽

Clinical Interpretation

Download Full-text

Review of Case studies of the clinical interpretation of the Bender Gestalt Test.

Contemporary Psychology ◽

10.1037/015833 ◽

1977 ◽

Vol 22 (3) ◽

pp. 230-230

Author(s):

JAMES BIERI

Keyword(s):

Case Studies ◽

Clinical Interpretation ◽

Bender Gestalt Test

Download Full-text

Review of The Halstead-Reitan Neuropsychological Test Battery: Theory and Clinical Interpretation.

Contemporary Psychology ◽

10.1037/024717 ◽

1986 ◽

Vol 31 (4) ◽

pp. 309-309

Author(s):

No authorship indicated

Keyword(s):

Neuropsychological Test ◽

Test Battery ◽

Clinical Interpretation ◽

Neuropsychological Test Battery

Download Full-text

A Comprehensive Handbook for MMPI–2–RF Clinical Interpretation

PsycCRITIQUES ◽

10.1037/a0030420 ◽

2012 ◽

Vol 57 (47) ◽

Author(s):

James Moses

Keyword(s):

Clinical Interpretation

Download Full-text

Review of WISC-IV advanced clinical interpretation.

Canadian Psychology/Psychologie canadienne ◽

10.1037/cp2007_1_51 ◽

2007 ◽

Vol 48 (1) ◽

pp. 51-53

Author(s):

Rebecca Pillai Riddell

Keyword(s):

Clinical Interpretation

Download Full-text

Evaluation of management strategy results of laboratory studies in serum samples with hemolysis

Medical alphabet ◽

10.33667/2078-5631-2019-1-4(379)-43-45 ◽

2019 ◽

Vol 1 (4) ◽

pp. 43-45

Author(s):

O. A. Klimenkova ◽

V. P. Pashkova ◽

T. V. Vavilova ◽

V. S. Berestovskaya

Keyword(s):

Management Strategy ◽

Negative Consequences ◽

Laboratory Studies ◽

Serum Samples ◽

Ongoing Debate ◽

Clinical Interpretation ◽

Points Of View ◽

Test Result ◽

High Uncertainty

There is an ongoing debate about what the laboratory should do with hemolyzed samples. Several strategies are proposed for managing the results obtained in such samples. The safest option from the analytical and clinical points of view is to perform a study of a new sample without hemolysis. Another approach is to carry out a test irregardless, but at the same time indicate a limit on the clinical interpretation of the result, by making a comment on possible hemoglobin interference. The choice of strategy should be based on a comparison of the risk of negative consequences in the absence of a test result and the likelihood of harm due to the transfer of the result with high uncertainty to the clinician.

Download Full-text

ENTEROPATHOGENS IN PEDIATRIC GASTROENTERITIS : CLINICAL INTERPRETATION OF NEW TOOLS RESULTS

10.26226/morressier.5ad774dcd462b80296ca67b4 ◽

2018 ◽

Author(s):

Anne Tilmanne

Keyword(s):

Clinical Interpretation

Download Full-text

Complex biological patterns of hematology parameters in childhood necessitating age‐ and sex‐specific reference intervals for evidence‐based clinical interpretation

International Journal of Laboratory Hematology ◽

10.1111/ijlh.13306 ◽

2020 ◽

Vol 42 (6) ◽

pp. 750-760

Author(s):

Mary Kathryn Bohn ◽

Victoria Higgins ◽

Houman Tahmasebi ◽

Alexandra Hall ◽

En Liu ◽

...

Keyword(s):

Reference Intervals ◽

Specific Reference ◽

Evidence Based ◽

Clinical Interpretation ◽

Hematology Parameters ◽

Biological Patterns ◽

Age And Sex

Download Full-text

More for less: predicting and maximizing genomic variant discovery via Bayesian nonparametrics

Biometrika ◽

10.1093/biomet/asab012 ◽

2021 ◽

Author(s):

Lorenzo Masoero ◽

Federico Camerlenghi ◽

Stefano Favaro ◽

Tamara Broderick

Keyword(s):

Optimal Allocation ◽

Bayesian Nonparametrics ◽

Allocation Of Resources ◽

Experimental Conditions ◽

Follow Up Study ◽

Variant Discovery ◽

Fixed Budget ◽

The Cost ◽

Genomic Variant

Abstract While the cost of sequencing genomes has decreased dramatically in recent years, this expense often remains non-trivial. Under a fixed budget, scientists face a natural trade-off between quantity and quality: spending resources to sequence a greater number of genomes or spending resources to sequence genomes with increased accuracy. Our goal is to find the optimal allocation of resources between quantity and quality. Optimizing resource allocation promises to reveal as many new variations in the genome as possible. In this paper, we introduce a Bayesian nonparametric methodology to predict the number of new variants in a follow-up study based on a pilot study. When experimental conditions are kept constant between the pilot and follow-up, we find that our prediction is competitive with the best existing methods. Unlike current methods, though, our new method allows practitioners to change experimental conditions between the pilot and the follow-up. We demonstrate how this distinction allows our method to be used for more realistic predictions and for optimal allocation of a fixed budget between quality and quantity.

Download Full-text