scholarly journals Finding human gene-disease associations using a Network Enhanced Similarity Search (NESS) of multi-species heterogeneous functional genomics data

Author(s):  
Timothy Reynolds ◽  
Jason A. Bubier ◽  
Michael A. Langston ◽  
Elissa J. Chesler ◽  
Erich J. Baker

AbstractDisease diagnosis and treatment is challenging in part due to the misalignment of diagnostic categories with the underlying biology of disease. The evaluation of large-scale genomic experimental datasets is a compelling approach to refining the classification of biological concepts, such as disease. Well-established approaches, some of which rely on information theory or network analysis, quantitatively assess relationships among biological entities using gene annotations, structured vocabularies, and curated data sources. However, the gene annotations used in these evaluations are often sparse, potentially biased due to uneven study and representation in the literature, and constrained to the single species from which they were derived. In order to overcome these deficiencies inherent in the structure and sparsity of these annotated datasets, we developed a novel Network Enhanced Similarity Search (NESS) tool which takes advantage of multi-species networks of heterogeneous data to bridge sparsely populated datasets.NESS employs a random walk with restart algorithm across harmonized multi-species data, effectively compensating for sparsely populated and noisy genomic studies. We further demonstrate that it is highly resistant to spurious or sparse datasets and generates significantly better recapitulation of ground truth biological pathways than other similarity metrics alone. Furthermore, since NESS has been deployed as an embedded tool in the GeneWeaver environment, it can rapidly take advantage of curated multi-species networks to provide informative assertions of relatedness of any pair of biological entities or concepts, e.g., gene-gene, gene-disease, or phenotype-disease associations. NESS ultimately enables multi-species analysis applications to leverage model organism data to overcome the challenge of data sparsity in the study of human disease.Availability and ImplementationImplementation available at https://geneweaver.org/ness. Source code freely available at https://github.com/treynr/ness.Author summaryFinding consensus among large-scale genomic datasets is an ongoing challenge in the biomedical sciences. Harmonizing and analyzing such data is important because it allows researchers to mitigate the idiosyncrasies of experimental systems, alleviate study biases, and augment sparse datasets. Additionally, it allows researchers to utilize animal model studies and cross-species experiments to better understand biological function in health and disease. Here we provide a tool for integrating and analyzing heterogeneous functional genomics data using a graph-based model. We show how this type of analysis can be used to identify similar relationships among biological entities such as genes, processes, and disease through shared genomic associations. Our results indicate this approach is effective at reducing biases caused by sparse and noisy datasets. We show how this type of analysis can be used to aid the classification gene function and prioritization of genes involved in substance use disorders. In addition, our analysis reveals genes and biological pathways with shared association to multiple, co-occurring substance use disorders.

2021 ◽  
Author(s):  
Peter B. Barr ◽  
Tim B. Bigdeli ◽  
Jacquelyn M. Meyers

ABSTRACTImportanceAll of Us is a landmark initiative for population-scale research into the etiology of psychiatric disorders and disparities across various sociodemographic categories.ObjectiveTo estimate the prevalence, comorbidity, and demographic covariates of psychiatric and substance use disorders in the All of Us biobank.Design, Setting, and ParticipantsWe estimated prevalence, overlap, and demographic correlates for psychiatric disorders derived from electronic health records in the All of Us biobank (release 5; N = 331,380)ExposuresSocial and demographic covariates.Main Outcome and MeasuresPsychiatric disorders derived from ICD10CM codes and grouped into phecodes across six broad domains: mood disorders, anxiety disorders, substance use disorders, stress-related disorders, schizophrenia, and personality disorders.ResultsThe prevalence of various disorders ranges from approximately 15% to less than 1%, with mood and anxiety disorders being the most common, followed by substance use disorders, stress-related disorders, schizophrenia, and personality disorders. There is substantial overlap among disorders, with a large portion of those with a disorder (~57%) having two or more registered diagnoses and tetrachoric correlations ranging from 0.43 – 0.74. The prevalence of disorders across demographic categories demonstrates that non-Hispanic whites, those of low socioeconomic status, women and those assigned female at birth, and sexual minorities are at greatest risk for most disorders.Conclusions and RelevanceAlthough the rates of disorders in All of Us are lower than rates for disorders in the general population, there is considerable variation, comorbidity, and differences across social groups. Large-scale resources like All of Us will prove to be invaluable for understanding the causes and consequences of psychiatric conditions. As we move towards an era of precision medicine, we must work to ensure it is delivered in an equitable manner.


2021 ◽  
Author(s):  
Pavel P. Kuksa ◽  
Prabhakaran Gangadharan ◽  
Zivadin Katanic ◽  
Lauren Kleidermacher ◽  
Alexandre Amlie-Wolf ◽  
...  

AbstractMotivationQuerying massive collections of functional genomic and annotation data, linking and summarizing the query results across data sources and data types are important steps in high-throughput genomic and genetic analytical workflows. However, accomplishing these steps is difficult because of the heterogeneity and breadth of data sources, experimental assays, biological conditions (e.g., tissues, cell types), data types, and file formats.ResultsFunctIonaL gEnomics Repository (FILER) is a large-scale, harmonized functional genomics data catalog uniquely providing: 1) streamlined access to >50,000 harmonized, annotated functional genomic and annotation datasets across >20 integrated data sources, >1,100 biological conditions/tissues/cell types, and >20 experimental assays; 2) a scalable, indexing-based genomic querying interface; 3) ability for users to analyze and annotate their own experimental data against reference datasets. This rich resource spans >17 Billion genomic records for both GRCh37/hg19 and GRCh38/hg38 genome builds. FILER scales well with the experimental (query) data size and the number of reference datasets and data sources. When evaluated on large-scale analysis tasks, FILER demonstrated great efficiency as the observed running time for querying 1000x more genomic intervals (106 vs. 103) against all 7×109 hg19 FILER records increased sub-linearly by only a factor of 15x. Together, these features facilitate reproducible research and streamline querying, integrating, and utilizing large-scale functional genomics and annotation data.Availability and implementationFILER can be 1) freely accessed at https://lisanwanglab.org/FILER,2) deployed on cloud or local servers (https://bitbucket.org/wanglab-upenn/FILER), and 3) integrated with other pipelines using provided scripts.


Author(s):  
Anne Opsal ◽  
Øistein Kristensen ◽  
Thomas Clausen

Abstract Background Health care workers in the addiction field have long emphasised the importance of a patient’s motivation on the outcome of treatments for substance use disorders (SUDs). Many patients entering treatment are not yet ready to make the changes required for recovery and are often unprepared or sometimes unwilling to modify their behaviour. The present study compared stages of readiness to change and readiness to seek help among patients with SUDs involuntarily and voluntarily admitted to treatment to investigate whether changes in the stages of readiness at admission predict drug control outcomes at follow-up. Methods This prospective study included 65 involuntarily and 137 voluntarily admitted patients treated in three addiction centres in Southern Norway. Patients were evaluated using the Europ-ASI, Readiness to Change Questionnaire (RTCQ), and Treatment Readiness Tool (TReaT). Results The involuntarily admitted patients had significantly lower levels of motivation to change than the voluntarily admitted patients at the time of admission (39% vs. 59%). The majority of both involuntarily and voluntarily admitted patients were in the highest stage (preparation) for readiness to seek help at admission and continued to be in this stage at discharge. The stage of readiness to change at admission did not predict abstinence at follow-up. The only significant predictor of ongoing drug use at 6 months was SUD severity at baseline. Conclusions The majority of involuntarily admitted patients scored high on motivation to seek help. Their motivation was stable at a fairly high level during their stay, and even improved in some patients. Thus, they were approaching the motivation stage similar to the voluntarily admitted patients at the end of hospitalization. Therapists should focus on both motivating patients in treatment and adapting the treatment according to SUD severity. Trial registration ClinicalTrials.gov, NCT00970372. Registered 1 September 2008, https://clinicaltrials.gov/ct2/show/NCT00970372. The trial was registered before the first participant was enrolled. The fist participant was enrolled September 02, 2009.


2012 ◽  
Author(s):  
L. Michelle Tuten ◽  
Hendree E. Jones ◽  
Cindy M. Schaeffer ◽  
Maxine L. Stitzer

2014 ◽  
Author(s):  
L. C. van Boekel ◽  
E. P. M. Brouwers ◽  
J. van Weeghel ◽  
H. F. L. Garretsen

Sign in / Sign up

Export Citation Format

Share Document