Leveraging brain cortex-derived molecular data to elucidate epigenetic and transcriptomic drivers of neurological function and disease

Mapping Intimacies ◽

10.1101/429134 ◽

2018 ◽

Author(s):

Charlie Hatcher ◽

Caroline L. Relton ◽

Tom R. Gaunt ◽

Tom G. Richardson

Keyword(s):

Gene Expression ◽

Dna Methylation ◽

Histone Acetylation ◽

Genetic Variants ◽

Large Scale ◽

Genetic Variant ◽

Association Studies ◽

Genome Wide Association Studies ◽

Epigenetic Mechanisms ◽

Trait Variation

AbstractIntegrative approaches which harness large-scale molecular datasets can help develop mechanistic insight into findings from genome-wide association studies (GWAS). We have performed extensive analyses to uncover transcriptional and epigenetic processes which may play a role in neurological trait variation.This was undertaken by applying Bayesian multiple-trait colocalization systematically across the genome to identify genetic variants responsible for influencing intermediate molecular phenotypes as well as neurological traits. In this analysis we leveraged high dimensional quantitative trait loci data derived from prefrontal cortex tissue (concerning gene expression, DNA methylation and histone acetylation) and GWAS findings for 5 neurological traits (Neuroticism, Schizophrenia, Educational Attainment, Insomnia and Alzheimer’s disease).There was evidence of colocalization for 118 associations suggesting that the same underlying genetic variant influenced both nearby gene expression as well as neurological trait variation. Of these, 73 associations provided evidence that the genetic variant also influenced proximal DNA methylation and/or histone acetylation. These findings support previous evidence at loci where epigenetic mechanisms may putatively mediate effects of genetic variants on traits, such as KLC1 and schizophrenia. We also uncovered evidence implicating novel loci in neurological disease susceptibility, including genes expressed predominantly in brain tissue such as MDGA1, KIRREL3 and SLC12A5.An inverse relationship between DNA methylation and gene expression was observed more than can be accounted for by chance, supporting previous findings implicating DNA methylation as a transcriptional repressor. Our study should prove valuable in helping future studies prioritise candidate genes and epigenetic mechanisms for in-depth functional follow-up analyses.

FarmCPUpp: Efficient Large-Scale GWAS

10.1101/238832 ◽

2017 ◽

Author(s):

Aaron Kusmec ◽

Patrick S. Schnable

Keyword(s):

Genetic Variants ◽

Memory Management ◽

Large Scale ◽

Association Studies ◽

Genome Wide Association Studies ◽

Trait Variation ◽

R Programming Language ◽

Genome Wide ◽

R Programming ◽

Dense Marker

AbstractGenome-wide association studies (GWAS) are computationally demanding analyses that use large sample sizes and dense marker sets to discover associations between quantitative trait variation and genetic variants. FarmCPU is a powerful new method for performing GWAS. However, its performance is hampered by details of its implementation and its reliance on the R programming language. In this paper we present an efficient implementation of FarmCPU, called FarmCPUpp, that retains the R user interface but improves memory management and speed through the use of C++ code and parallel computing.

A transcriptome-wide Mendelian randomization study to uncover tissue-dependent regulatory mechanisms across the human phenome

10.1101/563379 ◽

2019 ◽

Cited By ~ 2

Author(s):

Tom G Richardson ◽

Gibran Hemani ◽

Tom R Gaunt ◽

Caroline L Relton ◽

George Davey Smith

Keyword(s):

Gene Expression ◽

Genetic Variants ◽

Complex Traits ◽

Mendelian Randomization ◽

Drug Repositioning ◽

Association Studies ◽

Thyroid Tissue ◽

Genome Wide Association Studies ◽

Tissue Specific ◽

Genome Wide

AbstractBackgroundDeveloping insight into tissue-specific transcriptional mechanisms can help improve our understanding of how genetic variants exert their effects on complex traits and disease. By applying the principles of Mendelian randomization, we have undertaken a systematic analysis to evaluate transcriptome-wide associations between gene expression across 48 different tissue types and 395 complex traits.ResultsOverall, we identified 100,025 gene-trait associations based on conventional genome-wide corrections (P < 5 × 10−08) that also provided evidence of genetic colocalization. These results indicated that genetic variants which influence gene expression levels in multiple tissues are more likely to influence multiple complex traits. We identified many examples of tissue-specific effects, such as genetically-predicted TPO, NR3C2 and SPATA13 expression only associating with thyroid disease in thyroid tissue. Additionally, FBN2 expression was associated with both cardiovascular and lung function traits, but only when analysed in heart and lung tissue respectively.We also demonstrate that conducting phenome-wide evaluations of our results can help flag adverse on-target side effects for therapeutic intervention, as well as propose drug repositioning opportunities. Moreover, we find that exploring the tissue-dependency of associations identified by genome-wide association studies (GWAS) can help elucidate the causal genes and tissues responsible for effects, as well as uncover putative novel associations.ConclusionsThe atlas of tissue-dependent associations we have constructed should prove extremely valuable to future studies investigating the genetic determinants of complex disease. The follow-up analyses we have performed in this study are merely a guide for future research. Conducting similar evaluations can be undertaken systematically at http://mrcieu.mrsoftware.org/Tissue_MR_atlas/.

Integrative Genomic Analysis for the Discovery of Biomarkers in Prostate Cancer

Biomarker Insights ◽

10.4137/bmi.s13729 ◽

2014 ◽

Vol 9 ◽

pp. BMI.S13729 ◽

Cited By ~ 5

Author(s):

Chindo Hicks ◽

Tejaswi Koganti ◽

Shankar Giri ◽

Memory Tekere ◽

Ritika Ramani ◽

...

Keyword(s):

Gene Expression ◽

Prostate Cancer ◽

Gene Expression Data ◽

Genetic Variants ◽

Association Studies ◽

Biological Pathways ◽

Great Success ◽

Genome Wide Association Studies ◽

Expression Data ◽

Increased Risk

Genome-wide association studies (GWAS) have achieved great success in identifying single nucleotide polymorphisms (SNPs, herein called genetic variants) and genes associated with risk of developing prostate cancer. However, GWAS do not typically link the genetic variants to the disease state or inform the broader context in which the genetic variants operate. Here, we present a novel integrative genomics approach that combines GWAS information with gene expression data to infer the causal association between gene expression and the disease and to identify the network states and biological pathways enriched for genetic variants. We identified gene regulatory networks and biological pathways enriched for genetic variants, including the prostate cancer, IGF-1, JAK2, androgen, and prolactin signaling pathways. The integration of GWAS information with gene expression data provides insights about the broader context in which genetic variants associated with an increased risk of developing prostate cancer operate.

Prioritization of SNPs for genome-wide association studies using an interaction model of genetic variation, gene expression, and trait variation

Molecules and Cells ◽

10.1007/s10059-012-2264-7 ◽

2012 ◽

Vol 33 (4) ◽

pp. 351-361 ◽

Cited By ~ 1

Author(s):

Hyojung Paik ◽

Junho Kim ◽

Sunjae Lee ◽

Hyoung-Sam Heo ◽

Cheol-Goo Hur ◽

...

Keyword(s):

Gene Expression ◽

Genetic Variation ◽

Association Studies ◽

Interaction Model ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Trait Variation ◽

Genome Wide

circVAR database: genome-wide archive of genetic variants for human circular RNAs

10.21203/rs.3.rs-48904/v2 ◽

2020 ◽

Author(s):

Min Zhao ◽

Hong Qu

Keyword(s):

Genetic Variants ◽

Complex Traits ◽

Large Scale ◽

Rna Binding ◽

Rna Binding Proteins ◽

Association Studies ◽

Chromosome 17 ◽

Circular Rnas ◽

Genome Wide Association Studies ◽

Genome Wide

Abstract Background: Circular RNAs (circRNAs) play important roles in regulating gene expression through binding miRNAs and RNA binding proteins. Genetic variation of circRNAs may affect complex traits/diseases by changing their binding efficiency to target miRNAs and proteins. There is a growing demand for investigations of the functions of genetic changes using large-scale experimental evidence. However, there is no online genetic resource for circRNA genes. Results: We performed extensive genetic annotation of 295,526 circRNAs integrated from circBase, circNet and circRNAdb. All pre-computed genetic variants were presented at our online resource, circVAR, with data browsing and search functionality. We explored the chromosome-based distribution of circRNAs and their associated variants. We found that, based on mapping to the 1000 Genomes and ClinVAR databases, chromosome 17 has a relatively large number of circRNAs and associated common and health-related genetic variants. Following the annotation of genome wide association studies (GWAS)-based circRNA variants, we found many non-coding variants within circRNAs, suggesting novel mechanisms for common diseases reported from GWAS studies. For cancer-based somatic variants, we found that chromosome 7 has many highly complex mutations that have been overlooked in previous research. Conclusion: We used the circVAR database to collect SNPs and small insertions and deletions (INDELs) in putative circRNA regions and to identify their potential phenotypic information. To provide a reusable resource for the circRNA research community, we have published all the pre-computed genetic data concerning circRNAs and associated genes together with data query and browsing functions at http://soft.bioinfo-minzhao.org/circvar .

A novel quantile regression approach for eQTL discovery

10.1101/070052 ◽

2016 ◽

Author(s):

Xiaoyu Song ◽

Gen Li ◽

Iuliana Ionita-Laza ◽

Ying Wei

Keyword(s):

Gene Expression ◽

Genetic Variation ◽

Linear Regression ◽

Genetic Variants ◽

Molecular Mechanisms ◽

Association Studies ◽

Expression Level ◽

Special Focus ◽

Genome Wide Association Studies ◽

Genome Wide

AbstractOver the past decade, there has been a remarkable improvement in our understanding of the role of genetic variation in complex human diseases, especially via genome-wide association studies. However, the underlying molecular mechanisms are still poorly characterized, impending the development of therapeutic interventions. Identifying genetic variants that influence the expression level of a gene, i.e. expression quantitative trait loci (eQTLs), can help us understand how genetic variants influence traits at the molecular level. While most eQTL studies focus on identifying mean effects on gene expression using linear regression, evidence suggests that genetic variation can impact the entire distribution of the expression level. Indeed, several studies have already investigated higher order associations with a special focus on detecting heteroskedasticity. In this paper, we develop a Quantile Rank-score Based Test (QRBT) to identify eQTLs that are associated with the conditional quantile functions of gene expression. We have applied the proposed QRBT to the Genotype-Tissue Expression project, an international tissue bank for studying the relationship between genetic variation and gene expression in human tissues, and found that the proposed QRBT complements the existing methods, and identifies new eQTLs with heterogeneous effects genome-wideacross different quantile levels. Notably, we show that the eQTLs identified by QRBT but missed by linear regression are more likely to be tissue specific, and also associated with greater enrichment in genome-wide significant SNPs from the GWAS catalog. An R package implementing QRBT is available on our website.

Functional Genetic Biomarkers of Alzheimer’s Disease and Gene Expression from Peripheral Blood

10.1101/2021.01.15.426891 ◽

2021 ◽

Author(s):

Andrew Ni ◽

Amish Sethi ◽

Keyword(s):

Gene Expression ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Peripheral Blood ◽

Genetic Variants ◽

Cell Activation ◽

Association Studies ◽

Gene Set Enrichment Analysis ◽

Machine Learning Techniques ◽

Genome Wide Association Studies

AbstractDetecting Alzheimer’s Disease (AD) at the earliest possible stage is key in advancing AD prevention and treatment but is challenged by normal aging processes in addition to other confounding neurodegenerative diseases. Recent genome-wide association studies (GWAS) have identified associated alleles, but it has been difficult to transition from non-coding genetic variants to underlying mechanisms of AD. Here, we sought to reveal functional genetic variants and diagnostic biomarkers underlying AD using machine learning techniques. We first developed a Random Forest (RF) classifier using microarray gene expression data sampled from the peripheral blood of 744 participants in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. After initial feature selection, 5-fold cross-validation of the 100-gene RF classifier achieved an accuracy of 99.04%. The high accuracy of the RF classifier supports the possibility of a powerful and minimally invasive tool for screening of AD. Next, unsupervised clustering was used to validate and identify relationships among differentially expressed genes (DEGs) the RF selected revealing 3 distinct AD clusters. Results suggest downregulation of global sulfatase and oxidoreductase activities in AD through mutations in SUMF1 and SMOX respectively. Then, we used Greedy Fast Causal Inference (GFCI) to find potential causes of AD within DEGs. In the causal graph, HLA-DPB1 and CYP4A11 emerge as hub genes, furthering the discussion of the immune system’s role in AD. Finally, we used Gene Set Enrichment Analysis (GSEA) to determine the biological pathways and processes underlying the DEGs that were highly correlated with AD. Cell activation in the immune system, glycosaminoglycan (GAG) binding, vascular dysfunction, oxidative stress, and the neuronal apoptotic process were revealed to be significantly enriched in AD. This study further advances the possibility of low-cost and noninvasive genetic screening for AD while also providing potential gene targets for further experimentation.

Large-scale integration of DNA methylation and gene expression array platforms

10.1101/2021.08.12.455267 ◽

2021 ◽

Author(s):

Eva E Lancaster ◽

Vladimir I Vladimirov ◽

Brien P Riley ◽

Joseph W Landry ◽

Roxann Roberson-Nay ◽

...

Keyword(s):

Gene Expression ◽

Dna Methylation ◽

Large Scale ◽

Association Studies ◽

Adolescent And Young Adult ◽

Expression Array ◽

Cpg Sites ◽

Disease States ◽

Large Scale Integration ◽

Scale Integration

Epigenome-wide association studies (EWAS) aim to provide evidence that marks of DNA methylation (DNAm) have downstream consequences that can result in the development of human diseases. Although these methods have been successful in identifying DNAm patterns associated with disease states, any further characterization of etiologic mechanisms remains elusive. This knowledge gap does not originate from a lack of DNAm-trait associations, but rather stems from study design issues that affect the interpretability of EWAS results. Despite known limitations in predicting the function of a particular CpG site, most EWAS maintain the broad assumption that altered DNAm results in a concomitant change of transcription at the most proximal gene. This study integrated DNAm and gene expression (GE) measurements in two cohorts, the Adolescent and Young Adult Twin Study (AYATS) and the Pregnancy, Race, Environment, Genes (PREG) study, to improve the understanding of epigenomic regulatory mechanisms. CpG sites associated with GE in cis were enriched in areas of transcription factor binding and areas of intermediate-to-low CpG density. CpG sites associated with trans GE were also enriched in areas of known regulatory significance, including enhancer regions. These results highlight issues with restricting DNAm-transcript annotations to small genomic intervals and question the validity of assuming a canonical cis DNAm-GE pathway. Based on these findings, the interpretation of EWAS results is limited in studies without multi-omic support and further research should identify genomic regions in which GE-associated DNAm is overrepresented.

Genetic correlations between subcortical brain volumes and psychiatric disorders

The British Journal of Psychiatry ◽

10.1192/bjp.2019.277 ◽

2020 ◽

Vol 216 (5) ◽

pp. 280-283

Author(s):

Kazutaka Ohi ◽

Takamitsu Shimada ◽

Yuzuru Kataoka ◽

Toshiki Yasuyama ◽

Yasuhiro Kawasaki ◽

...

Keyword(s):

Psychiatric Disorders ◽

Genetic Variants ◽

Large Scale ◽

Association Studies ◽

Genetic Correlations ◽

Intracranial Volume ◽

Genome Wide Association Studies ◽

Brain Volumes ◽

Subcortical Brain ◽

Genome Wide

SummaryPsychiatric disorders as well as subcortical brain volumes are highly heritable. Large-scale genome-wide association studies (GWASs) for these traits have been performed. We investigated the genetic correlations between five psychiatric disorders and the seven subcortical brain volumes and the intracranial volume from large-scale GWASs by linkage disequilibrium score regression. We revealed weak overlaps between the genetic variants associated with psychiatric disorders and subcortical brain and intracranial volumes, such as in schizophrenia and the hippocampus and bipolar disorder and the accumbens. We confirmed shared aetiology and polygenic architecture across the psychiatric disorders and the specific subcortical brain and intracranial volume.

Transcriptome-wide association supplements genome-wide association in Zea mays

10.1101/363242 ◽

2018 ◽

Cited By ~ 4

Author(s):

Karl A. G. Kremling ◽

Christine H. Diepenbrock ◽

Michael A. Gore ◽

Edward S. Buckler ◽

Nonoy B. Bandillo

Keyword(s):

Gene Expression ◽

Complex Traits ◽

Large Scale ◽

New Technologies ◽

Seed Quality ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Biological Organization ◽

Genome Wide

AbstractModern improvement of complex traits in agricultural species relies on successful associations of heritable molecular variation with observable phenotypes. Historically, this pursuit has primarily been based on easily measurable genetic markers. The recent advent of new technologies allows assaying and quantifying biological intermediates (hereafter endophenotypes) which are now readily measurable at a large scale across diverse individuals. The potential of using endophenotypes for dissecting traits of interest remains underexplored in plants. The work presented here illustrated the utility of a large-scale (299 genotype and 7 tissue) gene expression resource to dissect traits across multiple levels of biological organization. Using single-tissue- and multi-tissue-based transcriptome-wide association studies (TWAS), we revealed that about half of the functional variation for agronomic and seed quality (carotenoid, tocochromanol) traits is regulatory. Comparing the efficacy of TWAS with genome-wide association studies (GWAS) and an ensemble approach that combines both GWAS and TWAS, we demonstrated that results of TWAS in combination with GWAS increase the power to detect known genes and aid in prioritizing likely causal genes. Using a variance partitioning approach in the independent maize Nested Association Mapping (NAM) population, we also showed that the most strongly associated genes identified by combining GWAS and TWAS explain more heritable variance for a majority of traits, beating the heritability captured by the random genes and the genes identified by GWAS or TWAS alone. This improves not only the ability to link genes to phenotypes, but also highlights the phenotypic consequences of regulatory variation in plants.Author summaryWe examined the ability to associate variability in gene expression directly with terminal phenotypes of interest, as a supplement linking genotype to phenotype. We found that transcriptome-wide association studies (TWAS) are a useful accessory to genome-wide association studies (GWAS). In a combined test with GWAS results, TWAS improves the capacity to re-detect genes known to underlie quantitative trait loci for kernel and agronomic phenotypes. This improves not only the capacity to link genes to phenotypes, but also illustrates the widespread importance of regulation for phenotype.