scholarly journals On Using Local Ancestry to Characterize the Genetic Architecture of Human Phenotypes: Genetic Regulation of Gene Expression in Multiethnic or Admixed Populations as a Model

2018 ◽  
Author(s):  
Yizhen Zhong ◽  
Minoli Perera ◽  
Eric R. Gamazon

AbstractBackgroundUnderstanding the nature of the genetic regulation of gene expression promises to advance our understanding of the genetic basis of disease. However, the methodological impact of use of local ancestry on high-dimensional omics analyses, including most prominently expression quantitative trait loci (eQTL) mapping and trait heritability estimation, in admixed populations remains critically underexplored.ResultsHere we develop a statistical framework that characterizes the relationships among the determinants of the genetic architecture of an important class of molecular traits. We estimate the trait variance explained by ancestry using local admixture relatedness between individuals. Using National Institute of General Medical Sciences (NIGMS) and Genotype-Tissue Expression (GTEx) datasets, we show that use of local ancestry can substantially improve eQTL mapping and heritability estimation and characterize the sparse versus polygenic component of gene expression in admixed and multiethnic populations respectively. Using simulations of diverse genetic architectures to estimate trait heritability and the level of confounding, we show improved accuracy given individual-level data and evaluate a summary statistics based approach. Furthermore, we provide a computationally efficient approach to local ancestry analysis in eQTL mapping while increasing control of type I and type II error over traditional approaches.ConclusionOur study has important methodological implications on genetic analysis of omics traits across a range of genomic contexts, from a single variant to a prioritized region to the entire genome. Our findings highlight the importance of using local ancestry to better characterize the heritability of complex traits and to more accurately map genetic associations.

Science ◽  
2019 ◽  
Vol 364 (6447) ◽  
pp. 1287-1290 ◽  
Author(s):  
B. J. Strober ◽  
R. Elorbany ◽  
K. Rhodes ◽  
N. Krishnan ◽  
K. Tayeb ◽  
...  

Genetic regulation of gene expression is dynamic, as transcription can change during cell differentiation and across cell types. We mapped expression quantitative trait loci (eQTLs) throughout differentiation to elucidate the dynamics of genetic effects on cell type–specific gene expression. We generated time-series RNA sequencing data, capturing 16 time points during the differentiation of induced pluripotent stem cells to cardiomyocytes, in 19 human cell lines. We identified hundreds of dynamic eQTLs that change over time, with enrichment in enhancers of relevant cell types. We also found nonlinear dynamic eQTLs, which affect only intermediate stages of differentiation and cannot be found by using data from mature tissues. These fleeting genetic associations with gene regulation may explain some of the components of complex traits and disease. We highlight one example of a nonlinear eQTL that is associated with body mass index.


2018 ◽  
Author(s):  
Sini Nagpal ◽  
Xiaoran Meng ◽  
Michael P. Epstein ◽  
Lam C. Tsoi ◽  
Matthew Patrick ◽  
...  

AbstractThe transcriptome-wide association studies (TWAS) that test for association between the study trait and the imputed gene expression levels from cis-acting expression quantitative trait loci (cis-eQTL) genotypes have successfully enhanced the discovery of genetic risk loci for complex traits. By using the gene expression imputation models fitted from reference datasets that have both genetic and transcriptomic data, TWAS facilitates gene-based tests with GWAS data while accounting for the reference transcriptomic data. The existing TWAS tools like PrediXcan and FUSION use parametric imputation models that have limitations for modeling the complex genetic architecture of transcriptomic data. Therefore, we propose an improved Bayesian method that assumes a data-driven nonparametric prior to impute gene expression. Our method is general and flexible and includes both the parametric imputation models used by PrediXcan and FUSION as special cases. Our simulation studies showed that the nonparametric Bayesian model improved both imputation R2 for transcriptomic data and the TWAS power over PrediXcan. In real applications, our nonparametric Bayesian method fitted transcriptomic imputation models for 2X number of genes with 1.7X average regression R2 over PrediXcan, thus improving the power of follow-up TWAS. Hence, the nonparametric Bayesian model is preferred for modeling the complex genetic architecture of transcriptomes and is expected to enhance transcriptome-integrated genetic association studies. We implement our Bayesian approach in a convenient software tool “TIGAR” (Transcriptome-Integrated Genetic Association Resource), which imputes transcriptomic data and performs subsequent TWAS using individual-level or summary-level GWAS data.


Cells ◽  
2021 ◽  
Vol 10 (10) ◽  
pp. 2611
Author(s):  
Jayron J. Habibe ◽  
Maria P. Clemente-Olivo ◽  
Carlie J. de Vries

Susceptibility to complex pathological conditions such as obesity, type 2 diabetes and cardiovascular disease is highly variable among individuals and arises from specific changes in gene expression in combination with external factors. The regulation of gene expression is determined by genetic variation (SNPs) and epigenetic marks that are influenced by environmental factors. Aging is a major risk factor for many multifactorial diseases and is increasingly associated with changes in DNA methylation, leading to differences in gene expression. Four and a half LIM domains 2 (FHL2) is a key regulator of intracellular signal transduction pathways and the FHL2 gene is consistently found as one of the top hyper-methylated genes upon aging. Remarkably, FHL2 expression increases with methylation. This was demonstrated in relevant metabolic tissues: white adipose tissue, pancreatic β-cells, and skeletal muscle. In this review, we provide an overview of the current knowledge on regulation of FHL2 by genetic variation and epigenetic DNA modification, and the potential consequences for age-related complex multifactorial diseases.


2020 ◽  
Vol 35 (2) ◽  
pp. 377-393 ◽  
Author(s):  
Sally Mortlock ◽  
Raden I Kendarsari ◽  
Jenny N Fung ◽  
Greg Gibson ◽  
Fei Yang ◽  
...  

Abstract STUDY QUESTION Are genetic effects on endometrial gene expression tissue specific and/or associated with reproductive traits and diseases? SUMMARY ANSWER Analyses of RNA-sequence data and individual genotype data from the endometrium identified novel and disease associated, genetic mechanisms regulating gene expression in the endometrium and showed evidence that these mechanisms are shared across biologically similar tissues. WHAT IS KNOWN ALREADY The endometrium is a complex tissue vital for female reproduction and is a hypothesized source of cells initiating endometriosis. Understanding genetic regulation specific to, and shared between, tissue types can aid the identification of genes involved in complex genetic diseases. STUDY DESIGN, SIZE, DURATION RNA-sequence and genotype data from 206 individuals was analysed and results were compared with large publicly available datasets. PARTICIPANTS/MATERIALS, SETTING, METHODS RNA-sequencing and genotype data from 206 endometrial samples was used to identify the influence of genetic variants on gene expression, via expression quantitative trait loci (eQTL) analysis and to compare these endometrial eQTLs with those in other tissues. To investigate the association between endometrial gene expression regulation and reproductive traits and diseases, we conducted a tissue enrichment analysis, transcriptome-wide association study (TWAS) and summary data-based Mendelian randomisation (SMR) analyses. Transcriptomic data was used to test differential gene expression between women with and without endometriosis. MAIN RESULTS AND THE ROLE OF CHANCE A tissue enrichment analysis with endometriosis genome-wide association study summary statistics showed that genes surrounding endometriosis risk loci were significantly enriched in reproductive tissues. A total of 444 sentinel cis-eQTLs (P < 2.57 × 10−9) and 30 trans-eQTLs (P < 4.65 × 10−13) were detected, including 327 novel cis-eQTLs in endometrium. A large proportion (85%) of endometrial eQTLs are present in other tissues. Genetic effects on endometrial gene expression were highly correlated with the genetic effects on reproductive (e.g. uterus, ovary) and digestive tissues (e.g. salivary gland, stomach), supporting a shared genetic regulation of gene expression in biologically similar tissues. The TWAS analysis indicated that gene expression at 39 loci is associated with endometriosis, including five known endometriosis risk loci. SMR analyses identified potential target genes pleiotropically or causally associated with reproductive traits and diseases including endometriosis. However, without taking account of genetic variants, a direct comparison between women with and without endometriosis showed no significant difference in endometrial gene expression. LARGE SCALE DATA The eQTL dataset generated in this study is available at http://reproductivegenomics.com.au/shiny/endo_eqtl_rna/. Additional datasets supporting the conclusions of this article are included within the article and the supplementary information files, or are available on reasonable request. LIMITATIONS, REASONS FOR CAUTION Data are derived from fresh tissue samples and expression levels are an average of expression from different cell types within the endometrium. Subtle cell-specifc expression changes may not be detected and differences in cell composition between samples and across the menstrual cycle will contribute to sample variability. Power to detect tissue specific eQTLs and differences between women with and without endometriosis was limited by the sample size in this study. The statistical approaches used in this study identify the likely gene targets for specific genetic risk factors, but not the functional mechanism by which changes in gene expression may influence disease risk. WIDER IMPLICATIONS OF THE FINDINGS Our results identify novel genetic variants that regulate gene expression in endometrium and the majority of these are shared across tissues. This allows analysis with large publicly available datasets to identify targets for female reproductive traits and diseases. Much larger studies will be required to identify genetic regulation of gene expression that will be specific to endometrium. STUDY FUNDING/COMPETING INTEREST(S) This work was supported by the National Health and Medical Research Council (NHMRC) under project grants GNT1026033, GNT1049472, GNT1046880, GNT1050208, GNT1105321, GNT1083405 and GNT1107258. G.W.M is supported by a NHMRC Fellowship (GNT1078399). J.Y is supported by an ARC Fellowship (FT180100186). There are no competing interests.


Genetics ◽  
2019 ◽  
Vol 212 (3) ◽  
pp. 905-918 ◽  
Author(s):  
Biao Zeng ◽  
Luke R. Lloyd-Jones ◽  
Grant W. Montgomery ◽  
Andres Metspalu ◽  
Tonu Esko ◽  
...  

Expression QTL (eQTL) detection has emerged as an important tool for unraveling the relationship between genetic risk factors and disease or clinical phenotypes. Most studies are predicated on the assumption that only a single causal variant explains the association signal in each interval. This greatly simplifies the statistical modeling, but is liable to biases in scenarios where multiple local causal-variants are responsible. Here, our primary goal was to address the prevalence of secondary cis-eQTL signals regulating peripheral blood gene expression locally, utilizing two large human cohort studies, each >2500 samples with accompanying whole genome genotypes. The CAGE (Consortium for the Architecture of Gene Expression) dataset is a compendium of Illumina microarray studies, and the Framingham Heart Study is a two-generation Affymetrix dataset. We also describe Bayesian colocalization analysis of the extent of sharing of cis-eQTL detected in both studies as well as with the BIOS RNAseq dataset. Stepwise conditional modeling demonstrates that multiple eQTL signals are present for ∼40% of over 3500 eGenes in both microarray datasets, and that the number of loci with additional signals reduces by approximately two-thirds with each conditioning step. Although <20% of the peak signals across platforms fine map to the same credible interval, the colocalization analysis finds that as many as 50–60% of the primary eQTL are actually shared. Subsequently, colocalization of eQTL signals with GWAS hits detected 1349 genes whose expression in peripheral blood is associated with 591 human phenotype traits or diseases, including enrichment for genes with regulatory functions. At least 10%, and possibly as many as 40%, of eQTL-trait colocalized signals are due to nonprimary cis-eQTL peaks, but just one-quarter of these colocalization signals replicated across the gene expression datasets. Our results are provided as a web-based resource for visualization of multi-site regulation of gene expression and its association with human complex traits and disease states.


2017 ◽  
Vol 100 (6) ◽  
pp. 985-986 ◽  
Author(s):  
Chen Yao ◽  
Roby Joehanes ◽  
Andrew D. Johnson ◽  
Tianxiao Huan ◽  
Chunyu Liu ◽  
...  

2018 ◽  
Author(s):  
Heather E. Wheeler ◽  
Sally Ploch ◽  
Alvaro N. Barbeira ◽  
Rodrigo Bonazzola ◽  
Angela Andaleon ◽  
...  

AbstractRegulation of gene expression is an important mechanism through which genetic variation can affect complex traits. A substantial portion of gene expression variation can be explained by both local (cis) and distal (trans) genetic variation. Much progress has been made in uncovering cis-acting expression quantitative trait loci (cis-eQTL), but trans-eQTL have been more difficult to identify and replicate. Here we take advantage of our ability to predict the cis component of gene expression coupled with gene mapping methods such as PrediXcan to identify high confidence candidate trans-acting genes and their targets. That is, we correlate the cis component of gene expression with observed expression of genes in different chromosomes. Leveraging the shared cis-acting regulation across tissues, we combine the evidence of association across all available GTEx tissues and find 2356 trans-acting/target gene pairs with high mappability scores. Reassuringly, trans-acting genes are enriched in transcription and nucleic acid binding pathways and target genes are enriched in known transcription factor binding sites. Interestingly, trans-acting genes are more significantly associated with selected complex traits and diseases than target or background genes, consistent with percolating trans effects. Our scripts and summary statistics are publicly available for future studies of trans-acting gene regulation.


2017 ◽  
Author(s):  
Douglas R. Wilson ◽  
Wei Sun ◽  
Joseph G. Ibrahim

AbstractThe study of gene expression quantitative trait loci (eQTL) is an effective approach to illuminate the functional roles of genetic variants. Computational methods have been developed for eQTL mapping using gene expression data from microarray or RNA-seq technology. Application of these methods for eQTL mapping in tumor tissues is problematic because tumor tissues are composed of both tumor and infiltrating normal cells (e.g. immune cells) and eQTL effects may vary between tumor and infiltrating normal cells. To address this challenge, we have developed a new method for eQTL mapping using RNA-seq data from tumor samples. Our method separately estimates the eQTL effects in tumor and infiltrating normal cells using both total expression and allele-specific expression (ASE). We demonstrate that our method controls type I error rate and has higher power than some alternative approaches. We applied our method to study RNA-seq data from The Cancer Genome Atlas and illustrated the similarities and differences of eQTL effects in tumor and normal cells.


Sign in / Sign up

Export Citation Format

Share Document