scholarly journals Diverse environmental perturbations reveal the evolution and context-dependency of genetic effects on gene expression levels

2021 ◽  
Author(s):  
Amanda J Lea ◽  
Julie Peng ◽  
Julien J Ayroles

There is increasing appreciation that human complex traits are determined by poorly understood interactions between our genomes and daily environments. These "genotype x environment" (GxE) interactions remain difficult to map at the organismal level, but can be uncovered using molecular phenotypes. To do so at large-scale, we profiled transcriptomes across 12 cellular environments using 544 immortalized B cell lines from the 1000 Genomes Project. We mapped the genetic basis of gene expression across environments and revealed a context-dependent genetic architecture: the average heritability of gene expression levels increased in treatment relative to control conditions and, on average, each treatment revealed expression quantitative trait loci (eQTL) at 11% of genes. In total, 22% of all eQTL were context-dependent, and this group was enriched for trait- and disease-associated loci. Further, evolutionary analyses revealed that positive selection has shaped GxE loci involved in responding to immune challenges and hormones, but not man-made chemicals, suggesting there is reduced opportunity for selection to act on responses to molecules recently introduced into human environments. Together, our work highlights the importance of considering an exposure's evolutionary history when studying and interpreting GxE interactions, and provides new insight into the evolutionary mechanisms that maintain GxE loci in human populations.

2021 ◽  
Vol 12 ◽  
Author(s):  
Cheng Gao ◽  
Hairong Wei ◽  
Kui Zhang

Characterization of genetic variations that are associated with gene expression levels is essential to understand cellular mechanisms that underline human complex traits. Expression quantitative trait loci (eQTL) mapping attempts to identify genetic variants, such as single nucleotide polymorphisms (SNPs), that affect the expression of one or more genes. With the availability of a large volume of gene expression data, it is necessary and important to develop fast and efficient statistical and computational methods to perform eQTL mapping for such large scale data. In this paper, we proposed a new method, the low rank penalized regression method (LORSEN), for eQTL mapping. We evaluated and compared the performance of LORSEN with two existing methods for eQTL mapping using extensive simulations as well as real data from the HapMap3 project. Simulation studies showed that our method outperformed two commonly used methods for eQTL mapping, LORS and FastLORS, in many scenarios in terms of area under the curve (AUC). We illustrated the usefulness of our method by applying it to SNP variants data and gene expression levels on four chromosomes from the HapMap3 Project.


PLoS ONE ◽  
2008 ◽  
Vol 3 (10) ◽  
pp. e3344 ◽  
Author(s):  
Mark D. Alter ◽  
Daniel B. Rubin ◽  
Keri Ramsey ◽  
Rebecca Halpern ◽  
Dietrich A. Stephan ◽  
...  

2017 ◽  
Author(s):  
Luke J. O’Connor ◽  
Alexander Gusev ◽  
Xuanyao Liu ◽  
Po-Ru Loh ◽  
Hilary K. Finucane ◽  
...  

AbstractDisease risk variants identified by GWAS are predominantly noncoding, suggesting that gene regulation plays an important role. eQTL studies in unaffected individuals are often used to link disease-associated variants with the genes they regulate, relying on the hypothesis that noncoding regulatory effects are mediated by steady-state expression levels. To test this hypothesis, we developed a method to estimate the proportion of disease heritability mediated by the cis-genetic component of assayed gene expression levels. The method, gene expression co-score regression (GECS regression), relies on the idea that, for a gene whose expression level affects a phenotype, SNPs with similar effects on the expression of that gene will have similar phenotypic effects. In order to distinguish directional effects mediated by gene expression from non-directional pleiotropic or tagging effects, GECS regression operates on pairs of cis SNPs in linkage equilibrium, regressing pairwise products of disease effect sizes on products of cis-eQTL effect sizes. We verified that GECS regression produces robust estimates of mediated effects in simulations. We applied the method to eQTL data in 44 tissues from the GTEx consortium (average NeQTL = 158 samples) in conjunction with GWAS summary statistics for 30 diseases and complex traits (average NGWAS = 88K) with low pairwise genetic correlation, estimating the proportion of SNP-heritability mediated by the cis-genetic component of assayed gene expression in the union of the 44 tissues. The mean estimate was 0.21 (s.e. = 0.01) across 30 traits, with a significantly positive estimate (p < 0.001) for every trait. Thus, assayed gene expression in bulk tissues mediates a statistically significant but modest proportion of disease heritability, motivating the development of additional assays to capture regulatory effects and the use of our method to estimate how much disease heritability they mediate.


PLoS ONE ◽  
2013 ◽  
Vol 8 (8) ◽  
pp. e71462 ◽  
Author(s):  
Yan Guo ◽  
Quanhu Sheng ◽  
Jiang Li ◽  
Fei Ye ◽  
David C. Samuels ◽  
...  

Genes ◽  
2021 ◽  
Vol 12 (6) ◽  
pp. 854
Author(s):  
Yishu Wang ◽  
Lingyun Xu ◽  
Dongmei Ai

DNA methylation is an important regulator of gene expression that can influence tumor heterogeneity and shows weak and varying expression levels among different genes. Gastric cancer (GC) is a highly heterogeneous cancer of the digestive system with a high mortality rate worldwide. The heterogeneous subtypes of GC lead to different prognoses. In this study, we explored the relationships between DNA methylation and gene expression levels by introducing a sparse low-rank regression model based on a GC dataset with 375 tumor samples and 32 normal samples from The Cancer Genome Atlas database. Differences in the DNA methylation levels and sites were found to be associated with differences in the expressed genes related to GC development. Overall, 29 methylation-driven genes were found to be related to the GC subtypes, and in the prognostic model, we explored five prognoses related to the methylation sites. Finally, based on a low-rank matrix, seven subgroups were identified with different methylation statuses. These specific classifications based on DNA methylation levels may help to account for heterogeneity and aid in personalized treatments.


2021 ◽  
Vol 15 (1) ◽  
Author(s):  
Weitong Cui ◽  
Huaru Xue ◽  
Lei Wei ◽  
Jinghua Jin ◽  
Xuewen Tian ◽  
...  

Abstract Background RNA sequencing (RNA-Seq) has been widely applied in oncology for monitoring transcriptome changes. However, the emerging problem that high variation of gene expression levels caused by tumor heterogeneity may affect the reproducibility of differential expression (DE) results has rarely been studied. Here, we investigated the reproducibility of DE results for any given number of biological replicates between 3 and 24 and explored why a great many differentially expressed genes (DEGs) were not reproducible. Results Our findings demonstrate that poor reproducibility of DE results exists not only for small sample sizes, but also for relatively large sample sizes. Quite a few of the DEGs detected are specific to the samples in use, rather than genuinely differentially expressed under different conditions. Poor reproducibility of DE results is mainly caused by high variation of gene expression levels for the same gene in different samples. Even though biological variation may account for much of the high variation of gene expression levels, the effect of outlier count data also needs to be treated seriously, as outlier data severely interfere with DE analysis. Conclusions High heterogeneity exists not only in tumor tissue samples of each cancer type studied, but also in normal samples. High heterogeneity leads to poor reproducibility of DEGs, undermining generalization of differential expression results. Therefore, it is necessary to use large sample sizes (at least 10 if possible) in RNA-Seq experimental designs to reduce the impact of biological variability and DE results should be interpreted cautiously unless soundly validated.


Agronomy ◽  
2021 ◽  
Vol 11 (1) ◽  
pp. 92
Author(s):  
Joon Seon Lee ◽  
Lexuan Gao ◽  
Laura Melissa Guzman ◽  
Loren H. Rieseberg

Approximately 10% of agricultural land is subject to periodic flooding, which reduces the growth, survivorship, and yield of most crops, reinforcing the need to understand and enhance flooding resistance in our crops. Here, we generated RNA-Seq data from leaf and root tissue of domesticated sunflower to explore differences in gene expression and alternative splicing (AS) between a resistant and susceptible cultivar under both flooding and control conditions and at three time points. Using a combination of mixed model and gene co-expression analyses, we were able to separate general responses of sunflower to flooding stress from those that contribute to the greater tolerance of the resistant line. Both cultivars responded to flooding stress by upregulating expression levels of known submergence responsive genes, such as alcohol dehydrogenases, and slowing metabolism-related activities. Differential AS reinforced expression differences, with reduced AS frequencies typically observed for genes with upregulated expression. Significant differences were found between the genotypes, including earlier and stronger upregulation of the alcohol fermentation pathway and a more rapid return to pre-flooding gene expression levels in the resistant genotype. Our results show how changes in the timing of gene expression following both the induction of flooding and release from flooding stress contribute to increased flooding tolerance.


Sign in / Sign up

Export Citation Format

Share Document