scholarly journals The genetic architecture of gene expression levels in wild baboons

eLife ◽  
2015 ◽  
Vol 4 ◽  
Author(s):  
Jenny Tung ◽  
Xiang Zhou ◽  
Susan C Alberts ◽  
Matthew Stephens ◽  
Yoav Gilad

Primate evolution has been argued to result, in part, from changes in how genes are regulated. However, we still know little about gene regulation in natural primate populations. We conducted an RNA sequencing (RNA-seq)-based study of baboons from an intensively studied wild population. We performed complementary expression quantitative trait locus (eQTL) mapping and allele-specific expression analyses, discovering substantial evidence for, and surprising power to detect, genetic effects on gene expression levels in the baboons. eQTL were most likely to be identified for lineage-specific, rapidly evolving genes; interestingly, genes with eQTL significantly overlapped between baboons and a comparable human eQTL data set. Our results suggest that genes vary in their tolerance of genetic perturbation, and that this property may be conserved across species. Further, they establish the feasibility of eQTL mapping using RNA-seq data alone, and represent an important step towards understanding the genetic architecture of gene expression in primates.

2014 ◽  
Author(s):  
Jenny Tung ◽  
Xiang Zhou ◽  
Susan C Alberts ◽  
Matthew Stephens ◽  
Yoav Gilad

Gene expression variation is well documented in human populations and its genetic architecture has been extensively explored. However, we still know little about the genetic architecture of gene expression variation in other species, particularly our closest living relatives, the nonhuman primates. To address this gap, we performed an RNA sequencing (RNA-seq)-based study of 63 wild baboons, members of the intensively studied Amboseli baboon population in Kenya. Our study design allowed us to measure gene expression levels and identify genetic variants using the same data set, enabling us to perform complementary mapping of putative cis-acting expression quantitative trait loci (eQTL) and measurements of allele-specific expression (ASE) levels. We discovered substantial evidence for genetic effects on gene expression levels in this population. Surprisingly, we found more power to detect individual eQTL in the baboons relative to a HapMap human data set of comparable size, probably as a result of greater genetic variation, enrichment of SNPs with high minor allele frequencies, and longer-range linkage disequilibrium in the baboons. eQTL were most likely to be identified for lineage-specific, rapidly evolving genes. Interestingly, genes with eQTL significantly overlapped between the baboon and human data sets, suggesting that some genes may tolerate more genetic perturbation than others, and that this property may be conserved across species. Finally, we used a Bayesian sparse linear mixed model to partition genetic, demographic, and early environmental contributions to variation in gene expression levels. We found a strong genetic contribution to gene expression levels for almost all genes, while individual demographic and environmental effects tended to be more modest. Together, our results establish the feasibility of eQTL mapping using RNA-seq data alone, and act as an important first step towards understanding the genetic architecture of gene expression variation in nonhuman primates.


2021 ◽  
Vol 15 (1) ◽  
Author(s):  
Weitong Cui ◽  
Huaru Xue ◽  
Lei Wei ◽  
Jinghua Jin ◽  
Xuewen Tian ◽  
...  

Abstract Background RNA sequencing (RNA-Seq) has been widely applied in oncology for monitoring transcriptome changes. However, the emerging problem that high variation of gene expression levels caused by tumor heterogeneity may affect the reproducibility of differential expression (DE) results has rarely been studied. Here, we investigated the reproducibility of DE results for any given number of biological replicates between 3 and 24 and explored why a great many differentially expressed genes (DEGs) were not reproducible. Results Our findings demonstrate that poor reproducibility of DE results exists not only for small sample sizes, but also for relatively large sample sizes. Quite a few of the DEGs detected are specific to the samples in use, rather than genuinely differentially expressed under different conditions. Poor reproducibility of DE results is mainly caused by high variation of gene expression levels for the same gene in different samples. Even though biological variation may account for much of the high variation of gene expression levels, the effect of outlier count data also needs to be treated seriously, as outlier data severely interfere with DE analysis. Conclusions High heterogeneity exists not only in tumor tissue samples of each cancer type studied, but also in normal samples. High heterogeneity leads to poor reproducibility of DEGs, undermining generalization of differential expression results. Therefore, it is necessary to use large sample sizes (at least 10 if possible) in RNA-Seq experimental designs to reduce the impact of biological variability and DE results should be interpreted cautiously unless soundly validated.


2012 ◽  
Vol 90 (9) ◽  
pp. 1059-1071 ◽  
Author(s):  
Laia Navarro-Martín ◽  
Chantal Lanctôt ◽  
Christopher Edge ◽  
Jeff Houlahan ◽  
Vance L. Trudeau

Numerous studies using laboratory-reared tadpoles have shown the importance of thyroid hormones (TH), thyroid receptors (TR), and deiodinase (Dio) enzymes during anuran metamorphosis. Our study focuses on the analysis of thyroid-related genes in tadpoles of wild Wood Frogs ( Lithobates sylvaticus (LeConte, 1825); also known as Rana sylvatica (Cope, 1889)) during metamorphosis. Results showed that, in concordance with laboratory-reared studies, thyroid receptor beta (trb) gene expression profiles presented the most marked changes. At climax and compared with premetamorphic stages, brains, tails, and gonad–mesonephros complex (GMC) tissues increased trb expression levels 5-, 21-, and 41-fold, respectively (p < 0.05). In addition, gene expression levels of brain deiodinase type II and III showed opposite trends, where 3-fold decrease and 10-fold increase were, respectively, found. This finding supports the idea that thyroid hormone, as it has been demonstrated in laboratory-reared tadpoles, is also involved in natural metamorphosis in wild tadpoles. Interestingly, and contrary to our predictions, we observed that whole brain corticotropin-releasing factor (crf) and crf receptor 1 (crfr1) gene expression levels significantly decrease through metamorphosis in wild L. sylvaticus tadpoles. Further analyses are required to determine if a role of TH in the timing of anuran gonadal development exists, as well as the importance of cell-specific and tissue-specific expression of crf and crfr1 to metamorphosis.


2017 ◽  
Author(s):  
Douglas R. Wilson ◽  
Wei Sun ◽  
Joseph G. Ibrahim

AbstractThe study of gene expression quantitative trait loci (eQTL) is an effective approach to illuminate the functional roles of genetic variants. Computational methods have been developed for eQTL mapping using gene expression data from microarray or RNA-seq technology. Application of these methods for eQTL mapping in tumor tissues is problematic because tumor tissues are composed of both tumor and infiltrating normal cells (e.g. immune cells) and eQTL effects may vary between tumor and infiltrating normal cells. To address this challenge, we have developed a new method for eQTL mapping using RNA-seq data from tumor samples. Our method separately estimates the eQTL effects in tumor and infiltrating normal cells using both total expression and allele-specific expression (ASE). We demonstrate that our method controls type I error rate and has higher power than some alternative approaches. We applied our method to study RNA-seq data from The Cancer Genome Atlas and illustrated the similarities and differences of eQTL effects in tumor and normal cells.


2015 ◽  
Author(s):  
Jenny Tung ◽  
Xiang Zhou ◽  
Susan C Alberts ◽  
Matthew Stephens ◽  
Yoav Gilad

2021 ◽  
Author(s):  
Jian-Rong Li ◽  
Mabel Tang ◽  
Yafang Li ◽  
Christopher I Amos ◽  
Chao Cheng

Abstract Background: Expression quantitative trait loci (eQTLs) analyses have been widely used to identify genetic variants associated with gene expression levels to understand what molecular mechanisms underlie genetic traits. The resultant eQTLs might affect the expression of associated genes through transcriptional or post-transcriptional regulation. In this study, we attempt to distinguish these two types of regulation by identifying genetic variants associated with mRNA stability of genes (stQTLs).Results: Here, we presented a computational framework that take the advantage of recently developed methods to infer the mRNA stability of genes based on RNA-seq data and performed association analysis to identify stQTLs. Using the Genotype-Tissue Expression (GTEx) lung RNA-Seq data, we identified a total of 142,801 stQTLs for 3,942 genes and 186,132 eQTLs for 4,751 genes from 15,122,700 genetic variants for 13,476 genes, respectively. Interesting, our results indicated that stQTLs were enriched in the CDS and 3’UTR regions, while eQTLs are enriched in the CDS, 3’UTR, 5’UTR, and upstream regions. We also found that stQTLs are more likely than eQTLs to overlap with RNA binding protein (RBP) and microRNA (miRNA) binding sites. Our analyses demonstrate that simultaneous identification of stQTLs and eQTLs can provide more mechanistic insight on the association between genetic variants and gene expression levels.


2010 ◽  
Vol 08 (supp01) ◽  
pp. 177-192 ◽  
Author(s):  
XI WANG ◽  
ZHENGPENG WU ◽  
XUEGONG ZHANG

Due to its unprecedented high-resolution and detailed information, RNA-seq technology based on next-generation high-throughput sequencing significantly boosts the ability to study transcriptomes. The estimation of genes' transcript abundance levels or gene expression levels has always been an important question in research on the transcriptional regulation and gene functions. On the basis of the concept of Reads Per Kilo-base per Million reads (RPKM), taking the union-intersection genes (UI-based) and summing up inferred isoform abundance (isoform-based) are the two current strategies to estimate gene expression levels, but produce different estimations. In this paper, we made the first attempt to compare the two strategies' performances through a series of simulation studies. Our results showed that the isoform-based method gives not only more accurate estimation but also has less uncertainty than the UI-based strategy. If taking into account the non-uniformity of read distribution, the isoform-based method can further reduce estimation errors. We applied both strategies to real RNA-seq datasets of technical replicates, and found that the isoform-based strategy also displays a better performance. For a more accurate estimation of gene expression levels from RNA-seq data, even if the abundance levels of isoforms are not of interest, it is still better to first infer the isoform abundance and sum them up to get the expression level of a gene as a whole.


Blood ◽  
2016 ◽  
Vol 128 (22) ◽  
pp. 1042-1042
Author(s):  
Kohei Hosokawa ◽  
Sachiko Kajigaya ◽  
Keyvan Keyvanfar ◽  
Qiao Wangmin ◽  
Yanling Xie ◽  
...  

Abstract Background. Paroxysmal nocturnal hemoglobinuria (PNH) is a rare acquired blood disease, characterized by hemolytic anemia, bone marrow (BM) failure, and venous thrombosis. The etiology of PNH is a somatic mutation in the phosphatidylinositol glycan class A gene (PIG-A) on the X chromosome, which causes deficiency in glycosyl phosphatidylinositol-anchored proteins (GPI-APs). The involvement of T cells in PNH is strongly supported by clinical overlap between PNH and aplastic anemia (AA); the presence of GPI-AP deficient cells in AA associated with favorable response to immunosuppressive therapy; and an oligoclonal T cell repertoire in PNH patients. However, the molecular mechanisms responsible for the aberrant immune responses in PNH patients are not well understood. To identify aberrant molecular mechanisms involved in immune targeting of hematopoietic stem cells in BM, RNA sequencing (RNA-seq) was applied to examine the transcriptome of T cell subsets from PNH patients and healthy controls. Method. Blood samples were obtained after informed consent from 15 PNH patients and 15 age-matched healthy controls. For RNA extraction, freshly isolated peripheral blood mononuclear cells were sorted on the same day of blood draw to obtain four different T cell (CD3+ CD14- CD19- ViViD-) populations [CD4+ naïve (CD45RA+ CD45RO-), CD4+ memory (CD45RA- CD45RO+), CD8+ naïve (CD45RA+ CD45RO-), and CD8+ memory (CD45RA- CD45RO+) T cells] by fluorescence-activated cell sorter . RNA-Seq analysis from three PNH and three healthy controls was performed using the Illumina HiSeq™ 2000 platform. The Ingenuity® Pathway Analysis and Gene set enrichment analysis (GSEA) were employed to elucidate transcriptional pathways. RNA-seq data were validated by flow cytometry and quantitative real-time RT-PCR (RT-qPCR). Results and Discussion . Differentially expressed gene analysis of four T cell subsets showed distinct gene expression signatures in individual T cell subsets. In CD4+ naïve T cells, 11 gene expression levels were significantly different: five upregulated (including SRRM2 and TNFSF8) and six downregulated genes (including GIMAP6) (> 2 fold change, false discovery rate [FDR] < 0.05). In CD4+ memory T cells, 25 gene expression levels were significantly different: 15 upregulated (including JUND and TOB1) and 10 downregulated genes (including GIMAP4). In CD8+ naive T cells, only two gene expression levels were significantly different: upregulated CTSW and downregulated RPL9. In CD8+ memory T cells, seven gene expression levels were significantly different: two upregulated (CTSW and DPP4) and five downregulated genes (including SLC12A7). Further, differentially expressed gene analysis was performed by combining CD4+ naïve, CD4+ memory, CD8+ naïve, and CD8+ memory T cells from PNH or healthy controls, respectively. Out of 55 gene expression levels that were significantly different, 41 were upregulated (including TNFAIP3, JUN, JUND, TOB1, TNFSF8, and CD69) and 14 downregulated (including GIMAP4). By canonical pathways analysis, putative gene network interactions of differentially expressed genes were significantly enriched for canonical pathways of TNFR1, TNFR2, IL-17A, and CD27 signaling. By GSEA, the most significantly upregulated gene sets in CD4+ naïve, CD4+ memory, CD8+ naïve, and CD8+ memory T cells from PNH patients displayed gene signatures related to the "IGF1 pathway", "Pre-NOTCH expression and processing", "AP-1 pathway", and "ATF2 pathway", respectively. For validation of the RNA-seq data, we chose seven genes (TNFAIP3, JUN, JUND, TOB1, TNFSF8, CD69, and CTSW) because these are important mediators involved in regulation for T cells and dysregulation of these genes is associated with autoimmune diseases. Differential expression levels of TNFAIP3, JUN, and TOB1 were validated by RT-qPCR. By flow cytometry, higher expression of CD69 and TNFSF8 was confirmed in CD4+ and CD8+T cells from PNH compared to healthy controls. Conclusion. Using RNA-seq, we identified novel molecular mechanisms and pathways which may underlie the aberrant T cell immune status in PNH. Specific dysregulation of T cell intracellular signaling may contribute to BM failure and the inflammatory environment in PNH. Understanding these pathways may provide new therapeutic strategies to modulate T cell immune responses in BM failure. Disclosures Hosokawa: Aplastic Anemia and MDS International Foundation: Research Funding. Rios:GSK/Novartis: Research Funding. Weinstein:GSK/Novartis: Research Funding. Townsley:GSK/Novartis: Research Funding.


Sign in / Sign up

Export Citation Format

Share Document