scholarly journals Promoter G-quadruplexes and transcription factors cooperate to shape the cell type-specific transcriptome

2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Sara Lago ◽  
Matteo Nadai ◽  
Filippo M. Cernilogar ◽  
Maryam Kazerani ◽  
Helena Domíniguez Moreno ◽  
...  

AbstractCell identity is maintained by activation of cell-specific gene programs, regulated by epigenetic marks, transcription factors and chromatin organization. DNA G-quadruplex (G4)-folded regions in cells were reported to be associated with either increased or decreased transcriptional activity. By G4-ChIP-seq/RNA-seq analysis on liposarcoma cells we confirmed that G4s in promoters are invariably associated with high transcription levels in open chromatin. Comparing G4 presence, location and transcript levels in liposarcoma cells to available data on keratinocytes, we showed that the same promoter sequences of the same genes in the two cell lines had different G4-folding state: high transcript levels consistently associated with G4-folding. Transcription factors AP-1 and SP1, whose binding sites were the most significantly represented in G4-folded sequences, coimmunoprecipitated with their G4-folded promoters. Thus, G4s and their associated transcription factors cooperate to determine cell-specific transcriptional programs, making G4s to strongly emerge as new epigenetic regulators of the transcription machinery.

2020 ◽  
Author(s):  
Sara Lago ◽  
Filippo M. Cernilogar ◽  
Maryam Kazerani ◽  
Helena Domíniguez Moreno ◽  
Matteo Nadai ◽  
...  

AbstractCell identity is maintained by activation of cell-specific gene programs, regulated by epigenetic marks, transcription factors and chromatin organization1-3. DNA G-quadruplex (G4)-folded regions in cells were reported to be associated with either increased or decreased transcriptional activity4,5. By G4 ChIP-seq/RNA-seq analysis on liposarcoma cells we confirmed that G4s in promoters are invariably associated with high transcription levels in open chromatin. Comparing G4 presence, location and transcript levels in liposarcoma cells to available data on keratinocytes, we showed that the same promoter sequences of the same genes in the two cell lines had different G4-folding state: high transcript levels consistently associated with high G4-folding. Transcription factors AP-1 and SP1, whose binding sites were the most significantly represented in G4-folded sequences, coimmunoprecipitated with their G4-folded promoters. Thus G4s and their associated transcription factors cooperate to determine cell-specific transcriptional programs, making G4s strongly emerge as new epigenetic regulators of the transcription machinery.


2021 ◽  
Vol 22 (11) ◽  
pp. 5902
Author(s):  
Stefan Nagel ◽  
Claudia Pommerenke ◽  
Corinna Meyer ◽  
Hans G. Drexler

Recently, we documented a hematopoietic NKL-code mapping physiological expression patterns of NKL homeobox genes in human myelopoiesis including monocytes and their derived dendritic cells (DCs). Here, we enlarge this map to include normal NKL homeobox gene expressions in progenitor-derived DCs. Analysis of public gene expression profiling and RNA-seq datasets containing plasmacytoid and conventional dendritic cells (pDC and cDC) demonstrated HHEX activity in both entities while cDCs additionally expressed VENTX. The consequent aim of our study was to examine regulation and function of VENTX in DCs. We compared profiling data of VENTX-positive cDC and monocytes with VENTX-negative pDC and common myeloid progenitor entities and revealed several differentially expressed genes encoding transcription factors and pathway components, representing potential VENTX regulators. Screening of RNA-seq data for 100 leukemia/lymphoma cell lines identified prominent VENTX expression in an acute myelomonocytic leukemia cell line, MUTZ-3 containing inv(3)(q21q26) and t(12;22)(p13;q11) and representing a model for DC differentiation studies. Furthermore, extended gene analyses indicated that MUTZ-3 is associated with the subtype cDC2. In addition to analysis of public chromatin immune-precipitation data, subsequent knockdown experiments and modulations of signaling pathways in MUTZ-3 and control cell lines confirmed identified candidate transcription factors CEBPB, ETV6, EVI1, GATA2, IRF2, MN1, SPIB, and SPI1 and the CSF-, NOTCH-, and TNFa-pathways as VENTX regulators. Live-cell imaging analyses of MUTZ-3 cells treated for VENTX knockdown excluded impacts on apoptosis or induced alteration of differentiation-associated cell morphology. In contrast, target gene analysis performed by expression profiling of knockdown-treated MUTZ-3 cells revealed VENTX-mediated activation of several cDC-specific genes including CSFR1, EGR2, and MIR10A and inhibition of pDC-specific genes like RUNX2. Taken together, we added NKL homeobox gene activities for progenitor-derived DCs to the NKL-code, showing that VENTX is expressed in cDCs but not in pDCs and forms part of a cDC-specific gene regulatory network operating in DC differentiation and function.


2020 ◽  
Vol 22 (Supplement_3) ◽  
pp. iii314-iii314
Author(s):  
Amir Arabzade ◽  
Yanhua Zhao ◽  
Srinidhi Varadharajan ◽  
Hsiao-Chi Chen ◽  
Austin Stuckert ◽  
...  

Abstract RATIONALE Over 70% of supratentorial (ST) ependymoma are characterized by an oncogenic fusion between C11ORF95 and RELA. C11ORF95-RELA fusion is frequently the sole genetic driver detected in ST ependymoma, thus ranking this genomic event as a lead target for therapeutic investigation. RELA is a transcription factor (TF) central to mediating NF-kB pathway activation in processes such as inflammation, cellular metabolism, and chemotaxis. HYPOTHESIS: We posited that C11ORF95-RELA acts as an oncogenic TF that aberrantly shapes the tumor epigenome to drive aberrant transcription. Approach: To this end we developed an in utero electroporation (IUE) mouse model of ependymoma to express C11ORF95-RELA during embryonic development. Our IUE approach allowed us to develop C11ORF95-RELA driven tumor models and cell lines. We comprehensively characterized the epigenome and transcriptome of C11ORF95-RELA fusion driven mouse cells by H3K27ac ChIP-seq, ATAC-seq, and RNA-seq. RESULTS This data revealed that: 1) C11ORF95-RELA directly engages ‘open’ chromatin and is enriched at regions with known RELA TF binding sites as well as novel genomic loci/motifs, 2) C11ORF95-RELA preferentially binds to both H3K27ac (active) enhancers and promoters, and 3) Bound C11ORF95-RELA promoter loci are associated with increased transcription of genes shared with human ependymoma. CONCLUSION Our findings shed light on the transcriptional mechanisms of C11ORF95-RELA, and reveal downstream targets that may represent cancer dependency genes and molecular targets.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Rongxin Fang ◽  
Sebastian Preissl ◽  
Yang Li ◽  
Xiaomeng Hou ◽  
Jacinta Lucero ◽  
...  

AbstractIdentification of the cis-regulatory elements controlling cell-type specific gene expression patterns is essential for understanding the origin of cellular diversity. Conventional assays to map regulatory elements via open chromatin analysis of primary tissues is hindered by sample heterogeneity. Single cell analysis of accessible chromatin (scATAC-seq) can overcome this limitation. However, the high-level noise of each single cell profile and the large volume of data pose unique computational challenges. Here, we introduce SnapATAC, a software package for analyzing scATAC-seq datasets. SnapATAC dissects cellular heterogeneity in an unbiased manner and map the trajectories of cellular states. Using the Nyström method, SnapATAC can process data from up to a million cells. Furthermore, SnapATAC incorporates existing tools into a comprehensive package for analyzing single cell ATAC-seq dataset. As demonstration of its utility, SnapATAC is applied to 55,592 single-nucleus ATAC-seq profiles from the mouse secondary motor cortex. The analysis reveals ~370,000 candidate regulatory elements in 31 distinct cell populations in this brain region and inferred candidate cell-type specific transcriptional regulators.


2019 ◽  
Vol 47 (17) ◽  
pp. 9069-9086 ◽  
Author(s):  
Filippo M Cernilogar ◽  
Stefan Hasenöder ◽  
Zeyang Wang ◽  
Katharina Scheibner ◽  
Ingo Burtscher ◽  
...  

Abstract Pioneer transcription factors (PTF) can recognize their binding sites on nucleosomal DNA and trigger chromatin opening for recruitment of other non-pioneer transcription factors. However, critical properties of PTFs are still poorly understood, such as how these transcription factors selectively recognize cell type-specific binding sites and under which conditions they can initiate chromatin remodelling. Here we show that early endoderm binding sites of the paradigm PTF Foxa2 are epigenetically primed by low levels of active chromatin modifications in embryonic stem cells (ESC). Priming of these binding sites is supported by preferential recruitment of Foxa2 to endoderm binding sites compared to lineage-inappropriate binding sites, when ectopically expressed in ESCs. We further show that binding of Foxa2 is required for chromatin opening during endoderm differentiation. However, increased chromatin accessibility was only detected on binding sites which are synergistically bound with other endoderm transcription factors. Thus, our data suggest that binding site selection of PTFs is directed by the chromatin environment and that chromatin opening requires collaboration of PTFs with additional transcription factors.


2007 ◽  
Vol 4 (2) ◽  
pp. 1-23
Author(s):  
Amitava Karmaker ◽  
Kihoon Yoon ◽  
Mark Doderer ◽  
Russell Kruzelock ◽  
Stephen Kwek

Summary Revealing the complex interaction between trans- and cis-regulatory elements and identifying these potential binding sites are fundamental problems in understanding gene expression. The progresses in ChIP-chip technology facilitate identifying DNA sequences that are recognized by a specific transcription factor. However, protein-DNA binding is a necessary, but not sufficient, condition for transcription regulation. We need to demonstrate that their gene expression levels are correlated to further confirm regulatory relationship. Here, instead of using a linear correlation coefficient, we used a non-linear function that seems to better capture possible regulatory relationships. By analyzing tissue-specific gene expression profiles of human and mouse, we delineate a list of pairs of transcription factor and gene with highly correlated expression levels, which may have regulatory relationships. Using two closely-related species (human and mouse), we perform comparative genome analysis to cross-validate the quality of our prediction. Our findings are confirmed by matching publicly available TFBS databases (like TRANFAC and ConSite) and by reviewing biological literature. For example, according to our analysis, 80% and 85.71% of the targets genes associated with E2F5 and RELB transcription factors have the corresponding known binding sites. We also substantiated our results on some oncogenes with the biomedical literature. Moreover, we performed further analysis on them and found that BCR and DEK may be regulated by some common transcription factors. Similar results for BTG1, FCGR2B and LCK genes were also reported.


2019 ◽  
Author(s):  
Pawel F. Przytycki ◽  
Katherine S. Pollard

Single-cell and bulk genomics assays have complementary strengths and weaknesses, and alone neither strategy can fully capture regulatory elements across the diversity of cells in complex tissues. We present CellWalker, a method that integrates single-cell open chromatin (scATAC-seq) data with gene expression (RNA-seq) and other data types using a network model that simultaneously improves cell labeling in noisy scATAC-seq and annotates cell-type specific regulatory elements in bulk data. We demonstrate CellWalker’s robustness to sparse annotations and noise using simulations and combined RNA-seq and ATAC-seq in individual cells. We then apply CellWalker to the developing brain. We identify cells transitioning between transcriptional states, resolve enhancers to specific cell types, and observe that autism and other neurological traits can be mapped to specific cell types through their enhancers.


2021 ◽  
Author(s):  
Weixu Wang ◽  
Jun Yao ◽  
Yi Wang ◽  
Chao Zhang ◽  
Wei Tao ◽  
...  

Cell type-specific gene expression (CSE) brings novel biological insights into both physiological and pathological processes compared with bulk tissue gene expression. Although fluorescence-activated cell sorting (FACS) and single-cell RNA sequencing (scRNA-seq) are two widely used techniques to detect gene expression in a cell type-specific manner, the constraints of cost and labor force make it impractical as a routine on large patient cohorts. Here, we present ENIGMA, an algorithm that deconvolutes bulk RNA-seq into cell type-specific expression matrices and cell type fraction matrices without the need of physical sorting or sequencing of single cells. ENIGMA used cell type signature matrix generated from either FACS RNA-seq or scRNA-seq as reference, and applied matrix completion technique to achieve fast and accurate deconvolution. We demonstrated the superior performance of ENIGMA to previously published algorithms (TCA, bMIND and CIBERSORTx) while requiring much less running time on both simulated and realistic datasets. To prove its value in biological discovery, we applied ENIGMA to bulk RNA-seq from arthritis patients and revealed a pseudo-differentiation trajectory that could reflect monocyte to macrophage transition. We also applied ENIGMA to bulk RNA-seq data of pancreatic islet tissue from type 2 diabetes (T2D) patients and discovered a beta cell-specific gene co-expression module related to senescence and apoptosis that possibly contributed to the pathogenesis of T2D. Together, ENIGMA provides a new framework to improve the CSE estimation by integrating FACS RNA-seq and scRNA-seq with tissue bulk RNA-seq data, and will extend our understandings about cell heterogeneity on population level with no need for experimental tissue disaggregation.


2018 ◽  
Author(s):  
Xuran Wang ◽  
Jihwan Park ◽  
Katalin Susztak ◽  
Nancy R. Zhang ◽  
Mingyao Li

AbstractWe present MuSiC, a method that utilizes cell-type specific gene expression from single-cell RNA sequencing (RNA-seq) data to characterize cell type compositions from bulk RNA-seq data in complex tissues. When applied to pancreatic islet and whole kidney expression data in human, mouse, and rats, MuSiC outperformed existing methods, especially for tissues with closely related cell types. MuSiC enables characterization of cellular heterogeneity of complex tissues for identification of disease mechanisms.


Sign in / Sign up

Export Citation Format

Share Document