scholarly journals Quantification of differential transcription factor activity and multiomics-based classification into activators and repressors:diffTF

2018 ◽  
Author(s):  
Ivan Berest ◽  
Christian Arnold ◽  
Armando Reyes-Palomares ◽  
Giovanni Palla ◽  
Kasper Dindler Rasmussen ◽  
...  

Transcription factor (TF) activity is an important read-out of cellular signalling pathways and thus to assess regulatory differences across conditions. However, current technologies lack the ability to simultaneously assess activity changes for multiple TFs and in particular to determine whether a specific TF acts globally as transcriptional repressor or activator. To this end, we introduce a widely applicable genome-wide methoddiffTFto assess differential TF activity and to classify TFs as activator or repressor (available athttps://git.embl.de/grp-zaugg/diffTF). This is done by integrating any type of genome-wide chromatin accessibility data with RNA-Seq data and in-silico predicted TF binding sites. We corroborated the classification of TFs into repressors and activators by three independent analyses based on enrichments of active/repressive chromatin states, correlation of TF activity with gene expression, and activator-and repressor-specific chromatin footprints. To show the power ofdiffTF, we present two case studies: First, we applieddiffTFin to a large ATAC-Seq/RNA-Seq dataset comparing mutated and unmutated chronic lymphocytic leukemia samples, where we identified dozens of known (40%) and potentially novel (60%) TFs that are differentially active. We were also able to classify almost half of them as either repressor and activator. Second, we applieddiffTFto a small ATAC-Seq/RNA-Seq data set comparing two cell types along the hematopoietic differentiation trajectory (multipotent progenitors – MPP – versus granulocyte-macrophage progenitors – GMP). Here we identified the known drivers of differentiation and found that the majority of the differentially active TFs are transcriptional activators. Overall,diffTFwas able to recover the known TFs in both case studies, additionally identified TFs that have been less well characterized in the given condition, and provides a classification of the TFs into transcriptional activators and repressors.

2018 ◽  
Author(s):  
Hatice U. Osmanbeyoglu ◽  
Fumiko Shimizu ◽  
Angela Rynne-Vidal ◽  
Petar Jelinic ◽  
Samuel C. Mok ◽  
...  

ABSTRACTEpigenomic data on transcription factor occupancy and chromatin accessibility can elucidate the developmental origin of cancer cells and reveal the enhancer landscape of key oncogenic transcriptional regulators. However, in many cancers, epigenomic analyses have been limited, and computational methods to infer regulatory networks in tumors typically use expression data alone, or rely on transcription factor (TF) motifs in annotated promoter regions. Here, we develop a novel machine learning strategy called PSIONIC (patient-specific inference of networks informed by chromatin) to combine cell line chromatin accessibility data with large tumor expression data sets and model the effect of enhancers on transcriptional programs in multiple cancers. We generated a new ATAC-seq data set profiling chromatin accessibility in gynecologic and basal breast cancer cell lines and applied PSIONIC to 723 RNA-seq experiments from ovarian, uterine, and basal breast tumors as well as 96 cell line RNA-seq profiles. Our computational framework enables us to share information across tumors to learn patient-specific inferred TF activities, revealing regulatory differences between and within tumor types. Many of the identified TF regulators were significantly associated with survival outcome in basal breast, uterine serous and endometrioid carcinomas. Moreover, PSIONIC-predicted activity for MTF1 in cell line models correlated with sensitivity to MTF1 inhibition. Therefore computationally dissecting the role of TFs in gynecologic cancers may ultimately advance personalized therapy.


2020 ◽  
Author(s):  
Laiyi Fu ◽  
Lihua Zhang ◽  
Emmanuel Dollinger ◽  
Qinke Peng ◽  
Qing Nie ◽  
...  

AbstractCharacterizing genome-wide binding profiles of transcription factor (TF) is essential for understanding many biological processes. Although techniques have been developed to assess binding profiles within a population of cells, determining binding profiles at a single cell level remains elusive. Here we report scFAN (Single Cell Factor Analysis Network), a deep learning model that predicts genome-wide TF binding profiles in individual cells. scFAN is pre-trained on genome-wide bulk ATAC-seq, DNA sequence and ChIP-seq data, and utilizes single-cell ATAC-seq to predict TF binding in individual cells. We demonstrate the efficacy of scFAN by studying sequence motifs enriched within predicted binding peaks and investigating the effectiveness of predicted TF peaks for discovering cell types. We develop a new metric “TF activity score” to characterize each cell, and show that the activity scores can reliably capture cell identities. The method allows us to discover and study cellular identities and heterogeneity based on chromatin accessibility profiles.


2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Sarah E. Pierce ◽  
Jeffrey M. Granja ◽  
William J. Greenleaf

AbstractChromatin accessibility profiling can identify putative regulatory regions genome wide; however, pooled single-cell methods for assessing the effects of regulatory perturbations on accessibility are limited. Here, we report a modified droplet-based single-cell ATAC-seq protocol for perturbing and evaluating dynamic single-cell epigenetic states. This method (Spear-ATAC) enables simultaneous read-out of chromatin accessibility profiles and integrated sgRNA spacer sequences from thousands of individual cells at once. Spear-ATAC profiling of 104,592 cells representing 414 sgRNA knock-down populations reveals the temporal dynamics of epigenetic responses to regulatory perturbations in cancer cells and the associations between transcription factor binding profiles.


2018 ◽  
Vol 8 (1) ◽  
Author(s):  
Riccardo Lorrai ◽  
Francesco Gandolfi ◽  
Alessandra Boccaccini ◽  
Veronica Ruta ◽  
Marco Possenti ◽  
...  

2021 ◽  
Author(s):  
Peter Orchard ◽  
Nandini Manickam ◽  
Christa Ventresca ◽  
Swarooparani Vadlamudi ◽  
Arushi Varshney ◽  
...  

Skeletal muscle accounts for the largest proportion of human body mass, on average, and is a key tissue in complex diseases and mobility. It is composed of several different cell and muscle fiber types. Here, we optimize single-nucleus ATAC-seq (snATAC-seq) to map skeletal muscle cell–specific chromatin accessibility landscapes in frozen human and rat samples, and single-nucleus RNA-seq (snRNA-seq) to map cell-specific transcriptomes in human. We additionally perform multi-omics profiling (gene expression and chromatin accessibility) on human and rat muscle samples. We capture type I and type II muscle fiber signatures, which are generally missed by existing single-cell RNA-seq methods. We perform cross-modality and cross-species integrative analyses on 33,862 nuclei and identify seven cell types ranging in abundance from 59.6% to 1.0% of all nuclei. We introduce a regression-based approach to infer cell types by comparing transcription start site–distal ATAC-seq peaks to reference enhancer maps and show consistency with RNA-based marker gene cell type assignments. We find heterogeneity in enrichment of genetic variants linked to complex phenotypes from the UK Biobank and diabetes genome-wide association studies in cell-specific ATAC-seq peaks, with the most striking enrichment patterns in muscle mesenchymal stem cells (∼3.5% of nuclei). Finally, we overlay these chromatin accessibility maps on GWAS data to nominate causal cell types, SNPs, transcription factor motifs, and target genes for type 2 diabetes signals. These chromatin accessibility profiles for human and rat skeletal muscle cell types are a useful resource for nominating causal GWAS SNPs and cell types.


2021 ◽  
Author(s):  
Anna Reznichenko ◽  
Viji Nair ◽  
Sean Eddy ◽  
Mark Tomilo ◽  
Timothy Slidel ◽  
...  

Current classification of chronic kidney disease (CKD) into stages based on the indirect measures of kidney functional state, estimated glomerular filtration rate and albuminuria, is agnostic to the heterogeneity of underlying etiologies, histopathology, and molecular processes. We used genome-wide transcriptomics from patients kidney biopsies, directly reflecting kidney biological processes, to stratify patients from three independent CKD cohorts. Unsupervised Self-Organizing Maps (SOM), an artificial neural network algorithm, assembled CKD patients into four novel subgroups, molecular categories, based on the similarity of their kidney transcriptomics profiles. The unbiased, molecular categories were present across CKD stages and histopathological diagnoses, highlighting heterogeneity of conventional clinical subgroups at the molecular level. CKD molecular categories were distinct in terms of biological pathways, transcriptional regulation and associated kidney cell types, indicating that the molecular categorization is founded on biologically meaningful mechanisms. Importantly, our results revealed that not all biological pathways are equally activated in all patients; instead, different pathways could be more dominant in different subgroups and thereby differentially influencing disease progression and outcomes. This first kidney-centric unbiased categorization of CKD paves the way to an integrated clinical, morphological and molecular diagnosis. This is a key step towards enabling precision medicine for this heterogeneous condition with the potential to advance biological understanding, clinical management, and drug development, as well as establish a roadmap for molecular reclassification of CKD and other complex diseases.


2021 ◽  
Author(s):  
Dennis A Sun ◽  
Nipam H Patel

AbstractEmerging research organisms enable the study of biology that cannot be addressed using classical “model” organisms. The development of novel data resources can accelerate research in such animals. Here, we present new functional genomic resources for the amphipod crustacean Parhyale hawaiensis, facilitating the exploration of gene regulatory evolution using this emerging research organism. We use Omni-ATAC-Seq, an improved form of the Assay for Transposase-Accessible Chromatin coupled with next-generation sequencing (ATAC-Seq), to identify accessible chromatin genome-wide across a broad time course of Parhyale embryonic development. This time course encompasses many major morphological events, including segmentation, body regionalization, gut morphogenesis, and limb development. In addition, we use short- and long-read RNA-Seq to generate an improved Parhyale genome annotation, enabling deeper classification of identified regulatory elements. We leverage a variety of bioinformatic tools to discover differential accessibility, predict nucleosome positioning, infer transcription factor binding, cluster peaks based on accessibility dynamics, classify biological functions, and correlate gene expression with accessibility. Using a Minos transposase reporter system, we demonstrate the potential to identify novel regulatory elements using this approach, including distal regulatory elements. This work provides a platform for the identification of novel developmental regulatory elements in Parhyale, and offers a framework for performing such experiments in other emerging research organisms.Primary Findings-Omni-ATAC-Seq identifies cis-regulatory elements genome-wide during crustacean embryogenesis-Combined short- and long-read RNA-Seq improves the Parhyale genome annotation-ImpulseDE2 analysis identifies dynamically regulated candidate regulatory elements-NucleoATAC and HINT-ATAC enable inference of nucleosome occupancy and transcription factor binding-Fuzzy clustering reveals peaks with distinct accessibility and chromatin dynamics-Integration of accessibility and gene expression reveals possible enhancers and repressors-Omni-ATAC can identify known and novel regulatory elements


2020 ◽  
Vol 4 (Supplement_1) ◽  
Author(s):  
Frederique Murielle Ruf-Zamojski ◽  
Michel A Zamojski ◽  
German Nudelman ◽  
Yongchao Ge ◽  
Natalia Mendelev ◽  
...  

Abstract The pituitary gland is a critical regulator of the neuroendocrine system. To further our understanding of the classification, cellular heterogeneity, and regulatory landscape of pituitary cell types, we performed and computationally integrated single cell (SC)/single nucleus (SN) resolution experiments capturing RNA expression, chromatin accessibility, and DNA methylation state from mouse dissociated whole pituitaries. Both SC and SN transcriptome analysis and promoter accessibility identified the five classical hormone-producing cell types (somatotropes, gonadotropes (GT), lactotropes, thyrotropes, and corticotropes). GT cells distinctively expressed transcripts for Cga, Fshb, Lhb, Nr5a1, and Gnrhr in SC RNA-seq and SN RNA-seq. This was matched in SN ATAC-seq with GTs specifically showing open chromatin at the promoter regions for the same genes. Similarly, the other classically defined anterior pituitary cells displayed transcript expression and chromatin accessibility patterns characteristic of their own cell type. This integrated analysis identified additional cell-types, such as a stem cell cluster expressing transcripts for Sox2, Sox9, Mia, and Rbpms, and a broadly accessible chromatin state. In addition, we performed bulk ATAC-seq in the LβT2b gonadotrope-like cell line. While the FSHB promoter region was closed in the cell line, we identified a region upstream of Fshb that became accessible by the synergistic actions of GnRH and activin A, and that corresponded to a conserved region identified by a polycystic ovary syndrome (PCOS) single nucleotide polymorphism (SNP). Although this locus appears closed in deep sequencing bulk ATAC-seq of dissociated mouse pituitary cells, SN ATAC-seq of the same preparation showed that this site was specifically open in mouse GT, but closed in 14 other pituitary cell type clusters. This discrepancy highlighted the detection limit of a bulk ATAC-seq experiment in a subpopulation, as GT represented ~5% of this dissociated anterior pituitary sample. These results identified this locus as a candidate for explaining the dual dependence of Fshb expression on GnRH and activin/TGFβ signaling, and potential new evidence for upstream regulation of Fshb. The pituitary epigenetic landscape provides a resource for improved cell type identification and for the investigation of the regulatory mechanisms driving cell-to-cell heterogeneity. Additional authors not listed due to abstract submission restrictions: N. Seenarine, M. Amper, N. Jain (ISMMS).


2019 ◽  
Vol 19 (10) ◽  
pp. e8-e9
Author(s):  
Raphael Szalat ◽  
Matthew Lawlor ◽  
Mariateresa Fulciniti ◽  
Charles B. Epstein ◽  
Yan Xu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document