scholarly journals Testis single-cell RNA-seq reveals the dynamics of de novo gene transcription and germline mutational bias in Drosophila

2019 ◽  
Author(s):  
Evan Witt ◽  
Sigi Benjamin ◽  
Nicolas Svetec ◽  
Li Zhao

SummaryThe testis is a peculiar tissue in many respects. It shows patterns of rapid gene evolution and provides a hotspot for the origination of genetic novelties such as de novo genes, duplications and mutations. To investigate the expression patterns of genetic novelties across cell types, we performed single-cell RNA-sequencing of adult Drosophila testis. We found that new genes were expressed in various cell types, the patterns of which may be influenced by their mode of origination. In particular, lineage-specific de novo genes are commonly expressed in early spermatocytes, while young duplicated genes are often bimodally expressed. Analysis of germline substitutions suggests that spermatogenesis is a highly reparative process, with the mutational load of germ cells decreasing as spermatogenesis progresses. By elucidating the distribution of genetic novelties across spermatogenesis, this study provides a deeper understanding of how the testis maintains its core reproductive function while being a hotbed of evolutionary innovation.

eLife ◽  
2019 ◽  
Vol 8 ◽  
Author(s):  
Evan Witt ◽  
Sigi Benjamin ◽  
Nicolas Svetec ◽  
Li Zhao

The testis is a peculiar tissue in many respects. It shows patterns of rapid gene evolution and provides a hotspot for the origination of genetic novelties such as de novo genes, duplications and mutations. To investigate the expression patterns of genetic novelties across cell types, we performed single-cell RNA-sequencing of adult Drosophila testis. We found that new genes were expressed in various cell types, the patterns of which may be influenced by their mode of origination. In particular, lineage-specific de novo genes are commonly expressed in early spermatocytes, while young duplicated genes are often bimodally expressed. Analysis of germline substitutions suggests that spermatogenesis is a highly reparative process, with the mutational load of germ cells decreasing as spermatogenesis progresses. By elucidating the distribution of genetic novelties across spermatogenesis, this study provides a deeper understanding of how the testis maintains its core reproductive function while being a hotbed of evolutionary innovation.


2019 ◽  
Author(s):  
Chen Xie ◽  
Cemalettin Bekpen ◽  
Sven Künzel ◽  
Maryam Keshavarz ◽  
Rebecca Krebs-Wheaton ◽  
...  

AbstractThe de novo emergence of new transcripts has been well documented through genomic analyses. However, a functional analysis, especially of very young protein-coding genes, is still largely lacking. Here we focus on three loci that have evolved from previously intergenic sequences in the house mouse (Mus musculus) and are not present in its closest relatives. We have obtained knockouts and analyzed their phenotypes, including a deep transcriptomic analysis, based on a dedicated power analysis. We show that the transcriptional networks are significantly disturbed in the knockouts and that all three genes have effects on phenotypes that are related to their expression patterns. This includes behavioral effects, skeletal differences and the regulation of the reproduction cycle in females. Substitution analysis suggests that all three genes have directly obtained an activity, without new adaptive substitutions. Our findings support the hypothesis that de novo genes can quickly adopt functions without extensive adaptation.Impact statementNew protein-coding genes emerging out of non-coding sequences can become directly functional without signatures of adaptive protein changes


2019 ◽  
Author(s):  
Paco Majic ◽  
Joshua L. Payne

AbstractRegulatory networks control the spatiotemporal gene expression patterns that give rise to and define the individual cell types of multicellular organisms. In eumetazoa, distal regulatory elements called enhancers play a key role in determining the structure of such networks, particularly the wiring diagram of “who regulates whom.” Mutations that affect enhancer activity can therefore rewire regulatory networks, potentially causing changes in gene expression that are adaptive. Here, we use whole-tissue and single-cell transcriptomic and chromatin accessibility data from mouse to show that enhancers play an additional role in the evolution of regulatory networks: They facilitate network growth by creating transcriptionally active regions of open chromatin that are conducive to de novo gene evolution. Specifically, our comparative transcriptomic analysis with three other mammalian species shows that young, mouse-specific intergenic open reading frames are preferentially located near enhancers, whereas older open reading frames are not. Mouse-specific intergenic open reading frames that are proximal to enhancers are more highly and stably transcribed than those that are not proximal to enhancers or promoters, and they are transcribed in a limited diversity of cellular contexts. Furthermore, we report several instances of mouse-specific intergenic open reading frames that are proximal to promoters that show evidence of being repurposed enhancers. We also show that open reading frames gradually acquire specific interactions with enhancers over macro-evolutionary timescales, helping integrate new genes into existing regulatory networks. Taken together, our results highlight a dual role of enhancers in expanding and rewiring gene regulatory networks.


2021 ◽  
Vol 22 (S3) ◽  
Author(s):  
Yuanyuan Li ◽  
Ping Luo ◽  
Yi Lu ◽  
Fang-Xiang Wu

Abstract Background With the development of the technology of single-cell sequence, revealing homogeneity and heterogeneity between cells has become a new area of computational systems biology research. However, the clustering of cell types becomes more complex with the mutual penetration between different types of cells and the instability of gene expression. One way of overcoming this problem is to group similar, related single cells together by the means of various clustering analysis methods. Although some methods such as spectral clustering can do well in the identification of cell types, they only consider the similarities between cells and ignore the influence of dissimilarities on clustering results. This methodology may limit the performance of most of the conventional clustering algorithms for the identification of clusters, it needs to develop special methods for high-dimensional sparse categorical data. Results Inspired by the phenomenon that same type cells have similar gene expression patterns, but different types of cells evoke dissimilar gene expression patterns, we improve the existing spectral clustering method for clustering single-cell data that is based on both similarities and dissimilarities between cells. The method first measures the similarity/dissimilarity among cells, then constructs the incidence matrix by fusing similarity matrix with dissimilarity matrix, and, finally, uses the eigenvalues of the incidence matrix to perform dimensionality reduction and employs the K-means algorithm in the low dimensional space to achieve clustering. The proposed improved spectral clustering method is compared with the conventional spectral clustering method in recognizing cell types on several real single-cell RNA-seq datasets. Conclusions In summary, we show that adding intercellular dissimilarity can effectively improve accuracy and achieve robustness and that improved spectral clustering method outperforms the traditional spectral clustering method in grouping cells.


PLoS ONE ◽  
2021 ◽  
Vol 16 (7) ◽  
pp. e0254194
Author(s):  
Hong-Tae Park ◽  
Woo Bin Park ◽  
Suji Kim ◽  
Jong-Sung Lim ◽  
Gyoungju Nah ◽  
...  

Mycobacterium avium subsp. paratuberculosis (MAP) is a causative agent of Johne’s disease, which is a chronic and debilitating disease in ruminants. MAP is also considered to be a possible cause of Crohn’s disease in humans. However, few studies have focused on the interactions between MAP and human macrophages to elucidate the pathogenesis of Crohn’s disease. We sought to determine the initial responses of human THP-1 cells against MAP infection using single-cell RNA-seq analysis. Clustering analysis showed that THP-1 cells were divided into seven different clusters in response to phorbol-12-myristate-13-acetate (PMA) treatment. The characteristics of each cluster were investigated by identifying cluster-specific marker genes. From the results, we found that classically differentiated cells express CD14, CD36, and TLR2, and that this cell type showed the most active responses against MAP infection. The responses included the expression of proinflammatory cytokines and chemokines such as CCL4, CCL3, IL1B, IL8, and CCL20. In addition, the Mreg cell type, a novel cell type differentiated from THP-1 cells, was discovered. Thus, it is suggested that different cell types arise even when the same cell line is treated under the same conditions. Overall, analyzing gene expression patterns via scRNA-seq classification allows a more detailed observation of the response to infection by each cell type.


2020 ◽  
Author(s):  
Etienne Becht ◽  
Daniel Tolstrup ◽  
Charles-Antoine Dutertre ◽  
Florent Ginhoux ◽  
Evan W. Newell ◽  
...  

AbstractModern immunologic research increasingly requires high-dimensional analyses in order to understand the complex milieu of cell-types that comprise the tissue microenvironments of disease. To achieve this, we developed Infinity Flow combining hundreds of overlapping flow cytometry panels using machine learning to enable the simultaneous analysis of the co-expression patterns of 100s of surface-expressed proteins across millions of individual cells. In this study, we demonstrate that this approach allows the comprehensive analysis of the cellular constituency of the steady-state murine lung and to identify novel cellular heterogeneity in the lungs of melanoma metastasis bearing mice. We show that by using supervised machine learning, Infinity Flow enhances the accuracy and depth of clustering or dimensionality reduction algorithms. Infinity Flow is a highly scalable, low-cost and accessible solution to single cell proteomics in complex tissues.


2019 ◽  
Author(s):  
Alexandra Grubman ◽  
Gabriel Chew ◽  
John F. Ouyang ◽  
Guizhi Sun ◽  
Xin Yi Choo ◽  
...  

AbstractAlzheimer’s disease (AD) is a heterogeneous disease that is largely dependent on the complex cellular microenvironment in the brain. This complexity impedes our understanding of how individual cell types contribute to disease progression and outcome. To characterize the molecular and functional cell diversity in the human AD brain we utilized single nuclei RNA- seq in AD and control patient brains in order to map the landscape of cellular heterogeneity in AD. We detail gene expression changes at the level of cells and cell subclusters, highlighting specific cellular contributions to global gene expression patterns between control and Alzheimer’s patient brains. We observed distinct cellular regulation of APOE which was repressed in oligodendrocyte progenitor cells (OPCs) and astrocyte AD subclusters, and highly enriched in a microglial AD subcluster. In addition, oligodendrocyte and microglia AD subclusters show discordant expression of APOE. Integration of transcription factor regulatory modules with downstream GWAS gene targets revealed subcluster-specific control of AD cell fate transitions. For example, this analysis uncovered that astrocyte diversity in AD was under the control of transcription factor EB (TFEB), a master regulator of lysosomal function and which initiated a regulatory cascade containing multiple AD GWAS genes. These results establish functional links between specific cellular sub-populations in AD, and provide new insights into the coordinated control of AD GWAS genes and their cell-type specific contribution to disease susceptibility. Finally, we created an interactive reference web resource which will facilitate brain and AD researchers to explore the molecular architecture of subtype and AD-specific cell identity, molecular and functional diversity at the single cell level.HighlightsWe generated the first human single cell transcriptome in AD patient brainsOur study unveiled 9 clusters of cell-type specific and common gene expression patterns between control and AD brains, including clusters of genes that present properties of different cell types (i.e. astrocytes and oligodendrocytes)Our analyses also uncovered functionally specialized sub-cellular clusters: 5 microglial clusters, 8 astrocyte clusters, 6 neuronal clusters, 6 oligodendrocyte clusters, 4 OPC and 2 endothelial clusters, each enriched for specific ontological gene categoriesOur analyses found manifold AD GWAS genes specifically associated with one cell-type, and sets of AD GWAS genes co-ordinately and differentially regulated between different brain cell-types in AD sub-cellular clustersWe mapped the regulatory landscape driving transcriptional changes in AD brain, and identified transcription factor networks which we predict to control cell fate transitions between control and AD sub-cellular clustersFinally, we provide an interactive web-resource that allows the user to further visualise and interrogate our dataset.Data resource web interface:http://adsn.ddnetbio.com


Blood ◽  
2021 ◽  
Vol 138 (Supplement 1) ◽  
pp. 3458-3458
Author(s):  
Tsz-Kwong Man ◽  
Mohammad Javad Najaf Panah ◽  
Jessica L. Elswood ◽  
Pavel Sumazin ◽  
Michele S. Redell

Abstract Introduction - Acute myeloid leukemia (AML) is an aggressive disease with a relapse rate of approximately 40% in children. Progress in improving cure rates has been slow, in part because AML is very heterogeneous. Molecular studies consistently show that most cases are comprised of distinct subclones that diminish or expand over the course of therapy. Single-cell profiling methods now allow parsing of the leukemic population into subsets based on gene and/or protein expression patterns. We hypothesized that comparing the features of the subsets that are dominant at relapse with those that are dominant at diagnosis would reveal mechanisms of treatment failure. Methods - We profiled diagnosis-relapse pairs from 6 pediatric AML patients by Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-Seq). All patients were treated at Texas Children's Cancer Center and consented to banking of tissue for research. CITE-Seq was performed by Immunai (New York, NY) using a customized panel of 65 oligonucleotide-tagged antibodies, the 10x Genomics Chromium system for single-cell RNA library generation, and the Novaseq 6000 for sequencing. After data cleanup and normalization, clustering by scRNA-seq was done using the Seurat package. Cell-type identification of clusters was facilitated by published healthy bone marrow scRNA-seq datasets (van Galen et al, Cell 2019). Differentially expressed genes (DEGs) and proteins (DEPs) between diagnosis and relapse were determined using Wilcoxin ranked sum tests. Results - We generated single-cell transcriptomes for a total of 28,486 cells from 12 samples, with a mean of 2373 cells and 1416 genes per sample. Samples were integrated with batch effect correction, producing 30 distinct clusters (cell types) in total (Figure 1A). Cell types with expression profiles consistent with lymphocytes and erythroid precursors were identified in multiple patients, whereas AML cell types tended to be specific to individual patients (Figure 1B). For patients TCH1, TCH2 and TCH3, the most abundant cell types at diagnosis were rare at relapse, and cell types that were rare at diagnosis became dominant at relapse. For these 3 cases, we identified DEGs between the dominant diagnosis cell types and dominant relapse cell types. We found 18 genes that were upregulated at relapse in at least 2 of the cases. Several genes related to actin polymerization were enriched (ARPC1B, ACTB, PFN1), possibly reflecting an enhanced capacity for adhesion and migration. Also of note, macrophage migration inhibitory factor (MIF) and its receptor CD74 were upregulated at relapse, suggesting a role in chemoresistance. For patients TCH4, TCH5 and TCH6, the same cell types that were abundant at diagnosis were also abundant at relapse, and few genes were significantly altered between diagnosis and relapse in multiple cases. Only SRGN, which encodes the proteoglycan serglycin, and GAPDH were altered in 2 of these 3 cases, and both were downregulated at relapse. We performed similar comparisons to identify proteins that were differentially expressed between diagnosis and relapse pairs. The number of DEPs between the dominant diagnosis and relapse cell types ranged from 0 (TCH1 and TCH6) to 5 (TCH2). The only protein altered in more than one case was CD7, which was enriched at relapse in TCH2, TCH3 and TCH4. Conclusions - From CITE-Seq profiling of 6 pediatric AML cases we identified two distinct patterns of relapse. For 3 cases, relapse occurred by expansion of a subset that was small but present at diagnosis. Enrichment of genes associated with adhesion and survival signaling suggests that these cells survived because they were well-equipped to take advantage of interactions with the microenvironment. For 3 other cases, the population that was dominant at diagnosis persisted and expanded at relapse with few substantial changes in gene or protein expression profiles. This pattern suggests that these AML cells were a priori equipped to survive chemotherapy, even though bulk disease levels were transiently reduced below the limit of detection. Most profiled proteins did not change substantially between diagnosis and relapse. An exception is CD7, which was enriched at relapse in 50% of our cases and represents a potential therapeutic target. Analysis of more cases will refine these relapse patterns, reveal potential mechanisms of chemoresistance and inform the development of novel therapies. Figure 1 Figure 1. Disclosures No relevant conflicts of interest to declare.


Sign in / Sign up

Export Citation Format

Share Document