Content-based search of gene expression databases using binary fingerprints of differential expression profiles

Author(s):  
Francis Bell ◽  
Ahmet Sacan
2019 ◽  
Vol 20 (23) ◽  
pp. 6098 ◽  
Author(s):  
Amarinder Singh Thind ◽  
Kumar Parijat Tripathi ◽  
Mario Rosario Guarracino

The comparison of high throughput gene expression datasets obtained from different experimental conditions is a challenging task. It provides an opportunity to explore the cellular response to various biological events such as disease, environmental conditions, and drugs. There is a need for tools that allow the integration and analysis of such data. We developed the “RankerGUI pipeline”, a user-friendly web application for the biological community. It allows users to use various rank based statistical approaches for the comparison of full differential gene expression profiles between the same or different biological states obtained from different sources. The pipeline modules are an integration of various open-source packages, a few of which are modified for extended functionality. The main modules include rank rank hypergeometric overlap, enriched rank rank hypergeometric overlap and distance calculations. Additionally, preprocessing steps such as merging differential expression profiles of multiple independent studies can be added before running the main modules. Output plots show the strength, pattern, and trends among complete differential expression profiles. In this paper, we describe the various modules and functionalities of the developed pipeline. We also present a case study that demonstrates how the pipeline can be used for the comparison of differential expression profiles obtained from multiple platforms’ data of the Gene Expression Omnibus. Using these comparisons, we investigate gene expression patterns in kidney and lung cancers.


2018 ◽  
Vol 115 (21) ◽  
pp. 5492-5497 ◽  
Author(s):  
Iskander Said ◽  
Ashley Byrne ◽  
Victoria Serrano ◽  
Charis Cardeno ◽  
Christopher Vollmers ◽  
...  

Chromosomal inversions are widely thought to be favored by natural selection because they suppress recombination between alleles that have higher fitness on the same genetic background or in similar environments. Nonetheless, few selected alleles have been characterized at the molecular level. Gene expression profiling provides a powerful way to identify functionally important variation associated with inversions and suggests candidate phenotypes. However, altered genome structure itself might also impact gene expression by influencing expression profiles of the genes proximal to inversion breakpoint regions or by modifying expression patterns genome-wide due to rearranging large regulatory domains. In natural inversions, genetic differentiation and genome structure are inextricably linked. Here, we characterize differential expression patterns associated with two chromosomal inversions found in natural Drosophila melanogaster populations. To isolate the impacts of genome structure, we engineered synthetic chromosomal inversions on controlled genetic backgrounds with breakpoints that closely match each natural inversion. We find that synthetic inversions have negligible effects on gene expression. Nonetheless, natural inversions have broad-reaching regulatory impacts in cis and trans. Furthermore, we find that differentially expressed genes associated with both natural inversions are enriched for loci associated with immune response to bacterial pathogens. Our results support the idea that inversions in D. melanogaster experience natural selection to maintain associations between functionally related alleles to produce complex phenotypic outcomes.


Blood ◽  
2014 ◽  
Vol 124 (21) ◽  
pp. 1894-1894
Author(s):  
Hogune Im ◽  
Varsha Rao ◽  
Kunju Joshi Sridhar ◽  
Rui Chen ◽  
George Mias ◽  
...  

Abstract Background: Prior studies using microarray platforms have shown alterations of gene expression profiles (GEPs) in MDS CD34+ marrow cells related to clinical outcomes (Sridhar et al, Blood 2009, Pellagatti et al, JCO 2013). Given the increased sensitivity and accuracy of high-throughput RNA sequencing (RNA-Seq) (Mortazavi et al, Nat Meth 2008, Soon et al, Mol Syst Bio 2012) for detecting and quantifying mRNA transcripts, we applied this methodology for evaluating differential gene expression between MDS and normal CD34+ marrow cells. Methods:RNA was isolated from magnetic bead affinity-enriched CD34+ (>90%) marrow aspirate cells (Miltenyi Biotec, Auburn, CA) and amplified using the Smarter Kit (Clontech, Mt View, CA). The amplified product (ds DNA) was fragmented to a size distribution of ~200-300bp using the E220 Focused Ultrasonicator (Covaris Inc, Woburn, MA). End repair, adapter ligation and PCR amplification were performed using the NEBNext Ultra RNA library prep kit for Illumina (New England Biolabs, Ipswich, MA). The indexed cDNA libraries were sequenced (paired end, 100bp) on an Illumina HiSeq2000 platform with median read counts of 69 million. The sequences were aligned to Human Reference sequence hg19 using DNAnexus mapper with gene detection focused on known annotated genes. The differential expression was analyzed using edgeR. DAVID and Ingenuity IPA programs were used for pathway analyses. Gene Set Enrichment Analysis (GSEA) was used to identify biologic processes in our dataset present across phenotypes. Results: Correlations of RNA-Seq data from unamplified to amplified transcripts demonstrated high fidelity of transcripts obtained (Pearson and Spearman R2 = 0.80). After filtering samples for adequate read counts, 12,323 genes were evaluated. Differential expression analysis yielded 719 differentially expressed genes (DEGs) in MDS (n=30) vs normal (n=21) with FDR <.05. Among the DEGs, 548 and 171 were over- and under-expressed ≥2 fold in MDS vs Normal, respectively: 20% of the overexpressed genes were present in >50% of the patients. Hierarchical cluster analysis using these DEGs confirmed clear separation of MDS patients from normals, with 2 differential expression clusters—one region overexpressed and one underexpressed. A distinctive trend toward clustering of the patients was seen which related to their IPSS categories and marrow blast %. In functional pathway analysis of the 2 distinctive gene clusters which distinguished MDS from normal, the underexpressed MDS DEGs demonstrated enrichment of inflammatory cytokines, oxidative stress and interleukin signaling pathways, plus mitochondrial calcium transport; whereas the MDS overexpressed DEG cluster showed enrichment of adherens junction/cytokeletal remodeling, cell cycle control of chromosome replication and DNA damage response pathways. Using GSEA analysis, significantly increased numbers of genes in MDS vs normal, common to those in gene sets present within curated public databases, were involved with TP53 targets and mTOR signaling pathways. Conclusions: Our study demonstrated that RNA-Seq methodology, a high-throughput and more comprehensive technique than most gene expression microarrays, was capable of showing significant and distinctive differences in gene expression between MDS and normal marrow CD34+ cells. Specific clustering of the DEGs was demonstrated to distinguish patient subsets associated with their major clinical features. Further, the stringently identified DEGs shown to be engaged in functional pathways and biologic processes highly relevant for MDS were extant within the patients’ CD34+ cells. These transcriptomic data provide information complementary to exomic mutational findings contributing to improved understanding of biologic mechanisms underlying MDS. Disclosures No relevant conflicts of interest to declare.


2016 ◽  
Vol 1 (1) ◽  
pp. 34 ◽  
Author(s):  
Wiktor Mazin ◽  
Joseph A Tamm ◽  
Irina A Antonijevic ◽  
Aicha Abdourahman ◽  
Munish Das ◽  
...  

Gene expression profiles in blood are increasingly being used to identify biomarkers for different affective disorders. We have selected a set of 29 genes to generate expression profiles for healthy control subjects as well as for patients diagnosed with acute post-traumatic stress disorder (PTSD) and with borderline personality disorder (BPD). Measurements were performed by quantitative polymerase chain reaction (qPCR). Using the actual data in an anonym-ous form we constructed a series of artificial data sets with known gene expression profiles. These sets were used to test 14 classification algorithms and feature selection methods for their ability to identify the correct expression patterns. Application of the three most effective algorithms to the actual expression data showed that control subjects can be dis-tinguished from BPD patients based on differential expression levels of the gene transcripts Gi2, GR and MAPK14, targets that may have links to stress related diseases. Controls can also be distinguished from acute PTSD patients by differential expression levels of the transcripts for ERK2 and RGS2 that are known to be associated with mood disord-ers and social anxiety. We conclude that it is possible to identify informative transcription profiles in blood samples from individuals with affective disorders.


Blood ◽  
2010 ◽  
Vol 116 (21) ◽  
pp. 2435-2435
Author(s):  
Xiao-Jie Yan ◽  
Daniel Kalenscher ◽  
Erin Boyle ◽  
Arrti Damle ◽  
Sophie Yancopoulos ◽  
...  

Abstract Abstract 2435 Introduction: Chronic Lymphocyte Leukemia (CLL) is characterized by the slow clonal expansion of B-lymphocytes, which eventually overwhelm healthy immune cells, therefore hindering normal function. In order to determine the underlying mechanisms for this uncontrolled proliferation, gene expression data was gathered through cDNA microarray analysis of the B-cells from CLL patients. The data was split into two groups depending on expression levels of Ki67, a strong marker for cellular proliferation. In our study, high Ki67 expression only showed in 60% of unmutated CLL patients and higher in CD38+ CLL B cells. The expression of individuals with high levels of Ki67 was compared to the expression profiles of individuals with low levels of Ki67. This allowed for the determination of differences in gene expression correlated to an increase in proliferation of B-cells. Methods: 1) The RNA were purified from CLL B cells which were isolated by RosetteSep Human B cell Enrichment Cocktail (StemCell Technologies) during 4 hours after blood drawn from patients. Microarray assay was performed on Illumina HumanWG-6 expression beadchips. 2) The samples were devided to 2 groups according Ki67 surface expression, 9 samples with Ki67high (>5%) and 26 samples with Ki67low (<5%). 3) Differential expression of gene were first analyzed using Partek. 98 genes were selected as differentially expressed if the fold change was greater than 1.5-fold and p value is lower than <0.01. 4) This set of genes was further examined by pathway analysis (Ingenuity pathway analyses). Results: 1) Time to first time treatment (TTFT) were compared between Ki67high and Ki67low patients. Ki67high patients had a significantly shorter TTFT (2.76 yr) compared to Ki67low patients (23.46 years; P<0.0001). 2) 98 genes were selected as differentially expressed between Ki67high and Ki67low groups. 78 genes were upregulated and 20 genes downregulated in Ki67high group. For the location of these genes, Extracellular space: 9 genes; Plasma membrane: 22 genes; Cytoplasma: 21 genes and Nucleus: 18 genes. 3) Several networks, which involved in cell growth and proliferation, cell cycles and cellular development, has revealed in which there were significant changes in gene expression correlated to high levels of Ki-67 activity. 4) As would be expected, distinct changes in expression were those relating to cell proliferation (23 genes), cell movement and migration (14 genes) as well as, cell-signaling and interaction (20 genes) and cell death (23 genes). 5) To compare this gene list to another set microarray analyses in CLL patients. 28 genes were identified in both experiments. 6) One pathway which particular involved with NF-κB (nuclear factor kappa-light-chain-enhancer of activated B cells) was selected. The genes precipitated in this pathway are CDKN2C, CRY1, DFNA5, FKBP5, HIST2H2AA3, IL15, LPL, MPSK, MYLK, PAK1, ZAP70. NF-κB is a transcription factor involved in the regulation of various genes relating to apoptosis, cell survival, and proliferation. Although unchanged in its own expression, various regulators of NF-κB displayed a significant change in expression. Conclusion and discussion: Ki67 is a nuclear protein which upregulated in G1, S, G2, and M phases of the cell cycle but is absent from resting cells (G0 phase1). We reported that Ki67+ cell is enriched in CD38+ cells regardless of the percentage of CD38+ cells in a patient's CLL clone2. To analyze gene expression profile in Ki67high and Ki67low groups will determine the possible interactions responsible for the proliferative nature of B-cells in CLL. Misregulation of the NF-κB complex has been implicated in the development of various immunological diseases as well as cancer, therefore identifying it as a candidate for further research in CLL development and treatment. Disclosures: No relevant conflicts of interest to declare.


BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Rannyele Passos Ribeiro ◽  
Guillermo Ponz-Segrelles ◽  
Christoph Bleidorn ◽  
Maria Teresa Aguado

Abstract Background Annelids exhibit remarkable postembryonic developmental abilities. Most annelids grow during their whole life by adding segments through the action of a segment addition zone (SAZ) located in front of the pygidium. In addition, they show an outstanding ability to regenerate their bodies. Experimental evidence and field observations show that many annelids are able to regenerate their posterior bodies, while anterior regeneration is often limited or absent. Syllidae, for instance, usually show high abilities of posterior regeneration, although anterior regeneration varies across species. Some syllids are able to partially restore the anterior end, while others regenerate all lost anterior body after bisection. Here, we used comparative transcriptomics to detect changes in the gene expression profiles during anterior regeneration, posterior regeneration and regular growth of two syllid species: Sphaerosyllis hystrix and Syllis gracilis; which exhibit limited and complete anterior regeneration, respectively. Results We detected a high number of genes with differential expression: 4771 genes in S. hystrix (limited anterior regeneration) and 1997 genes in S. gracilis (complete anterior regeneration). For both species, the comparative transcriptomic analysis showed that gene expression during posterior regeneration and regular growth was very similar, whereas anterior regeneration was characterized by up-regulation of several genes. Among the up-regulated genes, we identified putative homologs of regeneration-related genes associated to cellular proliferation, nervous system development, establishment of body axis, and stem-cellness; such as rup and JNK (in S. hystrix); and glutamine synthetase, elav, slit, Hox genes, β-catenin and PL10 (in S. gracilis). Conclusions Posterior regeneration and regular growth show no significant differences in gene expression in the herein investigated syllids. However, anterior regeneration is associated with a clear change in terms of gene expression in both species. Our comparative transcriptomic analysis was able to detect differential expression of some regeneration-related genes, suggesting that syllids share some features of the regenerative mechanisms already known for other annelids and invertebrates.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Nathalie Charbonnel ◽  
Maxime Galan ◽  
Caroline Tatard ◽  
Anne Loiseau ◽  
Christophe Diagne ◽  
...  

Abstract Biological invasions are major anthropogenic changes associated with threats to biodiversity and health. However, what determines the successful establishment and spread of introduced populations remains unclear. Here, we explore several hypotheses linking invasion success and immune phenotype traits, including those based on the evolution of increased competitive ability concept. We compared gene expression profiles between anciently and recently established populations of two major invading species, the house mouse Mus musculus domesticus and the black rat Rattus rattus, in Senegal (West Africa). Transcriptome analyses identified differential expression between anciently and recently established populations for 364 mouse genes and 83 rat genes. All immune-related genes displaying differential expression along the mouse invasion route were overexpressed at three of the four recently invaded sites studied. Complement activation pathway genes were overrepresented among these genes. By contrast, no particular immunological process was found to be overrepresented among the differentially expressed genes of black rat. Changes in transcriptome profiles were thus observed along invasion routes, but with different specific patterns between the two invasive species. These changes may be driven by increases in infection risks at sites recently invaded by the house mouse, and by stochastic events associated with colonization history for the black rat. These results constitute a first step toward the identification of immune eco-evolutionary processes potentially involved in the invasion success of these two rodent species.


Sign in / Sign up

Export Citation Format

Share Document