Combining Hierarchical Clustering and Self-Organizing Maps for Exploratory Analysis of Gene Expression Patterns

2002 ◽  
Vol 1 (5) ◽  
pp. 467-470 ◽  
Author(s):  
Javier Herrero ◽  
Joaquín Dopazo
Blood ◽  
2005 ◽  
Vol 106 (11) ◽  
pp. 3524-3524
Author(s):  
Anil Potti ◽  
Holly K. Dressman ◽  
Murat O. Arcasoy

Abstract Hematopoietic proliferation, lineage commitment, and terminal differentiation are characterized by the emergence of a cell type-specific gene expression and transcriptional programs that determine the specific phenotype and function of cells in the erythroid lineage. Our objectives in this study were to identify unique gene expression patterns that characterize the transcriptional program of normal primary human erythroid precursors during terminal differentiation, and define the gene expression patterns seen in erythroblasts (EBL) of patients with polycythemia vera (PV). Homogenous populations of primary proEBL were generated from purified liquid cultures of CD34+ cells collected from healthy volunteers and PV patients. All patients with PV were diagnosed based on established criteria and had the JAK2-V617F mutation. Morphologic examination and surface expression of CD71 confirmed the purity of proEBL cell populations. ProEBL from normal individuals were induced to terminally differentiate generating orthochromatic EBL. RNA was extracted from normal proEBL, PV proEBL, and normal orthochromatic EBL. Affymetrix U133 Plus 2.0 arrays representing approximately 39,000 human genes were used for gene expression analysis. Four replicates from four independent primary cell cultures were analyzed for each comparison group (e.g. undifferentiated proEBL versus terminally differentiated orthochromatic EBL). Unsupervised hierarchical clustering showed distinct gene expression profiles in the proEBL and terminally differentiated EBL lineages. 1109 genes (2.0 fold change, P<0.01) were found to be differentially expressed. Numerous erythroid genes were found to be upregulated during terminal differentiation [e.g. globin genes, erythropoietin receptor, heme synthesis enzymes (ferrochelatase, ALAS2) erythrocyte membrane proteins (band 3, ankyrin, protein 4.1) and transcription factors (NFE2, Kruppel-like factors, myb, GATA2)]. As a proof of validation, the differential expression of 7 genes was verified by Northern blotting. To better understand the biologic role of the gene sets identified, using Ingenuity pathway analysis, individual genes were integrated into specific regulatory and signaling pathway networks. A total of 19 networks with significant scores (>23) were identified. Biological functions of the identified networks included RNA post-transcriptional regulation, cell cycle control, translational regulation, DNA replication and repair and cellular assembly/organization. In a proof of principle study, gene expression patterns in PV proEBL (n=6) were compared to normal proEBL (n=5). Unsupervised hierarchical clustering showed a distinct gene expression profile for PV. A binary regression predictive model was also developed to find gene expression patterns predictive for PV. Using this model a 150 gene predictor was found that could predict PV patients from control at 100% accuracy. Ingenuity pathways analysis of a subset of gene subsets demonstrated several biologically relevant networks that were distinct in patients with PV, including myc, CDC2, and JAK2. Deregulation of normal transcriptional mechanisms in hematopoietic cells is associated with the pathogenesis of PV. Further, our data shows that genomic studies provide new insights into transcriptional programs that govern erythroid differentiation, and identify biologically relevant deregulated pathways as potential targets for therapy in PV.


2021 ◽  
Vol 12 ◽  
Author(s):  
Jing Cao ◽  
Jiao Gong ◽  
Xinhua Li ◽  
Zhaoxia Hu ◽  
Yingjun Xu ◽  
...  

Objectives: The pathogenesis of heterogeneity in gastric cancer (GC) is not clear and presents as a significant obstacle in providing effective drug treatment. We aimed to identify subtypes of GC and explore the underlying pathogenesis.Methods: We collected two microarray datasets from GEO (GSE84433 and GSE84426), performed an unsupervised cluster analysis based on gene expression patterns, and identified related immune and stromal cells. Then, we explored the possible molecular mechanisms of each subtype by functional enrichment analysis and identified related hub genes.Results: First, we identified three clusters of GC by unsupervised hierarchical clustering, with average silhouette width of 0.96, and also identified their related representative genes and immune cells. We validated our findings using dataset GSE84426. Subtypes associated with the highest mortality (subtype 2 in the training group and subtype C in the validation group) showed high expression of SPARC, COL3A1, and CCN. Both subtypes also showed high infiltration of fibroblasts, endothelial cells, hematopoietic stem cells, and a high stromal score. Furthermore, subtypes with the best prognosis (subtype 3 in the training group and subtype A in the validation group) showed high expression of FGL2, DLGAP1-AS5, and so on. Both subtypes also showed high infiltration of CD4+ T cells, CD8+ T cells, NK cells, pDC, macrophages, and CD4+ T effector memory cells.Conclusion: We found that GC can be classified into three subtypes based on gene expression patterns and cell composition. Findings of this study help us better understand the tumor microenvironment and immune milieu associated with heterogeneity in GC and provide practical information to guide personalized treatment.


Pneumologie ◽  
2018 ◽  
Vol 72 (S 01) ◽  
pp. S8-S9
Author(s):  
M Bauer ◽  
H Kirsten ◽  
E Grunow ◽  
P Ahnert ◽  
M Kiehntopf ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document