Identifying Cancer Subtypes Using a Residual Graph Convolution Model on a Sample Similarity Network

Cancer subtype classification helps us to understand the pathogenesis of cancer and develop new cancer drugs, treatment from which patients would benefit most. Most previous studies detect cancer subtypes by extracting features from individual samples, ignoring their associations with others. We believe that the interactions of cancer samples can help identify cancer subtypes. This work proposes a cancer subtype classification method based on a residual graph convolutional network and a sample similarity network. First, we constructed a sample similarity network regarding cancer gene co-expression patterns. Then, the gene expression profiles of cancer samples as initial features and the sample similarity network were passed into a two-layer graph convolutional network (GCN) model. We introduced the initial features to the GCN model to avoid over-smoothing during the training process. Finally, the classification of cancer subtypes was obtained through a softmax activation function. Our model was applied to breast invasive carcinoma (BRCA), glioblastoma multiforme (GBM) and lung cancer (LUNG) datasets. The accuracy values of our model reached 82.58%, 85.13% and 79.18% for BRCA, GBM and LUNG, respectively, which outperformed the existing methods. The survival analysis of our results proves the significant clinical features of the cancer subtypes identified by our model. Moreover, we can leverage our model to detect the essential genes enriched in gene ontology (GO) terms and the biological pathways related to a cancer subtype.

Download Full-text

Cancer subtype classification and modeling by pathway attention and propagation

Bioinformatics ◽

10.1093/bioinformatics/btaa203 ◽

2020 ◽

Vol 36 (12) ◽

pp. 3818-3824

Author(s):

Sangseon Lee ◽

Sangsoo Lim ◽

Taeheon Lee ◽

Inyoung Sung ◽

Sun Kim

Keyword(s):

Expression Profiles ◽

Gene Expression Profiles ◽

Supplementary Information ◽

Convolutional Network ◽

Cancer Subtypes ◽

Subtype Classification ◽

Pathway Gene ◽

Human Genes ◽

Cancer Subtype ◽

Network Propagation

Abstract Motivation Biological pathway is an important curated knowledge of biological processes. Thus, cancer subtype classification based on pathways will be very useful to understand differences in biological mechanisms among cancer subtypes. However, pathways include only a fraction of the entire gene set, only one-third of human genes in KEGG, and pathways are fragmented. For this reason, there are few computational methods to use pathways for cancer subtype classification. Results We present an explainable deep-learning model with attention mechanism and network propagation for cancer subtype classification. Each pathway is modeled by a graph convolutional network. Then, a multi-attention-based ensemble model combines several hundreds of pathways in an explainable manner. Lastly, network propagation on pathway–gene network explains why gene expression profiles in subtypes are different. In experiments with five TCGA cancer datasets, our method achieved very good classification accuracies and, additionally, identified subtype-specific pathways and biological functions. Availability and implementation The source code is available at http://biohealth.snu.ac.kr/software/GCN_MAE. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Discovering lncRNA Mediated Sponge Interactions in Breast Cancer Molecular Subtypes

10.1101/209015 ◽

2017 ◽

Author(s):

Gulden Olgun ◽

Ozgur Sahin ◽

Oznur Tastan

Keyword(s):

Breast Cancer ◽

Hox Genes ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Functional Enrichment ◽

Breast Cancer Subtypes ◽

Integrative Approach ◽

Cancer Subtypes ◽

Significant Difference

AbstractMotivationLong non-coding RNAs(lncRNAs) can indirectly regulate mRNAs expression levels by sequestering microRNAs (miRNAs), and act as competing endogenous RNAs (ceRNAs) or as sponges. Previous studies identified lncRNA-mediated sponge interactions in various cancers including the breast cancer. However, breast cancer subtypes are quite distinct in terms of their molecular profiles; therefore, ceRNAs are expected to be subtype-specific as well.ResultsTo find lncRNA-mediated ceRNA interactions in breast cancer subtypes, we develop an integrative approach. We conduct partial correlation analysis and kernel independence tests on patient gene expression profiles and further refine the candidate interactions with miRNA target information. We find that although there are sponges common to multiple subtypes, there are also distinct subtype-specific interactions. Functional enrichment of mRNAs that participate in these interactions highlights distinct biological processes for different subtypes. Interestingly, some of the ceRNAs also reside in close proximity in the genome; for example, those involving HOX genes, HOTAIR, miR-196a-1 and miR-196a-2. We also discover subtype-specific sponge interactions with high prognostic potential. For instance, when grouping is based on the expression patterns of specific sponge interactions, patients differ significantly in their survival distributions. If on the other hand, patients are grouped based on the individual RNA expression profiles of the sponge participants, they do not exhibit a significant difference in survival. These results can help shed light on subtype-specific mechanisms of breast cancer, and the methodology developed herein can help uncover sponges in other diseases.

Download Full-text

Characterization of Interactions between the Soybean Salt-Stress Responsive Membrane-Intrinsic Proteins GmPIP1 and GmPIP2

Agronomy ◽

10.3390/agronomy11071312 ◽

2021 ◽

Vol 11 (7) ◽

pp. 1312

Author(s):

Jia Liu ◽

Weicong Qi ◽

Haiying Lu ◽

Hongbo Shao ◽

Dayong Zhang

Keyword(s):

Plasma Membrane ◽

Salt Stress ◽

Salt Tolerance ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Early Gene ◽

Plant Responses ◽

Constitutive Overexpression ◽

Important Trait

Salt tolerance is an important trait in soybean cultivation and breeding. Plant responses to salt stress include physiological and biochemical changes that affect the movement of water across the plasma membrane. Plasma membrane intrinsic proteins (PIPs) localize to the plasma membrane and regulate the water and solutes flow. In this study, quantitative real-time PCR and yeast two-hybridization were engaged to analyze the early gene expression profiles and interactions of a set of soybean PIPs (GmPIPs) in response to salt stress. A total of 20 GmPIPs-encoding genes had varied expression profiles after salt stress. Among them, 13 genes exhibited a downregulated expression pattern, including GmPIP1;6, the constitutive overexpression of which could improve soybean salt tolerance, and its close homologs GmPIP1;7 and 1;5. Three genes showed upregulated patterns, including the GmPIP1;6 close homolog GmPIP1;4, when four genes with earlier increased and then decreased expression patterns. GmPIP1;5 and GmPIP1;6 could both physically interact strongly with GmPIP2;2, GmPIP2;4, GmPIP2;6, GmPIP2;8, GmPIP2;9, GmPIP2;11, and GmPIP2;13. Definite interactions between GmPIP1;6 and GmPIP1;7 were detected and GmPIP2;9 performed homo-interaction. The interactions of GmPIP1;5 with GmPIP2;11 and 2;13, GmPIP1;6 with GmPIP2;9, 2;11 and GmPIP2;13, and GmPIP2;9 with itself were strengthened upon salt stress rather than osmotic stress. Taken together, we inferred that GmPIP1 type and GmPIP2 type could associate with each other to synergistically function in the plant cell; a salt-stress environment could promote part of their interactions. This result provided new clues to further understand the soybean PIP–isoform interactions, which lead to potentially functional homo- and heterotetramers for salt tolerance.

Download Full-text

Discovering Distinct Patterns in Gene Expression Profiles

Journal of Integrative Bioinformatics ◽

10.1515/jib-2008-105 ◽

2008 ◽

Vol 5 (2) ◽

Cited By ~ 1

Author(s):

Li Teng ◽

Laiwan Chan

Keyword(s):

Gene Expression ◽

Large Scale ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Clustering Methods ◽

Gene Expressions ◽

Real Gene ◽

Large Scale Dataset ◽

Coexpressed Genes

SummaryTraditional analysis of gene expression profiles use clustering to find groups of coexpressed genes which have similar expression patterns. However clustering is time consuming and could be diffcult for very large scale dataset. We proposed the idea of Discovering Distinct Patterns (DDP) in gene expression profiles. Since patterns showing by the gene expressions reveal their regulate mechanisms. It is significant to find all different patterns existing in the dataset when there is little prior knowledge. It is also a helpful start before taking on further analysis. We propose an algorithm for DDP by iteratively picking out pairs of gene expression patterns which have the largest dissimilarities. This method can also be used as preprocessing to initialize centers for clustering methods, like K-means. Experiments on both synthetic dataset and real gene expression datasets show our method is very effective in finding distinct patterns which have gene functional significance and is also effcient.

Download Full-text

CFTR ΔF508 mutation has minimal effect on the gene expression profile of differentiated human airway epithelia

AJP Lung Cellular and Molecular Physiology ◽

10.1152/ajplung.00065.2005 ◽

2005 ◽

Vol 289 (4) ◽

pp. L545-L553 ◽

Cited By ~ 29

Author(s):

Joseph Zabner ◽

Todd E. Scheetz ◽

Hakeem G. Almabrazi ◽

Thomas L. Casavant ◽

Jian Huang ◽

...

Keyword(s):

Gene Expression ◽

Cystic Fibrosis ◽

Large Scale ◽

Expression Profiles ◽

Expression Patterns ◽

Primary Cultures ◽

Gene Expression Profiles ◽

Filter Method ◽

Tissue Destruction ◽

Airway Epithelia

Cystic fibrosis (CF) is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR), an epithelial chloride channel regulated by phosphorylation. Most of the disease-associated morbidity is the consequence of chronic lung infection with progressive tissue destruction. As an approach to investigate the cellular effects of CFTR mutations, we used large-scale microarray hybridization to contrast the gene expression profiles of well-differentiated primary cultures of human CF and non-CF airway epithelia grown under resting culture conditions. We surveyed the expression profiles for 10 non-CF and 10 ΔF508 homozygote samples. Of the 22,283 genes represented on the Affymetrix U133A GeneChip, we found evidence of significant changes in expression in 24 genes by two-sample t-test ( P < 0.00001). A second, three-filter method of comparative analysis found no significant differences between the groups. The levels of CFTR mRNA were comparable in both groups. There were no significant differences in the gene expression patterns between male and female CF specimens. There were 18 genes with significant increases and 6 genes with decreases in CF relative to non-CF samples. Although the function of many of the differentially expressed genes is unknown, one transcript that was elevated in CF, the KCl cotransporter (KCC4), is a candidate for further study. Overall, the results indicate that CFTR dysfunction has little direct impact on airway epithelial gene expression in samples grown under these conditions.

Download Full-text

Comparative transcriptomics identifies differences in the regulation of the floral transition between Arabidopsis and Brassica rapa cultivars

10.1101/2020.08.26.266494 ◽

2020 ◽

Author(s):

Alexander Calderwood ◽

Jo Hepworth ◽

Shannon Woodhouse ◽

Lorelei Bilham ◽

D. Marc Jones ◽

...

Keyword(s):

Gene Expression ◽

Brassica Rapa ◽

Regulatory Networks ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Detailed Comparison ◽

Developmental Time ◽

Floral Transition ◽

Gene Regulatory

AbstractThe timing of the floral transition affects reproduction and yield, however its regulation in crops remains poorly understood. Here, we use RNA-Seq to determine and compare gene expression dynamics through the floral transition in the model species Arabidopsis thaliana and the closely related crop Brassica rapa. A direct comparison of gene expression over time between species shows little similarity, which could lead to the inference that different gene regulatory networks are at play. However, these differences can be largely resolved by synchronisation, through curve registration, of gene expression profiles. We find that different registration functions are required for different genes, indicating that there is no common ‘developmental time’ to which Arabidopsis and B. rapa can be mapped through gene expression. Instead, the expression patterns of different genes progress at different rates. We find that co-regulated genes show similar changes in synchronisation between species, suggesting that similar gene regulatory sub-network structures may be active with different wiring between them. A detailed comparison of the regulation of the floral transition between Arabidopsis and B. rapa, and between two B. rapa accessions reveals different modes of regulation of the key floral integrator SOC1, and that the floral transition in the B. rapa accessions is triggered by different pathways, even when grown under the same environmental conditions. Our study adds to the mechanistic understanding of the regulatory network of flowering time in rapid cycling B. rapa under long days and highlights the importance of registration methods for the comparison of developmental gene expression data.

Download Full-text

Transcriptional and Physiological Analyses to Assess the Effects of a Novel Biostimulant in Tomato

Frontiers in Plant Science ◽

10.3389/fpls.2021.781993 ◽

2022 ◽

Vol 12 ◽

Author(s):

Maria Cristina Della Lucia ◽

Ali Baghdadi ◽

Francesca Mangione ◽

Matteo Borella ◽

Walter Zegada-Lizarazu ◽

...

Keyword(s):

Water Stress ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Principal Component ◽

Negative Effects ◽

Nutrient Metabolism ◽

Tomato Plants ◽

Reverse Transcription Pcr ◽

Osmotic Stress Tolerance

This work aimed to study the effects in tomato (Solanum lycopersicum L.) of foliar applications of a novel calcium-based biostimulant (SOB01) using an omics approach involving transcriptomics and physiological profiling. A calcium-chloride fertilizer (SOB02) was used as a product reference standard. Plants were grown under well-watered (WW) and water stress (WS) conditions in a growth chamber. We firstly compared the transcriptome profile of treated and untreated tomato plants using the software RStudio. Totally, 968 and 1,657 differentially expressed genes (DEGs) (adj-p-value < 0.1 and |log2(fold change)| ≥ 1) were identified after SOB01 and SOB02 leaf treatments, respectively. Expression patterns of 9 DEGs involved in nutrient metabolism and osmotic stress tolerance were validated by real-time quantitative reverse transcription PCR (RT-qPCR) analysis. Principal component analysis (PCA) on RT-qPCR results highlighted that the gene expression profiles after SOB01 treatment in different water regimes were clustering together, suggesting that the expression pattern of the analyzed genes in well water and water stress plants was similar in the presence of SOB01 treatment. Physiological analyses demonstrated that the biostimulant application increased the photosynthetic rate and the chlorophyll content under water deficiency compared to the standard fertilizer and led to a higher yield in terms of fruit dry matter and a reduction in the number of cracked fruits. In conclusion, transcriptome and physiological profiling provided comprehensive information on the biostimulant effects highlighting that SOB01 applications improved the ability of the tomato plants to mitigate the negative effects of water stress.

Download Full-text

Identify Breast Cancer Subtypes by Gene Expression Profiles

Journal of Data Science ◽

10.6339/jds.2004.02(2).210 ◽

2021 ◽

Vol 2 (2) ◽

pp. 165-175

Author(s):

Grace S. Shieh ◽

Chy-Huei Bai ◽

Chih Lee

Keyword(s):

Breast Cancer ◽

Gene Expression ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Breast Cancer Subtypes ◽

Cancer Subtypes

Download Full-text

Spatial Transcriptomics analysis of uterine gene expression in enhancer of Zeste homolog 2 (Ezh2) conditional knockout mice

Biology of Reproduction ◽

10.1093/biolre/ioab147 ◽

2021 ◽

Author(s):

Ana M Mesa ◽

Jiude Mao ◽

Theresa I Medrano ◽

Nathan J Bivens ◽

Alexander Jurkevich ◽

...

Keyword(s):

Gene Expression ◽

Epithelial Cells ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Reproductive Organs ◽

Conditional Knockout ◽

Estrogen Signaling ◽

Uterine Epithelial Cell ◽

Stromal Tissue

Abstract Histone proteins undergo various modifications that alter chromatin structure, including addition of methyl groups. Enhancer of homolog 2 (EZH2), is a histone methyltransferase that methylates lysine residue 27, and thereby, suppresses gene expression. EZH2 plays integral role in the uterus and other reproductive organs. We have previously shown that conditional deletion of uterine EZH2 results in increased proliferation of luminal and glandular epithelial cells, and RNAseq analyses reveal several uterine transcriptomic changes in Ezh2 conditional (c) knockout (KO) mice that can affect estrogen signaling pathways. To pinpoint the origin of such gene expression changes, we used the recently developed spatial transcriptomics (ST) method with the hypotheses that Ezh2cKO mice would predominantly demonstrate changes in epithelial cells and/or ablation of this gene would disrupt normal epithelial/stromal gene expression patterns. Uteri were collected from ovariectomized adult WT and Ezh2cKO mice and analyzed by ST. Asb4, Cxcl14, Dio2, and Igfbp5 were increased, Sult1d1, Mt3, and Lcn2 were reduced in Ezh2cKO uterine epithelium vs. WT epithelium. For Ezh2cKO uterine stroma, differentially expressed key hub genes included Cald1, Fbln1, Myh11, Acta2, and Tagln. Conditional loss of uterine Ezh2 also appears to shift the balance of gene expression profiles in epithelial vs. stromal tissue toward uterine epithelial cell and gland development and proliferation, consistent with uterine gland hyperplasia in these mice. Current findings provide further insight into how EZH2 may selectively affect uterine epithelial and stromal compartments. Additionally, these transcriptome data might provide the mechanistic understanding and valuable biomarkers for human endometrial disorders with epigenetic underpinnings.

Download Full-text

Building Gene Networks by Analyzing Gene Expression Profiles

Advanced Methodologies and Technologies in Medicine and Healthcare - Advances in Medical Diagnosis, Treatment, and Care ◽

10.4018/978-1-5225-7489-7.ch003 ◽

2019 ◽

pp. 27-44

Author(s):

Crescenzio Gallo

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Networks ◽

Dna Microarrays ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Expression Data ◽

Gene Expressions ◽

Over Time

The possible applications of modeling and simulation in the field of bioinformatics are very extensive, ranging from understanding basic metabolic paths to exploring genetic variability. Experimental results carried out with DNA microarrays allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. In this chapter, the authors examine various methods for analyzing gene expression data, addressing the important topics of (1) selecting the most differentially expressed genes, (2) grouping them by means of their relationships, and (3) classifying samples based on gene expressions.

Download Full-text