Identifying functional modules using energy minimization with graph cuts

Aims: The aim of this article was to find functional (or disease-relevant) modules using gene expression data. Background: Biotechnological developments are leading to a rapid increase in the volume of transcriptome data and thus driving the growth of interactome data. This has made it possible to perform transcriptomic analysis by integrating interactome data. Considering that genes do not exist nor operate in isolation, and instead participate in biological networks, interactomics is equally important to expression profiles. Objective: We constructed a network-based method based on gene expression data in order to identify functional (or disease-relevant) modules. Method: We used the energy minimization with graph cuts method by integrating gene interaction networks under the assumption of the ‘guilt by association’ principle. Result: Our method performs well in an independent simulation experiment and has the ability to identify strongly disease-relevant modules in real experiments. Our method is able to find important functional modules associated with two subtypes of lymphoma in a lymphoma microarray dataset. Moreover, the method can identify the biological subnetworks and most of the genes associated with Duchenne muscular dystrophy. Conclusion: We successfully adapted the energy minimization with the graph cuts method to identify functionally important genes from genomic data by integrating gene interaction networks.

Download Full-text

Modelling gene interaction networks from time-series gene expression data using evolving spiking neural networks

Evolving Systems ◽

10.1007/s12530-019-09269-6 ◽

2019 ◽

Vol 11 (4) ◽

pp. 599-613 ◽

Cited By ~ 1

Author(s):

Elisa Capecci ◽

Jesus L. Lobo ◽

Ibai Laña ◽

Josafath I. Espinosa-Ramos ◽

Nikola Kasabov

Keyword(s):

Gene Expression ◽

Neural Networks ◽

Time Series ◽

Gene Expression Data ◽

Gene Interaction ◽

Interaction Networks ◽

Spiking Neural Networks ◽

Expression Data ◽

Gene Interaction Networks ◽

Time Series Gene Expression

Download Full-text

Inferring weighted and directed gene interaction networks from gene expression data using the phi-mixing coefficient

Proceedings 2012 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS) ◽

10.1109/gensips.2012.6507755 ◽

2012 ◽

Cited By ~ 3

Author(s):

Nitin Singh ◽

Mehmet Eren Ahsen ◽

Shiva Mankala ◽

M. Vidyasagar ◽

Michael White

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Interaction ◽

Interaction Networks ◽

Expression Data ◽

Mixing Coefficient ◽

Gene Interaction Networks

Download Full-text

Network-based cancer genomic data integration for pattern discovery

BMC Genomic Data ◽

10.1186/s12863-021-01004-y ◽

2021 ◽

Vol 22 (S1) ◽

Author(s):

Fangfang Zhu ◽

Jiang Li ◽

Juan Liu ◽

Wenwen Min

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Expression Profiles ◽

Mirna Gene ◽

Interaction Network ◽

Gene Interaction ◽

Functional Modules ◽

Expression Data ◽

Gene Interaction Network ◽

Gene Modules

Abstract Background Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted. Results In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered. Conclusions All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer.

Download Full-text

Construction of Gene Interaction Networks from Gene Expression Data Based on Evolutionary Computation

Journal of Control Automation and Systems Engineering ◽

10.5302/j.icros.2004.10.12.1189 ◽

2004 ◽

Vol 10 (12) ◽

pp. 1189-1195

Keyword(s):

Gene Expression ◽

Evolutionary Computation ◽

Gene Expression Data ◽

Gene Interaction ◽

Interaction Networks ◽

Expression Data ◽

Gene Interaction Networks

Download Full-text

Predicting Host Immune Cell Dynamics and Key Disease-Associated Genes Using Tissue Transcriptional Profiles

Processes ◽

10.3390/pr7050301 ◽

2019 ◽

Vol 7 (5) ◽

pp. 301

Author(s):

Muying Wang ◽

Satoshi Fukuyama ◽

Yoshihiro Kawaoka ◽

Jason E. Shoemaker

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Immune Cell ◽

Mean Squared Error ◽

Expression Profiles ◽

Statistical Tests ◽

Critical Factor ◽

Expression Data ◽

Cell Dynamics ◽

Cell Counts

Motivation: Immune cell dynamics is a critical factor of disease-associated pathology (immunopathology) that also impacts the levels of mRNAs in diseased tissue. Deconvolution algorithms attempt to infer cell quantities in a tissue/organ sample based on gene expression profiles and are often evaluated using artificial, non-complex samples. Their accuracy on estimating cell counts given temporal tissue gene expression data remains not well characterized and has never been characterized when using diseased lung. Further, how to remove the effects of cell migration on transcript counts to improve discovery of disease factors is an open question. Results: Four cell count inference (i.e., deconvolution) tools are evaluated using microarray data from influenza-infected lung sampled at several time points post-infection. The analysis finds that inferred cell quantities are accurate only for select cell types and there is a tendency for algorithms to have a good relative fit (R 2 ) but a poor absolute fit (normalized mean squared error; NMSE), which suggests systemic biases exist. Nonetheless, using cell fraction estimates to adjust gene expression data, we show that genes associated with influenza virus replication and increased infection pathology are more likely to be identified as significant than when applying traditional statistical tests.

Download Full-text

Connecting gene expression data from connectivity map and in silico target predictions for small molecule mechanism-of-action analysis

Molecular BioSystems ◽

10.1039/c4mb00328d ◽

2015 ◽

Vol 11 (1) ◽

pp. 86-96 ◽

Cited By ~ 17

Author(s):

Aakash Chavan Ravindranath ◽

Nolen Perualila-Tan ◽

Adetayo Kasim ◽

Georgios Drakakis ◽

Sonia Liggi ◽

...

Keyword(s):

Gene Expression ◽

Ligand Binding ◽

Gene Expression Data ◽

Mechanism Of Action ◽

In Silico ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Expression Data ◽

Connectivity Map ◽

Action Analysis

Integrating gene expression profiles with certain proteins can improve our understanding of the fundamental mechanisms in protein–ligand binding.

Download Full-text

A Graph Feature Auto-Encoder for the prediction of unobserved node features on biological networks

BMC Bioinformatics ◽

10.1186/s12859-021-04447-3 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Ramin Hasibi ◽

Tom Michoel

Keyword(s):

Gene Expression ◽

Neural Networks ◽

Gene Expression Data ◽

Biological Networks ◽

Molecular Interaction ◽

Interaction Networks ◽

Omics Data ◽

Expression Data ◽

Molecular Interaction Networks ◽

Graph Neural Networks

Abstract Background Molecular interaction networks summarize complex biological processes as graphs, whose structure is informative of biological function at multiple scales. Simultaneously, omics technologies measure the variation or activity of genes, proteins, or metabolites across individuals or experimental conditions. Integrating the complementary viewpoints of biological networks and omics data is an important task in bioinformatics, but existing methods treat networks as discrete structures, which are intrinsically difficult to integrate with continuous node features or activity measures. Graph neural networks map graph nodes into a low-dimensional vector space representation, and can be trained to preserve both the local graph structure and the similarity between node features. Results We studied the representation of transcriptional, protein–protein and genetic interaction networks in E. coli and mouse using graph neural networks. We found that such representations explain a large proportion of variation in gene expression data, and that using gene expression data as node features improves the reconstruction of the graph from the embedding. We further proposed a new end-to-end Graph Feature Auto-Encoder framework for the prediction of node features utilizing the structure of the gene networks, which is trained on the feature prediction task, and showed that it performs better at predicting unobserved node features than regular MultiLayer Perceptrons. When applied to the problem of imputing missing data in single-cell RNAseq data, the Graph Feature Auto-Encoder utilizing our new graph convolution layer called FeatGraphConv outperformed a state-of-the-art imputation method that does not use protein interaction information, showing the benefit of integrating biological networks and omics data with our proposed approach. Conclusion Our proposed Graph Feature Auto-Encoder framework is a powerful approach for integrating and exploiting the close relation between molecular interaction networks and functional genomics data.

Download Full-text

Building Gene Networks by Analyzing Gene Expression Profiles

Advanced Methodologies and Technologies in Medicine and Healthcare - Advances in Medical Diagnosis, Treatment, and Care ◽

10.4018/978-1-5225-7489-7.ch003 ◽

2019 ◽

pp. 27-44

Author(s):

Crescenzio Gallo

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Networks ◽

Dna Microarrays ◽

Expression Profiles ◽

Expression Patterns ◽

Gene Expression Profiles ◽

Expression Data ◽

Gene Expressions ◽

Over Time

The possible applications of modeling and simulation in the field of bioinformatics are very extensive, ranging from understanding basic metabolic paths to exploring genetic variability. Experimental results carried out with DNA microarrays allow researchers to measure expression levels for thousands of genes simultaneously, across different conditions and over time. A key step in the analysis of gene expression data is the detection of groups of genes that manifest similar expression patterns. In this chapter, the authors examine various methods for analyzing gene expression data, addressing the important topics of (1) selecting the most differentially expressed genes, (2) grouping them by means of their relationships, and (3) classifying samples based on gene expressions.

Download Full-text

Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data

eLife ◽

10.7554/elife.26476 ◽

2017 ◽

Vol 6 ◽

Cited By ~ 107

Author(s):

Julien Racle ◽

Kaat de Jonge ◽

Petra Baumgaertner ◽

Daniel E Speiser ◽

David Gfeller

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Immune Cell ◽

Expression Profiles ◽

Cell Types ◽

Response To Therapy ◽

Expression Data ◽

Cell Type ◽

Tumor Gene Expression ◽

Tumor Gene

Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org).

Download Full-text

ExAtlas: An interactive online tool for meta-analysis of gene expression data

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720015500195 ◽

2015 ◽

Vol 13 (06) ◽

pp. 1550019 ◽

Cited By ~ 37

Author(s):

Alexei A. Sharov ◽

David Schlessinger ◽

Minoru S. H. Ko

Keyword(s):

Gene Expression ◽

Gene Ontology ◽

Gene Expression Data ◽

Fixed Effects ◽

Expression Profiles ◽

Meta Analysis ◽

Data Sets ◽

Expression Data ◽

Gene Set ◽

Public Data

We have developed ExAtlas, an on-line software tool for meta-analysis and visualization of gene expression data. In contrast to existing software tools, ExAtlas compares multi-component data sets and generates results for all combinations (e.g. all gene expression profiles versus all Gene Ontology annotations). ExAtlas handles both users’ own data and data extracted semi-automatically from the public repository (GEO/NCBI database). ExAtlas provides a variety of tools for meta-analyses: (1) standard meta-analysis (fixed effects, random effects, z-score, and Fisher’s methods); (2) analyses of global correlations between gene expression data sets; (3) gene set enrichment; (4) gene set overlap; (5) gene association by expression profile; (6) gene specificity; and (7) statistical analysis (ANOVA, pairwise comparison, and PCA). ExAtlas produces graphical outputs, including heatmaps, scatter-plots, bar-charts, and three-dimensional images. Some of the most widely used public data sets (e.g. GNF/BioGPS, Gene Ontology, KEGG, GAD phenotypes, BrainScan, ENCODE ChIP-seq, and protein–protein interaction) are pre-loaded and can be used for functional annotations.

Download Full-text