Classification of micro-array gene expression data using neural networks

ABSTRACTConvolutional neural networks (CNNs) represent a major breakthrough in image classification. However, there has not been similar progress in applying CNNs, or neural networks of any kind, to classification of tabular data. We developed and evaluated a novel method, TAbular Convolution (TAC), for classification of such data using CNNs by transforming tabular data to images and then classifying the images using CNNs. The transformation is performed by treating each row of tabular data (i.e., vector of features) as an image filter (kernel), and applying the filter to a fixed base image. A CNN is then trained to classify the filtered images. We applied TAC to classification of gene expression data derived from blood samples of patients with bacterial or viral infections. Our results demonstrate that off-the-shelf ResNet can classify the gene expression data as accurately as the current non-CNN state-of-the-art classifiers.

Download Full-text

Attribute Selection and Classification of Prostate Cancer Gene Expression Data Using Artificial Neural Networks

Lecture Notes in Computer Science - Trends and Applications in Knowledge Discovery and Data Mining ◽

10.1007/978-3-319-42996-0_3 ◽

2016 ◽

pp. 26-34 ◽

Cited By ~ 1

Author(s):

Sreenivas Sremath Tirumala ◽

A. Narayanan

Keyword(s):

Gene Expression ◽

Prostate Cancer ◽

Neural Networks ◽

Artificial Neural Networks ◽

Gene Expression Data ◽

Attribute Selection ◽

Cancer Gene ◽

Expression Data ◽

Artificial Neural

Download Full-text

A class imbalance-aware Relief algorithm for the classification of tumors using microarray gene expression data

Computational Biology and Chemistry ◽

10.1016/j.compbiolchem.2019.03.017 ◽

2019 ◽

Vol 80 ◽

pp. 121-127 ◽

Cited By ~ 3

Author(s):

Yuanyu He ◽

Junhai Zhou ◽

Yaping Lin ◽

Tuanfei Zhu

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Class Imbalance ◽

Microarray Gene Expression Data ◽

Expression Data ◽

Microarray Gene Expression ◽

Relief Algorithm ◽

Classification Of Tumors ◽

Microarray Gene

Download Full-text

Improving the Performance of Principal Components for Classification of Gene Expression Data Through Feature Selection

Studies in Classification, Data Analysis, and Knowledge Organization - Data Science and Classification ◽

10.1007/3-540-34416-0_35 ◽

2006 ◽

pp. 325-332

Author(s):

Edgar Acuña ◽

Jaime Porras

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Gene Expression Data ◽

Principal Components ◽

Expression Data

Download Full-text

An Effective Classification Model for Cancer Diagnosis Using Micro Array Gene Expression Data

2009 International Conference on Computer Engineering and Technology ◽

10.1109/iccet.2009.38 ◽

2009 ◽

Cited By ~ 1

Author(s):

V. Saravanan ◽

R. Mallika

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Cancer Diagnosis ◽

Classification Model ◽

Expression Data ◽

Micro Array

Download Full-text

Classification of Microarray Gene Expression Data by MultiBlock Dimension Reduction

Communications for Statistical Applications and Methods ◽

10.5351/ckss.2006.13.3.567 ◽

2006 ◽

Vol 13 (3) ◽

pp. 567-576

Author(s):

Mi-Ra Oh ◽

Seo-Young Kim ◽

Kyung-Sook Kim ◽

Jang-Sun Baek ◽

Young-Sook Son

Keyword(s):

Gene Expression ◽

Dimension Reduction ◽

Gene Expression Data ◽

Microarray Gene Expression Data ◽

Expression Data ◽

Microarray Gene Expression ◽

Microarray Gene

Download Full-text

A Graph Feature Auto-Encoder for the prediction of unobserved node features on biological networks

BMC Bioinformatics ◽

10.1186/s12859-021-04447-3 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Ramin Hasibi ◽

Tom Michoel

Keyword(s):

Gene Expression ◽

Neural Networks ◽

Gene Expression Data ◽

Biological Networks ◽

Molecular Interaction ◽

Interaction Networks ◽

Omics Data ◽

Expression Data ◽

Molecular Interaction Networks ◽

Graph Neural Networks

Abstract Background Molecular interaction networks summarize complex biological processes as graphs, whose structure is informative of biological function at multiple scales. Simultaneously, omics technologies measure the variation or activity of genes, proteins, or metabolites across individuals or experimental conditions. Integrating the complementary viewpoints of biological networks and omics data is an important task in bioinformatics, but existing methods treat networks as discrete structures, which are intrinsically difficult to integrate with continuous node features or activity measures. Graph neural networks map graph nodes into a low-dimensional vector space representation, and can be trained to preserve both the local graph structure and the similarity between node features. Results We studied the representation of transcriptional, protein–protein and genetic interaction networks in E. coli and mouse using graph neural networks. We found that such representations explain a large proportion of variation in gene expression data, and that using gene expression data as node features improves the reconstruction of the graph from the embedding. We further proposed a new end-to-end Graph Feature Auto-Encoder framework for the prediction of node features utilizing the structure of the gene networks, which is trained on the feature prediction task, and showed that it performs better at predicting unobserved node features than regular MultiLayer Perceptrons. When applied to the problem of imputing missing data in single-cell RNAseq data, the Graph Feature Auto-Encoder utilizing our new graph convolution layer called FeatGraphConv outperformed a state-of-the-art imputation method that does not use protein interaction information, showing the benefit of integrating biological networks and omics data with our proposed approach. Conclusion Our proposed Graph Feature Auto-Encoder framework is a powerful approach for integrating and exploiting the close relation between molecular interaction networks and functional genomics data.

Download Full-text