Increasing Power by Sharing Information from Genetic Background and Treatment in Clustering of Gene Expression Time Series

Clustering of gene expression time series gives insight into which genes may be co-regulated, allowing us to discern the activity of pathways in a given microarray experiment. Of particular interest is how a given group of genes varies with different conditions or genetic background. This paper develops a new clustering method that allows each cluster to be parameterised according to whether the behaviour of the genes across conditions is correlated or anti-correlated. By specifying correlation between such genes,more information is gain within the cluster about how the genes interrelate. Amyotrophic lateral sclerosis (ALS) is an irreversible neurodegenerative disorder that kills the motor neurons and results in death within 2 to 3 years from the symptom onset. Speed of progression for different patients are heterogeneous with significant variability. The SOD1G93A transgenic mice from different backgrounds (129Sv and C57) showed consistent phenotypic differences for disease progression. A hierarchy of Gaussian isused processes to model condition-specific and gene-specific temporal co-variances. This study demonstrated about finding some significant gene expression profiles and clusters of associated or co-regulated gene expressions together from four groups of data (SOD1G93A and Ntg from 129Sv and C57 backgrounds). Our study shows the effectiveness of sharing information between replicates and different model conditions when modelling gene expression time series. Further gene enrichment score analysis and ontology pathway analysis of some specified clusters for a particular group may lead toward identifying features underlying the differential speed of disease progression.

Download Full-text

TWO-PASS IMPUTATION ALGORITHM FOR MISSING VALUE ESTIMATION IN GENE EXPRESSION TIME SERIES

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720007003053 ◽

2007 ◽

Vol 05 (05) ◽

pp. 1005-1022 ◽

Cited By ~ 20

Author(s):

ELENA TSIPORKOVA ◽

VESELKA BOEVA

Keyword(s):

Gene Expression ◽

Time Series ◽

Missing Values ◽

Time Series Data ◽

Expression Profiles ◽

Series Data ◽

Gene Expression Time Series ◽

Value Estimation ◽

Missing Value Estimation ◽

Expression Time

Gene expression microarray experiments frequently generate datasets with multiple values missing. However, most of the analysis, mining, and classification methods for gene expression data require a complete matrix of gene array values. Therefore, the accurate estimation of missing values in such datasets has been recognized as an important issue, and several imputation algorithms have already been proposed to the biological community. Most of these approaches, however, are not particularly suitable for time series expression profiles. In view of this, we propose a novel imputation algorithm, which is specially suited for the estimation of missing values in gene expression time series data. The algorithm utilizes Dynamic Time Warping (DTW) distance in order to measure the similarity between time expression profiles, and subsequently selects for each gene expression profile with missing values a dedicated set of candidate profiles for estimation. Three different DTW-based imputation (DTWimpute) algorithms have been considered: position-wise, neighborhood-wise, and two-pass imputation. These have initially been prototyped in Perl, and their accuracy has been evaluated on yeast expression time series data using several different parameter settings. The experiments have shown that the two-pass algorithm consistently outperforms, in particular for datasets with a higher level of missing entries, the neighborhood-wise and the position-wise algorithms. The performance of the two-pass DTWimpute algorithm has further been benchmarked against the weighted K-Nearest Neighbors algorithm, which is widely used in the biological community; the former algorithm has appeared superior to the latter one. Motivated by these findings, indicating clearly the added value of the DTW techniques for missing value estimation in time series data, we have built an optimized C++ implementation of the two-pass DTWimpute algorithm. The software also provides for a choice between three different initial rough imputation methods.

Download Full-text

An Integrative DTW-based imputation method for gene expression time series data

2012 6th IEEE INTERNATIONAL CONFERENCE INTELLIGENT SYSTEMS ◽

10.1109/is.2012.6335145 ◽

2012 ◽

Cited By ~ 3

Author(s):

Elena Kostadinova ◽

Veselka Boeva ◽

Liliana Boneva ◽

Elena Tsiporkova

Keyword(s):

Gene Expression ◽

Time Series ◽

Time Series Data ◽

Imputation Method ◽

Series Data ◽

Gene Expression Time Series ◽

Expression Time

Download Full-text

GeneShelf: A Web-based Visual Interface for Large Gene Expression Time-Series Data Repositories

IEEE Transactions on Visualization and Computer Graphics ◽

10.1109/tvcg.2009.146 ◽

2009 ◽

Vol 15 (6) ◽

pp. 905-912 ◽

Cited By ~ 9

Author(s):

Bohyoung Kim ◽

Bongshin Lee ◽

S. Knoblach ◽

E. Hoffman ◽

Jinwook Seo

Keyword(s):

Gene Expression ◽

Time Series ◽

Time Series Data ◽

Series Data ◽

Data Repositories ◽

Web Based ◽

Large Gene ◽

Gene Expression Time Series ◽

Visual Interface ◽

Expression Time

Download Full-text

Parallel e-CCC-Biclustering: Mining Approximate Temporal Patterns in Gene Expression Time Series Using Parallel Biclustering

Advances in Intelligent and Soft Computing - 6th International Conference on Practical Applications of Computational Biology & Bioinformatics ◽

10.1007/978-3-642-28839-5_3 ◽

2012 ◽

pp. 21-31 ◽

Cited By ~ 1

Author(s):

Filipe Cristóvão ◽

Sara C. Madeira

Keyword(s):

Gene Expression ◽

Time Series ◽

Temporal Patterns ◽

Gene Expression Time Series ◽

Expression Time

Download Full-text

A Tutorial to Identify Nonlinear Associations in Gene Expression Time Series Data

Transcription Factor Regulatory Networks - Methods in Molecular Biology ◽

10.1007/978-1-4939-0805-9_8 ◽

2014 ◽

pp. 87-95

Author(s):

André Fujita ◽

Satoru Miyano

Keyword(s):

Gene Expression ◽

Time Series ◽

Time Series Data ◽

Series Data ◽

Gene Expression Time Series ◽

Nonlinear Associations ◽

Expression Time

Download Full-text

Estimation and inversion of the effects of cell population asynchrony in gene expression time-series

Signal Processing ◽

10.1016/s0165-1684(02)00471-1 ◽

2003 ◽

Vol 83 (4) ◽

pp. 835-858 ◽

Cited By ~ 4

Author(s):

Harri Lähdesmäki ◽

Heikki Huttunen ◽

Tommi Aho ◽

Marja-Leena Linne ◽

Jari Niemi ◽

...

Keyword(s):

Gene Expression ◽

Time Series ◽

Cell Population ◽

Gene Expression Time Series ◽

And Inversion ◽

Expression Time

Download Full-text

BTW: a web server for Boltzmann time warping of gene expression time series

Nucleic Acids Research ◽

10.1093/nar/gkl162 ◽

2006 ◽

Vol 34 (Web Server) ◽

pp. W482-W485 ◽

Cited By ~ 3

Author(s):

F. Ferre ◽

P. Clote

Keyword(s):

Gene Expression ◽

Time Series ◽

Web Server ◽

Time Warping ◽

Gene Expression Time Series ◽

Expression Time

Download Full-text

Stochastic Dynamic Modeling of Short Gene Expression Time-Series Data

IEEE Transactions on NanoBioscience ◽

10.1109/tnb.2008.2000149 ◽

2008 ◽

Vol 7 (1) ◽

pp. 44-55 ◽

Cited By ~ 55

Author(s):

Zidong Wang* ◽

Fuwen Yang ◽

Daniel W. C. Ho ◽

Stephen Swift ◽

Allan Tucker ◽

...

Keyword(s):

Gene Expression ◽

Time Series ◽

Dynamic Modeling ◽

Time Series Data ◽

Series Data ◽

Stochastic Dynamic ◽

Gene Expression Time Series ◽

Expression Time

Download Full-text

NETWORKS FROM GENE EXPRESSION TIME SERIES: CHARACTERIZATION OF CORRELATION PATTERNS

International Journal of Bifurcation and Chaos ◽

10.1142/s0218127407018543 ◽

2007 ◽

Vol 17 (07) ◽

pp. 2477-2483 ◽

Cited By ~ 4

Author(s):

D. REMONDINI ◽

N. NERETTI ◽

C. FRANCESCHI ◽

P. TIERI ◽

J. M. SEDIVY ◽

...

Keyword(s):

Gene Expression ◽

Time Series ◽

Large Scale ◽

Molecular Mechanisms ◽

Single Gene ◽

Biological Information ◽

Reconstruction Method ◽

Gene Expression Time Series ◽

Expression Time

We address the problem of finding large-scale functional and structural relationships between genes, given a time series of gene expression data, namely mRNA concentration values measured from genetically engineered rat fibroblasts cell lines responding to conditional cMyc proto-oncogene activation. We show how it is possible to retrieve suitable information about molecular mechanisms governing the cell response to conditional perturbations. This task is complex because typical high-throughput genomics experiments are performed with high number of probesets (103–104 genes) and a limited number of observations (< 102 time points). In this paper, we develop a deepest analysis of our previous work [Remondini et al., 2005] in which we characterized some of the main features of a gene-gene interaction network reconstructed from temporal correlation of gene expression time series. One first advancement is based on the comparison of the reconstructed network with networks obtained from randomly generated data, in order to characterize which features retrieve real biological information, and which are instead due to the characteristics of the network reconstruction method. The second and perhaps more relevant advancement is the characterization of the global change in co-expression pattern following cMyc activation as compared to a basal unperturbed state. We propose an analogy with a physical system in a critical state close to a phase transition (e.g. Potts ferromagnet), since the cell responds to the stimulus with high susceptibility, such that a single gene activation propagates to almost the entire genome. Our result is relative to temporal properties of gene network dynamics, and there are experimental evidence that this can be related to spatial properties regarding the global organization of chromatine structure [Knoepfler et al., 2006].

Download Full-text

Gene Time E pression Warper: a tool for alignment, template matching and visualization of gene expression time series

Bioinformatics ◽

10.1093/bioinformatics/bti787 ◽

2005 ◽

Vol 22 (2) ◽

pp. 251-252 ◽

Cited By ~ 21

Author(s):

J. Criel ◽

E. Tsiporkova

Keyword(s):

Gene Expression ◽

Time Series ◽

Template Matching ◽

Gene Expression Time Series ◽

Expression Time

Download Full-text