Oligonucleotide microarray data mining: search for age-dependent gene expression

The advent of gene expression microarray technology enables the simultaneous measurement of expression levels for thousands or tens of thousands of genes in a single experiment (Schena, et al., 1995). Analysis of gene expression microarray data presents unprecedented opportunities and challenges for data mining in areas such as gene clustering (Eisen, et al., 1998; Tamayo, et al., 1999), sample clustering and class discovery (Alon, et al., 1999; Golub, et al., 1999), sample class prediction (Golub, et al., 1999; Wu, et al., 2003), and gene selection (Xing, Jordan, & Karp, 2001; Yu & Liu, 2004). This article introduces the basic concepts of gene expression microarray data and describes relevant data-mining tasks. It briefly reviews the state-of-the-art methods for each data-mining task and identifies emerging challenges and future research directions in microarray data analysis.

Download Full-text

Data Mining and Meta-Analysis on DNA Microarray Data

International Journal of Systems Biology and Biomedical Technologies ◽

10.4018/ijsbbt.2012070101 ◽

2012 ◽

Vol 1 (3) ◽

pp. 1-39

Author(s):

Triantafyllos Paparountas ◽

Maria Nefeli Nikolaidou-Katsaridou ◽

Gabriella Rustici ◽

Vasilis Aidinis

Keyword(s):

Gene Expression ◽

Data Mining ◽

Microarray Data ◽

Meta Analysis ◽

Biological Significance ◽

Biological Information ◽

Expression Arrays ◽

Dna Microarray Data ◽

Gene Expression Arrays ◽

Expression Genetics

Microarray technology enables high-throughput parallel gene expression analysis, and use has grown exponentially thanks to the development of a variety of applications for expression, genetics and epigenetic studies. A wealth of data is now available from public repositories, providing unprecedented opportunities for meta-analysis approaches, which could generate new biological information, unrelated to the original scope of individual studies. This study provides a guideline for identification of biological significance of the statistically-selected differentially-expressed genes derived from gene expression arrays as well as to suggest further analysis pathways. The authors review the prerequisites for data-mining and meta-analysis, summarize the conceptual methods to derive biological information from microarray data and suggest software for each category of data mining or meta-analysis.

Download Full-text

Microarray Data Mining

Knowledge Discovery Practices and Emerging Applications of Data Mining - Advances in Data Mining and Database Management ◽

10.4018/978-1-60960-067-9.ch002 ◽

2010 ◽

pp. 23-47

Author(s):

Giulia Bruno ◽

Alessandro Fiori

Keyword(s):

Gene Expression ◽

Data Mining ◽

Microarray Data ◽

Regulatory Networks ◽

Single Experiment ◽

Classification Feature ◽

Building Models ◽

Critical Issues ◽

Highly Correlated ◽

Genetic Profiles

Microarray technology is a powerful tool to analyze thousands of gene expression values with a single experiment. Due to the huge amount of data, most of recent studies are focused on the analysis and the extraction of useful and interesting information from microarray data. Examples of applications include detecting genes highly correlated to diseases, selecting genes which show a similar behavior under specific conditions, building models to predict the disease outcome based on genetic profiles, and inferring regulatory networks. This chapter presents a review of four popular data mining techniques (i.e., Classification, Feature Selection, Clustering and Association Rule Mining) applied to microarray data. It describes the main characteristics of microarray data in order to understand the critical issues which are introduced by gene expression values analysis. Each technique is analyzed and examples of pertinent literature are reported. Finally, prospects of data mining research on microarray data are provided.

Download Full-text

Text Mining Perspectives in Microarray Data Mining

ISRN Computational Biology ◽

10.1155/2013/159135 ◽

2013 ◽

Vol 2013 ◽

pp. 1-5 ◽

Cited By ~ 1

Author(s):

Jeyakumar Natarajan

Keyword(s):

Gene Expression ◽

Data Mining ◽

Text Mining ◽

Gene Expression Data ◽

Microarray Data ◽

Machine Learning Algorithms ◽

Microarray Data Analysis ◽

Expression Data ◽

Related Data ◽

Mining Methods

Current microarray data mining methods such as clustering, classification, and association analysis heavily rely on statistical and machine learning algorithms for analysis of large sets of gene expression data. In recent years, there has been a growing interest in methods that attempt to discover patterns based on multiple but related data sources. Gene expression data and the corresponding literature data are one such example. This paper suggests a new approach to microarray data mining as a combination of text mining (TM) and information extraction (IE). TM is concerned with identifying patterns in natural language text and IE is concerned with locating specific entities, relations, and facts in text. The present paper surveys the state of the art of data mining methods for microarray data analysis. We show the limitations of current microarray data mining methods and outline how text mining could address these limitations.

Download Full-text

Knowledge discovery in gene-expression-microarray data: mining the information output of the genome

Trends in Biotechnology ◽

10.1016/s0167-7799(99)01359-1 ◽

1999 ◽

Vol 17 (11) ◽

pp. 429-436 ◽

Cited By ~ 45

Author(s):

Gary Zweiger

Keyword(s):

Gene Expression ◽

Data Mining ◽

Knowledge Discovery ◽

Microarray Data ◽

Gene Expression Microarray ◽

Expression Microarray ◽

Gene Expression Microarray Data

Download Full-text

Information Extraction from Microarray Data

Journal of Database Management ◽

10.4018/jdm.2014010102 ◽

2014 ◽

Vol 25 (1) ◽

pp. 29-58 ◽

Cited By ~ 2

Author(s):

Alessandro Fiori ◽

Alberto Grand ◽

Giulia Bruno ◽

Francesco Gavino Brundu ◽

Domenico Schioppa ◽

...

Keyword(s):

Gene Expression ◽

Data Mining ◽

Microarray Data ◽

Regulatory Networks ◽

Molecular Data ◽

Experimental Conditions ◽

Single Experiment ◽

Building Models ◽

Critical Issues ◽

Highly Correlated

Nowadays, a huge amount of high throughput molecular data are available for analysis and provide novel and useful insights into complex biological systems, through the acquisition of a high-resolution picture of their molecular status in defined experimental conditions. In this context, microarrays are a powerful tool to analyze thousands of gene expression values with a single experiment. A number of approaches have been developed to detecting genes highly correlated to diseases, selecting genes that exhibit a similar behavior under specific conditions, building models to predict disease outcome based on genetic profiles, and inferring regulatory networks. This paper discusses popular and recent data mining techniques (i.e., Feature Selection, Clustering, Classification, and Association Rule Mining) applied to microarray data. The main characteristics of microarray data and preprocessing procedures are presented to understand the critical issues introduced by gene expression values analysis. Each technique is analyzed, and relevant examples of pertinent literature are reported. Moreover, real use cases exploiting analytic pipelines that use these methods are also introduced. Finally, future directions of data mining research on microarray data are envisioned.

Download Full-text

Information Extraction from Microarray Data

Business Intelligence ◽

10.4018/978-1-4666-9562-7.ch060 ◽

2016 ◽

pp. 1180-1211 ◽

Cited By ~ 1

Author(s):

Alessandro Fiori ◽

Alberto Grand ◽

Giulia Bruno ◽

Francesco Gavino Brundu ◽

Domenico Schioppa ◽

...

Keyword(s):

Gene Expression ◽

Data Mining ◽

Microarray Data ◽

Regulatory Networks ◽

Molecular Data ◽

Experimental Conditions ◽

Single Experiment ◽

Building Models ◽

Critical Issues ◽

Highly Correlated

Nowadays, a huge amount of high throughput molecular data are available for analysis and provide novel and useful insights into complex biological systems, through the acquisition of a high-resolution picture of their molecular status in defined experimental conditions. In this context, microarrays are a powerful tool to analyze thousands of gene expression values with a single experiment. A number of approaches have been developed to detecting genes highly correlated to diseases, selecting genes that exhibit a similar behavior under specific conditions, building models to predict disease outcome based on genetic profiles, and inferring regulatory networks. This paper discusses popular and recent data mining techniques (i.e., Feature Selection, Clustering, Classification, and Association Rule Mining) applied to microarray data. The main characteristics of microarray data and preprocessing procedures are presented to understand the critical issues introduced by gene expression values analysis. Each technique is analyzed, and relevant examples of pertinent literature are reported. Moreover, real use cases exploiting analytic pipelines that use these methods are also introduced. Finally, future directions of data mining research on microarray data are envisioned.

Download Full-text

Data Mining and Meta-Analysis on DNA Microarray Data

Bioinformatics ◽

10.4018/978-1-4666-3604-0.ch062 ◽

2013 ◽

pp. 1196-1236

Author(s):

Triantafyllos Paparountas ◽

Maria Nefeli Nikolaidou-Katsaridou ◽

Gabriella Rustici ◽

Vasilis Aidinis

Keyword(s):

Gene Expression ◽

Data Mining ◽

Microarray Data ◽

Meta Analysis ◽

Biological Significance ◽

Biological Information ◽

Dna Microarray Data ◽

Gene Expression Arrays ◽

Expression Genetics ◽

Public Repositories

Microarray technology enables high-throughput parallel gene expression analysis, and use has grown exponentially thanks to the development of a variety of applications for expression, genetics and epigenetic studies. A wealth of data is now available from public repositories, providing unprecedented opportunities for meta-analysis approaches, which could generate new biological information, unrelated to the original scope of individual studies. This study provides a guideline for identification of biological significance of the statistically-selected differentially-expressed genes derived from gene expression arrays as well as to suggest further analysis pathways. The authors review the prerequisites for data-mining and meta-analysis, summarize the conceptual methods to derive biological information from microarray data and suggest software for each category of data mining or meta-analysis.

Download Full-text