Visualising Inconsistency and Incompleteness in RDF Gene Expression Data using FCA

Honour Chika Nwagwu

doi:10.4018/ijcssa.2014010105

Using Formal Concept Analysis to Identify Negative Correlations in Gene Expression Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2015.2443805 ◽

2016 ◽

Vol 13 (2) ◽

pp. 380-391 ◽

Cited By ~ 4

Author(s):

Xudong Tu ◽

Yuanliang Wang ◽

Maolan Zhang ◽

Jinchuan Wu

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Formal Concept Analysis ◽

Concept Analysis ◽

Formal Concept ◽

Expression Data

Download Full-text

Mining gene expression data with pattern structures in formal concept analysis

Information Sciences ◽

10.1016/j.ins.2010.07.007 ◽

2011 ◽

Vol 181 (10) ◽

pp. 1989-2001 ◽

Cited By ~ 111

Author(s):

Mehdi Kaytoue ◽

Sergei O. Kuznetsov ◽

Amedeo Napoli ◽

Sébastien Duplessis

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Formal Concept Analysis ◽

Concept Analysis ◽

Formal Concept ◽

Expression Data ◽

Pattern Structures

Download Full-text

Clustering Genes Using Heterogeneous Data Sources

International Journal of Knowledge Discovery in Bioinformatics ◽

10.4018/jkdb.2010040102 ◽

2010 ◽

Vol 1 (2) ◽

pp. 12-28 ◽

Cited By ~ 3

Author(s):

Erliang Zeng ◽

Chengyong Yang ◽

Tao Li ◽

Giri Narasimhan

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Incomplete Data ◽

Clustering Algorithm ◽

Biological Data ◽

Exploratory Analysis ◽

Data Sources ◽

Modular Organization ◽

Constrained Clustering ◽

Expression Data

Clustering of gene expression data is a standard exploratory technique used to identify closely related genes. Many other sources of data are also likely to be of great assistance in the analysis of gene expression data. This data provides a mean to begin elucidating the large-scale modular organization of the cell. The authors consider the challenging task of developing exploratory analytical techniques to deal with multiple complete and incomplete information sources. The Multi-Source Clustering (MSC) algorithm developed performs clustering with multiple, but complete, sources of data. To deal with incomplete data sources, the authors adopted the MPCK-means clustering algorithms to perform exploratory analysis on one complete source and other potentially incomplete sources provided in the form of constraints. This paper presents a new clustering algorithm MSC to perform exploratory analysis using two or more diverse but complete data sources, studies the effectiveness of constraints sets and robustness of the constrained clustering algorithm using multiple sources of incomplete biological data, and incorporates such incomplete data into constrained clustering algorithm in form of constraints sets.

Download Full-text

Clustering Genes Using Heterogeneous Data Sources

Computational Knowledge Discovery for Bioinformatics Research ◽

10.4018/978-1-4666-1785-8.ch005 ◽

2013 ◽

pp. 67-83

Author(s):

Erliang Zeng ◽

Chengyong Yang ◽

Tao Li ◽

Giri Narasimhan

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Incomplete Data ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Exploratory Analysis ◽

Data Sources ◽

Constrained Clustering ◽

Expression Data ◽

Multiple Sources

Clustering of gene expression data is a standard exploratory technique used to identify closely related genes. Many other sources of data are also likely to be of great assistance in the analysis of gene expression data. This data provides a mean to begin elucidating the large-scale modular organization of the cell. The authors consider the challenging task of developing exploratory analytical techniques to deal with multiple complete and incomplete information sources. The Multi-Source Clustering (MSC) algorithm developed performs clustering with multiple, but complete, sources of data. To deal with incomplete data sources, the authors adopted the MPCK-means clustering algorithms to perform exploratory analysis on one complete source and other potentially incomplete sources provided in the form of constraints. This paper presents a new clustering algorithm MSC to perform exploratory analysis using two or more diverse but complete data sources, studies the effectiveness of constraints sets and robustness of the constrained clustering algorithm using multiple sources of incomplete biological data, and incorporates such incomplete data into constrained clustering algorithm in form of constraints sets.

Download Full-text

Inference of Gene Regulatory Networks by Topological Prior Information and Data Integration

Biotechnology ◽

10.4018/978-1-5225-8903-7.ch010 ◽

2019 ◽

pp. 265-304

Author(s):

David Correa Martins Jr. ◽

Fabricio Martins Lopes ◽

Shubhra Sankar Ray

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Prior Information ◽

Heterogeneous Data ◽

Data Sources ◽

Expression Data ◽

Heterogeneous Data Sources ◽

Gene Regulatory

The inference of Gene Regulatory Networks (GRNs) is a very challenging problem which has attracted increasing attention since the development of high-throughput sequencing and gene expression measurement technologies. Many models and algorithms have been developed to identify GRNs using mainly gene expression profile as data source. As the gene expression data usually has limited number of samples and inherent noise, the integration of gene expression with several other sources of information can be vital for accurately inferring GRNs. For instance, some prior information about the overall topological structure of the GRN can guide inference techniques toward better results. In addition to gene expression data, recently biological information from heterogeneous data sources have been integrated by GRN inference methods as well. The objective of this chapter is to present an overview of GRN inference models and techniques with focus on incorporation of prior information such as, global and local topological features and integration of several heterogeneous data sources.

Download Full-text

Inference of Gene Regulatory Networks by Topological Prior Information and Data Integration

Advances in Medical Technologies and Clinical Practice - Emerging Research in the Analysis and Modeling of Gene Regulatory Networks ◽

10.4018/978-1-5225-0353-8.ch001 ◽

2016 ◽

pp. 1-51

Author(s):

David Correa Martins Jr. ◽

Fabricio Martins Lopes ◽

Shubhra Sankar Ray

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Gene Regulatory Networks ◽

Regulatory Networks ◽

Prior Information ◽

Heterogeneous Data ◽

Data Sources ◽

Expression Data ◽

Heterogeneous Data Sources ◽

Gene Regulatory

The inference of Gene Regulatory Networks (GRNs) is a very challenging problem which has attracted increasing attention since the development of high-throughput sequencing and gene expression measurement technologies. Many models and algorithms have been developed to identify GRNs using mainly gene expression profile as data source. As the gene expression data usually has limited number of samples and inherent noise, the integration of gene expression with several other sources of information can be vital for accurately inferring GRNs. For instance, some prior information about the overall topological structure of the GRN can guide inference techniques toward better results. In addition to gene expression data, recently biological information from heterogeneous data sources have been integrated by GRN inference methods as well. The objective of this chapter is to present an overview of GRN inference models and techniques with focus on incorporation of prior information such as, global and local topological features and integration of several heterogeneous data sources.

Download Full-text

Gene Expression Array Exploration Using $\mathcal{K}$ -Formal Concept Analysis

Formal Concept Analysis - Lecture Notes in Computer Science ◽

10.1007/978-3-642-20514-9_11 ◽

2011 ◽

pp. 119-134 ◽

Cited By ~ 1

Author(s):

José María González Calabozo ◽

Carmen Peláez-Moreno ◽

Francisco José Valverde-Albacete

Keyword(s):

Gene Expression ◽

Formal Concept Analysis ◽

Concept Analysis ◽

Formal Concept ◽

Gene Expression Array ◽

Expression Array

Download Full-text

An Algorithm for Recomputing Concepts in Microarray Data Analysis by Biological Lattice

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2013.p0761 ◽

2013 ◽

Vol 17 (5) ◽

pp. 761-771 ◽

Cited By ~ 1

Author(s):

Hidenobu Hashikami ◽

◽

Takanari Tanabata ◽

Fumiaki Hirose ◽

Nur Hasanah ◽

...

Keyword(s):

Gene Expression ◽

Microarray Data ◽

Formal Concept Analysis ◽

Concept Analysis ◽

Microarray Gene Expression Data ◽

Microarray Data Analysis ◽

Formal Concept ◽

Two Phase ◽

Microarray Gene Expression ◽

Formal Concepts

A data-analytic system is proposed for microarray gene expression data based on Formal Concept Analysis (FCA). The purpose of the system is to systematically organize data and to build a complete lattice that analyzes complex relations among genes and give biological interpretation of microarray data. In the system, formal concept analysis handles complex relations, so the microarray data is binarized by setting up a threshold. When change occurs in a conventional algorithm, formal concepts that are nodes of the lattice were calculated from the beginning, but the calculation is inefficient. This paper proposes a new algorithm that has two phase of matrix detection and updating concepts to efficiently update only altered concepts from previously generated concepts. Experiments on run time show that the algorithm takes an average of 0.94 seconds to process real microarray data containing of 43,734 genes and 6 gene expression values.

Download Full-text

Interactive Data Mining Tool for Microarray Data Analysis Using Formal Concept Analysis

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2012.p0273 ◽

2012 ◽

Vol 16 (2) ◽

pp. 273-281 ◽

Cited By ~ 1

Author(s):

Takanari Tanabata ◽

◽

Fumiaki Hirose ◽

Hidenobu Hashikami ◽

Hajime Nobuhara ◽

...

Keyword(s):

Gene Expression ◽

Formal Concept Analysis ◽

Function Analysis ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Concept Analysis ◽

Microarray Data Analysis ◽

Formal Concept ◽

Gene Expressions ◽

Gene Functions

The DNA microarray analysis can explain gene functions by measuring tens of thousands of gene expressions at once and analyzing gene expression profiles that are obtained from the measurement. However, gene expression profiles have such a vast amount of information and therefore most analyses work are done on the data narrowed down by statistical methods, there remains a possibility ofmissing out on genes that consist the factors of phenomena from their evaluations. This study propose a method based on a formal concept analysis to visualize all gene expression profiles and characteristic information that can be obtained from annotation information of each gene so that the user can overview them. In the formal concept analysis, a lattice structure that allows genes to be hierarchically classified and made viewable is built based on the inclusion relations of attributes from a context table in which gene is the object and the attributes are expression profiles and binarized characteristic information. With the proposed method, the user can change the overview state by adjusting the expression ratio and the binary state of characteristic information, understand the relational structure of gene expressions, and carry out analyses of gene functions. We develop software to practice the proposed method, and then ask a biologist to evaluate effectiveness of proposed method applied to a function analysis of genes related to blue light signaling of rice seedlings.

Download Full-text

Interactive knowledge discovery and data mining on genomic expression data with numeric formal concept analysis

BMC Bioinformatics ◽

10.1186/s12859-016-1234-z ◽

2016 ◽

Vol 17 (1) ◽

Cited By ~ 3

Author(s):

Jose M González-Calabozo ◽

Francisco J Valverde-Albacete ◽

Carmen Peláez-Moreno

Keyword(s):

Data Mining ◽

Knowledge Discovery ◽

Formal Concept Analysis ◽

Concept Analysis ◽

Formal Concept ◽

Expression Data ◽

Genomic Expression ◽

Genomic Expression Data

Download Full-text