Probability Based Most Informative Gene Selection From Microarray Data

Microarray datasets have a wide application in bioinformatics research. Analysis to measure the expression level of thousands of genes of this kind of high-throughput data can help for finding the cause and subsequent treatment of any disease. There are many techniques in gene analysis to extract biologically relevant information from inconsistent and ambiguous data. In this paper, the concepts of functional dependency and closure of an attribute of database technology are used for finding the most important set of genes for cancer detection. Firstly, the method computes similarity factor between each pair of genes. Based on the similarity factors a set of gene dependency is formed from which closure set is obtained. Subsequently, conditional probability based interestingness measurements are used to determine the most informative gene for disease classification. The proposed method is applied on some publicly available cancerous gene expression dataset. The result shows the effectiveness and robustness of the algorithm.

Download Full-text

A phase diagram for gene selection and disease classification

10.1101/002360 ◽

2014 ◽

Author(s):

Hong-Dong Li ◽

Qing-Song Xu ◽

Yi-Zeng Liang

Keyword(s):

Phase Diagram ◽

Gene Selection ◽

Population Analysis ◽

Predictive Ability ◽

Disease Diagnosis ◽

Disease Classification ◽

Small Subset ◽

Analysis Framework ◽

Source Codes ◽

Microarray Datasets

Identifying a small subset of discriminate genes is important for predicting clinical outcomes and facilitating disease diagnosis. Based on the model population analysis framework, we present a method, called PHADIA, which is able to output a phase diagram displaying the predictive ability of each variable, which provides an intuitive way for selecting informative variables. Using two publicly available microarray datasets, its demonstrated that our method can selects a few informative genes and achieves significantly better or comparable classification accuracy compared to the reported results in the literature. The source codes are freely available at: www.libpls.net.

Download Full-text

Incremental Search for Informative Gene Selection in Cancer Classification

Annals of Emerging Technologies in Computing ◽

10.33166/aetic.2021.02.002 ◽

2021 ◽

Vol 5 (2) ◽

pp. 15-21

Author(s):

Fathima Fajila ◽

Yuhanis Yusof

Keyword(s):

Data Analysis ◽

Microarray Data ◽

Gene Selection ◽

Subset Selection ◽

Cancer Classification ◽

Microarray Data Analysis ◽

Informative Gene ◽

Incremental Search ◽

Selection Approach ◽

Microarray Datasets

Although numerous methods of using microarray data analysis for classification have been reported, there is space in the field of cancer classification for new inventions in terms of informative gene selection. This study introduces a new incremental search-based gene selection approach for cancer classification. The strength of wrappers in determining relevant genes in a gene pool can be increased as they evaluate each possible gene’s subset. Nevertheless, the searching algorithms play a major role in gene’s subset selection. Hence, there is the possibility of finding more informative genes with incremental application. Thus, we introduce an approach which utilizes two searching algorithms in gene’s subset selection. The approach was efficient enough to classify five out of six microarray datasets with 100% accuracy using only a few biomarkers while the rest classified with only one misclassification.

Download Full-text

NoRCE: non-coding RNA sets cis enrichment tool

BMC Bioinformatics ◽

10.1186/s12859-021-04112-9 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Gulden Olgun ◽

Afshan Nabi ◽

Oznur Tastan

Keyword(s):

Expression Patterns ◽

Target Prediction ◽

Enrichment Analysis ◽

Fruit Fly ◽

Relevant Information ◽

R Package ◽

Data Repository ◽

Biologically Relevant ◽

Gene Sets ◽

Data Files

Abstract Background While some non-coding RNAs (ncRNAs) are assigned critical regulatory roles, most remain functionally uncharacterized. This presents a challenge whenever an interesting set of ncRNAs needs to be analyzed in a functional context. Transcripts located close-by on the genome are often regulated together. This genomic proximity on the sequence can hint at a functional association. Results We present a tool, NoRCE, that performs cis enrichment analysis for a given set of ncRNAs. Enrichment is carried out using the functional annotations of the coding genes located proximal to the input ncRNAs. Other biologically relevant information such as topologically associating domain (TAD) boundaries, co-expression patterns, and miRNA target prediction information can be incorporated to conduct a richer enrichment analysis. To this end, NoRCE includes several relevant datasets as part of its data repository, including cell-line specific TAD boundaries, functional gene sets, and expression data for coding & ncRNAs specific to cancer. Additionally, the users can utilize custom data files in their investigation. Enrichment results can be retrieved in a tabular format or visualized in several different ways. NoRCE is currently available for the following species: human, mouse, rat, zebrafish, fruit fly, worm, and yeast. Conclusions NoRCE is a platform-independent, user-friendly, comprehensive R package that can be used to gain insight into the functional importance of a list of ncRNAs of any type. The tool offers flexibility to conduct the users’ preferred set of analyses by designing their own pipeline of analysis. NoRCE is available in Bioconductor and https://github.com/guldenolgun/NoRCE.

Download Full-text

Gene Selection and Classification in Microarray Datasets using a Hybrid Approach of PCC-BPSO/GA with Multi Classifiers

Journal of Computer Science ◽

10.3844/jcssp.2018.868.880 ◽

2018 ◽

Vol 14 (6) ◽

pp. 868-880 ◽

Cited By ~ 3

Author(s):

Shilan S. Hameed ◽

Fahmi F. Muhammad ◽

Rohayanti Hassan ◽

Faisal Saeed

Keyword(s):

Gene Selection ◽

Hybrid Approach ◽

Microarray Datasets

Download Full-text

Gene Selection in Cancer Classification Using Sparse Logistic Regression with L1/2 Regularization

Applied Sciences ◽

10.3390/app8091569 ◽

2018 ◽

Vol 8 (9) ◽

pp. 1569 ◽

Cited By ~ 3

Author(s):

Shengbing Wu ◽

Hongkun Jiang ◽

Haiwei Shen ◽

Ziyi Yang

Keyword(s):

Logistic Regression ◽

Gene Selection ◽

Classification Performance ◽

Cancer Classification ◽

Sparse Logistic Regression ◽

The Subject ◽

Selection For ◽

Microarray Datasets ◽

Sparse Methods

In recent years, gene selection for cancer classification based on the expression of a small number of gene biomarkers has been the subject of much research in genetics and molecular biology. The successful identification of gene biomarkers will help in the classification of different types of cancer and improve the prediction accuracy. Recently, regularized logistic regression using the L 1 regularization has been successfully applied in high-dimensional cancer classification to tackle both the estimation of gene coefficients and the simultaneous performance of gene selection. However, the L 1 has a biased gene selection and dose not have the oracle property. To address these problems, we investigate L 1 / 2 regularized logistic regression for gene selection in cancer classification. Experimental results on three DNA microarray datasets demonstrate that our proposed method outperforms other commonly used sparse methods ( L 1 and L E N ) in terms of classification performance.

Download Full-text

Noise matters: elephants show risk-avoidance behaviour in response to human-generated seismic cues

Proceedings of The Royal Society B Biological Sciences ◽

10.1098/rspb.2021.0774 ◽

2021 ◽

Vol 288 (1953) ◽

pp. 20210774

Author(s):

Beth Mortimer ◽

James A. Walker ◽

David S. Lolchuragi ◽

Michael Reinwald ◽

David Daballen

Keyword(s):

Seismic Noise ◽

Information Transfer ◽

Conservation Management ◽

Sensory Ecology ◽

Loxodonta Africana ◽

Relevant Information ◽

African Elephants ◽

Risk Avoidance ◽

Biologically Relevant ◽

Seismic Vibrations

African elephants ( Loxodonta africana ) use many sensory modes to gather information about their environment, including the detection of seismic, or ground-based, vibrations. Seismic information is known to include elephant-generated signals, but also potentially encompasses biotic cues that are commonly referred to as ‘noise’. To investigate seismic information transfer in elephants beyond communication, here we tested the hypothesis that wild elephants detect and discriminate between seismic vibrations that differ in their noise types, whether elephant- or human-generated. We played three types of seismic vibrations to elephants: seismic recordings of elephants (elephant-generated), white noise (human-generated) and a combined track (elephant- and human-generated). We found evidence of both detection of seismic noise and discrimination between the two treatments containing human-generated noise. In particular, we found evidence of retreat behaviour, where seismic tracks with human-generated noise caused elephants to move further away from the trial location. We conclude that seismic noise are cues that contain biologically relevant information for elephants that they can associate with risk. This expands our understanding of how elephants use seismic information, with implications for elephant sensory ecology and conservation management.

Download Full-text

A Multi-population χ 2 Test Approach to Informative Gene Selection

Lecture Notes in Computer Science - Intelligent Data Engineering and Automated Learning - IDEAL 2005 ◽

10.1007/11508069_53 ◽

2005 ◽

pp. 406-413 ◽

Cited By ~ 2

Author(s):

Jun Luo ◽

Jinwen Ma

Keyword(s):

Gene Selection ◽

Informative Gene

Download Full-text

Principal component analysis in metabolomics: from multidimensional data toward biologically relevant information

Identification and Data Processing Methods in Metabolomics ◽

10.4155/fseb2013.14.149 ◽

2015 ◽

pp. 82-95 ◽

Cited By ~ 1

Author(s):

Renata Bujak ◽

Michal Jan Markuszewski

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Relevant Information ◽

Component Analysis ◽

Multidimensional Data ◽

Biologically Relevant

Download Full-text

Collecting biologically relevant information: DNA to population density.

APA handbook of comparative psychology: Basic concepts, methods, neural substrate, and behavior. ◽

10.1037/0000011-005 ◽

2017 ◽

pp. 87-113

Author(s):

Tobias Deschner ◽

Mimi Arandjelovic ◽

Hjalmar S. Kühl

Keyword(s):

Population Density ◽

Relevant Information ◽

Biologically Relevant

Download Full-text

Exploring Dimensions of the Media Dream

Exploring the Collective Unconscious in the Age of Digital Media - Advances in Psychology, Mental Health, and Behavioral Studies ◽

10.4018/978-1-4666-9891-8.ch001 ◽

2016 ◽

pp. 1-39 ◽

Cited By ~ 4

Author(s):

Rollin McCraty ◽

Stephen Brock Schafer

Keyword(s):

Heart Rate ◽

Heart Rate Variability ◽

Electromagnetic Coupling ◽

Relevant Information ◽

Living Systems ◽

Biologically Relevant ◽

Dream Analysis ◽

Psychological Analysis ◽

Dramatic Structure ◽

The Media

The earth's magnetic fields are carriers of biologically relevant information that connects all living systems. The electromagnetic coupling of the human brain, cardiovascular and nervous systems, and geomagnetic frequencies supports the hypothesis that the mediated reality of electromagnetic bandwidths can be correlated with bio-energetic and geomagnetic frequencies. Understood as bio-energetic functions (Thinking, Feeling, Sensing, & Intuiting), the media-sphere becomes measurable according to principles of coherency (measured as heart-rate variability, HRV) and principles of Jungian dream analysis (compensation and dramatic structure). It has been demonstrated that the rhythmic patterns in beat-to-beat heart rate variability reflect emotional functions, permeate every bodily cell, and play a central role in the generation and transmission of system-wide information via the electromagnetic field. So, the “media dream” becomes susceptible to psychological analysis leading to a better understanding of unconscious cognitive archetypal patterns of contextual collectives.

Download Full-text