Novel framework for image attribute annotation with gene selection XGBoost algorithm and relative attribute model

Abstract Background In recent years, to investigate challenging bioinformatics problems, the utilization of multiple genomic and proteomic sources has become immensely popular among researchers. One such issue is feature or gene selection and identifying relevant and non-redundant marker genes from high dimensional gene expression data sets. In that context, designing an efficient feature selection algorithm exploiting knowledge from multiple potential biological resources may be an effective way to understand the spectrum of cancer or other diseases with applications in specific epidemiology for a particular population. Results In the current article, we design the feature selection and marker gene detection as a multi-view multi-objective clustering problem. Regarding that, we propose an Unsupervised Multi-View Multi-Objective clustering-based gene selection approach called UMVMO-select. Three important resources of biological data (gene ontology, protein interaction data, protein sequence) along with gene expression values are collectively utilized to design two different views. UMVMO-select aims to reduce gene space without/minimally compromising the sample classification efficiency and determines relevant and non-redundant gene markers from three cancer gene expression benchmark data sets. Conclusion A thorough comparative analysis has been performed with five clustering and nine existing feature selection methods with respect to several internal and external validity metrics. Obtained results reveal the supremacy of the proposed method. Reported results are also validated through a proper biological significance test and heatmap plotting.

Download Full-text

Gene Selection for Cancer Classification using a New Hybrid of Binary Black Hole Algorithm

2020 28th Signal Processing and Communications Applications Conference (SIU) ◽

10.1109/siu49456.2020.9302351 ◽

2020 ◽

Author(s):

Elnaz Pashaei ◽

Elham Pashaei

Keyword(s):

Black Hole ◽

Gene Selection ◽

Cancer Classification ◽

Binary Black Hole ◽

Selection For ◽

Black Hole Algorithm

Download Full-text

An Adaptive Unsupervised Feature Selection Algorithm Based on MDS for Tumor Gene Data Classification

Sensors ◽

10.3390/s21113627 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3627

Author(s):

Bo Jin ◽

Chunling Fu ◽

Yong Jin ◽

Wei Yang ◽

Shengbin Li ◽

...

Keyword(s):

Feature Selection ◽

Local Structure ◽

Gene Selection ◽

Dimensional Space ◽

Original Data ◽

Global Structure ◽

Biological Data ◽

Special Treatment ◽

Selection Scheme ◽

Unsupervised Feature Selection

Identifying the key genes related to tumors from gene expression data with a large number of features is important for the accurate classification of tumors and to make special treatment decisions. In recent years, unsupervised feature selection algorithms have attracted considerable attention in the field of gene selection as they can find the most discriminating subsets of genes, namely the potential information in biological data. Recent research also shows that maintaining the important structure of data is necessary for gene selection. However, most current feature selection methods merely capture the local structure of the original data while ignoring the importance of the global structure of the original data. We believe that the global structure and local structure of the original data are equally important, and so the selected genes should maintain the essential structure of the original data as far as possible. In this paper, we propose a new, adaptive, unsupervised feature selection scheme which not only reconstructs high-dimensional data into a low-dimensional space with the constraint of feature distance invariance but also employs ℓ2,1-norm to enable a matrix with the ability to perform gene selection embedding into the local manifold structure-learning framework. Moreover, an effective algorithm is developed to solve the optimization problem based on the proposed scheme. Comparative experiments with some classical schemes on real tumor datasets demonstrate the effectiveness of the proposed method.

Download Full-text

Modelling the Process of Determining the Tourist Demand Based on Fishbein's Multi-Attribute Model

2020 IEEE 15th International Conference on Computer Sciences and Information Technologies (CSIT) ◽

10.1109/csit49958.2020.9321898 ◽

2020 ◽

Author(s):

Tetiana Hovorushchenko ◽

Vladyslav Glukhov ◽

Olha Hovorushchenko

Keyword(s):

Attribute Model ◽

Tourist Demand

Download Full-text