Subset Selection by Pareto Optimization with Recombination

Chao Qian; Chao Bian; Chao Feng

doi:10.1609/aaai.v34i03.5621

Subset Selection by Pareto Optimization with Recombination

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i03.5621 ◽

2020 ◽

Vol 34 (03) ◽

pp. 2408-2415

Author(s):

Chao Qian ◽

Chao Bian ◽

Chao Feng

Keyword(s):

Feature Selection ◽

Objective Function ◽

Fundamental Problem ◽

Subset Selection ◽

Optimal Solution ◽

Sparse Regression ◽

Unsupervised Feature Selection ◽

Optimal Polynomial ◽

Approximation Guarantee ◽

The Given

Subset selection, i.e., to select a limited number of items optimizing some given objective function, is a fundamental problem with various applications such as unsupervised feature selection and sparse regression. By employing a multi-objective evolutionary algorithm (EA) with mutation only to optimize the given objective function and minimize the number of selected items simultaneously, the recently proposed POSS algorithm achieves state-of-the-art performance for subset selection. In this paper, we propose the PORSS algorithm by incorporating recombination, a characterizing feature of EAs, into POSS. We prove that PORSS can achieve the optimal polynomial-time approximation guarantee as POSS when the objective function is monotone, and can find an optimal solution efficiently in some cases whereas POSS cannot. Extensive experiments on unsupervised feature selection and sparse regression show the superiority of PORSS over POSS. Our analysis also theoretically discloses that recombination from diverse solutions can be more likely than mutation alone to generate various variations, thereby leading to better exploration; this may be of independent interest for understanding the influence of recombination.

Download Full-text

Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/302 ◽

2021 ◽

Author(s):

Chao Bian ◽

Chao Qian ◽

Frank Neumann ◽

Yang Yu

Keyword(s):

Objective Function ◽

Greedy Algorithm ◽

Real World ◽

Fundamental Problem ◽

Subset Selection ◽

Selection Strategy ◽

Cost Constraints ◽

Approximation Guarantee ◽

Changes Over Time ◽

Over Time

Subset selection with cost constraints is a fundamental problem with various applications such as influence maximization and sensor placement. The goal is to select a subset from a ground set to maximize a monotone objective function such that a monotone cost function is upper bounded by a budget. Previous algorithms with bounded approximation guarantees include the generalized greedy algorithm, POMC and EAMC, all of which can achieve the best known approximation guarantee. In real-world scenarios, the resources often vary, i.e., the budget often changes over time, requiring the algorithms to adapt the solutions quickly. However, when the budget changes dynamically, all these three algorithms either achieve arbitrarily bad approximation guarantees, or require a long running time. In this paper, we propose a new algorithm FPOMC by combining the merits of the generalized greedy algorithm and POMC. That is, FPOMC introduces a greedy selection strategy into POMC. We prove that FPOMC can maintain the best known approximation guarantee efficiently.

Download Full-text

Joint Embedding Learning and Sparse Regression: A Framework for Unsupervised Feature Selection

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2013.2272642 ◽

2014 ◽

Vol 44 (6) ◽

pp. 793-804 ◽

Cited By ~ 226

Author(s):

Chenping Hou ◽

Feiping Nie ◽

Xuelong Li ◽

Dongyun Yi ◽

Yi Wu

Keyword(s):

Feature Selection ◽

Sparse Regression ◽

Unsupervised Feature Selection ◽

Joint Embedding

Download Full-text

Unsupervised feature selection based on joint spectral learning and general sparse regression

Neural Computing and Applications ◽

10.1007/s00521-019-04117-9 ◽

2019 ◽

Vol 32 (11) ◽

pp. 6581-6589

Author(s):

Tao Chen ◽

Yanrong Guo ◽

Shijie Hao

Keyword(s):

Feature Selection ◽

Sparse Regression ◽

Unsupervised Feature Selection ◽

Spectral Learning

Download Full-text

Sequence Selection by Pareto Optimization

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/206 ◽

2018 ◽

Cited By ~ 1

Author(s):

Chao Qian ◽

Chao Feng ◽

Ke Tang

Keyword(s):

Objective Function ◽

Real World ◽

Superior Performance ◽

Sequence Length ◽

Iterative Approach ◽

Reasonable Time ◽

Sequence Selection ◽

Real World Applications ◽

Approximation Guarantee ◽

The Given

The problem of selecting a sequence of items from a universe that maximizes some given objective function arises in many real-world applications. In this paper, we propose an anytime randomized iterative approach POSeqSel, which maximizes the given objective function and minimizes the sequence length simultaneously. We prove that for any previously studied objective function, POSeqSel using a reasonable time can always reach or improve the best known approximation guarantee. Empirical results exhibit the superior performance of POSeqSel.

Download Full-text

Unsupervised Feature Selection by Pareto Optimization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013534 ◽

2019 ◽

Vol 33 ◽

pp. 3534-3541 ◽

Cited By ~ 3

Author(s):

Chao Feng ◽

Chao Qian ◽

Ke Tang

Keyword(s):

Feature Selection ◽

Reconstruction Error ◽

Superior Performance ◽

Data Matrix ◽

Feature Transformation ◽

Huge Number ◽

Unsupervised Feature Selection ◽

Column Subset Selection ◽

Approximation Guarantee ◽

Natural Formulation

Dimensionality reduction is often employed to deal with the data with a huge number of features, which can be generally divided into two categories: feature transformation and feature selection. Due to the interpretability, the efficiency during inference and the abundance of unlabeled data, unsupervised feature selection has attracted much attention. In this paper, we consider its natural formulation, column subset selection (CSS), which is to minimize the reconstruction error of a data matrix by selecting a subset of features. We propose an anytime randomized iterative approach POCSS, which minimizes the reconstruction error and the number of selected features simultaneously. Its approximation guarantee is well bounded. Empirical results exhibit the superior performance of POCSS over the state-of-the-art algorithms.

Download Full-text

Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data

Remote Sensing ◽

10.3390/rs10101564 ◽

2018 ◽

Vol 10 (10) ◽

pp. 1564 ◽

Cited By ~ 3

Author(s):

Patrick Bradley ◽

Sina Keller ◽

Martin Weinmann

Keyword(s):

Feature Selection ◽

Dimensionality Reduction ◽

Hyperspectral Data ◽

Training Data ◽

High Dimensional ◽

Unsupervised Feature Selection ◽

Feature Selection Techniques ◽

The Given

In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.

Download Full-text

Unsupervised Feature Selection for Histogram-Valued Symbolic Data Using Hierarchical Conceptual Clustering

Stats ◽

10.3390/stats4020024 ◽

2021 ◽

Vol 4 (2) ◽

pp. 359-384

Author(s):

Manabu Ichino ◽

Kadri Umbleja ◽

Hiroyuki Yaguchi

Keyword(s):

Feature Selection ◽

Feature Selection Method ◽

Fixed Number ◽

Data Sets ◽

Data Set ◽

Unsupervised Feature Selection ◽

Symbolic Data ◽

Thin Structure ◽

The Given

This paper presents an unsupervised feature selection method for multi-dimensional histogram-valued data. We define a multi-role measure, called the compactness, based on the concept size of given objects and/or clusters described using a fixed number of equal probability bin-rectangles. In each step of clustering, we agglomerate objects and/or clusters so as to minimize the compactness for the generated cluster. This means that the compactness plays the role of a similarity measure between objects and/or clusters to be merged. Minimizing the compactness is equivalent to maximizing the dis-similarity of the generated cluster, i.e., concept, against the whole concept in each step. In this sense, the compactness plays the role of cluster quality. We also show that the average compactness of each feature with respect to objects and/or clusters in several clustering steps is useful as a feature effectiveness criterion. Features having small average compactness are mutually covariate and are able to detect a geometrically thin structure embedded in the given multi-dimensional histogram-valued data. We obtain thorough understandings of the given data via visualization using dendrograms and scatter diagrams with respect to the selected informative features. We illustrate the effectiveness of the proposed method by using an artificial data set and real histogram-valued data sets.

Download Full-text

Unsupervised feature selection based on self-representation sparse regression and local similarity preserving

International Journal of Machine Learning and Cybernetics ◽

10.1007/s13042-017-0760-y ◽

2017 ◽

Vol 10 (4) ◽

pp. 757-770 ◽

Cited By ~ 3

Author(s):

Ronghua Shang ◽

Jiangwei Chang ◽

Licheng Jiao ◽

Yu Xue

Keyword(s):

Feature Selection ◽

Local Similarity ◽

Sparse Regression ◽

Unsupervised Feature Selection ◽

Similarity Preserving

Download Full-text

Unsupervised Feature Selection for Histogram-Valued Symbolic Data by Hierarchical Conceptual Clustering

10.20944/preprints202103.0753.v1 ◽

2021 ◽

Author(s):

Manabu Ichino ◽

Kadri Umbleja ◽

Hiroyuki Yaguchi

Keyword(s):

Feature Selection ◽

Feature Selection Method ◽

Fixed Number ◽

Data Sets ◽

Data Set ◽

Unsupervised Feature Selection ◽

Symbolic Data ◽

Thin Structure ◽

The Given

This paper presents an unsupervised feature selection method for multi-dimensional histogram-valued data. We define a multi-role measure, called the compactness, based on the concept size of given objects and/or clusters described by a fixed number of equal probability bin-rectangles. In each step of clustering, we agglomerate objects and/or clusters so as to minimize the compactness for the generated cluster. This means that the compactness plays the role of a similarity measure between objects and/or clusters to be merged. To minimize the compactness is equivalent to maximize the dis-similarity of the generated cluster, i.e., concept, against the whole concept in each step. In this sense, the compactness plays the role of cluster quality. We also show that the average compactness of each feature with respect to objects and/or clusters in several clustering steps is useful as feature effectiveness criterion. Features having small average compactness are mutually covariate, and are able to detect geometrically thin structure embedded in the given multi-dimensional histogram-valued data. We obtain thorough understandings of the given data by the visualization using dendrograms and scatter diagrams with respect to the selected informative features. We illustrate the effectiveness of the proposed method by using an artificial data set and real histogram-valued data sets.

Download Full-text

A Rough Set Pooled Fitness Function Based Particle Swarm Optimization Algorithm using Golden Ratio Principle for Feature Selection

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9823.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 3785-3790

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Objective Function ◽

Rough Set ◽

Optimization Algorithm ◽

Particle Swarm Optimization Algorithm ◽

Golden Ratio ◽

Particle Swarm ◽

Optimal Solution ◽

Swarm Optimization

Particle Swarm Optimization, a nature based stochastic evolutionary algorithm that iteratively tries to improvise the solution pertaining to a particular objective function. The problem becomes challenging if the objective function is not properly identified nor it is properly been evaluated which results in slow convergence and inability to find the optimal solution. Hence, we propose a novel rough set based particle swarm optimization algorithm using golden ratio principle for an efficient feature selection process that focusses on two objectives: First, that results in a reduced subset of features without conceding the originality of the data and the second is that yields a high optimal result. Since many subset of features might result with a meaningful solution, we have used the golden ratio principle to extract the most reduced subset with a high optimal solution. The algorithm has been tested over several benchmark datasets. The results shows that the proposed algorithm identifies a small set of features without convincing the optimal solution, thus able to achieve the stated objectives.

Download Full-text