Optimality of Spectrum Pursuit for Column Subset Selection Problem: Theoretical Guarantees and Applications in Deep Learning

10.36227/techrxiv.13253945.v1 ◽

2020 ◽

Author(s):

Mohsen Joneidi ◽

Saeed Vahidian ◽

Ashkan Esmaeili ◽

Siavash Khodadadeh

Keyword(s):

Deep Learning ◽

Upper Bound ◽

Linear Complexity ◽

Subset Selection ◽

Selection Problem ◽

Theoretical Methods ◽

Novel Technique ◽

Original Dataset ◽

Column Subset Selection ◽

Minimum Number

We propose a novel technique for finding representatives from a large, unsupervised dataset. The approach is based on the concept of self-rank, defined as the minimum number of samples needed to reconstruct all samples with an accuracy proportional to the rank-$K$ approximation. Our proposed algorithm enjoys linear complexity w.r.t. the size of original dataset and simultaneously it provides an adaptive upper bound for approximation ratio. These favorable characteristics result in filling a historical gap between practical and theoretical methods in finding representatives.<br>

Download Full-text

A Comparison of Differential Evolution and Genetic Algorithms for the Column Subset Selection Problem

Advances in Intelligent Systems and Computing - Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015 ◽

10.1007/978-3-319-26227-7_21 ◽

2016 ◽

pp. 223-232 ◽

Cited By ~ 1

Author(s):

Pavel Krömer ◽

Jan Platoš

Keyword(s):

Genetic Algorithms ◽

Differential Evolution ◽

Subset Selection ◽

Selection Problem ◽

Column Subset Selection

Download Full-text

Genetic Algorithm for the Column Subset Selection Problem

2014 Eighth International Conference on Complex, Intelligent and Software Intensive Systems ◽

10.1109/cisis.2014.3 ◽

2014 ◽

Cited By ~ 6

Author(s):

Pavel Kromer ◽

Jan Plato ◽

Vaclav Snael

Keyword(s):

Genetic Algorithm ◽

Subset Selection ◽

Selection Problem ◽

Column Subset Selection

Download Full-text

An Improved Approximation Algorithm for the Column Subset Selection Problem

Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms ◽

10.1137/1.9781611973068.105 ◽

2009 ◽

Cited By ~ 92

Author(s):

Christos Boutsidis ◽

Michael W. Mahoney ◽

Petros Drineas

Keyword(s):

Approximation Algorithm ◽

Subset Selection ◽

Selection Problem ◽

Column Subset Selection

Download Full-text

Select to better learn: Fast and accurate deep learning using data selection from nonlinear manifolds

10.36227/techrxiv.12084027 ◽

2020 ◽

Author(s):

Mohsen Joneidi ◽

Saeed Vahidian ◽

Ashkan Esmaeili ◽

Weijia Wang ◽

Nazanin Rahnavard ◽

...

Keyword(s):

Deep Learning ◽

Small Subset ◽

Original Dataset ◽

Wide Range ◽

Spectral Components ◽

Column Subset Selection ◽

Important Open Problem ◽

Data Points ◽

Using Data ◽

Nonlinear Manifolds

Finding a small subset of data whose linear combination spans other data points, also called column subset selection problem (CSSP), is an important open problem in computer science with many applications in computer vision and deep learning. There are some studies that solve CSSP in a polynomial time complexity w.r.t. the size of the original dataset. A simple and efficient selection algorithm with a linear complexity order, referred to as spectrum pursuit (SP), is proposed that pursuits spectral components of the dataset using available sample points. The proposed non-greedy algorithm aims to iteratively find K data samples whose span is close to that of the first K spectral components of entire data. SP has no parameter to be fine tuned and this desirable property makes it problem-independent. The simplicity of SP enables us to extend the underlying linear model to more complex models such as nonlinear manifolds and graph-based models. The nonlinear extension of SP is introduced as kernel-SP (KSP). The superiority of the proposed algorithms is demonstrated in a wide range of applications.

Download Full-text

Column Subset Selection Problem is UG-hard

Journal of Computer and System Sciences ◽

10.1016/j.jcss.2014.01.004 ◽

2014 ◽

Vol 80 (4) ◽

pp. 849-859 ◽

Cited By ~ 10

Author(s):

A. Çivril

Keyword(s):

Subset Selection ◽

Selection Problem ◽

Column Subset Selection

Download Full-text

Select to better learn: Fast and accurate deep learning using data selection from nonlinear manifolds

10.36227/techrxiv.12084027.v1 ◽

2020 ◽

Author(s):

Mohsen Joneidi ◽

Saeed Vahidian ◽

Ashkan Esmaeili ◽

Weijia Wang ◽

Nazanin Rahnavard ◽

...

Keyword(s):

Deep Learning ◽

Small Subset ◽

Original Dataset ◽

Wide Range ◽

Spectral Components ◽

Column Subset Selection ◽

Important Open Problem ◽

Data Points ◽

Using Data ◽

Nonlinear Manifolds

Finding a small subset of data whose linear combination spans other data points, also called column subset selection problem (CSSP), is an important open problem in computer science with many applications in computer vision and deep learning. There are some studies that solve CSSP in a polynomial time complexity w.r.t. the size of the original dataset. A simple and efficient selection algorithm with a linear complexity order, referred to as spectrum pursuit (SP), is proposed that pursuits spectral components of the dataset using available sample points. The proposed non-greedy algorithm aims to iteratively find K data samples whose span is close to that of the first K spectral components of entire data. SP has no parameter to be fine tuned and this desirable property makes it problem-independent. The simplicity of SP enables us to extend the underlying linear model to more complex models such as nonlinear manifolds and graph-based models. The nonlinear extension of SP is introduced as kernel-SP (KSP). The superiority of the proposed algorithms is demonstrated in a wide range of applications.

Download Full-text

A Near-optimal Protocol for the Subset Selection Problem in RFID Systems

2020 16th International Conference on Mobility, Sensing and Networking (MSN) ◽

10.1109/msn50589.2020.00022 ◽

2020 ◽

Author(s):

Xiujun Wang ◽

Zhi Liu ◽

Susumu Ishihara ◽

Zhe Dang ◽

Jie Li

Keyword(s):

Subset Selection ◽

Selection Problem ◽

Rfid Systems ◽

Optimal Protocol

Download Full-text

Structured (De)composable Representations Trained with Neural Networks

Computers ◽

10.3390/computers9040079 ◽

2020 ◽

Vol 9 (4) ◽

pp. 79

Author(s):

Graham Spinks ◽

Marie-Francine Moens

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Contextual Information ◽

Learning To Learn ◽

Class Label ◽

Impact Performance ◽

Novel Technique ◽

Language Data ◽

Concept Classes ◽

Generic Representation

This paper proposes a novel technique for representing templates and instances of concept classes. A template representation refers to the generic representation that captures the characteristics of an entire class. The proposed technique uses end-to-end deep learning to learn structured and composable representations from input images and discrete labels. The obtained representations are based on distance estimates between the distributions given by the class label and those given by contextual information, which are modeled as environments. We prove that the representations have a clear structure allowing decomposing the representation into factors that represent classes and environments. We evaluate our novel technique on classification and retrieval tasks involving different modalities (visual and language data). In various experiments, we show how the representations can be compressed and how different hyperparameters impact performance.

Download Full-text

Evolving Deep Learning Convolutional Neural Networks for Early COVID-19 Detection in Chest X-ray Images

Mathematics ◽

10.3390/math9091002 ◽

2021 ◽

Vol 9 (9) ◽

pp. 1002

Author(s):

Mohammad Khishe ◽

Fabio Caraffini ◽

Stefan Kuhn

Keyword(s):

Deep Learning ◽

Early Detection ◽

Iterative Process ◽

High Accuracy ◽

X Ray ◽

Starting Point ◽

Reliable Model ◽

Minimum Number ◽

Chest X Ray ◽

Deep Learning Model

This article proposes a framework that automatically designs classifiers for the early detection of COVID-19 from chest X-ray images. To do this, our approach repeatedly makes use of a heuristic for optimisation to efficiently find the best combination of the hyperparameters of a convolutional deep learning model. The framework starts with optimising a basic convolutional neural network which represents the starting point for the evolution process. Subsequently, at most two additional convolutional layers are added, at a time, to the previous convolutional structure as a result of a further optimisation phase. Each performed phase maximises the the accuracy of the system, thus requiring training and assessment of the new model, which gets gradually deeper, with relevant COVID-19 chest X-ray images. This iterative process ends when no improvement, in terms of accuracy, is recorded. Hence, the proposed method evolves the most performing network with the minimum number of convolutional layers. In this light, we simultaneously achieve high accuracy while minimising the presence of redundant layers to guarantee a fast but reliable model. Our results show that the proposed implementation of such a framework achieves accuracy up to 99.11%, thus being particularly suitable for the early detection of COVID-19.

Download Full-text