ON THE CHOICE OF PARTIAL ORDERS ON FEATURE VALUES SETS IN THE SUPERVISED CLASSIFICATION PROBLEM

Author(s):  
Jaromír Janisch ◽  
Tomáš Pevný ◽  
Viliam Lisý

We study a classification problem where each feature can be acquired for a cost and the goal is to optimize a trade-off between the expected classification error and the feature cost. We revisit a former approach that has framed the problem as a sequential decision-making problem and solved it by Q-learning with a linear approximation, where individual actions are either requests for feature values or terminate the episode by providing a classification decision. On a set of eight problems, we demonstrate that by replacing the linear approximation with neural networks the approach becomes comparable to the state-of-the-art algorithms developed specifically for this problem. The approach is flexible, as it can be improved with any new reinforcement learning enhancement, it allows inclusion of pre-trained high-performance classifier, and unlike prior art, its performance is robust across all evaluated datasets.


Author(s):  
Amparo Baillo ◽  
Antonio Cuevas ◽  
Ricardo Fraiman

This article reviews the literature concerning supervised and unsupervised classification of functional data. It first explains the meaning of unsupervised classification vs. supervised classification before discussing the supervised classification problem in the infinite-dimensional case, showing that its formal statement generally coincides with that of discriminant analysis in the classical multivariate case. It then considers the optimal classifier and plug-in rules, empirical risk and empirical minimization rules, linear discrimination rules, the k nearest neighbor (k-NN) method, and kernel rules. It also describes classification based on partial least squares, classification based on reproducing kernels, and depth-based classification. Finally, it examines unsupervised classification methods, focusing on K-means for functional data, K-means for data in a Hilbert space, and impartial trimmed K-means for functional data. Some practical issues, in particular real-data examples and simulations, are reviewed and some selected proofs are given.


2021 ◽  
Author(s):  
Zakaria Mehrab ◽  
Aniruddha Adiga ◽  
Madhav Marathe ◽  
Srinivasan Venkatramanan ◽  
Samarth Swarup

High resolution mobility datasets have become increasingly available in the past few years and have enabled detailed models for infectious disease spread including those for COVID-19. However, there are open questions on how such a mobility data can be used effectively within epidemic models and for which tasks they are best suited. In this paper, we extract a number of graph-based proximity metrics from high resolution cellphone trace data from X-Mode and use it to study COVID-19 epidemic spread in 50 land grant university counties in the US. We present an approach to estimate the effect of mobility on cases by fitting an ODE based model and performing multivariate linear regression to explain the estimated time varying transmissibility. We find that, while mobility plays a significant role, the contribution is heterogeneous across the counties, as exemplified by a subsequent correlation analysis. We subsequently evaluate the metrics' utility for case surge prediction defined as a supervised classification problem, and show that the learnt model can predict surges with 95% accuracy and 87% F1-score.


2020 ◽  
Vol 14 ◽  
pp. 174830262097153
Author(s):  
Carlos Brito-Pacheco ◽  
Carlos Brito-Loeza ◽  
Anabel Martin-Gonzalez

In this work, we introduce a new regularized logistic model for the supervised classification problem. Current logistic models have become the preferred tools for supervised classification in many situations. They mostly use either L1 or L2 regularization of the weight vector of parameters. Here we take a different approach by applying regularization not to the weight vector but to the gradient vector of the function representing the separating hyper-surface. We present the mathematical analysis of the model in its continuous setting and provide experimental evidence to show that the new model is competitive with state of the art models.


2021 ◽  
Vol 62 ◽  
pp. 36-43
Author(s):  
Eglė Zikarienė ◽  
Kęstutis Dučinskas

In this paper, spatial data specified by auto-beta models is analysed by considering a supervised classification problem of classifying feature observation into one of two populations. Two classification rules based on conditional Bayes discriminant function (BDF) and linear discriminant function (LDF) are proposed. These classification rules are critically compared by the values of the actual error rates through the simulation study.


2019 ◽  
Vol 29 (2) ◽  
pp. 151-176
Author(s):  
Wiame Ech-Chelfi ◽  
Hammoumi El

In this work, we develop CASVM and CANN algorithms for semi-supervised classification problem. The algorithms are based on a combination of ensemble clustering and kernel methods. A probabilistic model of classification with the use of cluster ensemble is proposed. Within the model, error probability of CANN is studied. Assumptions that make probability of error converge to zero are formulated. The proposed algorithms are experimentally tested on a hyperspectral image. It is shown that CASVM and CANN are more noise resistant than standard SVM and kNN.


2012 ◽  
Vol 580 ◽  
pp. 369-373 ◽  
Author(s):  
Liang Huang ◽  
Rui Xuan Li ◽  
Kun Mei Wen ◽  
Xi Wu Gu

With the large amount of complex network data becoming available in the web, link prediction has become a popular research field of data mining. We focus on the link prediction task which can be formulated as a binary classification problem in social network. To treat this problem, a sparse semi-supervised classification algorithm called Self Training Semi-supervised Truncated Kernel Projection Machine (STKPM), based on empirical feature selection, is proposed for link prediction. Experimental results show that the proposed algorithm outperformed several outstanding learning algorithms with smaller test errors and more stability.


Sign in / Sign up

Export Citation Format

Share Document