On the Logical Analysis of Partially Ordered Data in the Supervised Classification Problem

2019 ◽  
Vol 59 (9) ◽  
pp. 1542-1552
Author(s):  
E. V. Djukova ◽  
G. O. Masliakov ◽  
P. A. Prokofyev
Author(s):  
Amparo Baillo ◽  
Antonio Cuevas ◽  
Ricardo Fraiman

This article reviews the literature concerning supervised and unsupervised classification of functional data. It first explains the meaning of unsupervised classification vs. supervised classification before discussing the supervised classification problem in the infinite-dimensional case, showing that its formal statement generally coincides with that of discriminant analysis in the classical multivariate case. It then considers the optimal classifier and plug-in rules, empirical risk and empirical minimization rules, linear discrimination rules, the k nearest neighbor (k-NN) method, and kernel rules. It also describes classification based on partial least squares, classification based on reproducing kernels, and depth-based classification. Finally, it examines unsupervised classification methods, focusing on K-means for functional data, K-means for data in a Hilbert space, and impartial trimmed K-means for functional data. Some practical issues, in particular real-data examples and simulations, are reviewed and some selected proofs are given.


2021 ◽  
Author(s):  
Zakaria Mehrab ◽  
Aniruddha Adiga ◽  
Madhav Marathe ◽  
Srinivasan Venkatramanan ◽  
Samarth Swarup

High resolution mobility datasets have become increasingly available in the past few years and have enabled detailed models for infectious disease spread including those for COVID-19. However, there are open questions on how such a mobility data can be used effectively within epidemic models and for which tasks they are best suited. In this paper, we extract a number of graph-based proximity metrics from high resolution cellphone trace data from X-Mode and use it to study COVID-19 epidemic spread in 50 land grant university counties in the US. We present an approach to estimate the effect of mobility on cases by fitting an ODE based model and performing multivariate linear regression to explain the estimated time varying transmissibility. We find that, while mobility plays a significant role, the contribution is heterogeneous across the counties, as exemplified by a subsequent correlation analysis. We subsequently evaluate the metrics' utility for case surge prediction defined as a supervised classification problem, and show that the learnt model can predict surges with 95% accuracy and 87% F1-score.


2020 ◽  
Vol 39 (24) ◽  
pp. 3313-3328
Author(s):  
Edward H. Ip ◽  
Shyh‐Huei Chen ◽  
Karen Bandeen‐Roche ◽  
Jaime L. Speiser ◽  
Li Cai ◽  
...  

2020 ◽  
Vol 14 ◽  
pp. 174830262097153
Author(s):  
Carlos Brito-Pacheco ◽  
Carlos Brito-Loeza ◽  
Anabel Martin-Gonzalez

In this work, we introduce a new regularized logistic model for the supervised classification problem. Current logistic models have become the preferred tools for supervised classification in many situations. They mostly use either L1 or L2 regularization of the weight vector of parameters. Here we take a different approach by applying regularization not to the weight vector but to the gradient vector of the function representing the separating hyper-surface. We present the mathematical analysis of the model in its continuous setting and provide experimental evidence to show that the new model is competitive with state of the art models.


Sign in / Sign up

Export Citation Format

Share Document