low rank approximations
Recently Published Documents


TOTAL DOCUMENTS

91
(FIVE YEARS 30)

H-INDEX

15
(FIVE YEARS 2)

Author(s):  
Michał Dereziński ◽  
Rajiv Khanna ◽  
Michael W. Mahoney

The Column Subset Selection Problem (CSSP) and the Nystrom method are among the leading tools for constructing interpretable low-rank approximations of large datasets by selecting a small but representative set of features or instances. A fundamental question in this area is: what is the cost of this interpretability, i.e., how well can a data subset of size k compete with the best rank k approximation? We develop techniques which exploit spectral properties of the data matrix to obtain improved approximation guarantees which go beyond the standard worst-case analysis. Our approach leads to significantly better bounds for datasets with known rates of singular value decay, e.g., polynomial or exponential decay. Our analysis also reveals an intriguing phenomenon: the cost of interpretability as a function of k may exhibit multiple peaks and valleys, which we call a multiple-descent curve. A lower bound we establish shows that this behavior is not an artifact of our analysis, but rather it is an inherent property of the CSSP and Nystrom tasks. Finally, using the example of a radial basis function (RBF) kernel, we show that both our improved bounds and the multiple-descent curve can be observed on real datasets simply by varying the RBF parameter.


2021 ◽  
Author(s):  
Iván Fernández-Val ◽  
Hugo Freeman ◽  
Martin Weidner

Abstract We provide estimation methods for nonseparable panel models based on low-rank factor structure approximations. The factor structures are estimated by matrix-completion methods to deal with the computational challenges of principal component analysis in the presence of missing data. We show that the resulting estimators are consistent in large panels, but suffer from approximation and shrinkage biases. We correct these biases using matching and difference-in-differences approaches. Numerical examples and an empirical application to the effect of election day registration on voter turnout in the U.S. illustrate the properties and usefulness of our methods.


2021 ◽  
Vol 42 (1) ◽  
pp. 330-350
Author(s):  
Michael S. Floater ◽  
Carla Manni ◽  
Espen Sande ◽  
Hendrik Speleers

2020 ◽  
pp. 1
Author(s):  
Markus Bachmayr ◽  
Henrik Eisenmann ◽  
Emil Kieri ◽  
André Uschmajew

Sign in / Sign up

Export Citation Format

Share Document