Algorithms and Literate Programs for Weighted Low-Rank Approximation with Missing Data

Background A lifelogs-based wellness index (LWI) is a function for calculating wellness scores based on health behavior lifelogs (eg, daily walking steps and sleep times collected via a smartwatch). A wellness score intuitively shows the users of smart wellness services the overall condition of their health behaviors. LWI development includes estimation (ie, estimating coefficients in LWI with data). A panel data set comprising health behavior lifelogs allows LWI estimation to control for unobserved variables, thereby resulting in less bias. However, these data sets typically have missing data due to events that occur in daily life (eg, smart devices stop collecting data when batteries are depleted), which can introduce biases into LWI coefficients. Thus, the appropriate choice of method to handle missing data is important for reducing biases in LWI estimations with panel data. However, there is a lack of research in this area. Objective This study aims to identify a suitable missing-data handling method for LWI estimation with panel data. Methods Listwise deletion, mean imputation, expectation maximization–based multiple imputation, predictive-mean matching–based multiple imputation, k-nearest neighbors–based imputation, and low-rank approximation–based imputation were comparatively evaluated by simulating an existing case of LWI development. A panel data set comprising health behavior lifelogs of 41 college students over 4 weeks was transformed into a reference data set without any missing data. Then, 200 simulated data sets were generated by randomly introducing missing data at proportions from 1% to 80%. The missing-data handling methods were each applied to transform the simulated data sets into complete data sets, and coefficients in a linear LWI were estimated for each complete data set. For each proportion for each method, a bias measure was calculated by comparing the estimated coefficient values with values estimated from the reference data set. Results Methods performed differently depending on the proportion of missing data. For 1% to 30% proportions, low-rank approximation–based imputation, predictive-mean matching–based multiple imputation, and expectation maximization–based multiple imputation were superior. For 31% to 60% proportions, low-rank approximation–based imputation and predictive-mean matching–based multiple imputation performed best. For over 60% proportions, only low-rank approximation–based imputation performed acceptably. Conclusions Low-rank approximation–based imputation was the best of the 6 data-handling methods regardless of the proportion of missing data. This superiority is generalizable to other panel data sets comprising health behavior lifelogs given their verified low-rank nature, for which low-rank approximation–based imputation is known to perform effectively. This result will guide missing-data handling in reducing coefficient biases in new development cases of linear LWIs with panel data.

Download Full-text

Structured Low-Rank Approximation with Missing Data

SIAM Journal on Matrix Analysis and Applications ◽

10.1137/120883050 ◽

2013 ◽

Vol 34 (2) ◽

pp. 814-830 ◽

Cited By ~ 40

Author(s):

Ivan Markovsky ◽

Konstantin Usevich

Keyword(s):

Missing Data ◽

Low Rank ◽

Low Rank Approximation ◽

Rank Approximation ◽

Structured Low Rank Approximation

Download Full-text

Tensor Recovery via Nonconvex Low-Rank Approximation

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287404 ◽

2021 ◽

Author(s):

Lin Chen ◽

Xue Jiang ◽

Xingzhao Liu ◽

Zhixin Zhou

Keyword(s):

Low Rank ◽

Low Rank Approximation ◽

Tensor Recovery ◽

Rank Approximation

Download Full-text

Improved MR image denoising via low- rank approximation and Laplacian-of-Gaussian edge detector

IET Image Processing ◽

10.1049/iet-ipr.2019.1648 ◽

2020 ◽

Vol 14 (12) ◽

pp. 2791-2798

Author(s):

Xiaoqun Qiu ◽

Zhen Chen ◽

Saifullah Adnan ◽

Hongwei He

Keyword(s):

Image Denoising ◽

Low Rank ◽

Edge Detector ◽

Low Rank Approximation ◽

Mr Image ◽

Rank Approximation

Download Full-text

Robust topology optimization with low rank approximation using artificial neural networks

Computational Mechanics ◽

10.1007/s00466-021-02069-3 ◽

2021 ◽

Author(s):

Vahid Keshavarzzadeh ◽

Robert M. Kirby ◽

Akil Narayan

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Topology Optimization ◽

Low Rank ◽

Low Rank Approximation ◽

Robust Topology Optimization ◽

Rank Approximation ◽

Artificial Neural

Download Full-text

Low-Complexity DNN-Based End-to-End Automatic Speech Recognition using Low-Rank Approximation

2020 International SoC Design Conference (ISOCC) ◽

10.1109/isocc50952.2020.9332970 ◽

2020 ◽

Author(s):

Jongmin Park ◽

Youngjoo Lee

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Low Complexity ◽

Low Rank ◽

Low Rank Approximation ◽

Rank Approximation ◽

End To End

Download Full-text

Fast Joint Multiband Reconstruction From Wideband Images Based on Low-Rank Approximation

IEEE Transactions on Computational Imaging ◽

10.1109/tci.2020.2998170 ◽

2020 ◽

Vol 6 ◽

pp. 922-933

Author(s):

M. Amine Hadj-Youcef ◽

Francois Orieux ◽

Alain Abergel ◽

Aurelia Fraysse

Keyword(s):

Low Rank ◽

Low Rank Approximation ◽

Rank Approximation

Download Full-text

Low-Rank Approximation of Difference between Correlation Matrices Using Inner Product

Applied Sciences ◽

10.3390/app11104582 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4582

Author(s):

Kensuke Tanioka ◽

Satoru Hiwa

Keyword(s):

Dimensional Reduction ◽

Low Rank ◽

Inner Product ◽

Low Rank Approximation ◽

Correlation Matrices ◽

Rank Matrix ◽

Clustering Approach ◽

Rank Approximation ◽

The Difference ◽

Low Rank Matrix

In the domain of functional magnetic resonance imaging (fMRI) data analysis, given two correlation matrices between regions of interest (ROIs) for the same subject, it is important to reveal relatively large differences to ensure accurate interpretation. However, clustering results based only on differences tend to be unsatisfactory and interpreting the features tends to be difficult because the differences likely suffer from noise. Therefore, to overcome these problems, we propose a new approach for dimensional reduction clustering. Methods: Our proposed dimensional reduction clustering approach consists of low-rank approximation and a clustering algorithm. The low-rank matrix, which reflects the difference, is estimated from the inner product of the difference matrix, not only from the difference. In addition, the low-rank matrix is calculated based on the majorize–minimization (MM) algorithm such that the difference is bounded within the range −1 to 1. For the clustering process, ordinal k-means is applied to the estimated low-rank matrix, which emphasizes the clustering structure. Results: Numerical simulations show that, compared with other approaches that are based only on differences, the proposed method provides superior performance in recovering the true clustering structure. Moreover, as demonstrated through a real-data example of brain activity measured via fMRI during the performance of a working memory task, the proposed method can visually provide interpretable community structures consisting of well-known brain functional networks, which can be associated with the human working memory system. Conclusions: The proposed dimensional reduction clustering approach is a very useful tool for revealing and interpreting the differences between correlation matrices, even when the true differences tend to be relatively small.

Download Full-text

$$\ell _p$$ ℓ p Regularized low-rank approximation via iterative reweighted singular value minimization

Computational Optimization and Applications ◽

10.1007/s10589-017-9933-6 ◽

2017 ◽

Vol 68 (3) ◽

pp. 619-642 ◽

Cited By ~ 6

Author(s):

Zhaosong Lu ◽

Yong Zhang ◽

Jian Lu

Keyword(s):

Singular Value ◽

Low Rank ◽

Low Rank Approximation ◽

Rank Approximation

Download Full-text

hIPPYlib

ACM Transactions on Mathematical Software ◽

10.1145/3428447 ◽

2021 ◽

Vol 47 (2) ◽

pp. 1-34

Author(s):

Umberto Villa ◽

Noemi Petra ◽

Omar Ghattas

Keyword(s):

Inverse Problems ◽

Large Scale ◽

Low Rank ◽

Scalable Algorithms ◽

Low Rank Approximation ◽

Dimensional Parameter ◽

Infinite Dimensional ◽

Bayesian Inverse Problems ◽

Rank Approximation ◽

Extensible Software

We present an extensible software framework, hIPPYlib, for solution of large-scale deterministic and Bayesian inverse problems governed by partial differential equations (PDEs) with (possibly) infinite-dimensional parameter fields (which are high-dimensional after discretization). hIPPYlib overcomes the prohibitively expensive nature of Bayesian inversion for this class of problems by implementing state-of-the-art scalable algorithms for PDE-based inverse problems that exploit the structure of the underlying operators, notably the Hessian of the log-posterior. The key property of the algorithms implemented in hIPPYlib is that the solution of the inverse problem is computed at a cost, measured in linearized forward PDE solves, that is independent of the parameter dimension. The mean of the posterior is approximated by the MAP point, which is found by minimizing the negative log-posterior with an inexact matrix-free Newton-CG method. The posterior covariance is approximated by the inverse of the Hessian of the negative log posterior evaluated at the MAP point. The construction of the posterior covariance is made tractable by invoking a low-rank approximation of the Hessian of the log-likelihood. Scalable tools for sample generation are also discussed. hIPPYlib makes all of these advanced algorithms easily accessible to domain scientists and provides an environment that expedites the development of new algorithms.

Download Full-text