Change-point estimation in high dimensional linear regression models via sparse group Lasso

Abstract Background Statistical analyses of biological problems in life sciences often lead to high-dimensional linear models. To solve the corresponding system of equations, penalization approaches are often the methods of choice. They are especially useful in case of multicollinearity, which appears if the number of explanatory variables exceeds the number of observations or for some biological reason. Then, the model goodness of fit is penalized by some suitable function of interest. Prominent examples are the lasso, group lasso and sparse-group lasso. Here, we offer a fast and numerically cheap implementation of these operators via proximal gradient descent. The grid search for the penalty parameter is realized by warm starts. The step size between consecutive iterations is determined with backtracking line search. Finally, seagull -the R package presented here- produces complete regularization paths. Results Publicly available high-dimensional methylation data are used to compare seagull to the established R package SGL. The results of both packages enabled a precise prediction of biological age from DNA methylation status. But even though the results of seagull and SGL were very similar (R2 > 0.99), seagull computed the solution in a fraction of the time needed by SGL. Additionally, seagull enables the incorporation of weights for each penalized feature. Conclusions The following operators for linear regression models are available in seagull: lasso, group lasso, sparse-group lasso and Integrative LASSO with Penalty Factors (IPF-lasso). Thus, seagull is a convenient envelope of lasso variants.

Download Full-text

Robust change point estimation in two-phase linear regression models: An application to metabolic pathway data

Journal of Computational and Applied Mathematics ◽

10.1016/j.cam.2019.06.020 ◽

2020 ◽

Vol 363 ◽

pp. 337-349

Author(s):

Sukru Acitas ◽

Birdal Senoglu

Keyword(s):

Linear Regression ◽

Metabolic Pathway ◽

Change Point ◽

Regression Models ◽

Point Estimation ◽

Linear Regression Models ◽

Two Phase ◽

Change Point Estimation ◽

Pathway Data

Download Full-text

Multiple Change-Points Estimation in Linear Regression Models via Sparse Group Lasso

IEEE Transactions on Signal Processing ◽

10.1109/tsp.2015.2411220 ◽

2015 ◽

Vol 63 (9) ◽

pp. 2209-2224 ◽

Cited By ~ 18

Author(s):

Bingwen Zhang ◽

Jun Geng ◽

Lifeng Lai

Keyword(s):

Linear Regression ◽

Regression Models ◽

Group Lasso ◽

Change Points ◽

Linear Regression Models ◽

Sparse Group Lasso

Download Full-text

Post-l1-penalized estimators in high-dimensional linear regression models

10.1920/wp.cem.2010.1310 ◽

2010 ◽

Cited By ~ 1

Author(s):

Victor Chernozhukov ◽

Alexandre Belloni

Keyword(s):

Linear Regression ◽

Regression Models ◽

High Dimensional ◽

Linear Regression Models

Download Full-text

Change point estimation in high dimensional Markov random-field models

Journal of the Royal Statistical Society Series B (Statistical Methodology) ◽

10.1111/rssb.12205 ◽

2016 ◽

Vol 79 (4) ◽

pp. 1187-1206 ◽

Cited By ~ 13

Author(s):

Sandipan Roy ◽

Yves Atchadé ◽

George Michailidis

Keyword(s):

Random Field ◽

Markov Random Field ◽

Change Point ◽

Point Estimation ◽

High Dimensional ◽

Change Point Estimation ◽

Markov Random

Download Full-text

Sequential Model Averaging for High Dimensional Linear Regression Models

SSRN Electronic Journal ◽

10.2139/ssrn.2896533 ◽

2017 ◽

Author(s):

Wei Lan ◽

Yingying Ma ◽

Junlong Zhao ◽

Hansheng Wang ◽

Chih-Ling Tsai

Keyword(s):

Linear Regression ◽

Regression Models ◽

Model Averaging ◽

High Dimensional ◽

Linear Regression Models ◽

Sequential Model

Download Full-text

An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models

IEEE Access ◽

10.1109/access.2020.3009377 ◽

2020 ◽

Vol 8 ◽

pp. 129032-129042

Author(s):

Yanxi Xie ◽

Yuewen Li ◽

Zhijie Xia ◽

Ruixia Yan

Keyword(s):

Linear Regression ◽

Variable Selection ◽

Regression Models ◽

High Dimensional ◽

Linear Regression Models ◽

Selection Algorithm

Download Full-text

BAYESIAN IDENTIFICATION OF OUTLIERS AND CHANGE-POINTS IN MEASUREMENT ERROR MODELS

Advances in Complex Systems ◽

10.1142/s0219525905000567 ◽

2005 ◽

Vol 08 (04) ◽

pp. 433-449 ◽

Cited By ~ 8

Author(s):

FERNANDO A. QUINTANA ◽

PILAR L. IGLESIAS ◽

HELENO BOLFARINE

Keyword(s):

Linear Regression ◽

Outlier Detection ◽

Change Point ◽

Regression Models ◽

Point Problem ◽

Linear Regression Models ◽

Outlier Identification ◽

Change Point Problem ◽

Error In Variables ◽

The Relationship

The problem of outlier and change-point identification has received considerable attention in traditional linear regression models from both, classical and Bayesian standpoints. In contrast, for the case of regression models with measurement errors, also known as error-in-variables models, the corresponding literature is scarce and largely focused on classical solutions for the normal case. The main object of this paper is to propose clustering algorithms for outlier detection and change-point identification in scale mixture of error-in-variables models. We propose an approach based on product partition models (PPMs) which allows one to study clustering for the models under consideration. This includes the change-point problem and outlier detection as special cases. The outlier identification problem is approached by adapting the algorithms developed by Quintana and Iglesias [32] for simple linear regression models. A special algorithm is developed for the change-point problem which can be applied in a more general setup. The methods are illustrated with two applications: (i) outlier identification in a problem involving the relationship between two methods for measuring serum kanamycin in blood samples from babies, and (ii) change-point identification in the relationship between the monthly dollar volume of sales on the Boston Stock Exchange and the combined monthly dollar volumes for the New York and American Stock Exchanges.

Download Full-text