scholarly journals Statistical Analysis Based on Lake Michigan Fish Acoustic Data Using LASSO Method

Author(s):  
Liming Xie

LASSO method is one of the most popular and more extensive regressions. It has been applied to many fields. However, it is rare seen to research with complicated big data in biology. This paper is to apply LASSO method to Lake Michigan Fish acoustic data. The main techniques include: Elastic Net selection, which tests validation from the average square error (ASE) to predict the error for the model by computing separately for each of these subsets; defaulting group LASSO to test multiple parameters by splitting a couple constituent parameters, such as successive intervals, multiple continuous depth layers, to estimate the Schwarz Bayesian information criterion (SBC) to find the lowest value for the model; The adaptive LASSO selection, which is applied to each of the parameters in constructing the LASSO constraint for weights, that is, the response y has mean zero and the regressor x are scaled to have mean zero and common standard deviation. The empirical results show that the fish density (Y) has strong relationships with area backscattering coefficient (PRC_ABC), secondly, significant interactions with PRC_ABC and Exclude below line depth mean), among PRC_ABC, fish density in the intervals and layers of acoustic survey transect of Lake Michigan.

2020 ◽  
Vol 79 (Suppl 1) ◽  
pp. 1252.2-1253
Author(s):  
R. Garofoli ◽  
M. Resche-Rigon ◽  
M. Dougados ◽  
D. Van der Heijde ◽  
C. Roux ◽  
...  

Background:Axial spondyloarthritis (axSpA) is a chronic rheumatic disease that encompasses various clinical presentations: inflammatory chronic back pain, peripheral manifestations and extra-articular manifestations. The current nomenclature divides axSpA in radiographic (in the presence of radiographic sacroiliitis) and non-radiographic (in the absence of radiographic sacroiliitis, with or without MRI sacroiliitis. Given that the functional burden of the disease appears to be greater in patients with radiographic forms, it seems crucial to be able to predict which patients will be more likely to develop structural damage over time. Predictive factors for radiographic progression in axSpA have been identified through use of traditional statistical models like logistic regression. However, these models present some limitations. In order to overcome these limitations and to improve the predictive performance, machine learning (ML) methods have been developed.Objectives:To compare ML models to traditional models to predict radiographic progression in patients with early axSpA.Methods:Study design: prospective French multicentric cohort study (DESIR cohort) with 5years of follow-up. Patients: all patients included in the cohort, i.e. 708 patients with inflammatory back pain for >3 months but <3 years, highly suggestive of axSpA. Data on the first 5 years of follow-up was used. Statistical analyses: radiographic progression was defined as progression either at the spine (increase of at least 1 point per 2 years of mSASSS scores) or at the sacroiliac joint (worsening of at least one grade of the mNY score between 2 visits). Traditional modelling: we first performed a bivariate analysis between our outcome (radiographic progression) and explanatory variables at baseline to select the variables to be included in our models and then built a logistic regression model (M1). Variable selection for traditional models was performed with 2 different methods: stepwise selection based on Akaike Information Criterion (stepAIC) method (M2), and the Least Absolute Shrinkage and Selection Operator (LASSO) method (M3). We also performed sensitivity analysis on all patients with manual backward method (M4) after multiple imputation of missing data. Machine learning modelling: using the “SuperLearner” package on R, we modelled radiographic progression with stepAIC, LASSO, random forest, Discrete Bayesian Additive Regression Trees Samplers (DBARTS), Generalized Additive Models (GAM), multivariate adaptive polynomial spline regression (polymars), Recursive Partitioning And Regression Trees (RPART) and Super Learner. Finally, the accuracy of traditional and ML models was compared based on their 10-foldcross-validated AUC (cv-AUC).Results:10-fold cv-AUC for traditional models were 0.79 and 0.78 for M2 and M3, respectively. The 3 best models in the ML algorithm were the GAM, the DBARTS and the Super Learner models, with 10-fold cv-AUC of: 0.77, 0.76 and 0.74, respectively (Table 1).Table 1.Comparison of 10-fold cross-validated AUC between best traditional and machine learning models.Best modelsCross-validated AUCTraditional models M2 (step AIC method)0.79 M3 (LASSO method)0.78Machine learning approach SL Discrete Bayesian Additive Regression Trees Samplers (DBARTS)0.76 SL Generalized Additive Models (GAM)0.77 Super Learner0.74AUC: Area Under the Curve; AIC: Akaike Information Criterion; LASSO: Least Absolute Shrinkage and Selection Operator; SL: SuperLearner. N = 295.Conclusion:Traditional models predicted better radiographic progression than ML models in this early axSpA population. Further ML algorithms image-based or with other artificial intelligence methods (e.g. deep learning) might perform better than traditional models in this setting.Acknowledgments:Thanks to the French National Society of Rheumatology and the DESIR cohort.Disclosure of Interests:Romain Garofoli: None declared, Matthieu resche-rigon: None declared, Maxime Dougados Grant/research support from: AbbVie, Eli Lilly, Merck, Novartis, Pfizer and UCB Pharma, Consultant of: AbbVie, Eli Lilly, Merck, Novartis, Pfizer and UCB Pharma, Speakers bureau: AbbVie, Eli Lilly, Merck, Novartis, Pfizer and UCB Pharma, Désirée van der Heijde Consultant of: AbbVie, Amgen, Astellas, AstraZeneca, BMS, Boehringer Ingelheim, Celgene, Cyxone, Daiichi, Eisai, Eli-Lilly, Galapagos, Gilead Sciences, Inc., Glaxo-Smith-Kline, Janssen, Merck, Novartis, Pfizer, Regeneron, Roche, Sanofi, Takeda, UCB Pharma; Director of Imaging Rheumatology BV, Christian Roux: None declared, Anna Moltó Grant/research support from: Pfizer, UCB, Consultant of: Abbvie, BMS, MSD, Novartis, Pfizer, UCB


2015 ◽  
Vol 32 (1) ◽  
pp. 243-259 ◽  
Author(s):  
Anders Bredahl Kock

We show that the adaptive Lasso is oracle efficient in stationary and nonstationary autoregressions. This means that it estimates parameters consistently, selects the correct sparsity pattern, and estimates the coefficients belonging to the relevant variables at the same asymptotic efficiency as if only these had been included in the model from the outset. In particular, this implies that it is able to discriminate between stationary and nonstationary autoregressions and it thereby constitutes an addition to the set of unit root tests. Next, and important in practice, we show that choosing the tuning parameter by Bayesian Information Criterion (BIC) results in consistent model selection.However, it is also shown that the adaptive Lasso has no power against shrinking alternatives of the form c/T if it is tuned to perform consistent model selection. We show that if the adaptive Lasso is tuned to perform conservative model selection it has power even against shrinking alternatives of this form and compare it to the plain Lasso.


2018 ◽  
Vol 61 (4) ◽  
pp. 451-458
Author(s):  
Suna Akkol

Abstract. The least absolute selection and shrinkage operator (LASSO) and adaptive LASSO methods have become a popular model in the last decade, especially for data with a multicollinearity problem. This study was conducted to estimate the live weight (LW) of Hair goats from biometric measurements and to select variables in order to reduce the model complexity by using penalized regression methods: LASSO and adaptive LASSO for γ=0.5 and γ=1. The data were obtained from 132 adult goats in Honaz district of Denizli province. Age, gender, forehead width, ear length, head length, chest width, rump height, withers height, back height, chest depth, chest girth, and body length were used as explanatory variables. The adjusted coefficient of determination (Radj2), root mean square error (RMSE), Akaike's information criterion (AIC), Schwarz Bayesian criterion (SBC), and average square error (ASE) were used in order to compare the effectiveness of the methods. It was concluded that adaptive LASSO (γ=1) estimated the LW with the highest accuracy for both male (Radj2=0.9048; RMSE = 3.6250; AIC = 79.2974; SBC = 65.2633; ASE = 7.8843) and female (Radj2=0.7668; RMSE = 4.4069; AIC = 392.5405; SBC = 308.9888; ASE = 18.2193) Hair goats when all the criteria were considered.


1991 ◽  
Vol 48 (5) ◽  
pp. 894-908 ◽  
Author(s):  
Stephen B. Brandt ◽  
Doran M. Mason ◽  
E. Vincent Patrick ◽  
Ray L. Argyle ◽  
L. Wells ◽  
...  

Based on acoustic data taken at night and vertically stratified by bottom depth (3–110 m only), the total number (± 95% CI) of pelagic fishes in Lake Michigan was 43.4 ± 10.1 × 109 or 226.0 ± 55.2 kt in spring (mean density 0.7–3.8 fish∙m−2 or 1.6–12.8 g∙m−2) and 115.8 ± 18.3 × 109 or 313.2 ± 74.3 kt in late summer, 1987 (mean density 1.1–7.9 fish∙m−2 or 3.0–13.2 g∙m−2); approximately 30% of this increase in numbers (35% of biomass) occurred within Green Bay. Abundance estimates from horizontally stratified (by water column depth) data were within 9–11% of vertically stratified estimates during spring but over 20% higher during summer. By extrapolation to all water depths, we estimated total pelagic biomass as 274.6 kt for spring and 410.8 kt for summer. During both seasons, smaller fishes were nearer to the surface and nearer shore than larger individuals, and acoustic measures of size approximated the sizes of fishes caught in trawls. Bioenergetic model simulations suggest that 60% of the available production of alewife (Alosa pseudoharengus) was either consumed by stocked salmonines (52.9%) or commercially harvested (7.1%) in 1987. Underwater acoustics proved a valuable tool for lakewide assessments of fish abundances in the Great Lakes.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Émeline Courtois ◽  
Pascale Tubert-Bitter ◽  
Ismaïl Ahmed

Abstract Background Adverse effects of drugs are often identified after market introduction. Post-marketing pharmacovigilance aims to detect them as early as possible and relies on spontaneous reporting systems collecting suspicious cases. Signal detection tools have been developed to mine these large databases and counts of reports are analysed with disproportionality methods. To address disproportionality method biases, recent methods apply to individual observations taking into account all exposures for the same patient. In particular, the logistic lasso provides an efficient variable selection framework, yet the choice of the regularization parameter is a challenging issue and the lasso variable selection may give inconsistent results. Methods We propose a new signal detection methodology based on the adaptive lasso. We derived two new adaptive weights from (i) a lasso regression using the Bayesian Information Criterion (BIC), and (ii) the class-imbalanced subsampling lasso (CISL), an extension of stability selection. The BIC is used in the adaptive lasso stage for variable selection. We performed an extensive simulation study and an application to real data, where we compared our methods to the existing adaptive lasso, and recent detection approaches based on lasso regression or propensity scores in high dimension. For both studies, we evaluate the methods in terms of false discoveries and sensitivity. Results In the simulations and the application, both proposed adaptive weights show equivalent or better performances than the other competitors, with an advantage for the CISL-based adaptive weights. CISL and lasso regression using BIC are solid alternatives. Conclusion Our proposed adaptive lasso is an appealing methodology for signal detection in pharmacovigilance. Although we cannot rely on test theory, our approaches show a low and stable False Discovery Rate in all simulation settings. All methods evaluated in this work are implemented in the adapt4pv R package.


2016 ◽  
Vol 124 (1) ◽  
pp. 120-125 ◽  
Author(s):  
Lingzhen Dai ◽  
Petros Koutrakis ◽  
Brent A. Coull ◽  
David Sparrow ◽  
Pantel S. Vokonas ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document