scholarly journals Screening the Influence of Biomarkers for Metabolic Syndrome in Occupational Population Based on the Lasso Algorithm

2021 ◽  
Vol 9 ◽  
Author(s):  
Qiao-Ying Xie ◽  
Ming-Wei Wang ◽  
Zu-Ying Hu ◽  
Cheng-Jian Cao ◽  
Cong Wang ◽  
...  

Aim: Metabolic syndrome (MS) screening is essential for the early detection of the occupational population. This study aimed to screen out biomarkers related to MS and establish a risk assessment and prediction model for the routine physical examination of an occupational population.Methods: The least absolute shrinkage and selection operator (Lasso) regression algorithm of machine learning was used to screen biomarkers related to MS. Then, the accuracy of the logistic regression model was further verified based on the Lasso regression algorithm. The areas under the receiving operating characteristic curves were used to evaluate the selection accuracy of biomarkers in identifying MS subjects with risk. The screened biomarkers were used to establish a logistic regression model and calculate the odds ratio (OR) of the corresponding biomarkers. A nomogram risk prediction model was established based on the selected biomarkers, and the consistency index (C-index) and calibration curve were derived.Results: A total of 2,844 occupational workers were included, and 10 biomarkers related to MS were screened. The number of non-MS cases was 2,189 and that of MS was 655. The area under the curve (AUC) value for non-Lasso and Lasso logistic regression was 0.652 and 0.907, respectively. The established risk assessment model revealed that the main risk biomarkers were absolute basophil count (OR: 3.38, CI:1.05–6.85), platelet packed volume (OR: 2.63, CI:2.31–3.79), leukocyte count (OR: 2.01, CI:1.79–2.19), red blood cell count (OR: 1.99, CI:1.80–2.71), and alanine aminotransferase level (OR: 1.53, CI:1.12–1.98). Furthermore, favorable results with C-indexes (0.840) and calibration curves closer to ideal curves indicated the accurate predictive ability of this nomogram.Conclusions: The risk assessment model based on the Lasso logistic regression algorithm helped identify MS with high accuracy in physically examining an occupational population.

2020 ◽  
Author(s):  
Qiao-Ying Xie ◽  
Ming-Wei Wang ◽  
Zu-Ying Hu ◽  
Yan-Ming Chu ◽  
Cheng-Jian Cao ◽  
...  

Abstract Background: Metabolic syndrome (MS) screening is important for the early detection of occupational population. This study aimed to screen out biomarkers related to MS and establish a risk assessment and prediction model for the routine physical examination of an occupational population.Methods: The least absolute shrinkage and selection operator (Lasso) regression algorithm of machine learning was used to screen biomarkers related to MS. Then, the accuracy of the logistic regression model was further verified based on the Lasso regression algorithm. Finally, the screened biomarkers were used to establish a logistic regression model and calculate the odds ratio (OR) of the corresponding biomarkers. Results: A total of 2844 occupational workers were included, and 10 biomarkers related to MS were screened. The area under the curve (AUC) value for non-Lasso and Lasso regression was 0.652 and 0.907, respectively. The established risk assessment model revealed that the main risk factors were basophil absolute count (OR: 3.38), platelet packed volume (OR: 2.63), leukocyte count (OR: 2.01), red blood cell count (OR: 1.99), and alanine aminotransferase level (OR: 1.53). Conclusion: The risk assessment model based on the Lasso regression algorithm helped identify Metabolic syndrome with high accuracy in physically examining an occupational population.


2019 ◽  
Vol 18 ◽  
pp. 153303381984663 ◽  
Author(s):  
Chang-Liang Luo ◽  
Yuan Rong ◽  
Hao Chen ◽  
Wu-Wen Zhang ◽  
Long Wu ◽  
...  

α-Fetoprotein is commonly used in the diagnosis of hepatocellular carcinoma. However, the diagnostic significance of α-fetoprotein has been questioned because a number of patients with hepatocellular carcinoma are α-fetoprotein negative. It is therefore necessary to develop novel noninvasive techniques for the early diagnosis of hepatocellular carcinoma, particularly when α-fetoprotein level is low or negative. The current study aimed to evaluate the diagnostic efficiency of hematological parameters to determine which can act as surrogate markers in α-fetoprotein–negative hepatocellular carcinoma. Therefore, a retrospective study was conducted on a training set recruited from Zhongnan Hospital of Wuhan University—including 171 α-fetoprotein–negative patients with hepatocellular carcinoma and 102 healthy individuals. The results show that mean values of mean platelet volume, red blood cell distribution width, mean platelet volume–PC ratio, neutrophils–lymphocytes ratio, and platelet count–lymphocytes ratio were significantly higher in patients with hepatocellular carcinoma in comparison to the healthy individuals. Most of these parameters showed moderate area under the curve in α-fetoprotein–negative patients with hepatocellular carcinoma, but their sensitivities or specificities were not satisfactory enough. So, we built a logistic regression model combining multiple hematological parameters. This model presented better diagnostic efficiency with area under the curve of 0.922, sensitivity of 83.0%, and specificity of 93.1%. In addition, the 4 validation sets from different hospitals were used to validate the model. They all showed good area under the curve with satisfactory sensitivities or specificities. These data indicate that the logistic regression model combining multiple hematological parameters has better diagnostic efficiency, and they might be helpful for the early diagnosis for α-fetoprotein–negative hepatocellular carcinoma.


2020 ◽  
Vol 71 (1) ◽  
pp. 299-305
Author(s):  
Fernando González-Mohíno ◽  
Jesús Santos del Cerro ◽  
Andrew Renfree ◽  
Inmaculada Yustres ◽  
José Mª González-Ravé

AbstractThe purpose of this analysis was to quantify the probability of achieving a top-3 finishing position during 800-m races at a global championship, based on dispersion of the runners during the first and second laps and the difference in split times between laps. Overall race times, intermediate and finishing positions and 400 m split times were obtained for 43 races over 800 m (21 men’s and 22 women’s) comprising 334 individual performances, 128 of which resulted in higher positions (top-3) and 206 the remaining positions. Intermediate and final positions along with times, the dispersion of the runners during the intermediate and final splits (SS1 and SS2), as well as differences between the two split times (Dsplits) were calculated. A logistic regression model was created to determine the influence of these factors in achieving a top-3 position. The final position was most strongly associated with SS2, but also with SS1 and Dsplits. The Global Significance Test showed that the model was significant (p < 0.001) with a predictive ability of 91.08% and an area under the curve coefficient of 0.9598. The values of sensitivity and specificity were 96.8% and 82.5%, respectively. The model demonstrated that SS1, SS2 and Dplits explained the finishing position in the 800-m event in global championships.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Jianjiang Qi ◽  
Di He ◽  
Dagan Yang ◽  
Mengyan Wang ◽  
Wenjun Ma ◽  
...  

Abstract Background The severity of COVID-19 associates with the clinical decision making and the prognosis of COVID-19 patients, therefore, early identification of patients who are likely to develop severe or critical COVID-19 is critical in clinical practice. The aim of this study was to screen severity-associated markers and construct an assessment model for predicting the severity of COVID-19. Methods 172 confirmed COVID-19 patients were enrolled from two designated hospitals in Hangzhou, China. Ordinal logistic regression was used to screen severity-associated markers. Least Absolute Shrinkage and Selection Operator (LASSO) regression was performed for further feature selection. Assessment models were constructed using logistic regression, ridge regression, support vector machine and random forest. The area under the receiver operator characteristic curve (AUROC) was used to evaluate the performance of different models. Internal validation was performed by using bootstrap with 500 re-sampling in the training set, and external validation was performed in the validation set for the four models, respectively. Results Age, comorbidity, fever, and 18 laboratory markers were associated with the severity of COVID-19 (all P values < 0.05). By LASSO regression, eight markers were included for the assessment model construction. The ridge regression model had the best performance with AUROCs of 0.930 (95% CI, 0.914–0.943) and 0.827 (95% CI, 0.716–0.921) in the internal and external validations, respectively. A risk score, established based on the ridge regression model, had good discrimination in all patients with an AUROC of 0.897 (95% CI 0.845–0.940), and a well-fitted calibration curve. Using the optimal cutoff value of 71, the sensitivity and specificity were 87.1% and 78.1%, respectively. A web-based assessment system was developed based on the risk score. Conclusions Eight clinical markers of lactate dehydrogenase, C-reactive protein, albumin, comorbidity, electrolyte disturbance, coagulation function, eosinophil and lymphocyte counts were associated with the severity of COVID-19. An assessment model constructed with these eight markers would help the clinician to evaluate the likelihood of developing severity of COVID-19 at admission and early take measures on clinical treatment.


2020 ◽  
Vol 35 (6) ◽  
pp. 933-933
Author(s):  
Rolin S ◽  
Kitchen Andren K ◽  
Mullen C ◽  
Kurniadi N ◽  
Davis J

Abstract Objective Previous research in a Veterans Affairs sample proposed using single items on the Neurobehavioral Symptom Inventory (NSI) to screen for anxiety (item 19) and depression (item 20). This study examined the approach in an outpatient physical medicine and rehabilitation sample. Method Participants (N = 84) underwent outpatient neuropsychological evaluation using the NSI, BDI-II, GAD-7, MMPI-2-RF, and Memory Complaints Inventory (MCI) among other measures. Anxiety and depression were psychometrically determined via cutoffs on the GAD-7 (&gt;4) and MMPI-2-RF ANX (&gt;64 T), and BDI-II (&gt;13) and MMPI-2-RF RC2 (&gt;64 T), respectively. Analyses included receiver operating characteristic analysis (ROC) and logistic regression. Logistic regression models used dichotomous anxiety and depression as outcomes and relevant NSI items and MCI average score as predictors. Results ROC analysis using NSI items to classify cases showed area under the curve (AUC) values of .77 for anxiety and .85 for depression. The logistic regression model predicting anxiety correctly classified 80% of cases with AUC of .86. The logistic regression model predicting depression correctly classified 79% of cases with AUC of .88. Conclusion Findings support the utility of NSI anxiety and depression items as screening measures in a rehabilitation population. Consideration of symptom validity via the MCI improved classification accuracy of the regression models. The approach may be useful in other clinical settings for quick assessment of psychological issues warranting further evaluation.


2019 ◽  
Vol 48 (4) ◽  
pp. 030006051989317
Author(s):  
Xindan Wang ◽  
Jing Huang ◽  
Zhao Bingbing ◽  
Shape Li ◽  
Li Li

Objective This study aimed to investigate a suitable risk assessment model to predict deep vein thrombosis (DVT) in patients with gynecological cancer. Methods Data from 212 patients with gynecological cancer in the Affiliated Tumor Hospital of Guangxi Medical University were retrospectively analyzed. Patients were risk-stratified with three different risk assessment models individually, including the Caprini model, Wells DVT model, and Khorana model. Results The difference in risk level evaluated by the Caprini model was not different between the DVT and control groups. However, the DVT group had a significantly higher risk level than the control group with the Wells DVT or Khorana model. The Wells DVT model was more effective for stratifying patients in the DVT group into the higher risk level and for stratifying those in the control group into the lower risk level. Receiver operating curve analysis showed that the area under the curve of the Wells DVT, Khorana, and Caprini models was 0.995 ± 0.002, 0.642 ± 0.038, and 0.567 ± 0.039, respectively. Conclusion The Wells DVT model is the most suitable risk assessment model for predicting DVT. Clinicians could also combine the Caprini and Wells DVT models to effectively identify high-risk patients and eliminate patients without DVT.


Sign in / Sign up

Export Citation Format

Share Document