Multiple Linear Regression in Predicting Motor Assessment Scale of Stroke Patients

Author(s):  
Sulaiman Mazlan ◽  
◽  
Hisyam Abdul Rahman ◽  
Babul Salam Ksm Kader Ibrahim ◽  
Yeong Che Fai ◽  
...  

The Multiple Linear Regression (MLR) is a predictive model that was commonly used to predict the clinical score of stroke patients. However, the performance of the predictive model slightly depends on the method of feature selection on the data as input predictor to the model. Therefore, appropriate feature selection method needs to be investigated in order to give an optimum performance of the prediction. This paper aims (i) to develop predictive model for Motor Assessment Scale (MAS) prediction of stroke patients, (ii) to establish relationship between kinematic variables and MAS score using a predictive model, (iii) to evaluate the prediction performance of a predictive model based on root mean squared error (RMSE) and coefficient of determination R2. Three types of feature selection methods involve in this study which are the combination of all kinematic variables, the combination of the best four or less kinematic variables, and the combination of kinematic variables based on p < 0.05. The prediction performance of MLR model between two assessment devices (iRest and ReHAD) has been compared. As the result, MLR model for ReHAD with the combination of kinematic variables that has p < 0.05 as input predictor has the best performance with Draw I (RMSEte = 1.9228, R2 = 0.8623), Draw Diamond (RMSEte = 2.6136, R2 = 0.7477), and Draw Circle (RMSEte = 2.1756, R2 = 0.8268). These finding suggest that the relationship between kinematic variables and MAS score of stoke patients is strong, and the MLR model with feature selection of kinematic variables that has p < 0.05 is able to predict the MAS score of stroke patients using the kinematic variables extracted from the assessment device.

2002 ◽  
Vol 12 (2) ◽  
pp. 250-256 ◽  
Author(s):  
Hudson Minshew ◽  
John Selker ◽  
Delbert Hemphill ◽  
Richard P. Dick

Predicting leaching of residual soil nitrate-nitrogen (NO3-N) in wet climates is important for reducing risks of groundwater contamination and conserving soil N. The goal of this research was to determine the potential to use easily measurable or readily available soilclimatic-plant data that could be put into simple computer models and used to predict NO3 leaching under various management systems. Two computer programs were compared for their potential to predict monthly NO3-N leaching losses in western Oregon vegetable systems with or without cover crops. The models were a statistical multiple linear regression (MLR) model and the commercially available Nitrate Leaching and Economical Analysis Package model (NLEAP 1.13). The best MLR model found using stepwise regression to predict annual leachate NO3-N had four independent variables (log transformed fall soil NO3-N, leachate volume, summer crop N uptake, and N fertilizer rate) (P < 0.001, R2 = 0.57). Comparisons were made between NLEAP and field data for mass of NO3-N leached between the months of September and May from 1992 to 1997. Predictions with NLEAP showed greater correlation to observed data during high-rainfall years compared to dry or averagerainfall years. The model was found to be sensitive to yield estimates, but vegetation management choices were limiting for vegetable crops and for systems that included a cover crop.


2017 ◽  
Vol 37 (1) ◽  
pp. 109 ◽  
Author(s):  
Yohanita Maulina Akbar ◽  
Dr. Rudiati Evi Masithoh ◽  
Nafis Khuriyati

In this research, Multiple Linear Regression (MLR) model was used to predict Brix and pH of banana based on RGB and Lab color values. Banana samples varied in color and ripening level from less ripen to ripen. RGB and Lab values were measured non-destructively using colormeter, while Brix and pH were determined using conventional method in laboratory. Multivariate analysis was done using the Unscrambler ® X 10.3 (CAMO, AS, OLSO, Norway, and trial version). Results showed that calibration model using MLR was able to predict Brix and pH of banana based on RGB and Lab color values. Furthermore, validation data were used to test the selected models. MLR model to predict Brix based on RGB and Lab validation resulted in 0.8 and 0.84 of determination coefficient between observation and prediction data. The model was also able to predict pH based on RGB and Lab values with 0.71 and 0.79 of determination coefficient between observation and prediction data. ABSTRAKPada penelitian ini, model Multiple Linear Regression (MLR) digunakan untuk memprediksi Brix dan pH pada buah pisang berdasarkan nilai warna Red Green Blue (RGB) dan Lab. Pisang yang dianalisis mempunyai variasi warna dari kurang masak sampai masak. Parameter warna RGB dan Lab dilakukan secara non-destruktif dengan menggunakan colormeter, sedangkan pengukuran kualitas internal yaitu Brix dan pH ditentukan secara destruktif atau dengan prosedur konvensional di laboratorium. Aplikasi analisis multivariat yang digunakan adalah Unscrambler ® X 10.3 (CAMO, AS, OLSO, Norway, versi trial). Analisis data menunjukkan bahwa model kalibrasi MLR dapat digunakan untuk memprediksi Brix dan pH berdasarkan parameter warna RGB dan Lab pada buah pisang. Selanjutnya, data validasi digunakan untuk menguji model MLR terpilih. Model kalibrasi MLR dapat memprediksi Brix berdasarkan nilai RGB dan Lab dengan nilai koefisien determinasi (R2) sebesar 0,8 dan 0,84, secara berurutan. Sedangkan koefisien determinasi (R2) untuk pH berdasarkan warna RGB dan Lab adalah 0,71 dan 0,79.


2021 ◽  
Vol 21 (1) ◽  
Author(s):  
Wissanupong Kliengchuay ◽  
Rachodbun Srimanus ◽  
Wechapraan Srimanus ◽  
Sarima Niampradit ◽  
Nopadol Preecha ◽  
...  

Abstract Background The northern regions of Thailand have been facing haze episodes and transboundary air pollution every year in which particulate matter, particularly PM10, accumulates in the air, detrimentally affecting human health. Chiang Rai province is one of the country’s most popular tourist destinations as well as an important economic hub. This study aims to develop and compare the best-fitted model for PM10 prediction for different seasons using meteorological factors. Method The air pollution and weather data acquired from the Pollution Control Department (PCD) spanned from the years 2011 until 2018 at two stations on an hourly basis. Four different stepwise Multiple Linear Regression (MLR) models for predicting the PM10 concentration were then developed, namely annual, summer, rainy, and winter seasons. Results The maximum daily PM10 concentration was observed in the summer season for both stations. The minimum daily concentration was detected in the rainy season. The seasonal variation of PM10 was significantly different for both stations. CO was moderately related to PM10 in the summer season. The PM10 summer model was the best MLR model to predict PM10 during haze episodes. In both stations, it revealed an R2 of 0.73 and 0.61 in stations 65 and 71, respectively. Relative humidity and atmospheric pressure display negative relationships, although temperature is positively correlated with PM10 concentrations in summer and rainy seasons. Whereas pressure plays a positive relationship with PM10 in the winter season. Conclusions In conclusion, the MLR models are effective at estimating PM10 concentrations at the local level for each seasonal. The annual MLR model at both stations indicates a good prediction with an R2 of 0.61 and 0.52 for stations 65 and 73, respectively.


2001 ◽  
Vol 1 (1-2) ◽  
pp. 45-51 ◽  
Author(s):  
Izumi Ohtsuru ◽  
Fumio Eto ◽  
Naoki Wada ◽  
Ikuko Saotome ◽  
Teruhito Furuichi

2020 ◽  
Vol 6 (4) ◽  
pp. 1981-1989
Author(s):  
Sarat Kumar Allu ◽  
Shailaja Srinivasan ◽  
Rama Krishna Maddala ◽  
Aparna Reddy ◽  
Gangagni Rao Anupoju

2021 ◽  
Author(s):  
Marta Ferreira ◽  
Pierre Lovinfosse ◽  
Johanne Hermesse ◽  
Marjolein Decuypere ◽  
Caroline Rousseau ◽  
...  

Abstract Background Features reproducibility and the generalizability of the models are currently among the most important limitations when integrating radiomics into the clinics. Radiomic features are sensitive to imaging acquisition protocols, reconstruction algorithms and parameters, as well as by the different steps of the usual radiomics workflow. We propose a framework for comparing the reproducibility of different pre-processing steps in PET/CT radiomic analysis in the prediction of disease free survival (DFS) across multi-scanners/centers. Results We evaluated and compared the prediction performance of several models that differ in i) the type of intensity discretization, ii) feature selection method, iii) features type i.e, original or tumour to liver ratio radiomic features (OR or TLR). We trained our models using data from one scanner/center and tested on two external scanner/centers. Our results show that there is a low reproducibility in predictions across scanners and discretization methods. Despite of this, TLR based models were generally more robust than OR. Maximum relevance minimum redundancy (MRMR) forward feature selection with Pearson correlation was the feature selection method that had the best mean area under the precision recall curve when using it combining the features from all discretization’s bin’s number (D_All_FBN) with TLR features for two of the four classifiers. Conclusion We evaluated and compared the prediction performance of several models in a data set containing hundred fifty-eight patients with locally advanced cervical cancer (LACC) from three distinct scanners. In our cohort of LAAC patients pre-processing of radiomic features in [18F]FDG PET affects DFS predictions performances across scanners and combining the D_All_FBN TLR approach with the MRMR forward Pearson feature selection method might help increasing robustness of radiomic studies.


2021 ◽  
Vol 36 ◽  
pp. 01014
Author(s):  
Fung Yuen Chin ◽  
Yong Kheng Goh

Feature selection is a process of selecting a group of relevant features by removing unnecessary features for use in constructing the predictive model. However, high dimensional data increases the difficulty of feature selection due to the curse of dimensionality. From the past research, the performance of the predictive model is always compared with the existing results. When attempting to model a new dataset, the current practice is to benchmark for the dataset obtained by including all the features, including redundant features and noise. Here we propose a new optimal baseline for the dataset by mean of ranked features using a mutual information score. The quality of a dataset depends on the information contained in the dataset, and the more information contains in the dataset, the better the performance of the predictive model. The number of features to achieve this new optimal baseline will be obtained at the same time, and serve as the guideline on the number of features needed in a feature selection method. We will also show some experimental results that the proposed method provides a better baseline with fewer features compared to the existing benchmark using all the features.


Sign in / Sign up

Export Citation Format

Share Document