Machine learning approaches in predicting ambulatory same day discharge patients after total hip arthroplasty

2021 ◽  
pp. rapm-2021-102715
Author(s):  
Haoyan Zhong ◽  
Jashvant Poeran ◽  
Alex Gu ◽  
Lauren A Wilson ◽  
Alejandro Gonzalez Della Valle ◽  
...  

BackgroundWith continuing financial and regulatory pressures, practice of ambulatory total hip arthroplasty is increasing. However, studies focusing on selection of optimal candidates are burdened by limitations related to traditional statistical approaches. Hereby we aimed to apply machine learning algorithm to identify characteristics associated with optimal candidates.MethodsThis retrospective cohort study included elective total hip arthroplasty (n=63 859) recorded in National Surgical Quality Improvement Program dataset from 2017 to 2018. The main outcome was length of stay. A total of 40 candidate variables were considered. We applied machine learning algorithms (multivariable logistic regression, artificial neural networks, and random forest models) to predict length of stay=0 day. Models’ accuracies and area under the curve were calculated.ResultsApplying machine learning models to compare length of stay=0 day to length of stay=1–3 days cases, we found area under the curve of 0.715, 0.762, and 0.804, accuracy of 0.65, 0.73, and 0.81 for logistic regression, artificial neural networks, and random forest model, respectively. Regarding the most important predictive features, anesthesia type, body mass index, age, ethnicity, white blood cell count, sodium level, and alkaline phosphatase were highlighted in machine learning models.ConclusionsMachine learning algorithm exhibited acceptable model quality and accuracy. Machine learning algorithms highlighted the as yet unrecognized impact of laboratory testing on future patient ambulatory pathway assignment.

2019 ◽  
Author(s):  
Mohammed Moreb ◽  
Oguz Ata

Abstract Background We propose a novel framework for health Informatics: framework and methodology of Software Engineering for machine learning in Health Informatics (SEMLHI). This framework shed light on its features, that allow users to study and analyze the requirements, determine the function of objects related to the system and determine the machine learning algorithms that will be used for the dataset.Methods Based on original data that collected from the hospital in Palestine government in the past three years, first the data validated and all outlier removed, analyzed using develop framework in order to compare ML provide patients with real-time. Our proposed module comparison with three Systems Engineering Methods Vee, agile and SEMLHI. The result used by implement prototype system, which require machine learning algorithm, after development phase, questionnaire deliver to developer to indicate the result using three methodology. SEMLHI framework, is composed into four components: software, machine learning model, machine learning algorithms, and health informatics data, Machine learning Algorithm component used five algorithms use to evaluate the accuracy for machine learning models on component.Results we compare our approach with the previously published systems in terms of performance to evaluate the accuracy for machine learning models, the results of accuracy with different algorithms applied for 750 case, linear SVG have about 0.57 value compared with KNeighbors classifier, logistic regression, multinomial NB, random forest classifier. This research investigates the interaction between SE, and ML within the context of health informatics, our proposed framework define the methodology for developers to analyzing and developing software for the health informatic model, and create a space, in which software engineering, and ML experts could work on the ML model lifecycle, on the disease level and the subtype level.Conclusions This article is an ongoing effort towards defining and translating an existing research pipeline into four integrated modules, as framework system using the dataset from healthcare to reduce cost estimation by using a new suggested methodology. The framework is available as open source software, licensed under GNU General Public License Version 3 to encourage others to contribute to the future development of the SEMLHI framework.


Author(s):  
Isaac Kofi Nti ◽  
◽  
Owusu N yarko-Boateng ◽  
Justice Aning

The numerical value of k in a k-fold cross-validation training technique of machine learning predictive models is an essential element that impacts the model’s performance. A right choice of k results in better accuracy, while a poorly chosen value for k might affect the model’s performance. In literature, the most commonly used values of k are five (5) or ten (10), as these two values are believed to give test error rate estimates that suffer neither from extremely high bias nor very high variance. However, there is no formal rule. To the best of our knowledge, few experimental studies attempted to investigate the effect of diverse k values in training different machine learning models. This paper empirically analyses the prevalence and effect of distinct k values (3, 5, 7, 10, 15 and 20) on the validation performance of four well-known machine learning algorithms (Gradient Boosting Machine (GBM), Logistic Regression (LR), Decision Tree (DT) and K-Nearest Neighbours (KNN)). It was observed that the value of k and model validation performance differ from one machine-learning algorithm to another for the same classification task. However, our empirical suggest that k = 7 offers a slight increase in validations accuracy and area under the curve measure with lesser computational complexity than k = 10 across most MLA. We discuss in detail the study outcomes and outline some guidelines for beginners in the machine learning field in selecting the best k value and machine learning algorithm for a given task.


Arthroplasty ◽  
2021 ◽  
Vol 3 (1) ◽  
Author(s):  
Siyuan Zhang ◽  
Jerry Yongqiang Chen ◽  
Hee Nee Pang ◽  
Ngai Nung Lo ◽  
Seng Jin Yeo ◽  
...  

Abstract Background Patient satisfaction is a unique and important measure of success after total hip arthroplasty (THA). Our study aimed to evaluate the use of machine learning (ML) algorithms to predict patient satisfaction after THA. Methods Prospectively collected data of 1508 primary THAs performed between 2006 and 2018 were extracted from our joint replacement registry and split into training (80%) and test (20%) sets. Supervised ML algorithms (Random Forest, Extreme Gradient Boosting, Support Vector Machines, Logistic LASSO) were developed with the training set, using patient demographics, comorbidities and preoperative patient reported outcome measures (PROMs) (Short Form-36 [SF-36], physical component summary [PCS] and mental component summary [MCS], Western Ontario and McMaster’s Universities Osteoarthritis Index [WOMAC] and Oxford Hip Score [OHS]) to predict patient satisfaction at 2 years postoperatively. Predictive performance was evaluated using the independent test set. Results Preoperative models demonstrated fair discriminative ability in predicting patient satisfaction, with the LASSO model achieving a maximum AUC of 0.76. Permutation importance revealed that the most important predictors of dissatisfaction were (1) patient’s age, (2) preoperative WOMAC, (3) number of comorbidities, (4) preoperative MCS, (5) previous lumbar spine surgery, and (6) low BMI (< 18.5). Conclusion Machine learning algorithms demonstrated fair discriminative ability in predicting patient satisfaction after THA. We have identified modifiable and non-modifiable predictors of postoperative satisfaction which could enhance preoperative counselling and improve health optimization prior to THA.


Hypertension ◽  
2021 ◽  
Vol 78 (5) ◽  
pp. 1595-1604
Author(s):  
Fabrizio Buffolo ◽  
Jacopo Burrello ◽  
Alessio Burrello ◽  
Daniel Heinrich ◽  
Christian Adolf ◽  
...  

Primary aldosteronism (PA) is the cause of arterial hypertension in 4% to 6% of patients, and 30% of patients with PA are affected by unilateral and surgically curable forms. Current guidelines recommend screening for PA ≈50% of patients with hypertension on the basis of individual factors, while some experts suggest screening all patients with hypertension. To define the risk of PA and tailor the diagnostic workup to the individual risk of each patient, we developed a conventional scoring system and supervised machine learning algorithms using a retrospective cohort of 4059 patients with hypertension. On the basis of 6 widely available parameters, we developed a numerical score and 308 machine learning-based models, selecting the one with the highest diagnostic performance. After validation, we obtained high predictive performance with our score (optimized sensitivity of 90.7% for PA and 92.3% for unilateral PA [UPA]). The machine learning-based model provided the highest performance, with an area under the curve of 0.834 for PA and 0.905 for diagnosis of UPA, with optimized sensitivity of 96.6% for PA, and 100.0% for UPA, at validation. The application of the predicting tools allowed the identification of a subgroup of patients with very low risk of PA (0.6% for both models) and null probability of having UPA. In conclusion, this score and the machine learning algorithm can accurately predict the individual pretest probability of PA in patients with hypertension and circumvent screening in up to 32.7% of patients using a machine learning-based model, without omitting patients with surgically curable UPA.


Author(s):  
A. Khanwalkar ◽  
R. Soni

Purpose: Diabetes is a chronic disease that pays for a large proportion of the nation's healthcare expenses when people with diabetes want medical care continuously. Several complications will occur if the polymer disorder is not treated and unrecognizable. The prescribed condition leads to a diagnostic center and a doctor's intention. One of the real-world subjects essential is to find the first phase of the polytechnic. In this work, basically a survey that has been analyzed in several parameters within the poly-infected disorder diagnosis. It resembles the classification algorithms of data collection that plays an important role in the data collection method. Automation of polygenic disorder analysis, as well as another machine learning algorithm. Design/methodology/approach: This paper provides extensive surveys of different analogies which have been used for the analysis of medical data, For the purpose of early detection of polygenic disorder. This paper takes into consideration methods such as J48, CART, SVMs and KNN square, this paper also conducts a formal surveying of all the studies, and provides a conclusion at the end. Findings: This surveying has been analyzed on several parameters within the poly-infected disorder diagnosis. It resembles that the classification algorithms of data collection plays an important role in the data collection method in Automation of polygenic disorder analysis, as well as another machine learning algorithm. Practical implications: This paper will help future researchers in the field of Healthcare, specifically in the domain of diabetes, to understand differences between classification algorithms. Originality/value: This paper will help in comparing machine learning algorithms by going through results and selecting the appropriate approach based on requirements.


Sign in / Sign up

Export Citation Format

Share Document