Application of Various Machine Learning Models for Process Stability of Bio-Electrochemical Anaerobic Digestion

The application of a machine learning (ML) model to bio-electrochemical anaerobic digestion (BEAD) is a future-oriented approach for improving process stability by predicting performances that have nonlinear relationships with various operational parameters. Five ML models, which included tree-, regression-, and neural network-based algorithms, were applied to predict the methane yield in BEAD reactor. The results showed that various 1-step ahead ML models, which utilized prior data of BEAD performances, could enhance prediction accuracy. In addition, 1-step ahead with retraining algorithm could improve prediction accuracy by 37.3% compared with the conventional multi-step ahead algorithm. The improvement was particularly noteworthy in tree- and regression-based ML models. Moreover, 1-step ahead with retraining algorithm showed high potential of achieving efficient prediction using pH as a single input data, which is plausibly an easier monitoring parameter compared with the other parameters required in bioprocess models.

Download Full-text

Retraining prior state performances of anaerobic digestion improves prediction accuracy of methane yield in various machine learning models

Applied Energy ◽

10.1016/j.apenergy.2021.117250 ◽

2021 ◽

Vol 298 ◽

pp. 117250

Author(s):

Jun-Gyu Park ◽

Hang-Bae Jun ◽

Tae-Young Heo

Keyword(s):

Machine Learning ◽

Anaerobic Digestion ◽

Prediction Accuracy ◽

Methane Yield ◽

Learning Models ◽

Prior State ◽

Machine Learning Models

Download Full-text

Automated Retraining of Machine Learning Models

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3322.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 445-452

Keyword(s):

Machine Learning ◽

Input Data ◽

Research Work ◽

Learning Models ◽

Machine Learning Methods ◽

Machine Learning Model ◽

Crucial Component ◽

Conventional Machine ◽

Over Time ◽

Machine Learning Models

Data is the most crucial component of a successful ML system. Once a machine learning model is developed, it gets obsolete over time due to presence of new input data being generated every second. In order to keep our predictions accurate we need to find a way to keep our models up to date. Our research work involves finding a mechanism which can retrain the model with new data automatically. This research also involves exploring the possibilities of automating machine learning processes. We started this project by training and testing our model using conventional machine learning methods. The outcome was then compared with the outcome of those experiments conducted using the AutoML methods like TPOT. This helped us in finding an efficient technique to retrain our models. These techniques can be used in areas where people do not deal with the actual working of a ML model but only require the outputs of ML processes

Download Full-text

Online learning behavior analysis based on machine learning

Asian Association of Open Universities Journal ◽

10.1108/aaouj-08-2019-0029 ◽

2019 ◽

Vol 14 (2) ◽

pp. 97-106

Author(s):

Ning Yan ◽

Oliver Tat-Sheung Au

Keyword(s):

Machine Learning ◽

Online Learning ◽

Correlation Analysis ◽

Prediction Accuracy ◽

Classification Models ◽

Limited Data ◽

Learning Models ◽

Learning Behavior ◽

Content Type ◽

Machine Learning Models

Purpose The purpose of this paper is to make a correlation analysis between students’ online learning behavior features and course grade, and to attempt to build some effective prediction model based on limited data. Design/methodology/approach The prediction label in this paper is the course grade of students, and the eigenvalues available are student age, student gender, connection time, hits count and days of access. The machine learning model used in this paper is the classical three-layer feedforward neural networks, and the scaled conjugate gradient algorithm is adopted. Pearson correlation analysis method is used to find the relationships between course grade and the student eigenvalues. Findings Days of access has the highest correlation with course grade, followed by hits count, and connection time is less relevant to students’ course grade. Student age and gender have the lowest correlation with course grade. Binary classification models have much higher prediction accuracy than multi-class classification models. Data normalization and data discretization can effectively improve the prediction accuracy of machine learning models, such as ANN model in this paper. Originality/value This paper may help teachers to find some clue to identify students with learning difficulties in advance and give timely help through the online learning behavior data. It shows that acceptable prediction models based on machine learning can be built using a small and limited data set. However, introducing external data into machine learning models to improve its prediction accuracy is still a valuable and hard issue.

Download Full-text

Modeling and Optimization of Anaerobic Digestion: A Review

E3S Web of Conferences ◽

10.1051/e3sconf/202122901022 ◽

2021 ◽

Vol 229 ◽

pp. 01022

Author(s):

Fatima Walid ◽

Sanaa El Fkihi ◽

Houda Benbrahim ◽

Hicham Tagemouati

Keyword(s):

Mathematical Modeling ◽

Machine Learning ◽

Anaerobic Digestion ◽

Biogas Production ◽

Data Driven ◽

Management Technique ◽

Learning Models ◽

Process Dynamics ◽

Metaheuristic Techniques ◽

Machine Learning Models

Anaerobic digestion is recognized as being an advantageous waste management technique representing a source of clean and renewable energy. However, biogas production through such practice is complex and it relies on the interaction of several factors including changes in operating and monitoring parameters. Enormous researchers have focused and gave their full attention to mathematical modeling of anaerobic digestion to get good insights about process dynamics, aiming to optimize its efficiency. This paper gives an overview of the different approaches applied to tackle this challenge including mechanistic and data-driven models. This review has led us to conclude that neural networks combined with metaheuristic techniques has the potential to outperform mechanistic and classical machine learning models.

Download Full-text

Accurately Differentiating Between Patients With COVID-19, Patients With Other Viral Infections, and Healthy Individuals: Multimodal Late Fusion Learning Approach

Journal of Medical Internet Research ◽

10.2196/25535 ◽

2021 ◽

Vol 23 (1) ◽

pp. e25535

Author(s):

Ming Xu ◽

Liu Ouyang ◽

Lei Han ◽

Kai Sun ◽

Tingting Yu ◽

...

Keyword(s):

Machine Learning ◽

Prediction Accuracy ◽

Clinical Applications ◽

Viral Pneumonia ◽

High Dimensional ◽

Healthy Individuals ◽

Late Fusion ◽

Single Feature ◽

Multimodal Features ◽

Machine Learning Models

Background Effectively identifying patients with COVID-19 using nonpolymerase chain reaction biomedical data is critical for achieving optimal clinical outcomes. Currently, there is a lack of comprehensive understanding in various biomedical features and appropriate analytical approaches for enabling the early detection and effective diagnosis of patients with COVID-19. Objective We aimed to combine low-dimensional clinical and lab testing data, as well as high-dimensional computed tomography (CT) imaging data, to accurately differentiate between healthy individuals, patients with COVID-19, and patients with non-COVID viral pneumonia, especially at the early stage of infection. Methods In this study, we recruited 214 patients with nonsevere COVID-19, 148 patients with severe COVID-19, 198 noninfected healthy participants, and 129 patients with non-COVID viral pneumonia. The participants’ clinical information (ie, 23 features), lab testing results (ie, 10 features), and CT scans upon admission were acquired and used as 3 input feature modalities. To enable the late fusion of multimodal features, we constructed a deep learning model to extract a 10-feature high-level representation of CT scans. We then developed 3 machine learning models (ie, k-nearest neighbor, random forest, and support vector machine models) based on the combined 43 features from all 3 modalities to differentiate between the following 4 classes: nonsevere, severe, healthy, and viral pneumonia. Results Multimodal features provided substantial performance gain from the use of any single feature modality. All 3 machine learning models had high overall prediction accuracy (95.4%-97.7%) and high class-specific prediction accuracy (90.6%-99.9%). Conclusions Compared to the existing binary classification benchmarks that are often focused on single-feature modality, this study’s hybrid deep learning-machine learning framework provided a novel and effective breakthrough for clinical applications. Our findings, which come from a relatively large sample size, and analytical workflow will supplement and assist with clinical decision support for current COVID-19 diagnostic methods and other clinical applications with high-dimensional multimodal biomedical features.

Download Full-text

An Evaluation of Wearable Inertial Sensor Configuration and Supervised Machine Learning Models for Automatic Punch Classification in Boxing

IoT ◽

10.3390/iot1020021 ◽

2020 ◽

Vol 1 (2) ◽

pp. 360-381

Author(s):

Matthew T. O. Worsey ◽

Hugo G. Espinosa ◽

Jonathan B. Shepherd ◽

David V. Thiel

Keyword(s):

Machine Learning ◽

Prediction Accuracy ◽

Inertial Sensors ◽

Inertial Sensor ◽

Supervised Machine Learning ◽

Learning Models ◽

Significant Statistical Difference ◽

Sensor Configuration ◽

Wearable Inertial Sensors ◽

Machine Learning Models

Machine learning is a powerful tool for data classification and has been used to classify movement data recorded by wearable inertial sensors in general living and sports. Inertial sensors can provide valuable biofeedback in combat sports such as boxing; however, the use of such technology has not had a global uptake. If simple inertial sensor configurations can be used to automatically classify strike type, then cumbersome tasks such as video labelling can be bypassed and the foundation for automated workload monitoring of combat sport athletes is set. This investigation evaluates the classification performance of six different supervised machine learning models (tuned and untuned) when using two simple inertial sensor configurations (configuration 1—inertial sensor worn on both wrists; configuration 2—inertial sensor worn on both wrists and third thoracic vertebrae [T3]). When trained on one athlete, strike prediction accuracy was good using both configurations (sensor configuration 1 mean overall accuracy: 0.90 ± 0.12; sensor configuration 2 mean overall accuracy: 0.87 ± 0.09). There was no significant statistical difference in prediction accuracy between both configurations and tuned and untuned models (p > 0.05). Moreover, there was no significant statistical difference in computational training time for tuned and untuned models (p > 0.05). For sensor configuration 1, a support vector machine (SVM) model with a Gaussian rbf kernel performed the best (accuracy = 0.96), for sensor configuration 2, a multi-layered perceptron neural network (MLP-NN) model performed the best (accuracy = 0.98). Wearable inertial sensors can be used to accurately classify strike-type in boxing pad work, this means that cumbersome tasks such as video and notational analysis can be bypassed. Additionally, automated workload and performance monitoring of athletes throughout training camp is possible. Future investigations will evaluate the performance of this algorithm on a greater sample size and test the influence of impact window-size on prediction accuracy. Additionally, supervised machine learning models should be trained on data collected during sparring to see if high accuracy holds in a competition setting. This can help move closer towards automatic scoring in boxing.

Download Full-text

Comparison of machine-learning models for predicting short-term building heating load using operational parameters

Energy and Buildings ◽

10.1016/j.enbuild.2021.111505 ◽

2021 ◽

pp. 111505

Author(s):

Yong Zhou ◽

Yanfeng Liu ◽

Dengjia Wang ◽

Xiaojun Liu

Keyword(s):

Machine Learning ◽

Learning Models ◽

Operational Parameters ◽

Short Term ◽

Heating Load ◽

Machine Learning Models

Download Full-text

Predicting High-Flow Nasal Cannula Failure in an ICU Using a Recurrent Neural Network with Transfer Learning and Input Data Perseveration: A Retrospective Analysis (Preprint)

10.2196/preprints.31760 ◽

2021 ◽

Author(s):

George Pappy ◽

Melissa Aczon ◽

Randall Wetzel ◽

David Ledbetter

Keyword(s):

Machine Learning ◽

Transfer Learning ◽

Airway Pressure ◽

Input Data ◽

Positive Airway Pressure ◽

High Flow ◽

Learning Models ◽

High Flow Nasal Cannula ◽

Non Invasive ◽

Machine Learning Models

BACKGROUND High Flow Nasal Cannula (HFNC) provides non-invasive respiratory support for critically ill children who may tolerate it more readily than other Non-Invasive (NIV) techniques such as Bilevel Positive Airway Pressure (BiPAP) and Continuous Positive Airway Pressure (CPAP). Moreover, HFNC may preclude the need for mechanical ventilation (intubation). Nevertheless, NIV or intubation may ultimately be necessary for certain patients. Timely prediction of HFNC failure can provide an indication for increasing respiratory support. OBJECTIVE This work developed and compared machine learning models to predict HFNC failure. METHODS A retrospective study was conducted using the Virtual Pediatric Intensive Care Unit database of Electronic Medical Records (EMR) of patients admitted to a tertiary pediatric ICU from January 2010 to February 2020. Patients <19 years old, without apnea, and receiving HFNC treatment were included. A Long Short-Term Memory (LSTM) model using 517 variables (vital signs, laboratory data and other clinical parameters) was trained to generate a continuous prediction of HFNC failure, defined as escalation to NIV or intubation within 24 hours of HFNC initiation. For comparison, seven other models were trained: a Logistic Regression (LR) using the same 517 variables, another LR using only 14 variables, and five additional LSTM-based models using the same 517 variables as the first LSTM and incorporating additional ML techniques (transfer learning, input perseveration, and ensembling). Performance was assessed using the area under the receiver operating curve (AUROC) at various times following HFNC initiation. The sensitivity, specificity, positive and negative predictive values (PPV, NPV) of predictions at two hours after HFNC initiation were also evaluated. These metrics were also computed in a cohort with primarily respiratory diagnoses. RESULTS 834 HFNC trials [455 training, 173 validation, 206 test] met the inclusion criteria, of which 175 [103, 30, 42] (21.0%) escalated to NIV or intubation. The LSTM models trained with transfer learning generally performed better than the LR models, with the best LSTM model achieving an AUROC of 0.78, vs 0.66 for the 14-variable LR and 0.71 for the 517-variable LR, two hours after initiation. All models except for the 14-variable LR achieved higher AUROCs in the respiratory cohort than in the general ICU population. CONCLUSIONS Machine learning models trained using EMR data were able to identify children at risk for failing HFNC within 24 hours of initiation. LSTM models that incorporated transfer learning, input data perseveration and ensembling showed improved performance than the LR and standard LSTM models.

Download Full-text

A Dynamic Convolutional Neural Network Based Shared-Bike Demand Forecasting Model

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3447988 ◽

2021 ◽

Vol 12 (6) ◽

pp. 1-24

Author(s):

Shaojie Qiao ◽

Nan Han ◽

Jianbin Huang ◽

Kun Yue ◽

Rui Mao ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Prediction Accuracy ◽

Demand Forecasting ◽

Forecasting Model ◽

Learning Models ◽

Bike Sharing ◽

Demand Forecasting Model ◽

Machine Learning Models

Bike-sharing systems are becoming popular and generate a large volume of trajectory data. In a bike-sharing system, users can borrow and return bikes at different stations. In particular, a bike-sharing system will be affected by weather, the time period, and other dynamic factors, which challenges the scheduling of shared bikes. In this article, a new shared-bike demand forecasting model based on dynamic convolutional neural networks, called SDF , is proposed to predict the demand of shared bikes. SDF chooses the most relevant weather features from real weather data by using the Pearson correlation coefficient and transforms them into a two-dimensional dynamic feature matrix, taking into account the states of stations from historical data. The feature information in the matrix is extracted, learned, and trained with a newly proposed dynamic convolutional neural network to predict the demand of shared bikes in a dynamical and intelligent fashion. The phase of parameter update is optimized from three aspects: the loss function, optimization algorithm, and learning rate. Then, an accurate shared-bike demand forecasting model is designed based on the basic idea of minimizing the loss value. By comparing with classical machine learning models, the weight sharing strategy employed by SDF reduces the complexity of the network. It allows a high prediction accuracy to be achieved within a relatively short period of time. Extensive experiments are conducted on real-world bike-sharing datasets to evaluate SDF. The results show that SDF significantly outperforms classical machine learning models in prediction accuracy and efficiency.

Download Full-text