Machine Learning for Air Transport Planning and Management

In this work we compare the performance of several machine learning algorithms applied to the problem of modelling air transport demand. Forecasting in the air transport industry is an essential part of planning and managing because of the economic and financial aspects of the industry. The traditional approach used in airline operations as specified by the International Civil Aviation Organization is the use of a multiple linear regression (MLR) model, utilizing cost variables and economic factors. Here, the performance of models utilizing an artificial neural network (ANN), an adaptive neuro-fuzzy inference system (ANFIS), a genetic algorithm, a support vector machine, and a regression tree are compared to MLR. The ANN and ANFIS had the best performance in terms of the lowest mean squared error.

Download Full-text

On the Prediction of Biogas Production from Vegetables, Fruits, and Food Wastes by ANFIS- and LSSVM-Based Models

BioMed Research International ◽

10.1155/2021/9202127 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Yong Yang ◽

Shuaishuai Zheng ◽

Zhilu Ai ◽

Mohammad Mahdi Molla Jafari

Keyword(s):

Mean Squared Error ◽

Fuzzy Inference ◽

Biogas Production ◽

Input Parameter ◽

Machine Learning Algorithms ◽

Least Square ◽

Support Vector ◽

Robust Algorithms ◽

Inference System ◽

Proposed Model

This study is aimed at modeling biodigestion systems as a function of the most influencing parameters to generate two robust algorithms on the basis of the machine learning algorithms, including adaptive network-based fuzzy inference system (ANFIS) and least square support vector machine (LSSVM). The models are assessed utilizing multiple statistical analyses for the actual values and model outcomes. Results from the suggested models indicate their great capability of predicting biogas production from vegetable food, fruits, and wastes for a variety of ranges of input parameters. The values that are calculated for the mean relative error (MRE %) and mean squared error (MSE) were 29.318 and 0.0039 for ANFIS, and 2.951 and 0.0001 for LSSVM which shows that the latter model has a better ability to predict the target data. Finally, in order to have additional certainty, two analyses of outlier identification and sensitivity were performed on the input parameter data that proved the proposed model in this paper has higher reliability in assessing output values compared with the previous model.

Download Full-text

Sentiment Analysis

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Handbook of Research on Pattern Engineering System Development for Big Data Analytics ◽

10.4018/978-1-5225-3870-7.ch009 ◽

2018 ◽

pp. 130-152 ◽

Cited By ~ 1

Author(s):

Syed Muzamil Basha ◽

Dharmendra Singh Rajput

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Daily Activity ◽

Opinion Mining ◽

Fuzzy Inference ◽

Human Life ◽

Machine Learning Algorithms ◽

Support Vector ◽

Inference System ◽

Neural Fuzzy

E-commerce has become a daily activity in human life. In it, the opinion and past experience related to particular product of others is playing a prominent role in selecting the product from the online market. In this chapter, the authors consider Tweets as a point of source to express users' emotions on particular subjects. This is scored with different sentiment scoring techniques. Since the patterns used in social media are relatively short, exact matches are uncommon, and taking advantage of partial matches allows one to significantly improve the accuracy of analysis on sentiments. The authors also focus on applying artificial neural fuzzy inference system (ANFIS) to train the model for better opinion mining. The scored sentiments are then classified using machine learning algorithms like support vector machine (SVM), decision tree, and naive Bayes.

Download Full-text

Using Machine Learning-Based Algorithms to Analyze Erosion Rates of a Watershed in Northern Taiwan

Sustainability ◽

10.3390/su12052022 ◽

2020 ◽

Vol 12 (5) ◽

pp. 2022 ◽

Cited By ~ 1

Author(s):

Kieu Anh Nguyen ◽

Walter Chen ◽

Bor-Shiun Lin ◽

Uma Seeboonruang

Keyword(s):

Machine Learning ◽

Random Forest ◽

Fuzzy Inference ◽

Absolute Error ◽

Organic Content ◽

Rank Test ◽

Support Vector ◽

Inference System ◽

Erosion Rates ◽

Northern Taiwan

This study continues a previous study with further analysis of watershed-scale erosion pin measurements. Three machine learning (ML) algorithms—Support Vector Machine (SVM), Adaptive Neuro-Fuzzy Inference System (ANFIS), and Artificial Neural Network (ANN)—were used to analyze depth of erosion of a watershed (Shihmen reservoir) in northern Taiwan. In addition to three previously used statistical indexes (Mean Absolute Error, Root Mean Square of Error, and R-squared), Nash–Sutcliffe Efficiency (NSE) was calculated to compare the predictive performances of the three models. To see if there was a statistical difference between the three models, the Wilcoxon signed-rank test was used. The research utilized 14 environmental attributes as the input predictors of the ML algorithms. They are distance to river, distance to road, type of slope, sub-watershed, slope direction, elevation, slope class, rainfall, epoch, lithology, and the amount of organic content, clay, sand, and silt in the soil. Additionally, measurements of a total of 550 erosion pins installed on 55 slopes were used as the target variable of the model prediction. The dataset was divided into a training set (70%) and a testing set (30%) using the stratified random sampling with sub-watershed as the stratification variable. The results showed that the ANFIS model outperforms the other two algorithms in predicting the erosion rates of the study area. The average RMSE of the test data is 2.05 mm/yr for ANFIS, compared to 2.36 mm/yr and 2.61 mm/yr for ANN and SVM, respectively. Finally, the results of this study (ANN, ANFIS, and SVM) were compared with the previous study (Random Forest, Decision Tree, and multiple regression). It was found that Random Forest remains the best predictive model, and ANFIS is the second-best among the six ML algorithms.

Download Full-text

Spatiotemporal Approaches for Quality Control and Error Correction of Atmospheric Data through Machine Learning

Computational Intelligence and Neuroscience ◽

10.1155/2020/7980434 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12 ◽

Cited By ~ 2

Author(s):

Hye-Jin Kim ◽

Sung Min Park ◽

Byung Jin Choi ◽

Seung-Hyun Moon ◽

Yong-Hyuk Kim

Keyword(s):

Machine Learning ◽

Time Series ◽

Quality Control ◽

Mean Squared Error ◽

Machine Learning Algorithms ◽

Support Vector ◽

Weather Element ◽

Applied Machine Learning ◽

Squared Error ◽

Atmospheric Data

We propose three quality control (QC) techniques using machine learning that depend on the type of input data used for training. These include QC based on time series of a single weather element, QC based on time series in conjunction with other weather elements, and QC using spatiotemporal characteristics. We performed machine learning-based QC on each weather element of atmospheric data, such as temperature, acquired from seven types of IoT sensors and applied machine learning algorithms, such as support vector regression, on data with errors to make meaningful estimates from them. By using the root mean squared error (RMSE), we evaluated the performance of the proposed techniques. As a result, the QC done in conjunction with other weather elements had 0.14% lower RMSE on average than QC conducted with only a single weather element. In the case of QC with spatiotemporal characteristic considerations, the QC done via training with AWS data showed performance with 17% lower RMSE than QC done with only raw data.

Download Full-text

Evaluating essential features of proppant transport at engineering scales combining field measurements with machine learning algorithms

10.31223/x5r64t ◽

2021 ◽

Author(s):

Xiaoyu Wang ◽

Lei Hou ◽

Xueyu Geng ◽

Peibin Gong ◽

Honglei Liu

Keyword(s):

Machine Learning ◽

Mean Squared Error ◽

Learning Algorithms ◽

Field Measurements ◽

Stratified Flow ◽

Machine Learning Algorithms ◽

Support Vector ◽

Proppant Transport ◽

Control Variate ◽

Injection Process

The characterization of the proppant transport at a field-engineering scale is still challenging due to the lack of direct subsurface measurements. Features that control the proppant transport may link the experimental and numerical observations to the practical operations at a field scale. To improve the numerical and laboratory simulations, we propose a machine-learning-based workflow to evaluate the essential features of proppant transport and their corresponding calculations. The proppant flow in fractures is estimated by applying the Gated recurrent unit (GRU) and Support-vector machine (SVM) algorithms to the measurements obtained from shale gas fracturing operations. Over 430,000 groups of fracturing data are collected and pre-processed by the proppant transport models to calculate key features, including settlement, stratified flow and inception of settled particles. The features are then fed into machine learning algorithms for pressure prediction. The root mean squared error (RMSE) is used as the criterion for ranking selected features via the control variate method. Our result shows that the stratified-flow feature (fracture-level) possesses better interpretations for the proppant transport, in which the Bi-power model helps to produce the best predictions. The settlement and inception features (particle-level) perform better in cases that the pressure fluctuates significantly, indicating that more complex fractures may have been generated. Moreover, our analyses on the remaining errors in the pressure-ascending cases suggest that (1) an introduction of the alternate-injection process, and (2) the improved calculation of proppant transport in complex fracture networks and highly-filled fractures will be beneficial to both experimental observations and field applications.

Download Full-text

Automated Detection of Multiple Sclerosis Lesions Using Texture-based Features and a Hybrid Classifier

Caspian Journal of Neurological Sciences ◽

10.32598/cjns.6.20.220.1 ◽

2020 ◽

Vol 6 (1) ◽

pp. 16-30

Author(s):

Somayeh Raiesdana ◽

Keyword(s):

Machine Learning ◽

Multiple Sclerosis ◽

Fuzzy Inference ◽

Brain Mri ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Inference System ◽

Common Technique ◽

Magnetic Resonance Imaging Mri

Background: Multiple Sclerosis (MS) is the most frequent non-traumatic neurological disease capable of causing disability in young adults. Detection of MS lesions with magnetic resonance imaging (MRI) is the most common technique. However, manual interpretation of vast amounts of data is often tedious and error-prone. Furthermore, changes in lesions are often subtle and extremely unrepresentative. Objectives: To develop an automated non-subjective method for the detection and quantification of MS lesions. Materials & Methods: This paper focuses on the automatic detection and classification of MS lesions in brain MRI images. Two datasets, one simulated and the other one recorded in hospital, are utilized in this work. A novel hybrid algorithm combining image processing and machine learning techniques is implemented. To this end, first, intricate morphological patterns are extracted from MRI images via texture analysis. Then, statistical textures-based features are extracted. Afterward, two supervised machine learning algorithms, i.e., the Hidden Markov Model (HMM) and Adaptive Neuro-Fuzzy Inference System (ANFIS) are employed within a hybrid platform. The hybrid system makes decisions based on ensemble learning. The stacking technique is used to apply predictions from both models o train a perceptron as a decisive model. Results: Experimental results on both datasets indicate that the proposed hybrid method outperforms HMM and ANFIS classifiers with reducing false positives. Furthermore, the performance of the proposed method compared with the state-of-the-art methods, was approved. Conclusion: Remarkable results of the proposed method motivate advanced detection systems employing other MRI sequences and their combination.

Download Full-text

Assessment of Machine Learning Algorithms for Prediction of Breast Cancer Malignancy Based on Mammogram Numeric Data

10.1101/2020.01.08.20016949 ◽

2020 ◽

Cited By ~ 1

Author(s):

Peter T. Habib ◽

Alsamman M. Alsamman ◽

Sameh E. Hassnein ◽

Ghada A. Shereif ◽

Aladdin Hamwieh

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cross Validation ◽

Mean Squared Error ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Adjusted Rand Index ◽

Support Vector ◽

Cancer Information ◽

Term Care

Abstractin 2019, estimated New Cases 268.600, Breast cancer has one of the most common cancers and is one of the world’s leading causes of death for women. Classification and data mining is an efficient way to classify information. Particularly in the medical field where prediction techniques are commonly used for early detection and effective treatment in diagnosis and research.These paper tests models for the mammogram analysis of breast cancer information from 23 of the more widely used machine learning algorithms such as Decision Tree, Random forest, K-nearest neighbors and support vector machine. The spontaneously splits results are distributed from a replicated 10-fold cross-validation method. The accuracy calculated by Regression Metrics such as Mean Absolute Error, Mean Squared Error, R2 Score and Clustering Metrics such as Adjusted Rand Index, Homogeneity, V-measure.accuracy has been checked F-Measure, AUC, and Cross-Validation. Thus, proper identification of patients with breast cancer would create care opportunities, for example, the supervision and the implementation of intervention plans could benefit the quality of long-term care. Experimental results reveal that the maximum precision 100%with the lowest error rate is obtained with Ada-boost Classifier.

Download Full-text

Estimation of Potato Yield Using Satellite Data at a Municipal Level: A Machine Learning Approach

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9060343 ◽

2020 ◽

Vol 9 (6) ◽

pp. 343 ◽

Cited By ~ 1

Author(s):

Pablo Salvador ◽

Diego Gómez ◽

Julia Sanz ◽

José Luis Casanova

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Mean Squared Error ◽

Irrigation System ◽

Meteorological Data ◽

Potato Yield ◽

Machine Learning Algorithms ◽

Support Vector ◽

Improved Model ◽

Municipal Level

Crop growth modeling and yield forecasting are essential to improve food security policies worldwide. To estimate potato (Solanum tubersum L.) yield over Mexico at a municipal level, we used meteorological data provided by the ERA5 (ECMWF Re-Analysis) dataset developed by the Copernicus Climate Change Service, satellite imagery from the TERRA platform, and field information. Five different machine learning algorithms were used to build the models: random forest (rf), support vector machine linear (svmL), support vector machine polynomial (svmP), support vector machine radial (svmR), and general linear model (glm). The optimized models were tested using independent data (2017 and 2018) not used in the training and optimization phase (2004–2016). In terms of percent root mean squared error (%RMSE), the best results were obtained by the rf algorithm in the winter cycle using variables from the first three months of the cycle (R2 = 0.757 and %RMSE = 18.9). For the summer cycle, the best performing model was the svmP which used the first five months of the cycle as variables (R2 = 0.858 and %RMSE = 14.9). Our results indicated that adding predictor variables of the last two months before the harvest did not significantly improved model performances. These results demonstrate that our models can predict potato yield by analyzing the yield of the previous year, the general conditions of NDVI, meteorology, and information related to the irrigation system at a municipal level.

Download Full-text

Demand forecasting at retail stage for selected vegetables: a performance analysis

Journal of Modelling in Management ◽

10.1108/jm2-11-2018-0192 ◽

2019 ◽

Vol 14 (4) ◽

pp. 1042-1063 ◽

Cited By ~ 1

Author(s):

Rahul Priyadarshi ◽

Akash Panigrahi ◽

Srikanta Routroy ◽

Girish Kant Garg

Keyword(s):

Machine Learning ◽

Performance Analysis ◽

Forecast Error ◽

Demand Forecasting ◽

Machine Learning Algorithms ◽

Gradient Boosting ◽

Support Vector ◽

Forecasting Model ◽

Content Type ◽

Forecasting Models

Purpose The purpose of this study is to select the appropriate forecasting model at the retail stage for selected vegetables on the basis of performance analysis. Design/methodology/approach Various forecasting models such as the Box–Jenkins-based auto-regressive integrated moving average model and machine learning-based algorithms such as long short-term memory (LSTM) networks, support vector regression (SVR), random forest regression, gradient boosting regression (GBR) and extreme GBR (XGBoost/XGBR) were proposed and applied (i.e. modeling, training, testing and predicting) at the retail stage for selected vegetables to forecast demand. The performance analysis (i.e. forecasting error analysis) was carried out to select the appropriate forecasting model at the retail stage for selected vegetables. Findings From the obtained results for a case environment, it was observed that the machine learning algorithms, namely LSTM and SVR, produced the better results in comparison with other different demand forecasting models. Research limitations/implications The results obtained from the case environment cannot be generalized. However, it may be used for forecasting of different agriculture produces at the retail stage, capturing their demand environment. Practical implications The implementation of LSTM and SVR for the case situation at the retail stage will reduce the forecast error, daily retail inventory and fresh produce wastage and will increase the daily revenue. Originality/value The demand forecasting model selection for agriculture produce at the retail stage on the basis of performance analysis is a unique study where both traditional and non-traditional models were analyzed and compared.

Download Full-text

Estimating the Total Organic Carbon for Unconventional Shale Resources During the Drilling Process: A Machine Learning Approach

Journal of Energy Resources Technology ◽

10.1115/1.4051737 ◽

2021 ◽

pp. 1-26

Author(s):

Ahmed Mahmoud ◽

Hany Gamal ◽

Salaheldin Elkatatny ◽

Ahmed Alsaihati

Keyword(s):

Machine Learning ◽

Organic Carbon ◽

Total Organic Carbon ◽

Gamma Ray ◽

Fuzzy Inference ◽

Support Vector ◽

Drilling Process ◽

Validation Data ◽

Inference System ◽

Data Points

Abstract Total organic carbon (TOC) is an essential parameter that indicates the quality of unconventional reservoirs. In this study, four machine learning (ML) algorithms of the adaptive neuro-fuzzy inference system (ANFIS), support vector regression (SVR), functional neural networks (FNN), and random forests (RF) were optimized to evaluate the TOC. The novelty of this work is that the optimized models predict the TOC from the bulk gamma-ray (GR) and spectral GR logs of uranium, thorium, and potassium only. The ML algorithms were trained on 749 datasets from Well-1, tested on 226 datasets from Well-2, and validated on 73 data points from Well-3. The predictability of the optimized algorithms was also compared with the available equations. The results of this study indicated that the optimized ANFIS, SVR, and RF models overperformed the available empirical equations in predicting the TOC. For validation data of Well-3, the optimized ANFIS, SVR, and RF algorithms predicted the TOC with AAPE's of 10.6%, 12.0%, and 8.9%, respectively, compared with the AAPE of 21.1% when the FNN model was used. While for the same data, the TOC was assessed with AAPE's of 48.6%, 24.6%, 20.2%, and 17.8% when Schmoker model, ΔlogR method, Zhao et al. correlation, and Mahmoud et al. correlation was used, respectively. The optimized models could be applied to estimate the TOC during the drilling process if the drillstring is provided with GR and spectral GR logging tools.

Download Full-text