Machine Learning Models for Predicting and Classifying the Tensile Strength of Polymeric Films Fabricated via Different Production Processes

Safwan Altarazi; Rula Allaf; Firas Alhindawi

doi:10.3390/ma12091475

Machine Learning Models for Predicting and Classifying the Tensile Strength of Polymeric Films Fabricated via Different Production Processes

Materials ◽

10.3390/ma12091475 ◽

2019 ◽

Vol 12 (9) ◽

pp. 1475 ◽

Cited By ~ 3

Author(s):

Safwan Altarazi ◽

Rula Allaf ◽

Firas Alhindawi

Keyword(s):

Machine Learning ◽

Tensile Strength ◽

Predictive Ability ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Polymeric Films ◽

Coefficient Of Determination ◽

Percentage Error ◽

Support Vector ◽

Extrusion Blow Molding

In this study, machine learning algorithms (MLA) were employed to predict and classify the tensile strength of polymeric films of different compositions as a function of processing conditions. Two film production techniques were investigated, namely compression molding and extrusion-blow molding. Multi-factor experiments were designed with corresponding parameters. A tensile test was conducted on samples and the tensile strength was recorded. Predictive and classification models from nine MLA were developed. Performance analysis demonstrated the superior predictive ability of the support vector machine (SVM) algorithm, in which a coefficient of determination and mean absolute percentage error of 96% and 4%, respectively were obtained for the extrusion-blow molded films. The classification performance of the MLA was also evaluated, with several algorithms exhibiting excellent performance.

Get full-text (via PubEx)

Machine Learning for Sensorless Temperature Estimation of a BLDC Motor

Sensors ◽

10.3390/s21144655 ◽

2021 ◽

Vol 21 (14) ◽

pp. 4655

Author(s):

Dariusz Czerwinski ◽

Jakub Gęca ◽

Krzysztof Kolano

Keyword(s):

Machine Learning ◽

Temperature Measurement ◽

Stochastic Gradient Descent ◽

Estimation Accuracy ◽

Coefficient Of Determination ◽

Percentage Error ◽

Support Vector ◽

Bldc Motor ◽

Temperature Estimation ◽

Motor Operation

In this article, the authors propose two models for BLDC motor winding temperature estimation using machine learning methods. For the purposes of the research, measurements were made for over 160 h of motor operation, and then, they were preprocessed. The algorithms of linear regression, ElasticNet, stochastic gradient descent regressor, support vector machines, decision trees, and AdaBoost were used for predictive modeling. The ability of the models to generalize was achieved by hyperparameter tuning with the use of cross-validation. The conducted research led to promising results of the winding temperature estimation accuracy. In the case of sensorless temperature prediction (model 1), the mean absolute percentage error MAPE was below 4.5% and the coefficient of determination R2 was above 0.909. In addition, the extension of the model with the temperature measurement on the casing (model 2) allowed reducing the error value to about 1% and increasing R2 to 0.990. The results obtained for the first proposed model show that the overheating protection of the motor can be ensured without direct temperature measurement. In addition, the introduction of a simple casing temperature measurement system allows for an estimation with accuracy suitable for compensating the motor output torque changes related to temperature.

Get full-text (via PubEx)

Prediction of Healing Performance of Autogenous Healing Concrete Using Machine Learning

Materials ◽

10.3390/ma14154068 ◽

2021 ◽

Vol 14 (15) ◽

pp. 4068

Author(s):

Xu Huang ◽

Mirna Wasouf ◽

Jessada Sresakoolchai ◽

Sakdirat Kaewunruen

Keyword(s):

Machine Learning ◽

Search Algorithm ◽

Weather Conditions ◽

Prediction Performance ◽

Machine Learning Algorithms ◽

Coefficient Of Determination ◽

Gradient Boosting ◽

Support Vector ◽

Self Healing ◽

Artificial Neural Network Ann

Cracks typically develop in concrete due to shrinkage, loading actions, and weather conditions; and may occur anytime in its life span. Autogenous healing concrete is a type of self-healing concrete that can automatically heal cracks based on physical or chemical reactions in concrete matrix. It is imperative to investigate the healing performance that autogenous healing concrete possesses, to assess the extent of the cracking and to predict the extent of healing. In the research of self-healing concrete, testing the healing performance of concrete in a laboratory is costly, and a mass of instances may be needed to explore reliable concrete design. This study is thus the world’s first to establish six types of machine learning algorithms, which are capable of predicting the healing performance (HP) of self-healing concrete. These algorithms involve an artificial neural network (ANN), a k-nearest neighbours (kNN), a gradient boosting regression (GBR), a decision tree regression (DTR), a support vector regression (SVR) and a random forest (RF). Parameters of these algorithms are tuned utilising grid search algorithm (GSA) and genetic algorithm (GA). The prediction performance indicated by coefficient of determination (R2) and root mean square error (RMSE) measures of these algorithms are evaluated on the basis of 1417 data sets from the open literature. The results show that GSA-GBR performs higher prediction performance (R2GSA-GBR = 0.958) and stronger robustness (RMSEGSA-GBR = 0.202) than the other five types of algorithms employed to predict the healing performance of autogenous healing concrete. Therefore, reliable prediction accuracy of the healing performance and efficient assistance on the design of autogenous healing concrete can be achieved.

Get full-text (via PubEx)

Performance Comparison of Machine Learning Algorithms for Estimating the Soil Salinity of Salt-Affected Soil Using Field Spectral Data

Remote Sensing ◽

10.3390/rs11222605 ◽

2019 ◽

Vol 11 (22) ◽

pp. 2605 ◽

Cited By ~ 3

Author(s):

Wang ◽

Chen ◽

Wang ◽

Keyword(s):

Machine Learning ◽

Spectral Data ◽

Soil Salinity ◽

Hyperspectral Data ◽

Machine Learning Algorithms ◽

Percentage Error ◽

Support Vector ◽

Boosted Regression Tree ◽

Data Noise ◽

Salt Affected Soil

Salt-affected soil is a prominent ecological and environmental problem in dry farming areas throughout the world. China has nearly 9.9 million km2 of salt-affected land. The identification, monitoring, and utilization of soil salinization have become important research topics for promoting sustainable progress. In this paper, using field-measured spectral data and soil salinity parameter data, through analysis and transformation of spectral data, five machine learning models, namely, random forest regression (RFR), support vector regression (SVR), gradient-boosted regression tree (GBRT), multilayer perceptron regression (MLPR), and least angle regression (Lars) are compared. The following performance measures of each model were evaluated: the collinear problems, handling data noise, stability, and the accuracy. In terms of these four aspects, the performance of each model on estimating soil salinity is evaluated. The results demonstrate that among the five models, RFR has the best performance in dealing with collinearity, RFR and MLPR have the best performance in dealing with data noise, and the SVR model is the most stable. The Lars model has the highest accuracy, with a determination coefficient (R2) of 0.87, ratio of performance to deviation (RPD) of 2.67, root mean square error (RMSE) of 0.18, and mean absolute percentage error (MAPE) of 0.11. Then, the comprehensive comparison and analysis of the five models are carried out, and it is found that the comprehensive performance of RFR model is the best; hence, this method is most suitable for estimating soil salinity using hyperspectral data. This study can provide a reference for the selection of regression methods in subsequent studies on estimating soil salinity using hyperspectral data.

Get full-text (via PubEx)

Air Quality Index and Air Pollutant Concentration Prediction Based on Machine Learning Algorithms

Applied Sciences ◽

10.3390/app9194069 ◽

2019 ◽

Vol 9 (19) ◽

pp. 4069 ◽

Cited By ~ 2

Author(s):

Huixiang Liu ◽

Qing Li ◽

Dongbing Yu ◽

Yu Gu

Keyword(s):

Machine Learning ◽

Air Pollution ◽

Air Quality ◽

Regression Models ◽

Quality Index ◽

Air Pollutant ◽

Air Quality Index ◽

Machine Learning Algorithms ◽

Coefficient Of Determination ◽

Support Vector

Air pollution has become an important environmental issue in recent decades. Forecasts of air quality play an important role in warning people about and controlling air pollution. We used support vector regression (SVR) and random forest regression (RFR) to build regression models for predicting the Air Quality Index (AQI) in Beijing and the nitrogen oxides (NOX) concentration in an Italian city, based on two publicly available datasets. The root-mean-square error (RMSE), correlation coefficient (r), and coefficient of determination (R2) were used to evaluate the performance of the regression models. Experimental results showed that the SVR-based model performed better in the prediction of the AQI (RMSE = 7.666, R2 = 0.9776, and r = 0.9887), and the RFR-based model performed better in the prediction of the NOX concentration (RMSE = 83.6716, R2 = 0.8401, and r = 0.9180). This work also illustrates that combining machine learning with air quality prediction is an efficient and convenient way to solve some related environment problems.

Get full-text (via PubEx)

Automatic recognition of self-acknowledged limitations in clinical research literature

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocy038 ◽

2018 ◽

Vol 25 (7) ◽

pp. 855-861 ◽

Cited By ~ 4

Author(s):

Halil Kilicoglu ◽

Graciela Rosemblat ◽

Mario Malički ◽

Gerben ter Riet

Keyword(s):

Machine Learning ◽

Clinical Research ◽

Binary Classification ◽

Classification Performance ◽

Research Literature ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Support Vector ◽

Rule Based ◽

Research Transparency

Abstract Objective To automatically recognize self-acknowledged limitations in clinical research publications to support efforts in improving research transparency. Methods To develop our recognition methods, we used a set of 8431 sentences from 1197 PubMed Central articles. A subset of these sentences was manually annotated for training/testing, and inter-annotator agreement was calculated. We cast the recognition problem as a binary classification task, in which we determine whether a given sentence from a publication discusses self-acknowledged limitations or not. We experimented with three methods: a rule-based approach based on document structure, supervised machine learning, and a semi-supervised method that uses self-training to expand the training set in order to improve classification performance. The machine learning algorithms used were logistic regression (LR) and support vector machines (SVM). Results Annotators had good agreement in labeling limitation sentences (Krippendorff’s α = 0.781). Of the three methods used, the rule-based method yielded the best performance with 91.5% accuracy (95% CI [90.1-92.9]), while self-training with SVM led to a small improvement over fully supervised learning (89.9%, 95% CI [88.4-91.4] vs 89.6%, 95% CI [88.1-91.1]). Conclusions The approach presented can be incorporated into the workflows of stakeholders focusing on research transparency to improve reporting of limitations in clinical studies.

Get full-text (via PubEx)

Survey of Machine Learning Algorithms to Detect Malware in Consumer Internet of Things Devices

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213021500202 ◽

2021 ◽

Vol 30 (04) ◽

pp. 2150020

Author(s):

Luke Holbrook ◽

Miltiadis Alamaniotis

Keyword(s):

Neural Network ◽

Machine Learning ◽

Internet Of Things ◽

Deep Neural Network ◽

Learning Algorithms ◽

Cyber Attacks ◽

Machine Learning Algorithms ◽

Coefficient Of Determination ◽

Support Vector ◽

Iot Devices

With the increase of cyber-attacks on millions of Internet of Things (IoT) devices, the poor network security measures on those devices are the main source of the problem. This article aims to study a number of these machine learning algorithms available for their effectiveness in detecting malware in consumer internet of things devices. In particular, the Support Vector Machines (SVM), Random Forest, and Deep Neural Network (DNN) algorithms are utilized for a benchmark with a set of test data and compared as tools in safeguarding the deployment for IoT security. Test results on a set of 4 IoT devices exhibited that all three tested algorithms presented here detect the network anomalies with high accuracy. However, the deep neural network provides the highest coefficient of determination R2, and hence, it is identified as the most precise among the tested algorithms concerning the security of IoT devices based on the data sets we have undertaken.

Get full-text (via PubEx)

Machine Learning Algorithms for the Forecasting of Wastewater Quality Indicators

Water ◽

10.3390/w9020105 ◽

2017 ◽

Vol 9 (2) ◽

pp. 105 ◽

Cited By ~ 44

Author(s):

Francesco Granata ◽

Stefano Papirio ◽

Giovanni Esposito ◽

Rudy Gargano ◽

Giovanni De Marinis

Keyword(s):

Machine Learning ◽

Support Vector Regression ◽

Quality Indicators ◽

Drainage Basin ◽

Learning Algorithms ◽

Oxygen Demand ◽

Machine Learning Algorithms ◽

Coefficient Of Determination ◽

Support Vector ◽

Wastewater Quality

Stormwater runoff is often contaminated by human activities. Stormwater discharge into water bodies significantly contributes to environmental pollution. The choice of suitable treatment technologies is dependent on the pollutant concentrations. Wastewater quality indicators such as biochemical oxygen demand (BOD5), chemical oxygen demand (COD), total suspended solids (TSS), and total dissolved solids (TDS) give a measure of the main pollutants. The aim of this study is to provide an indirect methodology for the estimation of the main wastewater quality indicators, based on some characteristics of the drainage basin. The catchment is seen as a black box: the physical processes of accumulation, washing, and transport of pollutants are not mathematically described. Two models deriving from studies on artificial intelligence have been used in this research: Support Vector Regression (SVR) and Regression Trees (RT). Both the models showed robustness, reliability, and high generalization capability. However, with reference to coefficient of determination R2 and root‐mean square error, Support Vector Regression showed a better performance than Regression Tree in predicting TSS, TDS, and COD. As regards BOD5, the two models showed a comparable performance. Therefore, the considered machine learning algorithms may be useful for providing an estimation of the values to be considered for the sizing of the treatment units in absence of direct measures.

Get full-text (via PubEx)

Comparative Study of Hybrid Artificial Intelligence Approaches for Predicting Peak Shear Strength Along Soil-Geocomposite Drainage Layer Interfaces

International Journal of Geosynthetics and Ground Engineering ◽

10.1007/s40891-021-00299-2 ◽

2021 ◽

Vol 7 (3) ◽

Author(s):

Zhiming Chao ◽

Gary Fowmes ◽

S. M. Dassanayake

Keyword(s):

Machine Learning ◽

Shear Strength ◽

Empirical Equation ◽

Soil Layer ◽

Coefficient Of Determination ◽

Percentage Error ◽

Support Vector ◽

Mechanism Analysis ◽

Peak Shear Strength ◽

Engineering Structures

AbstractPeak shear strength of soil-Geocomposite Drain Layer (GDL) interfaces is an important parameter in the designing and operating related engineering structures. In this paper, a database compiled from 316 large direct shear tests on soil-GDL interfaces has been established. Based on this database, five different machine learning models: Back Propagation Artificial Neural Network (BPANN) and Support Vector Machine (SVM), with hyperparameters optimised by Particle Swarm Optimisation Algorithm (PSO) and Genetic Algorithm (GA), respectively, and Extreme Learning Machine (ELM) optimised by Exhaustive Method, were adopt to assess the peak shear strength of soil-GDL interfaces. Then, a comprehensive investigation and comparison of the predictive performance for the models was conducted. Also, based on the selected optimal machine learning model, sensitivity analysis was conducted, and an empirical equation developed based on it. The research indicated that GA and PSO could significantly increase forecasting precision in a small number of iterations. The BPANN model optimised by PSO has the highest forecasting precision based on the statistics criteria: Root-Mean-Square Error, Correlation Coefficient, Coefficient of Determination, Wilmot’s Index of Agreement, and Mean Absolute Percentage Error. The normal stress has the biggest impact on the peak shear strength, followed by drainage core type, moisture saturation of the soil layer, shearing surface, soil type, consolidation condition, geotextile specification, soil density and drainage core thickness, and the ranking is affected partly by the data distribution of input parameters in the database based on mechanism analysis. An empirical equation developed from the optimal model was proposed to estimate the peak shear strength, which provides convenience for geotechnical engineering personnel with limited knowledge of machine learning technique.

Get full-text (via PubEx)

Identification of Risk Factors for Suicidal Ideation and Attempt Based on Machine Learning Algorithms: A Longitudinal Survey in Korea (2007–2019)

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph182312772 ◽

2021 ◽

Vol 18 (23) ◽

pp. 12772

Author(s):

Junggu Choi ◽

Seoyoung Cho ◽

Inhwan Ko ◽

Sanghoon Han

Keyword(s):

Machine Learning ◽

Risk Factors ◽

Suicidal Ideation ◽

Suicide Risk ◽

Large Scale ◽

Learning Algorithms ◽

Sociodemographic Factors ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Support Vector

Investigating suicide risk factors is critical for socioeconomic and public health, and many researchers have tried to identify factors associated with suicide. In this study, the risk factors for suicidal ideation were compared, and the contributions of different factors to suicidal ideation and attempt were investigated. To reflect the diverse characteristics of the population, the large-scale and longitudinal dataset used in this study included both socioeconomic and clinical variables collected from the Korean public. Three machine learning algorithms (XGBoost classifier, support vector classifier, and logistic regression) were used to detect the risk factors for both suicidal ideation and attempt. The importance of the variables was determined using the model with the best classification performance. In addition, a novel risk-factor score, calculated from the rank and importance scores of each variable, was proposed. Socioeconomic and sociodemographic factors showed a high correlation with risks for both ideation and attempt. Mental health variables ranked higher than other factors in suicidal attempts, posing a relatively higher suicide risk than ideation. These trends were further validated using the conditions from the integrated and yearly dataset. This study provides novel insights into suicidal risk factors for suicidal ideations and attempts.

Get full-text (via PubEx)

A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering

Methods of Information in Medicine ◽

10.3414/me16-01-0116 ◽

2017 ◽

Vol 56 (03) ◽

pp. 209-216 ◽

Cited By ~ 10

Author(s):

Said Ouatik El Alaoui ◽

Mourad Sarrouti

Keyword(s):

Machine Learning ◽

Question Answering ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Question Type ◽

Support Vector ◽

Learning Approaches ◽

Answer Extraction ◽

Improved Performance ◽

Type Classification

SummaryBackground and Objective: Biomedical question type classification is one of the important components of an automatic biomedical question answering system. The performance of the latter depends directly on the performance of its biomedical question type classification system, which consists of assigning a category to each question in order to determine the appropriate answer extraction algorithm. This study aims to automatically classify biomedical questions into one of the four categories: (1) yes/no, (2) factoid, (3) list, and (4) summary.Methods: In this paper, we propose a biomedical question type classification method based on machine learning approaches to automatically assign a category to a biomedical question. First, we extract features from biomedical questions using the proposed handcrafted lexico-syntactic patterns. Then, we feed these features for machine- learning algorithms. Finally, the class label is predicted using the trained classifiers.Results: Experimental evaluations performed on large standard annotated datasets of biomedical questions, provided by the BioASQ challenge, demonstrated that our method exhibits significant improved performance when compared to four baseline systems. The proposed method achieves a roughly 10-point increase over the best baseline in terms of accuracy. Moreover, the obtained results show that using handcrafted lexico-syntactic patterns as features’ provider of support vector machine (SVM) lead to the highest accuracy of 89.40%.Conclusion: The proposed method can automatically classify BioASQ questions into one of the four categories: yes/no, factoid, list, and summary. Furthermore, the results demonstrated that our method produced the best classification performance compared to four baseline systems.

Get full-text (via PubEx)