regression algorithms
Recently Published Documents


TOTAL DOCUMENTS

259
(FIVE YEARS 138)

H-INDEX

21
(FIVE YEARS 7)

Complexity ◽  
2022 ◽  
Vol 2022 ◽  
pp. 1-11
Author(s):  
Marium Mehmood ◽  
Nasser Alshammari ◽  
Saad Awadh Alanazi ◽  
Fahad Ahmad

The liver is the human body’s mandatory organ, but detecting liver disease at an early stage is very difficult due to the hiddenness of symptoms. Liver diseases may cause loss of energy or weakness when some irregularities in the working of the liver get visible. Cancer is one of the most common diseases of the liver and also the most fatal of all. Uncontrolled growth of harmful cells is developed inside the liver. If diagnosed late, it may cause death. Treatment of liver diseases at an early stage is, therefore, an important issue as is designing a model to diagnose early disease. Firstly, an appropriate feature should be identified which plays a more significant part in the detection of liver cancer at an early stage. Therefore, it is essential to extract some essential features from thousands of unwanted features. So, these features will be mined using data mining and soft computing techniques. These techniques give optimized results that will be helpful in disease diagnosis at an early stage. In these techniques, we use feature selection methods to reduce the dataset’s feature, which include Filter, Wrapper, and Embedded methods. Different Regression algorithms are then applied to these methods individually to evaluate the result. Regression algorithms include Linear Regression, Ridge Regression, LASSO Regression, Support Vector Regression, Decision Tree Regression, Multilayer Perceptron Regression, and Random Forest Regression. Based on the accuracy and error rates generated by these Regression algorithms, we have evaluated our results. The result shows that Random Forest Regression with the Wrapper Method from all the deployed Regression techniques is the best and gives the highest R2-Score of 0.8923 and lowest MSE of 0.0618.


2022 ◽  
pp. 108076
Author(s):  
João V.C. Moraes ◽  
Jéssica T.S. Reinaldo ◽  
Manuel Ferreira-Junior ◽  
Telmo Silva Filho ◽  
Ricardo B.C. Prudêncio

Author(s):  
Nader S. Santarisi ◽  
Sinan S. Faouri

In order to monitor the performance and related efficiency of a combined cycle power plant (CCPP), in addition to the best utilization of its power output, it is vital to predict its full load electrical power output. In this paper, the full load electrical power output of CCPP was predicted employing practically efficient machine learning algorithms, including linear regression, ridge regression, lasso regression, elastic net regression, random forest regression, and gradient boost regression. The original data came from an actual confidential power plant, which was working on a full load for 6 years, with four major features: ambient temperature, relative humidity, atmospheric pressure, and exhaust vacuum, and one target (electrical power output per hour). Different regression performance measures were used, including R2 (coefficient of determination), MAE (Mean Absolute Error), MSE (Mean Squared Error), RMSE (Root Mean Squared Error), and MAPE (Mean Absolute Percentage Error). Research results revealed that the gradient boost regression model outperformed other models with and without using the dimensionality reduction technique (PCA) with the highest R2 of 0.912 and 0.872, respectively, and had the lowest MAPE of 0.872 % and 1.039 %, respectively. Moreover, prediction performance dropped slightly after using the dimensionality reduction technique almost in all regression algorithms used. The novelty in this work is summarized in predicting electrical power output in a CCPP based on a few features using simpler algorithms than reported deep learning and neural networks algorithms combined. That means a lower cost and less complicated procedure as per each, however, resulting in practically accepted results according to the evaluation metrics used.


2021 ◽  
Vol Publish Ahead of Print ◽  
Author(s):  
Jacobien H.F. Oosterhoff ◽  
Benjamin Y. Gravesteijn ◽  
Aditya V. Karhade ◽  
Ruurd L. Jaarsma ◽  
Gino M.M.J. Kerkhoffs ◽  
...  

2021 ◽  
Vol 24 ◽  
pp. 1-12
Author(s):  
Anna Aksenova

The agreement of subject and predicate in Russian is actually much less trivial than it might seem at first glance. This paper deals with the case when the subject is realized by a combination of a noun with a quantifier. I analyze a set of examples with the words двое, трое, пара, тройка, десяток, сотня, тысяча, миллион and миллиард where there is a variation in predicate number agreement. Using Random Forest, CIT and Logistic Regression algorithms I prove that collective (двое, трое) and non-collective (пара, тройка, десяток, сотня, тысяча, миллион, миллиард) quantifiers exhibit different patterns of agreement. The first group tends to trigger more plural agreement, while for the second one singular agreement is more typical. Moreover, the quantifier phrase position relative to the predicate can also influence the choice of number marker on the verb.


2021 ◽  
Vol 06 (12) ◽  
Author(s):  
AKINWOLE Agnes Kikelomo ◽  

This work focused on the designing of medical diagnosis system using Supervised Machine Learning. Logistics Regression Algorithms (LRA) was adopted, the label inputs for the data set which the symptoms were trained and mapped with the input of the user. Diagnosis of malaria was considered in this work; the system verified the value of the logical regression in the medical decision support system. Medical practitioners and other health workers can use this system to make better decisions in medical diagnosis for malaria. Adoption of this system will reduce stress of diagnoses malaria from patient and reduce congestion in our hospitals.


2021 ◽  
Vol 931 (1) ◽  
pp. 012013
Author(s):  
Le Thi Nhut Suong ◽  
A V Bondarev ◽  
E V Kozlova

Abstract Geochemical studies of organic matter in source rocks play an important role in predicting the oil and gas accumulation of any territory, especially in oil and gas shale. For deep understanding, pyrolytic analyses are often carried out on samples before and after extraction of hydrocarbon with chloroform. However, extraction is a laborious and time-consuming process and the workload of laboratory equipment and time doubles. In this work, machine learning regression algorithms is applied for forecasting S2ex based on the pyrolytic analytic result of non-extracted samples. This study is carried out using more than 300 samples from 3 different wells in Bazhenov formation, Western Siberia. For developing a prediction model, 5 different machine learning regression algorithms including Multiple Linear Regression, Polynomial Regression, Support vector regression, Decision tree and Random forest have been tested and compared. The performance of these algorithms is examined by R-squared coefficient. The data of the X2 well was used for building a model. Simultaneously, this data is divided into 2 parts – 80% for training and 20% for checking. The model also was used for prediction of wells X1 and X3. Then, these predictive results were compared with the real results, which had been obtained from standard experiments. Despite limited amount of data, the result exceeded all expectations. The result of prediction also showcases that the relationship between before and after extraction parameters are complex and non-linear. The proof is R2 value of Multiple Linear Regression and Polynomial Regression is negative, which means the model is broken. However, Random forest and Decision tree give us a good performance. With the same algorithms, we can apply for prediction all geochemical parameters by depth or utilize them for well-logging data.


2021 ◽  
Author(s):  
Xinxing Chen ◽  
Zijian Liu ◽  
Jiale Zhu ◽  
Kuangen Zhang ◽  
Yuquan Leng ◽  
...  

Author(s):  
Elizalde Lopez Piol ◽  
◽  
Luisito Lolong Lacatan ◽  
Jaime P. Pulumbarit

— By fitting a linear equation to observable values, linear regression determines the relationship between two variables. The Department of Education enrollment data in the Philippines, specifically in the School Division of Batangas, is needed to produce modules. The data collected is from the division office, where data cleaning was applied. Deep Learning, Decision Tree, Random Forest, Gradient Boosted Tree, Support Vector Machine, and Linear Regression were used to perform the prediction, and linear regression performed the best with an absolute value of 14.465 and a relative error of 84.81%. Keywords— Prediction, Information Management, Linear Regression, Cloud Computing, LDM


Sign in / Sign up

Export Citation Format

Share Document