regression trees
Recently Published Documents


TOTAL DOCUMENTS

842
(FIVE YEARS 232)

H-INDEX

51
(FIVE YEARS 7)

2022 ◽  
Vol 2161 (1) ◽  
pp. 012053
Author(s):  
B P Ashwini ◽  
R Sumathi ◽  
H S Sudhira

Abstract Congested roads are a global problem, and increased usage of private vehicles is one of the main reasons for congestion. Public transit modes of travel are a sustainable and eco-friendly alternative for private vehicle usage, but attracting commuters towards public transit mode is a mammoth task. Commuters expect the public transit service to be reliable, and to provide a reliable service it is necessary to fine-tune the transit operations and provide well-timed necessary information to commuters. In this context, the public transit travel time is predicted in Tumakuru, a tier-2 city of Karnataka, India. As this is one of the initial studies in the city, the performance comparison of eight Machines Learning models including four linear namely, Linear Regression, Ridge Regression, Least Absolute Shrinkage and Selection Operator Regression, and Support Vector Regression; and four non-linear models namely, k-Nearest Neighbors, Regression Trees, Random Forest Regression, and Gradient Boosting Regression Trees is conducted to identify a suitable model for travel time predictions. The data logs of one month (November 2020) of the Tumakuru city service, provided by Tumakuru Smart City Limited are used for the study. The time-of-the-day (trip start time), day-of-the-week, and direction of travel are used for the prediction. Travel time for both upstream and downstream are predicted, and the results are evaluated based on the performance metrics. The results suggest that the performance of non-linear models is superior to linear models for predicting travel times, and Random Forest Regression was found to be a better model as compared to other models.


2021 ◽  
Vol 6 (2) ◽  
pp. 127-136
Author(s):  
Pungkas Subarkah ◽  
Ali Nur Ikhsan

With the increase in internet users and the development of technology, the threats to its security are increasingly diverse. One of them is phishing which is the most important issue in cyberspace. Phishing is a threatening and trapping activity someone by luring the target to indirectly provide information to the trapper. The number of phishing crimes, this has the potential to cause several losses, one of which is namely about the loss of privacy of a person or company. This study aims to identify phishing websites. The Classification And Regression Trees (CART) algorithm is one of the classification algorithms, and the dataset in this research taken from the UCI Repository Learning obtained from the University of Huddersfield. The method used in this research is problem identification, data collection, pre-processing stage, use of the CART algorithm, validation and evaluation and withdrawal conclusion. Based on the test results obtained the value of accuracy of 95.28%. Thus the value of the accuracy obtained using the CART algorithm of 95.28% categorized very good classification.


Tribologia ◽  
2021 ◽  
Vol 297 (3) ◽  
pp. 19-26
Author(s):  
Michał Kekez ◽  
Wojciech Jurczak ◽  
Dariusz Ozimina

The paper presents an analysis of the sound level recorded during dry sliding friction conditions. Balls with a diameter of 6 mm placed on pins were made of 100Cr6 steel, silicon carbide (SiC), and corundum (Al2O3), while rotating discs with a height of 6 mm and a diameter of 42 mm were made of 100Cr6 steel. Each pin and disc system was tested for two values of the relative humidity of the air (50 ± 5% and 90 ± 5%). Models of the A-sound level were developed using regression trees and random forest. The paper presents an analysis of the accuracy of the models obtained. Classifications of the six tests performed on the basis of sound level descriptors were also carried out.


Ground Water ◽  
2021 ◽  
Author(s):  
Katherine J. Knierim ◽  
James A. Kingsbury ◽  
Kenneth Belitz ◽  
Paul E. Stackelberg ◽  
Burke J. Minsley ◽  
...  

2021 ◽  
Vol 20 ◽  
pp. 650-656
Author(s):  
Eva Fadilah Ramadhani ◽  
Adji Achmad Rinaldo Fernandes ◽  
Ni Wayan Surya Wardhani

This study aims to determine the best classification results among discriminant analysis, CART, and Adaboost CART on Bank X's Home Ownership Credit (KPR) customers. This study uses secondary data which contains notes on the 5C assessment (Collateral, Character, Capacity, Condition, Capital) and collectibility of current and non-current loans. The sample used in this study was from 2000 debtors. Comparison of classifications based on model accuracy, sensitivity, and overall specificity shows that Adaboost CART is the best method for classifying credit collectibility at Bank X. This is due to the class imbalance in the data. This study compares the classification results between parametric statistics, namely discriminant analysis and non-parametric statistics, namely CART and Adaboost CART. The results of the research can be used as material for consideration and evaluation for banks in determining the policy for providing credit to prospective borrowers from the classification results of KPR Bank X consumers.


2021 ◽  
pp. 395-414
Author(s):  
Carlos M. Carvalho ◽  
Edward I. George ◽  
P. Richard Hahn ◽  
Robert E. McCulloch

2021 ◽  
Vol 66 (3) ◽  
pp. 597-608
Author(s):  
Adrianna Zańko ◽  
Karolina Milewska ◽  
Marcin Warpechowski ◽  
Robert Milewski

Abstract Many studies confirm the fact that women do not have sufficient knowledge about reproductive health, which is a significant problem nowadays due to the large percentage of people who suffer from infertility. A sources of knowledge from which information about health, including reproductive health, is obtained have various levels of reliability. The aim of the study was to use regression trees to find which of the analysed parameters had the greatest impact on the level of respondents’ knowledge about fertility and the impact of diet on fertility. The study was conducted among women who practice dance in Max Dance studio in Białystok. The group consisted of 42 women with an average age of 26.3 years, dancing in various dance styles at various levels of proficiency. A questionnaire on lifestyle and a sources of information on fertility was used; the questionnaire also contained a knowledge test focused on reproductive health and the impact of diet on fertility, in which the questions were based on information from the latest research. Three regression trees were created for three indicators determining the level of respondents’ knowledge. The obtained results revealed certain areas that have a significant impact on the level of knowledge about reproductive health, which may require additional education. The use of the regression trees method made it possible to determine the relationships between the analysed data that were not fully visible after standard biostatistical analyses had been performed. The created trees can be useful in improving the process of disseminating knowledge about reproductive health among women of childbearing age.


Sign in / Sign up

Export Citation Format

Share Document