Two-Level Regression Method Using Ensembles of Trees with Optimal Divergence

Abstract The article discusses a new two-level regression analysis method in which a corrective procedure is applied to optimal ensembles of regression trees. Optimization is carried out based on the simultaneous achievement of the divergence of the algorithms in the forecast space and a good approximation of the data by individual algorithms of the ensemble. Simple averaging, random regression forest, and gradient boosting are used as corrective procedures. Experiments are presented comparing the proposed method with the standard decision forest and the standard gradient boosting method for decision trees.

Download Full-text

Analisa Pengaruh Motivasi Konsumen, Persepsi Kualitas, Dan Sikap Konsumen Terhadap Keputusan Pembelian Sepeda Motor PT. Yamaha Indonesia MFG

CICES ◽

10.33050/cices.v5i2.619 ◽

2019 ◽

Vol 5 (2) ◽

pp. 188-203

Author(s):

Ria Wulandari ◽

M. Ifran Sanni ◽

Dani Ramadhan

Keyword(s):

Decision Making ◽

Regression Analysis ◽

Qualitative Analysis ◽

Purchase Decisions ◽

Analysis Method ◽

Purchasing Decisions ◽

Processing Data ◽

Customer Purchases ◽

Customer Attitudes ◽

Quantitative And Qualitative Methods

This research is motivated by a decline in motorcycle sales produced by PT. Yamaha Indonesia MFG in the 2014-2018 period. In this research there was a decrease in the decision on the power of interest in customer purchases on PT. Yamaha Indonesia MFG so that later can be analyzed in the formulation of this paper, that how customer take motorcycle purchase decisions amid the phenomenon of competition and increasingly crowded sales rivalries. The purpose of this research was to analyze the influence of motivation, perceived quality, and customer attitudes toward decisions in purchasing Yamaha motorbikes. This research uses quantitative and qualitative methods. The respondents in this research were 100 people who could meet one to five criteria consisting of; initiator (initiator), influencer (influencer), decision making (decider), purchase (buyer), user (user) motorcycle production PT. Yamaha Indonesia MFG. There are 3 hypotheses formulated and tested using the Regression Analysis method. In qualitative analysis it is obtained from the interpretation of processing data by providing information and explanation. In the results of this research shows the results of Motivation, Quality Perception, and Customer Attitudes have a relationship that has a significant impact on Purchasing Decisions.

Download Full-text

Adiksi Internet pada Remaja

TAZKIYA JOURNAL OF PSYCHOLOGY ◽

10.15408/tazkiya.v7i1.13501 ◽

2019 ◽

Vol 7 (1) ◽

pp. 72-78

Author(s):

Ismalia Prambayu ◽

Mulia Sari Dewi

Keyword(s):

Regression Analysis ◽

Social Skills ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Parenting Style ◽

Internet Addiction ◽

Psychological Factors ◽

Social Skill ◽

Analysis Method ◽

Loneliness Scale

AbstractInternet addiction has become a worrying phenomenon for Indonesian teenagers. This research was conducted to determine whether the psychological factors will influence internet addiction in adolescents. This research uses quantitative with multiple regression analysis method. The winning sample is 200 adolescents. The instrument collects data using a scale internet addiction scale that compiled by Griffiths (2005) and developed by Lemmens (2009), Parenting Authority Questionnaire (PAQ) developed by Buri (1991), Social Skill Inventory (SSI) developed by Riggio (1986), and A Rasch-Type Loneliness Scale compiled by De Jong Gierveld (2006). The results showed that there were significant differences in the parenting style, social skills, and loneliness on the tendency of internet addiction in adolescents.AbstrakAdiksi Internet menjadi salah satu fenomena yang mengkhawatirkan untuk remaja Indonesia. Penelitian ini dilakukan untuk mengetahui faktor psikologis apakah yang memberikan pengaruh terhadap kecenderungan adiksi internet pada remaja. Sampel pada penelitian ini berjumlah 200 remaja dengan menggunakan metode analisis kuantitatif. Penelitian ini menggunakan alat ukur sebagai berikut, alat ukur adiksi internet yang dikembangkan oleh Lemmens (2009), Parenting Authority Questionnaire (PAQ) yang dikembangkan oleh Buri (1991), Social Skill Inventory (SSI) yang dikembangkan oleh Riggio (1986), dan A Rasch-Type Loneliness Scale yang disusun oleh De Jong Gierveld (2006). Berdasarkan hasil pengujian ditemukan pengaruh signifikan gaya pengasuhan, keterampilan sosial, dan kesepian terhadap kecenderungan adiksi internet pada remaja.

Download Full-text

GRADIENT BOOSTING METHOD APPLICATION TO SUPPORT PROCESS DECISIONS IN THE ELECTRON-BEAM WELDING PROCESS

Siberian Journal of Science and Technology ◽

10.31772/2587-6066-2020-21-2-206-214 ◽

2020 ◽

Vol 21 (2) ◽

pp. 206-214

Author(s):

V. S. Tynchenko ◽

◽

I. A. Golovenok ◽

V. E. Petrenko ◽

A. V. Milov ◽

...

Keyword(s):

Electron Beam ◽

Electron Beam Welding ◽

Welding Process ◽

Gradient Boosting ◽

Boosting Method

Download Full-text

PENGARUH TRUST, PRICE DAN SERVICE QUALITY TERHADAP INTENTION TO PURCHASE PELANGGAN BHINNEKA.COM

Jurnal Riset Manajemen dan Bisnis (JRMB) Fakultas Ekonomi UNIAT ◽

10.36226/jrmb.v2i3.73 ◽

2017 ◽

Vol 2 (3) ◽

pp. 391-400

Author(s):

Rianto Nurcahyo ◽

Dennis Andry ◽

Kevin Kevin

Keyword(s):

Regression Analysis ◽

Data Analysis ◽

Service Quality ◽

Multiple Regression Analysis ◽

Multiple Regression ◽

Positive Influence ◽

Quantitative Approach ◽

Analysis Method ◽

Intention To Purchase ◽

Trust Intention

The purpose of this study is to identify and understand the factors that can influence the intention to purchase through trust, price, and service quality on consumers Bhinneka.com. This research uses quantitative approach by distributing questionnaires to 100 respondents Bhinneka.com. Data analysis method used in this research is simple and multiple regression analysis. The result shows that the three variables used have a positive influence on intention to purchase variable. The most dominant variables in explaining the variation of intention to purchase are service quality variable of 35.5%, the price of 17.2%, and trust of 24.6%. Keywords: service quality, price, trust, intention to purchase

Download Full-text

A Machine Learning Method for Predicting Vegetation Indices in China

Remote Sensing ◽

10.3390/rs13061147 ◽

2021 ◽

Vol 13 (6) ◽

pp. 1147

Author(s):

Xiangqian Li ◽

Wenping Yuan ◽

Wenjie Dong

Keyword(s):

Machine Learning ◽

Growing Season ◽

Crop Growth ◽

Spatiotemporal Distribution ◽

Coefficient Of Determination ◽

Gradient Boosting ◽

Severe Drought ◽

Vegetation Growth ◽

Extreme Gradient Boosting ◽

Boosting Method

To forecast the terrestrial carbon cycle and monitor food security, vegetation growth must be accurately predicted; however, current process-based ecosystem and crop-growth models are limited in their effectiveness. This study developed a machine learning model using the extreme gradient boosting method to predict vegetation growth throughout the growing season in China from 2001 to 2018. The model used satellite-derived vegetation data for the first month of each growing season, CO2 concentration, and several meteorological factors as data sources for the explanatory variables. Results showed that the model could reproduce the spatiotemporal distribution of vegetation growth as represented by the satellite-derived normalized difference vegetation index (NDVI). The predictive error for the growing season NDVI was less than 5% for more than 98% of vegetated areas in China; the model represented seasonal variations in NDVI well. The coefficient of determination (R2) between the monthly observed and predicted NDVI was 0.83, and more than 69% of vegetated areas had an R2 > 0.8. The effectiveness of the model was examined for a severe drought year (2009), and results showed that the model could reproduce the spatiotemporal distribution of NDVI even under extreme conditions. This model provides an alternative method for predicting vegetation growth and has great potential for monitoring vegetation dynamics and crop growth.

Download Full-text

Comparing the Principal Regression Analysis Method with Ridge Regression Analysis in Overcoming Multicollinearity on Human Development Index (HDI) Data in Regency/City of East Java in 2018

Journal of Physics Conference Series ◽

10.1088/1742-6596/1872/1/012024 ◽

2021 ◽

Vol 1872 (1) ◽

pp. 012024

Author(s):

A F Yoantika ◽

Susiswo

Keyword(s):

Regression Analysis ◽

Human Development ◽

Ridge Regression ◽

Human Development Index ◽

Analysis Method ◽

Development Index ◽

Ridge Regression Analysis

Download Full-text

Proposing a machine-learning based method to predict stillbirth before and during delivery and ranking the features: nationwide retrospective cross-sectional study

BMC Pregnancy and Childbirth ◽

10.1186/s12884-021-03658-z ◽

2021 ◽

Vol 21 (1) ◽

Author(s):

Toktam Khatibi ◽

Elham Hanifi ◽

Mohammad Mehdi Sepehri ◽

Leila Allahqoli

Keyword(s):

Machine Learning ◽

External Validation ◽

Fetal Loss ◽

Null Distribution ◽

Training Dataset ◽

Gradient Boosting ◽

Support Vector ◽

Cross Sectional ◽

Boosting Method ◽

Demographic Features

Abstract Background Stillbirth is defined as fetal loss in pregnancy beyond 28 weeks by WHO. In this study, a machine-learning based method is proposed to predict stillbirth from livebirth and discriminate stillbirth before and during delivery and rank the features. Method A two-step stack ensemble classifier is proposed for classifying the instances into stillbirth and livebirth at the first step and then, classifying stillbirth before delivery from stillbirth during the labor at the second step. The proposed SE has two consecutive layers including the same classifiers. The base classifiers in each layer are decision tree, Gradient boosting classifier, logistics regression, random forest and support vector machines which are trained independently and aggregated based on Vote boosting method. Moreover, a new feature ranking method is proposed in this study based on mean decrease accuracy, Gini Index and model coefficients to find high-ranked features. Results IMAN registry dataset is used in this study considering all births at or beyond 28th gestational week from 2016/04/01 to 2017/01/01 including 1,415,623 live birth and 5502 stillbirth cases. A combination of maternal demographic features, clinical history, fetal properties, delivery descriptors, environmental features, healthcare service provider descriptors and socio-demographic features are considered. The experimental results show that our proposed SE outperforms the compared classifiers with the average accuracy of 90%, sensitivity of 91%, specificity of 88%. The discrimination of the proposed SE is assessed and the average AUC of ±95%, CI of 90.51% ±1.08 and 90% ±1.12 is obtained on training dataset for model development and test dataset for external validation, respectively. The proposed SE is calibrated using isotopic nonparametric calibration method with the score of 0.07. The process is repeated 10,000 times and AUC of SE classifiers using random different training datasets as null distribution. The obtained p-value to assess the specificity of the proposed SE is 0.0126 which shows the significance of the proposed SE. Conclusions Gestational age and fetal height are two most important features for discriminating livebirth from stillbirth. Moreover, hospital, province, delivery main cause, perinatal abnormality, miscarriage number and maternal age are the most important features for classifying stillbirth before and during delivery.

Download Full-text

Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10010042 ◽

2021 ◽

Vol 10 (1) ◽

pp. 42

Author(s):

Kieu Anh Nguyen ◽

Walter Chen ◽

Bor-Shiun Lin ◽

Uma Seeboonruang

Keyword(s):

Machine Learning ◽

Soil Erosion ◽

Ensemble Methods ◽

Machine Learning Algorithms ◽

Multivariate Adaptive Regression Splines ◽

Gradient Boosting ◽

Support Vector ◽

Ensemble Machine Learning ◽

Boosting Method ◽

Bagging Method

Although machine learning has been extensively used in various fields, it has only recently been applied to soil erosion pin modeling. To improve upon previous methods of quantifying soil erosion based on erosion pin measurements, this study explored the possible application of ensemble machine learning algorithms to the Shihmen Reservoir watershed in northern Taiwan. Three categories of ensemble methods were considered in this study: (a) Bagging, (b) boosting, and (c) stacking. The bagging method in this study refers to bagged multivariate adaptive regression splines (bagged MARS) and random forest (RF), and the boosting method includes Cubist and gradient boosting machine (GBM). Finally, the stacking method is an ensemble method that uses a meta-model to combine the predictions of base models. This study used RF and GBM as the meta-models, decision tree, linear regression, artificial neural network, and support vector machine as the base models. The dataset used in this study was sampled using stratified random sampling to achieve a 70/30 split for the training and test data, and the process was repeated three times. The performance of six ensemble methods in three categories was analyzed based on the average of three attempts. It was found that GBM performed the best among the ensemble models with the lowest root-mean-square error (RMSE = 1.72 mm/year), the highest Nash-Sutcliffe efficiency (NSE = 0.54), and the highest index of agreement (d = 0.81). This result was confirmed by the spatial comparison of the absolute differences (errors) between model predictions and observations using GBM and RF in the study area. In summary, the results show that as a group, the bagging method and the boosting method performed equally well, and the stacking method was third for the erosion pin dataset considered in this study.

Download Full-text