The ensemble of neural network and gradient boosting for the prediction of customer profitability: A two-stage modeling approach

2018 ◽  
Vol 13 (4) ◽  
pp. 329-340
Author(s):  
Sunny Lam
Author(s):  
Chih-Hsiang Yang ◽  
Jaclyn P Maher ◽  
Aditya Ponnada ◽  
Eldin Dzubur ◽  
Rachel Nordgren ◽  
...  

Abstract People differ from each other to the extent to which momentary factors, such as context, mood, and cognitions, influence momentary health behaviors. However, statistical models to date are limited in their ability to test whether the association between two momentary variables (i.e., subject-level slopes) predicts a subject-level outcome. This study demonstrates a novel two-stage statistical modeling strategy that is capable of testing whether subject-level slopes between two momentary variables predict subject-level outcomes. An empirical case study application is presented to examine whether there are differences in momentary moderate-to-vigorous physical activity (MVPA) levels between the outdoor and indoor context in adults and whether these momentary differences predict mean daily MVPA levels 6 months later. One hundred and eight adults from a multiwave longitudinal study provided 4 days of ecological momentary assessment (during baseline) and accelerometry data (both at baseline and 6 month follow-up). Multilevel data were analyzed using an open-source program (MixWILD) to test whether momentary strength between outdoor context and MVPA during baseline was associated with average daily MVPA levels measured 6 months later. During baseline, momentary MVPA levels were higher in outdoor contexts as compared to indoor contexts (b = 0.07, p < .001). Participants who had more momentary MVPA when outdoors (vs. indoors) during baseline (i.e., a greater subject-level slope) had higher daily MVPA at the 6 month follow-up (b = 0.09, p < .05). This empirical example shows that the subject-level momentary association between specific context (i.e., outdoors) and health behavior (i.e., physical activity) may contribute to overall engagement in that behavior in the future. The demonstrated two-stage modeling approach has extensive applications in behavioral medicine to analyze intensive longitudinal data collected from wearable sensors and mobile devices.


2001 ◽  
Vol 5 (1-2) ◽  
pp. 69-79 ◽  
Author(s):  
Amy McCullough ◽  
Conrado M. Gempesaw ◽  
William H. Daniels ◽  
J. Richard Bacon

2020 ◽  
Vol 17 (2) ◽  
pp. 337-357
Author(s):  
Marta Galvani ◽  
Chiara Bardelli ◽  
Sara Bottiroli ◽  
Silvia Figini

Processes ◽  
2021 ◽  
Vol 9 (4) ◽  
pp. 655
Author(s):  
Huanhuan Zhang ◽  
Jigeng Li ◽  
Mengna Hong

With the global energy crisis and environmental pollution intensifying, tissue papermaking enterprises urgently need to save energy. The energy consumption model is essential for the energy saving of tissue paper machines. The energy consumption of tissue paper machine is very complicated, and the workload and difficulty of using the mechanism model to establish the energy consumption model of tissue paper machine are very large. Therefore, this article aims to build an empirical energy consumption model for tissue paper machines. The energy consumption of this model includes electricity consumption and steam consumption. Since the process parameters have a great influence on the energy consumption of the tissue paper machines, this study uses three methods: linear regression, artificial neural network and extreme gradient boosting tree to establish the relationship between process parameters and power consumption, and process parameters and steam consumption. Then, the best power consumption model and the best steam consumption model are selected from the models established by linear regression, artificial neural network and the extreme gradient boosting tree. Further, they are combined into the energy consumption model of the tissue paper machine. Finally, the models established by the three methods are evaluated. The experimental results show that using the empirical model for tissue paper machine energy consumption modeling is feasible. The result also indicates that the power consumption model and steam consumption model established by the extreme gradient boosting tree are better than the models established by linear regression and artificial neural network. The experimental results show that the power consumption model and steam consumption model established by the extreme gradient boosting tree are better than the models established by linear regression and artificial neural network. The mean absolute percentage error of the electricity consumption model and the steam consumption model built by the extreme gradient boosting tree is approximately 2.72 and 1.87, respectively. The root mean square errors of these two models are about 4.74 and 0.03, respectively. The result also indicates that using the empirical model for tissue paper machine energy consumption modeling is feasible, and the extreme gradient boosting tree is an efficient method for modeling energy consumption of tissue paper machines.


2021 ◽  
Vol 11 (9) ◽  
pp. 4243
Author(s):  
Chieh-Yuan Tsai ◽  
Yi-Fan Chiu ◽  
Yu-Jen Chen

Nowadays, recommendation systems have been successfully adopted in variant online services such as e-commerce, news, and social media. The recommenders provide users a convenient and efficient way to find their exciting items and increase service providers’ revenue. However, it is found that many recommenders suffered from the cold start (CS) problem where only a small number of ratings are available for some new items. To conquer the difficulties, this research proposes a two-stage neural network-based CS item recommendation system. The proposed system includes two major components, which are the denoising autoencoder (DAE)-based CS item rating (DACR) generator and the neural network-based collaborative filtering (NNCF) predictor. In the DACR generator, a textual description of an item is used as auxiliary content information to represent the item. Then, the DAE is applied to extract the content features from high-dimensional textual vectors. With the compact content features, a CS item’s rating can be efficiently derived based on the ratings of similar non-CS items. Second, the NNCF predictor is developed to predict the ratings in the sparse user–item matrix. In the predictor, both spare binary user and item vectors are projected to dense latent vectors in the embedding layer. Next, latent vectors are fed into multilayer perceptron (MLP) layers for user–item matrix learning. Finally, appropriate item suggestions can be accurately obtained. The extensive experiments show that the DAE can significantly reduce the computational time for item similarity evaluations while keeping the original features’ characteristics. Besides, the experiments show that the proposed NNCF predictor outperforms several popular recommendation algorithms. We also demonstrate that the proposed CS item recommender can achieve up to 8% MAE improvement compared to adding no CS item rating.


2020 ◽  
Vol 2020 ◽  
pp. 1-19
Author(s):  
De-Cheng Feng ◽  
Bo Fu

In this paper, an intelligent modeling approach is presented to predict the shear strength of the internal reinforced concrete (RC) beam-column joints and used to analyze the sensitivity of the influence factors on the shear strength. The proposed approach is established based on the famous boosting-family ensemble machine learning (ML) algorithms, i.e., gradient boosting regression tree (GBRT), which generates a strong predictive model by integrating several weak predictors, which are obtained by the well-known individual ML algorithms, e.g., DT, ANN, and SVM. The strong model is boosted as each weak predictor has its own weight in the final combination according to the performance. Compared with the conventional mechanical-driven shear strength models, e.g., the well-known modified compression field theory (MCFT), the proposed model can avoid the complicated derivation process of shear mechanism and calibration of the involved empirical parameters; thus, it provides a more convenient, fast, and robust alternative way for predicting the shear strength of the internal RC joints. To train and test the GBRT model, a total of 86 internal RC joint specimens are collected from the literatures, and four traditional ML models and the MCFT model are also employed as comparisons. The results indicate that the GBRT model is superior to both the traditional ML models and MCFT model, as its degree-of-fitting is the highest and the predicting dispersion is the lowest. Finally, the model is used to investigate the influences of different parameters on the shear strength of the internal RC joint, and the sensitivity and importance of the corresponding parameters are obtained.


Sign in / Sign up

Export Citation Format

Share Document