An empirical study of software change classification with imbalance data-handling methods

Author(s):  
Xiaoyan Zhu ◽  
Binbin Niu ◽  
E. James Whitehead ◽  
Zhongbin Sun
Author(s):  
Craig K. Enders ◽  
Amanda N. Baraldi

2011 ◽  
Vol 690 (2) ◽  
pp. 148-161 ◽  
Author(s):  
Christophe Tistaert ◽  
Bieke Dejaegher ◽  
Yvan Vander Heyden

SIMULATION ◽  
2020 ◽  
Vol 96 (10) ◽  
pp. 825-839
Author(s):  
Hao Cheng

Missing data is almost inevitable for various reasons in many applications. For hierarchical latent variable models, there usually exist two kinds of missing data problems. One is manifest variables with incomplete observations, the other is latent variables which cannot be observed directly. Missing data in manifest variables can be handled by different methods. For latent variables, there exist several kinds of partial least square (PLS) algorithms which have been widely used to estimate the value of latent variables. In this paper, we not only combine traditional linear regression type PLS algorithms with missing data handling methods, but also introduce quantile regression to improve the performances of PLS algorithms when the relationships among manifest and latent variables are not fixed according to the explored quantile of interest. Thus, we can get the overall view of variables’ relationships at different levels. The main challenges lie in how to introduce quantile regression in PLS algorithms correctly and how well the PLS algorithms perform when missing manifest variables occur. By simulation studies, we compare all the PLS algorithms with missing data handling methods in different settings, and finally build a business sophistication hierarchical latent variable model based on real data.


Sign in / Sign up

Export Citation Format

Share Document