mixed variables
Recently Published Documents


TOTAL DOCUMENTS

93
(FIVE YEARS 19)

H-INDEX

15
(FIVE YEARS 2)

2021 ◽  
Vol 20 (Number 3) ◽  
pp. 305-327
Author(s):  
Hashibah Hamid ◽  
Nor Idayu Mahat ◽  
Safwati Ibrahim

The strategy surrounding the extraction of a number of mixed variables is examined in this paper in building a model for Linear Discriminant Analysis (LDA). Two methods for extracting crucial variables from a dataset with categorical and continuous variables were employed, namely, multiple correspondence analysis (MCA) and principal component analysis (PCA). However, in this case, direct use of either MCA or PCA on mixed variables is impossible due to restrictions on the structure of data that each method could handle. Therefore, this paper executes some adjustments including a strategy for managing mixed variables so that those mixed variables are equivalent in values. With this, both MCA and PCA can be performed on mixed variables simultaneously. The variables following this strategy of extraction were then utilised in the construction of the LDA model before applying them to classify objects going forward. The suggested models, using three real sets of medical data were then tested, where the results indicated that using a combination of the two methods of MCA and PCA for extraction and LDA could reduce the model’s size, having a positive effect on classifying and better performance of the model since it leads towards minimising the leave-one-out error rate. Accordingly, the models proposed in this paper, including the strategy that was adapted was successful in presenting good results over the full LDA model. Regarding the indicators that were used to extract and to retain the variables in the model, cumulative variance explained (CVE), eigenvalue, and a non-significant shift in the CVE (constant change), could be considered a useful reference or guideline for practitioners experiencing similar issues in future.


Author(s):  
Hashibah Hamid Et.al

The original purpose of the location model is to deal with mixed variables discrimination for classification purposes. Due to the problem of empty cells, smoothed location model is introduced. However, the smoothed location model had smoothed all the cells either empty or not, where the smoothing process causing changesto the original information of the non-empty cells. As it is well known that those original informationis a valuable source and important in any study that should be maintained. To address the aforementioned issues, an amalgamationof maximum likelihood and smoothing estimations is introduced to construct a new location model. The amalgamation of both estimations is expected could handle all situations whether the cells are empty or not based on several settings of sample size and number of variables.


2021 ◽  
Vol 28 (3) ◽  
pp. 034501
Author(s):  
M. D. J. Cole ◽  
A. Mishchenko ◽  
A. Bottino ◽  
C. S. Chang
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document