Accurate Confidence Regions for Principal Components Factors*

Author(s):  
Javier Maldonado ◽  
Esther Ruiz
2014 ◽  
Vol 2014 ◽  
pp. 1-10 ◽  
Author(s):  
João Carlos G. D. Costa ◽  
Paulo José G. Da-Silva ◽  
Renan Moritz V. R. Almeida ◽  
Antonio Fernando C. Infantosi

The well-known multivariate technique Principal Components Analysis (PCA) is usually applied to a sample, and so component scores are subjected to sampling variability. However, few studies address their stability, an important topic when the sample size is small. This work presents three validation procedures applied to PCA, based on confidence regions generated by a variant of a nonparametric bootstrap called the partial bootstrap: (i) the assessment of PC scores variability by the spread and overlapping of “confidence regions” plotted around these scores; (ii) the use of the confidence regions centroids as a validation set; and (iii) the definition of the number of nontrivial axes to be retained for analysis. The methods were applied to EEG data collected during a postural control protocol with twenty-four volunteers. Two axes were retained for analysis, with 91.6% of explained variance. Results showed that the area of the confidence regions provided useful insights on the variability of scores and suggested that some subjects were not distinguishable from others, which was not evident from the principal planes. In addition, potential outliers, initially suggested by an analysis of the first principal plane, could not be confirmed by the confidence regions.


2006 ◽  
Vol 27 (2) ◽  
pp. 87-92 ◽  
Author(s):  
Willem K.B. Hofstee ◽  
Dick P.H. Barelds ◽  
Jos M.F. Ten Berge

Hofstee and Ten Berge (2004a) have proposed a new look at personality assessment data, based on a bipolar proportional (-1, .. . 0, .. . +1) scale, a corresponding coefficient of raw-scores likeness L = ΢XY/N, and raw-scores principal component analysis. In a normal sample, the approach resulted in a structure dominated by a first principal component, according to which most people are faintly to mildly socially desirable. We hypothesized that a more differentiated structure would arise in a clinical sample. We analyzed the scores of 775 psychiatric clients on the 132 items of the Dutch Personality Questionnaire (NPV). In comparison to a normative sample (N = 3140), the eigenvalue for the first principal component appeared to be 1.7 times as small, indicating that such clients have less personality (social desirability) in common. Still, the match between the structures in the two samples was excellent after oblique rotation of the loadings. We applied the abridged m-dimensional circumplex design, by which persons are typed by their two highest scores on the principal components, to the scores on the first four principal components. We identified five types: Indignant (1-), Resilient (1-2+), Nervous (1-2-), Obsessive-Compulsive (1-3-), and Introverted (1-4-), covering 40% of the psychiatric sample. Some 26% of the individuals had negligible scores on all type vectors. We discuss the potential and the limitations of our approach in a clinical context.


Methodology ◽  
2016 ◽  
Vol 12 (1) ◽  
pp. 11-20 ◽  
Author(s):  
Gregor Sočan

Abstract. When principal component solutions are compared across two groups, a question arises whether the extracted components have the same interpretation in both populations. The problem can be approached by testing null hypotheses stating that the congruence coefficients between pairs of vectors of component loadings are equal to 1. Chan, Leung, Chan, Ho, and Yung (1999) proposed a bootstrap procedure for testing the hypothesis of perfect congruence between vectors of common factor loadings. We demonstrate that the procedure by Chan et al. is both theoretically and empirically inadequate for the application on principal components. We propose a modification of their procedure, which constructs the resampling space according to the characteristics of the principal component model. The results of a simulation study show satisfactory empirical properties of the modified procedure.


1980 ◽  
Vol 19 (04) ◽  
pp. 205-209
Author(s):  
L. A. Abbott ◽  
J. B. Mitton

Data taken from the blood of 262 patients diagnosed for malabsorption, elective cholecystectomy, acute cholecystitis, infectious hepatitis, liver cirrhosis, or chronic renal disease were analyzed with three numerical taxonomy (NT) methods : cluster analysis, principal components analysis, and discriminant function analysis. Principal components analysis revealed discrete clusters of patients suffering from chronic renal disease, liver cirrhosis, and infectious hepatitis, which could be displayed by NT clustering as well as by plotting, but other disease groups were poorly defined. Sharper resolution of the same disease groups was attained by discriminant function analysis.


2018 ◽  
Vol 13 (4) ◽  
pp. 17-23
Author(s):  
E. A. Abidova ◽  
O. I. Bejsug ◽  
O. E. Draka ◽  
А. E. Dembickij ◽  
O. Yu. Pugachyova

Sign in / Sign up

Export Citation Format

Share Document