Stability of Nonlinear Principal Components Analysis: An Empirical Study Using the Balanced Bootstrap
Principal components analysis (PCA) is used to explore the structure of data sets containing linearly related numeric variables. Alternatively, nonlinear PCA can handle possibly nonlinearly related numeric as well as nonnumeric variables. For linear PCA, the stability of its solution can be established under the assumption of multivariate normality. For nonlinear PCA, however, standard options for establishing stability are not provided. The authors use the nonparametric bootstrap procedure to assess the stability of nonlinear PCA results, applied to empirical data. They use confidence intervals for the variable transformations and confidence ellipses for the eigenvalues, the component loadings, and the person scores. They discuss the balanced version of the bootstrap, bias estimation, and Procrustes rotation. To provide a benchmark, the same bootstrap procedure is applied to linear PCA on the same data. On the basis of the results, the authors advise using at least 1,000 bootstrap samples, using Procrustes rotation on the bootstrap results, examining the bootstrap distributions along with the confidence regions, and merging categories with small marginal frequencies to reduce the variance of the bootstrap results.
|Keywords||CATPCA, balanced bootstrap, categorical variables, nonlinear principal components analysis, stability|
|Persistent URL||dx.doi.org/10.1037/1082-989X.12.3.359, hdl.handle.net/1765/19263|
Linting, M., Meulman, J.J., Groenen, P.J.F., & van der Kooij, A.J.. (2007). Stability of Nonlinear Principal Components Analysis: An Empirical Study Using the Balanced Bootstrap. Psychological Methods, 12(3), 359–379. doi:10.1037/1082-989X.12.3.359