scholarly journals Different Methods for Determining the Dimensionality of Multivariate Models

2021 ◽  
Vol 1 ◽  
Author(s):  
Douglas N. Rutledge ◽  
Jean-Michel Roger ◽  
Matthieu Lesnoff

A tricky aspect in the use of all multivariate analysis methods is the choice of the number of Latent Variables to use in the model, whether in the case of exploratory methods such as Principal Components Analysis (PCA) or predictive methods such as Principal Components Regression (PCR), Partial Least Squares regression (PLS). For exploratory methods, we want to know which Latent Variables deserve to be selected for interpretation and which contain only noise. For predictive methods, we want to ensure that we include all the variability of interest for the prediction, without introducing variability that would lead to a reduction in the quality of the predictions for samples other than those used to create the multivariate model.

1995 ◽  
Vol 32 (9-10) ◽  
pp. 341-348
Author(s):  
V. Librando ◽  
G. Magazzù ◽  
A. Puglisi

The monitoring of water quality today provides a great quantity of data consisting of the values of the parameters measured as a function of time. In the marine environment, and especially in the suspended material, increasing importance is being given to the presence of organic micropollutants, particularly since some are known to be carcinogenic. As the number of measured parameters increases examining the data and their consequent interpretation becomes more difficult. To overcome such difficulties, numerous chemometric techniques have been introduced in environmental chemistry, such as Multivariate Data Analysis (MVDA), Principal Component Analysis (PCA) and Partial Least Squares Regression (PLSR). The use of the first technique in this work has been applied to the interpretation of the quality of Augusta bay, by measuring the concentration of numerous organic micropollutants, together with the classical water pollution parameters, in different sites and at different times. The MVDA has highlighted the difference between various sampling sites whose data were initially thought to be similar. Furthermore, it has allowed a choice of more significant parameters for future monitoring and more suitable sampling site locations.


Sign in / Sign up

Export Citation Format

Share Document