Concepedia

TLDR

Clustering variables around latent components is motivated by the need to group correlated variables, address negative correlations, and incorporate external data into the clustering process. This study investigates clustering of variables around latent components to organize multivariate data into meaningful structures. The authors perform hierarchical clustering followed by a partitioning algorithm, both maximizing a criterion that measures how strongly variables in a cluster relate to its latent variable. Illustrations demonstrate the method on real sensory data sets.

Abstract

Abstract Clustering of variables around latent components is investigated as a means to organize multivariate data into meaningful structures. The coverage includes (i) the case where it is desirable to lump together correlated variables no matter whether the correlation coefficient is positive or negative; (ii) the case where negative correlation shows high disagreement among variables; (iii) an extension of the clustering techniques which makes it possible to explain the clustering of variables taking account of external data. The strategy basically consists in performing a hierarchical cluster analysis, followed by a partitioning algorithm. Both algorithms aim at maximizing the same criterion which reflects the extent to which variables in each cluster are related to the latent variable associated with this cluster. Illustrations are outlined using real data sets from sensory studies.

References

YearCitations

Page 1