Concepedia

Publication | Closed Access

Feature Selection as a Preprocessing Step for Hierarchical Clustering

91

Citations

0

References

1999

Year

Luis Talavera

Unknown Venue

Abstract

Although feature selection is a central problem in inductive learning as suggested by the growing amount of research in this area, most of the work has been carried out under the supervised learning paradigm, paying little attention to unsupervised learning tasks and, particularly, clustering tasks. In this paper, we analyze the particular benefits that feature selection may provide in hierarchical clustering tasks and explore the power of feature selection methods applied as a preprocessing step under the proposed dimensions. Instead of only predicting class labels, the focus is on a more general inference tasks over all the features. Empirical results suggest that feature selection as preprocessing only provides limited improvements in the performance task. In addition, they raise the problem of the notion of irrelevance in unsupervised settings. 1 INTRODUCTION Inductive learning systems are a powerful approach for automatically extracting useful information from data or for assisti...