Publication | Closed Access
Bayesian <i>k</i>-Means as a “Maximization-Expectation” Algorithm
69
Citations
19
References
2009
Year
Cluster ComputingBayesian StatisticEngineeringMachine LearningBayesian InferenceOptimization-based Data MiningBayesian OptimizationData ScienceData MiningMixture AnalysisBayesian MethodsPublic HealthStatisticsBayesian Hierarchical ModelingDocument ClusteringKnowledge DiscoveryComputer ScienceBayesian StatisticsClassical Expectation-maximization AlgorithmMixture DistributionHidden VariablesNew ClassStatistical Inference
We introduce a new class of "maximization-expectation" (ME) algorithms where we maximize over hidden variables but marginalize over random parameters. This reverses the roles of expectation and maximization in the classical expectation-maximization algorithm. In the context of clustering, we argue that these hard assignments open the door to very fast implementations based on data structures such as kd-trees and conga lines. The marginalization over parameters ensures that we retain the ability to infer model structure (i.e., number of clusters). As an important example, we discuss a top-down Bayesian k-means algorithm and a bottom-up agglomerative clustering algorithm. In experiments, we compare these algorithms against a number of alternative algorithms that have recently appeared in the literature.
| Year | Citations | |
|---|---|---|
Page 1
Page 1