Publication | Closed Access
Minimum spanning tree partitioning algorithm for microaggregation
220
Citations
7
References
2005
Year
Mathematical ProgrammingCluster ComputingEngineeringNetwork AnalysisDiscrete OptimizationText MiningCluster TechnologyOptimization-based Data MiningData ScienceData MiningMinimum Group SizeCombinatorial OptimizationData ManagementStatisticsPronounced Clustering EffectsSocial Network AnalysisDocument ClusteringKnowledge DiscoveryComputer ScienceMst Partitioning AlgorithmNetwork ScienceGraph TheoryNetwork AlgorithmPartition (Database)BusinessBig Data
This paper presents a clustering algorithm for partitioning a minimum spanning tree with a constraint on minimum group size. The problem is motivated by microaggregation, a disclosure limitation technique in which similar records are aggregated into groups containing a minimum of k records. Heuristic clustering methods are needed since the minimum information loss microaggregation problem is NP-hard. Our MST partitioning algorithm for microaggregation is sufficiently efficient to be practical for large data sets and yields results that are comparable to the best available heuristic methods for microaggregation. For data that contain pronounced clustering effects, our method results in significantly lower information loss. Our algorithm is general enough to accommodate different measures of information loss and can be used for other clustering applications that have a constraint on minimum group size.
| Year | Citations | |
|---|---|---|
Page 1
Page 1