Publication | Closed Access
Approximation Bounds for Minimum Information Loss Microaggregation
12
Citations
17
References
2009
Year
Mathematical ProgrammingEngineeringComputational ComplexityApproximation BoundsData ScienceCombinatorial OptimizationComputational GeometryApproximation TheoryStatisticsDistance MeasureInformation TheoryLower BoundNp-hard Microaggregation ProblemComputer ScienceApproximation AlgorithmsAlgorithmic Information TheoryInteger ProgrammingLocal Search (Optimization)Squared Distance MeasureOptimization ProblemStatistical Inference
The NP-hard microaggregation problem seeks a partition of data points into groups of minimum specified size k, so as to minimize the sum of the squared euclidean distances of every point to its group's centroid. One recent heuristic provides an O(k <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sup> ) guarantee for this objective function and an O(k <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> ) guarantee for a version of the problem that seeks to minimize the sum of the distances of the points to its group's centroid. This paper establishes approximation bounds for another microaggregation heuristic, providing better approximation guarantees of O(k <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> ) for the squared distance measure and O(k) for the distance measure.
| Year | Citations | |
|---|---|---|
Page 1
Page 1