Publication | Closed Access
Fast, small-space algorithms for approximate histogram maintenance
247
Citations
16
References
2002
Year
Unknown Venue
EngineeringRobust HistogramStreaming AlgorithmComputational ComplexityRange SearchingImage AnalysisRobust Histogram ApproximationData SciencePattern RecognitionApproximate ComputingPattern AnalysisComputational GeometrySublinear AlgorithmMachine VisionSketching AlgorithmComputer ScienceImage SimilarityMedical Image ComputingComputer VisionComputational ScienceGeometric AlgorithmNatural SciencesMathematical FoundationsApproximate Histogram Maintenance
(MATH) A vector A of length N is defined implicitly, via a stream of updates of the form "add 5 to A3." We give a sketching algorithm, that constructs a small sketch from the stream of updates, and a reconstruction algorithm, that produces a B-bucket piecewise-constant representation (histogram) H for A from the sketch, such that ||A—H||≤(1+ε)||A—Hopt||, where the error ||A—H|| is either $\ell_1$ (absolute) or $\ell_2$ (root-mean-square) error. The time to process a single update, time to reconstruct the histogram, and size of the sketch are each bounded by poly(B,log(N),log||A,1/ε. Our result is obtained in two steps. First we obtain what we call a robust histogram approximation for A, a histogram such that adding a small number of buckets does not help improve the representation quality significantly. From the robust histogram, we cull a histogram of desired accruacy and B buckets in the second step. This technique also provides similar results for Haar wavelet representations, under $\ell_2$ error. Our results have applications in summarizing data distributions fast and succinctly even in distributed settings.
| Year | Citations | |
|---|---|---|
Page 1
Page 1