Publication | Closed Access
Testing that distributions are close
259
Citations
20
References
2002
Year
Unknown Venue
EngineeringTesting TechniqueSoftware TestingData DistributionRandomized AlgorithmComparative TestLower BoundAlgorithmic Information TheoryComputational ComplexityProbability TheoryComputer ScienceSample SizeStochastic GeometryN ElementApproximation TheoryStatisticsSublinear Algorithm
Given two distributions over an n element set, we wish to check whether these distributions are statistically close by only sampling. We give a sublinear algorithm which uses O(n/sup 2/3//spl epsiv//sup -4/ log n) independent samples from each distribution, runs in time linear in the sample size, makes no assumptions about the structure of the distributions, and distinguishes the cases when the distance between the distributions is small (less than max(/spl epsiv//sup 2//32/sup 3//spl radic/n,/spl epsiv//4/spl radic/n=)) or large (more than /spl epsiv/) in L/sub 1/-distance. We also give an /spl Omega/(n/sup 2/3//spl epsiv//sup -2/3/) lower bound. Our algorithm has applications to the problem of checking whether a given Markov process is rapidly mixing. We develop sublinear algorithms for this problem as well.
| Year | Citations | |
|---|---|---|
Page 1
Page 1