Concepedia

Publication | Closed Access

Unbiased estimation of size and other aggregates over hidden web databases

56

Citations

27

References

2010

Year

Abstract

Many websites provide restrictive form-like interfaces which allow users to execute search queries on the underlying hidden databases. In this paper, we consider the problem of estimating the size of a hidden database through its web interface. We propose novel techniques which use a small number of queries to produce unbiased estimates with small variance. These techniques can also be used for approximate query processing over hidden databases. We present theoretical analysis and extensive experiments to illustrate the effectiveness of our approach.

References

YearCitations

Page 1