Publication | Closed Access
Unbiased estimation of size and other aggregates over hidden web databases
56
Citations
27
References
2010
Year
Unknown Venue
EngineeringSearch QueriesData AggregationOther AggregatesApproximate Query ProcessingInformation RetrievalData ScienceData MiningHidden DatabaseManagementData IntegrationData ManagementStatisticsSearch TechnologyVery Large DatabaseKnowledge DiscoveryWebometricsData PrivacyComputer ScienceQuery AnalysisQuery OptimizationHidden Web DatabasesStatistical InferenceStatistical DatabaseApproximate Query AnsweringBig Data
Many websites provide restrictive form-like interfaces which allow users to execute search queries on the underlying hidden databases. In this paper, we consider the problem of estimating the size of a hidden database through its web interface. We propose novel techniques which use a small number of queries to produce unbiased estimates with small variance. These techniques can also be used for approximate query processing over hidden databases. We present theoretical analysis and extensive experiments to illustrate the effectiveness of our approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1