Publication | Open Access
Estimating the Impact of Unknown Unknowns on Aggregate Query Results
23
Citations
46
References
2016
Year
Unknown Venue
EngineeringData AggregationBusiness IntelligenceData PreparationUncertain DataData EcosystemAggregate FunctionInformation RetrievalData ScienceData MiningManagementData IntegrationData ManagementStatisticsData ModelingUnknown UnknownsData ScientistsEstimation StatisticKnowledge DiscoveryData QualityData CleansingIntegrated DataData TreatmentDisparate Data SourcesStatistical InferenceApproximate Query AnsweringBig Data
It is common practice for data scientists to acquire and integrate disparate data sources to achieve higher quality results. But even with a perfectly cleaned and merged data set, two fundamental questions remain: (1) is the integrated data set complete and (2) what is the impact of any unknown (i.e., unobserved) data on query results?
| Year | Citations | |
|---|---|---|
Page 1
Page 1