Publication | Closed Access
Statistical model criticism using kernel two sample tests
64
Citations
26
References
2015
Year
EngineeringMachine LearningAutoencodersStatistical FoundationKernel TwoData ScienceAdversarial Machine LearningGenerative ModelStatistical Model CriticismStatisticsSupervised LearningPredictive AnalyticsKnowledge DiscoveryModel ComparisonDeep LearningSynthetic DataMaximum Mean DiscrepancyReproducing Kernel MethodSample TestsStatistical Inference
We propose an exploratory approach to statistical model criticism using maximum mean discrepancy (MMD) two sample tests. Typical approaches to model criticism require a practitioner to select a statistic by which to measure discrepancies between data and a statistical model. MMD two sample tests are instead constructed as an analytic maximisation over a large space of possible statistics and therefore automatically select the statistic which most shows any discrepancy. We demonstrate on synthetic data that the selected statistic, called the witness function, can be used to identify where a statistical model most misrepresents the data it was trained on. We then apply the procedure to real data where the models being assessed are restricted Boltzmann machines, deep belief networks and Gaussian process regression and demonstrate the ways in which these models fail to capture the properties of the data they are trained on.
| Year | Citations | |
|---|---|---|
Page 1
Page 1