Publication | Closed Access
Test-Time Training for Out-of-Distribution Generalization
50
Citations
21
References
2019
Year
Unknown Venue
EngineeringMachine LearningGeneral ApproachDistribution ShiftsImage ClassificationImage AnalysisData SciencePattern RecognitionSelf-supervised LearningTest-time TrainingSemi-supervised LearningStatisticsSupervised LearningMachine VisionComputational Learning TheoryFeature LearningComputer ScienceStatistical Learning TheoryMedical Image ComputingDeep LearningComputer VisionStatistical Inference
We introduce a general approach, called test-time training, for improving the performance of predictive models when test and training data come from different distributions. Test-time training turns a single unlabeled test instance into a self-supervised learning problem, on which we update the model parameters before making a prediction on the test sample. We show that this simple idea leads to surprising improvements on diverse image classification benchmarks aimed at evaluating robustness to distribution shifts. Theoretical investigations on a convex model reveal helpful intuitions for when we can expect our approach to help.
| Year | Citations | |
|---|---|---|
Page 1
Page 1