Publication | Closed Access
Image to Image Translation for Domain Adaptation
567
Citations
29
References
2018
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningAutoencodersImage AnalysisUnsupervised Domain AdaptationData SciencePattern RecognitionVideo TransformerMachine TranslationSynthetic Image GenerationMachine VisionFeature LearningComputer ScienceImage TranslationHuman Image SynthesisDeep LearningMedical Image ComputingGeneral FrameworkComputer VisionDomain Adaptation
Many recent works are specific instances of the proposed general framework for unsupervised domain adaptation. The study proposes a general unsupervised domain adaptation framework that uses unpaired image‑to‑image translation to constrain encoder features, enabling source‑trained networks to operate on target domains without target annotations. The method adds auxiliary networks and losses that enforce reconstruction of both domains and indistinguishable feature distributions, leveraging unpaired image‑to‑image translation to regularize the encoder. The approach achieves state‑of‑the‑art performance on MNIST, USPS, SVHN, Office, GTA5, and Cityscapes datasets.
We propose a general framework for unsupervised domain adaptation, which allows deep neural networks trained on a source domain to be tested on a different target domain without requiring any training annotations in the target domain. This is achieved by adding extra networks and losses that help regularize the features extracted by the backbone encoder network. To this end we propose the novel use of the recently proposed unpaired image-to-image translation framework to constrain the features extracted by the encoder network. Specifically, we require that the features extracted are able to reconstruct the images in both domains. In addition we require that the distribution of features extracted from images in the two domains are indistinguishable. Many recent works can be seen as specific cases of our general framework. We apply our method for domain adaptation between MNIST, USPS, and SVHN datasets, and Amazon, Webcam and DSLR Office datasets in classification tasks, and also between GTA5 and Cityscapes datasets for a segmentation task. We demonstrate state of the art performance on each of these datasets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1