Publication | Closed Access
StarGAN v2: Diverse Image Synthesis for Multiple Domains
1.6K
Citations
34
References
2020
Year
Unknown Venue
EngineeringMachine LearningBiometricsVisual QualityImage AnalysisData SciencePattern RecognitionImage HallucinationSynthetic Image GenerationMachine VisionImage SynthesisVision Language ModelComputer ScienceHuman Image SynthesisMedical Image ComputingDeep LearningComputer VisionDifferent Visual DomainsGenerative Adversarial NetworkStargan V2
A good image-to-image translation model should learn a mapping between different visual domains while satisfying the following properties: 1) diversity of generated images and 2) scalability over multiple domains. Existing methods address either of the issues, having limited diversity or multiple models for all domains. We propose StarGAN v2, a single framework that tackles both and shows significantly improved results over the baselines. Experiments on CelebA-HQ and a new animal faces dataset (AFHQ) validate our superiority in terms of visual quality, diversity, and scalability. To better assess image-to-image translation models, we release AFHQ, high-quality animal faces with large inter- and intra-domain differences. The code, pretrained models, and dataset are available at https://github.com/clovaai/stargan-v2.
| Year | Citations | |
|---|---|---|
Page 1
Page 1