Publication | Open Access
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
1.2K
Citations
4
References
2016
Year
Artificial IntelligenceEngineeringMachine LearningGenerative SystemRepresentation LearningData SciencePattern RecognitionGenerative ModelDisentangled RepresentationsSynthetic Image GenerationGenerative ModelsComputer ScienceHuman Image SynthesisDeep LearningComputer VisionGenerative Adversarial NetworkInterpretable Representation LearningMutual InformationGenerative Ai
This paper introduces InfoGAN, an information‑theoretic extension to the Generative Adversarial Network that learns disentangled representations in a fully unsupervised manner. InfoGAN maximizes the mutual information between a small subset of latent variables and the generated data by optimizing a tractable lower bound, effectively implementing a Wake‑Sleep style training procedure. Experiments demonstrate that InfoGAN disentangles factors such as digit shape and writing style on MNIST, pose and lighting on 3D renders, background digits on SVHN, and facial attributes on CelebA, achieving representations competitive with supervised methods.
This paper describes InfoGAN, an information-theoretic extension to the Generative Adversarial Network that is able to learn disentangled representations in a completely unsupervised manner. InfoGAN is a generative adversarial network that also maximizes the mutual information between a small subset of the latent variables and the observation. We derive a lower bound to the mutual information objective that can be optimized efficiently, and show that our training procedure can be interpreted as a variation of the Wake-Sleep algorithm. Specifically, InfoGAN successfully disentangles writing styles from digit shapes on the MNIST dataset, pose from lighting of 3D rendered images, and background digits from the central digit on the SVHN dataset. It also discovers visual concepts that include hair styles, presence/absence of eyeglasses, and emotions on the CelebA face dataset. Experiments show that InfoGAN learns interpretable representations that are competitive with representations learned by existing fully supervised methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1