Publication | Open Access
Semantic Image Synthesis with Spatially-Adaptive Normalization
261
Citations
48
References
2019
Year
Image AnalysisMachine LearningInput Semantic LayoutEngineeringGenerative Adversarial NetworkSemantic Image SynthesisImage SynthesisScene UnderstandingInput LayoutsSemantic LayoutStyle TransferHuman Image SynthesisDeep LearningComputer VisionSynthetic Image Generation
Previous methods directly feed the semantic layout into a deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. The study proposes spatially‑adaptive normalization to synthesize photorealistic images from semantic layouts. This layer modulates activations in normalization layers via a spatially‑adaptive, learned transformation based on the input layout. Experiments on several challenging datasets demonstrate that the proposed method outperforms existing approaches in visual fidelity and alignment with input layouts, and enables user control over both semantic and style. Code is available at https://github.com/NVlabs/SPADE.
We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. We show that this is suboptimal as the normalization layers tend to ``wash away'' semantic information. To address the issue, we propose using the input layout for modulating the activations in normalization layers through a spatially-adaptive, learned transformation. Experiments on several challenging datasets demonstrate the advantage of the proposed method over existing approaches, regarding both visual fidelity and alignment with input layouts. Finally, our model allows user control over both semantic and style. Code is available at https://github.com/NVlabs/SPADE .
| Year | Citations | |
|---|---|---|
Page 1
Page 1