Publication | Closed Access
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
136
Citations
55
References
2023
Year
Unknown Venue
Numerical AnalysisEngineeringMachine LearningMultimodal LearningStyle TransferOptimal TransportNatural Language ProcessingMultimodal LlmImage AnalysisData ScienceCompact Parameter SpaceModel StorageSynthetic Image GenerationComputer ScienceHuman Image SynthesisMedical Image ComputingDeep LearningComputer VisionSvdiff MethodDiffusion ProcessDiffusion-based ModelingDiffusion ModelsMultiscale Modeling
Diffusion models have achieved remarkable success in text-to-image generation, enabling the creation of high-quality images from text prompts or other modalities. However, existing methods for customizing these models are limited by handling multiple personalized subjects and the risk of overfitting. Moreover, their large number of parameters is inefficient for model storage. In this paper, we propose a novel approach to address these limitations in existing text-to-image diffusion models for personalization. Our method involves fine-tuning the singular values of the weight matrices, leading to a compact and efficient parameter space that reduces the risk of overfitting and language-drifting. We also propose a Cut-Mix-Unmix data-augmentation technique to enhance the quality of multi-subject image generation and a simple text-based image editing framework. Our proposed SVDiff method has a significantly smaller model size compared to existing methods (≈2,200 times fewer parameters compared with vanilla DreamBooth), making it more practical for real-world applications.
| Year | Citations | |
|---|---|---|
Page 1
Page 1