Data-Free Learning of Student Networks

TLDR

Portable neural networks are essential for computer vision on edge devices, yet most compression methods require access to training data and the architecture of the target network, which are often unavailable due to privacy, legal, or transmission constraints. This work introduces a data‑free learning framework that trains efficient student networks without any training data. The framework treats a pre‑trained teacher as a fixed discriminator and uses a generative adversarial network to synthesize samples that elicit maximal responses from the teacher, then trains a compact student on these generated samples while leveraging the teacher. The resulting student networks achieve 92.22 % on CIFAR‑10, 74.47 % on CIFAR‑100, and 80.56 % on CelebA, demonstrating the effectiveness of the data‑free approach.

Abstract

Learning portable neural networks is very essential for computer vision for the purpose that pre-trained heavy deep models can be well applied on edge devices such as mobile phones and micro sensors. Most existing deep neural network compression and speed-up methods are very effective for training compact deep models, when we can directly access the training dataset. However, training data for the given deep network are often unavailable due to some practice problems (\eg privacy, legal issue, and transmission), and the architecture of the given network are also unknown except some interfaces. To this end, we propose a novel framework for training efficient deep neural networks by exploiting generative adversarial networks (GANs). To be specific, the pre-trained teacher networks are regarded as a fixed discriminator and the generator is utilized for derivating training samples which can obtain the maximum response on the discriminator. Then, an efficient network with smaller model size and computational complexity is trained using the generated data and the teacher network, simultaneously. Efficient student networks learned using the proposed Data-Free Learning (DFL) method achieve 92.22% and 74.47% accuracies without any training data on the CIFAR-10 and CIFAR-100 datasets, respectively. Meanwhile, our student network obtains an 80.56% accuracy on the CelebA benchmark.

References

Page 1

	Year	Citations

Page 1