Publication | Closed Access
Adaptive Iterative Pruning for Accelerating Deep Neural Networks
15
Citations
12
References
2019
Year
Unknown Venue
Artificial IntelligenceConvolutional Neural NetworkEngineeringMachine LearningNeural Networks (Machine Learning)Adaptive Iterative PruningDeep Learning ModelSocial SciencesBatch SizeData SciencePattern RecognitionSparse Neural NetworkData AugmentationComputer ScienceNeural Networks (Computational Neuroscience)Deep LearningNeural Architecture SearchStandard Iterative Pruning-Model CompressionDeep Neural NetworksLimited Data Learning
The new adaptive iterative pruning (AIP) approach is proposed. It differs from the standard iterative pruning- retraining scheme by the adaptively changing hyperparameters by linear, multiplicative, or adaptive laws (batch size, number of epochs, a minimal number of channels per layer) that compensate the loss of accuracy due to excessive retraining of the survived weights. It was shown, for example of classification task by VGG-16 model on MNIST dataset, that there is some domain of values where the loss of accuracy could be not higher than 2% with the inference speedup up to ×17 and the loss of accuracy could be not higher than 4% with the inference speedup up to ×30. In the more general sense, AIP can be widely applied for any deep learning model containing the convolutional layers, i.e. for any convolutional neural networks (CNNs), with the similar speedup of inference time and potentially faster convergence during training.
| Year | Citations | |
|---|---|---|
Page 1
Page 1