Publication | Closed Access
Fixed-point optimization of deep neural networks with adaptive step size retraining
33
Citations
19
References
2017
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningComputer ArchitectureGradual Quantization SchemeSparse Neural NetworkEmbedded Machine LearningNeural Scaling LawQuantization Step SizeComputer EngineeringLarge Scale OptimizationComputer ScienceDeep LearningNeural Architecture SearchModel CompressionAdaptive OptimizationDeep Neural NetworksAdaptive Step SizeFixed-point Optimization
Fixed-point optimization of deep neural networks plays an important role in hardware based design and low-power implementations. Many deep neural networks show fairly good performance even with 2- or 3-bit precision when quantized weights are fine-tuned by retraining. We propose an improved fixed-point optimization algorithm that estimates the quantization step size dynamically during the retraining. In addition, a gradual quantization scheme is also tested, which sequentially applies fixed-point optimizations from high- to low-precision. The experiments are conducted for feed-forward deep neural networks (FFDNNs), convolutional neural networks (CNNs), and recurrent neural networks (RNNs).
| Year | Citations | |
|---|---|---|
Page 1
Page 1