Publication | Closed Access
HNPU: An Adaptive DNN Training Processor Utilizing Stochastic Dynamic Fixed-Point and Active Bit-Precision Searching
56
Citations
24
References
2021
Year
Artificial IntelligenceEngineeringMachine LearningNeural Networks (Machine Learning)Active Bit-precision SearchingComputer ArchitectureSocial SciencesSparse Neural NetworkComputing SystemsEmbedded Machine LearningPerformance ImprovementActual Dnn TrainingComputer EngineeringLayer-wise Adaptive PrecisionDnn InferenceComputer ScienceNeural Networks (Computational Neuroscience)Deep LearningNeural Architecture SearchDeep Neural NetworksHardware Acceleration
This article presents HNPU, which is an energy-efficient deep neural network (DNN) training processor by adopting algorithm-hardware co-design. The HNPU supports stochastic dynamic fixed-point representation and layer-wise adaptive precision searching unit for low-bit-precision training. It additionally utilizes slice-level reconfigurability and sparsity to maximize its efficiency both in DNN inference and training. Adaptive bandwidth reconfigurable accumulation network enables reconfigurable DNN allocation and maintains its high core utilization even in various bit-precision conditions. Fabricated in a 28-nm process, the HNPU accomplished at least <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$5.9\times $ </tex-math></inline-formula> higher energy efficiency and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$2.5\times $ </tex-math></inline-formula> higher area efficiency in actual DNN training compared with the previous state-of-the-art on-chip learning processors.
| Year | Citations | |
|---|---|---|
Page 1
Page 1