Publication | Closed Access
Neuro.ZERO
42
Citations
89
References
2019
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningNeural Networks (Machine Learning)Computer ArchitectureDeep Neural Network1Hardware SystemsSocial SciencesComputing SystemsEmbedded Machine LearningPerformance ImprovementAccelerator MicrocontrollerComputer EngineeringNeural Networks (Computational Neuroscience)Computer ScienceDeep LearningModel CompressionHardware AccelerationDomain-specific AcceleratorEnsemble Inference
We introduce Neuro.ZERO---a co-processor architecture consisting of a main microcontroller (MCU) that executes scaled-down versions of a deep neural network1 (DNN) inference task, and an accelerator microcontroller that is powered by harvested energy and follows the intermittent computing paradigm [76]. The goal of the accelerator is to enhance the inference performance of the DNN that is running on the main microcontroller. Neuro.ZERO opportunistically accelerates the run-time performance of a DNN via one of its four acceleration modes: extended inference, expedited inference, ensemble inference, and latent training. To enable these modes, we propose two sets of algorithms: (1) energy and intermittence-aware DNN inference and training algorithms, and (2) a fast and high-precision adaptive fixed-point arithmetic that beats existing floating-point and fixed-point arithmetic in terms of speed and precision, respectively, and achieves the best of both. To evaluate Neuro.ZERO, we implement low-power image and audio recognition applications and demonstrate that their inference speedup increases by 1.6× and 1.7×, respectively, and the inference accuracy increases by 10% and 16%, respectively, when compared to battery-powered single-MCU systems.
| Year | Citations | |
|---|---|---|
Page 1
Page 1