Publication | Closed Access
9.2 A 28nm 12.1TOPS/W Dual-Mode CNN Processor Using Effective-Weight-Based Convolution and Error-Compensation-Based Prediction
61
Citations
10
References
2021
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningHardware AccelerationEdge ComputingSparse Neural NetworkConvolutional Neural NetworksComputer EngineeringComputer ArchitectureError-compensation-based PredictionQuantized CnnsEmbedded Machine LearningComputer ScienceDeep LearningNeural Architecture SearchCnn ProcessorsModel Compression
To deploy convolutional neural networks (CNNs) on edge devices efficiently, most existing CNN processors were built on quantized CNNs to optimize the inference operations. However, three issues (Fig. 9.2.1) have not been well addressed: 1) Duplicate weights in each kernel after quantization yielding repetitive multiplications; 2) a huge number of unnecessary MACs caused by ReLU activation functions; 3) frequent off-chip memory access in residual blocks.
| Year | Citations | |
|---|---|---|
Page 1
Page 1