Publication | Closed Access
Running sparse and low-precision neural network: When algorithm meets hardware
14
Citations
23
References
2018
Year
Artificial IntelligenceConvolutional Neural NetworkEngineeringMachine LearningHardware AlgorithmComputer ArchitectureDnn AlgorithmApproximate ComputingSparse Neural NetworkEmbedded Machine LearningParallel ComputingComputer EngineeringComputer ScienceDeep LearningNeural Architecture SearchModel CompressionDeep Neural NetworksHardware AccelerationParallel ProgrammingModel RedundancyAlgorithm Meets
Deep Neural Networks (DNNs) are pervasively applied in many artificial intelligence (AI) applications. The high performance of DNNs comes at the cost of larger size and higher compute complexity. Recent studies show that DNNs have much redundancy, such as the zero-value parameters and excessive numerical precision. To reduce computing complexity, many redundancy reduction techniques have been proposed, including pruning and data quantization. In this paper, we demonstrate our co-optimization of the DNN algorithm and hardware which exploits the model redundancy to accelerate DNNs.
| Year | Citations | |
|---|---|---|
Page 1
Page 1