Publication | Open Access
Maximizing CNN Accelerator Efficiency Through Resource Partitioning
292
Citations
32
References
2017
Year
Unknown Venue
Single ProcessorConvolutional Neural NetworkEngineeringMachine LearningHardware AccelerationCnn LayersConvolutional Neural NetworksComputer EngineeringComputer ArchitectureHardware AlgorithmDomain-specific AcceleratorComputer ScienceParallel ComputingDeep LearningNeural Architecture SearchModel CompressionComputer Vision
Convolutional neural networks (CNNs) are revolutionizing machine learning, but they present significant computational challenges. Recently, many FPGA-based accelerators have been proposed to improve the performance and efficiency of CNNs. Current approaches construct a single processor that computes the CNN layers one at a time; the processor is optimized to maximize the throughput at which the collection of layers is computed. However, this approach leads to inefficient designs because the same processor structure is used to compute CNN layers of radically varying dimensions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1