Publication | Open Access
Hybrid network-on-chip architectures for accelerating deep learning kernels on heterogeneous manycore platforms
39
Citations
35
References
2016
Year
Unknown Venue
EngineeringMachine LearningComputer ArchitectureDeep Learning KernelsHigh-performance ArchitectureEmbedded Machine LearningParallel ComputingManycore ProcessorComputer EngineeringNetwork On ChipHybrid Network-on-chip ArchitecturesComputer ScienceDeep LearningNeural Architecture SearchHardware AccelerationHeterogeneous Manycore PlatformsEdge ComputingMany-core ArchitectureParallel ProgrammingHybrid Noc
In recent years, designing specialized manycore heterogeneous architectures for deep learning kernels has become an area of great interest. However, the typical on-chip communication infrastructures employed on conventional manycore platforms are unable to handle both CPU and GPU communication requirements efficiently. Hence, in this paper, our aim is to enhance the performance of heterogeneous manycore architectures through the design of a hybrid NoC consisting of both wireline and wireless links. To this end, we specifically target the resource-intensive backpropagation algorithm commonly used as the training method in deep learning. For backpropagation, the proposed hybrid NoC achieves 1.9X reduction in network latency and improves the network throughput by a factor of 2 with respect to a highly optimized mesh NoC. These network level improvements translate into 25% savings in full system energy-delay-product (EDP). This demonstrates the capability of the proposed hybrid and heterogeneous manycore architecture in accelerating deep learning kernels in an energy-efficient manner.
| Year | Citations | |
|---|---|---|
Page 1
Page 1