Publication | Closed Access
3D NoC-Enabled Heterogeneous Manycore Architectures for Accelerating CNN Training
17
Citations
25
References
2017
Year
Unknown Venue
Accelerating Cnn TrainingEngineeringEnergy EfficiencyComputer ArchitectureDeep Learning TechnologyHigh-performance ArchitectureParallel ComputingComputer EngineeringNetwork On ChipComputer ScienceDeep LearningGpu ClusterGpu ArchitectureHardware AccelerationEdge ComputingConvolutional Neural NetworksMany-core ArchitectureDomain-specific AcceleratorParallel Programming
As deep learning technology is increasingly employed in diverse applications domains, the demand for computational power to enable these algorithms also increases. In this respect, high-performance three-dimensional (3D) heterogeneous manycore systems present a promising direction. However, deep learning on these systems pose several design challenges. First, the network-on-chip (NoC) must handle the traffic requirements of both CPU and GPU communications. Second, 3D system designs must address thermal issues resulting from high-power density. In this work, we propose a design methodology for a heterogeneous 3D NoC architecture that not only satisfies the traffic requirements of both CPUs and GPUs, but also reduces thermal hotspots. To this end, we target the training of two widely employed convolutional neural networks (CNN), namely, LeNet and CIFAR. By using our joint performance-thermal optimization methodology to create a 3D NoC for training CNNs, we reduce the maximum temperature by 22% while incurring only 5% full-system energy-delay-product degradation over a solely performance optimized 3D NoC. This demonstrates that, our design methodology achieves considerable temperature reduction with negligible loss in performance.
| Year | Citations | |
|---|---|---|
Page 1
Page 1