Publication | Closed Access
A 65nm 39GOPS/W 24-core processor with 11Tb/s/W packet-controlled circuit-switched double-layer network-on-chip and heterogeneous execution array
36
Citations
5
References
2013
Year
Unknown Venue
Cluster ComputingCmos LpEngineeringEnergy EfficiencyComputer ArchitectureInterconnection Network ArchitectureEmbedded SystemsProcessor ArchitectureHigh-performance ArchitectureComputing Systems24-Core ProcessorParallel ComputingManycore ProcessorHeterogeneous Execution ArrayTechnology Co-optimizationComputer EngineeringNetwork On ChipProgrammable Multi-core ProcessorsMany-core Architecture
With the increasing complexity and variety of applications, programmable multi-core processors are drawing attention due to their high flexibility and low implementation cost, yet their performance and energy efficiency still cannot fulfill the demands of many compute-intensive applications. This paper describes a high-performance energy-efficient 24-core processor for multi-media and communication applications, with the following key features: (1) a packet-controlled circuit-switched double-layer network-on-chip (NoC) which provides 11Tb/s/W energy efficiency with 435Gb/s bisection-bandwidth; (2) a cluster-shared NoC-connected heterogeneous reconfigurable execution array, which can improve the performance of frequently used computations in multimedia and communication applications by over 6×; (3) memory hierarchy improvements, including a multi-page foreground and background register file, and memory splitting and sharing. The processor, implemented in TSMC 65nm CMOS LP and occupying 18.8mm <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> (Fig. 3.6.7) operates at 850MHz at 1.2V, with 523mW power dissipation and 39GOPS/W (26pJ/operation) energy efficiency, which is 1.75× better than our former 16-core processor [3].
| Year | Citations | |
|---|---|---|
Page 1
Page 1