Publication | Open Access
Reactive NUCA
382
Citations
47
References
2009
Year
Unknown Venue
Hardware SecurityCluster ComputingEngineeringAggregate Cache CapacityEdge ComputingHigh-performance ArchitectureCloud ComputingCoherence MechanismsComputer ArchitectureComputer EngineeringCachingMany-core ArchitectureNetwork On ChipParallel ProgrammingParallel ComputingShared Cache DesignManycore Processor
Increases in on-chip communication delay and the large working sets of server and scientific workloads complicate the design of the on-chip last-level cache for multicore processors. The large working sets favor a shared cache design that maximizes the aggregate cache capacity and minimizes off-chip memory requests. At the same time, the growing on-chip communication delay favors core-private caches that replicate data to minimize delays on global wires. Recent hybrid proposals offer lower average latency than conventional designs, but they address the placement requirements of only a subset of the data accessed by the application, require complex lookup and coherence mechanisms that increase latency, or fail to scale to high core counts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1