Publication | Closed Access
The multicluster architecture: reducing cycle time through partitioning
129
Citations
10
References
2002
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitectureProcessor ArchitectureCluster TechnologyHigh-performance ArchitectureParallel ComputingManycore ProcessorMassively-parallel ComputingComputer EngineeringStatic InstructionComputer ScienceMulticluster ArchitectureSpec92 BenchmarksProgram AnalysisPartition (Database)Cloud ComputingMany-core ArchitectureParallel ProgrammingSystem Software
The multicluster architecture that we introduce offers a decentralized, dynamically scheduled architecture, in which the register files, dispatch queue, and functional units of the architecture are distributed across multiple clusters, and each cluster is assigned a subset of the architectural registers. The motivation for the multicluster architecture is to reduce the clock cycle time, relative to a single-cluster architecture with the same number of hardware resources, by reducing the size and complexity of components on critical timing paths. Resource partitioning, however, introduces instruction-execution overhead and may reduce the number of concurrently executing instructions. To counter these two negative by-products of partitioning, we developed a static instruction scheduling algorithm. We describe this algorithm, and using trace-driven simulations of SPEC92 benchmarks, evaluate its effectiveness. This evaluation indicates that for the configurations considered the multicluster architecture may have significant performance advantages at feature sizes below 0.35 /spl mu/m, and warrants further investigation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1