Publication | Closed Access
An empirical study of the scalability aspects of instruction distribution algorithms for clustered processors
36
Citations
15
References
2005
Year
Unknown Venue
Cluster ComputingEngineeringWire DelaysSub-micron Technology EraComputer ArchitectureComputational ComplexityClustered ProcessorsInterconnection Network ArchitectureProcessor ArchitectureHardware SecurityInstruction Distribution AlgorithmsHigh-performance ArchitectureSystems EngineeringParallel ComputingManycore ProcessorInstruction-level ParallelismComputer EngineeringInterconnection NetworkComputer SciencePerformance ScalabilityScalability AspectsParallel Performance EvaluationCluster Window SizeMultiprocessor SystemParallel Programming
In the sub-micron technology era, wire delays are becoming much more important than gate delays, making it particularly attractive to go for decentralized processors. A number of algorithms have already been proposed for distributing instructions among multiple clusters. In this paper we qualitatively and quantitatively analyze the effect of various hardware parameters on the scalability of different instruction distribution algorithms. Using a set of realistic system parameters, we examine performance differences resulting from different distribution algorithms as well as from specific implementation issues such as the type of interconnect, the fetch size, the cluster issue width, and the cluster window size. Our studies have found that those distribution algorithms that perform relatively better with 4 or fewer clusters are generally not the best ones for a larger number of clusters. Also, the relative performance and scalability of the algorithms are sensitive to different hardware parameters. We also found that, among the existing algorithms, there is no single algorithm that works uniformly best across all hardware configurations. This motivates the need to develop alternate interconnects and instruction distribution algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1