Publication | Closed Access
Application Specific Datapath Extension with Distributed I/O Functional Units
32
Citations
16
References
2007
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitectureComputational ComplexityProcessor ArchitectureSpeedup PotentialDatacenter-scale ComputingHardware SecurityHigh-performance ArchitectureSystems EngineeringParallel ComputingData ManagementInstruction-level ParallelismMassively-parallel ComputingData CommunicationComputer EngineeringComputer ScienceSpeedup Potential GrowsLegal Operation ClusterParallel Performance EvaluationParallel Programming
Performance of an application can be improved through augmenting the processor with application specific functional units (AFUs). Usually a cluster of operations identified from the application forms the behavior of an AFU. Several researchers studied the impact of input and output (I/O) constraints for a legal operation cluster on the overall achievable speedup. The general observation is that the speedup potential grows with the relaxation of I/O constraints. Going further, in this paper, the authors investigate the speedup potential of AFUs in the absence of I/O constraints. Design challenge in the absence of I/O constraints is addressed in a very practical manner, through the identification of maximal convex subgraphs. Usually the available register ports are few but the number of inputs/outputs of the identified patterns are likely to be large. The authors solve the register port limitation by the design of distributed I/O functional units, in which the operands are communicated in multiple cycles. The experimental results show that selection of maximal clusters achieves average 50% higher speedup than selecting I/O constrained operation clusters. Also, our identification algorithm runs 2 to 3 orders faster than an exhaustive identification approach
| Year | Citations | |
|---|---|---|
Page 1
Page 1