Publication | Closed Access
Hardware Implementation on FPGA for Task-Level Parallel Dataflow Execution Engine
20
Citations
36
References
2015
Year
Heterogeneous ComputingEngineeringHardware AlgorithmComputer ArchitectureParallel ImplementationHardware SecurityHigh-performance ArchitectureHeterogeneous Multicore PlatformParallel ComputingManycore ProcessorTask DependenciesHardware ImplementationComputer EngineeringComputer ScienceFpga DesignHardware AccelerationProgram AnalysisInstruction LevelMany-core ArchitectureParallel ProgrammingData-level Parallelism
Heterogeneous multicore platform has been widely used in various areas to achieve both power efficiency and high performance. However, it poses significant challenges to researchers to uncover more coarse-grained task level parallelization. In order to support automatic task parallel execution, this paper proposes a FPGA implementation of a hardware out-of-order scheduler on heterogeneous multicore platform. The scheduler is capable of exploring potential inter-task dependency, leading to a significant acceleration of dependence-aware applications. With the help of renaming scheme, the task dependencies are detected automatically during execution, and then task-level Write-After-Write (WAW) and Write-After-Read (WAR) dependencies can be eliminated dynamically. We extended the instruction level renaming techniques to perform task-level out-of-order execution, and implemented a prototype on a state-of-art Xilinx Virtex-5 FPGA device. Given the reconfigurable characteristic of FPGA, our scheduler supports changing accelerators at runtime to improve the flexibility. Experimental results demonstrate that our scheduler is efficient at both performance and resources usage.
| Year | Citations | |
|---|---|---|
Page 1
Page 1