Publication | Closed Access
Design of a parallel vector access unit for SDRAM memory systems
52
Citations
14
References
2002
Year
Unknown Venue
Hardware SecuritySdram SystemEngineeringHardware AccelerationMemory BottleneckHigh-performance ArchitectureCloud ComputingComputer EngineeringComputer ArchitecturePva DesignParallel ProgrammingComputer ScienceParallel ComputingVector ProcessingVirtual MemoryMemory ArchitectureSdram Memory SystemsIn-memory Computing
We are attacking the memory bottleneck by building a "smart" memory controller that improves effective memory bandwidth, bus utilization, and cache efficiency by letting applications dictate how their data is accessed and cached. This paper describes a parallel vector access unit (PVA), the vector memory subsystem that efficiently "gathers" sparse, strided data structures in parallel on a multi-bank SDRAM memory. We have validated our PVA design via gate-level simulation, and have evaluated its performance via functional simulation and formal analysis. On unit-stride vectors, PVA performance equals or exceeds that of an SDRAM system optimized for cache line fills. On vectors with larger strides, the PVA is up to 32.8 times faster. Our design is up to 3.3 times faster than a pipelined, serial SDRAM memory system that gathers sparse vector data, and the gathering mechanism is two to five times faster than in other PVAs with similar goals. Our PVA only slightly increases hardware complexity with respect to these other systems, and the scalable design is appropriate for a range of computing platforms, from vector supercomputers to commodity PCs.
| Year | Citations | |
|---|---|---|
Page 1
Page 1