Publication | Closed Access
Memory system characterization of big data workloads
32
Citations
23
References
2013
Year
Unknown Venue
Cluster ComputingEngineeringIn-memory DatabaseComputer ArchitectureData ScienceMemory FootprintsParallel ComputingData ManagementMemory Access PatternsComputer EngineeringComputer ScienceData-intensive ComputingMemory ArchitectureAnalysis MethodologyExternal-memory AlgorithmCloud ComputingBig Data WorkloadsParallel ProgrammingIn-storage ComputingBig Data
Two recent trends that have emerged include (1) Rapid growth in big data technologies with new types of computing models to handle unstructured data, such as map-reduce and noSQL (2) A growing focus on the memory subsystem for performance and power optimizations, particularly with emerging memory technologies offering different characteristics from conventional DRAM (bandwidths, read/write asymmetries). This paper examines how these trends may intersect by characterizing the memory access patterns of various Hadoop and noSQL big data workloads. Using memory DIMM traces collected using special hardware, we analyze the spatial and temporal reference patterns to bring out several insights related to memory and platform usages, such as memory footprints, read-write ratios, bandwidths, latencies, etc. We develop an analysis methodology to understand how conventional optimizations such as caching, prediction, and prefetching may apply to these workloads, and discuss the implications on software and system design.
| Year | Citations | |
|---|---|---|
Page 1
Page 1