Publication | Closed Access
Design considerations for distributed caching on the Internet
182
Citations
40
References
2003
Year
Unknown Venue
Distributed File SystemCluster ComputingEngineeringEdge ComputingDistributed CachingCloud ComputingIntegrated ArchitectureComputer ArchitectureDistributed Data StoreCachingMobile ComputingCache SystemsParallel ComputingData ReplicationData ManagementResponse TimeWeb Cache
We describe the design and implementation of an integrated architecture for cache systems that scale to hundreds or thousands of caches with thousands to millions of users. Rather than simply try to maximize hit rates, we take an end-to-end approach to improving response time by also considering hit times and miss times. We begin by studying several Internet caches and workloads, and we derive three core design principles for large scale distributed caches: minimize the number of hops to locate and access data on both hits and misses; share data among many users and scale to many caches; and cache data close to clients. Our strategies for addressing these issues are built around a scalable, high-performance data-location service that tracks where objects are replicated. We describe how to construct such a service and how to use this service to provide direct access to remote data and push-based data replication. We evaluate our system through trace-driven simulation and find that these strategies together provide response time speedups of 1.27 to 2.43 compared to a traditional three-level cache hierarchy for a range of trace workloads and simulated environments.
| Year | Citations | |
|---|---|---|
Page 1
Page 1