Publication | Open Access
Scale-up graph processing in the cloud
11
Citations
4
References
2014
Year
Unknown Venue
Cluster ComputingEngineeringDistributed AlgorithmsLarge GraphsScale-up Graph ProcessingGraph DatabaseGraph ProcessingData ScienceData-intensive PlatformParallel ComputingData ManagementComputer EngineeringGraph Processing AlgorithmComputer ScienceSecondary StorageData-intensive ComputingGraph AlgorithmScalable ComputingGraph TheoryCloud ComputingParallel ProgrammingDistributed Data StoreBig Data
Processing large graphs is an important part of the big-data problem. Recently a number of scale-up systems such as X-Stream, Graphchi and Turbograph have been proposed for processing large graphs using secondary storage on a single machine. The design and evaluation of these systems however have focused on physical machines. We expect that a natural evolution of such systems is to the cloud where a virtual machine would run the graph processing algorithm and access the graph from secondary storage remotely connected through the network. We evaluate a state of the art graph processing system called X-Stream in EC2 to identify challenges in this space. Our primary finding is that the network bandwidth between a virtual machine and remote storage becomes the limiter for performance. We show that this bottleneck can be somewhat alleviated through the use of VM local instance storage, network provisioning and compression.
| Year | Citations | |
|---|---|---|
Page 1
Page 1