Publication | Closed Access
An Improved Replica Placement Policy for Hadoop Distributed File System Running on Cloud Platforms
34
Citations
13
References
2017
Year
Unknown Venue
Distributed File SystemCluster ComputingReplica AssignmentEngineeringEdge ComputingCloud ComputingFile SystemsStorage ManagementDistributed Data StoreCloud PlatformsCloud Load BalancingLoad Balance UtilityParallel StorageDistributed CloudParallel ComputingLoad BalanceParallel File SystemData Management
Load balance is a crucial issue for data-intensive computing on cloud platforms, because a load balanced cluster can significantly improve the completion time of data-intensive jobs. In this paper, we present an improved replica placement policy for Hadoop Distributed File System (HDFS), which is specifically designed for heterogeneous clusters. The HDFS replica placement policy cannot generate balanced replica assignment, and hence has to rely on a load balance utility to balance the load among cluster nodes. In contrast, our proposed policy can generate perfectly even replica assignment, and also achieve load balance among cluster nodes in any heterogeneous or homogeneous environments without the running of the load balance utility.
| Year | Citations | |
|---|---|---|
Page 1
Page 1