Concepedia

Publication | Closed Access

Exploiting hierarchy in parallel computer networks to optimize collective operation performance

131

Citations

0

References

2000

Year

Abstract

The ecient implementation of collective communication operations has received much attention. Initial eorts modeled network communication and produced \\optimal" trees based on those models. However, the models used by these initial eorts assumed equal point-to-point latencies between any two processes. This assumption is violated in heterogeneous systems such as clusters of SMPs and wide-area \\computational grids", and as a result, collective operations that utilize the trees generated by these models perform suboptimally. In response, more recent work has focused on creating topology-aware trees for collective operations that minimize communication across slower channels (e.g., a wide-area network). While these efforts have signicant communication benets, they all limit their view of the network to only two layers. We present a strategy based upon a multilayer view of the network. By creating multilevel topology trees we take advantage of communication cost dierences at every lev...