Publication | Open Access
Scalable and fault tolerant orthogonalization based on randomized distributed data aggregation
30
Citations
28
References
2013
Year
Cluster ComputingEngineeringData AggregationDistributed AlgorithmsNetwork AnalysisFault ToleranceDistributed Data ProcessingFault-tolerant MessagingDistributed Data AnalyticsData ScienceMatrix ComputationsParallel ComputingFault Tolerant OrthogonalizationComputer EngineeringNew Aggregation AlgorithmComputer ScienceHypercube TopologySignal ProcessingDistributed ProcessingComputational ScienceFault-tolerant NetworkDistributed ComputingParallel Programming
The construction of distributed algorithms for matrix computations built on top of distributed data aggregation algorithms with randomized communication schedules is investigated. For this purpose, a new aggregation algorithm for summing or averaging distributed values, the push-flow algorithm, is developed, which achieves superior resilience properties with respect to failures compared to existing aggregation methods. It is illustrated that on a hypercube topology it asymptotically requires the same number of iterations as the optimal all-to-all reduction operation and that it scales well with the number of nodes. Orthogonalization is studied as a prototypical matrix computation task. A new fault tolerant distributed orthogonalization method rdmGS, which can produce accurate results even in the presence of node failures, is built on top of distributed data aggregation algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1