Publication | Open Access
Optimal algorithms for smooth and strongly convex distributed optimization in networks
189
Citations
16
References
2017
Year
Mathematical ProgrammingAccelerated Gradient DescentEngineeringDistributed AlgorithmsOptimal AlgorithmsNetwork AnalysisLocal ComputationsParallel ComputingCombinatorial OptimizationNetwork OptimizationApproximation TheoryContinuous OptimizationOptimal Convergence RatesDistributed OptimizationDistributed Constraint OptimizationComputer ScienceNetwork ScienceGraph TheoryOptimization ProblemConvex OptimizationBusiness
In this paper, we determine the optimal convergence rates for strongly convex and smooth distributed optimization in two settings: centralized and decentralized communications over a network. For centralized (i.e. master/slave) algorithms, we show that distributing Nesterov's accelerated gradient descent is optimal and achieves a precision $\varepsilon > 0$ in time $O(\sqrt{κ_g}(1+Δτ)\ln(1/\varepsilon))$, where $κ_g$ is the condition number of the (global) function to optimize, $Δ$ is the diameter of the network, and $τ$ (resp. $1$) is the time needed to communicate values between two neighbors (resp. perform local computations). For decentralized algorithms based on gossip, we provide the first optimal algorithm, called the multi-step dual accelerated (MSDA) method, that achieves a precision $\varepsilon > 0$ in time $O(\sqrt{κ_l}(1+\fracτ{\sqrtγ})\ln(1/\varepsilon))$, where $κ_l$ is the condition number of the local functions and $γ$ is the (normalized) eigengap of the gossip matrix used for communication between nodes. We then verify the efficiency of MSDA against state-of-the-art methods for two problems: least-squares regression and classification by logistic regression.
| Year | Citations | |
|---|---|---|
Page 1
Page 1