Publication | Open Access
Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization
16
Citations
10
References
2017
Year
Mathematical ProgrammingNumerical AnalysisLarge-scale Global OptimizationEngineeringMachine LearningScalable Parallel MethodStructural OptimizationUnconstrained OptimizationGroup LassoData ScienceSparse Neural NetworkEmpirical Risk MinimizationDerivative-free OptimizationParallel ComputingIncremental Gradient AlgorithmContinuous OptimizationComputer EngineeringLarge Scale OptimizationInverse ProblemsComputer ScienceDeep LearningNondifferentiable OptimizationAdaptive OptimizationModel OptimizationParallel LearningParallel ProgrammingNonsmooth BarrierComposite Optimization
Due to their simplicity and excellent performance, parallel asynchronous variants of stochastic gradient descent have become popular methods to solve a wide range of large-scale optimization problems on multi-core architectures. Yet, despite their practical success, support for nonsmooth objectives is still lacking, making them unsuitable for many problems of interest in machine learning, such as the Lasso, group Lasso or empirical risk minimization with convex constraints. In this work, we propose and analyze ProxASAGA, a fully asynchronous sparse method inspired by SAGA, a variance reduced incremental gradient algorithm. The proposed method is easy to implement and significantly outperforms the state of the art on several nonsmooth, large-scale problems. We prove that our method achieves a theoretical linear speedup with respect to the sequential version under assumptions on the sparsity of gradients and block-separability of the proximal term. Empirical benchmarks on a multi-core architecture illustrate practical speedups of up to 12x on a 20-core machine.
| Year | Citations | |
|---|---|---|
Page 1
Page 1