Publication | Open Access
Finding Low-rank Solutions to Matrix Problems, Efficiently and Provably.
32
Citations
71
References
2016
Year
Mathematical ProgrammingLarge-scale Global OptimizationEngineeringMachine LearningMatrix ProblemsGeneric Convex FMatrix FactorizationProduct UvRank-r MatricesDerivative-free OptimizationLarge Scale OptimizationInverse ProblemsComputer ScienceMatrix AnalysisApproximation TheoryLow-rank Approximation
A rank-r matrix X \in R^{m x n} can be written as a product UV', where U \in R^{m x r} and V \in R^{n x r}. One could exploit this observation in optimization: e.g., consider the minimization of a convex function f(X) over rank-r matrices, where the scaffold of rank-r matrices is modeled via the factorization in U and V variables. Such heuristic has been widely used before for specific problem instances, where the solution sought is (approximately) low-rank. Though such parameterization reduces the number of variables and is more efficient in computational speed and memory requirement (of particular interest is the case r << min{m, n}), it comes at a cost: f(UV') becomes a non-convex function w.r.t. U and V. In this paper, we study such parameterization in optimization of generic convex f and focus on first-order, gradient descent algorithmic solutions. We propose an algorithm we call the Bi-Factored Gradient Descent (BFGD) algorithm, an efficient first-order method that operates on the U, V factors. We show that when f is smooth, BFGD has local sublinear convergence, and linear convergence when f is both smooth and strongly convex. Moreover, for several key applications, we provide simple and efficient initialization schemes that provide approximate solutions good enough for the above convergence results to hold.
| Year | Citations | |
|---|---|---|
Page 1
Page 1