Publication | Closed Access
Input sparsity time low-rank approximation via ridge leverage score sampling
64
Citations
0
References
2017
Year
Mathematical ProgrammingEngineeringData ScienceRegularization (Mathematics)Approximation TheoryStatisticsLow-rank ApproximationInverse ProblemsComputer ScienceDimensionality ReductionNew AlgorithmRepresentative SubsetRidge LeverageSparse RepresentationHigh-dimensional MethodMatrix FactorizationCompressive SensingStatistical InferenceMatrix A
We present a new algorithm for finding a near optimal low-rank approximation of a matrix A in O(nnz(A)) time. Our method is based on a recursive sampling scheme for computing a representative subset of A's columns, which is then used to find a low-rank approximation.This approach differs substantially from prior O(nnz(A)) time algorithms, which are all based on fast Johnson-Lindenstrauss random projections. Our algorithm matches the guarantees of the random projection methods while offering a number of advantages.In addition to better performance on sparse and structured data, sampling algorithms can be applied in settings where random projections cannot. For example, we give new streaming algorithms for the column subset selection and projection-cost preserving sample problems. Our method has also been used in the fastest algorithms for provably accurate Nystrom approximation of kernel matrices [56].