Publication | Closed Access
Locality of Reference in LU Decomposition with Partial Pivoting
158
Citations
9
References
1997
Year
Mathematical ProgrammingCluster ComputingEngineeringLu DecompositionComputer ArchitectureComputational ComplexityArray ComputingHigh-performance ArchitectureMultilinear Subspace LearningParallel ComputingLow-rank ApproximationMassively-parallel ComputingPartitioned AlgorithmComputer EngineeringInverse ProblemsComputer ScienceNew AlgorithmMemory ArchitectureExternal-memory AlgorithmMatrix FactorizationParallel Programming
This paper presents a new partitioned algorithm for LU decomposition with partial pivoting. The new algorithm, called the recursively partitioned algorithm, is based on a recursive partitioning of the matrix. The paper analyzes the locality of reference in the new algorithm and the locality of reference in a known and widely used partitioned algorithm for LU decomposition called the right-looking algorithm. The analysis reveals that the new algorithm performs a factor of $\Theta(\sqrt{M/n})$ fewer I/O operations (or cache misses) than the right-looking algorithm, where n is the order of the matrix and M is the size of primary memory. The analysis also determines the optimal block size for the right-looking algorithm. Experimental comparisons between the new algorithm and the right-looking algorithm show that an implementation of the new algorithm outperforms a similarly coded right-looking algorithm on six different RISC architectures, that the new algorithm performs fewer cache misses than any other algorithm tested, and that it benefits more from Strassen's matrix-multiplication algorithm.
| Year | Citations | |
|---|---|---|
Page 1
Page 1