Publication | Closed Access
Principal Curve Algorithms for Partitioning High-Dimensional Data Spaces
14
Citations
35
References
2011
Year
EngineeringComplexity ReductionLinear Partitioning StrategiesUnsupervised Machine LearningImage AnalysisData ScienceData MiningPattern RecognitionPrincipal Component AnalysisComputational GeometryGeometric ModelingComputer EngineeringMost Partitioning AlgorithmsComputer ScienceDimensionality ReductionMedical Image ComputingNonlinear Dimensionality ReductionNatural SciencesPrincipal Curve AlgorithmsPrincipal Curve Tree
Most partitioning algorithms iteratively partition a space into cells that contain underlying linear or nonlinear structures using linear partitioning strategies. The compactness of each cell depends on how well the (locally) linear partitioning strategy approximates the intrinsic structure. To partition a compact structure for complex data in a nonlinear context, this paper proposes a nonlinear partition strategy. This is a principal curve tree (PC-tree), which is implemented iteratively. Given that a PC passes through the middle of the data distribution, it allows for partitioning based on the arc length of the PC. To enhance the partitioning of a given space, a residual version of the PC-tree algorithm is developed, denoted here as the principal component analysis tree (PCR-tree) algorithm. Because of its residual property, the PCR-tree can yield the intrinsic dimension of high-dimensional data. Comparisons presented in this paper confirm that the proposed PC-tree and PCR-tree approaches show a better performance than several other competing partitioning algorithms in terms of vector quantization error and nearest neighbor search. The comparison also shows that the proposed algorithms outperform competing linear methods in total average coverage which measures the nonlinear compactness of partitioning algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1