Publication | Open Access
UMAP as a Dimensionality Reduction Tool for Molecular Dynamics Simulations of Biomacromolecules: A Comparison Study
108
Citations
54
References
2021
Year
EngineeringMolecular BiologyComputational ChemistryMolecular DynamicsMolecular DesignMolecular GraphicData ScienceMolecular SimulationPrincipal Component AnalysisComputational BiochemistryBiophysicsSystems BiologyManifold LearningProtein ModelingDimensionality ReductionNonlinear Dimensionality ReductionBioinformaticsDimensionality Reduction ToolComputational BiologyDimensionality Reduction MethodMolecular Dynamics SimulationsComparison StudyMedicineComputational Biophysics
Proteins are the molecular machines of life. The multitude of possible conformations that proteins can adopt determines their free-energy landscapes. However, the inherently high dimensionality of a protein free-energy landscape poses a challenge to deciphering how proteins perform their functions. For this reason, dimensionality reduction is an active field of research for molecular biologists. The uniform manifold approximation and projection (UMAP) is a dimensionality reduction method based on a fuzzy topological analysis of data. In the present study, the performance of UMAP is compared with that of other popular dimensionality reduction methods such as t-distributed stochastic neighbor embedding (t-SNE), principal component analysis (PCA), and time-structure independent components analysis (tICA) in the context of analyzing molecular dynamics simulations of the circadian clock protein VIVID. A good dimensionality reduction method should accurately represent the data structure on the projected components. The comparison of the raw high-dimensional data with the projections obtained using different dimensionality reduction methods based on various metrics showed that UMAP has superior performance when compared with linear reduction methods (PCA and tICA) and has competitive performance and scalable computational cost.
| Year | Citations | |
|---|---|---|
Page 1
Page 1