Publication | Closed Access
Using Spearman's correlation coefficients for exploratory data analysis on big dataset
307
Citations
11
References
2015
Year
Social Data AnalysisEngineeringApplication DomainCorrelation CoefficientsSummary Correlation AnalysisBig Data InfrastructureUnsupervised Machine LearningBig Data ModelData ScienceData MiningManagementExploratory Data AnalysisData IntegrationBig DataData ManagementStatisticsSocial Network AnalysisPredictive AnalyticsKnowledge DiscoveryBig DatasetBig Data AcquisitionNetwork ScienceData AnalyticsData Modeling
Correlation analysis is widely used and valuable for exploratory data analysis in social networking research. The study compares Pearson, Spearman, and Kendall correlation coefficients in terms of definition and application domain. The authors apply the nonparametric Spearman rank correlation to a pump vibration dataset of 207,880 variables to assess relationships with pump working states. High Spearman correlation percentages were identified for various state comparisons, providing valuable insights for future unsupervised machine learning research. © 2015 John Wiley & Sons, Ltd.
Summary Correlation analysis is both popular and useful in a number of social networking research, particularly in the exploratory data analysis. In this paper, three well‐known and often‐used correlation coefficients, Pearson product–moment correlation coefficient, Spearman, and Kendall rank correlation coefficients, are compared from definition to application domain. Based on the characteristics of the pump's vibration dataset, the nonparametric and distribution‐free Spearman rank correlation coefficient is introduced to analyze the relationship between the pump's working state and each of the 207′880 variables. The percentage of variables and exact variables' tables with high Spearman's correlation coefficients for states I and II, states I and III, states II and III, and three states in different files are obtained respectively, which has important valuation for the future research of the unsupervised machine learning system. Copyright © 2015 John Wiley & Sons, Ltd.
| Year | Citations | |
|---|---|---|
Page 1
Page 1