Publication | Closed Access
Correlation Analysis of Big Data to Support Machine Learning
11
Citations
5
References
2015
Year
Unknown Venue
Big Data AcquisitionData ModelingEngineeringData ScienceData MiningPredictive AnalyticsKnowledge DiscoveryManagementMultidimensional AnalysisRegression AnalysisData AnalyticsBig Data NeedSupport Machine LearningFunctional Data AnalysisStatisticsQuantitative VariablesBig DataBig Data Model
The large size and complexity of datasets in Big Data need specialized statistical tools for analysis and we use R for correlation analysis of our data set. This paper explores the correlation analysis through best fit linear regression of quantitative variables with help of the demonstration based on scatter plots and linear regression best fit line. The analysis demonstrated in this paper is scalable to Big Data in any other context where the quantitative variables are clearly delineated. R provides multiple techniques and inferences to statistical analysis of dataset, this paper however explores the correlation between quantitative variable establishing the extent of dependability between them using R functions. The correlation and best fit line functions of R i.e. Cor () and abline(lmout) respectively are significantly explored.
| Year | Citations | |
|---|---|---|
Page 1
Page 1