Publication | Closed Access
A Divided Regression Analysis for Big Data
23
Citations
5
References
2015
Year
EngineeringBig Data AnalyticsMany Statistical MethodsBig Data InfrastructureBig Data ModelData ScienceData MiningManagementDivided Regression AnalysisBig DataData ManagementStatisticsPredictive AnalyticsKnowledge DiscoveryDivided Regression ModelComputer ScienceBig Data AcquisitionCloud ComputingStatistical InferenceMassive Data ProcessingData Modeling
Statistics is an important part in big data because many statistical methods are used for big data analysis. The aim of statistics is to estimate population using the sample extracted from the population, so statistics is to analyze not the population but the sample. But in big data environment, we can get the big data set closed to the population by the advanced computing systems such as cloud computing and high-speed internet. According to the circumstances, we can analyze entire part of big data like the population of statistics. But we may be impossible to analyze the entire data because of its huge data volume. So, in this paper, we propose a new analytical methodology for big data analysis in regression problem for reducing the computing burden. We call this a divided regression analysis. To verify the performance of our divided regression model, we carry out experiment and simulation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1