Publication | Closed Access
Evaluating R-Based Big Data Analytic Frameworks
10
Citations
5
References
2015
Year
Unknown Venue
Cluster ComputingEngineeringBig Data AnalyticsMap-reduceBig Data InfrastructureBig Data ModelData ScienceData-intensive PlatformManagementBig Data ArchitectureData IntegrationBig DataData ManagementStatisticsAnalytic QuestionComputer ScienceCloud ComputingParallel ProgrammingRuntime PerformanceVanilla ImplementationMassive Data ProcessingData Modeling
We study the two approaches, rHadoop and H2O, to intergate R, a popular statistical programming environment, into the Hadoop Big Data ecosystem. Using these approaches and the vanilla implementation of MapReduce to implement the solution to an analytic question for the on-time airline performance data set, we evaluate the differences in runtime performance and elaborate on the causes of these differences based on rHadoop and H2O's design principles.
| Year | Citations | |
|---|---|---|
Page 1
Page 1