Publication | Closed Access
VisReduce: Fast and responsive incremental information visualization of large datasets
39
Citations
33
References
2013
Year
Unknown Venue
Cluster ComputingEngineeringBig Data AnalyticsInteractive Data ExplorationData VisualizationVisual Analytics SytemsVisualization (Data Visualization)Interactive VisualizationData ScienceLarge DatasetsManagementData IntegrationBig DataData ManagementVisual AnalyticsBusiness VisualizationHigh-performance Data AnalyticsVisualization (Cognitive Psychology)Knowledge DiscoveryVisual Data MiningComputer ScienceVisualization (Biomedical Imaging)Parallel VisualizationData ProcessingCloudera ImpalaEnd-user ResponsivenessData Modeling
Performance and responsiveness of visual analytics sytems for exploratory data analysis of large datasets has been a long standing problem. We propose a method for incrementally computing visualizations in a distributed fashion by combining a modified MapReduce-style algorithm with a compressed columnar data store, resulting in significant improvements in performance and responsiveness for constructing commonly encountered information visualizations, e.g. bar charts, scatterplots, heat maps, cartograms and parallel coordinate plots. We compare our method with one that queries three other readily available database and data warehouse systems — PostgreSQL, Cloudera Impala and the MapReduce-based Apache Hive — in order to build visualizations. We show that our end-to-end approach allows for greater speed and guaranteed end-user responsiveness, even in the face of large, long-running queries.
| Year | Citations | |
|---|---|---|
Page 1
Page 1