Publication | Closed Access
Scalable Visual Analytics of Massive Textual Datasets
21
Citations
16
References
2007
Year
Unknown Venue
Cluster ComputingEngineeringInteractive Data ExplorationText MiningInteractive VisualizationInformation RetrievalData ScienceFirst Scalable ImplementationData IntegrationVisual AnalyticsKnowledge DiscoveryVisual Data MiningComputer ScienceParallel VisualizationText Processing EngineVisual Analytics ToolsParallel ProgrammingScalable Visual AnalyticsBig Data
This paper describes the first scalable implementation of a text processing engine used in visual analytics tools. These tools aid information analysts in interacting with and understanding large textual information content through visual interfaces. By developing a parallel implementation of the text processing engine, we enabled visual analytics tools to exploit cluster architectures and handle massive datasets. The paper describes key elements of our parallelization approach and demonstrates virtually linear scaling when processing multi-gigabyte data sets such as Pubmed. This approach enables interactive analysis of large datasets beyond capabilities of existing state-of-the art visual analytics tools.
| Year | Citations | |
|---|---|---|
Page 1
Page 1