Publication | Closed Access
Druid
165
Citations
32
References
2014
Year
Unknown Venue
Cluster ComputingEngineeringData ScienceFlexible FiltersBig Data IndexingData IntegrationParallel ProgrammingComputer ScienceMap-reduceParallel ComputingData ManagementColumn-oriented Storage LayoutData-intensive ComputingReal-time Exploratory AnalyticsMassive Data ProcessingBig DataHigh-performance Data Analytics
Druid is an open source data store designed for real-time exploratory analytics on large data sets. The system combines a column-oriented storage layout, a distributed, shared-nothing architecture, and an advanced indexing structure to allow for the arbitrary exploration of billion-row tables with sub-second latencies. In this paper, we describe Druid's architecture, and detail how it supports fast aggregations, flexible filters, and low latency data ingestion.
| Year | Citations | |
|---|---|---|
Page 1
Page 1