Publication | Closed Access
StreamDM: Advanced Data Mining in Spark Streaming
49
Citations
10
References
2015
Year
Unknown Venue
Cluster ComputingEngineeringMachine LearningStreaming AlgorithmData Streaming ArchitectureStreaming DataReal-time AnalyticsData ScienceData MiningManagementData IntegrationData ManagementKnowledge DiscoveryComputer ScienceData Stream ManagementHuawei NoahSpark StreamingData Stream MiningBig Data
Real-time analytics are becoming increasingly important due to the large amount of data that is being created continuously. Drawing from our experiences at Huawei Noah's Ark Lab, we present and demonstrate here StreamDM, a new open source data mining and machine learning library, designed on top of Spark Streaming, an extension of the core Spark API that enables scalable stream processing of data streams. StreamDM is designed to be easily extended and used, either practitioners, developers, or researchers, and is the first library to contain advanced stream mining algorithms for Spark Streaming.
| Year | Citations | |
|---|---|---|
Page 1
Page 1