Publication | Closed Access
End-to-End Machine Learning with Apache AsterixDB
13
Citations
16
References
2018
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningMachine Learning ToolBig Data AnalyticsBig Data InfrastructureBig Data ModelInformation RetrievalData ScienceData MiningManagementBig Data ArchitectureData IntegrationBig DataData ManagementHigh-performance Data AnalyticsBenchmark DatasetsMachine Learning ModelKnowledge DiscoveryComputer ScienceApache AsterixdbCloud ComputingMassive Data ProcessingData Modeling
Recent developments in machine learning and data science provide a foundation for extracting underlying information from Big Data. Unfortunately, current platforms and tools often require data scientists to glue together and maintain custom-built platforms consisting of multiple Big Data component technologies. In this paper, we explain how Apache AsterixDB, an open source Big Data Management System, can help to reduce the burden involved in using machine learning algorithms in Big Data analytics. In particular, we describe how AsterixDB's built-in support for user-defined functions (UDFs), the availability of UDFs in data ingestion pipelines and queries, and the provision of machine learning platform and notebook inter-operation capabilities can together enable data analysts to more easily create and manage end-to-end analytical dataflows.
| Year | Citations | |
|---|---|---|
Page 1
Page 1