Publication | Closed Access
INFless: a native serverless system for low-latency, high-throughput inference
114
Citations
15
References
2022
Year
Unknown Venue
Artificial IntelligenceServerless ArchitectureEngineeringMachine LearningNative Serverless SystemMachine Learning ToolComputer ArchitectureData ScienceHigh-performance ArchitectureServerless ComputingEmbedded Machine LearningParallel ComputingMl ServicesMachine Learning ModelComputer EngineeringLow LatencyComputer ScienceModern WebsitesCloud Computing
Modern websites increasingly rely on machine learning (ML) to improve their business efficiency. Developing and maintaining ML services incurs high costs for developers. Although serverless systems are a promising solution to reduce costs, we find that the current general purpose serverless systems cannot meet the low latency, high throughput demands of ML services.
| Year | Citations | |
|---|---|---|
Page 1
Page 1