Concepedia

Publication | Closed Access

INFless: a native serverless system for low-latency, high-throughput inference

114

Citations

15

References

2022

Year

Abstract

Modern websites increasingly rely on machine learning (ML) to improve their business efficiency. Developing and maintaining ML services incurs high costs for developers. Although serverless systems are a promising solution to reduce costs, we find that the current general purpose serverless systems cannot meet the low latency, high throughput demands of ML services.

References

YearCitations

Page 1