Publication | Closed Access
Flexible high-resolution object detection on edge devices with tunable latency
96
Citations
33
References
2021
Year
Unknown Venue
Convolutional Neural NetworkEngineeringFeature DetectionMachine LearningVideo InterpretationEdge DevicesImage AnalysisPattern RecognitionEdge DetectionVision SensorMachine VisionObject Detection ModelsObject DetectionComputer EngineeringComputer ScienceVideo UnderstandingNeural NetworksComputer VisionEdge ComputingObject Recognition
Object detection is a fundamental building block of video analytics applications. While Neural Networks (NNs)-based object detection models have shown excellent accuracy on benchmark datasets, they are not well positioned for high-resolution images inference on resource-constrained edge devices. Common approaches, including down-sampling inputs and scaling up neural networks, fall short of adapting to video content changes and various latency requirements. This paper presents Remix, a flexible framework for high-resolution object detection on edge devices. Remix takes as input a latency budget, and come up with an image partition and model execution plan which runs off-the-shelf neural networks on non-uniformly partitioned image blocks. As a result, it maximizes the overall detection accuracy by allocating various amount of compute power onto different areas of an image. We evaluate Remix on public dataset as well as real-world videos collected by ourselves. Experimental results show that Remix can either improve the detection accuracy by 18%-120% for a given latency budget, or achieve up to 8.1× inference speedup with accuracy on par with the state-of-the-art NNs.
| Year | Citations | |
|---|---|---|
Page 1
Page 1