Publication | Closed Access
SubFlow: A Dynamic Induced-Subgraph Strategy Toward Real-Time DNN Inference and Training
47
Citations
112
References
2020
Year
Unknown Venue
Artificial IntelligenceConvolutional Neural NetworkDeep Neural NetworksEngineeringMachine LearningData ScienceGraph Neural NetworkSparse Neural NetworkComputer EngineeringNeural Architecture SearchEmbedded Machine LearningComputer ScienceDeep LearningSubflow OperationsDeep Neural NetworkModel CompressionEnable SubflowSpeech Recognition
We introduce SubFlow-a dynamic adaptation and execution strategy for a deep neural network (DNN), which enables real-time DNN inference and training. The goal of SubFlow is to complete the execution of a DNN task within a timing constraint that may dynamically change while ensuring comparable performance to executing the full network by executing a subset of the DNN at run-time. To this end, we propose two online algorithms that enable SubFlow: 1) dynamic construction of a sub-network which constructs the best subnetwork of the DNN in terms of size and configuration, and 2) time-bound execution which executes the sub-network within a given time budget either for inference or training. We implement and open-source SubFlow by extending TensorFlow with full compatibility by adding SubFlow operations for convolutional and fully-connected layers of a DNN. We evaluate SubFlow with three popular DNN models (LeNet-5, AlexNet, and KWS), which shows that it provides flexible run-time execution and increases the utility of a DNN under dynamic timing constraints, e.g., lx-6.7x range of dynamic execution speed with average -3% of performance (inference accuracy) difference. We also implement an autonomous robot as an example system that uses SubFlow and demonstrate that its obstacle detection DNN is flexibly executed to meet a range of deadlines that varies depending on its running sped.
| Year | Citations | |
|---|---|---|
Page 1
Page 1