Publication | Closed Access
Convolutional Neural Networks on Dataflow Engines
10
Citations
14
References
2017
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningData ScienceHardware AccelerationEdge ComputingDfe Chip PropertiesConvolutional Neural NetworksComputer ArchitectureComputer EngineeringEmbedded Machine LearningComputer ScienceLatest GenerationParallel ComputingDeep LearningNeural Architecture Search
In this paper we discuss a high performance implementation for Convolutional Neural Networks (CNNs) inference on the latest generation of Dataflow Engines (DFEs). We discuss the architectural choices made during the design phase taking into account the DFE chip properties. We then perform design space exploration, considering the memory bandwidth and resources utilisation constraints derived from the used DFE and the chosen architecture. Finally, we discuss the high performance implementation and compare the obtained performance against other implementations, showing that our proposed design reaches 2,450 GOPS when running VGG16 as a test case.
| Year | Citations | |
|---|---|---|
Page 1
Page 1