Publication | Closed Access
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs
44
Citations
18
References
2018
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningBiometricsHardware AlgorithmFace RecognitionInception ModulesFast Fourier TransformFace DetectionFacial Recognition SystemImage AnalysisPattern RecognitionVideo TransformerMachine VisionComputer EngineeringComputer ScienceSwiss KnifeDeep LearningNeural Architecture SearchComputer VisionHardware Acceleration
Deep Convolutional Neural Networks (CNN) have become a Swiss knife in solving critical arti cial intelligence tasks. However, deploying deep CNN models for latency-critical tasks remains to be challenging because of the complex nature of CNNs. Recently, FPGA has become a favorable device to accelerate deep CNNs thanks to its high parallel processing capability and energy e ciency. In this work, we explore di erent fast convolution algorithms including Winograd and Fast Fourier Transform (FFT), and nd an optimal strategy to apply them together on di erent types of convolutions. We also propose an optimization scheme to exploit parallelism on novel CNN architectures such as Inception modules in GoogLeNet. We implement a con gurable IP-based face recognition acceler- ation system based on FaceNet using High-Level Synthesis. Our implementation on a Xilinx Ultrascale device achieves 3.75x la- tency speedup compared to a high-end NVIDIA GPU and surpasses previous FPGA results signi cantly.
| Year | Citations | |
|---|---|---|
Page 1
Page 1