Concepedia

Publication | Closed Access

AKG: automatic kernel generation for neural processing units using polyhedral transformations

66

Citations

62

References

2021

Year

Abstract

Existing tensor compilers have proven their effectiveness in deploying deep neural networks on general-purpose hardware like CPU and GPU, but optimizing for neural processing units (NPUs) is still challenging due to the heterogeneous compute units and complicated memory hierarchy.

References

YearCitations

Page 1