Publication | Closed Access
Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators
23
Citations
22
References
2023
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningData ScienceEnergy EfficiencyHardware AccelerationSparse Neural NetworkHigh-performance ArchitectureComputer EngineeringComputer ArchitectureNeural Architecture SearchDomain-specific AcceleratorEmbedded Machine LearningParallel ProgrammingComputer SciencePortable MeanDeep LearningDeep Learning Accelerators
Deep Learning Accelerators (DLAs) are effective to improve both performance and energy efficiency of compute-intensive deep learning algorithms. A flexible and portable mean to exploit DLAs is using high-performance software libraries with well-established APIs, which are typically either manually implemented or automatically generated by exploration-based compilation approaches. Though exploration-based approaches significantly reduce programming efforts, they fail to find optimal or near-optimal programs from a large but low-quality search space because the massive inherent constraints of DLAs cannot be accurately characterized.
| Year | Citations | |
|---|---|---|
Page 1
Page 1