Publication | Closed Access
7.3 A 28nm 38-to-102-TOPS/W 8b Multiply-Less Approximate Digital SRAM Compute-In-Memory Macro for Neural-Network Inference
46
Citations
4
References
2023
Year
Unknown Venue
EngineeringComputer ArchitectureIntegrated CircuitsNeural-network InferenceApproximate ComputingConventional Digital ArchitecturesComputing SystemsParallel ComputingPerformance ImprovementElectrical EngineeringComputer EngineeringOperand PrecisionComputer ScienceMicroelectronicsSignal ProcessingMemory Architecture38-To-102-tops/w 8BHardware AccelerationVlsi ArchitecturePrior Digital Cims
This paper presents a 2-to-8-b scalable digital SRAM-based CIM macro that is co-designed with a multiply-less neural-network (NN) design methodology and incorporates dynamic-logic-based approximate circuits for vector-vector operations. Digital CIMs enable high throughput and reliable matrix-vector multiplications (MVMs); however, digital CIMs face three major challenges to obtain further aggressive gains over conventional digital architectures: (1) prior digital CIMs exploiting approximate computation suffer from accuracy degradation [1]; (2) digital [2] and, as [3] predicted, mixed-signal CIMs [4], suffer from quadratic energy scaling with improving operand precision; (3) the tight and regular memory layout prevent <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">s</sup> CIMs from leveraging unstructured bit-level statistics.
| Year | Citations | |
|---|---|---|
Page 1
Page 1