Publication | Closed Access
A 19.4-nJ/Decision, 364-K Decisions/s, In-Memory Random Forest Multi-Class Inference Accelerator
59
Citations
23
References
2018
Year
364-K Decisions/sEngineeringMachine LearningVlsi DesignAnalog DesignComputer ArchitectureIntegrated CircuitsRf ClassifierHardware SystemsData ScienceData MiningDecision TreeComputing SystemsDecision Tree LearningEmbedded Machine LearningParallel ComputingPerformance ImprovementMultiple Classifier SystemElectrical EngineeringBenchmark DatasetsComputer EngineeringComputer ScienceDeep LearningSignal ProcessingHardware AccelerationVlsi ArchitectureRandom ForestEnsemble Algorithm
This paper presents an integrated circuit (IC) realization of a random forest (RF) machine learning classifier in a 65-nm CMOS. Algorithm, architecture, and circuits are co-optimized to achieve aggressive energy and delay benefits by taking advantage of the inherent error resiliency derived from the ensemble nature of an RF classifier. Deterministic subsampling (DSS) and regularized decision trees reduce interconnect complexity, and avoid irregular memory access patterns and computations, thereby reducing the energy-delay product (EDP). The prototype IC also employs low-swing analog in-memory computations embedded in a standard 6T SRAM to enable massively parallel tree node comparisons, thereby minimizing the memory fetches and reducing the EDP further. The 65-nm CMOS prototype IC achieves a 3.1× and 2.2× improved energy efficiency and throughput leading to 6.8× lower EDP compared to a conventional digital system at the same accuracies of 94% and 97.5% for two tasks: 1) eight-class traffic sign recognition and 2) face detection, respectively.
| Year | Citations | |
|---|---|---|
Page 1
Page 1