Publication | Open Access
Lightning: A Reconfigurable Photonic-Electronic SmartNIC for Fast and Energy-Efficient Inference
22
Citations
47
References
2023
Year
Unknown Venue
EngineeringAdvanced ComputingComputer ArchitectureReal-time Inference QueriesProgrammable PhotonicsOptical ComputingQuantum ComputingSystems EngineeringEmbedded Machine LearningPhotonic Integrated CircuitParallel ComputingPhotonicsPhysicsComputer EngineeringComputer ScienceReconfigurable ArchitectureDeep LearningPhotonic DeviceEnergy-efficient InferenceHardware AccelerationEdge ComputingDomain-specific AcceleratorChip SynthesisOptoelectronicsMassive Growth
The massive growth of machine learning-based applications and the end of Moore's law have created a pressing need to redesign computing platforms. We propose Lightning, the first reconfigurable photonic-electronic smartNIC to serve real-time deep neural network inference requests. Lightning uses a fast datapath to feed traffic from the NIC into the photonic domain without creating digital packet processing and data movement bottlenecks. To do so, Lightning leverages a novel reconfigurable count-action abstraction that keeps track of the required computation operations of each inference packet. Our count-action abstraction decouples the compute control plane from the data plane by counting the number of operations in each task and triggers the execution of the next task(s) without interrupting the dataflow. We evaluate Lightning's performance using four platforms: a prototype, chip synthesis, emulations, and simulations. Our prototype demonstrates the feasibility of performing 8-bit photonic multiply-accumulate operations with 99.25% accuracy. To the best of our knowledge, our prototype is the highest-frequency photonic computing system, capable of serving real-time inference queries at 4.055 GHz end-to-end. Our simulations with large DNN models show that compared to Nvidia A100 GPU, A100X DPU, and Brainwave smartNIC, Lightning accelerates the average inference serve time by 337×, 329×, and 42×, while consuming 352×, 419×, and 54× less energy, respectively.
| Year | Citations | |
|---|---|---|
Page 1
Page 1