Publication | Open Access
Characterizing Power Management Opportunities for LLMs in the Cloud
40
Citations
33
References
2024
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitectureDatacenter GpusHigh Performance ComputingPower Management OpportunitiesCloud Resource ManagementDatacenter-scale ComputingLarge Language ModelsData ScienceDistributed CloudParallel ComputingData ManagementPower ManagementComputer EngineeringComputer ScienceRecent InnovationScalable ComputingData Center ManagementSmart GridEnergy ManagementEdge ComputingCloud ComputingParallel ProgrammingBig Data
Recent innovation in large language models (LLMs), and their myriad use cases have rapidly driven up the compute demand for datacenter GPUs. Several cloud providers and other enterprises plan to substantially grow their datacenter capacity to support these new workloads. A key bottleneck resource in datacenters is power, which LLMs are quickly saturating due to their rapidly increasing model sizes.
| Year | Citations | |
|---|---|---|
Page 1
Page 1