Abstract
The explosive growth of artificial intelligence (AI) services has led to massive scaling of GPU computing clusters, causing sharp rises in power consumption and carbon emissions. Although hardware-level accelerator enhancements and deep neural network (DNN) model compression techniques can improve power efficiency, they often encounter deployment barriers and risks of accuracy loss in practice. To address these issues without altering hardware or model architectures, we propose a novel Carbon-Aware Resource Management (CA-RM) framework for GPU clusters. In order to minimize the carbon emission, the CA-RM framework dynamically adjusts energy usage by combining real-time GPU core frequency scaling with intelligent workload placement, aligning computation with the temporal availability of renewable generation. We introduce a new metric, performance-per-carbon (PPC), and develop three optimization formulations: carbon-constrained, performance-constrained, and PPC-driven objectives that simultaneously respect DNN model training deadlines, inference latency requirements, and carbon emission budgets. Through extensive simulations using real-world renewable energy traces and profiling data collected from NVIDIA RTX4090 GPU running representative DNN workloads, we show that the CA-RM framework substantially reduces carbon emission while satisfying service-level agreement (SLA) targets across a wide range of workload characteristics. Through experimental evaluation, we verify that the proposed CA-RM framework achieves approximately 35% carbon reduction on average, compared to competing approaches, while still ensuring acceptable processing performance across diverse workload behaviors.
| Original language | English |
|---|---|
| Article number | 633 |
| Journal | Applied Sciences (Switzerland) |
| Volume | 16 |
| Issue number | 2 |
| DOIs | |
| State | Published - 2026.01 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
-
SDG 13 Climate Action
Keywords
- carbon emission
- deep neural network
- frequency scaling
- inference
- renewable energy
- training
Fingerprint
Dive into the research topics of 'A Carbon-Efficient Framework for Deep Learning Workloads on GPU Clusters'. Together they form a unique fingerprint.Press/Media
-
Data from Jeonbuk National University Update Knowledge in Applied Sciences (A Carbon-Efficient Framework for Deep Learning Workloads on GPU Clusters)
26.02.5
1 item of Media coverage
Press/Media
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver