Skip to main navigation Skip to search Skip to main content

A Carbon-Efficient Framework for Deep Learning Workloads on GPU Clusters

  • Dong Ki Kang
  • , Yong Hyuk Moon*
  • *Corresponding author for this work
  • Sungshin Women's University

Research output: Contribution to journalJournal articlepeer-review

Abstract

The explosive growth of artificial intelligence (AI) services has led to massive scaling of GPU computing clusters, causing sharp rises in power consumption and carbon emissions. Although hardware-level accelerator enhancements and deep neural network (DNN) model compression techniques can improve power efficiency, they often encounter deployment barriers and risks of accuracy loss in practice. To address these issues without altering hardware or model architectures, we propose a novel Carbon-Aware Resource Management (CA-RM) framework for GPU clusters. In order to minimize the carbon emission, the CA-RM framework dynamically adjusts energy usage by combining real-time GPU core frequency scaling with intelligent workload placement, aligning computation with the temporal availability of renewable generation. We introduce a new metric, performance-per-carbon (PPC), and develop three optimization formulations: carbon-constrained, performance-constrained, and PPC-driven objectives that simultaneously respect DNN model training deadlines, inference latency requirements, and carbon emission budgets. Through extensive simulations using real-world renewable energy traces and profiling data collected from NVIDIA RTX4090 GPU running representative DNN workloads, we show that the CA-RM framework substantially reduces carbon emission while satisfying service-level agreement (SLA) targets across a wide range of workload characteristics. Through experimental evaluation, we verify that the proposed CA-RM framework achieves approximately 35% carbon reduction on average, compared to competing approaches, while still ensuring acceptable processing performance across diverse workload behaviors.

Original languageEnglish
Article number633
JournalApplied Sciences (Switzerland)
Volume16
Issue number2
DOIs
StatePublished - 2026.01

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy
  2. SDG 13 - Climate Action
    SDG 13 Climate Action

Keywords

  • carbon emission
  • deep neural network
  • frequency scaling
  • inference
  • renewable energy
  • training

Fingerprint

Dive into the research topics of 'A Carbon-Efficient Framework for Deep Learning Workloads on GPU Clusters'. Together they form a unique fingerprint.

Cite this