Skip to main navigation Skip to search Skip to main content

Reinforcement Learning-Based Optimization of Environmental Control Systems in Battery Energy Storage Rooms

  • So Yeon Park
  • , Deun Chan Kim
  • , Jun Ho Bang*
  • *Corresponding author for this work
  • Jeonbuk National University

Research output: Contribution to journalJournal articlepeer-review

Abstract

This study proposes a reinforcement learning (RL)-based optimization framework for the environmental control system of battery rooms in Energy Storage Systems (ESS). Conventional rule-based air-conditioning strategies are unable to adapt to real-time temperature and humidity fluctuations, often leading to excessive energy consumption or insufficient thermal protection. To overcome these limitations, both value-based (DQN, Double DQN, Dueling DQN) and policy-based (Policy Gradient, PPO, TRPO) RL algorithms are implemented and systematically compared. The algorithms are trained and evaluated using one year of real ESS operational data and corresponding meteorological data sampled at 15-min intervals. Performance is assessed in terms of convergence speed, learning stability, and cooling-energy consumption. The experimental results show that the DQN algorithm reduces time-averaged cooling power consumption by 46.5% compared to conventional rule-based control, while maintaining temperature, humidity, and dew-point constraint violation rates below 1% throughout the testing period. Among the policy-based methods, the Policy Gradient algorithm demonstrates competitive energy-saving performance but requires longer training time and exhibits higher reward variance. These findings confirm that RL-based control can effectively adapt to dynamic environmental conditions, thereby improving both energy efficiency and operational safety in ESS battery rooms. The proposed framework offers a practical and scalable solution for intelligent thermal management in ESS facilities.

Original languageEnglish
Article number516
JournalEnergies
Volume19
Issue number2
DOIs
StatePublished - 2026.01

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Policy Gradient (PG)
  • air conditioning system optimization
  • deep Q-Network (DQN)
  • energy storage system (ESS)
  • reinforcement learning (RL)

Fingerprint

Dive into the research topics of 'Reinforcement Learning-Based Optimization of Environmental Control Systems in Battery Energy Storage Rooms'. Together they form a unique fingerprint.

Cite this