Towards Safe Model-Free Building Energy Management using Masked Reinforcement Learning

Sharath Ram Kumar*, Rémy Rigo-Mariani*, Benoit Delinchant*, Arvind Easwaran

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Autonomous control of building energy resources including HVAC and battery storage systems has the potential to optimize operations and achieve objectives such as cost minimization. Existing approaches either require an explicit mathematical model of the building, or resort to simple rule-based controls (RBC) which may be sub-optimal. Model-free reinforcement learning (RL) is a promising method to overcome these limitations - however, it often requires a large number of interactions with the real environment before learning a functional policy. In this work, we investigate’Action Masking’, a technique to improve the learning efficiency of RL algorithms while respecting safety rules during the learning phase. Our solution achieves a cost reduction of 6% compared to a baseline rule-based controller, and also outperforms a popular transfer learning strategy. This suggests that model-free RL approaches are feasible and practical for problems in this domain.

Original languageEnglish
Title of host publicationProceedings of 2023 IEEE PES Innovative Smart Grid Technologies Europe, ISGT EUROPE 2023
PublisherIEEE Computer Society
ISBN (Electronic)9798350396782
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event2023 IEEE PES Innovative Smart Grid Technologies Europe, ISGT EUROPE 2023 - Grenoble, France
Duration: Oct 23 2023Oct 26 2023

Publication series

NameIEEE PES Innovative Smart Grid Technologies Conference Europe

Conference

Conference2023 IEEE PES Innovative Smart Grid Technologies Europe, ISGT EUROPE 2023
Country/TerritoryFrance
CityGrenoble
Period10/23/2310/26/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

ASJC Scopus Subject Areas

  • Computer Networks and Communications
  • Information Systems

Fingerprint

Dive into the research topics of 'Towards Safe Model-Free Building Energy Management using Masked Reinforcement Learning'. Together they form a unique fingerprint.

Cite this