Boolean:	`(bicycle AND helmet) OR (head AND protection)` (always group AND in parenthesis)
Title:	`title:(climate change)`
Author:	`author:("bohr niels" OR "bohr n")` (avoid only full first name)
Phrase:	`"water pump control"` (does not work with wildcards)
Wildcards:	`wom?n pharm*`

Conference paper

Addressing partial observability in reinforcement learning for energy management

In Proceedings of the 8th Acm International Conference on Systems for Energy-efficient Buildings, Cities, and Transportation — 2021, pp. 324-328

By Biemann, Marco^1,2,3,4; Liu, Xiufeng^1,2,3,4; Zeng, Yifeng⁵; Huang, Lizhen⁶

From

Department of Technology, Management and Economics, Technical University of Denmark¹

Sustainability, Department of Technology, Management and Economics, Technical University of Denmark²

Energy Economics and System Analysis, Sustainability, Society and Economics, Department of Technology, Management and Economics, Technical University of Denmark³

Energy Systems Analysis, Sustainability, Department of Technology, Management and Economics, Technical University of Denmark⁴

Northumbria University⁵

Norwegian University of Science and Technology⁶

Abstract

Automatic control of energy systems is affected by the uncertainties of multiple factors, including weather, prices and human activities. The literature relies on Markov-based control, taking only into account the current state. This impacts control performance, as previous states give additional context for decision making.

We present two ways to learn non-Markovian policies, based on recurrent neural networks and variational inference. We evaluate the methods on a simulated data centre HVAC control task. The results show that the off-policy stochastic latent actor-critic algorithm can maintain the temperature in the predefined range within three months of training without prior knowledge while reducing energy consumption compared to Markovian policies by more than 5%.

Language:	English
Publisher:	ACM
Year:	2021
Pages:	324-328
Types:	Conference paper
DOI:	10.1145/3486611.3488730
ORCIDs:	Biemann, Marco and Liu, Xiufeng

Keywords

SDG 7 - Affordable and Clean Energy

Other keywords

Computing methodologies HVAC control Learning paradigms Machine learning Markov processes Mathematics of computing POMDP Probabilistic reasoning algorithms Probability and statistics Reinforcement learning Stochastic processes Variational methods energy management recurrent neural networks reinforcement learning variational inference

Addressing partial observability in reinforcement learning for energy management

DTU Library

Address

Shortcuts

Log in?

Addressing partial observability in reinforcement learning for energy management

DTU Library

Address

Shortcuts