Concepedia

Publication | Closed Access

Reinforcement Learning Approach for Optimal Distributed Energy Management in a Microgrid

283

Citations

26

References

2018

Year

TLDR

Distributed energy management in microgrids involves autonomous suppliers and consumers making local profit‑maximizing decisions amid limited information and renewable generation variability. The study aims to develop a reinforcement‑learning algorithm that enables generation units, storage, and consumers to devise optimal energy‑management strategies without prior knowledge of each other or the microgrid. Using a multiagent model, the authors implement a reinforcement‑learning framework that trains agents to coordinate load scheduling and resource dispatch in the microgrid. Case studies demonstrate that the agents’ strategies converge to a Nash equilibrium, improving overall performance for all participants.

Abstract

In this paper, a multiagent-based model is used to study distributed energy management in a microgrid (MG). The suppliers and consumers of electricity are modeled as autonomous agents, capable of making local decisions in order to maximize their own profit in a multiagent environment. For every supplier, a lack of information about customers and other suppliers creates challenges to optimal decision making in order to maximize its return. Similarly, customers face difficulty in scheduling their energy consumption without any information about suppliers and electricity prices. Additionally, there are several uncertainties involved in the nature of MGs due to variability in renewable generation output power and continuous fluctuation of customers' consumption. In order to prevail over these challenges, a reinforcement learning algorithm was developed to allow generation resources, distributed storages, and customers to develop optimal strategies for energy management and load scheduling without prior information about each other and the MG system. Case studies are provided to show how the overall performance of all entities converges as an emergent behavior to the Nash equilibrium, benefiting all agents.

References

YearCitations

Page 1