A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems

Multiple microgrids can be interconnected to form a networked-microgrid (NMG) system. In this paper, a data-driven decentralized economic frequency control method is proposed for isolated NMG systems. Based on multi-agent deep reinforcement learning (MA-DRL) framework, each DRL agent controls the ge...

Full description

Saved in:
Bibliographic Details
Main Authors: Xia, Yang, Xu, Yan, Wang, Yu, Mondal, Suman, Dasgupta, Souvik, Gupta, Amit K., Gupta, Gaurav M.
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170509
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Multiple microgrids can be interconnected to form a networked-microgrid (NMG) system. In this paper, a data-driven decentralized economic frequency control method is proposed for isolated NMG systems. Based on multi-agent deep reinforcement learning (MA-DRL) framework, each DRL agent controls the generator and energy storage system (ESS) in each microgrid of the NMG. For offline training stage, the optimal control strategy is learned by soft actor-critic (SAC) algorithm with a global reward, which aims to restore system frequency while considering economy. Besides, to satisfy system constraints and avoid high learning costs, a safety model scheme is designed and trained to support each DRL agent. Since the agents are trained in the centralized learning process at offline stage, they are able to coordinate in a decentralized manner for online application, which only requires local information to generate optimal control actions. Also, the trained safety model can be applied for online stage to monitor and guide online actions. Finally, numerical tests are conducted to demonstrate the feasibility and effectiveness of the proposed method under the variation of renewable generation and load demand.