A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems
Multiple microgrids can be interconnected to form a networked-microgrid (NMG) system. In this paper, a data-driven decentralized economic frequency control method is proposed for isolated NMG systems. Based on multi-agent deep reinforcement learning (MA-DRL) framework, each DRL agent controls the ge...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/170509 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-170509 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1705092023-09-18T00:50:36Z A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems Xia, Yang Xu, Yan Wang, Yu Mondal, Suman Dasgupta, Souvik Gupta, Amit K. Gupta, Gaurav M. School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Networked-Microgrid Economic Frequency Control Multiple microgrids can be interconnected to form a networked-microgrid (NMG) system. In this paper, a data-driven decentralized economic frequency control method is proposed for isolated NMG systems. Based on multi-agent deep reinforcement learning (MA-DRL) framework, each DRL agent controls the generator and energy storage system (ESS) in each microgrid of the NMG. For offline training stage, the optimal control strategy is learned by soft actor-critic (SAC) algorithm with a global reward, which aims to restore system frequency while considering economy. Besides, to satisfy system constraints and avoid high learning costs, a safety model scheme is designed and trained to support each DRL agent. Since the agents are trained in the centralized learning process at offline stage, they are able to coordinate in a decentralized manner for online application, which only requires local information to generate optimal control actions. Also, the trained safety model can be applied for online stage to monitor and guide online actions. Finally, numerical tests are conducted to demonstrate the feasibility and effectiveness of the proposed method under the variation of renewable generation and load demand. Nanyang Technological University This work was supported by the RR@NTU Corporate Lab Phase II, Nanyang Technological University, Singapore. Paper no. TSTE-00057-2022. 2023-09-18T00:50:36Z 2023-09-18T00:50:36Z 2022 Journal Article Xia, Y., Xu, Y., Wang, Y., Mondal, S., Dasgupta, S., Gupta, A. K. & Gupta, G. M. (2022). A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems. IEEE Transactions On Sustainable Energy, 13(4), 1982-1993. https://dx.doi.org/10.1109/TSTE.2022.3178415 1949-3029 https://hdl.handle.net/10356/170509 10.1109/TSTE.2022.3178415 2-s2.0-85139500095 4 13 1982 1993 en IEEE Transactions on Sustainable Energy © 2022 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Networked-Microgrid Economic Frequency Control |
spellingShingle |
Engineering::Electrical and electronic engineering Networked-Microgrid Economic Frequency Control Xia, Yang Xu, Yan Wang, Yu Mondal, Suman Dasgupta, Souvik Gupta, Amit K. Gupta, Gaurav M. A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
description |
Multiple microgrids can be interconnected to form a networked-microgrid (NMG) system. In this paper, a data-driven decentralized economic frequency control method is proposed for isolated NMG systems. Based on multi-agent deep reinforcement learning (MA-DRL) framework, each DRL agent controls the generator and energy storage system (ESS) in each microgrid of the NMG. For offline training stage, the optimal control strategy is learned by soft actor-critic (SAC) algorithm with a global reward, which aims to restore system frequency while considering economy. Besides, to satisfy system constraints and avoid high learning costs, a safety model scheme is designed and trained to support each DRL agent. Since the agents are trained in the centralized learning process at offline stage, they are able to coordinate in a decentralized manner for online application, which only requires local information to generate optimal control actions. Also, the trained safety model can be applied for online stage to monitor and guide online actions. Finally, numerical tests are conducted to demonstrate the feasibility and effectiveness of the proposed method under the variation of renewable generation and load demand. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Xia, Yang Xu, Yan Wang, Yu Mondal, Suman Dasgupta, Souvik Gupta, Amit K. Gupta, Gaurav M. |
format |
Article |
author |
Xia, Yang Xu, Yan Wang, Yu Mondal, Suman Dasgupta, Souvik Gupta, Amit K. Gupta, Gaurav M. |
author_sort |
Xia, Yang |
title |
A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
title_short |
A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
title_full |
A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
title_fullStr |
A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
title_full_unstemmed |
A safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
title_sort |
safe policy learning-based method for decentralized and economic frequency control in isolated networked-microgrid systems |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/170509 |
_version_ |
1779156303065645056 |