Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions

Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the researches, agents are all isomorphic, which disregards the situation in which isomerous intersections cooperative together in a real ATSC scena...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Chengwei, TIAN, Yu, ZHANG, Zhibin, XUE, Wanli, XIE, Xiaofei, YANG, Tianpei, GE, Xin, CHEN, Rong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9852
https://ink.library.smu.edu.sg/context/sis_research/article/10852/viewcontent/T_ITS_NeighbourhoodCooperativeMultiagent_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10852
record_format dspace
spelling sg-smu-ink.sis_research-108522024-12-24T03:21:09Z Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions ZHANG, Chengwei TIAN, Yu ZHANG, Zhibin XUE, Wanli XIE, Xiaofei YANG, Tianpei GE, Xin CHEN, Rong Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the researches, agents are all isomorphic, which disregards the situation in which isomerous intersections cooperative together in a real ATSC scenario, especially in epidemic regions where different intersections have quite different levels of importance. To this end, this paper models the ATSC problem as a networked Markov game (NMG), in which agents take into account information, including traffic conditions of it and its connected neighbors. A cooperative MARL framework named neighborhood cooperative hysteretic DQN (NC-HDQN) is proposed. Specifically, for each NC-HDQN agent in the NMG, first, the framework analyses correlation degrees with their connected neighbors and weighs observations and rewards by these correlations. Second, NC-HDQN agents independently optimize their strategies on the weighted information using hysteretic DQN (HDQN), which is designed to learn optimal joint strategies in cooperative multiagent games. Third, a rule-based NC-HDQN method and a Pearson correlation coefficient based NC-HDQN method, i.e., empirical NC-HDQN (ENC-HDQN) and Pearson NC-HDQN (PNC-HDQN), respectively, are designed. The first method maps the correlation degree between two connected agents according to vehicle numbers on roads between the two agents. In contrast, the second method uses the Pearson correlation coefficient to calculate the correlation degree adaptively. Our methods are empirically evaluated in both a synthetic scenario and two real-world traffic scenarios and give better performances in almost every standard test metric for ATSC. 2022-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9852 info:doi/10.1109/TITS.2022.3173490 https://ink.library.smu.edu.sg/context/sis_research/article/10852/viewcontent/T_ITS_NeighbourhoodCooperativeMultiagent_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Correlation Games Training Markov processes Epidemics Roads Electronic mail Multi-agent learning cooperative Markov game independent reinforcement learning adaptive traffic signal control Artificial Intelligence and Robotics Theory and Algorithms Transportation
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Correlation
Games
Training
Markov processes
Epidemics
Roads
Electronic mail
Multi-agent learning
cooperative Markov game
independent reinforcement learning
adaptive traffic signal control
Artificial Intelligence and Robotics
Theory and Algorithms
Transportation
spellingShingle Correlation
Games
Training
Markov processes
Epidemics
Roads
Electronic mail
Multi-agent learning
cooperative Markov game
independent reinforcement learning
adaptive traffic signal control
Artificial Intelligence and Robotics
Theory and Algorithms
Transportation
ZHANG, Chengwei
TIAN, Yu
ZHANG, Zhibin
XUE, Wanli
XIE, Xiaofei
YANG, Tianpei
GE, Xin
CHEN, Rong
Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
description Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the researches, agents are all isomorphic, which disregards the situation in which isomerous intersections cooperative together in a real ATSC scenario, especially in epidemic regions where different intersections have quite different levels of importance. To this end, this paper models the ATSC problem as a networked Markov game (NMG), in which agents take into account information, including traffic conditions of it and its connected neighbors. A cooperative MARL framework named neighborhood cooperative hysteretic DQN (NC-HDQN) is proposed. Specifically, for each NC-HDQN agent in the NMG, first, the framework analyses correlation degrees with their connected neighbors and weighs observations and rewards by these correlations. Second, NC-HDQN agents independently optimize their strategies on the weighted information using hysteretic DQN (HDQN), which is designed to learn optimal joint strategies in cooperative multiagent games. Third, a rule-based NC-HDQN method and a Pearson correlation coefficient based NC-HDQN method, i.e., empirical NC-HDQN (ENC-HDQN) and Pearson NC-HDQN (PNC-HDQN), respectively, are designed. The first method maps the correlation degree between two connected agents according to vehicle numbers on roads between the two agents. In contrast, the second method uses the Pearson correlation coefficient to calculate the correlation degree adaptively. Our methods are empirically evaluated in both a synthetic scenario and two real-world traffic scenarios and give better performances in almost every standard test metric for ATSC.
format text
author ZHANG, Chengwei
TIAN, Yu
ZHANG, Zhibin
XUE, Wanli
XIE, Xiaofei
YANG, Tianpei
GE, Xin
CHEN, Rong
author_facet ZHANG, Chengwei
TIAN, Yu
ZHANG, Zhibin
XUE, Wanli
XIE, Xiaofei
YANG, Tianpei
GE, Xin
CHEN, Rong
author_sort ZHANG, Chengwei
title Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_short Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_full Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_fullStr Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_full_unstemmed Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_sort neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/9852
https://ink.library.smu.edu.sg/context/sis_research/article/10852/viewcontent/T_ITS_NeighbourhoodCooperativeMultiagent_av.pdf
_version_ 1820027799774167040