Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions

Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the researches, agents are all isomorphic, which disregards the situation in which isomerous intersections cooperative together in a real ATSC scena...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHANG, Chengwei, TIAN, Yu, ZHANG, Zhibin, XUE, Wanli, XIE, Xiaofei, YANG, Tianpei, GE, Xin, CHEN, Rong
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Correlation Games Training Markov processes Epidemics Roads Electronic mail Multi-agent learning cooperative Markov game independent reinforcement learning adaptive traffic signal control Artificial Intelligence and Robotics Theory and Algorithms Transportation
Online Access:	https://ink.library.smu.edu.sg/sis_research/9852 https://ink.library.smu.edu.sg/context/sis_research/article/10852/viewcontent/T_ITS_NeighbourhoodCooperativeMultiagent_av.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10852
record_format	dspace
spelling	sg-smu-ink.sis_research-108522024-12-24T03:21:09Z Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions ZHANG, Chengwei TIAN, Yu ZHANG, Zhibin XUE, Wanli XIE, Xiaofei YANG, Tianpei GE, Xin CHEN, Rong Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the researches, agents are all isomorphic, which disregards the situation in which isomerous intersections cooperative together in a real ATSC scenario, especially in epidemic regions where different intersections have quite different levels of importance. To this end, this paper models the ATSC problem as a networked Markov game (NMG), in which agents take into account information, including traffic conditions of it and its connected neighbors. A cooperative MARL framework named neighborhood cooperative hysteretic DQN (NC-HDQN) is proposed. Specifically, for each NC-HDQN agent in the NMG, first, the framework analyses correlation degrees with their connected neighbors and weighs observations and rewards by these correlations. Second, NC-HDQN agents independently optimize their strategies on the weighted information using hysteretic DQN (HDQN), which is designed to learn optimal joint strategies in cooperative multiagent games. Third, a rule-based NC-HDQN method and a Pearson correlation coefficient based NC-HDQN method, i.e., empirical NC-HDQN (ENC-HDQN) and Pearson NC-HDQN (PNC-HDQN), respectively, are designed. The first method maps the correlation degree between two connected agents according to vehicle numbers on roads between the two agents. In contrast, the second method uses the Pearson correlation coefficient to calculate the correlation degree adaptively. Our methods are empirically evaluated in both a synthetic scenario and two real-world traffic scenarios and give better performances in almost every standard test metric for ATSC. 2022-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9852 info:doi/10.1109/TITS.2022.3173490 https://ink.library.smu.edu.sg/context/sis_research/article/10852/viewcontent/T_ITS_NeighbourhoodCooperativeMultiagent_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Correlation Games Training Markov processes Epidemics Roads Electronic mail Multi-agent learning cooperative Markov game independent reinforcement learning adaptive traffic signal control Artificial Intelligence and Robotics Theory and Algorithms Transportation
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Correlation Games Training Markov processes Epidemics Roads Electronic mail Multi-agent learning cooperative Markov game independent reinforcement learning adaptive traffic signal control Artificial Intelligence and Robotics Theory and Algorithms Transportation
spellingShingle	Correlation Games Training Markov processes Epidemics Roads Electronic mail Multi-agent learning cooperative Markov game independent reinforcement learning adaptive traffic signal control Artificial Intelligence and Robotics Theory and Algorithms Transportation ZHANG, Chengwei TIAN, Yu ZHANG, Zhibin XUE, Wanli XIE, Xiaofei YANG, Tianpei GE, Xin CHEN, Rong Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
description	Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the researches, agents are all isomorphic, which disregards the situation in which isomerous intersections cooperative together in a real ATSC scenario, especially in epidemic regions where different intersections have quite different levels of importance. To this end, this paper models the ATSC problem as a networked Markov game (NMG), in which agents take into account information, including traffic conditions of it and its connected neighbors. A cooperative MARL framework named neighborhood cooperative hysteretic DQN (NC-HDQN) is proposed. Specifically, for each NC-HDQN agent in the NMG, first, the framework analyses correlation degrees with their connected neighbors and weighs observations and rewards by these correlations. Second, NC-HDQN agents independently optimize their strategies on the weighted information using hysteretic DQN (HDQN), which is designed to learn optimal joint strategies in cooperative multiagent games. Third, a rule-based NC-HDQN method and a Pearson correlation coefficient based NC-HDQN method, i.e., empirical NC-HDQN (ENC-HDQN) and Pearson NC-HDQN (PNC-HDQN), respectively, are designed. The first method maps the correlation degree between two connected agents according to vehicle numbers on roads between the two agents. In contrast, the second method uses the Pearson correlation coefficient to calculate the correlation degree adaptively. Our methods are empirically evaluated in both a synthetic scenario and two real-world traffic scenarios and give better performances in almost every standard test metric for ATSC.
format	text
author	ZHANG, Chengwei TIAN, Yu ZHANG, Zhibin XUE, Wanli XIE, Xiaofei YANG, Tianpei GE, Xin CHEN, Rong
author_facet	ZHANG, Chengwei TIAN, Yu ZHANG, Zhibin XUE, Wanli XIE, Xiaofei YANG, Tianpei GE, Xin CHEN, Rong
author_sort	ZHANG, Chengwei
title	Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_short	Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_full	Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_fullStr	Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_full_unstemmed	Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
title_sort	neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions
publisher	Institutional Knowledge at Singapore Management University
publishDate	2022
url	https://ink.library.smu.edu.sg/sis_research/9852 https://ink.library.smu.edu.sg/context/sis_research/article/10852/viewcontent/T_ITS_NeighbourhoodCooperativeMultiagent_av.pdf
_version_	1820027799774167040

Neighborhood cooperative multiagent reinforcement learning for adaptive traffic signal control in epidemic regions

Similar Items