STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL

Traffic congestion has been known notoriously to cause severe losses in various sectors. One of its primary causes is conflicting vehicle flows at intersections, where traffic light control was implemented to solve these conflicts. Recent developments in machine learning, especially reinforcement le...

Full description

Saved in:

Bibliographic Details
Main Author:	Junaedi, Dandi
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/65288
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:65288
spelling	id-itb.:652882022-06-22T08:49:56ZSTUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL Junaedi, Dandi Indonesia Final Project traffic congestion, traffic light control, coordination, reinforcement learning, Cooperative Double Q-learning, vehicle’s travel time, vehicle throughput INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/65288 Traffic congestion has been known notoriously to cause severe losses in various sectors. One of its primary causes is conflicting vehicle flows at intersections, where traffic light control was implemented to solve these conflicts. Recent developments in machine learning, especially reinforcement learning (RL), have shown a stupendous ability to learn and solve problems in complex models. This potential can be applied to traffic light control to help manage large-scale traffic control in a coordinated manner. This research proposed the utilization of mean-field theory in RL to enhance the learning process by sharing parameters’ information between neighboring agents to improve coordination between intersections with Cooperative Double Q-Learning (Co-DQL) algorithm. This research was conducted on an area of Central Jakarta by replicating its network and traffic conditions on traffic simulators, VISSIM and SUMO. Sydney Coordinated Adaptive Traffic System (SCATS) was implemented in VISSIM to simulate the traffic conditions that the chosen algorithms will face. Co-DQL was implemented in SUMO and was compared to other RL algorithms (Deep Deterministic Policy Gradient (DDPG) and Deep Q-learning (DQN)) and conventional algorithms (Max-pressure (MP), Uniform, and Webster’s). From the comparison of the vehicles’ travel time and vehicle throughput, the simulations show that DDPG, Uniform, and Webster’s have better performance than the remaining algorithms (over 40.000 vehicle throughput and small distribution of vehicles that has over 25.000 seconds of travel time). These algorithms that determine the change in green light’s duration for the next cycle as the control action are performing better than the remaining algorithms that adaptively change the phases of traffic lights. This type of action is almost boundless in its decision to change phases. Therefore, if a bottleneck in vehicle flow happens, the traffic light can hold its phase for an uncertain amount of time. This result shows that city-scale traffic control requires more developments to achieve better results, especially on the network model and the control action type. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	Traffic congestion has been known notoriously to cause severe losses in various sectors. One of its primary causes is conflicting vehicle flows at intersections, where traffic light control was implemented to solve these conflicts. Recent developments in machine learning, especially reinforcement learning (RL), have shown a stupendous ability to learn and solve problems in complex models. This potential can be applied to traffic light control to help manage large-scale traffic control in a coordinated manner. This research proposed the utilization of mean-field theory in RL to enhance the learning process by sharing parameters’ information between neighboring agents to improve coordination between intersections with Cooperative Double Q-Learning (Co-DQL) algorithm. This research was conducted on an area of Central Jakarta by replicating its network and traffic conditions on traffic simulators, VISSIM and SUMO. Sydney Coordinated Adaptive Traffic System (SCATS) was implemented in VISSIM to simulate the traffic conditions that the chosen algorithms will face. Co-DQL was implemented in SUMO and was compared to other RL algorithms (Deep Deterministic Policy Gradient (DDPG) and Deep Q-learning (DQN)) and conventional algorithms (Max-pressure (MP), Uniform, and Webster’s). From the comparison of the vehicles’ travel time and vehicle throughput, the simulations show that DDPG, Uniform, and Webster’s have better performance than the remaining algorithms (over 40.000 vehicle throughput and small distribution of vehicles that has over 25.000 seconds of travel time). These algorithms that determine the change in green light’s duration for the next cycle as the control action are performing better than the remaining algorithms that adaptively change the phases of traffic lights. This type of action is almost boundless in its decision to change phases. Therefore, if a bottleneck in vehicle flow happens, the traffic light can hold its phase for an uncertain amount of time. This result shows that city-scale traffic control requires more developments to achieve better results, especially on the network model and the control action type.
format	Final Project
author	Junaedi, Dandi
spellingShingle	Junaedi, Dandi STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL
author_facet	Junaedi, Dandi
author_sort	Junaedi, Dandi
title	STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL
title_short	STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL
title_full	STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL
title_fullStr	STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL
title_full_unstemmed	STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL
title_sort	study in implementation of reinforcement learning for coordinated traffic light control in multi-intersection road network model
url	https://digilib.itb.ac.id/gdl/view/65288
_version_	1823648070691192832

STUDY IN IMPLEMENTATION OF REINFORCEMENT LEARNING FOR COORDINATED TRAFFIC LIGHT CONTROL IN MULTI-INTERSECTION ROAD NETWORK MODEL

Similar Items