A multi-agent reinforcement learning approach for flight speed control systems

With increasing air traffic, there is an ever-growing need for Air Traffic Controllers (ATCO) to efficiently manage air traffic and congestion. Congestion often leads to increase in delays in the Terminal Maneuvering Area (TMA), which is one of the primary challenges that is being faced by ATCO. Int...

Full description

Saved in:
Bibliographic Details
Main Author: Kanupriya, Malhotra
Other Authors: Sameer Alam
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/159164
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-159164
record_format dspace
spelling sg-ntu-dr.10356-1591642023-03-04T20:11:55Z A multi-agent reinforcement learning approach for flight speed control systems Kanupriya, Malhotra Sameer Alam School of Mechanical and Aerospace Engineering Air Traffic Management Research Institute sameeralam@ntu.edu.sg Engineering::Aeronautical engineering::Air navigation Engineering::Computer science and engineering::Computing methodologies::Simulation and modeling With increasing air traffic, there is an ever-growing need for Air Traffic Controllers (ATCO) to efficiently manage air traffic and congestion. Congestion often leads to increase in delays in the Terminal Maneuvering Area (TMA), which is one of the primary challenges that is being faced by ATCO. Introduction of approaches such as the Extended Arrival Manager (E-AMAN) propose solutions whereby flights are scheduled early, much before they enter the TMA, to absorb flight delays in the TMA that could potentially cause large amounts of fuel burn and have detrimental environmental impacts. However, for practical implications of such an approach, a speed management system is required, whereby, flights can coordinate to effectively absorb system-level delays in their en-route phase. This report proposes a Multi-Agent System (MAS) approach using Deep Reinforcement Learning to model and train flights as agents which can coordinate with each other to effectively absorb system-level delays. The simulations use MultiAgent POsthumous Credit Assignment (MA-POCA) in Unity to model flights as agents that are part of a cooperative MAS. Two reward approaches have been compared, where flights are either provided with complete penalty for exceeding the required delay, or partial penalty for exceeding the required delay. Findings highlight the percentage of maximum delay absorbed by each flight, along with the distribution of flight delay absorption in the flight plan. Initial testings reveal an average of 3.3 minutes of system-level delay absorptions from a required 4 minutes of delay. Bachelor of Engineering (Mechanical Engineering) 2022-06-10T12:51:09Z 2022-06-10T12:51:09Z 2022 Final Year Project (FYP) Kanupriya, M. (2022). A multi-agent reinforcement learning approach for flight speed control systems. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159164 https://hdl.handle.net/10356/159164 en C053 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Aeronautical engineering::Air navigation
Engineering::Computer science and engineering::Computing methodologies::Simulation and modeling
spellingShingle Engineering::Aeronautical engineering::Air navigation
Engineering::Computer science and engineering::Computing methodologies::Simulation and modeling
Kanupriya, Malhotra
A multi-agent reinforcement learning approach for flight speed control systems
description With increasing air traffic, there is an ever-growing need for Air Traffic Controllers (ATCO) to efficiently manage air traffic and congestion. Congestion often leads to increase in delays in the Terminal Maneuvering Area (TMA), which is one of the primary challenges that is being faced by ATCO. Introduction of approaches such as the Extended Arrival Manager (E-AMAN) propose solutions whereby flights are scheduled early, much before they enter the TMA, to absorb flight delays in the TMA that could potentially cause large amounts of fuel burn and have detrimental environmental impacts. However, for practical implications of such an approach, a speed management system is required, whereby, flights can coordinate to effectively absorb system-level delays in their en-route phase. This report proposes a Multi-Agent System (MAS) approach using Deep Reinforcement Learning to model and train flights as agents which can coordinate with each other to effectively absorb system-level delays. The simulations use MultiAgent POsthumous Credit Assignment (MA-POCA) in Unity to model flights as agents that are part of a cooperative MAS. Two reward approaches have been compared, where flights are either provided with complete penalty for exceeding the required delay, or partial penalty for exceeding the required delay. Findings highlight the percentage of maximum delay absorbed by each flight, along with the distribution of flight delay absorption in the flight plan. Initial testings reveal an average of 3.3 minutes of system-level delay absorptions from a required 4 minutes of delay.
author2 Sameer Alam
author_facet Sameer Alam
Kanupriya, Malhotra
format Final Year Project
author Kanupriya, Malhotra
author_sort Kanupriya, Malhotra
title A multi-agent reinforcement learning approach for flight speed control systems
title_short A multi-agent reinforcement learning approach for flight speed control systems
title_full A multi-agent reinforcement learning approach for flight speed control systems
title_fullStr A multi-agent reinforcement learning approach for flight speed control systems
title_full_unstemmed A multi-agent reinforcement learning approach for flight speed control systems
title_sort multi-agent reinforcement learning approach for flight speed control systems
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/159164
_version_ 1759854511220850688