Automatic speech recognition and chat bot for air traffic control

Artificial Intelligence (AI) has demonstrated the ability to manage complex processes highly effectively and thus is widely seen as a key component in future airport ATM systems. Future AI tools for ATMs will rely on digital data, such as surveillance, radar, weather, and flight plans, for the...

Full description

Saved in:
Bibliographic Details
Main Author: Low, Ashton Kin Yun
Other Authors: Sameer Alam
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/177842
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-177842
record_format dspace
spelling sg-ntu-dr.10356-1778422024-05-31T12:34:43Z Automatic speech recognition and chat bot for air traffic control Low, Ashton Kin Yun Sameer Alam School of Mechanical and Aerospace Engineering sameeralam@ntu.edu.sg Engineering Automatic speech recognition Artificial Intelligence (AI) has demonstrated the ability to manage complex processes highly effectively and thus is widely seen as a key component in future airport ATM systems. Future AI tools for ATMs will rely on digital data, such as surveillance, radar, weather, and flight plans, for their operation. However, the foundational Air Traffic Control Officer (ATCo)-pilot communication medium is voice, which is a vital source of situational data. Controller Pilot Data Link Communications (CPDLC) has been developed as an alternative, text-based communication delivery method, however, ATCo-pilot communications will not be completely transitioned to this framework in the near-term future. Moreover, as CPDLC is a one-to-one communication paradigm, the additional situational awareness of other traffic provided by traditional party-line VHF communications is potentially lost. Therefore, an automated speech-to-text translation tool can be seen as a missing link, enabling traditional ATCo-pilot voice communications to be automatically translated and input into a datalink system such as CPDLC. To this end this paper presents a Machine Learning (ML) based Automatic Speech Recognition (ASR) framework that is able to accurately translate VHF-quality ATCo-pilot speech communication to text, achieving a Word Error Rate of only 6.13%. Moreover, the presented model is able to extract crucial information with an accuracy and F1-score of 95.2% and 90.5% respectively. A detailed design of the framework is provided to enable its replication by the wider research community. Bachelor's degree 2024-05-31T12:34:42Z 2024-05-31T12:34:42Z 2024 Final Year Project (FYP) Low, A. K. Y. (2024). Automatic speech recognition and chat bot for air traffic control. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177842 https://hdl.handle.net/10356/177842 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Automatic speech recognition
spellingShingle Engineering
Automatic speech recognition
Low, Ashton Kin Yun
Automatic speech recognition and chat bot for air traffic control
description Artificial Intelligence (AI) has demonstrated the ability to manage complex processes highly effectively and thus is widely seen as a key component in future airport ATM systems. Future AI tools for ATMs will rely on digital data, such as surveillance, radar, weather, and flight plans, for their operation. However, the foundational Air Traffic Control Officer (ATCo)-pilot communication medium is voice, which is a vital source of situational data. Controller Pilot Data Link Communications (CPDLC) has been developed as an alternative, text-based communication delivery method, however, ATCo-pilot communications will not be completely transitioned to this framework in the near-term future. Moreover, as CPDLC is a one-to-one communication paradigm, the additional situational awareness of other traffic provided by traditional party-line VHF communications is potentially lost. Therefore, an automated speech-to-text translation tool can be seen as a missing link, enabling traditional ATCo-pilot voice communications to be automatically translated and input into a datalink system such as CPDLC. To this end this paper presents a Machine Learning (ML) based Automatic Speech Recognition (ASR) framework that is able to accurately translate VHF-quality ATCo-pilot speech communication to text, achieving a Word Error Rate of only 6.13%. Moreover, the presented model is able to extract crucial information with an accuracy and F1-score of 95.2% and 90.5% respectively. A detailed design of the framework is provided to enable its replication by the wider research community.
author2 Sameer Alam
author_facet Sameer Alam
Low, Ashton Kin Yun
format Final Year Project
author Low, Ashton Kin Yun
author_sort Low, Ashton Kin Yun
title Automatic speech recognition and chat bot for air traffic control
title_short Automatic speech recognition and chat bot for air traffic control
title_full Automatic speech recognition and chat bot for air traffic control
title_fullStr Automatic speech recognition and chat bot for air traffic control
title_full_unstemmed Automatic speech recognition and chat bot for air traffic control
title_sort automatic speech recognition and chat bot for air traffic control
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/177842
_version_ 1800916392896102400