Automatic speech recognition and chat bot for air traffic control
Artificial Intelligence (AI) has demonstrated the ability to manage complex processes highly effectively and thus is widely seen as a key component in future airport ATM systems. Future AI tools for ATMs will rely on digital data, such as surveillance, radar, weather, and flight plans, for the...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/177842 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-177842 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1778422024-05-31T12:34:43Z Automatic speech recognition and chat bot for air traffic control Low, Ashton Kin Yun Sameer Alam School of Mechanical and Aerospace Engineering sameeralam@ntu.edu.sg Engineering Automatic speech recognition Artificial Intelligence (AI) has demonstrated the ability to manage complex processes highly effectively and thus is widely seen as a key component in future airport ATM systems. Future AI tools for ATMs will rely on digital data, such as surveillance, radar, weather, and flight plans, for their operation. However, the foundational Air Traffic Control Officer (ATCo)-pilot communication medium is voice, which is a vital source of situational data. Controller Pilot Data Link Communications (CPDLC) has been developed as an alternative, text-based communication delivery method, however, ATCo-pilot communications will not be completely transitioned to this framework in the near-term future. Moreover, as CPDLC is a one-to-one communication paradigm, the additional situational awareness of other traffic provided by traditional party-line VHF communications is potentially lost. Therefore, an automated speech-to-text translation tool can be seen as a missing link, enabling traditional ATCo-pilot voice communications to be automatically translated and input into a datalink system such as CPDLC. To this end this paper presents a Machine Learning (ML) based Automatic Speech Recognition (ASR) framework that is able to accurately translate VHF-quality ATCo-pilot speech communication to text, achieving a Word Error Rate of only 6.13%. Moreover, the presented model is able to extract crucial information with an accuracy and F1-score of 95.2% and 90.5% respectively. A detailed design of the framework is provided to enable its replication by the wider research community. Bachelor's degree 2024-05-31T12:34:42Z 2024-05-31T12:34:42Z 2024 Final Year Project (FYP) Low, A. K. Y. (2024). Automatic speech recognition and chat bot for air traffic control. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177842 https://hdl.handle.net/10356/177842 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering Automatic speech recognition |
spellingShingle |
Engineering Automatic speech recognition Low, Ashton Kin Yun Automatic speech recognition and chat bot for air traffic control |
description |
Artificial Intelligence (AI) has demonstrated the ability to manage complex processes
highly effectively and thus is widely seen as a key component in future airport ATM
systems. Future AI tools for ATMs will rely on digital data, such as surveillance, radar,
weather, and flight plans, for their operation. However, the foundational Air Traffic
Control Officer (ATCo)-pilot communication medium is voice, which is a vital source
of situational data. Controller Pilot Data Link Communications (CPDLC) has been
developed as an alternative, text-based communication delivery method, however,
ATCo-pilot communications will not be completely transitioned to this framework in
the near-term future. Moreover, as CPDLC is a one-to-one communication paradigm,
the additional situational awareness of other traffic provided by traditional party-line
VHF communications is potentially lost. Therefore, an automated speech-to-text
translation tool can be seen as a missing link, enabling traditional ATCo-pilot voice
communications to be automatically translated and input into a datalink system such
as CPDLC. To this end this paper presents a Machine Learning (ML) based Automatic
Speech Recognition (ASR) framework that is able to accurately translate VHF-quality
ATCo-pilot speech communication to text, achieving a Word Error Rate of only
6.13%. Moreover, the presented model is able to extract crucial information with an
accuracy and F1-score of 95.2% and 90.5% respectively. A detailed design of the
framework is provided to enable its replication by the wider research community. |
author2 |
Sameer Alam |
author_facet |
Sameer Alam Low, Ashton Kin Yun |
format |
Final Year Project |
author |
Low, Ashton Kin Yun |
author_sort |
Low, Ashton Kin Yun |
title |
Automatic speech recognition and chat bot for air traffic control |
title_short |
Automatic speech recognition and chat bot for air traffic control |
title_full |
Automatic speech recognition and chat bot for air traffic control |
title_fullStr |
Automatic speech recognition and chat bot for air traffic control |
title_full_unstemmed |
Automatic speech recognition and chat bot for air traffic control |
title_sort |
automatic speech recognition and chat bot for air traffic control |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/177842 |
_version_ |
1800916392896102400 |