Determining human intention in videos 1

With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of ped...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Rachel
Other Authors: Cham Tat Jen
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/166016
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-166016
record_format dspace
spelling sg-ntu-dr.10356-1660162023-04-21T15:38:41Z Determining human intention in videos 1 Tan, Rachel Cham Tat Jen School of Computer Science and Engineering ASTJCham@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of pedestrians and other vehicles within a short period. To address this problem, this paper studied the Trajectron++ model, a machine learning model which has outperformed other popular and traditional machine learning models in trajectory prediction. In an attempt to improve on the Trajectron++ model, the Long-Short-Term Memory (LSTM) Network is replaced with Transformers as Transformers are found to outperform LSTMs in other similar problems such as Natural Language Processing. Evaluation results proved that the replacement of LSTMs with Transformers result in a less accurate prediction than the original Trajectron++ model. This is deduced from the existence of a larger maximum value for the Average Displacement Error and Final Displacement Error in most datasets and a generally higher Kernel Displacement Error Negative Log-Likelihood (KDE NLL) value for the Trajectron++ with Transformers model. Notably, there was a contradiction observed in the KDE NLL values for different datasets which suggest that the modified Trajectron++ model does perform better for one particular dataset. To explain what caused this contradiction and explore possible areas of improvement to the modified Trajectron++ model, future research could be performed. Bachelor of Science in Mathematical and Computer Sciences 2023-04-18T12:59:57Z 2023-04-18T12:59:57Z 2023 Final Year Project (FYP) Tan, R. (2023). Determining human intention in videos 1. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166016 https://hdl.handle.net/10356/166016 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Tan, Rachel
Determining human intention in videos 1
description With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of pedestrians and other vehicles within a short period. To address this problem, this paper studied the Trajectron++ model, a machine learning model which has outperformed other popular and traditional machine learning models in trajectory prediction. In an attempt to improve on the Trajectron++ model, the Long-Short-Term Memory (LSTM) Network is replaced with Transformers as Transformers are found to outperform LSTMs in other similar problems such as Natural Language Processing. Evaluation results proved that the replacement of LSTMs with Transformers result in a less accurate prediction than the original Trajectron++ model. This is deduced from the existence of a larger maximum value for the Average Displacement Error and Final Displacement Error in most datasets and a generally higher Kernel Displacement Error Negative Log-Likelihood (KDE NLL) value for the Trajectron++ with Transformers model. Notably, there was a contradiction observed in the KDE NLL values for different datasets which suggest that the modified Trajectron++ model does perform better for one particular dataset. To explain what caused this contradiction and explore possible areas of improvement to the modified Trajectron++ model, future research could be performed.
author2 Cham Tat Jen
author_facet Cham Tat Jen
Tan, Rachel
format Final Year Project
author Tan, Rachel
author_sort Tan, Rachel
title Determining human intention in videos 1
title_short Determining human intention in videos 1
title_full Determining human intention in videos 1
title_fullStr Determining human intention in videos 1
title_full_unstemmed Determining human intention in videos 1
title_sort determining human intention in videos 1
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/166016
_version_ 1764208135443054592