Determining human intention in videos 1
With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of ped...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/166016 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-166016 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1660162023-04-21T15:38:41Z Determining human intention in videos 1 Tan, Rachel Cham Tat Jen School of Computer Science and Engineering ASTJCham@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of pedestrians and other vehicles within a short period. To address this problem, this paper studied the Trajectron++ model, a machine learning model which has outperformed other popular and traditional machine learning models in trajectory prediction. In an attempt to improve on the Trajectron++ model, the Long-Short-Term Memory (LSTM) Network is replaced with Transformers as Transformers are found to outperform LSTMs in other similar problems such as Natural Language Processing. Evaluation results proved that the replacement of LSTMs with Transformers result in a less accurate prediction than the original Trajectron++ model. This is deduced from the existence of a larger maximum value for the Average Displacement Error and Final Displacement Error in most datasets and a generally higher Kernel Displacement Error Negative Log-Likelihood (KDE NLL) value for the Trajectron++ with Transformers model. Notably, there was a contradiction observed in the KDE NLL values for different datasets which suggest that the modified Trajectron++ model does perform better for one particular dataset. To explain what caused this contradiction and explore possible areas of improvement to the modified Trajectron++ model, future research could be performed. Bachelor of Science in Mathematical and Computer Sciences 2023-04-18T12:59:57Z 2023-04-18T12:59:57Z 2023 Final Year Project (FYP) Tan, R. (2023). Determining human intention in videos 1. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166016 https://hdl.handle.net/10356/166016 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Tan, Rachel Determining human intention in videos 1 |
description |
With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of pedestrians and other vehicles within a short period. To address this problem, this paper studied the Trajectron++ model, a machine learning model which has outperformed other popular and traditional machine learning models in trajectory prediction. In an attempt to improve on the Trajectron++ model, the Long-Short-Term Memory (LSTM) Network is replaced with Transformers as Transformers are found to outperform LSTMs in other similar problems such as Natural Language Processing. Evaluation results proved that the replacement of LSTMs with Transformers result in a less accurate prediction than the original Trajectron++ model. This is deduced from the existence of a larger maximum value for the Average Displacement Error and Final Displacement Error in most datasets and a generally higher Kernel Displacement Error Negative Log-Likelihood (KDE NLL) value for the Trajectron++ with Transformers model. Notably, there was a contradiction observed in the KDE NLL values for different datasets which suggest that the modified Trajectron++ model does perform better for one particular dataset. To explain what caused this contradiction and explore possible areas of improvement to the modified Trajectron++ model, future research could be performed. |
author2 |
Cham Tat Jen |
author_facet |
Cham Tat Jen Tan, Rachel |
format |
Final Year Project |
author |
Tan, Rachel |
author_sort |
Tan, Rachel |
title |
Determining human intention in videos 1 |
title_short |
Determining human intention in videos 1 |
title_full |
Determining human intention in videos 1 |
title_fullStr |
Determining human intention in videos 1 |
title_full_unstemmed |
Determining human intention in videos 1 |
title_sort |
determining human intention in videos 1 |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/166016 |
_version_ |
1764208135443054592 |