Determining human intention in videos 1

With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of ped...

Full description

Saved in:

Bibliographic Details
Main Author:	Tan, Rachel
Other Authors:	Cham Tat Jen
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/166016
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-166016
record_format	dspace
spelling	sg-ntu-dr.10356-1660162023-04-21T15:38:41Z Determining human intention in videos 1 Tan, Rachel Cham Tat Jen School of Computer Science and Engineering ASTJCham@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of pedestrians and other vehicles within a short period. To address this problem, this paper studied the Trajectron++ model, a machine learning model which has outperformed other popular and traditional machine learning models in trajectory prediction. In an attempt to improve on the Trajectron++ model, the Long-Short-Term Memory (LSTM) Network is replaced with Transformers as Transformers are found to outperform LSTMs in other similar problems such as Natural Language Processing. Evaluation results proved that the replacement of LSTMs with Transformers result in a less accurate prediction than the original Trajectron++ model. This is deduced from the existence of a larger maximum value for the Average Displacement Error and Final Displacement Error in most datasets and a generally higher Kernel Displacement Error Negative Log-Likelihood (KDE NLL) value for the Trajectron++ with Transformers model. Notably, there was a contradiction observed in the KDE NLL values for different datasets which suggest that the modified Trajectron++ model does perform better for one particular dataset. To explain what caused this contradiction and explore possible areas of improvement to the modified Trajectron++ model, future research could be performed. Bachelor of Science in Mathematical and Computer Sciences 2023-04-18T12:59:57Z 2023-04-18T12:59:57Z 2023 Final Year Project (FYP) Tan, R. (2023). Determining human intention in videos 1. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166016 https://hdl.handle.net/10356/166016 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Tan, Rachel Determining human intention in videos 1
description	With the recent heightened interest in autonomous vehicles in consumer markets, there are many attempts to reduce the accident rates for both semi-autonomous and fully-autonomous vehicles. One such attempt is in developing the ability of the vehicle to accurately predict the future trajectory of pedestrians and other vehicles within a short period. To address this problem, this paper studied the Trajectron++ model, a machine learning model which has outperformed other popular and traditional machine learning models in trajectory prediction. In an attempt to improve on the Trajectron++ model, the Long-Short-Term Memory (LSTM) Network is replaced with Transformers as Transformers are found to outperform LSTMs in other similar problems such as Natural Language Processing. Evaluation results proved that the replacement of LSTMs with Transformers result in a less accurate prediction than the original Trajectron++ model. This is deduced from the existence of a larger maximum value for the Average Displacement Error and Final Displacement Error in most datasets and a generally higher Kernel Displacement Error Negative Log-Likelihood (KDE NLL) value for the Trajectron++ with Transformers model. Notably, there was a contradiction observed in the KDE NLL values for different datasets which suggest that the modified Trajectron++ model does perform better for one particular dataset. To explain what caused this contradiction and explore possible areas of improvement to the modified Trajectron++ model, future research could be performed.
author2	Cham Tat Jen
author_facet	Cham Tat Jen Tan, Rachel
format	Final Year Project
author	Tan, Rachel
author_sort	Tan, Rachel
title	Determining human intention in videos 1
title_short	Determining human intention in videos 1
title_full	Determining human intention in videos 1
title_fullStr	Determining human intention in videos 1
title_full_unstemmed	Determining human intention in videos 1
title_sort	determining human intention in videos 1
publisher	Nanyang Technological University
publishDate	2023
url	https://hdl.handle.net/10356/166016
_version_	1764208135443054592

Determining human intention in videos 1

Similar Items