Big data analytics for smart transportation

As the cost of storage decreases, more data is being collected and if these data can provide us with insights, we will be able to make more well-informed decisions. Therefore, being able to draw insights from traffic data will enable road users to better plan their routes while the relevant authorit...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Hong Yee
Other Authors: Mo Li
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/138640
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-138640
record_format dspace
spelling sg-ntu-dr.10356-1386402022-09-21T05:59:11Z Big data analytics for smart transportation Lim, Hong Yee Mo Li School of Computer Science and Engineering limo@ntu.edu.sg Engineering::Computer science and engineering As the cost of storage decreases, more data is being collected and if these data can provide us with insights, we will be able to make more well-informed decisions. Therefore, being able to draw insights from traffic data will enable road users to better plan their routes while the relevant authorities in charge of road planning will be better able to allocate land for transport more efficiently. In this report, we will look at two different approaches available for traffic flow forecasting: parametric and non-parametric. All the models will be tested using a traffic flow dataset which is converted from raw GPS coordinates of taxis in Singapore. Both approaches will be evaluated against the Historical Average Model, which is the selected baseline model, using a set of evaluation metrics which are root mean squared error (RMSE), mean absolute error (MAE), accuracy, variance and R2 score. For the parametric approach, models such as the AutoRegressive Integrated Moving Average (ARIMA) was tested. For the non-parametric approach, models such as the Long Short Term Memory (LSTM) that capture only the temporal features of the dataset as well as models such as the Temporal Graph Convolutional Network (T-GCN) that capture both the temporal and spatial features of the dataset were tested. Therefore, the focus of this project is to determine which approach is more suitable for traffic flow forecasting and whether capturing both temporal and spatial features of the dataset will improve the model’s performance based on the set of evaluation metrics mentioned above. Bachelor of Engineering (Computer Science) 2020-05-11T06:03:07Z 2020-05-11T06:03:07Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/138640 en SCSE 19-0477 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Lim, Hong Yee
Big data analytics for smart transportation
description As the cost of storage decreases, more data is being collected and if these data can provide us with insights, we will be able to make more well-informed decisions. Therefore, being able to draw insights from traffic data will enable road users to better plan their routes while the relevant authorities in charge of road planning will be better able to allocate land for transport more efficiently. In this report, we will look at two different approaches available for traffic flow forecasting: parametric and non-parametric. All the models will be tested using a traffic flow dataset which is converted from raw GPS coordinates of taxis in Singapore. Both approaches will be evaluated against the Historical Average Model, which is the selected baseline model, using a set of evaluation metrics which are root mean squared error (RMSE), mean absolute error (MAE), accuracy, variance and R2 score. For the parametric approach, models such as the AutoRegressive Integrated Moving Average (ARIMA) was tested. For the non-parametric approach, models such as the Long Short Term Memory (LSTM) that capture only the temporal features of the dataset as well as models such as the Temporal Graph Convolutional Network (T-GCN) that capture both the temporal and spatial features of the dataset were tested. Therefore, the focus of this project is to determine which approach is more suitable for traffic flow forecasting and whether capturing both temporal and spatial features of the dataset will improve the model’s performance based on the set of evaluation metrics mentioned above.
author2 Mo Li
author_facet Mo Li
Lim, Hong Yee
format Final Year Project
author Lim, Hong Yee
author_sort Lim, Hong Yee
title Big data analytics for smart transportation
title_short Big data analytics for smart transportation
title_full Big data analytics for smart transportation
title_fullStr Big data analytics for smart transportation
title_full_unstemmed Big data analytics for smart transportation
title_sort big data analytics for smart transportation
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/138640
_version_ 1745574626691383296