Big data analytics for smart transportation
As the cost of storage decreases, more data is being collected and if these data can provide us with insights, we will be able to make more well-informed decisions. Therefore, being able to draw insights from traffic data will enable road users to better plan their routes while the relevant authorit...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/138640 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-138640 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1386402022-09-21T05:59:11Z Big data analytics for smart transportation Lim, Hong Yee Mo Li School of Computer Science and Engineering limo@ntu.edu.sg Engineering::Computer science and engineering As the cost of storage decreases, more data is being collected and if these data can provide us with insights, we will be able to make more well-informed decisions. Therefore, being able to draw insights from traffic data will enable road users to better plan their routes while the relevant authorities in charge of road planning will be better able to allocate land for transport more efficiently. In this report, we will look at two different approaches available for traffic flow forecasting: parametric and non-parametric. All the models will be tested using a traffic flow dataset which is converted from raw GPS coordinates of taxis in Singapore. Both approaches will be evaluated against the Historical Average Model, which is the selected baseline model, using a set of evaluation metrics which are root mean squared error (RMSE), mean absolute error (MAE), accuracy, variance and R2 score. For the parametric approach, models such as the AutoRegressive Integrated Moving Average (ARIMA) was tested. For the non-parametric approach, models such as the Long Short Term Memory (LSTM) that capture only the temporal features of the dataset as well as models such as the Temporal Graph Convolutional Network (T-GCN) that capture both the temporal and spatial features of the dataset were tested. Therefore, the focus of this project is to determine which approach is more suitable for traffic flow forecasting and whether capturing both temporal and spatial features of the dataset will improve the model’s performance based on the set of evaluation metrics mentioned above. Bachelor of Engineering (Computer Science) 2020-05-11T06:03:07Z 2020-05-11T06:03:07Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/138640 en SCSE 19-0477 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Lim, Hong Yee Big data analytics for smart transportation |
description |
As the cost of storage decreases, more data is being collected and if these data can provide us with insights, we will be able to make more well-informed decisions. Therefore, being able to draw insights from traffic data will enable road users to better plan their routes while the relevant authorities in charge of road planning will be better able to allocate land for transport more efficiently.
In this report, we will look at two different approaches available for traffic flow forecasting: parametric and non-parametric. All the models will be tested using a traffic flow dataset which is converted from raw GPS coordinates of taxis in Singapore. Both approaches will be evaluated against the Historical Average Model, which is the selected baseline model, using a set of evaluation metrics which are root mean squared error (RMSE), mean absolute error (MAE), accuracy, variance and R2 score. For the parametric approach, models such as the AutoRegressive Integrated Moving Average (ARIMA) was tested. For the non-parametric approach, models such as the Long Short Term Memory (LSTM) that capture only the temporal features of the dataset as well as models such as the Temporal Graph Convolutional Network (T-GCN) that capture both the temporal and spatial features of the dataset were tested.
Therefore, the focus of this project is to determine which approach is more suitable for traffic flow forecasting and whether capturing both temporal and spatial features of the dataset will improve the model’s performance based on the set of evaluation metrics mentioned above. |
author2 |
Mo Li |
author_facet |
Mo Li Lim, Hong Yee |
format |
Final Year Project |
author |
Lim, Hong Yee |
author_sort |
Lim, Hong Yee |
title |
Big data analytics for smart transportation |
title_short |
Big data analytics for smart transportation |
title_full |
Big data analytics for smart transportation |
title_fullStr |
Big data analytics for smart transportation |
title_full_unstemmed |
Big data analytics for smart transportation |
title_sort |
big data analytics for smart transportation |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/138640 |
_version_ |
1745574626691383296 |