ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA
There are several types of public transportation in Indonesia, such as Angkot/Angkutan Kota (City Transportation) becoming the focus of this study. The angkot data used in this study was those collected from the applications, namely SEMUT and GPS Tracker, installed in every angkot. This GPS se...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/55196 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:55196 |
---|---|
spelling |
id-itb.:551962021-06-16T09:27:57ZANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA Rin Nurmalasari, Rin Indonesia Theses Data Analysis, LSTM, Autoencoder, Anomaly behavior angkot, ID3 INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/55196 There are several types of public transportation in Indonesia, such as Angkot/Angkutan Kota (City Transportation) becoming the focus of this study. The angkot data used in this study was those collected from the applications, namely SEMUT and GPS Tracker, installed in every angkot. This GPS sensor data can be used for facilitating the monitoring of all activities and behaviors done by angkot. This study would create a model that could detect anomaly behavior angkot done by angkot. An anomaly is defined as abnormal behavior. The model was constructed to provide information if the anomaly behavior angkot was identified. Before conducting data analysis, the data taken from SEMUT should pass the data cleaning process, such as parsing, correction and standardization, enhancement, matching, and consolidation. The pre-processing was thereupon conducted including attribute selection, Exploratory Data Analysis (EDA), determining data training, and data testing. After conducting the EDA stage, the activities and behaviors done by angkot could be observed and defined as anomaly behavior angkot. Subsequently, the data separation for the normal behavior in the angkot to construct the anomaly behavior angkot detection using the Long-Short Term Memory (LSTM) autoencoder. This model was used for training the normal data without any label. The result from the model evaluation had a gradual drop in the loss MAE value. The final value in the training reached 0.0099. The output from the result of the anomaly behavior angkot detection using the long-short term memory autoencoder was the data label. The data would be automatically labeled with true or false. The true label was provided if the loss MAE value was more than the threshold value (0.1), indicating that there was anomaly behavior angkot. Meanwhile, the false label was given when the loss MAE in the tested data was less than the threshold value of 0.1, indicating that the angkot had normal behavior. After testing and checking by comparing the output of the model and the checking result using a manual method using the data exploration, the Long-Short Term Memory autoencoder had an accuracy of 89.6%. Further, the anomaly behavior angkot using the Iterative Dichotomiser Three (ID3) method was constructed. The anomaly behavior angkot done by the angkot consisted of off-Track, waiting time, and Speeding up. This training process was done by data normalization and nondata normalization. The evaluation technique for the training and testing was 10- fold cross-validation. To know the difference between the ID3 performance using normalization and the ID3 performance without normalization, the t-test was used.iv Based on the testing, since the sig. value of ? ?, namely sig. (p-value) of 0.001 or less than 0.05, the H0 was rejected and H1 was accepted (a significant difference identified). It can conclude that the performance of the ID3 using normalization is higher than the performance of the ID3 without normalization. The ID3 using normalization had an accuracy of 98%, while the ID3 without normalization had an accuracy of 95%. Furthermore, manual testing was applied to the models by comparing time of the model with time of the data exploration. The result of the checking and testing had an accuracy of 91.2%. Lastly, the result of EDA and the model is presented in the form of a web dashboard. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
There are several types of public transportation in Indonesia, such as
Angkot/Angkutan Kota (City Transportation) becoming the focus of this study. The
angkot data used in this study was those collected from the applications, namely
SEMUT and GPS Tracker, installed in every angkot. This GPS sensor data can be
used for facilitating the monitoring of all activities and behaviors done by angkot.
This study would create a model that could detect anomaly behavior angkot done
by angkot. An anomaly is defined as abnormal behavior. The model was
constructed to provide information if the anomaly behavior angkot was identified.
Before conducting data analysis, the data taken from SEMUT should pass the data
cleaning process, such as parsing, correction and standardization, enhancement,
matching, and consolidation. The pre-processing was thereupon conducted
including attribute selection, Exploratory Data Analysis (EDA), determining data
training, and data testing. After conducting the EDA stage, the activities and
behaviors done by angkot could be observed and defined as anomaly behavior
angkot. Subsequently, the data separation for the normal behavior in the angkot to
construct the anomaly behavior angkot detection using the Long-Short Term
Memory (LSTM) autoencoder. This model was used for training the normal data
without any label. The result from the model evaluation had a gradual drop in the
loss MAE value. The final value in the training reached 0.0099. The output from
the result of the anomaly behavior angkot detection using the long-short term
memory autoencoder was the data label. The data would be automatically labeled
with true or false. The true label was provided if the loss MAE value was more than
the threshold value (0.1), indicating that there was anomaly behavior angkot.
Meanwhile, the false label was given when the loss MAE in the tested data was less
than the threshold value of 0.1, indicating that the angkot had normal behavior.
After testing and checking by comparing the output of the model and the checking
result using a manual method using the data exploration, the Long-Short Term
Memory autoencoder had an accuracy of 89.6%. Further, the anomaly behavior
angkot using the Iterative Dichotomiser Three (ID3) method was constructed. The
anomaly behavior angkot done by the angkot consisted of off-Track, waiting time,
and Speeding up. This training process was done by data normalization and nondata normalization. The evaluation technique for the training and testing was 10-
fold cross-validation. To know the difference between the ID3 performance using
normalization and the ID3 performance without normalization, the t-test was used.iv
Based on the testing, since the sig. value of ? ?, namely sig. (p-value) of 0.001 or
less than 0.05, the H0 was rejected and H1 was accepted (a significant difference
identified). It can conclude that the performance of the ID3 using normalization is
higher than the performance of the ID3 without normalization. The ID3 using
normalization had an accuracy of 98%, while the ID3 without normalization had
an accuracy of 95%. Furthermore, manual testing was applied to the models by
comparing time of the model with time of the data exploration. The result of the
checking and testing had an accuracy of 91.2%. Lastly, the result of EDA and the
model is presented in the form of a web dashboard. |
format |
Theses |
author |
Rin Nurmalasari, Rin |
spellingShingle |
Rin Nurmalasari, Rin ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA |
author_facet |
Rin Nurmalasari, Rin |
author_sort |
Rin Nurmalasari, Rin |
title |
ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA |
title_short |
ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA |
title_full |
ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA |
title_fullStr |
ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA |
title_full_unstemmed |
ANOMALY BEHAVIOR ANGKOT DETECTION OF ANGKOT USING AUTOENCODER LONG SHORT TERM MEMORY AND ID3 BASED ON THE TRAVEL HISTORICAL DATA |
title_sort |
anomaly behavior angkot detection of angkot using autoencoder long short term memory and id3 based on the travel historical data |
url |
https://digilib.itb.ac.id/gdl/view/55196 |
_version_ |
1822929834066575360 |