DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM
In previous studies geolocation data has been collected through two sources, namely the angkot Android application and the WiFi Module ESP8266 as a GPS tracker. The collected data is stored in a database that is implemented using Mongo DB. The stored data was then observed and found 15 types of &...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/70695 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:70695 |
---|---|
spelling |
id-itb.:706952023-01-19T09:44:03ZDATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM Taufiq Al Ghifari, Nasy`an Indonesia Theses geolocation data, public transportation, data quality, data cleaning, classification. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/70695 In previous studies geolocation data has been collected through two sources, namely the angkot Android application and the WiFi Module ESP8266 as a GPS tracker. The collected data is stored in a database that is implemented using Mongo DB. The stored data was then observed and found 15 types of 'dirty data' based on Lin Li's dirty data taxonomy. With the discovery of 15 types of 'dirty data' on angkot data, cleaning actions need to be taken in order to obtain accurate analysis results. Before cleaning the data, measurement of the quality of the data is done using Redman's theory. After measuring the quality of angkot data, the process of improving data quality can be done by data cleaning. The data cleaning process is adjusted to the purpose of data processing, which is to analyze the activity of angkot based on the data that is owned and also adjusted to the condition of the existing data. The data cleaning method applied to the angkot data successfully cleared 86.6% of the previous types of ‘dirty data’. After the angkot data goes through the data cleaning process, the final results of the angkot data from the data cleaning process will immediately follow the data pre-processing stages, the output of this process is the angkot data that is ready to be displayed on the web application as the final result of the data visualization process. In a web application, angkot activity on the data will be seen, also angkot travel patterns can be observed. From here, the normal and abnormal angkot behavior can be defined to help create angkot behavior detection algorithms. After the angkot behavior detection algorithm has been developed, the experiment is conducted by comparing the same abnormal time between the results of the angkot behavior detection algorithm with the results of exploration on a web application. The accuracy of the angkot behavior detection algorithm is 63%. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
In previous studies geolocation data has been collected through two sources,
namely the angkot Android application and the WiFi Module ESP8266 as a GPS
tracker. The collected data is stored in a database that is implemented using Mongo
DB. The stored data was then observed and found 15 types of 'dirty data' based on
Lin Li's dirty data taxonomy. With the discovery of 15 types of 'dirty data' on angkot
data, cleaning actions need to be taken in order to obtain accurate analysis results.
Before cleaning the data, measurement of the quality of the data is done using
Redman's theory. After measuring the quality of angkot data, the process of
improving data quality can be done by data cleaning. The data cleaning process is
adjusted to the purpose of data processing, which is to analyze the activity of angkot
based on the data that is owned and also adjusted to the condition of the existing
data. The data cleaning method applied to the angkot data successfully cleared
86.6% of the previous types of ‘dirty data’.
After the angkot data goes through the data cleaning process, the final results of
the angkot data from the data cleaning process will immediately follow the data
pre-processing stages, the output of this process is the angkot data that is ready to
be displayed on the web application as the final result of the data visualization
process. In a web application, angkot activity on the data will be seen, also angkot
travel patterns can be observed. From here, the normal and abnormal angkot
behavior can be defined to help create angkot behavior detection algorithms. After
the angkot behavior detection algorithm has been developed, the experiment is
conducted by comparing the same abnormal time between the results of the angkot
behavior detection algorithm with the results of exploration on a web application.
The accuracy of the angkot behavior detection algorithm is 63%. |
format |
Theses |
author |
Taufiq Al Ghifari, Nasy`an |
spellingShingle |
Taufiq Al Ghifari, Nasy`an DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM |
author_facet |
Taufiq Al Ghifari, Nasy`an |
author_sort |
Taufiq Al Ghifari, Nasy`an |
title |
DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM |
title_short |
DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM |
title_full |
DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM |
title_fullStr |
DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM |
title_full_unstemmed |
DATA QUALITY IMPROVEMENT FOR ANGKOT ACTIVITY ANALYSIS AND THE CREATION OF ANGKOT BEHAVIOR DETECTION ALGORITHM |
title_sort |
data quality improvement for angkot activity analysis and the creation of angkot behavior detection algorithm |
url |
https://digilib.itb.ac.id/gdl/view/70695 |
_version_ |
1822991691916771328 |