DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK
Data digitalization has become a crucial topic, especially in the current modern era. Manual document processing requires more time and effort compared to automated processing utilizing digital data. In the context of document digitalization, such as ID cards (KTP), driver's licenses (SIM),...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/78168 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:78168 |
---|---|
spelling |
id-itb.:781682023-09-18T10:28:14ZDEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK Riemenn, Louis Indonesia Final Project optical character recognition, synthetic data, text detection, text recognition, document, KK INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/78168 Data digitalization has become a crucial topic, especially in the current modern era. Manual document processing requires more time and effort compared to automated processing utilizing digital data. In the context of document digitalization, such as ID cards (KTP), driver's licenses (SIM), and family cards (KK), optical character recognition is employed to transform text in images into a digital format. This digital format can then be further processed digitally. In this final project, modules for generating synthetic data for KTP, SIM, and KK; text detection for KK documents; and text recognition for KK documents were developed. Synthetic data generation was accomplished using synthetic composites techniques, involving original images that were modified by adding synthetic elements that were previously absent. The original images (images from each KTP, SIM, and KK document) were modified by removing specific information in the images. False information was then added to the modified images, followed by introducing noise and tilting. The text detection and text recognition modules were built in three stages. The first stage involved selecting the most suitable model through benchmarking. The second stage consisted of training the chosen model using a KK dataset. The final stage was model evaluation to ensure that the model exhibited good performance and improvement compared to the previous state. Based on the conducted benchmarking, the selected model for text detection was DB with the MobileNetV3 backbone. Meanwhile, the chosen model for text recognition was SVTR with the SVTR-Tiny backbone. The selected models also demonstrated improved performance after training with the KK dataset, with the text detection model achieving precision of 97.80%, recall of 97.29%, and an F1 score of 97.54%, while the text recognition model achieved an accuracy of 99.99%. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Data digitalization has become a crucial topic, especially in the current modern era. Manual
document processing requires more time and effort compared to automated processing utilizing
digital data. In the context of document digitalization, such as ID cards (KTP), driver's licenses
(SIM), and family cards (KK), optical character recognition is employed to transform text in
images into a digital format. This digital format can then be further processed digitally. In this
final project, modules for generating synthetic data for KTP, SIM, and KK; text detection for KK
documents; and text recognition for KK documents were developed. Synthetic data generation
was accomplished using synthetic composites techniques, involving original images that were
modified by adding synthetic elements that were previously absent. The original images (images
from each KTP, SIM, and KK document) were modified by removing specific information in the
images. False information was then added to the modified images, followed by introducing noise
and tilting.
The text detection and text recognition modules were built in three stages. The first stage
involved selecting the most suitable model through benchmarking. The second stage consisted of
training the chosen model using a KK dataset. The final stage was model evaluation to ensure
that the model exhibited good performance and improvement compared to the previous state.
Based on the conducted benchmarking, the selected model for text detection was DB with the
MobileNetV3 backbone. Meanwhile, the chosen model for text recognition was SVTR with the
SVTR-Tiny backbone. The selected models also demonstrated improved performance after
training with the KK dataset, with the text detection model achieving precision of 97.80%, recall
of 97.29%, and an F1 score of 97.54%, while the text recognition model achieved an accuracy of
99.99%. |
format |
Final Project |
author |
Riemenn, Louis |
spellingShingle |
Riemenn, Louis DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK |
author_facet |
Riemenn, Louis |
author_sort |
Riemenn, Louis |
title |
DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK |
title_short |
DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK |
title_full |
DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK |
title_fullStr |
DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK |
title_full_unstemmed |
DEVELOPMENT OF AN OCR MODULE FOR READING KK AND A SYNTHETIC DATA GENERATION MODULE FOR KTP, SIM, AND KK |
title_sort |
development of an ocr module for reading kk and a synthetic data generation module for ktp, sim, and kk |
url |
https://digilib.itb.ac.id/gdl/view/78168 |
_version_ |
1822995648429948928 |