DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES
This final project aims to develop a system for understanding formal documents, namely BPKB, and handwritten text using optical character recognition (OCR) techniques. This system requires an internet connection in the document recognition phase. The process in this system involves three main sta...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/75327 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:75327 |
---|---|
spelling |
id-itb.:753272023-07-26T15:20:01ZDEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES Ananda Pratama Resyaly, Daffa Indonesia Final Project Formal document understanding, BPKB, handwritten text, OCR, text detection, text recognition, name entity recognition. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/75327 This final project aims to develop a system for understanding formal documents, namely BPKB, and handwritten text using optical character recognition (OCR) techniques. This system requires an internet connection in the document recognition phase. The process in this system involves three main stages, namely text detection, text recognition, and named entity recognition. In the text detection phase, the system uses a text detection algorithm to identify areas in the formal document that potentially contain text. Then, in the text recognition phase, an OCR algorithm is applied to recognize text in each area detected previously. Subsequently, in the named entity recognition phase, the system applies a natural language processing (NLP) algorithm to recognize certain entities, such as the owner's name in the formal document. Testing was carried out using various formal documents with image quality variations, and the test results showed a reasonable level of accuracy and inference time. With the existence of this system, it is hoped that the process of understanding and extracting information from formal documents will be more efficient and accurate. This system can be used in the automation and digitization process of formal documents, improving information accessibility, and reducing human errors in data processing. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
This final project aims to develop a system for understanding formal documents, namely
BPKB, and handwritten text using optical character recognition (OCR) techniques. This system
requires an internet connection in the document recognition phase. The process in this system
involves three main stages, namely text detection, text recognition, and named entity
recognition.
In the text detection phase, the system uses a text detection algorithm to identify areas in the
formal document that potentially contain text. Then, in the text recognition phase, an OCR
algorithm is applied to recognize text in each area detected previously. Subsequently, in the
named entity recognition phase, the system applies a natural language processing (NLP)
algorithm to recognize certain entities, such as the owner's name in the formal document.
Testing was carried out using various formal documents with image quality variations, and the
test results showed a reasonable level of accuracy and inference time.
With the existence of this system, it is hoped that the process of understanding and extracting
information from formal documents will be more efficient and accurate. This system can be
used in the automation and digitization process of formal documents, improving information
accessibility, and reducing human errors in data processing. |
format |
Final Project |
author |
Ananda Pratama Resyaly, Daffa |
spellingShingle |
Ananda Pratama Resyaly, Daffa DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES |
author_facet |
Ananda Pratama Resyaly, Daffa |
author_sort |
Ananda Pratama Resyaly, Daffa |
title |
DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES |
title_short |
DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES |
title_full |
DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES |
title_fullStr |
DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES |
title_full_unstemmed |
DEVELOPMENT OF A SYSTEM FOR UNDERSTANDING BPKB FORMAL DOCUMENTS AND HANDWRITTEN TEXT USING OPTICAL CHARACTER RECOGNITION (OCR) TECHNIQUES |
title_sort |
development of a system for understanding bpkb formal documents and handwritten text using optical character recognition (ocr) techniques |
url |
https://digilib.itb.ac.id/gdl/view/75327 |
_version_ |
1822280136602419200 |