Machine learning based characters recognition

This final year project aims to study and implement some machine learning techniques for character recognition. The author was tasked to develop a mobile app for a business card scanner based on these techniques. The author has chosen to do research on Tesseract, which is an open-source optical char...

Full description

Saved in:
Bibliographic Details
Main Author: Song, Tianyi
Other Authors: Huang Guangbin
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/75485
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-75485
record_format dspace
spelling sg-ntu-dr.10356-754852023-07-07T16:19:53Z Machine learning based characters recognition Song, Tianyi Huang Guangbin School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering This final year project aims to study and implement some machine learning techniques for character recognition. The author was tasked to develop a mobile app for a business card scanner based on these techniques. The author has chosen to do research on Tesseract, which is an open-source optical character recognition (OCR) engine sponsored by Google and has embedded the Tess-two library locally into the business card scanner. The scanner was developed for Android systems. It is able to scan characters on business cards, distinguish the information and save it into the entry attributes for a new contact. It includes functions of photo cropping and saving, character recognition, information extraction and contact adding. The app design, app structure, key codes and testing results will be included in this report. Since OCR is the key technology of the application, its principle and development will be discussed for basic understanding as well as future improvement of the scanner. Bachelor of Engineering 2018-05-31T08:34:57Z 2018-05-31T08:34:57Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/75485 en Nanyang Technological University 42 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Song, Tianyi
Machine learning based characters recognition
description This final year project aims to study and implement some machine learning techniques for character recognition. The author was tasked to develop a mobile app for a business card scanner based on these techniques. The author has chosen to do research on Tesseract, which is an open-source optical character recognition (OCR) engine sponsored by Google and has embedded the Tess-two library locally into the business card scanner. The scanner was developed for Android systems. It is able to scan characters on business cards, distinguish the information and save it into the entry attributes for a new contact. It includes functions of photo cropping and saving, character recognition, information extraction and contact adding. The app design, app structure, key codes and testing results will be included in this report. Since OCR is the key technology of the application, its principle and development will be discussed for basic understanding as well as future improvement of the scanner.
author2 Huang Guangbin
author_facet Huang Guangbin
Song, Tianyi
format Final Year Project
author Song, Tianyi
author_sort Song, Tianyi
title Machine learning based characters recognition
title_short Machine learning based characters recognition
title_full Machine learning based characters recognition
title_fullStr Machine learning based characters recognition
title_full_unstemmed Machine learning based characters recognition
title_sort machine learning based characters recognition
publishDate 2018
url http://hdl.handle.net/10356/75485
_version_ 1772827935881822208