Vietnamese character recognition based on cnn model with reduced character classes

This article will detail the steps to build and train the convolutional neural network (CNN) model for Vietnamese character recognition in educational books. Based on this model, a mobile application for extracting text content from images in Vietnamese textbooks was built using OpenCV and Canny edg...

Full description

Saved in:
Bibliographic Details
Main Authors: Phan, T.H., Tran, D.C., Hassan, M.F.
Format: Article
Published: Institute of Advanced Engineering and Science 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85102988809&doi=10.11591%2feei.v10i2.2810&partnerID=40&md5=5e693f828b523a177314ee98545d5589
http://eprints.utp.edu.my/23853/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Petronas
Description
Summary:This article will detail the steps to build and train the convolutional neural network (CNN) model for Vietnamese character recognition in educational books. Based on this model, a mobile application for extracting text content from images in Vietnamese textbooks was built using OpenCV and Canny edge detection algorithm. There are 178 characters classes in Vietnamese with accents. However, within the scope of Vietnamese character recognition in textbooks, some classes of characters only differ in terms of actual sizes, such as �c� and �C�, �o� and �O�. Therefore, the authors built the classification model for 138 Vietnamese character classes after filtering out similar character classes to increase the model's effectiveness. © 2021, Institute of Advanced Engineering and Science. All rights reserved.