Mobile application on a scene text spotting

Scene text spotting serves as an important concept in many practical applications. In particular, the applications of text spotting may include but not limited to reducing human labor of manual text extraction tasks, retrieving information from images for image context analysis, or automatically ide...

Full description

Saved in:
Bibliographic Details
Main Author: Nguyen Doan Hoang Lam
Other Authors: Loke Yuan Ren
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/147958
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Scene text spotting serves as an important concept in many practical applications. In particular, the applications of text spotting may include but not limited to reducing human labor of manual text extraction tasks, retrieving information from images for image context analysis, or automatically identifying human identity by reading their identification card. With many important applications of text spotting, many attempts on implementing software applications that adopted different text spotting methods have been made. While these existing applications have eased many human activities that require text extraction from natural scenes, they also experienced some limitations. Some applications still require too many manual actions from users and cannot spot text in real-time while others may suffer from accuracy and performance issues due to the obsolete text spotting methods. To resolve those issues, this project proposed the implementation of a mobile application that adopted a well-known text spotting approach known as the Adaptive Bezier Curve Network. The performance of this approach, which has been evaluated on TotalText and CTW1500 dataset, proved to achieve a state-of-the-art accuracy while having considerably high inference speed compared to the other existing state-of-the-art methods. In addition to adopting this approach, the project has successfully built an application and a text spotting server using socket programming method as well as our own defined image streaming protocol. Finally, the experiments conducted to measure the performance of the application shows that it is capable of real-time text spotting with up to eight frames per seconds on average while retaining the state-of-the-art text spotting accuracy.