Mobile application on a scene text spotting

Scene text spotting serves as an important concept in many practical applications. In particular, the applications of text spotting may include but not limited to reducing human labor of manual text extraction tasks, retrieving information from images for image context analysis, or automatically ide...

Full description

Saved in:
Bibliographic Details
Main Author: Nguyen Doan Hoang Lam
Other Authors: Loke Yuan Ren
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/147958
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-147958
record_format dspace
spelling sg-ntu-dr.10356-1479582021-04-20T08:02:24Z Mobile application on a scene text spotting Nguyen Doan Hoang Lam Loke Yuan Ren School of Computer Science and Engineering yrloke@ntu.edu.sg Engineering::Computer science and engineering::Software::Software engineering Scene text spotting serves as an important concept in many practical applications. In particular, the applications of text spotting may include but not limited to reducing human labor of manual text extraction tasks, retrieving information from images for image context analysis, or automatically identifying human identity by reading their identification card. With many important applications of text spotting, many attempts on implementing software applications that adopted different text spotting methods have been made. While these existing applications have eased many human activities that require text extraction from natural scenes, they also experienced some limitations. Some applications still require too many manual actions from users and cannot spot text in real-time while others may suffer from accuracy and performance issues due to the obsolete text spotting methods. To resolve those issues, this project proposed the implementation of a mobile application that adopted a well-known text spotting approach known as the Adaptive Bezier Curve Network. The performance of this approach, which has been evaluated on TotalText and CTW1500 dataset, proved to achieve a state-of-the-art accuracy while having considerably high inference speed compared to the other existing state-of-the-art methods. In addition to adopting this approach, the project has successfully built an application and a text spotting server using socket programming method as well as our own defined image streaming protocol. Finally, the experiments conducted to measure the performance of the application shows that it is capable of real-time text spotting with up to eight frames per seconds on average while retaining the state-of-the-art text spotting accuracy. Bachelor of Engineering (Computer Science) 2021-04-20T08:02:24Z 2021-04-20T08:02:24Z 2021 Final Year Project (FYP) Nguyen Doan Hoang Lam (2021). Mobile application on a scene text spotting. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/147958 https://hdl.handle.net/10356/147958 en SCSE20-0091 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Software::Software engineering
spellingShingle Engineering::Computer science and engineering::Software::Software engineering
Nguyen Doan Hoang Lam
Mobile application on a scene text spotting
description Scene text spotting serves as an important concept in many practical applications. In particular, the applications of text spotting may include but not limited to reducing human labor of manual text extraction tasks, retrieving information from images for image context analysis, or automatically identifying human identity by reading their identification card. With many important applications of text spotting, many attempts on implementing software applications that adopted different text spotting methods have been made. While these existing applications have eased many human activities that require text extraction from natural scenes, they also experienced some limitations. Some applications still require too many manual actions from users and cannot spot text in real-time while others may suffer from accuracy and performance issues due to the obsolete text spotting methods. To resolve those issues, this project proposed the implementation of a mobile application that adopted a well-known text spotting approach known as the Adaptive Bezier Curve Network. The performance of this approach, which has been evaluated on TotalText and CTW1500 dataset, proved to achieve a state-of-the-art accuracy while having considerably high inference speed compared to the other existing state-of-the-art methods. In addition to adopting this approach, the project has successfully built an application and a text spotting server using socket programming method as well as our own defined image streaming protocol. Finally, the experiments conducted to measure the performance of the application shows that it is capable of real-time text spotting with up to eight frames per seconds on average while retaining the state-of-the-art text spotting accuracy.
author2 Loke Yuan Ren
author_facet Loke Yuan Ren
Nguyen Doan Hoang Lam
format Final Year Project
author Nguyen Doan Hoang Lam
author_sort Nguyen Doan Hoang Lam
title Mobile application on a scene text spotting
title_short Mobile application on a scene text spotting
title_full Mobile application on a scene text spotting
title_fullStr Mobile application on a scene text spotting
title_full_unstemmed Mobile application on a scene text spotting
title_sort mobile application on a scene text spotting
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/147958
_version_ 1698713687350575104