Training deep network models for accurate recognition of texts in scene images

With the advent of the industrial 4.0 era, computer vision and its applications have played an increasingly significant role in the digitization of industries. Images that contain text can be used as sources of information for automating processes. Text recognition from scene images constitute...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhang, Weilun
Other Authors:	Lu Shijian
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Online Access:	https://hdl.handle.net/10356/156999
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-156999
record_format	dspace
spelling	sg-ntu-dr.10356-1569992022-05-06T04:29:42Z Training deep network models for accurate recognition of texts in scene images Zhang, Weilun Lu Shijian School of Computer Science and Engineering Shijian.Lu@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision With the advent of the industrial 4.0 era, computer vision and its applications have played an increasingly significant role in the digitization of industries. Images that contain text can be used as sources of information for automating processes. Text recognition from scene images constitutes still a very challenging task because of various backgrounds, variations in size and space, as well as irregular arrangements. The first half of this paper will study and analyse a deep learning model for word recognition in scene images using deep learning. As part of this work, CRNN, one of the most extensively used state-of-the-art deep learning models for text recognition, will be implemented and trained. A number of hyperparameters will be tweaked to obtain optimum performance. In addition, further improvements to CRNN recognition accuracy will be explored. Different image super-resolution techniques and the ImageFilter models will be utilized and compared as a pre-processing stage for recognition in the second part of this paper. Bachelor of Engineering (Computer Science) 2022-05-06T04:29:42Z 2022-05-06T04:29:42Z 2022 Final Year Project (FYP) Zhang, W. (2022). Training deep network models for accurate recognition of texts in scene images. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156999 https://hdl.handle.net/10356/156999 en PSCSE20-0069 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Zhang, Weilun Training deep network models for accurate recognition of texts in scene images
description	With the advent of the industrial 4.0 era, computer vision and its applications have played an increasingly significant role in the digitization of industries. Images that contain text can be used as sources of information for automating processes. Text recognition from scene images constitutes still a very challenging task because of various backgrounds, variations in size and space, as well as irregular arrangements. The first half of this paper will study and analyse a deep learning model for word recognition in scene images using deep learning. As part of this work, CRNN, one of the most extensively used state-of-the-art deep learning models for text recognition, will be implemented and trained. A number of hyperparameters will be tweaked to obtain optimum performance. In addition, further improvements to CRNN recognition accuracy will be explored. Different image super-resolution techniques and the ImageFilter models will be utilized and compared as a pre-processing stage for recognition in the second part of this paper.
author2	Lu Shijian
author_facet	Lu Shijian Zhang, Weilun
format	Final Year Project
author	Zhang, Weilun
author_sort	Zhang, Weilun
title	Training deep network models for accurate recognition of texts in scene images
title_short	Training deep network models for accurate recognition of texts in scene images
title_full	Training deep network models for accurate recognition of texts in scene images
title_fullStr	Training deep network models for accurate recognition of texts in scene images
title_full_unstemmed	Training deep network models for accurate recognition of texts in scene images
title_sort	training deep network models for accurate recognition of texts in scene images
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/156999
_version_	1734310319419817984

Training deep network models for accurate recognition of texts in scene images

Similar Items