Text restoration using image super resolution

Text Recognition and scene text recognition have gained high prominence with the emergence of advanced deep learning techniques, such as CNNs. However, when the scene data is of low resolution, most models fail to provide accurate results. To this extent, super resolution is proposed as a pre proces...

全面介紹

Saved in:

書目詳細資料
主要作者:	Bodipati, Kiran
其他作者:	Chen Change Loy
格式:	Final Year Project
語言:	English
出版:	Nanyang Technological University 2023
主題:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
在線閱讀:	https://hdl.handle.net/10356/166103
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Nanyang Technological University
語言:	English

實物特徵
總結:	Text Recognition and scene text recognition have gained high prominence with the emergence of advanced deep learning techniques, such as CNNs. However, when the scene data is of low resolution, most models fail to provide accurate results. To this extent, super resolution is proposed as a pre processing technique to improve the resolution of the images. Traditional Super Resolution models are developed for natural scenes and tend to fail in the case of scene text, due to several characteristics of the text that make it challenging for text super resolution. The lack of high quality datasets for this task is a factor in the poor performance of existing models. In our study, we provide a comprehensive review of existing super resolution techniques and the techniques specific to the context of scene text data. In this study, we build a new practical dataset that can be used to this extent. We create high resolution synthetic text data and collect high resolution images crawling the web. The corresponding low resolution images are created using a practical higher order degradation model. We train on the architecture of Real-ESRGAN and provide a qualitative and qualitative study of the datasets proposed and demonstrate the performance of the new models. Comparisons against the pre-trained Real-ESRGAN model is provided. The limitations of the proposed datasets and models are discussed.

Text restoration using image super resolution

相似書籍