Deformable scene text detection using harmonic features and modified pixel aggregation network

Although text detection methods have addressed several challenges in the past, there is a dearth of effective methods for text detection in deformable images, such as images containing text embedded on cloth, banners, rubber, sports jerseys, uniforms, etc. This is because deformable regions contain...

Full description

Saved in:
Bibliographic Details
Main Authors: Jain, Tanmay, Palaiahnakote, Shivakumara, Pal, Umapada, Liu, Cheng-Lin
Format: Article
Published: Elsevier 2021
Subjects:
Online Access:http://eprints.um.edu.my/26409/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaya
id my.um.eprints.26409
record_format eprints
spelling my.um.eprints.264092022-02-28T00:48:37Z http://eprints.um.edu.my/26409/ Deformable scene text detection using harmonic features and modified pixel aggregation network Jain, Tanmay Palaiahnakote, Shivakumara Pal, Umapada Liu, Cheng-Lin QA75 Electronic computers. Computer science Although text detection methods have addressed several challenges in the past, there is a dearth of effective methods for text detection in deformable images, such as images containing text embedded on cloth, banners, rubber, sports jerseys, uniforms, etc. This is because deformable regions contain surfaces of arbitrarily shapes, which lead to poor text quality. This paper presents a new method for deformable text detection in natural scene images. It is observed that although the shapes of characters change in a deformable region, the pixel values and spatial relationship between the pixels do not change. This motivated us to explore extraction of Maximally Stable Extremal Regions (MSER) in an image in which pixels that share common features are grouped into components. The unique character shape variations led us to explore harmonic features to represent the component shape variations, using which a classifier classifies text and non-text components from the output of the MSER step. Additionally, the objective of developing a lightweight method with low computational cost motivated us to introduce a modified Pixel Aggression Network (PAN) for text deformable text detection at a component level. Comprehensive experiments which include experiments on our Deformable Text Dataset (DTD) and standard natural scene text datasets, namely, MSRATD-500, ICDAR 2019 MLT, Total-Text, CTW1500, ICDAR 2019 ArT and DSTA1500 datasets show that the proposed model outperforms the existing methods for our dataset as well as the standard datasets. (c) 2021 Elsevier B.V. All rights reserved. Elsevier 2021-12 Article PeerReviewed Jain, Tanmay and Palaiahnakote, Shivakumara and Pal, Umapada and Liu, Cheng-Lin (2021) Deformable scene text detection using harmonic features and modified pixel aggregation network. Pattern Recognition Letters, 152. pp. 135-142. ISSN 0167-8655, DOI https://doi.org/10.1016/j.patrec.2021.10.006 <https://doi.org/10.1016/j.patrec.2021.10.006>. 10.1016/j.patrec.2021.10.006
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Jain, Tanmay
Palaiahnakote, Shivakumara
Pal, Umapada
Liu, Cheng-Lin
Deformable scene text detection using harmonic features and modified pixel aggregation network
description Although text detection methods have addressed several challenges in the past, there is a dearth of effective methods for text detection in deformable images, such as images containing text embedded on cloth, banners, rubber, sports jerseys, uniforms, etc. This is because deformable regions contain surfaces of arbitrarily shapes, which lead to poor text quality. This paper presents a new method for deformable text detection in natural scene images. It is observed that although the shapes of characters change in a deformable region, the pixel values and spatial relationship between the pixels do not change. This motivated us to explore extraction of Maximally Stable Extremal Regions (MSER) in an image in which pixels that share common features are grouped into components. The unique character shape variations led us to explore harmonic features to represent the component shape variations, using which a classifier classifies text and non-text components from the output of the MSER step. Additionally, the objective of developing a lightweight method with low computational cost motivated us to introduce a modified Pixel Aggression Network (PAN) for text deformable text detection at a component level. Comprehensive experiments which include experiments on our Deformable Text Dataset (DTD) and standard natural scene text datasets, namely, MSRATD-500, ICDAR 2019 MLT, Total-Text, CTW1500, ICDAR 2019 ArT and DSTA1500 datasets show that the proposed model outperforms the existing methods for our dataset as well as the standard datasets. (c) 2021 Elsevier B.V. All rights reserved.
format Article
author Jain, Tanmay
Palaiahnakote, Shivakumara
Pal, Umapada
Liu, Cheng-Lin
author_facet Jain, Tanmay
Palaiahnakote, Shivakumara
Pal, Umapada
Liu, Cheng-Lin
author_sort Jain, Tanmay
title Deformable scene text detection using harmonic features and modified pixel aggregation network
title_short Deformable scene text detection using harmonic features and modified pixel aggregation network
title_full Deformable scene text detection using harmonic features and modified pixel aggregation network
title_fullStr Deformable scene text detection using harmonic features and modified pixel aggregation network
title_full_unstemmed Deformable scene text detection using harmonic features and modified pixel aggregation network
title_sort deformable scene text detection using harmonic features and modified pixel aggregation network
publisher Elsevier
publishDate 2021
url http://eprints.um.edu.my/26409/
_version_ 1735409408952762368