Detection and rectification of arbitrary shaped scene texts by using text keypoints and links

Detection and recognition of scene texts of arbitrary shapes remain a grand challenge due to the super-rich text shape variation in text line orientations, lengths, curvatures, etc. This paper presents a mask-guided multi-task network that detects and rectifies scene texts of arbitrary shapes reliab...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Xue, Chuhui, Lu, Shijian, Hoi, Steven
其他作者:	School of Computer Science and Engineering
格式:	Article
語言:	English
出版:	2022
主題:	Engineering::Computer science and engineering Scene Text Recognition Deep Learning
在線閱讀:	https://hdl.handle.net/10356/161795
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Nanyang Technological University
語言:	English

id	sg-ntu-dr.10356-161795
record_format	dspace
spelling	sg-ntu-dr.10356-1617952022-09-20T05:36:48Z Detection and rectification of arbitrary shaped scene texts by using text keypoints and links Xue, Chuhui Lu, Shijian Hoi, Steven School of Computer Science and Engineering Engineering::Computer science and engineering Scene Text Recognition Deep Learning Detection and recognition of scene texts of arbitrary shapes remain a grand challenge due to the super-rich text shape variation in text line orientations, lengths, curvatures, etc. This paper presents a mask-guided multi-task network that detects and rectifies scene texts of arbitrary shapes reliably. Three types of keypoints are detected which specify the centre line and so the shape of text instances accurately. In addition, four types of keypoint links are detected of which the horizontal links associate the detected keypoints of each text instance and the vertical links predict a pair of landmark points (for each keypoint) along the upper and lower text boundary, respectively. Scene texts can be located and rectified by linking up the associated landmark points (giving localization polygon boxes) and transforming the polygon boxes via thin plate spline, respectively. Extensive experiments over several public datasets show that the use of text keypoints is tolerant to the variation in text orientations, lengths, and curvatures, and it achieves competitive scene text detection and rectification performance as compared with state-of-the-art methods. 2022-09-20T05:36:48Z 2022-09-20T05:36:48Z 2022 Journal Article Xue, C., Lu, S. & Hoi, S. (2022). Detection and rectification of arbitrary shaped scene texts by using text keypoints and links. Pattern Recognition, 124, 108494-. https://dx.doi.org/10.1016/j.patcog.2021.108494 0031-3203 https://hdl.handle.net/10356/161795 10.1016/j.patcog.2021.108494 2-s2.0-85122476297 124 108494 en Pattern Recognition © 2021 Elsevier Ltd. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering Scene Text Recognition Deep Learning
spellingShingle	Engineering::Computer science and engineering Scene Text Recognition Deep Learning Xue, Chuhui Lu, Shijian Hoi, Steven Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
description	Detection and recognition of scene texts of arbitrary shapes remain a grand challenge due to the super-rich text shape variation in text line orientations, lengths, curvatures, etc. This paper presents a mask-guided multi-task network that detects and rectifies scene texts of arbitrary shapes reliably. Three types of keypoints are detected which specify the centre line and so the shape of text instances accurately. In addition, four types of keypoint links are detected of which the horizontal links associate the detected keypoints of each text instance and the vertical links predict a pair of landmark points (for each keypoint) along the upper and lower text boundary, respectively. Scene texts can be located and rectified by linking up the associated landmark points (giving localization polygon boxes) and transforming the polygon boxes via thin plate spline, respectively. Extensive experiments over several public datasets show that the use of text keypoints is tolerant to the variation in text orientations, lengths, and curvatures, and it achieves competitive scene text detection and rectification performance as compared with state-of-the-art methods.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Xue, Chuhui Lu, Shijian Hoi, Steven
format	Article
author	Xue, Chuhui Lu, Shijian Hoi, Steven
author_sort	Xue, Chuhui
title	Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
title_short	Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
title_full	Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
title_fullStr	Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
title_full_unstemmed	Detection and rectification of arbitrary shaped scene texts by using text keypoints and links
title_sort	detection and rectification of arbitrary shaped scene texts by using text keypoints and links
publishDate	2022
url	https://hdl.handle.net/10356/161795
_version_	1745574614754394112

Detection and rectification of arbitrary shaped scene texts by using text keypoints and links

相似書籍