A new ring radius transform-based thinning method for multi-oriented video characters

Thinning that preserves visual topology of characters in video is challenging in the field of document analysis and video text analysis due to low resolution and complex background. This paper proposes to explore ring radius transform (RRT) to generate a radius map from Canny edges of each input ima...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wu, Y., Shivakumara, P., Wei, W., Lu, T., Pal, U.
Format:	Article
Published:	Springer Verlag (Germany) 2015
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.um.edu.my/19427/ http://dx.doi.org/10.1007/s10032-015-0238-y
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.um.eprints.19427
record_format	eprints
spelling	my.um.eprints.194272018-09-26T04:44:36Z http://eprints.um.edu.my/19427/ A new ring radius transform-based thinning method for multi-oriented video characters Wu, Y. Shivakumara, P. Wei, W. Lu, T. Pal, U. QA75 Electronic computers. Computer science Thinning that preserves visual topology of characters in video is challenging in the field of document analysis and video text analysis due to low resolution and complex background. This paper proposes to explore ring radius transform (RRT) to generate a radius map from Canny edges of each input image to obtain its medial axis. A radius value contained in the radius map here is the nearest distance to the edge pixels on contours. For the radius map, the method proposes a novel idea for identifying medial axis (middle pixels between two strokes) for arbitrary orientations of the character. Iterative-maximal-growing is then proposed to connect missing medial axis pixels at junctions and intersections. Next, we perform histogram on color information of medial axes with clustering to eliminate false medial axis segments. The method finally restores the shape of the character through radius values of medial axis pixels for the purpose of recognition with the Google Open source OCR (Tesseract). The method has been tested on video, natural scene and handwritten characters from ICDAR 2013, SVT, arbitrary-oriented data from MSRA-TD500, multi-script character data and MPEG7 object data to evaluate its performances at thinning level as well as recognition level. Experimental results comparing with the state-of-the-art methods show that the proposed method is generic and outperforms the existing methods in terms of obtaining skeleton, preserving visual topology and recognition rate. The method is also robust to handle characters of arbitrary orientations. Springer Verlag (Germany) 2015 Article PeerReviewed Wu, Y. and Shivakumara, P. and Wei, W. and Lu, T. and Pal, U. (2015) A new ring radius transform-based thinning method for multi-oriented video characters. International Journal on Document Analysis and Recognition (IJDAR), 18 (2). pp. 137-151. ISSN 1433-2833 http://dx.doi.org/10.1007/s10032-015-0238-y doi:10.1007/s10032-015-0238-y
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Wu, Y. Shivakumara, P. Wei, W. Lu, T. Pal, U. A new ring radius transform-based thinning method for multi-oriented video characters
description	Thinning that preserves visual topology of characters in video is challenging in the field of document analysis and video text analysis due to low resolution and complex background. This paper proposes to explore ring radius transform (RRT) to generate a radius map from Canny edges of each input image to obtain its medial axis. A radius value contained in the radius map here is the nearest distance to the edge pixels on contours. For the radius map, the method proposes a novel idea for identifying medial axis (middle pixels between two strokes) for arbitrary orientations of the character. Iterative-maximal-growing is then proposed to connect missing medial axis pixels at junctions and intersections. Next, we perform histogram on color information of medial axes with clustering to eliminate false medial axis segments. The method finally restores the shape of the character through radius values of medial axis pixels for the purpose of recognition with the Google Open source OCR (Tesseract). The method has been tested on video, natural scene and handwritten characters from ICDAR 2013, SVT, arbitrary-oriented data from MSRA-TD500, multi-script character data and MPEG7 object data to evaluate its performances at thinning level as well as recognition level. Experimental results comparing with the state-of-the-art methods show that the proposed method is generic and outperforms the existing methods in terms of obtaining skeleton, preserving visual topology and recognition rate. The method is also robust to handle characters of arbitrary orientations.
format	Article
author	Wu, Y. Shivakumara, P. Wei, W. Lu, T. Pal, U.
author_facet	Wu, Y. Shivakumara, P. Wei, W. Lu, T. Pal, U.
author_sort	Wu, Y.
title	A new ring radius transform-based thinning method for multi-oriented video characters
title_short	A new ring radius transform-based thinning method for multi-oriented video characters
title_full	A new ring radius transform-based thinning method for multi-oriented video characters
title_fullStr	A new ring radius transform-based thinning method for multi-oriented video characters
title_full_unstemmed	A new ring radius transform-based thinning method for multi-oriented video characters
title_sort	new ring radius transform-based thinning method for multi-oriented video characters
publisher	Springer Verlag (Germany)
publishDate	2015
url	http://eprints.um.edu.my/19427/ http://dx.doi.org/10.1007/s10032-015-0238-y
_version_	1643690985401614336

A new ring radius transform-based thinning method for multi-oriented video characters

Similar Items