A new ring radius transform-based thinning method for multi-oriented video characters

Thinning that preserves visual topology of characters in video is challenging in the field of document analysis and video text analysis due to low resolution and complex background. This paper proposes to explore ring radius transform (RRT) to generate a radius map from Canny edges of each input ima...

Full description

Saved in:
Bibliographic Details
Main Authors: Wu, Y., Shivakumara, P., Wei, W., Lu, T., Pal, U.
Format: Article
Published: Springer Verlag (Germany) 2015
Subjects:
Online Access:http://eprints.um.edu.my/19427/
http://dx.doi.org/10.1007/s10032-015-0238-y
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaya
id my.um.eprints.19427
record_format eprints
spelling my.um.eprints.194272018-09-26T04:44:36Z http://eprints.um.edu.my/19427/ A new ring radius transform-based thinning method for multi-oriented video characters Wu, Y. Shivakumara, P. Wei, W. Lu, T. Pal, U. QA75 Electronic computers. Computer science Thinning that preserves visual topology of characters in video is challenging in the field of document analysis and video text analysis due to low resolution and complex background. This paper proposes to explore ring radius transform (RRT) to generate a radius map from Canny edges of each input image to obtain its medial axis. A radius value contained in the radius map here is the nearest distance to the edge pixels on contours. For the radius map, the method proposes a novel idea for identifying medial axis (middle pixels between two strokes) for arbitrary orientations of the character. Iterative-maximal-growing is then proposed to connect missing medial axis pixels at junctions and intersections. Next, we perform histogram on color information of medial axes with clustering to eliminate false medial axis segments. The method finally restores the shape of the character through radius values of medial axis pixels for the purpose of recognition with the Google Open source OCR (Tesseract). The method has been tested on video, natural scene and handwritten characters from ICDAR 2013, SVT, arbitrary-oriented data from MSRA-TD500, multi-script character data and MPEG7 object data to evaluate its performances at thinning level as well as recognition level. Experimental results comparing with the state-of-the-art methods show that the proposed method is generic and outperforms the existing methods in terms of obtaining skeleton, preserving visual topology and recognition rate. The method is also robust to handle characters of arbitrary orientations. Springer Verlag (Germany) 2015 Article PeerReviewed Wu, Y. and Shivakumara, P. and Wei, W. and Lu, T. and Pal, U. (2015) A new ring radius transform-based thinning method for multi-oriented video characters. International Journal on Document Analysis and Recognition (IJDAR), 18 (2). pp. 137-151. ISSN 1433-2833 http://dx.doi.org/10.1007/s10032-015-0238-y doi:10.1007/s10032-015-0238-y
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Wu, Y.
Shivakumara, P.
Wei, W.
Lu, T.
Pal, U.
A new ring radius transform-based thinning method for multi-oriented video characters
description Thinning that preserves visual topology of characters in video is challenging in the field of document analysis and video text analysis due to low resolution and complex background. This paper proposes to explore ring radius transform (RRT) to generate a radius map from Canny edges of each input image to obtain its medial axis. A radius value contained in the radius map here is the nearest distance to the edge pixels on contours. For the radius map, the method proposes a novel idea for identifying medial axis (middle pixels between two strokes) for arbitrary orientations of the character. Iterative-maximal-growing is then proposed to connect missing medial axis pixels at junctions and intersections. Next, we perform histogram on color information of medial axes with clustering to eliminate false medial axis segments. The method finally restores the shape of the character through radius values of medial axis pixels for the purpose of recognition with the Google Open source OCR (Tesseract). The method has been tested on video, natural scene and handwritten characters from ICDAR 2013, SVT, arbitrary-oriented data from MSRA-TD500, multi-script character data and MPEG7 object data to evaluate its performances at thinning level as well as recognition level. Experimental results comparing with the state-of-the-art methods show that the proposed method is generic and outperforms the existing methods in terms of obtaining skeleton, preserving visual topology and recognition rate. The method is also robust to handle characters of arbitrary orientations.
format Article
author Wu, Y.
Shivakumara, P.
Wei, W.
Lu, T.
Pal, U.
author_facet Wu, Y.
Shivakumara, P.
Wei, W.
Lu, T.
Pal, U.
author_sort Wu, Y.
title A new ring radius transform-based thinning method for multi-oriented video characters
title_short A new ring radius transform-based thinning method for multi-oriented video characters
title_full A new ring radius transform-based thinning method for multi-oriented video characters
title_fullStr A new ring radius transform-based thinning method for multi-oriented video characters
title_full_unstemmed A new ring radius transform-based thinning method for multi-oriented video characters
title_sort new ring radius transform-based thinning method for multi-oriented video characters
publisher Springer Verlag (Germany)
publishDate 2015
url http://eprints.um.edu.my/19427/
http://dx.doi.org/10.1007/s10032-015-0238-y
_version_ 1643690985401614336