Bottom-up scene text detection with Markov clustering networks
A novel detection framework named Markov Clustering Network (MCN) is proposed for fast and robust scene text detection. Different from the traditional top-down scene text detection approaches that inherit from the classic object detection, MCN detects scene text objects in a bottom-up manner. MCN pr...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/161604 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-161604 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1616042022-09-13T08:45:53Z Bottom-up scene text detection with Markov clustering networks Liu, Zichuan Lin, Guosheng Goh, Wang Ling School of Computer Science and Engineering School of Electrical and Electronic Engineering Engineering::Computer science and engineering Neural Networks Markov Clustering A novel detection framework named Markov Clustering Network (MCN) is proposed for fast and robust scene text detection. Different from the traditional top-down scene text detection approaches that inherit from the classic object detection, MCN detects scene text objects in a bottom-up manner. MCN predicts instance-level bounding boxes by firstly converting an image into a stochastic flow graph where Markov Clustering is performed based on the predicted stochastic flows. The stochastic flows encode the local correlation and semantic information of scene text objects. An object is modeled as strongly connected nodes by flows, which allows flexible and bottom-up detection for scale-varying and rotated text objects without prior knowledge of object size. The flow prediction is supported by the advanced Convolutional Neural Networks architectures and Position-aware spatial attention mechanism, which provides enhanced flow prediction by adaptively fusing spatial representations. The experimental evaluation on public benchmarks shows that our MCN method achieves the state-of-art performance on public benchmarks, especially in retrieving long and oriented texts. Ministry of Education (MOE) National Research Foundation (NRF) This research is supported by the National Research Foundation Singapore under its AI Singapore Programme (Award Number: AISG-RP-2018-003) and the MOE Tier-1 research Grants: RG126/17 (S) and RG28/18 (S). 2022-09-09T07:55:06Z 2022-09-09T07:55:06Z 2020 Journal Article Liu, Z., Lin, G. & Goh, W. L. (2020). Bottom-up scene text detection with Markov clustering networks. International Journal of Computer Vision, 128(6), 1786-1809. https://dx.doi.org/10.1007/s11263-020-01298-y 0920-5691 https://hdl.handle.net/10356/161604 10.1007/s11263-020-01298-y 2-s2.0-85079452321 6 128 1786 1809 en AISG-RP-2018-003 RG126/17 (S) RG28/18 (S) International Journal of Computer Vision © 2020 Springer Science+Business Media, LLC, part of Springer Nature. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Neural Networks Markov Clustering |
spellingShingle |
Engineering::Computer science and engineering Neural Networks Markov Clustering Liu, Zichuan Lin, Guosheng Goh, Wang Ling Bottom-up scene text detection with Markov clustering networks |
description |
A novel detection framework named Markov Clustering Network (MCN) is proposed for fast and robust scene text detection. Different from the traditional top-down scene text detection approaches that inherit from the classic object detection, MCN detects scene text objects in a bottom-up manner. MCN predicts instance-level bounding boxes by firstly converting an image into a stochastic flow graph where Markov Clustering is performed based on the predicted stochastic flows. The stochastic flows encode the local correlation and semantic information of scene text objects. An object is modeled as strongly connected nodes by flows, which allows flexible and bottom-up detection for scale-varying and rotated text objects without prior knowledge of object size. The flow prediction is supported by the advanced Convolutional Neural Networks architectures and Position-aware spatial attention mechanism, which provides enhanced flow prediction by adaptively fusing spatial representations. The experimental evaluation on public benchmarks shows that our MCN method achieves the state-of-art performance on public benchmarks, especially in retrieving long and oriented texts. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Liu, Zichuan Lin, Guosheng Goh, Wang Ling |
format |
Article |
author |
Liu, Zichuan Lin, Guosheng Goh, Wang Ling |
author_sort |
Liu, Zichuan |
title |
Bottom-up scene text detection with Markov clustering networks |
title_short |
Bottom-up scene text detection with Markov clustering networks |
title_full |
Bottom-up scene text detection with Markov clustering networks |
title_fullStr |
Bottom-up scene text detection with Markov clustering networks |
title_full_unstemmed |
Bottom-up scene text detection with Markov clustering networks |
title_sort |
bottom-up scene text detection with markov clustering networks |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/161604 |
_version_ |
1744365392861593600 |