Video text detection and segmentation for optical character recognition

In this paper, we present approaches to detecting and segmenting text in videos. The proposed video-text-detection technique is capable of adaptively applying appropriate operators for video frames of different modalities by classifying the background complexities. Effective operators such as the re...

Full description

Saved in:
Bibliographic Details
Main Authors: NGO, Chong-wah, CHAN, Chi-Kwong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2005
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6352
https://ink.library.smu.edu.sg/context/sis_research/article/7355/viewcontent/ms05.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7355
record_format dspace
spelling sg-smu-ink.sis_research-73552021-11-23T04:02:48Z Video text detection and segmentation for optical character recognition NGO, Chong-wah CHAN, Chi-Kwong In this paper, we present approaches to detecting and segmenting text in videos. The proposed video-text-detection technique is capable of adaptively applying appropriate operators for video frames of different modalities by classifying the background complexities. Effective operators such as the repeated shifting operations are applied for the noise removal of images with high edge density. Meanwhile, a text-enhancement technique is used to highlight the text regions of low-contrast images. A coarse-to-fine projection technique is then employed to extract text lines from video frames. Experimental results indicate that the proposed text-detection approach is superior to the machine-learning-based (such as SVM and neural network), multiresolution-based, and DCT-based approaches in terms of detection and false-alarm rates. Besides text detection, a technique for text segmentation is also proposed based on adaptive thresholding. A commercial OCR package is then used to recognize the segmented foreground text. A satisfactory character-recognition rate is reported in our experiments. 2005-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6352 info:doi/10.1007/s00530-004-0157-0 https://ink.library.smu.edu.sg/context/sis_research/article/7355/viewcontent/ms05.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University video text detection text segmentation text recognition Computer Sciences Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic video text detection
text segmentation
text recognition
Computer Sciences
Graphics and Human Computer Interfaces
spellingShingle video text detection
text segmentation
text recognition
Computer Sciences
Graphics and Human Computer Interfaces
NGO, Chong-wah
CHAN, Chi-Kwong
Video text detection and segmentation for optical character recognition
description In this paper, we present approaches to detecting and segmenting text in videos. The proposed video-text-detection technique is capable of adaptively applying appropriate operators for video frames of different modalities by classifying the background complexities. Effective operators such as the repeated shifting operations are applied for the noise removal of images with high edge density. Meanwhile, a text-enhancement technique is used to highlight the text regions of low-contrast images. A coarse-to-fine projection technique is then employed to extract text lines from video frames. Experimental results indicate that the proposed text-detection approach is superior to the machine-learning-based (such as SVM and neural network), multiresolution-based, and DCT-based approaches in terms of detection and false-alarm rates. Besides text detection, a technique for text segmentation is also proposed based on adaptive thresholding. A commercial OCR package is then used to recognize the segmented foreground text. A satisfactory character-recognition rate is reported in our experiments.
format text
author NGO, Chong-wah
CHAN, Chi-Kwong
author_facet NGO, Chong-wah
CHAN, Chi-Kwong
author_sort NGO, Chong-wah
title Video text detection and segmentation for optical character recognition
title_short Video text detection and segmentation for optical character recognition
title_full Video text detection and segmentation for optical character recognition
title_fullStr Video text detection and segmentation for optical character recognition
title_full_unstemmed Video text detection and segmentation for optical character recognition
title_sort video text detection and segmentation for optical character recognition
publisher Institutional Knowledge at Singapore Management University
publishDate 2005
url https://ink.library.smu.edu.sg/sis_research/6352
https://ink.library.smu.edu.sg/context/sis_research/article/7355/viewcontent/ms05.pdf
_version_ 1770575940073553920