Document image segmentation and classification
a fast speed and robust document image segmentation and classification algorithm based on bottom-up strategy is proposed. Several techniques are used to overcome the slow speed limitation and large memory space requirement of the traditional bottom-up strategy. In line segment extraction, byte-based...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/19756 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-19756 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-197562023-07-04T15:54:49Z Document image segmentation and classification Chang, Kim Wah. Shao, Lejun School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing a fast speed and robust document image segmentation and classification algorithm based on bottom-up strategy is proposed. Several techniques are used to overcome the slow speed limitation and large memory space requirement of the traditional bottom-up strategy. In line segment extraction, byte-based operation is used instead of bit-based operation, precomputed tables are used where the data byte of the document image is used as an index into the table, and the attributes of line segment(s) contained in the data byte are returned, state machine is used in conjunction with the look-up tables to form linked lists of line segments. In connected component forming process, line segments formed in two consecutive scan lines will be merged into connected components immediately. This greatly reduced the memory space requirement. In classification stage, attributes extracted out from the data byte in the segmentation process are used. This makes the classification an easy task. Master of Engineering 2009-12-14T06:34:09Z 2009-12-14T06:34:09Z 1994 1994 Thesis http://hdl.handle.net/10356/19756 en NANYANG TECHNOLOGICAL UNIVERSITY 138 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Chang, Kim Wah. Document image segmentation and classification |
description |
a fast speed and robust document image segmentation and classification algorithm based on bottom-up strategy is proposed. Several techniques are used to overcome the slow speed limitation and large memory space requirement of the traditional bottom-up strategy. In line segment extraction, byte-based operation is used instead of bit-based operation, precomputed tables are used where the data byte of the document image is used as an index into the table, and the attributes of line segment(s) contained in the data byte are returned, state machine is used in conjunction with the look-up tables to form linked lists of line segments. In connected component forming process, line segments formed in two consecutive scan lines will be merged into connected components immediately. This greatly reduced the memory space requirement. In classification stage, attributes extracted out from the data byte in the segmentation process are used. This makes the classification an easy task. |
author2 |
Shao, Lejun |
author_facet |
Shao, Lejun Chang, Kim Wah. |
format |
Theses and Dissertations |
author |
Chang, Kim Wah. |
author_sort |
Chang, Kim Wah. |
title |
Document image segmentation and classification |
title_short |
Document image segmentation and classification |
title_full |
Document image segmentation and classification |
title_fullStr |
Document image segmentation and classification |
title_full_unstemmed |
Document image segmentation and classification |
title_sort |
document image segmentation and classification |
publishDate |
2009 |
url |
http://hdl.handle.net/10356/19756 |
_version_ |
1772825631984189440 |