Offline text-independent Chinese writer identification method with two-tier image retrieval
Writer identification is essential today to identify the authenticity of a document in forensic expert decision-making. However, handwriting in various languages specifically Chinese poses a different challenge in identifying the writer. The main challenge faced by current researchers is that they f...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/96211/1/TanGloriaJennisPSC2019.pdf.pdf http://eprints.utm.my/id/eprint/96211/ http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:142139 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
Language: | English |
id |
my.utm.96211 |
---|---|
record_format |
eprints |
spelling |
my.utm.962112022-07-05T03:07:20Z http://eprints.utm.my/id/eprint/96211/ Offline text-independent Chinese writer identification method with two-tier image retrieval Tan, Gloria Jennis QA75 Electronic computers. Computer science Writer identification is essential today to identify the authenticity of a document in forensic expert decision-making. However, handwriting in various languages specifically Chinese poses a different challenge in identifying the writer. The main challenge faced by current researchers is that they fail to adopt traditional methods over an offline text independent Chinese writer identification scheme due to the complexity of Chinese writing structure and style. Furthermore, the previous method relies heavily on the selection of window size, which causes an ambiguity and leads to inconsistent results if the previous method is applied on a large image repository while finding the best-matched document from the database. Thus, much uncertainty still exists about the insurmountable searching space and the method has failed to show the effectiveness in searching relevant documents from a large image repository. This research attempted to solve problems by developing a new identification scheme for offline text-independent Chinese writer identification with the enhancement of feature extraction method and two-tier image retrieval mechanism to reduce search space and increase identification rates. The technique involved three essential steps. Firstly, the first-tier phase used Slantlet Transform based Local Binary Pattern (SLT-LBP) to bring out fine details. Then, sixty matching handwriting images were short-listed for the second-tier phase using Hierarchical Centroid (HC) of image pixels method for feature extraction. Finally, thirty shortlisted images were used as the input in the identification phase using Gray-Level Difference Method (GLDM) features. Experiment results had remarkably improved as compared to the previous method and the increase was from 95.4% to 96.68% in terms of identification rate as reported in the HIT-MW dataset. The contribution of this study is that it highlights the importance of using a two-tier retrieval mechanism to reduce search space in a large database in order to achieve higher accuracy. Besides, the development of a size-independent writer identification mechanism is a novelty as it can corroborate real-world application. 2019 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/96211/1/TanGloriaJennisPSC2019.pdf.pdf Tan, Gloria Jennis (2019) Offline text-independent Chinese writer identification method with two-tier image retrieval. PhD thesis, Universiti Teknologi Malaysia. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:142139 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Tan, Gloria Jennis Offline text-independent Chinese writer identification method with two-tier image retrieval |
description |
Writer identification is essential today to identify the authenticity of a document in forensic expert decision-making. However, handwriting in various languages specifically Chinese poses a different challenge in identifying the writer. The main challenge faced by current researchers is that they fail to adopt traditional methods over an offline text independent Chinese writer identification scheme due to the complexity of Chinese writing structure and style. Furthermore, the previous method relies heavily on the selection of window size, which causes an ambiguity and leads to inconsistent results if the previous method is applied on a large image repository while finding the best-matched document from the database. Thus, much uncertainty still exists about the insurmountable searching space and the method has failed to show the effectiveness in searching relevant documents from a large image repository. This research attempted to solve problems by developing a new identification scheme for offline text-independent Chinese writer identification with the enhancement of feature extraction method and two-tier image retrieval mechanism to reduce search space and increase identification rates. The technique involved three essential steps. Firstly, the first-tier phase used Slantlet Transform based Local Binary Pattern (SLT-LBP) to bring out fine details. Then, sixty matching handwriting images were short-listed for the second-tier phase using Hierarchical Centroid (HC) of image pixels method for feature extraction. Finally, thirty shortlisted images were used as the input in the identification phase using Gray-Level Difference Method (GLDM) features. Experiment results had remarkably improved as compared to the previous method and the increase was from 95.4% to 96.68% in terms of identification rate as reported in the HIT-MW dataset. The contribution of this study is that it highlights the importance of using a two-tier retrieval mechanism to reduce search space in a large database in order to achieve higher accuracy. Besides, the development of a size-independent writer identification mechanism is a novelty as it can corroborate real-world application. |
format |
Thesis |
author |
Tan, Gloria Jennis |
author_facet |
Tan, Gloria Jennis |
author_sort |
Tan, Gloria Jennis |
title |
Offline text-independent Chinese writer identification method with two-tier image retrieval |
title_short |
Offline text-independent Chinese writer identification method with two-tier image retrieval |
title_full |
Offline text-independent Chinese writer identification method with two-tier image retrieval |
title_fullStr |
Offline text-independent Chinese writer identification method with two-tier image retrieval |
title_full_unstemmed |
Offline text-independent Chinese writer identification method with two-tier image retrieval |
title_sort |
offline text-independent chinese writer identification method with two-tier image retrieval |
publishDate |
2019 |
url |
http://eprints.utm.my/id/eprint/96211/1/TanGloriaJennisPSC2019.pdf.pdf http://eprints.utm.my/id/eprint/96211/ http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:142139 |
_version_ |
1738510338003828736 |