Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine

In this paper, we introduce a two-stage recognition process for classification of 164 classes of mixing of printed Thai and English characters. Various structural features based on image ratios, image projections, outer boundaries, Pyramid Histogram of Oriented Gradients (PHOG) are extracted from im...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chayut Wiwatcharakoses, Karn Patanukhom
Format:	Conference Proceeding
Published:	2018
Online Access:	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894122646&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/47413
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Chiang Mai University

id	th-cmuir.6653943832-47413
record_format	dspace
spelling	th-cmuir.6653943832-474132018-04-25T08:39:44Z Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine Chayut Wiwatcharakoses Karn Patanukhom In this paper, we introduce a two-stage recognition process for classification of 164 classes of mixing of printed Thai and English characters. Various structural features based on image ratios, image projections, outer boundaries, Pyramid Histogram of Oriented Gradients (PHOG) are extracted from images. In the first stage, Fuzzy C Mean Clustering (FCM) is applied to create prototypes of every character. The class of nearest neighbor prototype is determined and used as the first stage classification output. A hybrid structure of nearest neighbor classifier and Support Vector Machine (SVM) are proposed for the second stage. Based on classification results obtained from the first stage, the suitable classifiers can be selected. For SVM classifier, possible class candidates for each prototype are analyzed from confusion matrices of the first stage result. For nearest neighbor classifier, in order to refine the result, accurate search on a limited set of training samples corresponding to the nearest prototypes obtained in the first stage is performed. According to experiments on data set of more than 500,000 character images with various font styles, sizes, and resolutions, we obtain the accuracy of 88.09% in the first stage and the result is improved to 97.06% in the second stage. The experiments also show improvement of the proposed scheme in comparison with conventional schemes. © 2013 IEEE. 2018-04-25T08:39:44Z 2018-04-25T08:39:44Z 2013-12-01 Conference Proceeding 2-s2.0-84894122646 10.1109/SITIS.2013.23 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894122646&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/47413
institution	Chiang Mai University
building	Chiang Mai University Library
country	Thailand
collection	CMU Intellectual Repository
description	In this paper, we introduce a two-stage recognition process for classification of 164 classes of mixing of printed Thai and English characters. Various structural features based on image ratios, image projections, outer boundaries, Pyramid Histogram of Oriented Gradients (PHOG) are extracted from images. In the first stage, Fuzzy C Mean Clustering (FCM) is applied to create prototypes of every character. The class of nearest neighbor prototype is determined and used as the first stage classification output. A hybrid structure of nearest neighbor classifier and Support Vector Machine (SVM) are proposed for the second stage. Based on classification results obtained from the first stage, the suitable classifiers can be selected. For SVM classifier, possible class candidates for each prototype are analyzed from confusion matrices of the first stage result. For nearest neighbor classifier, in order to refine the result, accurate search on a limited set of training samples corresponding to the nearest prototypes obtained in the first stage is performed. According to experiments on data set of more than 500,000 character images with various font styles, sizes, and resolutions, we obtain the accuracy of 88.09% in the first stage and the result is improved to 97.06% in the second stage. The experiments also show improvement of the proposed scheme in comparison with conventional schemes. © 2013 IEEE.
format	Conference Proceeding
author	Chayut Wiwatcharakoses Karn Patanukhom
spellingShingle	Chayut Wiwatcharakoses Karn Patanukhom Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine
author_facet	Chayut Wiwatcharakoses Karn Patanukhom
author_sort	Chayut Wiwatcharakoses
title	Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine
title_short	Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine
title_full	Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine
title_fullStr	Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine
title_full_unstemmed	Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine
title_sort	two-stage recognition for printed thai and english characters using nearest neighbor and support vector machine
publishDate	2018
url	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84894122646&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/47413
_version_	1681423056294117376

Two-stage recognition for printed Thai and English characters using nearest neighbor and support vector machine

Similar Items