A unified framework for thai metadata extraction using case-based reasoning

Metadata is a very popular word in information technology today because it helps users to differentiate significant documents from non-significant documents. With the growth of the Internet and related tools, there has been a rapid growth of online resources. However, lack of metadata available for...

Full description

Saved in:
Bibliographic Details
Main Authors: Khankasikam K., Chakpitak N.
Format: Conference or Workshop Item
Language:English
Published: 2014
Online Access:http://www.scopus.com/inward/record.url?eid=2-s2.0-62949102193&partnerID=40&md5=02028c179f81b74b83fa9ecbd9012650
http://cmuir.cmu.ac.th/handle/6653943832/953
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
Language: English
Description
Summary:Metadata is a very popular word in information technology today because it helps users to differentiate significant documents from non-significant documents. With the growth of the Internet and related tools, there has been a rapid growth of online resources. However, lack of metadata available for these resources stops their discovery and dissemination over the Internet. The process for manual metadata extraction is time-consuming, costly, and labor-extensive. This paper describes a framework for automatic metadata extraction from electronic Thai documents. The system consists of three main components: a case retrieval module for comparing problem case and stored case using nearest neighbor retrieval technique, a metadata creation module for automatically extracting metadata from electronic Thai documents using Thai information extraction techniques, and a metadata verification module for correcting the errors in extracted metadata. The experimental results show that using the proposed framework could reduce the labor work of Thai metadata creation process. © 2008 IEEE.