A unified framework for thai metadata extraction using case-based reasoning

Metadata is a very popular word in information technology today because it helps users to differentiate significant documents from non-significant documents. With the growth of the Internet and related tools, there has been a rapid growth of online resources. However, lack of metadata available for...

Full description

Saved in:
Bibliographic Details
Main Authors: Krisda Khankasikam, Nopasit Chakpitak
Format: Conference Proceeding
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=62949102193&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/60274
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
Description
Summary:Metadata is a very popular word in information technology today because it helps users to differentiate significant documents from non-significant documents. With the growth of the Internet and related tools, there has been a rapid growth of online resources. However, lack of metadata available for these resources stops their discovery and dissemination over the Internet. The process for manual metadata extraction is time-consuming, costly, and labor-extensive. This paper describes a framework for automatic metadata extraction from electronic Thai documents. The system consists of three main components: a case retrieval module for comparing problem case and stored case using nearest neighbor retrieval technique, a metadata creation module for automatically extracting metadata from electronic Thai documents using Thai information extraction techniques, and a metadata verification module for correcting the errors in extracted metadata. The experimental results show that using the proposed framework could reduce the labor work of Thai metadata creation process. © 2008 IEEE.