Automatic classification using concept knowledge of web documents
In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for eac...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2004
|
Subjects: | |
Online Access: | http://repo.uum.edu.my/13843/1/KM112.pdf http://repo.uum.edu.my/13843/ http://www.kmice.cms.net.my |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Utara Malaysia |
Language: | English |
id |
my.uum.repo.13843 |
---|---|
record_format |
eprints |
spelling |
my.uum.repo.138432015-04-13T08:56:36Z http://repo.uum.edu.my/13843/ Automatic classification using concept knowledge of web documents Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae QA76 Computer software In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for each category is extended to determine index weight value.The system is constructed for experimenting and estimating,which is consist of web robot, indexer, concept knowledge database for each category and the document classifier.Our system to be applied the extended TFIDF method shows an accuracy of 88% in automatic classifying of web documents. 2004-02-14 Conference or Workshop Item PeerReviewed application/pdf en http://repo.uum.edu.my/13843/1/KM112.pdf Choi, Sang-Ho and Park, Sa-Joon and Hwang, Su-Cheol and Kim, Ki-Tae (2004) Automatic classification using concept knowledge of web documents. In: Knowledge Management International Conference and Exhibition 2004 (KMICE 2004), 14-15 February 2004, Evergreen Laurel Hotel, Penang. http://www.kmice.cms.net.my |
institution |
Universiti Utara Malaysia |
building |
UUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Utara Malaysia |
content_source |
UUM Institutionali Repository |
url_provider |
http://repo.uum.edu.my/ |
language |
English |
topic |
QA76 Computer software |
spellingShingle |
QA76 Computer software Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae Automatic classification using concept knowledge of web documents |
description |
In order to classify web documents, we suggest a method using concept knowledge of category.In our study, the concept relations between keywords are extracted using hyperlink information and after the extracted keywords are classified into each category, these are used as an index.Then TFIDF for each category is extended to determine index weight value.The system is constructed for experimenting and estimating,which is consist of web robot, indexer, concept knowledge database for each category and the document classifier.Our system to be applied the extended TFIDF method shows an accuracy of 88% in automatic classifying of web documents. |
format |
Conference or Workshop Item |
author |
Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae |
author_facet |
Choi, Sang-Ho Park, Sa-Joon Hwang, Su-Cheol Kim, Ki-Tae |
author_sort |
Choi, Sang-Ho |
title |
Automatic classification using concept knowledge of web documents |
title_short |
Automatic classification using concept knowledge of web documents |
title_full |
Automatic classification using concept knowledge of web documents |
title_fullStr |
Automatic classification using concept knowledge of web documents |
title_full_unstemmed |
Automatic classification using concept knowledge of web documents |
title_sort |
automatic classification using concept knowledge of web documents |
publishDate |
2004 |
url |
http://repo.uum.edu.my/13843/1/KM112.pdf http://repo.uum.edu.my/13843/ http://www.kmice.cms.net.my |
_version_ |
1644281297053417472 |