A knowledge based system for automatic classification of web pages

The paper describes design and implementation of a new knowledge based system for Automatic Information Retrieval DataBase (AIRDB).AIRDB helps the end-user to cluster and classify web pages on the basis of information filtering combined with an Artificial Neural Network (ANN).The classification de...

Full description

Saved in:
Bibliographic Details
Main Author: Fathy, Sherif Kassem
Format: Conference or Workshop Item
Language:English
Published: 2006
Subjects:
Online Access:http://repo.uum.edu.my/11537/1/564.pdf
http://repo.uum.edu.my/11537/
http://www.kmice.cms.net.my/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Utara Malaysia
Language: English
id my.uum.repo.11537
record_format eprints
spelling my.uum.repo.115372014-07-02T08:31:42Z http://repo.uum.edu.my/11537/ A knowledge based system for automatic classification of web pages Fathy, Sherif Kassem T Technology (General) The paper describes design and implementation of a new knowledge based system for Automatic Information Retrieval DataBase (AIRDB).AIRDB helps the end-user to cluster and classify web pages on the basis of information filtering combined with an Artificial Neural Network (ANN).The classification depends mainly on keyword indexes.A large sample set consists of 11043 web pages of several formats are collected automatically and randomly from various resources.The AIRDB feature selection algorithm is summarized.The feature selection depends upon stemming words of web page. Each stem word is generated with local profile. This local profile contains information that indicates the weight of each stem with the possible related classes of web pages.A statistical analysis process is illustrated to reduce the noise stems.The various components of the AIRDB are described.The knowledge based system is tested with various web pages that disseminate their content in English.The average discrimination performance of the AIRDB reaches 84%. 2006-06-06 Conference or Workshop Item PeerReviewed application/pdf en http://repo.uum.edu.my/11537/1/564.pdf Fathy, Sherif Kassem (2006) A knowledge based system for automatic classification of web pages. In: Knowledge Management International Conference and Exhibition 2006 (KMICE 2006), 6-8 June 2006, The Legend Hotel Kuala Lumpur. http://www.kmice.cms.net.my/
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutionali Repository
url_provider http://repo.uum.edu.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Fathy, Sherif Kassem
A knowledge based system for automatic classification of web pages
description The paper describes design and implementation of a new knowledge based system for Automatic Information Retrieval DataBase (AIRDB).AIRDB helps the end-user to cluster and classify web pages on the basis of information filtering combined with an Artificial Neural Network (ANN).The classification depends mainly on keyword indexes.A large sample set consists of 11043 web pages of several formats are collected automatically and randomly from various resources.The AIRDB feature selection algorithm is summarized.The feature selection depends upon stemming words of web page. Each stem word is generated with local profile. This local profile contains information that indicates the weight of each stem with the possible related classes of web pages.A statistical analysis process is illustrated to reduce the noise stems.The various components of the AIRDB are described.The knowledge based system is tested with various web pages that disseminate their content in English.The average discrimination performance of the AIRDB reaches 84%.
format Conference or Workshop Item
author Fathy, Sherif Kassem
author_facet Fathy, Sherif Kassem
author_sort Fathy, Sherif Kassem
title A knowledge based system for automatic classification of web pages
title_short A knowledge based system for automatic classification of web pages
title_full A knowledge based system for automatic classification of web pages
title_fullStr A knowledge based system for automatic classification of web pages
title_full_unstemmed A knowledge based system for automatic classification of web pages
title_sort knowledge based system for automatic classification of web pages
publishDate 2006
url http://repo.uum.edu.my/11537/1/564.pdf
http://repo.uum.edu.my/11537/
http://www.kmice.cms.net.my/
_version_ 1644280668682715136