Clustering techniques for web mining

In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web docu...

Full description

Saved in:
Bibliographic Details
Main Author: Lu, Jiao.
Other Authors: Chen Lihui
Format: Final Year Project
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/18909
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-18909
record_format dspace
spelling sg-ntu-dr.10356-189092023-07-07T15:48:14Z Clustering techniques for web mining Lu, Jiao. Chen Lihui School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web document clustering approach based on a phrase-based document Indexing has been implemented based on three merits. The first is the new document representation called Document index Graph (DIG), which is used to represent the document. The second is a new similarity measure between documents which is based on the matching phrases and their weights. The third concept is theincremental document clustering method. The objective of this dissertation is to design and implement the clustering system based on the concepts above. The implementation details, the experimental results and performance evaluation are reported. Bachelor of Engineering 2009-08-17T05:52:53Z 2009-08-17T05:52:53Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/18909 en Nanyang Technological University 68 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Lu, Jiao.
Clustering techniques for web mining
description In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web document clustering approach based on a phrase-based document Indexing has been implemented based on three merits. The first is the new document representation called Document index Graph (DIG), which is used to represent the document. The second is a new similarity measure between documents which is based on the matching phrases and their weights. The third concept is theincremental document clustering method. The objective of this dissertation is to design and implement the clustering system based on the concepts above. The implementation details, the experimental results and performance evaluation are reported.
author2 Chen Lihui
author_facet Chen Lihui
Lu, Jiao.
format Final Year Project
author Lu, Jiao.
author_sort Lu, Jiao.
title Clustering techniques for web mining
title_short Clustering techniques for web mining
title_full Clustering techniques for web mining
title_fullStr Clustering techniques for web mining
title_full_unstemmed Clustering techniques for web mining
title_sort clustering techniques for web mining
publishDate 2009
url http://hdl.handle.net/10356/18909
_version_ 1772829007575777280