Clustering techniques for web mining
In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web docu...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/18909 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-18909 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-189092023-07-07T15:48:14Z Clustering techniques for web mining Lu, Jiao. Chen Lihui School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web document clustering approach based on a phrase-based document Indexing has been implemented based on three merits. The first is the new document representation called Document index Graph (DIG), which is used to represent the document. The second is a new similarity measure between documents which is based on the matching phrases and their weights. The third concept is theincremental document clustering method. The objective of this dissertation is to design and implement the clustering system based on the concepts above. The implementation details, the experimental results and performance evaluation are reported. Bachelor of Engineering 2009-08-17T05:52:53Z 2009-08-17T05:52:53Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/18909 en Nanyang Technological University 68 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems Lu, Jiao. Clustering techniques for web mining |
description |
In an effort to keep up with the fast growth of World Wide Web, many Web Document Clustering techniques have been designed. These techniques can be used to increase the accuracy and efficiency of the users to find the relevant information they want from the internet. In this dissertation, a Web document clustering approach based on a phrase-based document Indexing has been implemented based on three merits. The first is the new document representation called Document index Graph (DIG), which is used to represent the document. The second is a new similarity measure between documents which is based on the matching phrases and their weights. The third concept is theincremental document clustering method. The objective of this dissertation is to design and implement the clustering system based on the concepts above. The implementation details, the experimental results and performance evaluation are reported. |
author2 |
Chen Lihui |
author_facet |
Chen Lihui Lu, Jiao. |
format |
Final Year Project |
author |
Lu, Jiao. |
author_sort |
Lu, Jiao. |
title |
Clustering techniques for web mining |
title_short |
Clustering techniques for web mining |
title_full |
Clustering techniques for web mining |
title_fullStr |
Clustering techniques for web mining |
title_full_unstemmed |
Clustering techniques for web mining |
title_sort |
clustering techniques for web mining |
publishDate |
2009 |
url |
http://hdl.handle.net/10356/18909 |
_version_ |
1772829007575777280 |