Toolkits development for high dimensional data analysis

In an effort to keep up with the fast growth of World Wide Web, data analysis has become a widely used and necessary aspect of the web usage. Many web document data analysis toolkits have been developed. These toolkits can be used to increase the accuracy and efficiency for the users to find the rel...

Full description

Saved in:
Bibliographic Details
Main Author: Lin, Si Jie.
Other Authors: Chen Lihui
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40890
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-40890
record_format dspace
spelling sg-ntu-dr.10356-408902023-07-07T16:51:26Z Toolkits development for high dimensional data analysis Lin, Si Jie. Chen Lihui School of Electrical and Electronic Engineering DRNTU::Engineering In an effort to keep up with the fast growth of World Wide Web, data analysis has become a widely used and necessary aspect of the web usage. Many web document data analysis toolkits have been developed. These toolkits can be used to increase the accuracy and efficiency for the users to find the relevant information they want from the internet. This report mainly consists of four parts that corresponds to four high dimensional data analysis toolkits designed and developed for various purposes. In the first part, data analysis toolkits with different document representation models and clustering methods are developed. In the second part, some evaluation toolkits are developed. In the third part, the data extraction toolkits based on the MEAD system are developed. Additionally, adding additional functions into an existing system called iSEARCH, a search system with returned results in a clustered way. In this report, the design and implement of each part based on the requirements will be explained. The performance of each system is evaluated by the standard evaluation metrics. The report concludes with the objective achieved along with some recommendations for future development. Bachelor of Engineering 2010-06-23T07:24:05Z 2010-06-23T07:24:05Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40890 en Nanyang Technological University 90 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Lin, Si Jie.
Toolkits development for high dimensional data analysis
description In an effort to keep up with the fast growth of World Wide Web, data analysis has become a widely used and necessary aspect of the web usage. Many web document data analysis toolkits have been developed. These toolkits can be used to increase the accuracy and efficiency for the users to find the relevant information they want from the internet. This report mainly consists of four parts that corresponds to four high dimensional data analysis toolkits designed and developed for various purposes. In the first part, data analysis toolkits with different document representation models and clustering methods are developed. In the second part, some evaluation toolkits are developed. In the third part, the data extraction toolkits based on the MEAD system are developed. Additionally, adding additional functions into an existing system called iSEARCH, a search system with returned results in a clustered way. In this report, the design and implement of each part based on the requirements will be explained. The performance of each system is evaluated by the standard evaluation metrics. The report concludes with the objective achieved along with some recommendations for future development.
author2 Chen Lihui
author_facet Chen Lihui
Lin, Si Jie.
format Final Year Project
author Lin, Si Jie.
author_sort Lin, Si Jie.
title Toolkits development for high dimensional data analysis
title_short Toolkits development for high dimensional data analysis
title_full Toolkits development for high dimensional data analysis
title_fullStr Toolkits development for high dimensional data analysis
title_full_unstemmed Toolkits development for high dimensional data analysis
title_sort toolkits development for high dimensional data analysis
publishDate 2010
url http://hdl.handle.net/10356/40890
_version_ 1772826655554797568