Toolkits development for high dimensional data analysis
In an effort to keep up with the fast growth of World Wide Web, data analysis has become a widely used and necessary aspect of the web usage. Many web document data analysis toolkits have been developed. These toolkits can be used to increase the accuracy and efficiency for the users to find the rel...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/40890 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-40890 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-408902023-07-07T16:51:26Z Toolkits development for high dimensional data analysis Lin, Si Jie. Chen Lihui School of Electrical and Electronic Engineering DRNTU::Engineering In an effort to keep up with the fast growth of World Wide Web, data analysis has become a widely used and necessary aspect of the web usage. Many web document data analysis toolkits have been developed. These toolkits can be used to increase the accuracy and efficiency for the users to find the relevant information they want from the internet. This report mainly consists of four parts that corresponds to four high dimensional data analysis toolkits designed and developed for various purposes. In the first part, data analysis toolkits with different document representation models and clustering methods are developed. In the second part, some evaluation toolkits are developed. In the third part, the data extraction toolkits based on the MEAD system are developed. Additionally, adding additional functions into an existing system called iSEARCH, a search system with returned results in a clustered way. In this report, the design and implement of each part based on the requirements will be explained. The performance of each system is evaluated by the standard evaluation metrics. The report concludes with the objective achieved along with some recommendations for future development. Bachelor of Engineering 2010-06-23T07:24:05Z 2010-06-23T07:24:05Z 2010 2010 Final Year Project (FYP) http://hdl.handle.net/10356/40890 en Nanyang Technological University 90 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering |
spellingShingle |
DRNTU::Engineering Lin, Si Jie. Toolkits development for high dimensional data analysis |
description |
In an effort to keep up with the fast growth of World Wide Web, data analysis has become a widely used and necessary aspect of the web usage. Many web document data analysis toolkits have been developed. These toolkits can be used to increase the accuracy and efficiency for the users to find the relevant information they want from the internet.
This report mainly consists of four parts that corresponds to four high dimensional data analysis toolkits designed and developed for various purposes. In the first part, data analysis toolkits with different document representation models and clustering methods are developed. In the second part, some evaluation toolkits are developed. In the third part, the data extraction toolkits based on the MEAD system are developed. Additionally, adding additional functions into an existing system called iSEARCH, a search system with returned results in a clustered way.
In this report, the design and implement of each part based on the requirements will be explained. The performance of each system is evaluated by the standard evaluation metrics. The report concludes with the objective achieved along with some recommendations for future development. |
author2 |
Chen Lihui |
author_facet |
Chen Lihui Lin, Si Jie. |
format |
Final Year Project |
author |
Lin, Si Jie. |
author_sort |
Lin, Si Jie. |
title |
Toolkits development for high dimensional data analysis |
title_short |
Toolkits development for high dimensional data analysis |
title_full |
Toolkits development for high dimensional data analysis |
title_fullStr |
Toolkits development for high dimensional data analysis |
title_full_unstemmed |
Toolkits development for high dimensional data analysis |
title_sort |
toolkits development for high dimensional data analysis |
publishDate |
2010 |
url |
http://hdl.handle.net/10356/40890 |
_version_ |
1772826655554797568 |