Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
Rapid development in social network Internet application have led to unprecedented increase in the size and quality of datasets. Developing a tool which can be used in analysing large scale network can indeed make our life much easier and contribute to data analytic. This project aimed to continue...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/72828 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-72828 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-728282023-03-03T20:48:29Z Tools for analysis of large-scale networks (I) algorithms, analytics and visualization Zhang, Xinyi Cong Gao School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering Rapid development in social network Internet application have led to unprecedented increase in the size and quality of datasets. Developing a tool which can be used in analysing large scale network can indeed make our life much easier and contribute to data analytic. This project aimed to continue to work onto the large scaled analyzation tool that is developed by the previous student, Chua Chee Ann. In the previous tool, many basic analytical functions such as data search, topic modelling on the retrieved data and graphic user interface has been implemented successfully. Among all the social media sites, Twitter was chosen and 16.5GB raw tweets was used as the dataset in this project. In this project, 2 approaches have been taken to improve the overall performance of this tool. Firstly, existing data structure-grid file has been analysed and implemented onto the dataset, which proven to be effectively improve the query time of different type of queries. Secondly, multiprocessing has been implemented in this project to improve the efficiency of the data processing time specifically on Topic Modelling function. Bachelor of Engineering (Computer Science) 2017-11-23T09:58:35Z 2017-11-23T09:58:35Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/72828 en Nanyang Technological University 44 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Zhang, Xinyi Tools for analysis of large-scale networks (I) algorithms, analytics and visualization |
description |
Rapid development in social network Internet application have led to unprecedented increase in the size and quality of datasets. Developing a tool which can be used in analysing large scale network can indeed make our life much easier and contribute to data analytic.
This project aimed to continue to work onto the large scaled analyzation tool that is developed by the previous student, Chua Chee Ann. In the previous tool, many basic analytical functions such as data search, topic modelling on the retrieved data and graphic user interface has been implemented successfully. Among all the social media sites, Twitter was chosen and 16.5GB raw tweets was used as the dataset in this project.
In this project, 2 approaches have been taken to improve the overall performance of this tool. Firstly, existing data structure-grid file has been analysed and implemented onto the dataset, which proven to be effectively improve the query time of different type of queries. Secondly, multiprocessing has been implemented in this project to improve the efficiency of the data processing time specifically on Topic Modelling function. |
author2 |
Cong Gao |
author_facet |
Cong Gao Zhang, Xinyi |
format |
Final Year Project |
author |
Zhang, Xinyi |
author_sort |
Zhang, Xinyi |
title |
Tools for analysis of large-scale networks (I) algorithms, analytics and visualization |
title_short |
Tools for analysis of large-scale networks (I) algorithms, analytics and visualization |
title_full |
Tools for analysis of large-scale networks (I) algorithms, analytics and visualization |
title_fullStr |
Tools for analysis of large-scale networks (I) algorithms, analytics and visualization |
title_full_unstemmed |
Tools for analysis of large-scale networks (I) algorithms, analytics and visualization |
title_sort |
tools for analysis of large-scale networks (i) algorithms, analytics and visualization |
publishDate |
2017 |
url |
http://hdl.handle.net/10356/72828 |
_version_ |
1759858220036259840 |