Tools for analysis of large-scale networks (I) algorithms, analytics and visualization

Rapid development in social network Internet application have led to unprecedented increase in the size and quality of datasets. Developing a tool which can be used in analysing large scale network can indeed make our life much easier and contribute to data analytic. This project aimed to continue...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Xinyi
Other Authors: Cong Gao
Format: Final Year Project
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/72828
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-72828
record_format dspace
spelling sg-ntu-dr.10356-728282023-03-03T20:48:29Z Tools for analysis of large-scale networks (I) algorithms, analytics and visualization Zhang, Xinyi Cong Gao School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering Rapid development in social network Internet application have led to unprecedented increase in the size and quality of datasets. Developing a tool which can be used in analysing large scale network can indeed make our life much easier and contribute to data analytic. This project aimed to continue to work onto the large scaled analyzation tool that is developed by the previous student, Chua Chee Ann. In the previous tool, many basic analytical functions such as data search, topic modelling on the retrieved data and graphic user interface has been implemented successfully. Among all the social media sites, Twitter was chosen and 16.5GB raw tweets was used as the dataset in this project. In this project, 2 approaches have been taken to improve the overall performance of this tool. Firstly, existing data structure-grid file has been analysed and implemented onto the dataset, which proven to be effectively improve the query time of different type of queries. Secondly, multiprocessing has been implemented in this project to improve the efficiency of the data processing time specifically on Topic Modelling function. Bachelor of Engineering (Computer Science) 2017-11-23T09:58:35Z 2017-11-23T09:58:35Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/72828 en Nanyang Technological University 44 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Zhang, Xinyi
Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
description Rapid development in social network Internet application have led to unprecedented increase in the size and quality of datasets. Developing a tool which can be used in analysing large scale network can indeed make our life much easier and contribute to data analytic. This project aimed to continue to work onto the large scaled analyzation tool that is developed by the previous student, Chua Chee Ann. In the previous tool, many basic analytical functions such as data search, topic modelling on the retrieved data and graphic user interface has been implemented successfully. Among all the social media sites, Twitter was chosen and 16.5GB raw tweets was used as the dataset in this project. In this project, 2 approaches have been taken to improve the overall performance of this tool. Firstly, existing data structure-grid file has been analysed and implemented onto the dataset, which proven to be effectively improve the query time of different type of queries. Secondly, multiprocessing has been implemented in this project to improve the efficiency of the data processing time specifically on Topic Modelling function.
author2 Cong Gao
author_facet Cong Gao
Zhang, Xinyi
format Final Year Project
author Zhang, Xinyi
author_sort Zhang, Xinyi
title Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
title_short Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
title_full Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
title_fullStr Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
title_full_unstemmed Tools for analysis of large-scale networks (I) algorithms, analytics and visualization
title_sort tools for analysis of large-scale networks (i) algorithms, analytics and visualization
publishDate 2017
url http://hdl.handle.net/10356/72828
_version_ 1759858220036259840