Cognitive data analysis

As theoretical research into various fields, such as in the computer science field continues to grow, there is a high chance that newer research done would be more interdisciplinary in nature or consist of multiple research fields within the same discipline. It therefore becomes necessary for a rese...

Full description

Saved in:
Bibliographic Details
Main Author: Chew, Jonathan Wei Liang
Other Authors: Tan Ah Hwee
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/139144
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-139144
record_format dspace
spelling sg-ntu-dr.10356-1391442020-05-16T07:27:19Z Cognitive data analysis Chew, Jonathan Wei Liang Tan Ah Hwee School of Computer Science and Engineering asahtan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence As theoretical research into various fields, such as in the computer science field continues to grow, there is a high chance that newer research done would be more interdisciplinary in nature or consist of multiple research fields within the same discipline. It therefore becomes necessary for a researcher to be cognizant of the latest research trends for their research to be relevant. This project on cognitive data analysis, aims to analyze relevant data on computer science research conducted by Singapore Institutions in order to ascertain if there exists a prominent trend or pattern in computer science research in Singapore. The Fusion ART algorithm is used to perform clustering on the obtained data. ART algorithms have been shown to be performant when only a small number of datasets are available for training and is the reason for its adoption within this project. Firstly, a suitable dataset is chosen from a variety of data sources. Next, the collected dataset is preprocessed, through cleaning and transforming it into a form that can be utilized by the Fusion ART. We also evaluate suitable keyword extraction algorithms which would be used to tag a research paper to an appropriate ACM Computer Classification System category. Lastly, we experiment with the Fusion ART by varying its input parameters and feed the preprocessed data to the Fusion ART and examine the clusters formed and its relative weights in order to observe any trends within the dataset. Future work could be done to identify a better keyword extraction algorithm for tagging a research paper, as well as look into global trends in computer science research to identify computer science research trends on a global scale. Bachelor of Engineering (Computer Science) 2020-05-16T07:27:18Z 2020-05-16T07:27:18Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/139144 en SCSE19-0549 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Chew, Jonathan Wei Liang
Cognitive data analysis
description As theoretical research into various fields, such as in the computer science field continues to grow, there is a high chance that newer research done would be more interdisciplinary in nature or consist of multiple research fields within the same discipline. It therefore becomes necessary for a researcher to be cognizant of the latest research trends for their research to be relevant. This project on cognitive data analysis, aims to analyze relevant data on computer science research conducted by Singapore Institutions in order to ascertain if there exists a prominent trend or pattern in computer science research in Singapore. The Fusion ART algorithm is used to perform clustering on the obtained data. ART algorithms have been shown to be performant when only a small number of datasets are available for training and is the reason for its adoption within this project. Firstly, a suitable dataset is chosen from a variety of data sources. Next, the collected dataset is preprocessed, through cleaning and transforming it into a form that can be utilized by the Fusion ART. We also evaluate suitable keyword extraction algorithms which would be used to tag a research paper to an appropriate ACM Computer Classification System category. Lastly, we experiment with the Fusion ART by varying its input parameters and feed the preprocessed data to the Fusion ART and examine the clusters formed and its relative weights in order to observe any trends within the dataset. Future work could be done to identify a better keyword extraction algorithm for tagging a research paper, as well as look into global trends in computer science research to identify computer science research trends on a global scale.
author2 Tan Ah Hwee
author_facet Tan Ah Hwee
Chew, Jonathan Wei Liang
format Final Year Project
author Chew, Jonathan Wei Liang
author_sort Chew, Jonathan Wei Liang
title Cognitive data analysis
title_short Cognitive data analysis
title_full Cognitive data analysis
title_fullStr Cognitive data analysis
title_full_unstemmed Cognitive data analysis
title_sort cognitive data analysis
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/139144
_version_ 1681057763601416192