Extracting similar technology comparisons from crowd discussion on stack overflow

Nowadays there are many technologies available for developers to choose from when deciding which ones to adopt for their software projects. Technologies that fall into the same categories can provide similar functionalities yet excel in different features when comparing with each other. When making...

Full description

Saved in:
Bibliographic Details
Main Author: Lin, Tian
Other Authors: Jiang Xudong
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/75383
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-75383
record_format dspace
spelling sg-ntu-dr.10356-753832023-07-07T17:15:59Z Extracting similar technology comparisons from crowd discussion on stack overflow Lin, Tian Jiang Xudong Liu Yang School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering Nowadays there are many technologies available for developers to choose from when deciding which ones to adopt for their software projects. Technologies that fall into the same categories can provide similar functionalities yet excel in different features when comparing with each other. When making choices of technologies to adopt, developers tend to turn to online resources like assessment platforms or community reviews to get understandings of the technologies landscape. Reviews provided by these online platforms are most likely opinion-based and sometimes the information may be out of date. Besides that, online resources of reviews and comparisons are scattered all over the place, making it hard to have a centralized view of the aggregated opinions from the crowd. In this report, we will exploit the fact that posts on Stack Overflow are tagged by users with the most related technologies to increase the precision of searching results. Generated by users when they post contents online, tags classify contents and categorize them into groups for better content organization. A program based on Word2Vec model is developed to understand the relations among these tags, extract insights of similar tags that fall into the same comparable categories, perform comparative opinions mining to locate sentences containing pairs of similar tags and finally a website (https://similartagsheroku.herokuapp.com) is developed to enable the community to access, reference and evaluate our findings of similar technologies and comparative opinions reviews for certain technologies at any time. Based on the ideas above, this project will firstly gather, clean and organize data of tags in Stack Overflow. Next, the data will be feed into Word2Vec model to reconstruct the relationship between each tag, such that similar tags will be filtered by comparable categories and grouped per their proximity. Lastly, comparative opinion mining will be conducted on similar technology pairs to aggregate community opinions and reviews. In the final stage, a website will be deployed to hosted the findings from this project. Bachelor of Engineering 2018-05-31T02:59:43Z 2018-05-31T02:59:43Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/75383 en Nanyang Technological University 61 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Lin, Tian
Extracting similar technology comparisons from crowd discussion on stack overflow
description Nowadays there are many technologies available for developers to choose from when deciding which ones to adopt for their software projects. Technologies that fall into the same categories can provide similar functionalities yet excel in different features when comparing with each other. When making choices of technologies to adopt, developers tend to turn to online resources like assessment platforms or community reviews to get understandings of the technologies landscape. Reviews provided by these online platforms are most likely opinion-based and sometimes the information may be out of date. Besides that, online resources of reviews and comparisons are scattered all over the place, making it hard to have a centralized view of the aggregated opinions from the crowd. In this report, we will exploit the fact that posts on Stack Overflow are tagged by users with the most related technologies to increase the precision of searching results. Generated by users when they post contents online, tags classify contents and categorize them into groups for better content organization. A program based on Word2Vec model is developed to understand the relations among these tags, extract insights of similar tags that fall into the same comparable categories, perform comparative opinions mining to locate sentences containing pairs of similar tags and finally a website (https://similartagsheroku.herokuapp.com) is developed to enable the community to access, reference and evaluate our findings of similar technologies and comparative opinions reviews for certain technologies at any time. Based on the ideas above, this project will firstly gather, clean and organize data of tags in Stack Overflow. Next, the data will be feed into Word2Vec model to reconstruct the relationship between each tag, such that similar tags will be filtered by comparable categories and grouped per their proximity. Lastly, comparative opinion mining will be conducted on similar technology pairs to aggregate community opinions and reviews. In the final stage, a website will be deployed to hosted the findings from this project.
author2 Jiang Xudong
author_facet Jiang Xudong
Lin, Tian
format Final Year Project
author Lin, Tian
author_sort Lin, Tian
title Extracting similar technology comparisons from crowd discussion on stack overflow
title_short Extracting similar technology comparisons from crowd discussion on stack overflow
title_full Extracting similar technology comparisons from crowd discussion on stack overflow
title_fullStr Extracting similar technology comparisons from crowd discussion on stack overflow
title_full_unstemmed Extracting similar technology comparisons from crowd discussion on stack overflow
title_sort extracting similar technology comparisons from crowd discussion on stack overflow
publishDate 2018
url http://hdl.handle.net/10356/75383
_version_ 1772828444427550720