Extracting similar technology comparisons from crowd discussion on stack overflow
Nowadays there are many technologies available for developers to choose from when deciding which ones to adopt for their software projects. Technologies that fall into the same categories can provide similar functionalities yet excel in different features when comparing with each other. When making...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2018
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/75383 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-75383 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-753832023-07-07T17:15:59Z Extracting similar technology comparisons from crowd discussion on stack overflow Lin, Tian Jiang Xudong Liu Yang School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering Nowadays there are many technologies available for developers to choose from when deciding which ones to adopt for their software projects. Technologies that fall into the same categories can provide similar functionalities yet excel in different features when comparing with each other. When making choices of technologies to adopt, developers tend to turn to online resources like assessment platforms or community reviews to get understandings of the technologies landscape. Reviews provided by these online platforms are most likely opinion-based and sometimes the information may be out of date. Besides that, online resources of reviews and comparisons are scattered all over the place, making it hard to have a centralized view of the aggregated opinions from the crowd. In this report, we will exploit the fact that posts on Stack Overflow are tagged by users with the most related technologies to increase the precision of searching results. Generated by users when they post contents online, tags classify contents and categorize them into groups for better content organization. A program based on Word2Vec model is developed to understand the relations among these tags, extract insights of similar tags that fall into the same comparable categories, perform comparative opinions mining to locate sentences containing pairs of similar tags and finally a website (https://similartagsheroku.herokuapp.com) is developed to enable the community to access, reference and evaluate our findings of similar technologies and comparative opinions reviews for certain technologies at any time. Based on the ideas above, this project will firstly gather, clean and organize data of tags in Stack Overflow. Next, the data will be feed into Word2Vec model to reconstruct the relationship between each tag, such that similar tags will be filtered by comparable categories and grouped per their proximity. Lastly, comparative opinion mining will be conducted on similar technology pairs to aggregate community opinions and reviews. In the final stage, a website will be deployed to hosted the findings from this project. Bachelor of Engineering 2018-05-31T02:59:43Z 2018-05-31T02:59:43Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/75383 en Nanyang Technological University 61 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering |
spellingShingle |
DRNTU::Engineering::Computer science and engineering Lin, Tian Extracting similar technology comparisons from crowd discussion on stack overflow |
description |
Nowadays there are many technologies available for developers to choose from when deciding which ones to adopt for their software projects. Technologies that fall into the same categories can provide similar functionalities yet excel in different features when comparing with each other. When making choices of technologies to adopt, developers tend to turn to online resources like assessment platforms or community reviews to get understandings of the technologies landscape. Reviews provided by these online platforms are most likely opinion-based and sometimes the information may be out of date. Besides that, online resources of reviews and comparisons are scattered all over the place, making it hard to have a centralized view of the aggregated opinions from the crowd. In this report, we will exploit the fact that posts on Stack Overflow are tagged by users with the most related technologies to increase the precision of searching results. Generated by users when they post contents online, tags classify contents and categorize them into groups for better content organization. A program based on Word2Vec model is developed to understand the relations among these tags, extract insights of similar tags that fall into the same comparable categories, perform comparative opinions mining to locate sentences containing pairs of similar tags and finally a website (https://similartagsheroku.herokuapp.com) is developed to enable the community to access, reference and evaluate our findings of similar technologies and comparative opinions reviews for certain technologies at any time. Based on the ideas above, this project will firstly gather, clean and organize data of tags in Stack Overflow. Next, the data will be feed into Word2Vec model to reconstruct the relationship between each tag, such that similar tags will be filtered by comparable categories and grouped per their proximity. Lastly, comparative opinion mining will be conducted on similar technology pairs to aggregate community opinions and reviews. In the final stage, a website will be deployed to hosted the findings from this project. |
author2 |
Jiang Xudong |
author_facet |
Jiang Xudong Lin, Tian |
format |
Final Year Project |
author |
Lin, Tian |
author_sort |
Lin, Tian |
title |
Extracting similar technology comparisons from crowd discussion on stack overflow |
title_short |
Extracting similar technology comparisons from crowd discussion on stack overflow |
title_full |
Extracting similar technology comparisons from crowd discussion on stack overflow |
title_fullStr |
Extracting similar technology comparisons from crowd discussion on stack overflow |
title_full_unstemmed |
Extracting similar technology comparisons from crowd discussion on stack overflow |
title_sort |
extracting similar technology comparisons from crowd discussion on stack overflow |
publishDate |
2018 |
url |
http://hdl.handle.net/10356/75383 |
_version_ |
1772828444427550720 |