SIEVE: Helping developers sift wheat from chaff via cross-platform analysis

Software developers have benefited from various sources of knowledge such as forums, question-and-answer sites, and social media platforms to help them in various tasks. Extracting software-related knowledge from different platforms involves many challenges. In this paper, we propose an approach to...

Full description

Saved in:
Bibliographic Details
Main Authors: SULISTYA, Agus, PRANA, Gede A. A. P., LO, David, TREUDE, Christoph
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4499
https://ink.library.smu.edu.sg/context/sis_research/article/5502/viewcontent/arXiv_1810.13144.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5502
record_format dspace
spelling sg-smu-ink.sis_research-55022021-05-12T01:50:11Z SIEVE: Helping developers sift wheat from chaff via cross-platform analysis SULISTYA, Agus PRANA, Gede A. A. P. LO, David TREUDE, Christoph Software developers have benefited from various sources of knowledge such as forums, question-and-answer sites, and social media platforms to help them in various tasks. Extracting software-related knowledge from different platforms involves many challenges. In this paper, we propose an approach to improve the effectiveness of knowledge extraction tasks by performing cross-platform analysis. Our approach is based on transfer representation learning and word embedding, leveraging information extracted from a source platform which contains rich domain-related content. The information extracted is then used to solve tasks in another platform (considered as target platform) with less domain-related content. We first build a word embedding model as a representation learned from the source platform, and use the model to improve the performance of knowledge extraction tasks in the target platform. We experiment with Software Engineering Stack Exchange and Stack Overflow as source platforms, and two different target platforms, i.e., Twitter and YouTube. Our experiments show that our approach improves performance of existing work for the tasks of identifying software-related tweets and helpful YouTube comments. 2019-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4499 info:doi/10.1007/s10664-019-09775-w https://ink.library.smu.edu.sg/context/sis_research/article/5502/viewcontent/arXiv_1810.13144.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Word Embeddings Transfer Representation Learning Software Engineering Programming Languages and Compilers Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Word Embeddings
Transfer Representation Learning
Software Engineering
Programming Languages and Compilers
Software Engineering
spellingShingle Word Embeddings
Transfer Representation Learning
Software Engineering
Programming Languages and Compilers
Software Engineering
SULISTYA, Agus
PRANA, Gede A. A. P.
LO, David
TREUDE, Christoph
SIEVE: Helping developers sift wheat from chaff via cross-platform analysis
description Software developers have benefited from various sources of knowledge such as forums, question-and-answer sites, and social media platforms to help them in various tasks. Extracting software-related knowledge from different platforms involves many challenges. In this paper, we propose an approach to improve the effectiveness of knowledge extraction tasks by performing cross-platform analysis. Our approach is based on transfer representation learning and word embedding, leveraging information extracted from a source platform which contains rich domain-related content. The information extracted is then used to solve tasks in another platform (considered as target platform) with less domain-related content. We first build a word embedding model as a representation learned from the source platform, and use the model to improve the performance of knowledge extraction tasks in the target platform. We experiment with Software Engineering Stack Exchange and Stack Overflow as source platforms, and two different target platforms, i.e., Twitter and YouTube. Our experiments show that our approach improves performance of existing work for the tasks of identifying software-related tweets and helpful YouTube comments.
format text
author SULISTYA, Agus
PRANA, Gede A. A. P.
LO, David
TREUDE, Christoph
author_facet SULISTYA, Agus
PRANA, Gede A. A. P.
LO, David
TREUDE, Christoph
author_sort SULISTYA, Agus
title SIEVE: Helping developers sift wheat from chaff via cross-platform analysis
title_short SIEVE: Helping developers sift wheat from chaff via cross-platform analysis
title_full SIEVE: Helping developers sift wheat from chaff via cross-platform analysis
title_fullStr SIEVE: Helping developers sift wheat from chaff via cross-platform analysis
title_full_unstemmed SIEVE: Helping developers sift wheat from chaff via cross-platform analysis
title_sort sieve: helping developers sift wheat from chaff via cross-platform analysis
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/4499
https://ink.library.smu.edu.sg/context/sis_research/article/5502/viewcontent/arXiv_1810.13144.pdf
_version_ 1770574876071952384