Characterization and prediction of questions without accepted answers on Stack Overflow
A fast and effective approach to obtain information regarding software development problems is to search them to find similar solved problems or post questions on community question answering (CQA) websites. Solving coding problems in a short time is important, so these CQAs have a considerable impa...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6059 https://ink.library.smu.edu.sg/context/sis_research/article/7062/viewcontent/Characterization_Prediction_QA_Stack_Overflow_2021_av.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7062 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-70622021-09-01T01:40:38Z Characterization and prediction of questions without accepted answers on Stack Overflow YAZDANINIA, Mohamad LO, David SAMI, Ashkan A fast and effective approach to obtain information regarding software development problems is to search them to find similar solved problems or post questions on community question answering (CQA) websites. Solving coding problems in a short time is important, so these CQAs have a considerable impact on the software development process. However, if developers do not get their expected answers, the websites will not be useful, and software development time will increase. Stack Overflow is the most popular CQA concerning programming problems. According to its rules, the only sign that shows a question poser has achieved the desired answer is the user's acceptance. In this paper, we investigate unresolved questions, without accepted answers, on Stack Overflow. The number of unresolved questions is increasing. As of August 2019, 47% of Stack Overflow questions were unresolved. In this study, we analyze the effectiveness of various features, including some novel features, to resolve a question. We do not use the features that contain information not present at the time of asking a question, such as answers. To evaluate our features, we deploy several predictive models trained on the features of 18 million questions to predict whether a question will get an accepted answer or not. The results of this study show a significant relationship between our proposed features and getting accepted answers. Finally, we introduce an online tool that predicts whether a question will get an accepted answer or not. Currently, Stack Overflow's users do not receive any feedback on their questions before asking them, so they could carelessly ask unclear, unreadable, or inappropriately tagged questions. By using this tool, they can modify their questions and tags to check the different results of the tool and deliberately improve their questions to get accepted answers. 2021-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6059 info:doi/10.1109/ICPC52881.2021.00015 https://ink.library.smu.edu.sg/context/sis_research/article/7062/viewcontent/Characterization_Prediction_QA_Stack_Overflow_2021_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University coding problems empirical software engineering Stack Overflow Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
coding problems empirical software engineering Stack Overflow Software Engineering |
spellingShingle |
coding problems empirical software engineering Stack Overflow Software Engineering YAZDANINIA, Mohamad LO, David SAMI, Ashkan Characterization and prediction of questions without accepted answers on Stack Overflow |
description |
A fast and effective approach to obtain information regarding software development problems is to search them to find similar solved problems or post questions on community question answering (CQA) websites. Solving coding problems in a short time is important, so these CQAs have a considerable impact on the software development process. However, if developers do not get their expected answers, the websites will not be useful, and software development time will increase. Stack Overflow is the most popular CQA concerning programming problems. According to its rules, the only sign that shows a question poser has achieved the desired answer is the user's acceptance. In this paper, we investigate unresolved questions, without accepted answers, on Stack Overflow. The number of unresolved questions is increasing. As of August 2019, 47% of Stack Overflow questions were unresolved. In this study, we analyze the effectiveness of various features, including some novel features, to resolve a question. We do not use the features that contain information not present at the time of asking a question, such as answers. To evaluate our features, we deploy several predictive models trained on the features of 18 million questions to predict whether a question will get an accepted answer or not. The results of this study show a significant relationship between our proposed features and getting accepted answers. Finally, we introduce an online tool that predicts whether a question will get an accepted answer or not. Currently, Stack Overflow's users do not receive any feedback on their questions before asking them, so they could carelessly ask unclear, unreadable, or inappropriately tagged questions. By using this tool, they can modify their questions and tags to check the different results of the tool and deliberately improve their questions to get accepted answers. |
format |
text |
author |
YAZDANINIA, Mohamad LO, David SAMI, Ashkan |
author_facet |
YAZDANINIA, Mohamad LO, David SAMI, Ashkan |
author_sort |
YAZDANINIA, Mohamad |
title |
Characterization and prediction of questions without accepted answers on Stack Overflow |
title_short |
Characterization and prediction of questions without accepted answers on Stack Overflow |
title_full |
Characterization and prediction of questions without accepted answers on Stack Overflow |
title_fullStr |
Characterization and prediction of questions without accepted answers on Stack Overflow |
title_full_unstemmed |
Characterization and prediction of questions without accepted answers on Stack Overflow |
title_sort |
characterization and prediction of questions without accepted answers on stack overflow |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6059 https://ink.library.smu.edu.sg/context/sis_research/article/7062/viewcontent/Characterization_Prediction_QA_Stack_Overflow_2021_av.pdf |
_version_ |
1770575805046325248 |