Characterization and prediction of questions without accepted answers on Stack Overflow

A fast and effective approach to obtain information regarding software development problems is to search them to find similar solved problems or post questions on community question answering (CQA) websites. Solving coding problems in a short time is important, so these CQAs have a considerable impa...

Full description

Saved in:
Bibliographic Details
Main Authors: YAZDANINIA, Mohamad, LO, David, SAMI, Ashkan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6059
https://ink.library.smu.edu.sg/context/sis_research/article/7062/viewcontent/Characterization_Prediction_QA_Stack_Overflow_2021_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7062
record_format dspace
spelling sg-smu-ink.sis_research-70622021-09-01T01:40:38Z Characterization and prediction of questions without accepted answers on Stack Overflow YAZDANINIA, Mohamad LO, David SAMI, Ashkan A fast and effective approach to obtain information regarding software development problems is to search them to find similar solved problems or post questions on community question answering (CQA) websites. Solving coding problems in a short time is important, so these CQAs have a considerable impact on the software development process. However, if developers do not get their expected answers, the websites will not be useful, and software development time will increase. Stack Overflow is the most popular CQA concerning programming problems. According to its rules, the only sign that shows a question poser has achieved the desired answer is the user's acceptance. In this paper, we investigate unresolved questions, without accepted answers, on Stack Overflow. The number of unresolved questions is increasing. As of August 2019, 47% of Stack Overflow questions were unresolved. In this study, we analyze the effectiveness of various features, including some novel features, to resolve a question. We do not use the features that contain information not present at the time of asking a question, such as answers. To evaluate our features, we deploy several predictive models trained on the features of 18 million questions to predict whether a question will get an accepted answer or not. The results of this study show a significant relationship between our proposed features and getting accepted answers. Finally, we introduce an online tool that predicts whether a question will get an accepted answer or not. Currently, Stack Overflow's users do not receive any feedback on their questions before asking them, so they could carelessly ask unclear, unreadable, or inappropriately tagged questions. By using this tool, they can modify their questions and tags to check the different results of the tool and deliberately improve their questions to get accepted answers. 2021-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6059 info:doi/10.1109/ICPC52881.2021.00015 https://ink.library.smu.edu.sg/context/sis_research/article/7062/viewcontent/Characterization_Prediction_QA_Stack_Overflow_2021_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University coding problems empirical software engineering Stack Overflow Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic coding problems
empirical software engineering
Stack Overflow
Software Engineering
spellingShingle coding problems
empirical software engineering
Stack Overflow
Software Engineering
YAZDANINIA, Mohamad
LO, David
SAMI, Ashkan
Characterization and prediction of questions without accepted answers on Stack Overflow
description A fast and effective approach to obtain information regarding software development problems is to search them to find similar solved problems or post questions on community question answering (CQA) websites. Solving coding problems in a short time is important, so these CQAs have a considerable impact on the software development process. However, if developers do not get their expected answers, the websites will not be useful, and software development time will increase. Stack Overflow is the most popular CQA concerning programming problems. According to its rules, the only sign that shows a question poser has achieved the desired answer is the user's acceptance. In this paper, we investigate unresolved questions, without accepted answers, on Stack Overflow. The number of unresolved questions is increasing. As of August 2019, 47% of Stack Overflow questions were unresolved. In this study, we analyze the effectiveness of various features, including some novel features, to resolve a question. We do not use the features that contain information not present at the time of asking a question, such as answers. To evaluate our features, we deploy several predictive models trained on the features of 18 million questions to predict whether a question will get an accepted answer or not. The results of this study show a significant relationship between our proposed features and getting accepted answers. Finally, we introduce an online tool that predicts whether a question will get an accepted answer or not. Currently, Stack Overflow's users do not receive any feedback on their questions before asking them, so they could carelessly ask unclear, unreadable, or inappropriately tagged questions. By using this tool, they can modify their questions and tags to check the different results of the tool and deliberately improve their questions to get accepted answers.
format text
author YAZDANINIA, Mohamad
LO, David
SAMI, Ashkan
author_facet YAZDANINIA, Mohamad
LO, David
SAMI, Ashkan
author_sort YAZDANINIA, Mohamad
title Characterization and prediction of questions without accepted answers on Stack Overflow
title_short Characterization and prediction of questions without accepted answers on Stack Overflow
title_full Characterization and prediction of questions without accepted answers on Stack Overflow
title_fullStr Characterization and prediction of questions without accepted answers on Stack Overflow
title_full_unstemmed Characterization and prediction of questions without accepted answers on Stack Overflow
title_sort characterization and prediction of questions without accepted answers on stack overflow
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6059
https://ink.library.smu.edu.sg/context/sis_research/article/7062/viewcontent/Characterization_Prediction_QA_Stack_Overflow_2021_av.pdf
_version_ 1770575805046325248