Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners

Stack Overflow is a popular Q&A platform for developers to find solutions to programming problems. However, due to the varying quality of user-generated answers, there is a need for ways to help users find high-quality answers. While Stack Overflow's community-based approach can be effectiv...

Full description

Saved in:
Bibliographic Details
Main Authors: CHEN, Tianhao, OUH, Eng Lieh, TAN, Kar Way, LO, Siaw Ling
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8066
https://ink.library.smu.edu.sg/context/sis_research/article/9069/viewcontent/Machine_Learning_Approach_to_Automated_Doubt_Identification_on_Stack_Overflow_Comments_to_Guide_Programming_Learners.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9069
record_format dspace
spelling sg-smu-ink.sis_research-90692023-09-07T08:00:40Z Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners CHEN, Tianhao OUH, Eng Lieh TAN, Kar Way LO, Siaw Ling Stack Overflow is a popular Q&A platform for developers to find solutions to programming problems. However, due to the varying quality of user-generated answers, there is a need for ways to help users find high-quality answers. While Stack Overflow's community-based approach can be effective, important technical aspects of the answer need to be captured, and users’ comments might contain doubts regarding these aspects. In this paper, we showed the feasibility of using a machine learning model to identify doubts and conducted data analysis. We found that highly reputed users tend to raise more doubts; most answers have doubt in the first comment, and many answers have unsolved doubt in the last comment; high-score and low-score answers are equally likely to contain doubts in comments. Our classifier and findings can provide users with a new perspective on determining answers’ helpfulness and allow expert users to easily locate doubts to address. 2023-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8066 https://ink.library.smu.edu.sg/context/sis_research/article/9069/viewcontent/Machine_Learning_Approach_to_Automated_Doubt_Identification_on_Stack_Overflow_Comments_to_Guide_Programming_Learners.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Stack Overflow Doubt Identification Text Analytics Numerical Analysis and Scientific Computing Programming Languages and Compilers
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Stack Overflow
Doubt Identification
Text Analytics
Numerical Analysis and Scientific Computing
Programming Languages and Compilers
spellingShingle Stack Overflow
Doubt Identification
Text Analytics
Numerical Analysis and Scientific Computing
Programming Languages and Compilers
CHEN, Tianhao
OUH, Eng Lieh
TAN, Kar Way
LO, Siaw Ling
Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
description Stack Overflow is a popular Q&A platform for developers to find solutions to programming problems. However, due to the varying quality of user-generated answers, there is a need for ways to help users find high-quality answers. While Stack Overflow's community-based approach can be effective, important technical aspects of the answer need to be captured, and users’ comments might contain doubts regarding these aspects. In this paper, we showed the feasibility of using a machine learning model to identify doubts and conducted data analysis. We found that highly reputed users tend to raise more doubts; most answers have doubt in the first comment, and many answers have unsolved doubt in the last comment; high-score and low-score answers are equally likely to contain doubts in comments. Our classifier and findings can provide users with a new perspective on determining answers’ helpfulness and allow expert users to easily locate doubts to address.
format text
author CHEN, Tianhao
OUH, Eng Lieh
TAN, Kar Way
LO, Siaw Ling
author_facet CHEN, Tianhao
OUH, Eng Lieh
TAN, Kar Way
LO, Siaw Ling
author_sort CHEN, Tianhao
title Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
title_short Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
title_full Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
title_fullStr Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
title_full_unstemmed Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
title_sort machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8066
https://ink.library.smu.edu.sg/context/sis_research/article/9069/viewcontent/Machine_Learning_Approach_to_Automated_Doubt_Identification_on_Stack_Overflow_Comments_to_Guide_Programming_Learners.pdf
_version_ 1779157135348727808