Machine-learning approach to automated doubt identification on stack overflow comments to guide programming learners

Stack Overflow is a popular Q&A platform for developers to find solutions to programming problems. However, due to the varying quality of user-generated answers, there is a need for ways to help users find high-quality answers. While Stack Overflow's community-based approach can be effectiv...

Full description

Saved in:
Bibliographic Details
Main Authors: CHEN, Tianhao, OUH, Eng Lieh, TAN, Kar Way, LO, Siaw Ling
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8066
https://ink.library.smu.edu.sg/context/sis_research/article/9069/viewcontent/Machine_Learning_Approach_to_Automated_Doubt_Identification_on_Stack_Overflow_Comments_to_Guide_Programming_Learners.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:Stack Overflow is a popular Q&A platform for developers to find solutions to programming problems. However, due to the varying quality of user-generated answers, there is a need for ways to help users find high-quality answers. While Stack Overflow's community-based approach can be effective, important technical aspects of the answer need to be captured, and users’ comments might contain doubts regarding these aspects. In this paper, we showed the feasibility of using a machine learning model to identify doubts and conducted data analysis. We found that highly reputed users tend to raise more doubts; most answers have doubt in the first comment, and many answers have unsolved doubt in the last comment; high-score and low-score answers are equally likely to contain doubts in comments. Our classifier and findings can provide users with a new perspective on determining answers’ helpfulness and allow expert users to easily locate doubts to address.