Automated question title reformulation by mining modifcation logs from Stack Overflow
In Stack Overflow, developers may not clarify and summarize the critical problems in the question titles due to a lack of domain knowledge or poor writing skills. Previous studies mainly focused on automatically generating the question titles by analyzing the posts’ problem descriptions and code sni...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8225 https://ink.library.smu.edu.sg/context/sis_research/article/9228/viewcontent/Automated_question_title_reformulation_by_mining_modifcation_logs_av.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-9228 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-92282023-11-03T02:38:58Z Automated question title reformulation by mining modifcation logs from Stack Overflow LIU, Ke CHEN, Xiang CHEN, Chunyang XIE, Xiaofei CUI, Zhanqi In Stack Overflow, developers may not clarify and summarize the critical problems in the question titles due to a lack of domain knowledge or poor writing skills. Previous studies mainly focused on automatically generating the question titles by analyzing the posts’ problem descriptions and code snippets. In this study, we aim to improve title quality from the perspective of question title reformulation and propose a novel approach QETRA motivated by the findings of our formative study. Specifically, by mining modification logs from Stack Overflow, we first extract title reformulation pairs containing the original title and the reformulated title. Then we resort to multi-task learning by formalizing title reformulation for each programming language as separate but related tasks. Later we adopt a pre-trained model T5 to automatically learn the title reformulation patterns. Automated evaluation and human study both show the competitiveness of QETRA after compared with six state-of-the-art baselines. Moreover, our ablation study results also confirm that our studied question title reformulation task is more practical than the direct question title generation task for generating high-quality titles. Finally, we develop a browser plugin based on QETRA to facilitate the developers to perform title reformulation. Our study provides a new perspective for studying the quality of post titles and can further generate high-quality titles. 2023-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8225 info:doi/10.1109/TSE.2023.3292399 https://ink.library.smu.edu.sg/context/sis_research/article/9228/viewcontent/Automated_question_title_reformulation_by_mining_modifcation_logs_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Stack Overflow mining question post quality assurance question title reformulation modification logs deeplearning Artificial Intelligence and Robotics Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Stack Overflow mining question post quality assurance question title reformulation modification logs deeplearning Artificial Intelligence and Robotics Software Engineering |
spellingShingle |
Stack Overflow mining question post quality assurance question title reformulation modification logs deeplearning Artificial Intelligence and Robotics Software Engineering LIU, Ke CHEN, Xiang CHEN, Chunyang XIE, Xiaofei CUI, Zhanqi Automated question title reformulation by mining modifcation logs from Stack Overflow |
description |
In Stack Overflow, developers may not clarify and summarize the critical problems in the question titles due to a lack of domain knowledge or poor writing skills. Previous studies mainly focused on automatically generating the question titles by analyzing the posts’ problem descriptions and code snippets. In this study, we aim to improve title quality from the perspective of question title reformulation and propose a novel approach QETRA motivated by the findings of our formative study. Specifically, by mining modification logs from Stack Overflow, we first extract title reformulation pairs containing the original title and the reformulated title. Then we resort to multi-task learning by formalizing title reformulation for each programming language as separate but related tasks. Later we adopt a pre-trained model T5 to automatically learn the title reformulation patterns. Automated evaluation and human study both show the competitiveness of QETRA after compared with six state-of-the-art baselines. Moreover, our ablation study results also confirm that our studied question title reformulation task is more practical than the direct question title generation task for generating high-quality titles. Finally, we develop a browser plugin based on QETRA to facilitate the developers to perform title reformulation. Our study provides a new perspective for studying the quality of post titles and can further generate high-quality titles. |
format |
text |
author |
LIU, Ke CHEN, Xiang CHEN, Chunyang XIE, Xiaofei CUI, Zhanqi |
author_facet |
LIU, Ke CHEN, Xiang CHEN, Chunyang XIE, Xiaofei CUI, Zhanqi |
author_sort |
LIU, Ke |
title |
Automated question title reformulation by mining modifcation logs from Stack Overflow |
title_short |
Automated question title reformulation by mining modifcation logs from Stack Overflow |
title_full |
Automated question title reformulation by mining modifcation logs from Stack Overflow |
title_fullStr |
Automated question title reformulation by mining modifcation logs from Stack Overflow |
title_full_unstemmed |
Automated question title reformulation by mining modifcation logs from Stack Overflow |
title_sort |
automated question title reformulation by mining modifcation logs from stack overflow |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/8225 https://ink.library.smu.edu.sg/context/sis_research/article/9228/viewcontent/Automated_question_title_reformulation_by_mining_modifcation_logs_av.pdf |
_version_ |
1783955656781332480 |