Augmenting API documentation with insights from stack overflow

Software developers need access to different kinds of information which is often dispersed among different documentation sources, such as API documentation or Stack Overflow. We present an approach to automatically augment API documentation with “insight sentences” from Stack Overflow— sentences tha...

Full description

Saved in:
Bibliographic Details
Main Authors: TREUDE, Christoph, ROBILLARD, Martin P.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8938
https://ink.library.smu.edu.sg/context/sis_research/article/9941/viewcontent/icse16a.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9941
record_format dspace
spelling sg-smu-ink.sis_research-99412024-07-04T08:50:39Z Augmenting API documentation with insights from stack overflow TREUDE, Christoph ROBILLARD, Martin P. Software developers need access to different kinds of information which is often dispersed among different documentation sources, such as API documentation or Stack Overflow. We present an approach to automatically augment API documentation with “insight sentences” from Stack Overflow— sentences that are related to a particular API type and that provide insight not contained in the API documentation of that type. Based on a development set of 1,574 sentences, we compare the performance of two state-of-the-art summarization techniques as well as a pattern-based approach for insight sentence extraction. We then present SISE, a novel machine learning based approach that uses as features the sentences themselves, their formatting, their question, their answer, and their authors as well as part-of-speech tags and the similarity of a sentence to the corresponding API documentation. With SISE, we were able to achieve a precision of 0.64 and a coverage of 0.7 on the development set. In a comparative study with eight software developers, we found that SISE resulted in the highest number of sentences that were considered to add useful information not found in the API documentation. These results indicate that taking into account the meta data available on Stack Overflow as well as part-of-speech tags can significantly improve unsupervised extraction approaches when applied to Stack Overflow data. 2016-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8938 info:doi/10.1145/2884781.2884800 https://ink.library.smu.edu.sg/context/sis_research/article/9941/viewcontent/icse16a.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University API documentation Insight sentences Stack Overow Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic API documentation
Insight sentences
Stack Overow
Software Engineering
spellingShingle API documentation
Insight sentences
Stack Overow
Software Engineering
TREUDE, Christoph
ROBILLARD, Martin P.
Augmenting API documentation with insights from stack overflow
description Software developers need access to different kinds of information which is often dispersed among different documentation sources, such as API documentation or Stack Overflow. We present an approach to automatically augment API documentation with “insight sentences” from Stack Overflow— sentences that are related to a particular API type and that provide insight not contained in the API documentation of that type. Based on a development set of 1,574 sentences, we compare the performance of two state-of-the-art summarization techniques as well as a pattern-based approach for insight sentence extraction. We then present SISE, a novel machine learning based approach that uses as features the sentences themselves, their formatting, their question, their answer, and their authors as well as part-of-speech tags and the similarity of a sentence to the corresponding API documentation. With SISE, we were able to achieve a precision of 0.64 and a coverage of 0.7 on the development set. In a comparative study with eight software developers, we found that SISE resulted in the highest number of sentences that were considered to add useful information not found in the API documentation. These results indicate that taking into account the meta data available on Stack Overflow as well as part-of-speech tags can significantly improve unsupervised extraction approaches when applied to Stack Overflow data.
format text
author TREUDE, Christoph
ROBILLARD, Martin P.
author_facet TREUDE, Christoph
ROBILLARD, Martin P.
author_sort TREUDE, Christoph
title Augmenting API documentation with insights from stack overflow
title_short Augmenting API documentation with insights from stack overflow
title_full Augmenting API documentation with insights from stack overflow
title_fullStr Augmenting API documentation with insights from stack overflow
title_full_unstemmed Augmenting API documentation with insights from stack overflow
title_sort augmenting api documentation with insights from stack overflow
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/8938
https://ink.library.smu.edu.sg/context/sis_research/article/9941/viewcontent/icse16a.pdf
_version_ 1814047652174626816