Mining stack overflow for API class recommendation using DOC2VEC and LDA

To address the lexical gaps between natural language (NL) queries and Application Programming Interface (API) documentations, and between NL queries and programme code, this study developed a novel approach for recommending Java API classes that are relevant to the program ming tasks described in NL...

Full description

Saved in:
Bibliographic Details
Main Authors: Lee, Wai Keat, Su, Moon Ting
Format: Article
Published: 2021
Subjects:
Online Access:http://eprints.um.edu.my/26247/
https://doi.org/10.1049/sfw2.12023
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaya
id my.um.eprints.26247
record_format eprints
spelling my.um.eprints.262472022-02-22T05:05:54Z http://eprints.um.edu.my/26247/ Mining stack overflow for API class recommendation using DOC2VEC and LDA Lee, Wai Keat Su, Moon Ting QA75 Electronic computers. Computer science To address the lexical gaps between natural language (NL) queries and Application Programming Interface (API) documentations, and between NL queries and programme code, this study developed a novel approach for recommending Java API classes that are relevant to the program ming tasks described in NL queries. A Doc2Vec model was trained using question titles mined from Stack Overflow. The model was used to find question titles that are semantically similar to a query. Latent Dirichlet Allocation (LDA) topic modelling was applied on the Java API classes (extracted from code snippets found in the accepted answers of these similar questions) to extract a single topic comprising of the Top-10 Java API classes that are relevant to the query. The benchmarking of the proposed approach against state-of-the-art approaches, RACK and NLP2API, by using four performance metrics show that it is possible to produce comparable API recommendation results using a less complex approach that makes use of some basic machine learning models, in particular, Doc2Vec and LDA. The approach was implemented in a Java API class recommender with an Eclipse IDE's plug-in serving as the front-end. 2021-10 Article PeerReviewed Lee, Wai Keat and Su, Moon Ting (2021) Mining stack overflow for API class recommendation using DOC2VEC and LDA. IET Software, 15 (5). pp. 308-322. ISSN 1751-8806, DOI https://doi.org/10.1049/sfw2.12023 <https://doi.org/10.1049/sfw2.12023>. https://doi.org/10.1049/sfw2.12023 doi:10.1049/sfw2.12023
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Lee, Wai Keat
Su, Moon Ting
Mining stack overflow for API class recommendation using DOC2VEC and LDA
description To address the lexical gaps between natural language (NL) queries and Application Programming Interface (API) documentations, and between NL queries and programme code, this study developed a novel approach for recommending Java API classes that are relevant to the program ming tasks described in NL queries. A Doc2Vec model was trained using question titles mined from Stack Overflow. The model was used to find question titles that are semantically similar to a query. Latent Dirichlet Allocation (LDA) topic modelling was applied on the Java API classes (extracted from code snippets found in the accepted answers of these similar questions) to extract a single topic comprising of the Top-10 Java API classes that are relevant to the query. The benchmarking of the proposed approach against state-of-the-art approaches, RACK and NLP2API, by using four performance metrics show that it is possible to produce comparable API recommendation results using a less complex approach that makes use of some basic machine learning models, in particular, Doc2Vec and LDA. The approach was implemented in a Java API class recommender with an Eclipse IDE's plug-in serving as the front-end.
format Article
author Lee, Wai Keat
Su, Moon Ting
author_facet Lee, Wai Keat
Su, Moon Ting
author_sort Lee, Wai Keat
title Mining stack overflow for API class recommendation using DOC2VEC and LDA
title_short Mining stack overflow for API class recommendation using DOC2VEC and LDA
title_full Mining stack overflow for API class recommendation using DOC2VEC and LDA
title_fullStr Mining stack overflow for API class recommendation using DOC2VEC and LDA
title_full_unstemmed Mining stack overflow for API class recommendation using DOC2VEC and LDA
title_sort mining stack overflow for api class recommendation using doc2vec and lda
publishDate 2021
url http://eprints.um.edu.my/26247/
https://doi.org/10.1049/sfw2.12023
_version_ 1735409390848049152