ARSeek: identifying API resource using code and discussion on stack overflow

It is not a trivial problem to collect API-relevant examples, usages, and mentions on venues such as Stack Overflow. It requires efforts to correctly recognize whether the discussion refers to the API method that developers/tools are searching for. The content of the Stack Overflow thread, which con...

Full description

Saved in:
Bibliographic Details
Main Authors: LUONG, Gia Kien, HADI, Mohammad, Ferdian, Thung, FARD, Fatemeh H., LO, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7692
https://ink.library.smu.edu.sg/context/sis_research/article/8695/viewcontent/929800a331.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8695
record_format dspace
spelling sg-smu-ink.sis_research-86952023-01-10T03:13:45Z ARSeek: identifying API resource using code and discussion on stack overflow LUONG, Gia Kien HADI, Mohammad Ferdian, Thung FARD, Fatemeh H. LO, David It is not a trivial problem to collect API-relevant examples, usages, and mentions on venues such as Stack Overflow. It requires efforts to correctly recognize whether the discussion refers to the API method that developers/tools are searching for. The content of the Stack Overflow thread, which consists of both text paragraphs describing the involvement of the API method in the discussion and the code snippets containing the API invocation, may refer to the given API method. Leveraging this observation, we develop ARSeek, a context-specific algorithm to capture the semantic and syntactic information of the paragraphs and code snippets in a discussion. ARSeek combines a syntactic word-based score with a score from a predictive model fine-tuned from CodeBERT. In terms of F1-score, ARSeek achieves an average score of 0.8709 and beats the state-of-the-art approach by 14%. 2022-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7692 info:doi/10.1145/3524610.3527918 https://ink.library.smu.edu.sg/context/sis_research/article/8695/viewcontent/929800a331.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University API resource API embedding Content classification Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic API resource
API embedding
Content classification
Databases and Information Systems
spellingShingle API resource
API embedding
Content classification
Databases and Information Systems
LUONG, Gia Kien
HADI, Mohammad
Ferdian, Thung
FARD, Fatemeh H.
LO, David
ARSeek: identifying API resource using code and discussion on stack overflow
description It is not a trivial problem to collect API-relevant examples, usages, and mentions on venues such as Stack Overflow. It requires efforts to correctly recognize whether the discussion refers to the API method that developers/tools are searching for. The content of the Stack Overflow thread, which consists of both text paragraphs describing the involvement of the API method in the discussion and the code snippets containing the API invocation, may refer to the given API method. Leveraging this observation, we develop ARSeek, a context-specific algorithm to capture the semantic and syntactic information of the paragraphs and code snippets in a discussion. ARSeek combines a syntactic word-based score with a score from a predictive model fine-tuned from CodeBERT. In terms of F1-score, ARSeek achieves an average score of 0.8709 and beats the state-of-the-art approach by 14%.
format text
author LUONG, Gia Kien
HADI, Mohammad
Ferdian, Thung
FARD, Fatemeh H.
LO, David
author_facet LUONG, Gia Kien
HADI, Mohammad
Ferdian, Thung
FARD, Fatemeh H.
LO, David
author_sort LUONG, Gia Kien
title ARSeek: identifying API resource using code and discussion on stack overflow
title_short ARSeek: identifying API resource using code and discussion on stack overflow
title_full ARSeek: identifying API resource using code and discussion on stack overflow
title_fullStr ARSeek: identifying API resource using code and discussion on stack overflow
title_full_unstemmed ARSeek: identifying API resource using code and discussion on stack overflow
title_sort arseek: identifying api resource using code and discussion on stack overflow
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7692
https://ink.library.smu.edu.sg/context/sis_research/article/8695/viewcontent/929800a331.pdf
_version_ 1770576415223185408