Query Expansion via Wordnet for Effective Code Search

Source code search plays an important role in software maintenance. The effectiveness of source code search not only relies on the search technique, but also on the quality of the query. In practice, software systems are large, thus it is difficult for a developer to format an accurate query to expr...

Full description

Saved in:
Bibliographic Details
Main Authors: LU, Meili, SUN, Xiaobing, WANG, Shaowei, David LO, DUAN, Yucong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3080
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4080
record_format dspace
spelling sg-smu-ink.sis_research-40802016-02-05T06:30:05Z Query Expansion via Wordnet for Effective Code Search LU, Meili SUN, Xiaobing WANG, Shaowei David LO, DUAN, Yucong Source code search plays an important role in software maintenance. The effectiveness of source code search not only relies on the search technique, but also on the quality of the query. In practice, software systems are large, thus it is difficult for a developer to format an accurate query to express what really in her/his mind, especially when the maintainer and the original developer are not the same person. When a query performs poorly, it has to be reformulated. But the words used in a query may be different from those that have similar semantics in the source code, i.e., the synonyms, which will affect the accuracy of code search results. To address this issue, we propose an approach that extends a query with synonyms generated from WordNet. Our approach extracts natural language phrases from source code identifiers, matches expanded queries with these phrases, and sorts the search results. It allows developers to explore word usage in a piece of software, helps them quickly identify relevant program elements for investigation or quickly recognize alternative words for query reformulation. Our initial empirical study on search tasks performed on the JavaScript/ECMAScript interpreter and compiler, Rhino, shows that the synonyms used to expand the queries help recommend good alternative queries. Our approach also improves the precision and recall of Conquer, a state-of-the-art query expansion/reformulation technique, by 5% and 8% respectively. 2015-03-06T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/3080 info:doi/10.1109/SANER.2015.7081874 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Software Engineering
spellingShingle Software Engineering
LU, Meili
SUN, Xiaobing
WANG, Shaowei
David LO,
DUAN, Yucong
Query Expansion via Wordnet for Effective Code Search
description Source code search plays an important role in software maintenance. The effectiveness of source code search not only relies on the search technique, but also on the quality of the query. In practice, software systems are large, thus it is difficult for a developer to format an accurate query to express what really in her/his mind, especially when the maintainer and the original developer are not the same person. When a query performs poorly, it has to be reformulated. But the words used in a query may be different from those that have similar semantics in the source code, i.e., the synonyms, which will affect the accuracy of code search results. To address this issue, we propose an approach that extends a query with synonyms generated from WordNet. Our approach extracts natural language phrases from source code identifiers, matches expanded queries with these phrases, and sorts the search results. It allows developers to explore word usage in a piece of software, helps them quickly identify relevant program elements for investigation or quickly recognize alternative words for query reformulation. Our initial empirical study on search tasks performed on the JavaScript/ECMAScript interpreter and compiler, Rhino, shows that the synonyms used to expand the queries help recommend good alternative queries. Our approach also improves the precision and recall of Conquer, a state-of-the-art query expansion/reformulation technique, by 5% and 8% respectively.
format text
author LU, Meili
SUN, Xiaobing
WANG, Shaowei
David LO,
DUAN, Yucong
author_facet LU, Meili
SUN, Xiaobing
WANG, Shaowei
David LO,
DUAN, Yucong
author_sort LU, Meili
title Query Expansion via Wordnet for Effective Code Search
title_short Query Expansion via Wordnet for Effective Code Search
title_full Query Expansion via Wordnet for Effective Code Search
title_fullStr Query Expansion via Wordnet for Effective Code Search
title_full_unstemmed Query Expansion via Wordnet for Effective Code Search
title_sort query expansion via wordnet for effective code search
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/3080
_version_ 1770572803002597376