Improving reusability of software libraries through usage pattern mining

Modern software systems are increasingly dependent on third-party libraries. It is widely recognized that using mature and well-tested third-party libraries can improve developers’ productivity, reduce time-to-market, and produce more reliable software. Today’s open-source repositories provide a wid...

Full description

Saved in:
Bibliographic Details
Main Authors: SAIED, Mohamed Aymen, OUNI, Ali, SAHRAOUI, Houari A., KULA, Raula Gaikovina, INOUE, Katsuro, LO, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4303
https://ink.library.smu.edu.sg/context/sis_research/article/5306/viewcontent/1_s20_S0164121218301699_main__1_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5306
record_format dspace
spelling sg-smu-ink.sis_research-53062020-01-27T10:10:01Z Improving reusability of software libraries through usage pattern mining SAIED, Mohamed Aymen OUNI, Ali SAHRAOUI, Houari A. KULA, Raula Gaikovina INOUE, Katsuro LO, David Modern software systems are increasingly dependent on third-party libraries. It is widely recognized that using mature and well-tested third-party libraries can improve developers’ productivity, reduce time-to-market, and produce more reliable software. Today’s open-source repositories provide a wide range of libraries that can be freely downloaded and used. However, as software libraries are documented separately but intended to be used together, developers are unlikely to fully take advantage of these reuse opportunities. In this paper, we present a novel approach to automatically identify third-party library usage patterns, i.e., collections of libraries that are commonly used together by developers. Our approach employs a hierarchical clustering technique to group together software libraries based on external client usage. To evaluate our approach, we mined a large set of over 6000 popular libraries from Maven Central Repository and investigated their usage by over 38,000 client systems from the Github repository. Our experiments show that our technique is able to detect the majority (77%) of highly consistent and cohesive library usage patterns across a considerable number of client systems. 2018-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4303 info:doi/10.1016/j.jss.2018.08.032 https://ink.library.smu.edu.sg/context/sis_research/article/5306/viewcontent/1_s20_S0164121218301699_main__1_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Software libraries Software reuse Clustering Usage patterns Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Software libraries
Software reuse
Clustering
Usage patterns
Software Engineering
spellingShingle Software libraries
Software reuse
Clustering
Usage patterns
Software Engineering
SAIED, Mohamed Aymen
OUNI, Ali
SAHRAOUI, Houari A.
KULA, Raula Gaikovina
INOUE, Katsuro
LO, David
Improving reusability of software libraries through usage pattern mining
description Modern software systems are increasingly dependent on third-party libraries. It is widely recognized that using mature and well-tested third-party libraries can improve developers’ productivity, reduce time-to-market, and produce more reliable software. Today’s open-source repositories provide a wide range of libraries that can be freely downloaded and used. However, as software libraries are documented separately but intended to be used together, developers are unlikely to fully take advantage of these reuse opportunities. In this paper, we present a novel approach to automatically identify third-party library usage patterns, i.e., collections of libraries that are commonly used together by developers. Our approach employs a hierarchical clustering technique to group together software libraries based on external client usage. To evaluate our approach, we mined a large set of over 6000 popular libraries from Maven Central Repository and investigated their usage by over 38,000 client systems from the Github repository. Our experiments show that our technique is able to detect the majority (77%) of highly consistent and cohesive library usage patterns across a considerable number of client systems.
format text
author SAIED, Mohamed Aymen
OUNI, Ali
SAHRAOUI, Houari A.
KULA, Raula Gaikovina
INOUE, Katsuro
LO, David
author_facet SAIED, Mohamed Aymen
OUNI, Ali
SAHRAOUI, Houari A.
KULA, Raula Gaikovina
INOUE, Katsuro
LO, David
author_sort SAIED, Mohamed Aymen
title Improving reusability of software libraries through usage pattern mining
title_short Improving reusability of software libraries through usage pattern mining
title_full Improving reusability of software libraries through usage pattern mining
title_fullStr Improving reusability of software libraries through usage pattern mining
title_full_unstemmed Improving reusability of software libraries through usage pattern mining
title_sort improving reusability of software libraries through usage pattern mining
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/4303
https://ink.library.smu.edu.sg/context/sis_research/article/5306/viewcontent/1_s20_S0164121218301699_main__1_.pdf
_version_ 1770574604694192128