How are deep learning models similar? An empirical study on clone analysis of deep learning software

Deep learning (DL) has been successfully applied to many cutting-edge applications, e.g., image processing, speech recognition, and natural language processing. As more and more DL software is made open-sourced, publicly available, and organized in model repositories and stores (Model Zoo, ModelDepo...

Full description

Saved in:

Bibliographic Details
Main Authors:	WU, Xiongfei, QIN, Liangyu, YU, Bing, XIE, Xiaofei, MA, Lei, XUE, Yinxing, LIU, Yang, ZHAO, Jianjun
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2020
Subjects:	code clone detection deep learning model similarity Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/7111 https://ink.library.smu.edu.sg/context/sis_research/article/8114/viewcontent/3387904.3389254.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8114
record_format	dspace
spelling	sg-smu-ink.sis_research-81142022-04-14T11:43:46Z How are deep learning models similar? An empirical study on clone analysis of deep learning software WU, Xiongfei QIN, Liangyu YU, Bing XIE, Xiaofei MA, Lei XUE, Yinxing LIU, Yang ZHAO, Jianjun Deep learning (DL) has been successfully applied to many cutting-edge applications, e.g., image processing, speech recognition, and natural language processing. As more and more DL software is made open-sourced, publicly available, and organized in model repositories and stores (Model Zoo, ModelDepot), there comes a need to understand the relationships of these DL models regarding their maintenance and evolution tasks. Although clone analysis has been extensively studied for traditional software, up to the present, clone analysis has not been investigated for DL software. Since DL software adopts the data-driven development paradigm, it is still not clear whether and to what extent the clone analysis techniques of traditional software could be adapted to DL software.In this paper, we initiate the first step on the clone analysis of DL software at three different levels, i.e., source code level, model structural level, and input/output (I/0)-semantic level, which would be a key in DL software management, maintenance and evolution. We intend to investigate the similarity between these DL models from clone analysis perspective. Several tools and metrics are selected to conduct clone analysis of DL software at three different levels. Our study on two popular datasets (i.e., MNIST and CIFAR-10) and eight DL models of five architectural families (i.e., LeNet, ResNet, DenseNet, AlexNet, and VGG) shows that: 1). the three levels of similarity analysis are generally adequate to find clones between DL models ranging from structural to semantic; 2). different measures for clone analysis used at each level yield similar results; 3) clone analysis of one single level may not render a complete picture of the similarity of DL models. Our findings open up several research opportunities worth further exploration towards better understanding and more effective clone analysis of DL software. 2020-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7111 info:doi/10.1145/3387904.3389254 https://ink.library.smu.edu.sg/context/sis_research/article/8114/viewcontent/3387904.3389254.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University code clone detection deep learning model similarity Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	code clone detection deep learning model similarity Software Engineering
spellingShingle	code clone detection deep learning model similarity Software Engineering WU, Xiongfei QIN, Liangyu YU, Bing XIE, Xiaofei MA, Lei XUE, Yinxing LIU, Yang ZHAO, Jianjun How are deep learning models similar? An empirical study on clone analysis of deep learning software
description	Deep learning (DL) has been successfully applied to many cutting-edge applications, e.g., image processing, speech recognition, and natural language processing. As more and more DL software is made open-sourced, publicly available, and organized in model repositories and stores (Model Zoo, ModelDepot), there comes a need to understand the relationships of these DL models regarding their maintenance and evolution tasks. Although clone analysis has been extensively studied for traditional software, up to the present, clone analysis has not been investigated for DL software. Since DL software adopts the data-driven development paradigm, it is still not clear whether and to what extent the clone analysis techniques of traditional software could be adapted to DL software.In this paper, we initiate the first step on the clone analysis of DL software at three different levels, i.e., source code level, model structural level, and input/output (I/0)-semantic level, which would be a key in DL software management, maintenance and evolution. We intend to investigate the similarity between these DL models from clone analysis perspective. Several tools and metrics are selected to conduct clone analysis of DL software at three different levels. Our study on two popular datasets (i.e., MNIST and CIFAR-10) and eight DL models of five architectural families (i.e., LeNet, ResNet, DenseNet, AlexNet, and VGG) shows that: 1). the three levels of similarity analysis are generally adequate to find clones between DL models ranging from structural to semantic; 2). different measures for clone analysis used at each level yield similar results; 3) clone analysis of one single level may not render a complete picture of the similarity of DL models. Our findings open up several research opportunities worth further exploration towards better understanding and more effective clone analysis of DL software.
format	text
author	WU, Xiongfei QIN, Liangyu YU, Bing XIE, Xiaofei MA, Lei XUE, Yinxing LIU, Yang ZHAO, Jianjun
author_facet	WU, Xiongfei QIN, Liangyu YU, Bing XIE, Xiaofei MA, Lei XUE, Yinxing LIU, Yang ZHAO, Jianjun
author_sort	WU, Xiongfei
title	How are deep learning models similar? An empirical study on clone analysis of deep learning software
title_short	How are deep learning models similar? An empirical study on clone analysis of deep learning software
title_full	How are deep learning models similar? An empirical study on clone analysis of deep learning software
title_fullStr	How are deep learning models similar? An empirical study on clone analysis of deep learning software
title_full_unstemmed	How are deep learning models similar? An empirical study on clone analysis of deep learning software
title_sort	how are deep learning models similar? an empirical study on clone analysis of deep learning software
publisher	Institutional Knowledge at Singapore Management University
publishDate	2020
url	https://ink.library.smu.edu.sg/sis_research/7111 https://ink.library.smu.edu.sg/context/sis_research/article/8114/viewcontent/3387904.3389254.pdf
_version_	1770576214505816064

How are deep learning models similar? An empirical study on clone analysis of deep learning software

Similar Items