Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study

The complexity and dynamism of microservice systems pose unique challenges to a variety of software engineering tasks such as fault analysis and debugging. In spite of the prevalence and importance of microservices in industry, there is limited research on the fault analysis and debugging of microse...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHOU, Xiang, PENG, Xin, XIE, Tao, SUN, Jun, JI, Chao, LI, Wenhai, DING, Dan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4845
https://ink.library.smu.edu.sg/context/sis_research/article/5848/viewcontent/fault_analysis__PV.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5848
record_format dspace
spelling sg-smu-ink.sis_research-58482022-07-26T08:00:22Z Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study ZHOU, Xiang PENG, Xin XIE, Tao SUN, Jun JI, Chao LI, Wenhai DING, Dan The complexity and dynamism of microservice systems pose unique challenges to a variety of software engineering tasks such as fault analysis and debugging. In spite of the prevalence and importance of microservices in industry, there is limited research on the fault analysis and debugging of microservice systems. To fill this gap, we conduct an industrial survey to learn typical faults of microservice systems, current practice of debugging, and the challenges faced by developers in practice. We then develop a medium-size benchmark microservice system (being the largest and most complex open source microservice system within our knowledge) and replicate 22 industrial fault cases on it. Based on the benchmark system and the replicated fault cases, we conduct an empirical study to investigate the effectiveness of existing industrial debugging practices and whether they can be further improved by introducing the state-of-the-art tracing and visualization techniques for distributed systems. The results show that the current industrial practices of microservice debugging can be improved by employing proper tracing and visualization techniques and strategies. Our findings also suggest that there is a strong need for more intelligent trace analysis and visualization, e.g., by combining trace visualization and improved fault localization, and employing data-driven and learning-based recommendation for guided visual exploration and comparison of traces. 2021-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4845 info:doi/10.1109/TSE.2018.2887384 https://ink.library.smu.edu.sg/context/sis_research/article/5848/viewcontent/fault_analysis__PV.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University microservices fault localization tracing visualization debugging Computer and Systems Architecture Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic microservices
fault localization
tracing
visualization
debugging
Computer and Systems Architecture
Software Engineering
spellingShingle microservices
fault localization
tracing
visualization
debugging
Computer and Systems Architecture
Software Engineering
ZHOU, Xiang
PENG, Xin
XIE, Tao
SUN, Jun
JI, Chao
LI, Wenhai
DING, Dan
Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study
description The complexity and dynamism of microservice systems pose unique challenges to a variety of software engineering tasks such as fault analysis and debugging. In spite of the prevalence and importance of microservices in industry, there is limited research on the fault analysis and debugging of microservice systems. To fill this gap, we conduct an industrial survey to learn typical faults of microservice systems, current practice of debugging, and the challenges faced by developers in practice. We then develop a medium-size benchmark microservice system (being the largest and most complex open source microservice system within our knowledge) and replicate 22 industrial fault cases on it. Based on the benchmark system and the replicated fault cases, we conduct an empirical study to investigate the effectiveness of existing industrial debugging practices and whether they can be further improved by introducing the state-of-the-art tracing and visualization techniques for distributed systems. The results show that the current industrial practices of microservice debugging can be improved by employing proper tracing and visualization techniques and strategies. Our findings also suggest that there is a strong need for more intelligent trace analysis and visualization, e.g., by combining trace visualization and improved fault localization, and employing data-driven and learning-based recommendation for guided visual exploration and comparison of traces.
format text
author ZHOU, Xiang
PENG, Xin
XIE, Tao
SUN, Jun
JI, Chao
LI, Wenhai
DING, Dan
author_facet ZHOU, Xiang
PENG, Xin
XIE, Tao
SUN, Jun
JI, Chao
LI, Wenhai
DING, Dan
author_sort ZHOU, Xiang
title Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study
title_short Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study
title_full Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study
title_fullStr Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study
title_full_unstemmed Fault analysis and debugging of microservice systems: Industrial survey, benchmark system, and empirical study
title_sort fault analysis and debugging of microservice systems: industrial survey, benchmark system, and empirical study
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/4845
https://ink.library.smu.edu.sg/context/sis_research/article/5848/viewcontent/fault_analysis__PV.pdf
_version_ 1770575061437120512