Core utility development for hysia performance optimization

To serve machine learning requests with trained models plays an increasingly important role with the advance and continuous commercialization of machine learning models. Model serving is also the dominant cost in production-scale machine learning systems such as versatile prediction pipelines, compl...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhou, Shengsheng
Other Authors:	Wen Yonggang
Format:	Theses and Dissertations
Language:	English
Published:	2019
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/82835 http://hdl.handle.net/10220/50177
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-82835
record_format	dspace
spelling	sg-ntu-dr.10356-828352020-10-28T08:29:20Z Core utility development for hysia performance optimization Zhou, Shengsheng Wen Yonggang School of Computer Science and Engineering Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence To serve machine learning requests with trained models plays an increasingly important role with the advance and continuous commercialization of machine learning models. Model serving is also the dominant cost in production-scale machine learning systems such as versatile prediction pipelines, complex models, diverse machine learning frameworks and heterogeneous hardware like CPU, GPU and TPU. Serving machine learning pipelines with low latencies for better user experience is the key to the success for an e-commerce product. This becomes more challenging, due to the complex constitutions of model serving, i.e. models, frameworks and hardware accelerators, to serve interactive machine learning workloads. Accessibility, cost and latency are especially challenging to be addressed. Hysia is a multi-modal machine learning model serving framework developed by our team, to remedy such challenges introduced by the complex interactions between models and hardware. Hysia framework addresses acces- sibility, cost and latency issues by providing easy-to-use application interfaces and an intelligent controller which jointly optimizes performance to balance the trade-off between resource consumption and prediction accuracy. This thesis focuses on the design, implementation and benchmarking of the core utility for Hysia framework, i.e. to provide profile information about models and statuses about system resources in order to optimize machine learning pipelines. The core utility plays a significant role for the joint system performance optimization for Hysia. Model profiler and resource monitor form the core utility. The model profiler is designed to profile machine learning models to get their statistics like parameters, memory usage and inference latency. Our design for model profiler unifies the differences among various machine learning platforms and ensures extensibility. The resource monitor is used to monitor the system resource status like memory and GPU utilization. Our resource monitor is capable to retrieve rich system statuses. Both model profiler and resource monitor are designed in a distributed way to improve efficiency and support distributed computation. Master of Engineering 2019-10-16T05:58:13Z 2019-12-06T15:06:34Z 2019-10-16T05:58:13Z 2019-12-06T15:06:34Z 2019 Thesis Zhou, S. (2019). Core utility development for hysia performance optimization. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/82835 http://hdl.handle.net/10220/50177 10.32657/10356/82835 en 74 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Zhou, Shengsheng Core utility development for hysia performance optimization
description	To serve machine learning requests with trained models plays an increasingly important role with the advance and continuous commercialization of machine learning models. Model serving is also the dominant cost in production-scale machine learning systems such as versatile prediction pipelines, complex models, diverse machine learning frameworks and heterogeneous hardware like CPU, GPU and TPU. Serving machine learning pipelines with low latencies for better user experience is the key to the success for an e-commerce product. This becomes more challenging, due to the complex constitutions of model serving, i.e. models, frameworks and hardware accelerators, to serve interactive machine learning workloads. Accessibility, cost and latency are especially challenging to be addressed. Hysia is a multi-modal machine learning model serving framework developed by our team, to remedy such challenges introduced by the complex interactions between models and hardware. Hysia framework addresses acces- sibility, cost and latency issues by providing easy-to-use application interfaces and an intelligent controller which jointly optimizes performance to balance the trade-off between resource consumption and prediction accuracy. This thesis focuses on the design, implementation and benchmarking of the core utility for Hysia framework, i.e. to provide profile information about models and statuses about system resources in order to optimize machine learning pipelines. The core utility plays a significant role for the joint system performance optimization for Hysia. Model profiler and resource monitor form the core utility. The model profiler is designed to profile machine learning models to get their statistics like parameters, memory usage and inference latency. Our design for model profiler unifies the differences among various machine learning platforms and ensures extensibility. The resource monitor is used to monitor the system resource status like memory and GPU utilization. Our resource monitor is capable to retrieve rich system statuses. Both model profiler and resource monitor are designed in a distributed way to improve efficiency and support distributed computation.
author2	Wen Yonggang
author_facet	Wen Yonggang Zhou, Shengsheng
format	Theses and Dissertations
author	Zhou, Shengsheng
author_sort	Zhou, Shengsheng
title	Core utility development for hysia performance optimization
title_short	Core utility development for hysia performance optimization
title_full	Core utility development for hysia performance optimization
title_fullStr	Core utility development for hysia performance optimization
title_full_unstemmed	Core utility development for hysia performance optimization
title_sort	core utility development for hysia performance optimization
publishDate	2019
url	https://hdl.handle.net/10356/82835 http://hdl.handle.net/10220/50177
_version_	1683494210440790016

Core utility development for hysia performance optimization

Similar Items