Zeus: interpretable ML-based job scheduling in GPU datacentres
Hardware accelerators such as GPUs are essential for the development of Deep Learning (DL) models - as their training process is compute-intensive. A growing number of organisations have employed expensive multi-tenant GPU clusters to run distributed DL training jobs. Efficient job schedulers are re...
Saved in:
Main Author: | Amrita, Ravishankar |
---|---|
Other Authors: | Zhang Tianwei |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156566 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Supporting agent-based simulations on GPU
by: Li, Xiaosong
Published: (2016) -
Volume graphics shaders for GPU
by: Muhammad Mobeen Movania
Published: (2012) -
Yard crane scheduling with job arrival time uncertainties
by: Huang, Yixuan
Published: (2015) -
Fast, interactive water wave simulation in gpu for games
by: Chen, Ming
Published: (2011) -
Benchmarking of the popular DL Frameworks over multiple GPU cards on state-of-the-art CNN architectures
by: Kow, Li Ren
Published: (2018)