Reinforcement learning based online request scheduling framework for workload-adaptive edge deep learning inference
The recent advances of deep learning in various mobile and Internet-of-Things applications, coupled with the emergence of edge computing, have led to a strong trend of performing deep learning inference on the edge servers located physically close to the end devices. This trend presents the challeng...
Saved in:
Main Authors: | TAN, Xinrui, LI, Hongjia, XIE, Xiaofei, GUO, Lu, ANSARI, Nirwan, HUANG, Xueqing, WANG, Liming, XU, Zhen, LIU, Yang |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9442 https://ink.library.smu.edu.sg/context/sis_research/article/10442/viewcontent/RL_OnlineRequest_av.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Cross-lingual transfer learning for statistical type inference
by: LI, Zhiming, et al.
Published: (2022) -
Pruning-aware merging for efficient multitask inference
by: GAO, Dawei, et al.
Published: (2021) -
Edge-cloud cooperation for DNN inference via reinforcement learning and supervised learning
by: Zhang, Tinghao, et al.
Published: (2024) -
Secure and verifiable inference in deep neural networks
by: XU, Guowen, et al.
Published: (2020) -
Goal modelling for deep reinforcement learning agents
by: Leung, Jonathan, et al.
Published: (2022)