Reinforcement learning based online request scheduling framework for workload-adaptive edge deep learning inference

Reinforcement learning based online request scheduling framework for workload-adaptive edge deep learning inference

The recent advances of deep learning in various mobile and Internet-of-Things applications, coupled with the emergence of edge computing, have led to a strong trend of performing deep learning inference on the edge servers located physically close to the end devices. This trend presents the challeng...

Full description

Saved in:

Bibliographic Details
Main Authors:	TAN, Xinrui, LI, Hongjia, XIE, Xiaofei, GUO, Lu, ANSARI, Nirwan, HUANG, Xueqing, WANG, Liming, XU, Zhen, LIU, Yang
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Edge computing deep learning inference serving systems efficient deep learning inference reinforcement learning Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
Online Access:	https://ink.library.smu.edu.sg/sis_research/9442 https://ink.library.smu.edu.sg/context/sis_research/article/10442/viewcontent/RL_OnlineRequest_av.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

Cross-lingual transfer learning for statistical type inference
by: LI, Zhiming, et al.
Published: (2022)

Pruning-aware merging for efficient multitask inference
by: GAO, Dawei, et al.
Published: (2021)

Edge-cloud cooperation for DNN inference via reinforcement learning and supervised learning
by: Zhang, Tinghao, et al.
Published: (2024)

Secure and verifiable inference in deep neural networks
by: XU, Guowen, et al.
Published: (2020)

Goal modelling for deep reinforcement learning agents
by: Leung, Jonathan, et al.
Published: (2022)

Parameterized DNN design for identifying the resource limitations of edge deep learning hardware
by: Aung, Shin Thant
Published: (2024)

Applications of multi-agent reinforcement learning in future internet: a comprehensive survey
by: Li, Tianxu, et al.
Published: (2022)

LANGUAGE LEARNING OF INDUCTIVE INFERENCE MACHINES WITH MEMORY LIMITATION
by: MA JUNQI
Published: (2018)

EDLAB : a benchmark for edge deep learning accelerators
by: Kong, Hao, et al.
Published: (2022)

NEW ADVANCES IN BAYESIAN INFERENCE FOR GAUSSIAN PROCESS AND DEEP GAUSSIAN PROCESS MODELS
by: YU HAIBIN
Published: (2020)

DEVELOPMENT OF DEEP LEARNING METHODS FOR INFERRING TISSUE DYNAMICS AND FORCES
by: MURAT SHAGIROV
Published: (2023)

Variants of Partial Learning in Inductive Inference
by: GAO ZIYUAN
Published: (2012)

Learning and extending sublanguages
by: Jain, S., et al.
Published: (2013)

Learning to schedule joint radar-communication requests for optimal information freshness
by: Lee, Joash, et al.
Published: (2021)

Learning all subfunctions of a function
by: Jain, S., et al.
Published: (2013)

Knowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay
by: Yin, Haiyan, et al.
Published: (2017)

Which channel to ask my question? : personalized customer service request stream routing using deep reinforcement learning
by: Liu, Zining, et al.
Published: (2019)

Deep reinforcement learning for soft, flexible robots: Brief review with impending challenges
by: Bhagat, S., et al.
Published: (2021)

TravellingFL: communication efficient peer-to-peer federated learning
by: Gupta, Vansh, et al.
Published: (2024)

Stealing deep reinforcement learning models for fun and profit
by: CHEN, Kangjie, et al.
Published: (2021)

Action selection for composable modular deep reinforcement learning
by: GUPTA, Vaibhav, et al.
Published: (2021)

Learning languages from positive data and negative counterexamples
by: Jain, S., et al.
Published: (2013)

DEEP REINFORCEMENT LEARNING FOR SOLVING VEHICLE ROUTING PROBLEMS
by: LI JINGWEN
Published: (2022)

Edge accelerator for lifelong deep learning using streaming linear discriminant analysis
by: Piyasena, Duvindu, et al.
Published: (2024)

Practical learning synergies between pushing and grasping based on DRL
by: Huang, Yuanning
Published: (2024)

Device scheduling and assignment in hierarchical federated learning for Internet of Thing
by: Zhang, Tinghao, et al.
Published: (2024)

GENERALIZATION TECHNIQUES IN DEEP REINFORCEMENT LEARNING
by: MUHAMMAD RIZKI AULIA RAHMAN MAULANA
Published: (2023)

Bringing AI to edge : from deep learning's perspective
by: Liu, Di, et al.
Published: (2022)

Robust learning of automatic classes of languages
by: Jain, S., et al.
Published: (2013)

On the role of update constraints and text-types in iterative learning
by: Jain S., et al.
Published: (2020)

An empirical study towards characterizing deep learning development and deployment across different frameworks and platforms
by: GUO, Qianyu, et al.
Published: (2019)

TOWARDS EFFICIENT OBJECT DETECTION WITH DEEP LEARNING
by: WANG, TAO
Published: (2023)

Action selection for composable modular deep reinforcement learning
by: GUPTA, Vaibhav, et al.
Published: (2021)

NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
by: Zhang, Yuhang, et al.
Published: (2024)

Towards explaining sequences of actions in multi-agent deep reinforcement learning models
by: KHAING, Phyo Wai, et al.
Published: (2023)

Dynamically growing neural network architecture for lifelong deep learning on the edge
by: Piyasena, Duvindu, et al.
Published: (2021)

Prescribed learning of indexed families
by: Jain, S., et al.
Published: (2013)

VARIATIONAL DISTRIBUTION DESIGNS FOR APPROXIMATE THOMPSON SAMPLING IN DEEP REINFORCEMENT LEARNING
by: SIDDHARTH ARAVINDAN
Published: (2022)

Deep-attack over the deep reinforcement learning
by: Li, Yang, et al.
Published: (2022)

Deep reinforcement learning for autonomous cyber operation
by: Yong, Hou Zhong
Published: (2024)