CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference

Intelligent virtual agents are used to accomplish complex multi-modal tasks such as human instruction comprehension in mixed-reality environments by increasingly adopting richer, energy-intensive sensors and processing pipelines. In such applications, the context for activating sensors and processin...

Full description

Saved in:
Bibliographic Details
Main Authors: WEERAKOON, Dulanga, SUBBARAJU, Vigneshwaran, LIM, Joo Hwee, MISRA, Archan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9360
https://ink.library.smu.edu.sg/context/sis_research/article/10360/viewcontent/CAS_camready.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10360
record_format dspace
spelling sg-smu-ink.sis_research-103602024-10-30T06:04:25Z CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference WEERAKOON, Dulanga SUBBARAJU, Vigneshwaran LIM, Joo Hwee MISRA, Archan Intelligent virtual agents are used to accomplish complex multi-modal tasks such as human instruction comprehension in mixed-reality environments by increasingly adopting richer, energy-intensive sensors and processing pipelines. In such applications, the context for activating sensors and processing blocks required to accomplish a given task instance is usually manifested via multiple sensing modes. Based on this observation, we introduce a novel Commit-and-Switch ( CAS ) paradigm that simultaneously seeks to reduce both sensing and processing energy. In CAS , we first commit to a low-energy computational pipeline with a subset of available sensors. Then, the task context estimated by this pipeline is used to optionally switch to another energy-intensive DNN pipeline and activate additional sensors. We demonstrate how CAS's paradigm of interweaving DNN computation and sensor triggering can be instantiated principally by constructing multi-head DNN models and jointly optimizing the accuracy and sensing costs associated with different heads. We exemplify CAS via the development of the RealGIN-MH model for multi-modal target acquisition tasks, a core enabler of immersive human-agent interaction. RealGIN-MH achieves 12.9x reduction in energy overheads, while outperforming baseline dynamic model optimization approaches. 2024-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9360 info:doi/10.1109/LRA.2024.3469813 https://ink.library.smu.edu.sg/context/sis_research/article/10360/viewcontent/CAS_camready.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep Learning for Visual Perception Embedded Systems for Robotic and Automation Human-Robot Collaboration RGB-D Perception Vision and Sensor-Based Control Artificial Intelligence and Robotics
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Deep Learning for Visual Perception
Embedded Systems for Robotic and Automation
Human-Robot Collaboration
RGB-D Perception
Vision and Sensor-Based Control
Artificial Intelligence and Robotics
spellingShingle Deep Learning for Visual Perception
Embedded Systems for Robotic and Automation
Human-Robot Collaboration
RGB-D Perception
Vision and Sensor-Based Control
Artificial Intelligence and Robotics
WEERAKOON, Dulanga
SUBBARAJU, Vigneshwaran
LIM, Joo Hwee
MISRA, Archan
CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
description Intelligent virtual agents are used to accomplish complex multi-modal tasks such as human instruction comprehension in mixed-reality environments by increasingly adopting richer, energy-intensive sensors and processing pipelines. In such applications, the context for activating sensors and processing blocks required to accomplish a given task instance is usually manifested via multiple sensing modes. Based on this observation, we introduce a novel Commit-and-Switch ( CAS ) paradigm that simultaneously seeks to reduce both sensing and processing energy. In CAS , we first commit to a low-energy computational pipeline with a subset of available sensors. Then, the task context estimated by this pipeline is used to optionally switch to another energy-intensive DNN pipeline and activate additional sensors. We demonstrate how CAS's paradigm of interweaving DNN computation and sensor triggering can be instantiated principally by constructing multi-head DNN models and jointly optimizing the accuracy and sensing costs associated with different heads. We exemplify CAS via the development of the RealGIN-MH model for multi-modal target acquisition tasks, a core enabler of immersive human-agent interaction. RealGIN-MH achieves 12.9x reduction in energy overheads, while outperforming baseline dynamic model optimization approaches.
format text
author WEERAKOON, Dulanga
SUBBARAJU, Vigneshwaran
LIM, Joo Hwee
MISRA, Archan
author_facet WEERAKOON, Dulanga
SUBBARAJU, Vigneshwaran
LIM, Joo Hwee
MISRA, Archan
author_sort WEERAKOON, Dulanga
title CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
title_short CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
title_full CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
title_fullStr CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
title_full_unstemmed CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
title_sort cas: fusing dnn optimization & adaptive sensing for energy-efficient multi-modal inference
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9360
https://ink.library.smu.edu.sg/context/sis_research/article/10360/viewcontent/CAS_camready.pdf
_version_ 1814777825083785216