CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference
Intelligent virtual agents are used to accomplish complex multi-modal tasks such as human instruction comprehension in mixed-reality environments by increasingly adopting richer, energy-intensive sensors and processing pipelines. In such applications, the context for activating sensors and processin...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9360 https://ink.library.smu.edu.sg/context/sis_research/article/10360/viewcontent/CAS_camready.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10360 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-103602024-10-30T06:04:25Z CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference WEERAKOON, Dulanga SUBBARAJU, Vigneshwaran LIM, Joo Hwee MISRA, Archan Intelligent virtual agents are used to accomplish complex multi-modal tasks such as human instruction comprehension in mixed-reality environments by increasingly adopting richer, energy-intensive sensors and processing pipelines. In such applications, the context for activating sensors and processing blocks required to accomplish a given task instance is usually manifested via multiple sensing modes. Based on this observation, we introduce a novel Commit-and-Switch ( CAS ) paradigm that simultaneously seeks to reduce both sensing and processing energy. In CAS , we first commit to a low-energy computational pipeline with a subset of available sensors. Then, the task context estimated by this pipeline is used to optionally switch to another energy-intensive DNN pipeline and activate additional sensors. We demonstrate how CAS's paradigm of interweaving DNN computation and sensor triggering can be instantiated principally by constructing multi-head DNN models and jointly optimizing the accuracy and sensing costs associated with different heads. We exemplify CAS via the development of the RealGIN-MH model for multi-modal target acquisition tasks, a core enabler of immersive human-agent interaction. RealGIN-MH achieves 12.9x reduction in energy overheads, while outperforming baseline dynamic model optimization approaches. 2024-11-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9360 info:doi/10.1109/LRA.2024.3469813 https://ink.library.smu.edu.sg/context/sis_research/article/10360/viewcontent/CAS_camready.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Deep Learning for Visual Perception Embedded Systems for Robotic and Automation Human-Robot Collaboration RGB-D Perception Vision and Sensor-Based Control Artificial Intelligence and Robotics |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Deep Learning for Visual Perception Embedded Systems for Robotic and Automation Human-Robot Collaboration RGB-D Perception Vision and Sensor-Based Control Artificial Intelligence and Robotics |
spellingShingle |
Deep Learning for Visual Perception Embedded Systems for Robotic and Automation Human-Robot Collaboration RGB-D Perception Vision and Sensor-Based Control Artificial Intelligence and Robotics WEERAKOON, Dulanga SUBBARAJU, Vigneshwaran LIM, Joo Hwee MISRA, Archan CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference |
description |
Intelligent virtual agents are used to accomplish complex multi-modal tasks such as human instruction comprehension in mixed-reality environments by increasingly adopting richer, energy-intensive sensors and processing pipelines. In such applications, the context for activating sensors and processing blocks required to accomplish a given task instance is usually manifested via multiple sensing modes. Based on this observation, we introduce a novel Commit-and-Switch ( CAS ) paradigm that simultaneously seeks to reduce both sensing and processing energy. In CAS , we first commit to a low-energy computational pipeline with a subset of available sensors. Then, the task context estimated by this pipeline is used to optionally switch to another energy-intensive DNN pipeline and activate additional sensors. We demonstrate how CAS's paradigm of interweaving DNN computation and sensor triggering can be instantiated principally by constructing multi-head DNN models and jointly optimizing the accuracy and sensing costs associated with different heads. We exemplify CAS via the development of the RealGIN-MH model for multi-modal target acquisition tasks, a core enabler of immersive human-agent interaction. RealGIN-MH achieves 12.9x reduction in energy overheads, while outperforming baseline dynamic model optimization approaches. |
format |
text |
author |
WEERAKOON, Dulanga SUBBARAJU, Vigneshwaran LIM, Joo Hwee MISRA, Archan |
author_facet |
WEERAKOON, Dulanga SUBBARAJU, Vigneshwaran LIM, Joo Hwee MISRA, Archan |
author_sort |
WEERAKOON, Dulanga |
title |
CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference |
title_short |
CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference |
title_full |
CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference |
title_fullStr |
CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference |
title_full_unstemmed |
CAS: Fusing DNN optimization & adaptive sensing for energy-efficient multi-modal inference |
title_sort |
cas: fusing dnn optimization & adaptive sensing for energy-efficient multi-modal inference |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/9360 https://ink.library.smu.edu.sg/context/sis_research/article/10360/viewcontent/CAS_camready.pdf |
_version_ |
1814777825083785216 |