ImageInThat: Manipulating images to convey user instructions to robots

Foundation models are rapidly improving the capability of robots in performing everyday tasks autonomously such as meal preparation, yet robots will still need to be instructed by humans due to model performance, the difficulty of capturing user preferences, and the need for user agency. Robots can...

Full description

Saved in:
Bibliographic Details
Main Authors: MAHADEVAN, Karthik, LEWIS, Blaine, LI, Jiannan, MUTLU, Bilge, TANG, Anthony, GROSSMAN, Tovi
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2025
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/10133
https://ink.library.smu.edu.sg/context/sis_research/article/11133/viewcontent/HRI2025___PhotoManipulator.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English