Imitation learning from demonstration videos
Imitation learning is a challenging and meaningful task to encode prior knowl- edge to provide a motion control policy for guiding robot movement and trajec- tory autonomously to complete specified assignment with a given current state. However, effective translation from prior knowledge to control...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/177091 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Imitation learning is a challenging and meaningful task to encode prior knowl- edge to provide a motion control policy for guiding robot movement and trajec- tory autonomously to complete specified assignment with a given current state. However, effective translation from prior knowledge to control rules remains relatively unexplored. In this dissertation, we introduce an imitation learning method for encoding prior knowledge from dexterous manipulation demonstra- tion videos. Instead of adopting behavior cloning or pure RL algorithms, our model considers two online RL algorithms: 1) Demo Augmented Policy Gra- dients (DAPG) and 2) Generative Adversarial Imitation Learning (GAIL). With the requirements for encoding the finger action in demonstrations, we selected MANO as the baseline of hand pose estimation, and designed a SuperPoint- based module to optimize detection results. Quantitative experimental results show that our framework can exploit hand pose estimation on different dataset effectively and use imitation learning to achieve great overall performance on three defined tasks. Moreover, it has good generalization ability when deployed on unseen objects. Some visual results show that the proposed framework can be applied combining with prior knowledge from demonstration videos, which provides a possible solution for robot’s imitating human behaviors. |
---|