Adaptive optimal output tracking of continuous-time systems via output-feedback-based reinforcement learning

Reinforcement learning provides a powerful tool for designing a satisfactory controller through interactions with the environment. Although off-policy learning algorithms were recently designed for tracking problems, most of these results either are full-state feedback or have bounded control errors...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chen, Ci, Xie, Lihua, Xie, Kan, Lewis, Frank L., Xie. Shengli
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2022
Subjects:	Engineering::Electrical and electronic engineering Reinforcement Learning Output Tracking
Online Access:	https://hdl.handle.net/10356/163293
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Description
Summary:	Reinforcement learning provides a powerful tool for designing a satisfactory controller through interactions with the environment. Although off-policy learning algorithms were recently designed for tracking problems, most of these results either are full-state feedback or have bounded control errors, which may not be flexible or desirable for engineering problems in the real world. To address these problems, we propose an output-feedback-based reinforcement learning approach that allows us to find the optimal control solution using input–output data and ensure the asymptotic tracking control of continuous-time systems. More specifically, we first propose a dynamical controller revised from the standard output regulation theory and use it to formulate an optimal output tracking problem. Then, a state observer is used to re-express the system state. Consequently, we address the rank issue of the parameterization matrix and analyze the state re-expression error that are crucial for transforming the off-policy learning into an output-feedback form. A comprehensive simulation study is given to demonstrate the effectiveness of the proposed approach.

Adaptive optimal output tracking of continuous-time systems via output-feedback-based reinforcement learning

Similar Items