Rollout approach to sensor scheduling for remote state estimation under integrity attack
We consider the sensor scheduling problem for remote state estimation under integrity attacks. We seek to optimize a trade-off between the energy consumption of communications and the state estimation error covariance when the acknowledgment (ACK) information, sent by the remote estimator to the loc...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/163294 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-163294 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1632942022-11-30T06:31:58Z Rollout approach to sensor scheduling for remote state estimation under integrity attack Liu, Hanxiao Li, Yuchao Johansson, Karl Henrik Mårtensson, Jonas Xie, Lihua School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Energy Consumption Error Covariances We consider the sensor scheduling problem for remote state estimation under integrity attacks. We seek to optimize a trade-off between the energy consumption of communications and the state estimation error covariance when the acknowledgment (ACK) information, sent by the remote estimator to the local sensor, is compromised. The sensor scheduling problem is formulated as an infinite horizon discounted optimal control problem with infinite states. We first analyze the underlying Markov decision process (MDP) and show that the optimal scheduling without ACK attack is of the threshold type. Thus, we can simplify the problem by replacing the original state space with a finite state space. For the simplified MDP, when the ACK is under attack, the problem is modeled as a partially observable Markov decision process (POMDP). We analyze the induced MDP that uses a belief vector as its state for the POMDP. We investigate the properties of the exact optimal solution via contractive models and show that the threshold type of solution for the POMDP cannot be readily obtained. A suboptimal solution is then obtained via a rollout approach, which is a prominent class of reinforcement learning (RL) methods based on approximation in value space. We present two variants of rollout and provide performance bounds of those variants. Finally, numerical examples are used to demonstrate the effectiveness of the proposed rollout methods by comparing them with a finite history window approach that is widely used in RL for POMDP. 2022-11-30T06:31:58Z 2022-11-30T06:31:58Z 2022 Journal Article Liu, H., Li, Y., Johansson, K. H., Mårtensson, J. & Xie, L. (2022). Rollout approach to sensor scheduling for remote state estimation under integrity attack. Automatica, 144, 110473-. https://dx.doi.org/10.1016/j.automatica.2022.110473 0005-1098 https://hdl.handle.net/10356/163294 10.1016/j.automatica.2022.110473 2-s2.0-85134186564 144 110473 en Automatica © 2022 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Energy Consumption Error Covariances |
spellingShingle |
Engineering::Electrical and electronic engineering Energy Consumption Error Covariances Liu, Hanxiao Li, Yuchao Johansson, Karl Henrik Mårtensson, Jonas Xie, Lihua Rollout approach to sensor scheduling for remote state estimation under integrity attack |
description |
We consider the sensor scheduling problem for remote state estimation under integrity attacks. We seek to optimize a trade-off between the energy consumption of communications and the state estimation error covariance when the acknowledgment (ACK) information, sent by the remote estimator to the local sensor, is compromised. The sensor scheduling problem is formulated as an infinite horizon discounted optimal control problem with infinite states. We first analyze the underlying Markov decision process (MDP) and show that the optimal scheduling without ACK attack is of the threshold type. Thus, we can simplify the problem by replacing the original state space with a finite state space. For the simplified MDP, when the ACK is under attack, the problem is modeled as a partially observable Markov decision process (POMDP). We analyze the induced MDP that uses a belief vector as its state for the POMDP. We investigate the properties of the exact optimal solution via contractive models and show that the threshold type of solution for the POMDP cannot be readily obtained. A suboptimal solution is then obtained via a rollout approach, which is a prominent class of reinforcement learning (RL) methods based on approximation in value space. We present two variants of rollout and provide performance bounds of those variants. Finally, numerical examples are used to demonstrate the effectiveness of the proposed rollout methods by comparing them with a finite history window approach that is widely used in RL for POMDP. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Liu, Hanxiao Li, Yuchao Johansson, Karl Henrik Mårtensson, Jonas Xie, Lihua |
format |
Article |
author |
Liu, Hanxiao Li, Yuchao Johansson, Karl Henrik Mårtensson, Jonas Xie, Lihua |
author_sort |
Liu, Hanxiao |
title |
Rollout approach to sensor scheduling for remote state estimation under integrity attack |
title_short |
Rollout approach to sensor scheduling for remote state estimation under integrity attack |
title_full |
Rollout approach to sensor scheduling for remote state estimation under integrity attack |
title_fullStr |
Rollout approach to sensor scheduling for remote state estimation under integrity attack |
title_full_unstemmed |
Rollout approach to sensor scheduling for remote state estimation under integrity attack |
title_sort |
rollout approach to sensor scheduling for remote state estimation under integrity attack |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/163294 |
_version_ |
1751548554527113216 |