A learner-verifier framework for neural network controllers and certificates of stochastic systems
Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifer framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a l...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9058 https://ink.library.smu.edu.sg/context/sis_research/article/10061/viewcontent/978_3_031_30823_9.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10061 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-100612024-08-01T15:35:42Z A learner-verifier framework for neural network controllers and certificates of stochastic systems CHATTERJEE, Krishnendu HENZINGER, Thomas A. ZIKELIC, Dorde ZIKELIC, Dorde Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifer framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a lower bound on the probability of the property being satisfied, our framework jointly learns a control policy and a formal certificate to ensure the satisfaction of the property with a desired probability threshold. Both the control policy and the formal certificate are continuous functions from states to reals, which are learned as parameterized neural networks. While in the deterministic case, the certificates are invariant and barrier functions for safety, or Lyapunov and ranking functions for liveness, in the stochastic case the certificates are supermartingales. For certificate verification, we use interval arithmetic abstract interpretation to bound the expected values of neural network functions. 2023-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9058 info:doi/10.1007/978-3-031-30823-9_1 https://ink.library.smu.edu.sg/context/sis_research/article/10061/viewcontent/978_3_031_30823_9.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Learning-based control Stochastic systems Martingales Formal verification Databases and Information Systems OS and Networks |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Learning-based control Stochastic systems Martingales Formal verification Databases and Information Systems OS and Networks |
spellingShingle |
Learning-based control Stochastic systems Martingales Formal verification Databases and Information Systems OS and Networks CHATTERJEE, Krishnendu HENZINGER, Thomas A. ZIKELIC, Dorde ZIKELIC, Dorde A learner-verifier framework for neural network controllers and certificates of stochastic systems |
description |
Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifer framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a lower bound on the probability of the property being satisfied, our framework jointly learns a control policy and a formal certificate to ensure the satisfaction of the property with a desired probability threshold. Both the control policy and the formal certificate are continuous functions from states to reals, which are learned as parameterized neural networks. While in the deterministic case, the certificates are invariant and barrier functions for safety, or Lyapunov and ranking functions for liveness, in the stochastic case the certificates are supermartingales. For certificate verification, we use interval arithmetic abstract interpretation to bound the expected values of neural network functions. |
format |
text |
author |
CHATTERJEE, Krishnendu HENZINGER, Thomas A. ZIKELIC, Dorde ZIKELIC, Dorde |
author_facet |
CHATTERJEE, Krishnendu HENZINGER, Thomas A. ZIKELIC, Dorde ZIKELIC, Dorde |
author_sort |
CHATTERJEE, Krishnendu |
title |
A learner-verifier framework for neural network controllers and certificates of stochastic systems |
title_short |
A learner-verifier framework for neural network controllers and certificates of stochastic systems |
title_full |
A learner-verifier framework for neural network controllers and certificates of stochastic systems |
title_fullStr |
A learner-verifier framework for neural network controllers and certificates of stochastic systems |
title_full_unstemmed |
A learner-verifier framework for neural network controllers and certificates of stochastic systems |
title_sort |
learner-verifier framework for neural network controllers and certificates of stochastic systems |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2023 |
url |
https://ink.library.smu.edu.sg/sis_research/9058 https://ink.library.smu.edu.sg/context/sis_research/article/10061/viewcontent/978_3_031_30823_9.pdf |
_version_ |
1814047719785758720 |