A learner-verifier framework for neural network controllers and certificates of stochastic systems

Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifer framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a l...

Full description

Saved in:
Bibliographic Details
Main Authors: CHATTERJEE, Krishnendu, HENZINGER, Thomas A., ZIKELIC, Dorde
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9058
https://ink.library.smu.edu.sg/context/sis_research/article/10061/viewcontent/978_3_031_30823_9.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10061
record_format dspace
spelling sg-smu-ink.sis_research-100612024-08-01T15:35:42Z A learner-verifier framework for neural network controllers and certificates of stochastic systems CHATTERJEE, Krishnendu HENZINGER, Thomas A. ZIKELIC, Dorde ZIKELIC, Dorde Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifer framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a lower bound on the probability of the property being satisfied, our framework jointly learns a control policy and a formal certificate to ensure the satisfaction of the property with a desired probability threshold. Both the control policy and the formal certificate are continuous functions from states to reals, which are learned as parameterized neural networks. While in the deterministic case, the certificates are invariant and barrier functions for safety, or Lyapunov and ranking functions for liveness, in the stochastic case the certificates are supermartingales. For certificate verification, we use interval arithmetic abstract interpretation to bound the expected values of neural network functions. 2023-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9058 info:doi/10.1007/978-3-031-30823-9_1 https://ink.library.smu.edu.sg/context/sis_research/article/10061/viewcontent/978_3_031_30823_9.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Learning-based control Stochastic systems Martingales Formal verification Databases and Information Systems OS and Networks
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Learning-based control
Stochastic systems
Martingales
Formal verification
Databases and Information Systems
OS and Networks
spellingShingle Learning-based control
Stochastic systems
Martingales
Formal verification
Databases and Information Systems
OS and Networks
CHATTERJEE, Krishnendu
HENZINGER, Thomas A.
ZIKELIC, Dorde
ZIKELIC, Dorde
A learner-verifier framework for neural network controllers and certificates of stochastic systems
description Reinforcement learning has received much attention for learning controllers of deterministic systems. We consider a learner-verifer framework for stochastic control systems and survey recent methods that formally guarantee a conjunction of reachability and safety properties. Given a property and a lower bound on the probability of the property being satisfied, our framework jointly learns a control policy and a formal certificate to ensure the satisfaction of the property with a desired probability threshold. Both the control policy and the formal certificate are continuous functions from states to reals, which are learned as parameterized neural networks. While in the deterministic case, the certificates are invariant and barrier functions for safety, or Lyapunov and ranking functions for liveness, in the stochastic case the certificates are supermartingales. For certificate verification, we use interval arithmetic abstract interpretation to bound the expected values of neural network functions.
format text
author CHATTERJEE, Krishnendu
HENZINGER, Thomas A.
ZIKELIC, Dorde
ZIKELIC, Dorde
author_facet CHATTERJEE, Krishnendu
HENZINGER, Thomas A.
ZIKELIC, Dorde
ZIKELIC, Dorde
author_sort CHATTERJEE, Krishnendu
title A learner-verifier framework for neural network controllers and certificates of stochastic systems
title_short A learner-verifier framework for neural network controllers and certificates of stochastic systems
title_full A learner-verifier framework for neural network controllers and certificates of stochastic systems
title_fullStr A learner-verifier framework for neural network controllers and certificates of stochastic systems
title_full_unstemmed A learner-verifier framework for neural network controllers and certificates of stochastic systems
title_sort learner-verifier framework for neural network controllers and certificates of stochastic systems
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/9058
https://ink.library.smu.edu.sg/context/sis_research/article/10061/viewcontent/978_3_031_30823_9.pdf
_version_ 1814047719785758720