Watermarking deep reinforcement learning
Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of p...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/148747 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment. |
---|