Watermarking deep reinforcement learning
Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of p...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/148747 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-148747 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1487472021-05-17T02:05:29Z Watermarking deep reinforcement learning Sim, Ming Jie Zhang Tianwei School of Computer Science and Engineering Zhang Tianwei tianwei.zhang@ntu.edu.sg Engineering::Computer science and engineering::Software Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment. Bachelor of Engineering (Computer Engineering) 2021-05-17T02:05:29Z 2021-05-17T02:05:29Z 2021 Final Year Project (FYP) Sim, M. J. (2021). Watermarking deep reinforcement learning. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148747 https://hdl.handle.net/10356/148747 en SCSE20-0453 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Software |
spellingShingle |
Engineering::Computer science and engineering::Software Sim, Ming Jie Watermarking deep reinforcement learning |
description |
Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment. |
author2 |
Zhang Tianwei |
author_facet |
Zhang Tianwei Sim, Ming Jie |
format |
Final Year Project |
author |
Sim, Ming Jie |
author_sort |
Sim, Ming Jie |
title |
Watermarking deep reinforcement learning |
title_short |
Watermarking deep reinforcement learning |
title_full |
Watermarking deep reinforcement learning |
title_fullStr |
Watermarking deep reinforcement learning |
title_full_unstemmed |
Watermarking deep reinforcement learning |
title_sort |
watermarking deep reinforcement learning |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/148747 |
_version_ |
1701270599458881536 |