Watermarking deep reinforcement learning

Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of p...

Full description

Saved in:

Bibliographic Details
Main Author:	Sim, Ming Jie
Other Authors:	Zhang Tianwei
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2021
Subjects:	Engineering::Computer science and engineering::Software
Online Access:	https://hdl.handle.net/10356/148747
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-148747
record_format	dspace
spelling	sg-ntu-dr.10356-1487472021-05-17T02:05:29Z Watermarking deep reinforcement learning Sim, Ming Jie Zhang Tianwei School of Computer Science and Engineering Zhang Tianwei tianwei.zhang@ntu.edu.sg Engineering::Computer science and engineering::Software Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment. Bachelor of Engineering (Computer Engineering) 2021-05-17T02:05:29Z 2021-05-17T02:05:29Z 2021 Final Year Project (FYP) Sim, M. J. (2021). Watermarking deep reinforcement learning. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148747 https://hdl.handle.net/10356/148747 en SCSE20-0453 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Software
spellingShingle	Engineering::Computer science and engineering::Software Sim, Ming Jie Watermarking deep reinforcement learning
description	Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment.
author2	Zhang Tianwei
author_facet	Zhang Tianwei Sim, Ming Jie
format	Final Year Project
author	Sim, Ming Jie
author_sort	Sim, Ming Jie
title	Watermarking deep reinforcement learning
title_short	Watermarking deep reinforcement learning
title_full	Watermarking deep reinforcement learning
title_fullStr	Watermarking deep reinforcement learning
title_full_unstemmed	Watermarking deep reinforcement learning
title_sort	watermarking deep reinforcement learning
publisher	Nanyang Technological University
publishDate	2021
url	https://hdl.handle.net/10356/148747
_version_	1701270599458881536

Watermarking deep reinforcement learning

Similar Items