Watermarking deep reinforcement learning

Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of p...

Full description

Saved in:
Bibliographic Details
Main Author: Sim, Ming Jie
Other Authors: Zhang Tianwei
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148747
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148747
record_format dspace
spelling sg-ntu-dr.10356-1487472021-05-17T02:05:29Z Watermarking deep reinforcement learning Sim, Ming Jie Zhang Tianwei School of Computer Science and Engineering Zhang Tianwei tianwei.zhang@ntu.edu.sg Engineering::Computer science and engineering::Software Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment. Bachelor of Engineering (Computer Engineering) 2021-05-17T02:05:29Z 2021-05-17T02:05:29Z 2021 Final Year Project (FYP) Sim, M. J. (2021). Watermarking deep reinforcement learning. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148747 https://hdl.handle.net/10356/148747 en SCSE20-0453 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Software
spellingShingle Engineering::Computer science and engineering::Software
Sim, Ming Jie
Watermarking deep reinforcement learning
description Deep Reinforcement Learning (DRL) is becoming more widely researched on as it is increasingly useful in solving several complicated problems, such as robotics control and autonomous driving. DRL models are usually built with the help of enormous computational resources that process large amount of proprietary data. The models produced are valuable Intellectual Property (IP) to the designer of the model and need to be secured to preserve the owner’s competitive edge. This report presents a watermarking scheme on DRL as well as evaluates how the model parameters of the Deep Q-Network (DQN) policy used in DRL can affect the watermark performance. The watermarking scheme involves embedding a unique identifier within the policy where a unique sequence of state transitions is produced, while having minimum influence on the policy performance. The digital watermark can help to detect unauthorized duplications of proprietary policies. The watermarking is done on DQN policy in DRL and is trained in the Cartpole environment.
author2 Zhang Tianwei
author_facet Zhang Tianwei
Sim, Ming Jie
format Final Year Project
author Sim, Ming Jie
author_sort Sim, Ming Jie
title Watermarking deep reinforcement learning
title_short Watermarking deep reinforcement learning
title_full Watermarking deep reinforcement learning
title_fullStr Watermarking deep reinforcement learning
title_full_unstemmed Watermarking deep reinforcement learning
title_sort watermarking deep reinforcement learning
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/148747
_version_ 1701270599458881536