Two can play that game: An adversarial evaluation of a cyber-alert inspection system

Cyber-security is an important societal concern. Cyber-attacks have increased in numbers as well as in the extent of damage caused in every attack. Large organizations operate a Cyber Security Operation Center (CSOC), which forms the first line of cyber-defense. The inspection of cyber-alerts is a c...

Full description

Saved in:

Bibliographic Details
Main Authors:	SHAH, Ankit, SINHA, Arunesh, GANESAN, Rajesh, JAJODIA, Sushil, CAM, Hasan
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2020
Subjects:	Cyber-security operations center adversarial reinforcement learning game theory Artificial Intelligence and Robotics Computer and Systems Architecture
Online Access:	https://ink.library.smu.edu.sg/sis_research/5544 https://ink.library.smu.edu.sg/context/sis_research/article/6547/viewcontent/3377554__1_.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-6547
record_format	dspace
spelling	sg-smu-ink.sis_research-65472021-01-07T14:31:20Z Two can play that game: An adversarial evaluation of a cyber-alert inspection system SHAH, Ankit SINHA, Arunesh GANESAN, Rajesh JAJODIA, Sushil CAM, Hasan Cyber-security is an important societal concern. Cyber-attacks have increased in numbers as well as in the extent of damage caused in every attack. Large organizations operate a Cyber Security Operation Center (CSOC), which forms the first line of cyber-defense. The inspection of cyber-alerts is a critical part of CSOC operations (defender or blue team). Recent work proposed a reinforcement learning (RL) based approach for the defender’s decision-making to prevent the cyber-alert queue length from growing large and overwhelming the defender. In this article, we perform a red team (adversarial) evaluation of this approach. With the recent attacks on learning-based decision-making systems, it is even more important to test the limits of the defender’s RL approach. Toward that end, we learn several adversarial alert generation policies and the best response against them for various defender’s inspection policy. Surprisingly, we find the defender’s policies to be quite robust to the best response of the attacker. In order to explain this observation, we extend the earlier defender’s RL model to a game model with adversarial RL, and show that there exist defender policies that can be robust against any adversarial policy. We also derive a competitive baseline from the game theory model and compare it to the defender’s RL approach. However, when we go further to exploit the assumptions made in the Markov Decision Process (MDP) in the defender’s RL model, we discover an attacker policy that overwhelms the defender. We use a double oracle like approach to retrain the defender with episodes from this discovered attacker policy. This made the defender robust to the discovered attacker policy and no further harmful attacker policies were discovered. Overall, the adversarial RL and double oracle approach in RL are general techniques that are applicable to other RL usage in adversarial environments. 2020-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5544 info:doi/10.1145/3377554 https://ink.library.smu.edu.sg/context/sis_research/article/6547/viewcontent/3377554__1_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Cyber-security operations center adversarial reinforcement learning game theory Artificial Intelligence and Robotics Computer and Systems Architecture
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Cyber-security operations center adversarial reinforcement learning game theory Artificial Intelligence and Robotics Computer and Systems Architecture
spellingShingle	Cyber-security operations center adversarial reinforcement learning game theory Artificial Intelligence and Robotics Computer and Systems Architecture SHAH, Ankit SINHA, Arunesh GANESAN, Rajesh JAJODIA, Sushil CAM, Hasan Two can play that game: An adversarial evaluation of a cyber-alert inspection system
description	Cyber-security is an important societal concern. Cyber-attacks have increased in numbers as well as in the extent of damage caused in every attack. Large organizations operate a Cyber Security Operation Center (CSOC), which forms the first line of cyber-defense. The inspection of cyber-alerts is a critical part of CSOC operations (defender or blue team). Recent work proposed a reinforcement learning (RL) based approach for the defender’s decision-making to prevent the cyber-alert queue length from growing large and overwhelming the defender. In this article, we perform a red team (adversarial) evaluation of this approach. With the recent attacks on learning-based decision-making systems, it is even more important to test the limits of the defender’s RL approach. Toward that end, we learn several adversarial alert generation policies and the best response against them for various defender’s inspection policy. Surprisingly, we find the defender’s policies to be quite robust to the best response of the attacker. In order to explain this observation, we extend the earlier defender’s RL model to a game model with adversarial RL, and show that there exist defender policies that can be robust against any adversarial policy. We also derive a competitive baseline from the game theory model and compare it to the defender’s RL approach. However, when we go further to exploit the assumptions made in the Markov Decision Process (MDP) in the defender’s RL model, we discover an attacker policy that overwhelms the defender. We use a double oracle like approach to retrain the defender with episodes from this discovered attacker policy. This made the defender robust to the discovered attacker policy and no further harmful attacker policies were discovered. Overall, the adversarial RL and double oracle approach in RL are general techniques that are applicable to other RL usage in adversarial environments.
format	text
author	SHAH, Ankit SINHA, Arunesh GANESAN, Rajesh JAJODIA, Sushil CAM, Hasan
author_facet	SHAH, Ankit SINHA, Arunesh GANESAN, Rajesh JAJODIA, Sushil CAM, Hasan
author_sort	SHAH, Ankit
title	Two can play that game: An adversarial evaluation of a cyber-alert inspection system
title_short	Two can play that game: An adversarial evaluation of a cyber-alert inspection system
title_full	Two can play that game: An adversarial evaluation of a cyber-alert inspection system
title_fullStr	Two can play that game: An adversarial evaluation of a cyber-alert inspection system
title_full_unstemmed	Two can play that game: An adversarial evaluation of a cyber-alert inspection system
title_sort	two can play that game: an adversarial evaluation of a cyber-alert inspection system
publisher	Institutional Knowledge at Singapore Management University
publishDate	2020
url	https://ink.library.smu.edu.sg/sis_research/5544 https://ink.library.smu.edu.sg/context/sis_research/article/6547/viewcontent/3377554__1_.pdf
_version_	1770575504918708224

Two can play that game: An adversarial evaluation of a cyber-alert inspection system

Similar Items