Adapting strategies to dynamic environments in controllable stackelberg security games

Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

There is a growing interest in applying Stackelberg games to model resource allocation for patrolling security problems in which defenders must allocate limited security resources to protect targets from attack by adversaries. In real-world adversaries are sophisticated presenting dynamic strategies. Most existing approaches for computing defender strategies calculate the game against fixed behavioral models of adversaries, and cannot ensure success in the realization of the game. To address this shortcoming, this paper presents a novel approach for adapting preferred strategies in controlled Stackelberg security games using a reinforcement learning (RL) approach for attackers and defenders employing an average rewards.We propose a common framework that combines prior knowledge and temporal-difference method in reinforcement learning. The overall RL architecture involves two highest components: the adaptive primary learning architecture and the actor-critic architecture. In this work we consider a Stackelberg security game in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. For computing the equilibrium point we employ the extraproximal method. Finally, a game theory example illustrates the main results and the effectiveness of the method.

Original languageEnglish
Title of host publication2016 IEEE 55th Conference on Decision and Control, CDC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5484-5489
Number of pages6
ISBN (Electronic)9781509018376
DOIs
StatePublished - 27 Dec 2016
Event55th IEEE Conference on Decision and Control, CDC 2016 - Las Vegas, United States
Duration: 12 Dec 201614 Dec 2016

Publication series

Name2016 IEEE 55th Conference on Decision and Control, CDC 2016

Conference

Conference55th IEEE Conference on Decision and Control, CDC 2016
Country/TerritoryUnited States
CityLas Vegas
Period12/12/1614/12/16

Fingerprint

Dive into the research topics of 'Adapting strategies to dynamic environments in controllable stackelberg security games'. Together they form a unique fingerprint.

Cite this