Adapting strategies to dynamic environments in controllable stackelberg security games

Kristal K. Trejo; Julio B. Clempner; Alexander S. Poznyak

doi:10.1109/CDC.2016.7799111

Adapting strategies to dynamic environments in controllable stackelberg security games

Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak

Escuela Superior de Física y Matemáticas (ESFM)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

22 Scopus citations

Abstract

There is a growing interest in applying Stackelberg games to model resource allocation for patrolling security problems in which defenders must allocate limited security resources to protect targets from attack by adversaries. In real-world adversaries are sophisticated presenting dynamic strategies. Most existing approaches for computing defender strategies calculate the game against fixed behavioral models of adversaries, and cannot ensure success in the realization of the game. To address this shortcoming, this paper presents a novel approach for adapting preferred strategies in controlled Stackelberg security games using a reinforcement learning (RL) approach for attackers and defenders employing an average rewards.We propose a common framework that combines prior knowledge and temporal-difference method in reinforcement learning. The overall RL architecture involves two highest components: the adaptive primary learning architecture and the actor-critic architecture. In this work we consider a Stackelberg security game in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. For computing the equilibrium point we employ the extraproximal method. Finally, a game theory example illustrates the main results and the effectiveness of the method.

Original language	English
Title of host publication	2016 IEEE 55th Conference on Decision and Control, CDC 2016
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	5484-5489
Number of pages	6
ISBN (Electronic)	9781509018376
DOIs	https://doi.org/10.1109/CDC.2016.7799111
State	Published - 27 Dec 2016
Event	55th IEEE Conference on Decision and Control, CDC 2016 - Las Vegas, United States Duration: 12 Dec 2016 → 14 Dec 2016

Publication series

Name	2016 IEEE 55th Conference on Decision and Control, CDC 2016

Conference

Conference	55th IEEE Conference on Decision and Control, CDC 2016
Country/Territory	United States
City	Las Vegas
Period	12/12/16 → 14/12/16

Access to Document

10.1109/CDC.2016.7799111

Cite this

Trejo, K. K., Clempner, J. B., & Poznyak, A. S. (2016). Adapting strategies to dynamic environments in controllable stackelberg security games. In 2016 IEEE 55th Conference on Decision and Control, CDC 2016 (pp. 5484-5489). Article 7799111 (2016 IEEE 55th Conference on Decision and Control, CDC 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CDC.2016.7799111

@inproceedings{5d3f6e5438da49ec929e3bb17a164547,

title = "Adapting strategies to dynamic environments in controllable stackelberg security games",

abstract = "There is a growing interest in applying Stackelberg games to model resource allocation for patrolling security problems in which defenders must allocate limited security resources to protect targets from attack by adversaries. In real-world adversaries are sophisticated presenting dynamic strategies. Most existing approaches for computing defender strategies calculate the game against fixed behavioral models of adversaries, and cannot ensure success in the realization of the game. To address this shortcoming, this paper presents a novel approach for adapting preferred strategies in controlled Stackelberg security games using a reinforcement learning (RL) approach for attackers and defenders employing an average rewards.We propose a common framework that combines prior knowledge and temporal-difference method in reinforcement learning. The overall RL architecture involves two highest components: the adaptive primary learning architecture and the actor-critic architecture. In this work we consider a Stackelberg security game in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. For computing the equilibrium point we employ the extraproximal method. Finally, a game theory example illustrates the main results and the effectiveness of the method.",

author = "Trejo, {Kristal K.} and Clempner, {Julio B.} and Poznyak, {Alexander S.}",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.; 55th IEEE Conference on Decision and Control, CDC 2016 ; Conference date: 12-12-2016 Through 14-12-2016",

year = "2016",

month = dec,

day = "27",

doi = "10.1109/CDC.2016.7799111",

language = "Ingl{\'e}s",

series = "2016 IEEE 55th Conference on Decision and Control, CDC 2016",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "5484--5489",

booktitle = "2016 IEEE 55th Conference on Decision and Control, CDC 2016",

address = "Estados Unidos",

}

Trejo, KK, Clempner, JB & Poznyak, AS 2016, Adapting strategies to dynamic environments in controllable stackelberg security games. in 2016 IEEE 55th Conference on Decision and Control, CDC 2016., 7799111, 2016 IEEE 55th Conference on Decision and Control, CDC 2016, Institute of Electrical and Electronics Engineers Inc., pp. 5484-5489, 55th IEEE Conference on Decision and Control, CDC 2016, Las Vegas, United States, 12/12/16. https://doi.org/10.1109/CDC.2016.7799111

Adapting strategies to dynamic environments in controllable stackelberg security games. / Trejo, Kristal K.; Clempner, Julio B.; Poznyak, Alexander S.
2016 IEEE 55th Conference on Decision and Control, CDC 2016. Institute of Electrical and Electronics Engineers Inc., 2016. p. 5484-5489 7799111 (2016 IEEE 55th Conference on Decision and Control, CDC 2016).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Adapting strategies to dynamic environments in controllable stackelberg security games

AU - Trejo, Kristal K.

AU - Clempner, Julio B.

AU - Poznyak, Alexander S.

PY - 2016/12/27

Y1 - 2016/12/27

N2 - There is a growing interest in applying Stackelberg games to model resource allocation for patrolling security problems in which defenders must allocate limited security resources to protect targets from attack by adversaries. In real-world adversaries are sophisticated presenting dynamic strategies. Most existing approaches for computing defender strategies calculate the game against fixed behavioral models of adversaries, and cannot ensure success in the realization of the game. To address this shortcoming, this paper presents a novel approach for adapting preferred strategies in controlled Stackelberg security games using a reinforcement learning (RL) approach for attackers and defenders employing an average rewards.We propose a common framework that combines prior knowledge and temporal-difference method in reinforcement learning. The overall RL architecture involves two highest components: the adaptive primary learning architecture and the actor-critic architecture. In this work we consider a Stackelberg security game in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. For computing the equilibrium point we employ the extraproximal method. Finally, a game theory example illustrates the main results and the effectiveness of the method.

AB - There is a growing interest in applying Stackelberg games to model resource allocation for patrolling security problems in which defenders must allocate limited security resources to protect targets from attack by adversaries. In real-world adversaries are sophisticated presenting dynamic strategies. Most existing approaches for computing defender strategies calculate the game against fixed behavioral models of adversaries, and cannot ensure success in the realization of the game. To address this shortcoming, this paper presents a novel approach for adapting preferred strategies in controlled Stackelberg security games using a reinforcement learning (RL) approach for attackers and defenders employing an average rewards.We propose a common framework that combines prior knowledge and temporal-difference method in reinforcement learning. The overall RL architecture involves two highest components: the adaptive primary learning architecture and the actor-critic architecture. In this work we consider a Stackelberg security game in case of a metric state space for a class of time-discrete ergodic controllable Markov chains games. For computing the equilibrium point we employ the extraproximal method. Finally, a game theory example illustrates the main results and the effectiveness of the method.

UR - http://www.scopus.com/inward/record.url?scp=85010748463&partnerID=8YFLogxK

U2 - 10.1109/CDC.2016.7799111

DO - 10.1109/CDC.2016.7799111

M3 - Contribución a la conferencia

AN - SCOPUS:85010748463

T3 - 2016 IEEE 55th Conference on Decision and Control, CDC 2016

SP - 5484

EP - 5489

BT - 2016 IEEE 55th Conference on Decision and Control, CDC 2016

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 55th IEEE Conference on Decision and Control, CDC 2016

Y2 - 12 December 2016 through 14 December 2016

ER -

Adapting strategies to dynamic environments in controllable stackelberg security games

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this