Handling a Kullback–Leibler divergence random walk for scheduling effective patrol strategies in Stackelberg security games

César U. Solis; Julio B. Clempner; Alexander S. Poznyak

doi:10.14736/kyb-2019-4-0618

Handling a Kullback–Leibler divergence random walk for scheduling effective patrol strategies in Stackelberg security games

César U. Solis, Julio B. Clempner, Alexander S. Poznyak

Escuela Superior de Física y Matemáticas (ESFM)

Research output: Contribution to journal › Article › peer-review

9 Scopus citations

Abstract

This paper presents a new model for computing optimal randomized security policies in non-cooperative Stackelberg Security Games (SSGs) for multiple players. Our framework rests upon the extraproximal method and its extension to Markov chains, within which we explicitly compute the unique Stackelberg/Nash equilibrium of the game by employing the Lagrange method and introducing the Tikhonov regularization method. We also consider a game-theory realization of the problem that involves defenders and attackers performing a discrete-time random walk over a finite state space. Following the Kullback–Leibler divergence the players’ actions are fixed and, then the next-state distribution is computed. The player’s goal at each time step is to specify the probability distribution for the next state. We present an explicit construction of a computationally efficient strategy under mild defenders and attackers conditions and demonstrate the performance of the proposed method on a simulated target tracking problem.

Original language	English
Pages (from-to)	618-640
Number of pages	23
Journal	Kybernetika
Volume	55
Issue number	4
DOIs	https://doi.org/10.14736/kyb-2019-4-0618
State	Published - 2019

Keywords

Markov chains
Patrolling
Security
Stackelberg games

Access to Document

10.14736/kyb-2019-4-0618

Cite this

@article{e4e04b92dd0c45679dbdd25802348721,

title = "Handling a Kullback–Leibler divergence random walk for scheduling effective patrol strategies in Stackelberg security games",

abstract = "This paper presents a new model for computing optimal randomized security policies in non-cooperative Stackelberg Security Games (SSGs) for multiple players. Our framework rests upon the extraproximal method and its extension to Markov chains, within which we explicitly compute the unique Stackelberg/Nash equilibrium of the game by employing the Lagrange method and introducing the Tikhonov regularization method. We also consider a game-theory realization of the problem that involves defenders and attackers performing a discrete-time random walk over a finite state space. Following the Kullback–Leibler divergence the players{\textquoteright} actions are fixed and, then the next-state distribution is computed. The player{\textquoteright}s goal at each time step is to specify the probability distribution for the next state. We present an explicit construction of a computationally efficient strategy under mild defenders and attackers conditions and demonstrate the performance of the proposed method on a simulated target tracking problem.",

keywords = "Markov chains, Patrolling, Security, Stackelberg games",

author = "Solis, {C{\'e}sar U.} and Clempner, {Julio B.} and Poznyak, {Alexander S.}",

year = "2019",

doi = "10.14736/kyb-2019-4-0618",

language = "Ingl{\'e}s",

volume = "55",

pages = "618--640",

journal = "Kybernetika",

issn = "0023-5954",

number = "4",

}

TY - JOUR

T1 - Handling a Kullback–Leibler divergence random walk for scheduling effective patrol strategies in Stackelberg security games

AU - Solis, César U.

AU - Clempner, Julio B.

AU - Poznyak, Alexander S.

PY - 2019

Y1 - 2019

N2 - This paper presents a new model for computing optimal randomized security policies in non-cooperative Stackelberg Security Games (SSGs) for multiple players. Our framework rests upon the extraproximal method and its extension to Markov chains, within which we explicitly compute the unique Stackelberg/Nash equilibrium of the game by employing the Lagrange method and introducing the Tikhonov regularization method. We also consider a game-theory realization of the problem that involves defenders and attackers performing a discrete-time random walk over a finite state space. Following the Kullback–Leibler divergence the players’ actions are fixed and, then the next-state distribution is computed. The player’s goal at each time step is to specify the probability distribution for the next state. We present an explicit construction of a computationally efficient strategy under mild defenders and attackers conditions and demonstrate the performance of the proposed method on a simulated target tracking problem.

AB - This paper presents a new model for computing optimal randomized security policies in non-cooperative Stackelberg Security Games (SSGs) for multiple players. Our framework rests upon the extraproximal method and its extension to Markov chains, within which we explicitly compute the unique Stackelberg/Nash equilibrium of the game by employing the Lagrange method and introducing the Tikhonov regularization method. We also consider a game-theory realization of the problem that involves defenders and attackers performing a discrete-time random walk over a finite state space. Following the Kullback–Leibler divergence the players’ actions are fixed and, then the next-state distribution is computed. The player’s goal at each time step is to specify the probability distribution for the next state. We present an explicit construction of a computationally efficient strategy under mild defenders and attackers conditions and demonstrate the performance of the proposed method on a simulated target tracking problem.

KW - Markov chains

KW - Patrolling

KW - Security

KW - Stackelberg games

UR - http://www.scopus.com/inward/record.url?scp=85073552702&partnerID=8YFLogxK

U2 - 10.14736/kyb-2019-4-0618

DO - 10.14736/kyb-2019-4-0618

M3 - Artículo

SN - 0023-5954

VL - 55

SP - 618

EP - 640

JO - Kybernetika

JF - Kybernetika

IS - 4

ER -

Handling a Kullback–Leibler divergence random walk for scheduling effective patrol strategies in Stackelberg security games

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this