Analysis of best-reply strategies in repeated finite Markov Chains Games

Julio Clempner; Alexander Poznyak

doi:10.1109/CDC.2013.6759942

Analysis of best-reply strategies in repeated finite Markov Chains Games

Julio Clempner, Alexander Poznyak

Escuela Superior de Física y Matemáticas (ESFM)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

19 Scopus citations

Abstract

The "best-reply strategy" is a natural and most commonly applied type of actions which players prefer to use during a repeated game. Usually, the behavior of an individual cost-function, when such strategies are applied, turns out to be non-monotonic, and, as the results, to make the conclusion that such strategies lead to some equilibrium point is a non-trivial and delicate task. Moreover, even in repeated games the convergence to a stationary equilibrium is not always guaranteed. Here we show that in the ergodic class of finite controllable Markov Chains Dynamic Games the best reply actions lead obligatory to one of Nash equilibrium points. This conclusion is done by the Lyapunov Games concept which is based on the designing of an individual Lyapunov function (related with an individual cost function) which monotonically decreases (non-increases) during the game. The suggested approach is illustrated by the repeated asynchronous "Prisoner's Dilemma" game with bestreply actions application.

Translated title of the contribution	Análisis de estrategias de mejor respuesta en juegos de cadenas de Markov finitos repetidos
Original language	English
Title of host publication	2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	568-573
Number of pages	6
ISBN (Print)	9781467357173
DOIs	https://doi.org/10.1109/CDC.2013.6759942
State	Published - 2013
Event	52nd IEEE Conference on Decision and Control, CDC 2013 - Florence, Italy Duration: 10 Dec 2013 → 13 Dec 2013

Publication series

Name	Proceedings of the IEEE Conference on Decision and Control
ISSN (Print)	0743-1546
ISSN (Electronic)	2576-2370

Conference

Conference	52nd IEEE Conference on Decision and Control, CDC 2013
Country/Territory	Italy
City	Florence
Period	10/12/13 → 13/12/13

Access to Document

10.1109/CDC.2013.6759942

Cite this

@inproceedings{6bd4189eef9e42888025e91c056af02d,

title = "Analysis of best-reply strategies in repeated finite Markov Chains Games",

abstract = "The {"}best-reply strategy{"} is a natural and most commonly applied type of actions which players prefer to use during a repeated game. Usually, the behavior of an individual cost-function, when such strategies are applied, turns out to be non-monotonic, and, as the results, to make the conclusion that such strategies lead to some equilibrium point is a non-trivial and delicate task. Moreover, even in repeated games the convergence to a stationary equilibrium is not always guaranteed. Here we show that in the ergodic class of finite controllable Markov Chains Dynamic Games the best reply actions lead obligatory to one of Nash equilibrium points. This conclusion is done by the Lyapunov Games concept which is based on the designing of an individual Lyapunov function (related with an individual cost function) which monotonically decreases (non-increases) during the game. The suggested approach is illustrated by the repeated asynchronous {"}Prisoner's Dilemma{"} game with bestreply actions application.",

author = "Julio Clempner and Alexander Poznyak",

year = "2013",

doi = "10.1109/CDC.2013.6759942",

language = "Ingl{\'e}s",

isbn = "9781467357173",

series = "Proceedings of the IEEE Conference on Decision and Control",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "568--573",

booktitle = "2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013",

address = "Estados Unidos",

note = "52nd IEEE Conference on Decision and Control, CDC 2013 ; Conference date: 10-12-2013 Through 13-12-2013",

}

Clempner, J & Poznyak, A 2013, Analysis of best-reply strategies in repeated finite Markov Chains Games. in 2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013., 6759942, Proceedings of the IEEE Conference on Decision and Control, Institute of Electrical and Electronics Engineers Inc., pp. 568-573, 52nd IEEE Conference on Decision and Control, CDC 2013, Florence, Italy, 10/12/13. https://doi.org/10.1109/CDC.2013.6759942

Analysis of best-reply strategies in repeated finite Markov Chains Games. / Clempner, Julio; Poznyak, Alexander.
2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013. Institute of Electrical and Electronics Engineers Inc., 2013. p. 568-573 6759942 (Proceedings of the IEEE Conference on Decision and Control).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Analysis of best-reply strategies in repeated finite Markov Chains Games

AU - Clempner, Julio

AU - Poznyak, Alexander

PY - 2013

Y1 - 2013

N2 - The "best-reply strategy" is a natural and most commonly applied type of actions which players prefer to use during a repeated game. Usually, the behavior of an individual cost-function, when such strategies are applied, turns out to be non-monotonic, and, as the results, to make the conclusion that such strategies lead to some equilibrium point is a non-trivial and delicate task. Moreover, even in repeated games the convergence to a stationary equilibrium is not always guaranteed. Here we show that in the ergodic class of finite controllable Markov Chains Dynamic Games the best reply actions lead obligatory to one of Nash equilibrium points. This conclusion is done by the Lyapunov Games concept which is based on the designing of an individual Lyapunov function (related with an individual cost function) which monotonically decreases (non-increases) during the game. The suggested approach is illustrated by the repeated asynchronous "Prisoner's Dilemma" game with bestreply actions application.

AB - The "best-reply strategy" is a natural and most commonly applied type of actions which players prefer to use during a repeated game. Usually, the behavior of an individual cost-function, when such strategies are applied, turns out to be non-monotonic, and, as the results, to make the conclusion that such strategies lead to some equilibrium point is a non-trivial and delicate task. Moreover, even in repeated games the convergence to a stationary equilibrium is not always guaranteed. Here we show that in the ergodic class of finite controllable Markov Chains Dynamic Games the best reply actions lead obligatory to one of Nash equilibrium points. This conclusion is done by the Lyapunov Games concept which is based on the designing of an individual Lyapunov function (related with an individual cost function) which monotonically decreases (non-increases) during the game. The suggested approach is illustrated by the repeated asynchronous "Prisoner's Dilemma" game with bestreply actions application.

UR - http://www.scopus.com/inward/record.url?scp=84902356207&partnerID=8YFLogxK

U2 - 10.1109/CDC.2013.6759942

DO - 10.1109/CDC.2013.6759942

M3 - Contribución a la conferencia

AN - SCOPUS:84902356207

SN - 9781467357173

T3 - Proceedings of the IEEE Conference on Decision and Control

SP - 568

EP - 573

BT - 2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 52nd IEEE Conference on Decision and Control, CDC 2013

Y2 - 10 December 2013 through 13 December 2013

ER -

Analysis of best-reply strategies in repeated finite Markov Chains Games

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this