Analysis of best-reply strategies in repeated finite Markov Chains Games

Julio Clempner, Alexander Poznyak

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

19 Scopus citations

Abstract

The "best-reply strategy" is a natural and most commonly applied type of actions which players prefer to use during a repeated game. Usually, the behavior of an individual cost-function, when such strategies are applied, turns out to be non-monotonic, and, as the results, to make the conclusion that such strategies lead to some equilibrium point is a non-trivial and delicate task. Moreover, even in repeated games the convergence to a stationary equilibrium is not always guaranteed. Here we show that in the ergodic class of finite controllable Markov Chains Dynamic Games the best reply actions lead obligatory to one of Nash equilibrium points. This conclusion is done by the Lyapunov Games concept which is based on the designing of an individual Lyapunov function (related with an individual cost function) which monotonically decreases (non-increases) during the game. The suggested approach is illustrated by the repeated asynchronous "Prisoner's Dilemma" game with bestreply actions application.

Translated title of the contributionAnálisis de estrategias de mejor respuesta en juegos de cadenas de Markov finitos repetidos
Original languageEnglish
Title of host publication2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages568-573
Number of pages6
ISBN (Print)9781467357173
DOIs
StatePublished - 2013
Event52nd IEEE Conference on Decision and Control, CDC 2013 - Florence, Italy
Duration: 10 Dec 201313 Dec 2013

Publication series

NameProceedings of the IEEE Conference on Decision and Control
ISSN (Print)0743-1546
ISSN (Electronic)2576-2370

Conference

Conference52nd IEEE Conference on Decision and Control, CDC 2013
Country/TerritoryItaly
CityFlorence
Period10/12/1313/12/13

Fingerprint

Dive into the research topics of 'Analysis of best-reply strategies in repeated finite Markov Chains Games'. Together they form a unique fingerprint.

Cite this