Analytical method for mechanism design in partially observable markov games

Julio B. Clempner; Alexander S. Poznyak

doi:10.3390/math9040321

Analytical method for mechanism design in partially observable markov games

Título traducido de la contribución: Método analítico para el diseño de mecanismos en juegos de markov parcialmente observables

Julio B. Clempner, Alexander S. Poznyak

Escuela Superior de Física y Matemáticas (ESFM)

Producción científica: Contribución a una revista › Artículo › revisión exhaustiva

10 Citas (Scopus)

Resumen

A theme that become common knowledge of the literature is the difficulty of developing a mechanism that is compatible with individual incentives that simultaneously result in efficient decisions that maximize the total reward. In this paper, we suggest an analytical method for computing a mechanism design. This problem is explored in the context of a framework, in which the players follow an average utility in a non-cooperative Markov game with incomplete state information. All of the Nash equilibria are approximated in a sequential process. We describe a method for the derivative of the player’s equilibrium that instruments the design of the mechanism. In addition, it showed the convergence and rate of convergence of the proposed method. For computing the mechanism, we consider an extension of the Markov model for which it is introduced a new variable that represents the product of the mechanism design and the joint strategy. We derive formulas to recover the variables of interest: mechanisms, strategy, and distribution vector. The mechanism design and equilibrium strategies computation differ from those in previous literature. A numerical example presents the usefulness and effectiveness of the proposed method.

Título traducido de la contribución	Método analítico para el diseño de mecanismos en juegos de markov parcialmente observables
Idioma original	Inglés
Número de artículo	321
Páginas (desde-hasta)	1-15
Número de páginas	15
Publicación	Mathematics
Volumen	9
N.º	4
DOI	https://doi.org/10.3390/math9040321
Estado	Publicada - 2 feb. 2021

Acceder al documento

10.3390/math9040321

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

@article{66cd55ea1e2448ee80ad0e2e7bcbbe02,

title = "Analytical method for mechanism design in partially observable markov games",

abstract = "A theme that become common knowledge of the literature is the difficulty of developing a mechanism that is compatible with individual incentives that simultaneously result in efficient decisions that maximize the total reward. In this paper, we suggest an analytical method for computing a mechanism design. This problem is explored in the context of a framework, in which the players follow an average utility in a non-cooperative Markov game with incomplete state information. All of the Nash equilibria are approximated in a sequential process. We describe a method for the derivative of the player{\textquoteright}s equilibrium that instruments the design of the mechanism. In addition, it showed the convergence and rate of convergence of the proposed method. For computing the mechanism, we consider an extension of the Markov model for which it is introduced a new variable that represents the product of the mechanism design and the joint strategy. We derive formulas to recover the variables of interest: mechanisms, strategy, and distribution vector. The mechanism design and equilibrium strategies computation differ from those in previous literature. A numerical example presents the usefulness and effectiveness of the proposed method.",

keywords = "Dynamic mechanism design, Incentive-compatible mechanisms, Incomplete state information, Markov games with private information, Partially observable Markov chains",

author = "Clempner, {Julio B.} and Poznyak, {Alexander S.}",

note = "Publisher Copyright: {\textcopyright} 2021 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2021",

month = feb,

day = "2",

doi = "10.3390/math9040321",

language = "Ingl{\'e}s",

volume = "9",

pages = "1--15",

journal = "Mathematics",

issn = "2227-7390",

number = "4",

}

TY - JOUR

T1 - Analytical method for mechanism design in partially observable markov games

AU - Clempner, Julio B.

AU - Poznyak, Alexander S.

PY - 2021/2/2

Y1 - 2021/2/2

N2 - A theme that become common knowledge of the literature is the difficulty of developing a mechanism that is compatible with individual incentives that simultaneously result in efficient decisions that maximize the total reward. In this paper, we suggest an analytical method for computing a mechanism design. This problem is explored in the context of a framework, in which the players follow an average utility in a non-cooperative Markov game with incomplete state information. All of the Nash equilibria are approximated in a sequential process. We describe a method for the derivative of the player’s equilibrium that instruments the design of the mechanism. In addition, it showed the convergence and rate of convergence of the proposed method. For computing the mechanism, we consider an extension of the Markov model for which it is introduced a new variable that represents the product of the mechanism design and the joint strategy. We derive formulas to recover the variables of interest: mechanisms, strategy, and distribution vector. The mechanism design and equilibrium strategies computation differ from those in previous literature. A numerical example presents the usefulness and effectiveness of the proposed method.

AB - A theme that become common knowledge of the literature is the difficulty of developing a mechanism that is compatible with individual incentives that simultaneously result in efficient decisions that maximize the total reward. In this paper, we suggest an analytical method for computing a mechanism design. This problem is explored in the context of a framework, in which the players follow an average utility in a non-cooperative Markov game with incomplete state information. All of the Nash equilibria are approximated in a sequential process. We describe a method for the derivative of the player’s equilibrium that instruments the design of the mechanism. In addition, it showed the convergence and rate of convergence of the proposed method. For computing the mechanism, we consider an extension of the Markov model for which it is introduced a new variable that represents the product of the mechanism design and the joint strategy. We derive formulas to recover the variables of interest: mechanisms, strategy, and distribution vector. The mechanism design and equilibrium strategies computation differ from those in previous literature. A numerical example presents the usefulness and effectiveness of the proposed method.

KW - Dynamic mechanism design

KW - Incentive-compatible mechanisms

KW - Incomplete state information

KW - Markov games with private information

KW - Partially observable Markov chains

UR - http://www.scopus.com/inward/record.url?scp=85100956257&partnerID=8YFLogxK

U2 - 10.3390/math9040321

DO - 10.3390/math9040321

M3 - Artículo

AN - SCOPUS:85100956257

SN - 2227-7390

VL - 9

SP - 1

EP - 15

JO - Mathematics

JF - Mathematics

IS - 4

M1 - 321

ER -

Analytical method for mechanism design in partially observable markov games

Resumen

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto