A Markovian Stackelberg game approach for computing an optimal dynamic mechanism

Julio B. Clempner

doi:10.1007/s40314-021-01578-4

A Markovian Stackelberg game approach for computing an optimal dynamic mechanism

Julio B. Clempner

Escuela Superior de Física y Matemáticas (ESFM)

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

This paper presents a dynamic Bayesian–Stackelberg incentive-compatible mechanism, in which multiple agents observe private information and learn their behavior through a sequence of interactions in a repeated game, for a class of controllable homogeneous Markov games. We assume that the leaders can ex ante commit to their disclosure strategy and mechanism, and affect followers’ actions. Along the paper, leaders possess and benefit from some commitment leadership, which describes the distinctive nature of a Stackelberg game. In this dynamics, leaders and followers together are in a Stackelberg game where actions are taken in a sequential way in the two layers of the hierarchy, but independently leaders and followers are involved non-cooperativelyin two (Nash) games where actions are taken simultaneously. This game considers an ex-ante incentive-compatible mechanism, which in equilibrium maximizes the reward while the agents are learning their actions over a countable number of periods. The formulation of the problem considers a Bayesian–Stackelberg equilibrium in the context of Reinforcement Learning. We propose an algorithm supported by the extraproximal method and show that it converges. The Tikhonov’s regularization technique is employed for ensuring the existence and uniqueness of the Bayesian–Stackelberg equilibrium. We show and guarantee the convergence of the method to a single incentive-compatible mechanism. We derive the analytical expressions for computing the mechanism in a Stackelberg game, which is one of the main results of this work. We demonstrate the efficiency of the method by an experiment drawn from an electric power problem represented by an oligopolistic market structure dominated by a small number of large sellers (oligopolists).

Translated title of the contribution	Un enfoque de juego Markoviano Stackelberg para calcular un mecanismo dinámico óptimo
Original language	English
Article number	186
Journal	Computational and Applied Mathematics
Volume	40
Issue number	6
DOIs	https://doi.org/10.1007/s40314-021-01578-4
State	Published - Sep 2021

Keywords

Bayesian equilibrium
Dynamic mechanism design
Incentive-compatible mechanisms
Markov games
Stackelberg games with private information

Access to Document

10.1007/s40314-021-01578-4

Cite this

@article{a2e372d9d3ff48a3a63caec4fc7d69b2,

title = "A Markovian Stackelberg game approach for computing an optimal dynamic mechanism",

abstract = "This paper presents a dynamic Bayesian–Stackelberg incentive-compatible mechanism, in which multiple agents observe private information and learn their behavior through a sequence of interactions in a repeated game, for a class of controllable homogeneous Markov games. We assume that the leaders can ex ante commit to their disclosure strategy and mechanism, and affect followers{\textquoteright} actions. Along the paper, leaders possess and benefit from some commitment leadership, which describes the distinctive nature of a Stackelberg game. In this dynamics, leaders and followers together are in a Stackelberg game where actions are taken in a sequential way in the two layers of the hierarchy, but independently leaders and followers are involved non-cooperativelyin two (Nash) games where actions are taken simultaneously. This game considers an ex-ante incentive-compatible mechanism, which in equilibrium maximizes the reward while the agents are learning their actions over a countable number of periods. The formulation of the problem considers a Bayesian–Stackelberg equilibrium in the context of Reinforcement Learning. We propose an algorithm supported by the extraproximal method and show that it converges. The Tikhonov{\textquoteright}s regularization technique is employed for ensuring the existence and uniqueness of the Bayesian–Stackelberg equilibrium. We show and guarantee the convergence of the method to a single incentive-compatible mechanism. We derive the analytical expressions for computing the mechanism in a Stackelberg game, which is one of the main results of this work. We demonstrate the efficiency of the method by an experiment drawn from an electric power problem represented by an oligopolistic market structure dominated by a small number of large sellers (oligopolists).",

keywords = "Bayesian equilibrium, Dynamic mechanism design, Incentive-compatible mechanisms, Markov games, Stackelberg games with private information, equilibrio bayesiano, Dise{\~n}o de mecanismos din{\'a}micos, Mecanismos compatibles con incentivos, juegos de Markov, Juegos Stackelberg con informaci{\'o}n privada",

author = "Clempner, {Julio B.}",

note = "Publisher Copyright: {\textcopyright} 2021, SBMAC - Sociedade Brasileira de Matem{\'a}tica Aplicada e Computacional.",

year = "2021",

month = sep,

doi = "10.1007/s40314-021-01578-4",

language = "Ingl{\'e}s",

volume = "40",

journal = "Computational and Applied Mathematics",

issn = "2238-3603",

publisher = "Birkhauser Boston",

number = "6",

}

TY - JOUR

T1 - A Markovian Stackelberg game approach for computing an optimal dynamic mechanism

AU - Clempner, Julio B.

PY - 2021/9

Y1 - 2021/9

N2 - This paper presents a dynamic Bayesian–Stackelberg incentive-compatible mechanism, in which multiple agents observe private information and learn their behavior through a sequence of interactions in a repeated game, for a class of controllable homogeneous Markov games. We assume that the leaders can ex ante commit to their disclosure strategy and mechanism, and affect followers’ actions. Along the paper, leaders possess and benefit from some commitment leadership, which describes the distinctive nature of a Stackelberg game. In this dynamics, leaders and followers together are in a Stackelberg game where actions are taken in a sequential way in the two layers of the hierarchy, but independently leaders and followers are involved non-cooperativelyin two (Nash) games where actions are taken simultaneously. This game considers an ex-ante incentive-compatible mechanism, which in equilibrium maximizes the reward while the agents are learning their actions over a countable number of periods. The formulation of the problem considers a Bayesian–Stackelberg equilibrium in the context of Reinforcement Learning. We propose an algorithm supported by the extraproximal method and show that it converges. The Tikhonov’s regularization technique is employed for ensuring the existence and uniqueness of the Bayesian–Stackelberg equilibrium. We show and guarantee the convergence of the method to a single incentive-compatible mechanism. We derive the analytical expressions for computing the mechanism in a Stackelberg game, which is one of the main results of this work. We demonstrate the efficiency of the method by an experiment drawn from an electric power problem represented by an oligopolistic market structure dominated by a small number of large sellers (oligopolists).

AB - This paper presents a dynamic Bayesian–Stackelberg incentive-compatible mechanism, in which multiple agents observe private information and learn their behavior through a sequence of interactions in a repeated game, for a class of controllable homogeneous Markov games. We assume that the leaders can ex ante commit to their disclosure strategy and mechanism, and affect followers’ actions. Along the paper, leaders possess and benefit from some commitment leadership, which describes the distinctive nature of a Stackelberg game. In this dynamics, leaders and followers together are in a Stackelberg game where actions are taken in a sequential way in the two layers of the hierarchy, but independently leaders and followers are involved non-cooperativelyin two (Nash) games where actions are taken simultaneously. This game considers an ex-ante incentive-compatible mechanism, which in equilibrium maximizes the reward while the agents are learning their actions over a countable number of periods. The formulation of the problem considers a Bayesian–Stackelberg equilibrium in the context of Reinforcement Learning. We propose an algorithm supported by the extraproximal method and show that it converges. The Tikhonov’s regularization technique is employed for ensuring the existence and uniqueness of the Bayesian–Stackelberg equilibrium. We show and guarantee the convergence of the method to a single incentive-compatible mechanism. We derive the analytical expressions for computing the mechanism in a Stackelberg game, which is one of the main results of this work. We demonstrate the efficiency of the method by an experiment drawn from an electric power problem represented by an oligopolistic market structure dominated by a small number of large sellers (oligopolists).

KW - Bayesian equilibrium

KW - Dynamic mechanism design

KW - Incentive-compatible mechanisms

KW - Markov games

KW - Stackelberg games with private information

KW - equilibrio bayesiano

KW - Diseño de mecanismos dinámicos

KW - Mecanismos compatibles con incentivos

KW - juegos de Markov

KW - Juegos Stackelberg con información privada

UR - http://www.scopus.com/inward/record.url?scp=85110350483&partnerID=8YFLogxK

U2 - 10.1007/s40314-021-01578-4

DO - 10.1007/s40314-021-01578-4

M3 - Artículo

AN - SCOPUS:85110350483

SN - 2238-3603

VL - 40

JO - Computational and Applied Mathematics

JF - Computational and Applied Mathematics

IS - 6

M1 - 186

ER -

A Markovian Stackelberg game approach for computing an optimal dynamic mechanism

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this