A Markovian Stackelberg game approach for computing an optimal dynamic mechanism

Título traducido de la contribución: Un enfoque de juego Markoviano Stackelberg para calcular un mecanismo dinámico óptimo

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

6 Citas (Scopus)

Resumen

This paper presents a dynamic Bayesian–Stackelberg incentive-compatible mechanism, in which multiple agents observe private information and learn their behavior through a sequence of interactions in a repeated game, for a class of controllable homogeneous Markov games. We assume that the leaders can ex ante commit to their disclosure strategy and mechanism, and affect followers’ actions. Along the paper, leaders possess and benefit from some commitment leadership, which describes the distinctive nature of a Stackelberg game. In this dynamics, leaders and followers together are in a Stackelberg game where actions are taken in a sequential way in the two layers of the hierarchy, but independently leaders and followers are involved non-cooperativelyin two (Nash) games where actions are taken simultaneously. This game considers an ex-ante incentive-compatible mechanism, which in equilibrium maximizes the reward while the agents are learning their actions over a countable number of periods. The formulation of the problem considers a Bayesian–Stackelberg equilibrium in the context of Reinforcement Learning. We propose an algorithm supported by the extraproximal method and show that it converges. The Tikhonov’s regularization technique is employed for ensuring the existence and uniqueness of the Bayesian–Stackelberg equilibrium. We show and guarantee the convergence of the method to a single incentive-compatible mechanism. We derive the analytical expressions for computing the mechanism in a Stackelberg game, which is one of the main results of this work. We demonstrate the efficiency of the method by an experiment drawn from an electric power problem represented by an oligopolistic market structure dominated by a small number of large sellers (oligopolists).

Título traducido de la contribuciónUn enfoque de juego Markoviano Stackelberg para calcular un mecanismo dinámico óptimo
Idioma originalInglés
Número de artículo186
PublicaciónComputational and Applied Mathematics
Volumen40
N.º6
DOI
EstadoPublicada - sep. 2021

Palabras clave

  • equilibrio bayesiano
  • Diseño de mecanismos dinámicos
  • Mecanismos compatibles con incentivos
  • juegos de Markov
  • Juegos Stackelberg con información privada

Huella

Profundice en los temas de investigación de 'Un enfoque de juego Markoviano Stackelberg para calcular un mecanismo dinámico óptimo'. En conjunto forman una huella única.

Citar esto