A Proximal/Gradient Approach for Computing the Nash Equilibrium in Controllable Markov Games

Título traducido de la contribución: Un enfoque proximal/gradiente para calcular el equilibrio de Nash en juegos controlables de Markov

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

5 Citas (Scopus)

Resumen

This paper proposes a new algorithm for computing the Nash equilibrium based on an iterative approach of both the proximal and the gradient method for homogeneous, finite, ergodic and controllable Markov chains. We conceptualize the problem as a poly-linear programming problem. Then, we regularize the poly-linear functional employing a regularization approach over the Lagrange functional for ensuring the method to converge to some of the Nash equilibria of the game. This paper presents two main contributions: The first theoretical result is the proposed iterative approach, which employs both the proximal and the gradient method for computing the Nash equilibria in Markov games. The method transforms the game theory problem in a system of equations, in which each equation itself is an independent optimization problem for which the necessary condition of a minimum is computed employing a nonlinear programming solver. The iterated approach provides a quick rate of convergence to the Nash equilibrium point. The second computational contribution focuses on the analysis of the convergence of the proposed method and computes the rate of convergence of the step-size parameter. These results are interesting within the context of computational and algorithmic game theory. A numerical example illustrates the proposed approach.

Título traducido de la contribuciónUn enfoque proximal/gradiente para calcular el equilibrio de Nash en juegos controlables de Markov
Idioma originalInglés
Páginas (desde-hasta)847-862
Número de páginas16
PublicaciónJournal of Optimization Theory and Applications
Volumen188
N.º3
DOI
EstadoPublicada - mar. 2021

Huella

Profundice en los temas de investigación de 'Un enfoque proximal/gradiente para calcular el equilibrio de Nash en juegos controlables de Markov'. En conjunto forman una huella única.

Citar esto