A Proximal/Gradient Approach for Computing the Nash Equilibrium in Controllable Markov Games

Clempner, Julio B.

doi:10.1007/s10957-021-01812-3

A Proximal/Gradient Approach for Computing the Nash Equilibrium in Controllable Markov Games

Published: 20 January 2021

Volume 188, pages 847–862, (2021)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

Julio B. Clempner ORCID: orcid.org/0000-0002-5918-4671¹

319 Accesses
5 Citations
Explore all metrics

Abstract

This paper proposes a new algorithm for computing the Nash equilibrium based on an iterative approach of both the proximal and the gradient method for homogeneous, finite, ergodic and controllable Markov chains. We conceptualize the problem as a poly-linear programming problem. Then, we regularize the poly-linear functional employing a regularization approach over the Lagrange functional for ensuring the method to converge to some of the Nash equilibria of the game. This paper presents two main contributions: The first theoretical result is the proposed iterative approach, which employs both the proximal and the gradient method for computing the Nash equilibria in Markov games. The method transforms the game theory problem in a system of equations, in which each equation itself is an independent optimization problem for which the necessary condition of a minimum is computed employing a nonlinear programming solver. The iterated approach provides a quick rate of convergence to the Nash equilibrium point. The second computational contribution focuses on the analysis of the convergence of the proposed method and computes the rate of convergence of the step-size parameter. These results are interesting within the context of computational and algorithmic game theory. A numerical example illustrates the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Global Convergence of ADMM in Nonconvex Nonsmooth Optimization

Article 07 June 2018

Yu Wang, Wotao Yin & Jinshan Zeng

SPIRAL: a superlinearly convergent incremental proximal algorithm for nonconvex finite sum minimization

Article 29 March 2024

Pourya Behmandpoor, Puya Latafat, … Panagiotis Patrinos

On the Replication of the Pre-kernel and Related Solutions

Article 19 September 2023

Holger I. Meinhardt

Notes

Utopia point can be used as an ideal standard for the criteria values to find the best solution(s) from the Pareto optimal set (see [22,23,24]).

References

Nash, J.F.: Equilibrium points in n-person games. In: Proc. of the National Academy of Sciences, vol. 36, pp. 48–49 (1950)
Facchinei, F., Pang, J.S.: Finite-Dimensional Variational Inequalities and Complementarity Problems. Springer, New York (2003)
MATH Google Scholar
Nabetani, K., Tseng, P., Fukushima, M.: Parametrized variational inequality approaches to generalized Nash equilibrium problems with shared constraints. Comput. Optim. Appl. 8(3), 423–452 (2011)
Article MathSciNet Google Scholar
Facchinei, F., Sagratella, S.: On the computation of all solutions of jointly convex generalized Nash equilibrium problems. Optim. Lett. 5(3), 531–547 (2011)
Article MathSciNet Google Scholar
Dreves, A., Kanzow, C., Stein, O.: Nonsmooth optimization reformulations of player convex generalized Nash equilibrium problems. J. Glob. Optim. 53(4), 587–614 (2012)
Article MathSciNet Google Scholar
Clempner, J.B., Poznyak, A.S.: Convergence method, properties and computational complexity for Lyapunov games. Int. J. Appl. Math. Comput. Sci. 21(2), 349–361 (2011)
Article MathSciNet Google Scholar
Gabriel, S.A., Siddiqui, S., Conejo, A.J., Ruiz, C.: Solving discretely-constrained Nash–Cournot games with an application to power markets. Netw. Spat. Econ. 13(3), 307–326 (2013)
Article MathSciNet Google Scholar
Facchinei, F., Kanzow, C., Sagratella, S.: Solving quasi-variational inequalities via their kkt conditions. Math. Progr. 144(1–2), 369–412 (2014)
Article MathSciNet Google Scholar
Clempner, J.B.: Setting Cournot vs. Lyapunov games stability conditions and equilibrium point properties. Int. Game Theory Rev. 17, 1–10 (2015)
Article Google Scholar
Clempner, J.B., Poznyak, A.S.: Computing the strong Nash equilibrium for Markov chains games. Appl. Math. Comput. 265, 911–927 (2015)
MathSciNet MATH Google Scholar
Trejo, K.K., Clempner, J.B., Poznyak, A.S.: An optimal strong equilibrium solution for cooperative multi-leader-follower Stackelberg Markov chains games. Kibernetika 52(2), 258–279 (2016)
MathSciNet MATH Google Scholar
Antipin, A.S.: An extraproximal method for solving equilibrium programming problems and games. Comput. Math. Math. Phys. 45(11), 1893–1914 (2005)
MathSciNet Google Scholar
Dreves, A.: Computing all solutions of linear generalized Nash equilibrium problems. Math. Methods Oper. Res. (2016). https://doi.org/10.1007/s00186-016-0562-0
Article MathSciNet MATH Google Scholar
Clempner, J.B., Poznyak, A.S.: Analysis of best-reply strategies in repeated finite Markov chains games. In: IEEE Conference on Decision and Control (2013)
Clempner, J.B., Poznyak, A.S.: Convergence analysis for pure and stationary strategies in repeated potential games: Nash, Lyapunov and correlated equilibria. Expert Syst. Appl. 46, 474–484 (2016)
Article Google Scholar
Trejo, K.K., Clempner, J.B., Poznyak, A.S.: Computing the Lp-strong Nash equilibrium for Markov chains games. Appl. Math. Model. 41, 399–418 (2017)
Article MathSciNet Google Scholar
Clempner, J.B., Poznyak, A.S.: Finding the strong Nash equilibrium: computation, existence and characterization for Markov games. J. Optim. Theory Appl. 186(3), 1029–1052 (2020)
Article MathSciNet Google Scholar
Clempner, J.B., Poznyak, A.S.: A Tikhonov regularization parameter approach for solving Lagrange constrained optimization problems. Eng. Optim. 50(11), 1996–2012 (2018). https://doi.org/10.1080/0305215X.2017.1418866
Article MathSciNet Google Scholar
Clempner, J.B., Poznyak, A.S.: A Tikhonov regularized penalty function approach for solving polylinear programming problems. J. Comput. Appl. Math. 328, 267–286 (2018)
Article MathSciNet Google Scholar
Solis, C.U., Clempner, J.B., Poznyak, A.S.: Modeling multi-leader-follower non-cooperative Stackelberg games. Cybern. Syst. 47(8), 650–673 (2016)
Article Google Scholar
Clempner, J.B., Poznyak, A.S.: Analyzing an optimistic attitude for the leader firm in duopoly models: a strong Stackelberg equilibrium based on a Lyapunov game theory approach. Econ. Comput. Econ. Cybern. Stud. Res. 4(50), 41–60 (2016)
Google Scholar
Tanaka, K., Yokoyama, K.: On \(\epsilon \)-equilibrium point in a noncooperative n-person game. J. Math. Anal. Appl. 160, 413–423 (1991)
Article MathSciNet Google Scholar
Tanaka, K.: The closest solution to the shadow minimum of a cooperative dynamic game. Comput. Math. Appl. 18(1–3), 181–188 (1989)
Article MathSciNet Google Scholar
Clempner, J.B.: Computing multiobjective Markov chains handled by the extraproximal method. Ann. Oper. Res. 271(2), 469–486 (2018)
Article MathSciNet Google Scholar
Vasilyev, F.P., Khoroshilova, E.V., Antipin, S.: An extragradient method for finding the saddle point in an optimal control problem. Moscow Univ. Comput. Math. Cybern. 34(3), 113–118 (2010)
Article Google Scholar
Trejo, K.K., Clempner, J.B., Poznyak, A.S.: Computing the Stackelberg/Nash equilibria using the extraproximal method: convergence analysis and implementation details for Markov chains games. Int. J. Appl. Math. Comput. Sci. 25(2), 337–351 (2015)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Escuela Superior de Fsica y Matemticas, Instituto Politcnico Nacional, School of Physics and Mathematics, National Polytechnic Institute, Edificio 9 U.P. Adolfo Lopez Mateos, Col. San Pedro Zacatenco, 07730, Mexico City, Mexico
Julio B. Clempner

Authors

Julio B. Clempner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julio B. Clempner.

Additional information

Communicated by Kyriakos G. Vamvoudakis.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Clempner, J.B. A Proximal/Gradient Approach for Computing the Nash Equilibrium in Controllable Markov Games. J Optim Theory Appl 188, 847–862 (2021). https://doi.org/10.1007/s10957-021-01812-3

Download citation

Received: 25 July 2020
Accepted: 03 January 2021
Published: 20 January 2021
Issue Date: March 2021
DOI: https://doi.org/10.1007/s10957-021-01812-3

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Proximal/Gradient Approach for Computing the Nash Equilibrium in Controllable Markov Games

Abstract

Access this article

Similar content being viewed by others

Global Convergence of ADMM in Nonconvex Nonsmooth Optimization

SPIRAL: a superlinearly convergent incremental proximal algorithm for nonconvex finite sum minimization

On the Replication of the Pre-kernel and Related Solutions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Abstract

Access this article

Similar content being viewed by others

Global Convergence of ADMM in Nonconvex Nonsmooth Optimization

SPIRAL: a superlinearly convergent incremental proximal algorithm for nonconvex finite sum minimization

On the Replication of the Pre-kernel and Related Solutions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation