Elsevier

Games and Economic Behavior

Volume 126, March 2021, Pages 355-373
Games and Economic Behavior

Positive feedback in coordination games: Stochastic evolutionary dynamics and the logit choice rule

https://doi.org/10.1016/j.geb.2021.01.003Get rights and content

Abstract

We study the problem of stochastic stability for evolutionary dynamics under the logit choice rule. We consider general classes of coordination games, symmetric or asymmetric, with an arbitrary number of strategies, which satisfies the marginal bandwagon property (i.e., there is positive feedback to coordinate). Our main result is that the most likely evolutionary escape paths from a status quo convention consist of a series of identical mistakes. As an application of our result, we show that the Nash bargaining solution arises as the long run convention for the evolutionary Nash demand game under the usual logit choice rule. We also obtain a new bargaining solution if the logit choice rule is combined with intentional idiosyncratic plays. The new bargaining solution is more egalitarian than the Nash bargaining solution, demonstrating that intentionality implies equality under the logit choice model.

Introduction

Conventions and customs are sometimes determining factors for formal contracts. For example, Young and Burke (2001) show that local custom is a driving force in setting up the crop sharing contract terms in the state of Illinois. Customary patterns of behaviors such as asymmetric norms between racial groups and genders can also produce a mechanism by which inequalities persist for a long period of time (Naidu et al., 2017). Changes in informal convention sometimes induce formal institutional changes which may contribute to long-run economic growth (Hwang et al., 2016; Acemoglu et al., 2005). Thus, understanding both the disruption and emergence of conventions can shed light on problems such as economic incentives, inequality, and long-run growth. Conventions or social norms that typically form over time and last for a long period of time are frequently modeled as long run equilibria in stochastic evolutionary dynamics, in which agents play myopic best responses subject to mistakes, errors, or idiosyncratic plays (Young, 1993a; Kandori et al., 1993; Bowles, 2004). Therefore identifying the most likely evolutionary paths escaping from an existing convention or transitioning between conventions is a key step in studying the disruption and emergence of conventions.

Recently, one of the behavioral rules of myopic agents, the logit choice model, has become popular among researchers because of its analytic convenience (see Section 2 for existing studies using the logit choice rule). Under the logit choice rule, the probability of an agent's mistake decreases log-linearly in the payoff losses incurred by such a mistake. Recent experimental literature also supports the hypothesis that mistake probabilities decrease in payoff losses (see Section 2). In the widely used uniform mistake model, in which all possible mistakes are equally likely, the more likely path can be easily determined by comparing the number of mistakes involved and, hence, the lengths of paths (e.g., Kandori et al. (1993)). However, under the logit choice rule, determining the most likely path is far from obvious, because the probability of a path depends on the kinds of mistakes involved as well as on the length of the path. For example, Hwang and Newton (2016) provide an example in which two different kinds of mistake plays are involved in the most likely escape path from a convention under a finite population logit model (see Example 1 in Hwang and Newton (2016)).

Because of the complexity of the logit choice rule, there has been, so far, no general way to analyze the optimal evolutionary paths from one convention to another. As a concrete example, it is unknown which kind of contract conventions will emerge and persist when agents play the evolutionary version of the familiar Nash bargaining game (Nash, 1953) under the logit rules(see Section 5). The goal of this paper is to fill in this gap in the literature for general classes of games. Two recent studies address similar questions. Hwang and Newton (2016) study two-population coordination evolutionary models with zero off-diagonal payoffs and an arbitrary number of strategies, for both finite and infinite populations. Sandholm and Staudigl (2016) study a one-population coordination evolutionary model with three strategies in the infinite population limit. These two papers are discussed in more detail in Section 2 but we note here that their results are limited to specific classes of games (either games with zero off-diagonal payoffs or games with three strategies).

One of the main novelties of this paper is a new method—we call it “comparison principles”—which can be applied to one- or two-population games with an arbitrary number of strategies and which allows greatly reducing the complexity of finding the most likely evolutionary paths under the logit choice rule. Specifically, we find that under the logit choice rule, the positive feedback of agents (to coordinate) plays a key role. Kandori and Rob (1998) introduce the “marginal bandwagon property” to capture the positive feedback aspect of network externality, requiring that the advantage of strategy i over j is greater when the other player is playing strategy i. In stochastic evolutionary game theory, the (un-)likeliness of a path is measured by a quantity called the “cost”: the less costly a path is, the more likely is the transition induced by that path. We show that for finite population models with the logit choice rule, (i) positive feedback (defined by the marginal bandwagon property) implies that along the minimum cost escape paths from a status quo convention, agents always deviate first from the status quo convention strategy before deviating from other strategies (Lemma 4.1 (i)), and (ii) the relative strength of the positive feedback effects implies that the transitions from the status quo convention to another convention must occur consecutively in the cost optimal escape paths (Lemma 4.1 (ii)).

We then apply these results to the exit problem—the problem of finding a cost minimum path escaping from a convention—under finite population models and characterize the candidates for the cost minimizing paths, as follows. The candidates consist of (possibly) different kinds of repeated identical mistakes deviating from the status quo convention strategy (Proposition 4.1). Finally, to pin down the exact minimum cost escaping path, we consider the infinite population limit as in Sandholm and Staudigl (2016) (Proposition 4.2) and show that the most likely escape paths from the status quo convention involve only one kind of repeated identical mistakes of agents (Proposition 4.3, Proposition 4.4, and Theorem 4.1). This result holds for any coordination games satisfying the marginal bandwagon properties and some regularity conditions with an arbitrary number of strategies, regardless of symmetric or asymmetric games (hence, one- or two-population models; Theorem 4.1, Theorem 5.1). To the best of our knowledge, this is a novel result.

As an application of our main results, we study the evolutionary bargaining convention for the Nash demand games under the logit choice rule and show that the Nash bargaining solution arises as the stochastically stable convention under the usual logit choice rule (called the unintentional logit dynamic). We also obtain a new bargaining convention when the logit choice rule is combined with intentional idiosyncratic (non-best response) plays (called the intentional logit dynamic). By intentional idiosyncratic plays, we mean that agents always experiment with strategies under which they would do better, should that strategy induce a convention (Naidu et al., 2010; Hwang et al., 2018). We show that the new solution under intentional logit dynamics is more egalitarian than the Nash bargaining solution, hence intentionality implies equality (Proposition 5.1). The reason for equality is as follows: under the unintentional logit rule, some transitions from the egalitarian convention (the equal division convention) to the Nash bargaining convention (the unequal division convention) are driven by a population who stands to lose by such transitions. Under the intentional logit rule, every transition is driven by the population who stands to benefit. Thus, some unfavorable transitions leading to the Nash bargaining convention are replaced by favorable transitions to the deviant population, leading to a more equal convention than the Nash bargaining convention. It can be easily seen that our comparison principle as well as our remaining arguments for the logit choice rule hold for the uniform mistake models under the assumption of the marginal bandwagon property. Thus, our comparison principles provide a unified framework for analyzing evolutionary dynamics, including the uniform mistake and logit models.

This paper is organized as follows. Section 2 discusses the related literature. Section 3 introduces the basic setup and discusses, in some detail, an example illustrating our methods. We present our main results for the exit problem for one population models in Section 4. In Section 5, we present results for two population models and analyze the Nash demand game. In the appendix, we show that our result for the exit problem can be used to study the stochastic stability problem. The appendix also provides the technical details and proofs of the paper's results.

Section snippets

Related literature

There are many recent contributions to the analysis of stochastic evolutionary dynamics.1

Basic setup: one population model

Consider a population of n agents who play a symmetric coordination game with strategy set S={1,2,, |S|} and payoff matrix A. The population state is described as a vector of fractions of agents using each strategy; that is, the state of the population is xΔ(n), where Δ(n) is the simplexΔ(n):={(x1,,x|S|)1nZ|S|:ix(i)=1,x(i)0for all}. The expected payoff to an agent who chooses strategy i at population state x is given by π(i,x):=jSAijx(j).

We consider a discrete time strategy updating

Exit from the basin of attraction of a convention: one population models

We now present our comparison principles for games with an arbitrary number of strategies for a finite population. Our first comparison principle shows that given two paths, xx(i,k)x(i,k)(m¯,l) and xx(m¯,k)x(i,k)(m¯,l), it always costs less (or the same) to first switch away from strategy m¯ and then to switch away from the other strategies, as already explained in Section 3 (Lemma 4.1). Our second comparison principle is based on the fact that the sizes of the two different positive

Two population models

For the application of our results to the Nash demand game, we briefly introduce the two population setup and state the result for two population models (Theorem 5.1), which is proved in a similar way to the one population model result (see Appendix B). Consider two populations denoted by α and β, consisting of the same number of agents n and a bimatrix game (Aα,Aβ), where Aκ is an |S|×|S| matrix for κ=α,β. An α-agent playing i against j obtains a payoff Aijα, while a β-agent playing j against i

Summary

Relying on positive feedback conditions and the relative strengths of these effects we developed methods to identify the most likely paths for evolutionary population dynamics under the logit rule. We identified two main factors determining the minimum cost path escaping from a convention: (1) the existence of positive feedback effects, and (2) the relative strengths of positive feedback effects. This leads us to simple but powerful comparison principles that drastically reduce the number of

References (44)

  • R. McKelvey et al.

    Quantal response equilibria for normal form games

    Games Econ. Behav.

    (1995)
  • D. Myatt et al.

    A multinomial probit model of stochastic evolution

    J. Econ. Theory

    (2003)
  • H. Nax et al.

    Risk attitudes and risk dominance in the long run

    Games Econ. Behav.

    (2019)
  • D. Okada et al.

    Log-linear dynamics and local potential

    J. Econ. Theory

    (2012)
  • M. Peski

    Generalized risk-dominance and asymmetric dynamics

    J. Econ. Theory

    (2010)
  • W. Sandholm

    Decompositions and potentials for normal form games

    Games Econ. Behav.

    (2010)
  • R. Sawa et al.

    Prospect dynamics and loss dominance

    Games Econ. Behav.

    (2018)
  • M. Staudigl

    Stochastic stability in asymmetric binary choice coordination games

    Games Econ. Behav.

    (2012)
  • H.P. Young

    An evolutionary model of bargaining

    J. Econ. Theory

    (1993)
  • D. Acemoglu et al.

    Institutions as a fundamental cause of long-run growth

  • M. Belloc et al.

    The persistence of inferior cultural-institutional conventions

    Am. Econ. Rev. Pap. Proc.

    (2013)
  • E. Bilancini et al.

    The evolution of conventions under condition-dependent mistakes

    Econ. Theory

    (2020)
  • Cited by (5)

    • Statistical inference in evolutionary dynamics

      2023, Games and Economic Behavior
      Citation Excerpt :

      The equilibrium selection problem in coordination games lies at the heart of evolutionary game theory starting from Foster and Young (1990), Young (1993) and Kandori et al. (1993). See more recent contributions by Maruta (2002), Blume (2003), Peski (2010), Sandholm (2010), Neary (2012), Staudigl (2012), Staudigl and Weidenholzer (2014), Kreindler and Young (2013) Sandholm and Staudigl (2016), Hwang and Newton (2017), Sawa and Wu (2018a,b), Nax and Newton (2019), Bilancini and Boncinelli (2020), Hwang and Rey-Bellet (2021), Newton (2021), and Rozzi (2021), among many others. We investigate if our Bayesian dynamic has any equilibrium selection property.

    • Expected utility versus cumulative prospect theory in an evolutionary model of bargaining

      2022, Journal of Economic Dynamics and Control
      Citation Excerpt :

      However, when the errors are intentional (as described earlier), or coalitions of individuals may form, then Naidu et al. (2010) and Newton (2012a), respectively, recover the Nash bargaining solution as the stochastically stable outcome in contract games. On the other hand, while Hwang et al. (2018) consider logit dynamics in the contract game and show that the emergence of the Egalitarian bargaining solution (Kalai, 1977) when errors are intentional, and a logit bargaining solution when the errors are uniform, Hwang and Rey-Bellet (2021) demonstrate the stochastic stability of the Nash bargaining solution in the bargaining game under logit dynamics with uniform errors but not under the logit dynamics with intentional errors. In the context of analysing the effect of (behavioural) biases on the outcome of bargaining, Brito et al. (1977), Thomson (1981), and Gupta and Livne (1988), analyse the effect of reference points in the axiomatic approach to bargaining by Nash (1950) while Gimpel (2007) explores how the preference of ‘negotiators’ change in multi-attribute negotiations.

    The research of S.-H. H. was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2019S1A5A8035341). The research of L. R.-B. was supported by the US National Science Foundation (DMS-1515712). We greatly appreciate comments by the advisory editor and two anonymous referees. Especially, we would like to express special thanks to the late Bill Sandholm who carefully read the early version (http://arxiv.org/abs/OOO) of this paper and generously offered us many helpful suggestions.

    View full text