-
A Multivariate Complexity Analysis of the Material Consumption Scheduling Problem arXiv.cs.GT Pub Date : 2021-02-26 Matthias Bentert; Robert Bredereck; Péter Györgyi; Andrzej Kaczmarczyk; Rolf Niedermeier
The NP-hard MATERIAL CONSUMPTION SCHEDULING Problem and closely related problems have been thoroughly studied since the 1980's. Roughly speaking, the problem deals with minimizing the makespan when scheduling jobs that consume non-renewable resources. We focus on the single-machine case without preemption: from time to time, the resources of the machine are (partially) replenished, thus allowing for
-
Revelation Gap for Pricing from Samples arXiv.cs.GT Pub Date : 2021-02-26 Yiding Feng; Jason D. Hartline; Yingkai Li
This paper considers prior-independent mechanism design, in which a single mechanism is designed to achieve approximately optimal performance on every prior distribution from a given class. Most results in this literature focus on mechanisms with truthtelling equilibria, a.k.a., truthful mechanisms. Feng and Hartline (2018) introduce the revelation gap to quantify the loss of the restriction to truthful
-
Classifying Convergence Complexity of Nash Equilibria in Graphical Games Using Distributed Computing Theory arXiv.cs.GT Pub Date : 2021-02-26 Juho Hirvonen; Laura Schmid; Krishnendu Chatterjee; Stefan Schmid
Graphical games are a useful framework for modeling the interactions of (selfish) agents who are connected via an underlying topology and whose behaviors influence each other. They have wide applications ranging from computer science to economics and biology. Yet, even though a player's payoff only depends on the actions of their direct neighbors in graphical games, computing the Nash equilibria and
-
Are Gross Substitutes a Substitute for Submodular Valuations? arXiv.cs.GT Pub Date : 2021-02-26 Shahar Dozinski; Uriel Feige; Michal Feldman
The class of gross substitutes (GS) set functions plays a central role in Economics and Computer Science. GS belongs to the hierarchy of {\em complement free} valuations introduced by Lehmann, Lehmann and Nisan, along with other prominent classes: $GS \subsetneq Submodular \subsetneq XOS \subsetneq Subadditive$. The GS class has always been more enigmatic than its counterpart classes, both in its definition
-
Optimal Pricing of Information arXiv.cs.GT Pub Date : 2021-02-26 Shuze Liu; Weiran Shen; Haifeng Xu
A decision maker looks to take an active action (e.g., purchase some goods or make an investment). The payoff of this active action depends on his own private type as well as a random and unknown state of nature. To decide between this active action and another passive action, which always leads to a safe constant utility, the decision maker may purchase information from an information seller. The
-
Evolution of collective fairness in complex networks through degree-based role assignment arXiv.cs.GT Pub Date : 2021-02-26 Andreia Sofia Teixeira; Francisco C. Santos; Alexandre P. Francisco; Fernando P. Santos
From social contracts to climate agreements, individuals engage in groups that must collectively reach decisions with varying levels of equality and fairness. These dilemmas also pervade Distributed Artificial Intelligence, in domains such as automated negotiation, conflict resolution or resource allocation. As evidenced by the well-known Ultimatum Game -- where a Proposer has to divide a resource
-
Discord and Harmony in Networks arXiv.cs.GT Pub Date : 2021-02-26 Andrea Galeotti; Benjamin Golub; Sanjeev Goyal; Rithvik Rao
Consider a coordination game played on a network, where agents prefer taking actions closer to those of their neighbors and to their own ideal points in action space. We explore how the welfare outcomes of a coordination game depend on network structure and the distribution of ideal points throughout the network. To this end, we imagine a benevolent or adversarial planner who intervenes, at a cost
-
An Online Learning Approach to Interpolation and Extrapolation in Domain Generalization arXiv.cs.GT Pub Date : 2021-02-25 Elan Rosenfeld; Pradeep Ravikumar; Andrej Risteski
A popular assumption for out-of-distribution generalization is that the training data comprises sub-datasets, each drawn from a distinct distribution; the goal is then to "interpolate" these distributions and "extrapolate" beyond them -- this objective is broadly known as domain generalization. A common belief is that ERM can interpolate but not extrapolate and that the latter is considerably more
-
A Fragile multi-CPR Game arXiv.cs.GT Pub Date : 2021-02-25 Christos Pelekis; Panagiotis Promponas; Juan Alvarado; Eirini-Eleni Tsiropoulou; Symeon Papavassiliou
A Fragile CPR Game is an instance of a resource sharing game where a common-pool resource, which is prone to failure due to overuse, is shared among several players. Each player has a fixed initial endowment and is faced with the task of investing in the common-pool resource without forcing it to fail. The return from the common-pool resource is subject to uncertainty and is perceived by the players
-
Phragmén's Voting Methods and Justified Representation arXiv.cs.GT Pub Date : 2021-02-24 Markus Brill; Rupert Freeman; Svante Janson; Martin Lackner
In the late 19th century, Swedish mathematician Lars Edvard Phragm\'{e}n proposed a load-balancing approach for selecting committees based on approval ballots. We consider three committee voting rules resulting from this approach: two optimization variants -- one minimizing the maximal load and one minimizing the variance of loads -- and a sequential variant. We study Phragm\'{e}n's methods from an
-
Equal Affection or Random Selection: the Quality of Subjective Feedback from a Group Perspective arXiv.cs.GT Pub Date : 2021-02-24 Jiale Chen; Yuqing Kong; Yuxuan Lu
In the setting where a group of agents is asked a single subjective multi-choice question (e.g. which one do you prefer? cat or dog?), we are interested in evaluating the quality of the collected feedback. However, the collected statistics are not sufficient to reflect how informative the feedback is since fully informative feedback (equal affection of the choices) and fully uninformative feedback
-
Convergence of Bayesian Nash Equilibrium in Infinite Bayesian Games under Discretization arXiv.cs.GT Pub Date : 2021-02-24 Linan Huang; Quanyan Zhu
We prove the existence of Bayesian Nash Equilibrium (BNE) of general-sum Bayesian games with continuous types and finite actions under the conditions that the utility functions and the prior type distributions are continuous concerning the players' types. Moreover, there exists a sequence of discretized Bayesian games whose BNE strategies converge weakly to a BNE strategy of the infinite Bayesian game
-
Dynamic Games among Teams with Delayed Intra-Team Information Sharing arXiv.cs.GT Pub Date : 2021-02-23 Dengwang Tang; Hamidreza Tavafoghi; Vijay Subramanian; Ashutosh Nayyar; Demosthenis Teneketzis
We analyze a class of stochastic dynamic games among teams with asymmetric information, where members of a team share their observations internally with a delay of $d$. Each team is associated with a controlled Markov Chain, whose dynamics are coupled through the players' actions. These games exhibit challenges in both theory and practice due to the presence of signaling and the increasing domain of
-
A Refined Complexity Analysis of Fair Districting over Graphs arXiv.cs.GT Pub Date : 2021-02-23 Niclas Boehmer; Tomohiro Koana; Rolf Niedermeier
We study the NP-hard Fair Connected Districting problem: Partition a vertex-colored graph into k connected components (subsequently referred to as districts) so that in each district the most frequent color occurs at most a given number of times more often than the second most frequent color. Fair Connected Districting is motivated by various real-world scenarios where agents of different types, which
-
Multi-Sided Matching Markets with Consistent Preferences and Cooperative Partners arXiv.cs.GT Pub Date : 2021-02-23 Maximilian Mordig; Riccardo Della Vecchia; Nicolò Cesa-Bianchi; Bernhard Schölkopf
We introduce a variant of the three-sided stable matching problem for a PhD market with students, advisors, and co-advisors. In our formalization, students have consistent (lexicographic) preferences over advisors and co-advisors, and the latter have preferences over students only (hence advisors and co-advisors are cooperative). A student must be matched to one advisor and one co-advisor, or not at
-
New Characterizations of Strategy-Proofness under Single-Peakedness arXiv.cs.GT Pub Date : 2021-02-23 Andrew Jennings; Rida Laraki; Clemens Puppe; Estelle Varloot
We provide novel simple representations of strategy-proof voting rules when voters have uni-dimensional single-peaked preferences (as well as multi-dimensional separable preferences). The analysis recovers, links and unifies existing results in the literature such as Moulin's classic characterization in terms of phantom voters and Barber\`a, Gul and Stacchetti's in terms of winning coalitions ("generalized
-
Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games arXiv.cs.GT Pub Date : 2021-02-23 Yu Bai; Chi Jin; Huan Wang; Caiming Xiong
Real world applications such as economics and policy making often involve solving multi-agent games with two unique features: (1) The agents are inherently asymmetric and partitioned into leaders and followers; (2) The agents have different reward functions, thus the game is general-sum. The majority of existing results in this field focuses on either symmetric solution concepts (e.g. Nash equilibrium)
-
A theory of Automated Market Makers in DeFi arXiv.cs.GT Pub Date : 2021-02-22 Massimo Bartoletti; James Hsin-yu Chiang; Alberto Lluch-Lafuente
Automated market makers (AMMs) are one of the most prominent decentralized finance (DeFi) applications. They allow users to exchange units of different types of crypto-assets, without the need to find a counter-party. There are several implementations and models for AMMs, featuring a variety of sophisticated economic mechanisms. We present a theory of AMMs. The core of our theory is an abstract operational
-
Mutual information-based group explainers with coalition structure for machine learning model explanations arXiv.cs.GT Pub Date : 2021-02-22 Alexey Miroshnikov; Konstandinos Kotsiopoulos; Arjun Ravi Kannan
In this article, we propose and investigate ML group explainers in a general game-theoretic setting with the focus on coalitional game values and games based on the conditional and marginal expectation of an ML model. The conditional game takes into account the joint distribution of the predictors, while the marginal game depends on the structure of the model. The objective of the article is to unify
-
An agile and distributed mechanism for inter-domain network slicing in next generation mobile networks arXiv.cs.GT Pub Date : 2021-02-21 Jalal Khamse-Ashari; Gamini Senarath; Irem Bor-Yaliniz; Halim Yanikomeroglu
Network slicing is emerging as a promising method to provide sought-after versatility and flexibility to cope with ever-increasing demands. To realize such potential advantages and to meet the challenging requirements of various network slices in an on-demand fashion, we need to develop an agile and distributed mechanism for resource provisioning to different network slices in a heterogeneous multi-resource
-
(Almost Full) EFX Exists for Four Agents (and Beyond) arXiv.cs.GT Pub Date : 2021-02-21 Ben Berger; Avi Cohen; Michal Feldman; Amos Fiat
The existence of EFX allocations is a major open problem in fair division, even for additive valuations. The current state of the art is that no setting where EFX allocations are impossible is known, and EFX is known to exist for ($i$) agents with identical valuations, ($ii$) 2 agents, ($iii$) 3 agents with additive valuations, ($iv$) agents with one of two additive valuations and ($v$) agents with
-
A Game-Theoretic Approach for Hierarchical Policy-Making arXiv.cs.GT Pub Date : 2021-02-21 Feiran Jia; Aditya Mate; Zun Li; Shahin Jabbari; Mithun Chakraborty; Milind Tambe; Michael Wellman; Yevgeniy Vorobeychik
We present the design and analysis of a multi-level game-theoretic model of hierarchical policy-making, inspired by policy responses to the COVID-19 pandemic. Our model captures the potentially mismatched priorities among a hierarchy of policy-makers (e.g., federal, state, and local governments) with respect to two main cost components that have opposite dependence on the policy strength, such as post-intervention
-
Dynamical Analysis of the EIP-1559 Ethereum Fee Market arXiv.cs.GT Pub Date : 2021-02-21 Stefanos Leonardos; Barnabé Monnot; Daniël Reijsbergen; Stratis Skoulakis; Georgios Piliouras
Participation in permissionless blockchains results in competition over system resources, which needs to be controlled with fees. Ethereum's current fee mechanism is implemented via a first-price auction that results in unpredictable fees as well as other inefficiencies. EIP-1559 is a recent, improved proposal that introduces a number of innovative features such as a dynamically adaptive base fee that
-
Contextual First-Price Auctions with Budgets arXiv.cs.GT Pub Date : 2021-02-20 Santiago Balseiro; Christian Kroer; Rachitesh Kumar
The internet advertising market is a multi-billion dollar industry, in which advertisers buy thousands of ad placements every day by repeatedly participating in auctions. In recent years, the industry has shifted to first-price auctions as the preferred paradigm for selling advertising slots. Another important and ubiquitous feature of these auctions is the presence of campaign budgets, which specify
-
Mechanism Design Powered by Social Interactions arXiv.cs.GT Pub Date : 2021-02-20 Dengji Zhao
Mechanism design has traditionally assumed that the set of participants are fixed and known to the mechanism (the market owner) in advance. However, in practice, the market owner can only directly reach a small number of participants (her neighbours). Hence the owner often needs costly promotions to recruit more participants in order to get desirable outcomes such as social welfare or revenue maximization
-
Learning to Persuade on the Fly: Robustness Against Ignorance arXiv.cs.GT Pub Date : 2021-02-19 You Zu; Krishnamurthy Iyer; Haifeng Xu
We study a repeated persuasion setting between a sender and a receiver, where at each time $t$, the sender observes a payoff-relevant state drawn independently and identically from an unknown prior distribution, and shares state information with the receiver, who then myopically chooses an action. As in the standard setting, the sender seeks to persuade the receiver into choosing actions that are aligned
-
Online Learning via Offline Greedy Algorithms: Applications in Market Design and Optimization arXiv.cs.GT Pub Date : 2021-02-18 Rad NiazadehChicago Booth School of Business, Operations Management; Negin GolrezaeiMIT Sloan School of Management, Operations Management; Joshua WangGoogle Research Mountain View; Fransisca SusanMIT Sloan School of Management, Operations Management; Ashwinkumar BadanidiyuruGoogle Research Mountain View
Motivated by online decision-making in time-varying combinatorial environments, we study the problem of transforming offline algorithms to their online counterparts. We focus on offline combinatorial problems that are amenable to a constant factor approximation using a greedy algorithm that is robust to local errors. For such problems, we provide a general framework that efficiently transforms offline
-
Dealing with Non-Stationarity in Multi-Agent Reinforcement Learning via Trust Region Decomposition arXiv.cs.GT Pub Date : 2021-02-21 Wenhao Li; Xiangfeng Wang; Bo Jin; Junjie Sheng; Hongyuan Zha
Non-stationarity is one thorny issue in multi-agent reinforcement learning, which is caused by the policy changes of agents during the learning procedure. Current works to solve this problem have their own limitations in effectiveness and scalability, such as centralized critic and decentralized actor (CCDA), population-based self-play, modeling of others and etc. In this paper, we novelly introduce
-
Mastering Terra Mystica: Applying Self-Play to Multi-agent Cooperative Board Games arXiv.cs.GT Pub Date : 2021-02-21 Luis Perez
In this paper, we explore and compare multiple algorithms for solving the complex strategy game of Terra Mystica, hereafter abbreviated as TM. Previous work in the area of super-human game-play using AI has proven effective, with recent break-through for generic algorithms in games such as Go, Chess, and Shogi \cite{AlphaZero}. We directly apply these breakthroughs to a novel state-representation of
-
Arena-Independent Finite-Memory Determinacy in Stochastic Games arXiv.cs.GT Pub Date : 2021-02-19 Patricia Bouyer; Youssouf Oualhadj; Mickael Randour; Pierre Vandenhove
We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent
-
Gerrymandering on graphs: Computational complexity and parameterized algorithms arXiv.cs.GT Pub Date : 2021-02-19 Sushmita Gupta; Pallavi Jain; Fahad; Panolan; Sanjukta Roy; Saket Saurabh
Partitioning a region into districts to favor a particular candidate or a party is commonly known as gerrymandering. In this paper, we investigate the gerrymandering problem in graph theoretic setting as proposed by Cohen-Zemach et al. [AAMAS 2018]. Our contributions in this article are two-fold, conceptual and computational. We first resolve the open question posed by Ito et al. [AAMAS 2019] about
-
The Effectiveness of Subsidies and Tolls in Congestion Games arXiv.cs.GT Pub Date : 2021-02-18 Bryce L. Ferguson; Philip N. Brown; Jason R. Marden
Are rewards or penalties more effective in influencing user behavior? This work compares the effectiveness of subsidies and tolls in incentivizing user behavior in congestion games. The predominantly studied method of influencing user behavior in network routing problems is to institute taxes which alter users' observed costs in a manner that causes their self-interested choices to more closely align
-
The Flip Schelling Process on Random Geometric and Erdös-Rényi Graphs arXiv.cs.GT Pub Date : 2021-02-19 Thomas Bläsius; Tobias Friedrich; Martin S. Krejca; Louise Molitor
Schelling's classical segregation model gives a coherent explanation for the wide-spread phenomenon of residential segregation. We consider an agent-based saturated open-city variant, the Flip Schelling Process (FSP), in which agents, placed on a graph, have one out of two types and, based on the predominant type in their neighborhood, decide whether to changes their types; similar to a new agent arriving
-
Optimal Spectrum Partitioning and Licensing in Tiered Access under Stochastic Market Models arXiv.cs.GT Pub Date : 2021-02-18 Gourav Saha; Alhussein A. Abouzeid
We consider the problem of partitioning a spectrum band into M channels of equal bandwidth, and then further assigning these M channels into P licensed channels and M-P unlicensed channels. Licensed channels can be accessed both for licensed and opportunistic use following a tiered structure which has a higher priority for licensed use. Unlicensed channels can be accessed only for opportunistic use
-
On the Optimal Duration of Spectrum Leases in Exclusive License Markets with Stochastic Demand arXiv.cs.GT Pub Date : 2021-02-18 Gourav Saha; Alhussein A. Abouzeid; Zaheer Khan; Marja Matinmikko-Blue
This paper addresses the following question which is of interest in designing efficient exclusive-use spectrum licenses sold through spectrum auctions. Given a system model in which customer demand, revenue, and bids of wireless operators are characterized by stochastic processes and an operator is interested in joining the market only if its expected revenue is above a threshold and the lease duration
-
Efficient Carpooling and Toll Pricing for Autonomous Transportation arXiv.cs.GT Pub Date : 2021-02-18 Saurabh Amin; Patrick Jaillet; Manxi Wu
In this paper, we address the existence and computation of competitive equilibrium in the transportation market for autonomous carpooling first proposed by [Ostrovsky and Schwarz, 2019]. At equilibrium, the market organizes carpooled trips over a transportation network in a socially optimal manner and sets the corresponding payments for individual riders and toll prices on edges. The market outcome
-
Designing Approximately Optimal Search on Matching Platforms arXiv.cs.GT Pub Date : 2021-02-17 Nicole Immorlica; Brendan Lucier; Vahideh Manshadi; Alexander Wei
We study the design of a decentralized two-sided matching market in which agents' search is guided by the platform. There are finitely many agent types, each with (potentially random) preferences drawn from known type-specific distributions. Equipped with such distributional knowledge, the platform guides the search process by determining the meeting rate between each pair of types from the two sides
-
Linear Functions to the Extended Reals arXiv.cs.GT Pub Date : 2021-02-18 Bo Waggoner
This note investigates functions from $\mathbb{R}^d$ to $\mathbb{R} \cup \{\pm \infty\}$ that satisfy axioms of linearity wherever allowed by extended-value arithmetic. They have a nontrivial structure defined inductively on $d$, and unlike finite linear functions, they require $\Omega(d^2)$ parameters to uniquely identify. In particular they can capture vertical tangent planes to epigraphs: a function
-
Optimising Long-Term Outcomes using Real-World Fluent Objectives: An Application to Football arXiv.cs.GT Pub Date : 2021-02-18 Ryan Beal; Georgios Chalkiadakis; Timothy J. Norman; Sarvapali D. Ramchurn
In this paper, we present a novel approach for optimising long-term tactical and strategic decision-making in football (soccer) by encapsulating events in a league environment across a given time frame. We model the teams' objectives for a season and track how these evolve as games unfold to give a fluent objective that can aid in decision-making games. We develop Markov chain Monte Carlo and deep
-
Impartial selection with prior information arXiv.cs.GT Pub Date : 2021-02-17 Ioannis Caragiannis; George Christodoulou; Nicos Protopapas
We study the problem of {\em impartial selection}, a topic that lies at the intersection of computational social choice and mechanism design. The goal is to select the most popular individual among a set of community members. The input can be modeled as a directed graph, where each node represents an individual, and a directed edge indicates nomination or approval of a community member to another.
-
Gradient-Tracking over Directed Graphs for solving Leaderless Multi-Cluster Games arXiv.cs.GT Pub Date : 2021-02-18 Jan Zimmermann; Tatiana Tatarenko; Volker Willert; Jürgen Adamy
We are concerned with finding Nash Equilibria in agent-based multi-cluster games, where agents are separated into distinct clusters. While the agents inside each cluster collaborate to achieve a common goal, the clusters are considered to be virtual players that compete against each other in a non-cooperative game with respect to a coupled cost function. In such scenarios, the inner-cluster problem
-
L2E: Learning to Exploit Your Opponent arXiv.cs.GT Pub Date : 2021-02-18 Zhe Wu; Kai Li; Enmin Zhao; Hang Xu; Meng Zhang; Haobo Fu; Bo An; Junliang Xing
Opponent modeling is essential to exploit sub-optimal opponents in strategic interactions. Most previous works focus on building explicit models to directly predict the opponents' styles or strategies, which require a large amount of data to train the model and lack adaptability to unknown opponents. In this work, we propose a novel Learning to Exploit (L2E) framework for implicit opponent modeling
-
Understanding algorithmic collusion with experience replay arXiv.cs.GT Pub Date : 2021-02-18 Bingyan Han
In an infinitely repeated pricing game, pricing algorithms based on artificial intelligence (Q-learning) may consistently learn to charge supra-competitive prices even without communication. Although concerns on algorithmic collusion have arisen, little is known on underlying factors. In this work, we experimentally analyze the dynamics of algorithms with three variants of experience replay. Algorithmic
-
Stackelberg-Pareto Synthesis arXiv.cs.GT Pub Date : 2021-02-17 Véronique Bruyère; Jean-François Raskin; Clément Tamines
In this paper, we study the framework of two-player Stackelberg games played on graphs in which Player 0 announces a strategy and Player 1 responds rationally with a strategy that is an optimal response. While it is usually assumed that Player 1 has a single objective, we consider here the new setting where he has several. In this context, after responding with his strategy, Player 1 gets a payoff
-
Maximizing Social Welfare Subject to Network Externalities: A Unifying Submodular Optimization Approach arXiv.cs.GT Pub Date : 2021-02-17 S. Rasoul Etesami
We consider the problem of maximizing social welfare by allocating indivisible items to a set of agents subject to network externalities. We first provide a general formulation that captures some of the known models as a special case. We then show that the social welfare maximization problem benefits some nice sub-or supermodular properties. That allows us to devise simple polynomial-time approximation
-
Vote Delegation Favors Minority arXiv.cs.GT Pub Date : 2021-02-17 Hans Gersbach; Akaki Mamageishvili; Manvir Schneider
We examine vote delegation when delegators do not know the preferences of representatives. We show that free delegation favors minorities, that is, alternatives that have a lower chance of winning ex-ante. The same--but to a lesser degree--occurs if the number of voting rights actual voters can have is capped. However, when the fraction of delegators increases, the probability that the ex-ante minority
-
Vote Delegation and Misbehavior arXiv.cs.GT Pub Date : 2021-02-17 Hans Gersbach; Akaki Mamageishvili; Manvir Schneider
We study vote delegation with "well-behaving" and "misbehaving" agents and compare it with conventional voting. Typical examples are validation or governance tasks on blockchains. There is a majority of well-behaving agents, but since voting is costly, they may want to abstain or delegate their vote to other agents. Misbehaving agents always vote. We compare conventional voting allowing for abstention
-
A Game-theoretic Approach Towards Collaborative Coded Computation Offloading arXiv.cs.GT Pub Date : 2021-02-17 Jer Shyuan Ng; Wei Yang Bryan Lim; Zehui Xiong; Dusit Niyato; Cyril Leung; Dong In Kim; Junshan Zhang; Qiang Yang
Coded distributed computing (CDC) has emerged as a promising approach because it enables computation tasks to be carried out in a distributed manner while mitigating straggler effects, which often account for the long overall completion times. Specifically, by using polynomial codes, computed results from only a subset of edge servers can be used to reconstruct the final result. However, incentive
-
Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games arXiv.cs.GT Pub Date : 2021-02-17 Yulai Zhao; Yuandong Tian; Jason D. Lee; Simon S. Du
Policy gradient methods are widely used in solving two-player zero-sum games to achieve superhuman performance in practice. However, it remains elusive when they can provably find a near-optimal solution and how many samples and iterations are needed. The current paper studies natural extensions of Natural Policy Gradient algorithm for solving two-player zero-sum games where function approximation
-
A market-based approach for enabling inter-area reserve exchange arXiv.cs.GT Pub Date : 2021-02-17 Orcun Karaca; Stefanos Delikaraoglou; Maryam Kamgarpour
Considering the sequential clearing of energy and reserves in Europe, enabling inter-area reserve exchange requires optimally allocating inter-area transmission capacities between these two markets. To achieve this, we provide a market-based allocation framework and derive payments with desirable properties. The proposed min-max least core selecting payments achieve individual rationality, budget balance
-
Optimal intervention in traffic networks arXiv.cs.GT Pub Date : 2021-02-16 Leonardo Cianfanelli; Giacomo Como; Asuman Ozdaglar; Francesca Parise
We present an efficient algorithm to identify which edge should be improved in a traffic network to minimize the total travel time. Our main result is to show that it is possible to approximate the variation of total travel time obtained by changing the congestion coefficient of any given edge, by performing only local computations, without the need of recomputing the entire equilibrium flow. To obtain
-
DO-GAN: A Double Oracle Framework for Generative Adversarial Networks arXiv.cs.GT Pub Date : 2021-02-17 Aye Phyu Phyu Aung; Xinrun Wang; Runsheng Yu; Bo An; Senthilnath Jayavelu; Xiaoli Li
In this paper, we propose a new approach to train Generative Adversarial Networks (GANs) where we deploy a double-oracle framework using the generator and discriminator oracles. GAN is essentially a two-player zero-sum game between the generator and the discriminator. Training GANs is challenging as a pure Nash equilibrium may not exist and even finding the mixed Nash equilibrium is difficult as GANs
-
Fair division of graphs and of tangled cakes arXiv.cs.GT Pub Date : 2021-02-17 Ayumi Igarashi; William S. Zwicker
A tangle is a connected topological space constructed by gluing several copies of the unit interval $[0, 1]$. We explore which tangles guarantee envy-free allocations of connected shares for n agents, meaning that such allocations exist no matter which monotonic and continuous functions represent agents' valuations. Each single tangle $\mathcal{T}$ corresponds in a natural way to an infinite topological
-
Complex Momentum for Learning in Games arXiv.cs.GT Pub Date : 2021-02-16 Jonathan Lorraine; David Acuna; Paul Vicol; David Duvenaud
We generalize gradient descent with momentum for learning in differentiable games to have complex-valued momentum. We give theoretical motivation for our method by proving convergence on bilinear zero-sum games for simultaneous and alternating updates. Our method gives real-valued parameter updates, making it a drop-in replacement for standard optimizers. We empirically demonstrate that complex-valued
-
Follow-the-Regularizer-Leader Routes to Chaos in Routing Games arXiv.cs.GT Pub Date : 2021-02-16 Jakub Bielawski; Thiparat Chotibut; Fryderyk Falniowski; Grzegorz Kosiorowski; Michał Misiurewicz; Georgios Piliouras
We study the emergence of chaotic behavior of Follow-the-Regularized Leader (FoReL) dynamics in games. We focus on the effects of increasing the population size or the scale of costs in congestion games, and generalize recent results on unstable, chaotic behaviors in the Multiplicative Weights Update dynamics to a much larger class of FoReL dynamics. We establish that, even in simple linear non-atomic
-
Interim envy-freeness: A new fairness concept for random allocations arXiv.cs.GT Pub Date : 2021-02-15 Ioannis Caragiannis; Panagiotis Kanellopoulos; Maria Kyropoulou
With very few exceptions, research in fair division has mostly focused on deterministic allocations. Deviating from this trend, we define and study the novel notion of interim envy-freeness (iEF) for lotteries over allocations, which aims to serve as a sweet spot between the too stringent notion of ex-post envy-freeness and the very weak notion of ex-ante envy-freeness. Our new fairness notion is a
-
Best vs. All: Equity and Accuracy of Standardized Test Score Reporting arXiv.cs.GT Pub Date : 2021-02-15 Sampath Kannan; Mingzi Niu; Aaron Roth; Rakesh Vohra
We study a game theoretic model of standardized testing for college admissions. Students are of two types; High and Low. There is a college that would like to admit the High type students. Students take a potentially costly standardized exam which provides a noisy signal of their type. The students come from two populations, which are identical in talent (i.e. the type distribution is the same), but
-
Efficient Competitions and Online Learning with Strategic Forecasters arXiv.cs.GT Pub Date : 2021-02-16 Rafael Frongillo; Robert Gomez; Anish Thilagar; Bo Waggoner
Winner-take-all competitions in forecasting and machine-learning suffer from distorted incentives. Witkowskiet al. identified this problem and proposed ELF, a truthful mechanism to select a winner. We show that, from a pool of $n$ forecasters, ELF requires $\Theta(n\log n)$ events or test data points to select a near-optimal forecaster with high probability. We then show that standard online learning
-
The Dependent Chip Model (DCM): a simple and more realistic alternative to the Independent Chip Model (ICM) arXiv.cs.GT Pub Date : 2021-02-15 E. Besalú
The Dependent Chip Model (DCM) is proposed as an alternative to the Independent Chip Model (ICM) usually employed in poker tournament negotiations. DCM constitutes a recursive exploration of a multiplayer Texas hold'em poker game tree tracking. The DCM procedure considers all players as having exactly the same playing skills and probabilities to win a single poker hand, but submitted to their stacks
-
RPPLNS: Pay-per-last-N-shares with a Randomised Twist arXiv.cs.GT Pub Date : 2021-02-15 Jonathan Katz; Philip Lazos; Francisco J. Marmolejo-Cossío; Xinyu Zhou
"Pay-per-last-$N$-shares" (PPLNS) is one of the most common payout strategies used by mining pools in Proof-of-Work (PoW) cryptocurrencies. As with any payment scheme, it is imperative to study issues of incentive compatibility of miners within the pool. For PPLNS this question has only been partially answered; we know that reasonably-sized miners within a PPLNS pool prefer following the pool protocol
Contents have been reproduced by permission of the publishers.