There are two equivalent ways to understand the best response property of a Nash Equilibrium strategy. First, we can say that the player plays a mixed strategy whose expected payoff is maximal among all possible mixed strategies. Second, we can say that the player randomly chooses a pure strategy from the set of pure strategies whose expected payoff is maximal among all possible pure strategies.
So far so good, and every student of game theory is aware of this equivalence. What I think is less known is that the two perspectives are not identical for -best response and
-equilibrium: A mixed strategy whose expected payoff is almost optimal might put some positive (though small) probability on a pure strategy which gives a horrible payoff. In this post I am going to explain why I used to think the difference between the two perspectives is inconsequential, and why, following a conversation with Ayala Mashiah-Yaakovi about her work on subgame perfect equilibrium in Borel games, I changed my mind.
A (two player) normal form game is given by Borel sets of pure strategies of Alice and Bob and Borel functions
, the payoff functions of Alice and Bob. Elements of
and
are called mixed strategies. As usual, we extend the domain of the payoff functions to mixed strategies and identify the pure strategy
with the mixed strategy that plays
with probability
.
Let be a mixed strategy of Bob and let
. A mixed strategy
of Alice is
-best response to
if
for every
. A mixed strategy
of Alice is strong
-best response to
if
The first definition is standard, and the second is a definition that I invented for this post. We can now distinguish between -equilibrium (a mixed strategy profile under which every player
-best respond to the opponent) and strong
-equilibrium (a mixed strategy profile under which every player strongly
-best respond to the opponent). Every strong
-equilibrium is in particular an
-equilibrium, but the converse is not true.
Personally, I strongly prefer strong -equilibrium. As I explained in another blog, I don’t care much for the idea that players toss coins to decide which action to play. The way I prefer to interpret mixed strategies in Nash Equilibrium follows Harsany’s purification idea: the game is replaced with a game of incomplete information in which each type of Alice plays a pure best response. The mixture is not a product of an a randomization performed by Alice, but of the fact that Nature randomly chooses some type of Alice. Now if we assume that each type
-best responds, we arrive at the concept of strong
-equilibrium.
How could I live in harmony with all the game theorists who use the standard definition of -equilibrium in their papers ? Well, I console myself with the following proposition
Proposition 1 The following conditions are equivalent for every normal form game
- The game admits an
-equilibrium for every
.
- The game admits a strong
-equilibrium for every
.
I never actually wrote down a proof for this proposition, so let’s agree that the proposition is easy to verify. So up to now, when I read and wrote papers that prove existence of -equilibrium I always interpreted them as papers that prove existence of strong
-equilibrium. But as I said, I now think that things get more subtle when we look at extensive form games.
The following gem was discovered by Eilon Solan and Nicolas Vieille (pdf, Example 3): A perfect information game between two players is played in periods. At even periods Alice can quit the game or continue. At odd periods
Bob can quit or continue. If at some period Alice quits the game terminates with payoffs
(that is,
to Alice and
to Bob). If at some period Bob quits the game terminates with payoffs
. If nobody ever quits the final payoff is
.
A pure strategy of a player tells them at every period whether to quit or continue. The game has a Nash equilibrium according to which both players quit at every period. This Nash equilibrium is, however, not subgame perfect: If the unthinkable happens and Alice does not quit at period then Bob’s strategy tells him to quit at period
, but in fact it is better for him to continue and let Alice quit at period
. On the other hand, Bob must threaten to quit, or Alice will just continue forever.
What are we left with ? This game has no pure strategy subgame perfect equilibrium, nor even pure strategy subgame perfect -equilibrium (again, easy to verify). However Eilon and Nicolas prove that the game has subgame perfect
-equilibrium in mixed strategies! Alice quits at every period and Bob quits with probability
at every period.
If you are still reading and managed to convince yourself that this is indeed a subgame perfect -equilbrium then you might feel a bit uncomfortable with the concept of subgame perfect
-equilibrium. If not, put yourself in Bob’s shoes in the game that starts at period 1: Eilon and Nicolas have just stopped by, recommending that both you and Alice follow their
-equilibrium strategy profile. So you are supposed to toss a coin with probability
for Heads and quit if the outcome is Heads. As Nash Equilibrium goes, you ask yourself whether it makes sense to follow the recommendation given that Alice follows it. Ex ante, before you tosses the coin, the recommendation seems reasonable. You don’t mind promising to quit if your coin turns on Heads given that this is unlikely to happen anyway. But, in the interim stage, after the coin turns out to be Heads, you are asked to quit even though Alice will quit at the next round, so by quitting you will lose much more than
. I would shirk, but then again, I voted for strong
-equilibrium.
So what I would like to have is a subgame perfect stronge -equilibrium. I am not sure what’s the best way to define this concept, but I am pretty sure Solan-Vieille’s game doesn’t admit such a thing. In particular, I believe my beloved proposition has no good analogue for subgame perfect equilibrium.
2 comments
March 10, 2011 at 8:48 am
Jonathan Weinstein
At the CS seminar on Monday, someone was insisting to the speaker that your preferred definition of epsilon-equilibrium was the one that made sense (I tend to agree, but this was rather unfair to the poor guy who had just worked on algorithms for the standard definition.) Is this idea floating through the air somewhere at Kellogg?
March 13, 2011 at 2:30 pm
Eran
GMTA
I also noticed that the strong definition fits better with the definition of epsilon-rationalizability in Dekel, Fudenberg & Morris: they require that a player will epsilon best respond to a conjecture about the other players, and the conjecture is such that the support of the mixture of actions played by every type profile of the opponent is concentrated on the epsilon-rationalizable actions of those types.