Tag Archive

You are currently browsing the tag archive for the ‘zero-sum games’ tag.

A Lovely Proof (?) of Minimax

September 15, 2022 in linear programming, Operations research | Tags: Blackwell, minimax, zero-sum games | by rvohra | 4 comments

Peter Bickel taught me a nifty, little, literally, proof of the minimax theorem. Peter says it was inspired by his reading of David Blackwell‘s vector generalization of the same. Here goes.

Let $A$ be the $m \times n$ payoff matrix of zero-sum game. Here $a_{ij}$ is the payoff to the row player from playing $i$ when the column player plays $j$ . Let $\Delta^k$ be $k$-simplex.

For each $p \in \Delta^m$ , let $R(p) = \{ p^TAq: q \in \Delta^n \}.$ This is the set of payoffs that the row player can achieve when playing mixed strategy $p$ . By convexity, it must be an interval whose endpoints depend on $p$ . We will only be interested in the left most endpoint, call it $b(p)$ . Thus, we can assume that (strictly speaking should be subset) $R(p) = [b(p), + \infty).$ Similarly, for each $q \in \Delta^n$ , let $C(q) = \{ p^TAq: p \in \Delta^m \}$ . By the same observations we may suppose that $C(q)= (-\infty, d(q)].$

For any $p \in \Delta^m$ and $q \in \Delta^n$ we have $R(p) \cap C(q) \neq \emptyset.$ Now, choose $p^* = \arg \max_{p \in \Delta^m}b(p)$ which exists by continuity and compactness. Similarly, choose $q^* = \arg \min_{q \in \Delta^n}d(q).$ Then, $v = (p^{*})^{T}Aq^{*} \in R(p^{*}) \cap C(q^{*})$ is the value of the game.

No Farkas or separating hyperplane. Now, minimax is equivalent to duality. So, one might wonder whether such a proof could be used to obtain the duality theorem of linear programming. Assuming that the primal and the dual had optimal solutions, yes. However, the case where one is infeasible or unbounded would cause this kind of proof to break.

PS: Then, Olivier Gossner pointed out that perhaps because I wanted to believe so much that it was true, that I had overlooked something. See the comments. He was right. I tried to make the proof work but could not. A proof of minimax that avoids Farkas and/or the separating hyperlane is not outlandish. There is Guillermo Owen’s proof, via induction, for example. By the way this was not the first such elementary proof. There was a flurry elementary proofs of the minimax in the 40s and 50s in response to a challenge of von Neumann. Indeed, Peck (1958) writes in the introduction to his paper:

“There are so many proofs of this theorem in the literature, that an excuse is necessary before exhibiting another. Such may be found by examining the proof given below for the following: it uses no matrices, almost no topology and makes little use of the geometry of convex sets; it applies equally well to the case where only one of the pure strategy spaces is finite; also there is no assumption that the payoff function is bounded.”

That an induction proof should exist is unsurprising given David Gale’s inductive proof of the Farkas lemma. In fact, Gale’s proof can be seen as an instantiation of Fourier’s method for solving linear programs, again, no Farkas or separating hyperplane.

Lloyd Shapley

March 21, 2016 in Game Theory, Uncategorized | Tags: matching, open problems, stability, zero-sum games | by Eran | Comments closed

I am not the right person to write about Lloyd Shapley. I think I only saw him once, in the first stony brook conference I attended. He reminded me of Doc Brown from Back to The Future, but I am not really sure why. Here are links to posts in The Economist and NYT following his death.

Shapley got the Nobel in 2012 and according to Robert Aumann deserved to get it right with Nash. Shapley himself however was not completely on board: “I consider myself a mathematician and the award is for economics. I never, never in my life took a course in economics.” If you are wondering what he means by “a mathematician” read the following quote, from the last paragraph of his stable matching paper with David Gale

The argument is carried out not in mathematical symbols but in ordinary English; there are no obscure or technical terms. Knowledge of calculus is not presupposed. In fact, one hardly needs to know how to count. Yet any mathematician will immediately recognize the argument as mathematical…

What, then, to raise the old question once more, is mathematics? The answer, it appears, is that any argument which is carried out with sufficient precision is mathematical

In the paper Gale and Shapley considered a problem of matching (or assignment as they called it) of applicants to colleges, where each applicant has his own preference over colleges and each college has its preference over applicants. Moreover, each college has a quota. Here is the definition of stability, taken from the original paper

Definition: An assignment of applicants to colleges will be called unstable if there are two applicants ${\alpha}$ and ${\beta}$ who are assigned to colleges ${A}$ and ${B}$ , respectively, although ${\beta}$ prefers ${A}$ to ${B}$ and ${A}$ prefers ${\beta}$ to ${\alpha}$ .
According to the Gale-Shapley algorithm, applicants apply to colleges sequentially following their preferences. A college with quota ${q}$ maintains a `waiting list’ of size ${q}$ with the top ${q}$ applicants that has applied to it so far, and rejects all other applicants. When an applicant is rejected from a college he applies to his next favorite college. Gale and Shapley proved that the algorithm terminates with a stable assignment.

One reason that the paper was so successful is that the Gale Shapley method is actually used in practice. (A famous example is the national resident program that assigns budding physicians to hospitals). From theoretical perspective my favorite follow-up is a paper of Dubins and Freedman “Machiavelli and the Gale-Shapley Algorithm” (1981): Suppose that some applicant, Machiavelli, decides to `cheat’ and apply to colleges in different order than his true ranking. Can Machiavelli improves his position in the assignment produced by the algorithm ? Dubins and Freedman prove that the answer to this question is no.

Shapley’s contribution to game theory is too vast to mention in a single post. Since I mainly want to say something about his mathematics let me mention Shapley-Folkman-Starr Lemma, a kind of discrete analogue of Lyapunov’s theorem on the range of non-atomic vector measures, and KKMS Lemma which I still don’t understand its meaning but it has something to do with fixed points and Yaron and I have used it in our paper about rental harmony.

I am going to talk in more details about stochasic games, introduced by Shapley in 1953, since this area has been flourishing recently with some really big developments. A (two-player, zero-sum) stochastic game is given by a finite set ${Z}$ of states, finite set of actions ${A,B}$ for the players, a period payoff function ${r:Z\times A\times B\rightarrow [0,1]}$ , a distribution ${q(\cdot|z,a,b)}$ over ${Z}$ for every state ${z}$ and actions ${a,b}$ , and a discount factor ${0<\beta<1}$ . At every period the system is at some state ${z\in Z}$ , players choose actions ${a,b}$ simultaneously and independently. Then the column player pays ${r(z,a,b)}$ to the row player. The game then moves to a new state in the next period, randomized according to ${q(\cdot|z,a,b)}$ . Players evaluate their infinite stream of payoofs via the discount factor ${\beta}$ . The model is a generalization of the single player dynamic programming model which was studied by Blackwell and Bellman. Shapley proved that every zero-sum stochastic game admits a value, by imitating the familiar single player argument, which have been the joy and pride of macroeconomists ever since Lucas asset pricing model (think Bellman Equation and the contraction operators). Fink later proved using similar ideas that non-zero sum discounted stochastic games admit perfect markov equilibria.

A major question, following a similar question in the single player setup, is the limit behavior of the value and the optimal strategies when players become more patient (i.e., ${\beta}$ goes to ${1}$ ). Mertens and Neyman have proved that the limit exists, and moreover that for every ${\epsilon>0}$ there strategies which are ${\epsilon}$ -optimal for sufficiently large discount factor. Whether a similar result holds for Nash equilibrium in ${N}$ -player stochastic games is probably the most important open question in game theory. Another important question is whether the limit of the value exists for zero-sum games in which the state is not observed by both players. Bruno Zilloto has recently answered this question by providing a counter-example. I should probably warn that you need to know how to count and also some calculus to follow up this literature. Bruno Zilloto will give the Shapley Lecture in Games2016 in Maastricht. Congrats, Bruno ! and thanks to Shapley for leaving us with some much stuff to play with !

Who Was Colonel Blotto?

February 19, 2016 in Blotto, Uncategorized | Tags: Nash equilibrium, zero-sum games | by rvohra | Comments closed

Credit for the game that bears his name is due to to Borel. It appears in a 1921 paper in French. An English translation (by Leonard Savage) may be found in a 1953 Econometrica.

blottorace1

The first appearance in print of a version of the game with Colonel Blotto’s name attached is, I believe, in the The Weekend Puzzle Book by Caliban (June 1924). Caliban was the pen name of Hubert Phillips one time head of Economics at the University of Bristol and a puzzle contributor to The New Statesman.

Blotto itself is a slang word for inebriation. It does not, apparently, derive from the word `blot’, meaning to absorb liquid. One account credits a French manufacturer of delivery tricycles (Blotto Freres, see the picture) that were infamous for their instability. This inspired Laurel and Hardy to title one of their movies Blotto. In it they get blotto on cold tea, thinking it whiskey.

Over time, the Colonel has been promoted. In 2006 to General and to Field Marshall in 2011.

Expert testing part 3

March 29, 2011 in Uncategorized | Tags: axiom of choice, Blackwell, expert testing, infinite games, zero-sum games | by Eran | 1 comment

Department of self-promotion: sequential tests, Blackwell games and the axiom of determinacy.

Links to part 1 and part 2

Recap: At every day ${n}$ an outcome ${s_n\in \{0,1\}}$ is realized (` ${1}$ ‘ means a rainy day, ` ${0}$ ‘ a non-rainy day). An expert claims to know the distribution of the stochastic process that generates the sequence ${s_0,s_1,\dots}$ of outcomes. A sequential test is given by a Borel set ${R\subseteq \bigl([0,1]\times\{0,1\}\bigr)^{\mathbb N}}$ of infinite sequences of predictions and outcomes. An expert who delivers forecast ${f:\{0,1\}^{<{\mathbb N}}\rightarrow [0,1]}$ fails on a realization ${x=(s_0,s_1,\dots)\in X=\{0,1\}^{\mathbb N}}$ if ${\left(p_0,s_0,p_1,s_1,\dots\right)\in R}$ where ${p_k=f(s_0,\dots,s_{k-1})}$ is the prediction made by ${f}$ about ${s_k}$ given the previous outcomes ${s_0,\dots,s_{k-1}}$ .

Let me remind you what I mean by `a test that does not reject the truth with probability ${1-\epsilon}$ ‘. I am going to write an equivalent definition to the one I used up to now, this time using the language of stochastic processes.

Definition 1 The test does not reject the truth with probabality ${1-\epsilon}$ if for every ${\{0,1\}}$ -valued stochastic process ${S_0,S_1,\dots}$ one has

$\displaystyle \mathop{\mathbb P}\bigl(\left(P_0,S_0,P_1,S_1,\dots\right)\in R\bigr)<\epsilon\ \ \ \ \ (1)$

where

$\displaystyle P_n=\mathop{\mathbb P}\left(S_n=1\left|S_0,\dots,S_{n-1}\right.\right)\ \ \ \ \ (2)$

We are going to prove the following theorem:

Theorem 2 Consider a sequential test that does not reject the true expert with probability ${1-\epsilon}$ . Then for every ${\delta>0}$ there exists a ${\zeta\in\Delta({\mathcal F})}$ such that

$\displaystyle \zeta\bigl(\left\{\mu|f\text{ fails on }x\right\}\bigr)<\epsilon+\delta\ \ \ \ \ (3)$

for every ${x\in X}$ .

So a charlatan, who doesn’t know anything about the true distribution of the process, can randomize a forecast according to ${\zeta}$ and pass the test with high probability regardless of the actual distribution.

— Discussion —

Before we delve into the proof of the theorem, a couple of words about where we are. Recall that a forecast ${f:\{0,1\}^{<{\mathbb N}}\rightarrow [0,1]}$ specifies the Expert’s prediction about rain tomorrow after every possible history ${\sigma=(s_0,\dots,s_{n-1})}$ . We denote by ${{\mathcal F}}$ the set of all such forecasts. The most general tests are given by a function ${T:{\mathcal F}\rightarrow 2^X}$ , and specify for every such ${f}$ the set of realizations ${x\in X=\{0,1\}^{\mathbb N}}$ over which the forecast ${f}$ fails. Since ${X}$ is infinite we know that there exist tests that passes a true expert and are not manipulable by a strategic charlatan.

Sequential tests have the additional property that the test’s verdict depends only on predictions made by ${f}$ along the realized path: When deciding whether a forecast ${f}$ passes or fails when the realization is ${x=(s_0,s_1,\dots)}$ the test only considers the predictions ${p_k=f(s_0,\dots,s_{k-1})}$ made by ${f}$ along x. We also say that the test does not depend on counter-factual predictions, i.e. predictions about the probability of rainy day after histories that never happen. It seems that counter-factual predictions would be irrelevant to testing anyway, but, as the theorem shows, if the test does not use counter-factual prediction then it is manipulable.

One situation in which sequential tests are the only available tests is when, instead of providing his entire forecast ${f}$ before any outcome is realized, at every day ${n}$ the expert only provides his prediction ${p_n}$ about the outcome ${s_n}$ of day ${n}$ just before it is realized. At infinity, all the information available to the tester is the sequence ${(p_0,s_0,p_1,s_1,\dots)}$ of predictions and realized outcomes.

— Sketch of Proof —

We can transform the expert’s story to a two-player normal form zero-sum game as we did before: Nature chooses a realization ${x\in X}$ and Expert chooses a forecast ${f\in{\mathcal F}}$ . Then Expert pays Nature ${1}$ if ${f}$ fails on ${x}$ and ${0}$ otherwise. The fact that the test does not reject the true expert translates to the fact that the maximin of the game is small. If we knew that the minimax is also small then an optimal mixed strategy ${\zeta}$ for the Expert will satisfy (3). We only need to prove the existence of value, or as game theorists say, that the game is determined.

Unfortunately, this time we cannot use Fan’s Theorem since we made no topological assumption about the set ${R}$ , so there is no hope to get semi-continuity of the payoff function. Indeed, as we shall see in a moment, the Normal form representation misses an important part of the expert’s story. Instead of using a normal form game, we are going to write the game in extensive form. I will call this game ${\Gamma}$ .

The game is played in stages ${n=0,1,\dots}$ .
At stage ${n}$ Nature chooses an outcome ${s_n}$ and Expert chooses a prediction ${p_n\in [0,1]}$ simultaneously and independently.
Nature does not monitor past actions of Expert.
Expert monitors past actions ${s_0,\dots,s_{n-1}}$ of Nature.
At infinity, Expert pays Nature ${1}$ if ${\left(p_0,s_0,p_1,s_1,\dots\right)\in R}$ and ${0}$ otherwise.

Now I am going to assume that you are familiar with the concept of strategy in extensive form game, and are aware of Kuhn’s Theorem about the equivalence between behavioral strategies and mixtures of pure strategies (I will make implicit uses of both directions of Kuhn’s Theorem in what follows). We can then look at the normal form representation of this game, in which the players choose pure strategies. A moment’s thought will convince you that this is exactly the game from the previous paragraph: Nature’s set of pure strategies is ${X}$ , Expert’s set of pure strategies is ${{\mathcal F}}$ and the payoff for a strategy profile ${(x,f)}$ is ${\mathbf{1}_{T(f)}(x)}$ . So far no real gain. Extensive form games in which one of the players don’t monitor opponent’s actions need not be determined. In order to get a game with a value we are going to twist the game ${\Gamma}$ , and allow Nature to observe past actions of the Expert player. This makes life more difficult for the Expert. Up to a minor inaccuracy which I will comment about later, the resulting game is what’s called Blackwell game and it admits a value by (a seminal theorem of Donald Martin).

Here is the game after the twist. I call this game ${\Gamma^\ast}$ .

The game is played in stages ${n=0,1,\dots}$ .
At stage ${n}$ Nature chooses an outcome ${s_n}$ and Expert chooses a prediction ${p_n\in [0,1]}$ simultaneously and independently.
Each player monitors past actions of the opponent.
At infinity, Expert pays Nature ${1}$ if ${\left(p_0,s_0,p_1,s_1,\dots\right)\in R}$ and ${0}$ otherwise.

Now if you internalized the method of proving manipulability that I was advocating in the previous two episodes, you know what’s left to prove: that the maximin of ${\Gamma^\ast}$ is small, i.e. that, fore every strategy of Nature, Expert has a response that makes the payoff at most ${\epsilon}$ . We know this is true for the game ${\Gamma}$ but in ${\Gamma^\ast}$ Nature is more powerful.

Here is the most important insight of the proof: The fact that an expert who knows the distribution of the process can somehow pass the test ${T}$ implies that the maximin in ${\Gamma}$ is small, but this fact alone doesn’t say anything about the maximin of ${\Gamma^\ast}$ . To show that the maximin of ${\Gamma^\ast}$ is also small we will use the fact that the way such an expert passes the test ${T}$ is by providing the correct forecast. Until now the distinction was not really important to us. Now it comes into play.

Let ${g:\left([0,1]\times \{0,1\}\right)^{<{\mathbb N}}\rightarrow [0,1]}$ be a behavioral strategy of Nature in ${\Gamma^\ast}$ , i.e. a contingent plan that specifies the probability that Nature play ${1}$ after every. Let ${f:\{0,1\}^{<{\mathbb N}}\rightarrow [0,1]}$ be the pure strategy of Expert in ${\Gamma^\ast}$ that is given by

$\displaystyle \begin{array}{rcl} &&f(s_0,s_1,\dots,s_{n-1})=\\&&g\bigl(f(), s_0, f\left(s_0,s_1\right), s_1, \dots,f\left(s_0,s_1,\dots,s_{n-2}\right),s_{n-1}\bigr)\end{array}$

So the pure action taken by the Expert player at day ${n}$ is the mixed action that Nature is going to take at day ${n}$ according to her strategy ${g}$ . Now assume that Nature follows ${g}$ and Expert follows ${f}$ . Let ${P_n}$ and ${S_n}$ be the random variables representing the actions taken by Expert and Nature at day ${n}$ . Then the stochastic process ${P_0,S_0,P_1,\dots}$ satisfies (2). Therefore from (1) we get that the expected payoff when Nature plays ${g}$ and Expert plays ${f}$ is indeed smaller than ${\epsilon}$ .

Now for the minor inaccuracy that I mentioned: For Martin’s Theorem we need the set of actions at every stage to be finite. We can handle this obstacle by restricting Expert’s action at every stage to a grid and applying the coupling argument.

— Non-Borel Tests —

What about pathological sequential tests that are given by a non-Borel set ${R\subseteq \bigl([0,1]\times\{0,1\}\bigr)^{\mathbb N}}$ ? Well, if, like me, you find it preposterously impossible to choose one sock from each of infinitely many pairs of socks, then perhaps you live in the AD paradise. Here every set is measurable, Blackwell Games are determined even when the payoff function is not Borel, and Theorem 2 is true without the assumption that ${R}$ is Borel. See, the AD universe is a paradise for charlatans, since they can do as well as the true experts.

If, on the other hand, you subscribe to the axiom of choice, then you have a non-manipulable test:

Theorem 3 There exists a sequential test with a non-Borel set ${R}$ that does not reject the truth with probability ${1}$ and such that for every ${\zeta\in\Delta({\mathcal F})}$ there exists some realization ${x=(s_0,s_1,\dots)}$ such that

$\displaystyle \zeta\bigl(\left\{\mu|f\text{ fails on }x\right\}\bigr)=1.$

— Summary —

If you plan to remember one conclusion from my last three posts, I suggest you pick this: There exist non-manipulable tests, but they must rely on counter-factual predictions, or be extremely pathological.

Muchas gracias to everyone who read to this point. Did I mention that I have a paper about this stuff ?

Lecture notes on expert testing

March 14, 2011 in Uncategorized | Tags: expert testing, zero-sum games | by Eran | 4 comments

I am visiting the rationality center in the Hebrew University, and I am presenting some papers from the expert testing literature. Here are the lecture notes for the first talk. If you read this and find typos please let me know. The next paragraph contains the background story, and can be safely skipped.

A self-proclaimed expert opens a shop with a sign at the door that says `Here you can buy probabilities’. So the expert is a kind of a fortune-teller, he provides a service, or a product, and the product that the expert provides is a real number: the probability of some event or more generally the distribution of some random variable. You can ask for the probability of rain tomorrow, give the expert some green papers with a picture of George Washington and receive in return a paper with a real number between 0 and 1. The testing literature asks whether you can, after the fact, check the quality of the product you got from the expert, i.e. whether the expert gave you the correct probability or whether he just emptied your pocket for a worthless number.

So, let ${X}$ be a set of realizations. Nature randomizes an element from ${X}$ according to some distribution and an expert claims to know Nature’s distribution. A test is given by a function ${T:\Delta(X)\rightarrow 2^X}$ : the expert delivers a forecast ${\mu\in\Delta(X)}$ and fails if the realization ${x}$ turned out to be in ${T(\mu)}$ . A good test will be such that only `true’ experts, i.e. those who deliver the correct ${\mu}$ , will not fail.

— Manipulability —

I start with Sandroni’s paper (pdf). The following definition formalises the idea that the true expert is unlikely to fail the test

Definition 1 The test ${T}$ does not reject the truth with probability ${1-\epsilon}$ if ${\mu(T(\mu))<\epsilon}$ for every ${\mu\in \Delta(X)}$ .

If a test does not reject the truth with probability ${1-\epsilon}$ then a true expert, who knows how nature randomizes the realization, is unlikely to fail the test. Alas, a charlatan who acts strategically is also unlikely to fail

Theorem 2 (Sandroni (2003)) Let ${X}$ be finite. If ${T:\Delta(X)\rightarrow 2^X}$ is a test that does not reject the true expert with probability ${1-\epsilon}$ then there exists ${\zeta\in\Delta\bigl(\Delta(X)\bigr)}$ such that ${\zeta\bigl(\{\mu|x\in T(\mu)\}\bigr)<\epsilon}$ for every ${x\in X}$ .

So, a charlatan who knows nothing about how Nature chooses ${x}$ can randomize a forecast ${\mu}$ according to ${\zeta}$ , and is unlikely to fail, regardless of the realization. We say that the test is manipulable.

For Sandroni’s Theorem we do not need to assume any structure on ${X}$ . However the situation we have in mind is that ${X=S^n}$ for some finite set ${S}$ of outcome. So at every day ${0\leq k<n}$ Nature randomizes an outcome of ${s_k\in S}$ and the Expert claims to know the stochastic process that governs Nature’s choices.

Proof: Let ${Y\subseteq \Delta(X)}$ be a finite set such that ${T(Y)=T(\Delta(X))}$ .

Consider the following two-player zero-sum game with the players called Nature and Expert. Nature is the maximizer with pure strategies set ${X}$ , and Expert is the minimizer, with pure strategies set ${Y}$ . If Nature plays ${x\in X}$ and Expert plays ${y\in Y}$ then Expert pays Nature ${1}$ if ${x\in T(y)}$ and ${0}$ otherwise.

By von-Neumann’s Minimax Theorem the game admits a value ${v}$ in mixed strategies, so that the maximin equals the minimax. We claim that ${v<\epsilon}$ . Indeed, let ${\mu\in\Delta(X)}$ be a mixed strategy of Nature and let ${y\in Y}$ be such that ${T(y)=T(\mu)}$ . Then the expected payoff that Expert pays Nature if Nature plays ${\mu}$ and Expert plays ${y}$ is ${\mu(T(y))=\mu(T(\mu)<\epsilon}$ since the game does not reject the true expert with probability ${1-\epsilon}$ .

Now let ${\zeta\in\Delta(Y)}$ be a mixed optimal strategy of the Expert in the game. Then

$\displaystyle \zeta\bigl(\{\mu\in Y|x\in T(\mu)\}\bigr)\leq v<\epsilon$

for every pure strategy ${x}$ of Nature, since the left hand side is the expected payoff that the Expert pays Nature if he plays ${\zeta}$ and Nature plays ${x}$ . $\Box$

The argument in the proof of Sandroni’s Theorem captures the essence of all the manipulability theorems we will see. We define a zero-sum game between two players, Nature and Expert. The minimax theorem is the core of the proof: The fact that the maximin is smaller than ${\epsilon}$ follows from the assumption that the test is unlikely to reject the true expert. The fact that minimax is smaller than ${\epsilon}$ implies that the test is manipulable. In the middle, we use the most wonderful miracle that the minimax and maximin are equal.

— Fan’s Theorem —

Here is the more general minimax theorem which we will use later: In a zero-sum game, if one of the players have a compact set of pure strategies, and the payoff to that player is u.s.c. in his own strategy then the game admits a value in mixed strategies.

Proposition 3 (Ky Fan)Consider a two-player zero-sum game in normal form with pure strategy sets ${X,Y}$ and payoff function ${h:X\times Y\rightarrow [0,1]}$ . If ${X}$ is a compact metrizable space and ${h(\cdot,y)}$ is upper-semi continuous for every ${y\in Y}$ then the game has a value in mixed strategies, i.e.

$\displaystyle \sup_{\mu\in \Delta(X)}\inf_{y\in Y}\int h(x,y) \mu(\text{d} x)=\inf_{\zeta\in\Delta_f(Y)}\sup_{x\in X}\int h(x,y)\zeta(\text{d} y).$

and all suprema are attained.

The set ${Y}$ is not topologized and ${\Delta_f(Y)}$ is the set of distributions over ${Y}$ with finite support, so the integral on the right side is just a summation. The challenge in using Fan’s Theorem is to find a topology over ${X}$ which is sufficiently weak to make ${X}$ compact and sufficiently strong to make the functions ${h(\cdot,y)}$ upper semi-continuous.

— Non-manipulability —

Sandroni’s Theorem does not hold when ${X}$ is infinite, precisely because the game we defined in the proof will have no value. We now take a game without a value and translate it to a non-manipulable test for an infinite set ${X}$ of realizations. I only know about one game without a value (`I know a larger such game’ quipped Sergiu when I said it in the talk): Two players pick simultaneously and independently natural numbers and the player that chose the largest number wins. Here is the translation of the game to a test:

Example 1 Let ${X=\mathbb{N}}$ . Let ${T:\Delta(X)\rightarrow 2^X}$ be the test that is given by

$\displaystyle T(\mu)=\{x\in{\mathbb N} | \mu([x,\infty))<\epsilon\}.$

Then ${T}$ does not reject the true expert with probability ${1-\epsilon}$ , and for every ${\zeta\in\Delta(\Delta(X))}$ there exists a realization ${x\in X}$ such that

$\displaystyle \zeta\bigl(\{\mu\in\Delta(X)|x\in T(\mu)\}\bigr)>1-\epsilon.$

Not only the test is not manipulable, also for every ${\zeta\in \Delta(\Delta(X))}$ that a charlatan might employs to randomize his worthless forecast ${\mu}$ , there is some realization ${x}$ under which he fails with high probability.

Proof: For every ${\mu\in\Delta(X)}$ let ${t(\mu)}$ ( ${t}$ for tail) be given by ${t(\mu)=\min\{t\in {\mathbb N}|\mu\left([t,\infty)\right)<\epsilon\}}$ . Then ${T}$ can be equivalently written as ${T(\mu)=[t(\mu),\infty)}$ .

We first show that the test has small probability to reject a true expert. Indeed,

$\displaystyle \mu\left(T(\mu)\right)=\mu\left([t(\mu),\infty)\right)<\epsilon$

for every ${\mu\in\Delta(X)}$ , where the first equality follows from the definition of ${T}$ and the second from the definition of ${t}$ .

Now let ${\zeta\in\Delta(\Delta(X))}$ . Let ${\bar\zeta\in\Delta(\mathbb{N})}$ be the push-forward of ${\zeta}$ under ${t}$ and let ${x\geq t(\bar\zeta)}$ . Then

$\displaystyle \begin{array}{rcl} \lefteqn{\zeta\bigl(\{\mu\in\Delta(X)|x\in T(\mu)\}\bigr)=}\\&&\zeta\bigl(\{\mu\in\Delta(X)|x\geq t(\mu)\}\bigr)=\bar\zeta\left([0,x]\right)\geq 1-\epsilon.\end{array}$

The first equality follows from the definition of ${T}$ , the second from the definition of ${\bar\zeta}$ and the inequality from the choice of ${x}$ . $\Box$

— Sequential Tests —

So, Sandroni’s Theorem only holds for finite sets ${X}$ . We are now aiming at proving a manipulability theorem for infinite set ${X}$ by adding some structure on the set ${X}$ and some restrictions on the test. From now on we assume that ${X=\{0,1\}^{\mathbb N}}$ . The set ${S=\{0,1\}}$ is the set of outcomes. Everything works for arbitrary finite set of outcomes but I stick with two outcomes for notational convenience. Note that ${X}$ a compact metrizable space in the product topology.

There is an almost equivalent representation of and element ${\mu\in\Delta(X)}$ as the conditional probability that ${s_n=1}$ given ${s_0,\dots,s_{n-1}}$ for a ${\mu}$ -randomly chosen ${x=(s_0,s_1,\dots)}$ . Let ${\mathcal{F}}$ be the set of functions ${f:\{0,1\}^{<{\mathbb N}}\rightarrow [0,1]}$ , then this gives rise to natural map ${f\in\mathcal{F}\mapsto\mu_f\Delta(X)}$ , which is surjective and continuous when the domain is equipped with the product topology and the range with the weak ${^\ast}$ topology. It will be useful to identify ${\mathcal{F}}$ with ${\Delta(X)}$ , so I will think of a test as a function ${T:\mathcal{F}\rightarrow 2^X}$ .

For every ${f\in\mathcal{F}}$ and every realization ${x=(s_0,s_1,\dots)}$ , the sequence of forecasts of ${f}$ along ${x}$ is given by ${(p_0,p_1,\dots)}$ where ${p_k=f(s_0,\dots,s_{k-1})}$ .

Definition 4 A test ${T:\mathcal{F}\rightarrow 2^X}$ is sequential if ${x\in T(f_1)}$ implies ${x\in T(f_2)}$ for every ${f_1,f_2\in\mathcal{F}}$ and every realization ${x}$ such that the forecasts made by ${f_1}$ and ${f_2}$ along ${x}$ are the same.

Equivalently, a sequential test is given by a subset ${R\subseteq ([0,1]\times \{0,1\})^{\mathbb N}}$ ( ${R}$ for rejection): ${x\in T(f)}$ iff ${(p_0,s_0,p_1,s_1,\dots)\in R}$ for every realization ${x=(s_0,s_1,\dots)}$ and every forecast ${f\in\mathcal{F}}$ , where ${(p_0,p_1,\dots)}$ is the sequence of predictions of ${f}$ along ${x}$ .

— Next Episode (Thursday 12:00) —

I will talk about Olszewski and Sandroni’s paper `The manipulability of future independent tests’ and my paper `Many inspections are manipulable’ . Main goal is to prove that sequential tests are always manipulable.

A post on Rabin

April 19, 2010 in Game Theory | Tags: computability, michael rabin, zero-sum games | by Eran | 4 comments

I wrote some time ago about Michael Rabin’s example of how a backward induction argument is killed when you require players’ plans to be computable. Since Rabin’s paper is not available online, I intended to post the proof at some point, but never actually got to do it. Richard Lipton’s post about Emil Post is a good inspiration to pay that debt since the concept of simple set, which was introduced by Post, has a role in Rabin’s example.

A set ${S\subseteq\mathbb{N}}$ is simple if ${S}$ is recursively enumerable and its complement ${S^c}$ is infinite and contains no infinite recursively enumerable subset.

For Rabin’s purpose, what matters is that Post proved the existence of such a set. So Post provides us with a code of a computer program that enumerates over all the elements of ${S}$ and such that no computer program can produce infinitely many elements that are not in ${S}$ . By the way, Post introduces the concept of simple set while working on his eponymous problem which Richard mentions. In fact, if ${S}$ is simple set then ${S}$ is not many-one equivalent to a universal recursively enumerable set.

Now let ${S}$ be a simple set and consider the following three-round game with perfect information. At each round a player announces a natural number (an action): First Alice announces ${n}$ . Then Bob announces ${m}$ and then Alice announces ${t}$ . After the game is played, a winner is determined as follows: Bob wins if ${m>n}$ and ${m}$ is not the ${t}$ -th element in the enumeration of ${S}$ . Otherwise, Alice wins. Therefore, determining the winner is a task that can be effectively done using the code provided by Post.

Does any of the players have a strategy that guarantee winning regardless of how the opponent play ? The answer to this question depends on which mathematical creature you use to model the intuitive notion of `strategy’.

If by a strategy you mean just a function from histories to current action then Bob has a winning strategy: for every ${n}$ that Alice might chose in the first round there exists some ${m>n}$ such that ${m\notin S}$ , since the complement of ${S}$ is infinite.

But if you think of a strategy as a complete set of instructions, that is a plan or an algorithm that will tell you how to produce your next action given the past, then you want to model a strategy as a computable function. Now, for every computable function ${f:\mathbb{N}\rightarrow\mathbb{N}}$ that Bob can chose, there exists some ${m}$ such that either ${f(m)\leq m}$ or ${f(m)\in S}$ since otherwise ${f(0),f(f(0)),f(f(f(0))),\dots}$ would have been a recursive enumeration of infinitely many elements of ${S^c}$ . In particular, there exists some ${t}$ such that if Alice plays ${m}$ in the first stage and ${t}$ in the third stage she wins against ${f}$ . So, Bob has no winning strategy.

Computability in game theory

February 15, 2010 in Game Theory | Tags: computability, michael rabin, zero-sum games | by Eran | 15 comments

Since in the near future I will have to start advocating a computability argument, and since this position is somewhat in contrast to what was my view a couple of years ago when I first came across algorithmic game theory, I thought I should try to clarify my thoughts about the issue in a couple of posts.

The most beautiful example I know for the surprising outcomes we get when taking computability into account in game theory is Michael Rabin’s paper from 1957. Rabin considers a finite win-loss game with perfect information between two players, Alice and Bob. The game is played in three rounds, at each round a player announces a natural number (an action): First Alice announces ${n}$ . Then Bob announces ${m}$ and then Alice announces ${t}$ . After these declaration are made, the game coordinator determines the winner: If ${(n,m,t)\in W}$ Alice wins. If ${(n,m,t)\notin W}$ Bob wins. The set $W$ , Alice’s winning set, is part of the description of the games. This is a finite game, which can be solved by a background induction. By the so called Zermelo’s Theorem, the game is determined: One of the player has a winning strategy. The identity of the winner depends of course on ${W}$ .

Rabin gives an example for a game with the following propertie:

${W}$ is a decidable set: this means that there is an effective way to decide, for every triple ${(n,m,t)}$ whether or not ${(n,m,t)\in W}$ . So the job of the game coordinator in this game can be performed by a computer program. Games with effectively computable payoff are called actual games: These are the games that can actually be played.
Bob has a winning strategy ${f:\mathbb{N}\rightarrow\mathbb{N}}$ : whatever actions ${n}$ and ${t}$ Alice chooses in the first and third stages, Bob will win the game if in the second stage he chooses ${f(n)}$ : so that $(n,f(n),t)\notin W$ for any $n,t\in \mathbb{N}$
Bob has no computable winning strategy: For every function ${f:\mathbb{N}\rightarrow\mathbb{N}}$ that can be computed by a computer program there exist some actions ${n,t}$ for Alice such that ${(n,f(n),t)\in W}$ .

Feel the blow ? Bob finds himself in a strange predicament. On the one hand, after observing Alice’s action ${n}$ from the first round, Bob knows that there is some action ${m}$ he can choose that beats any possible response of Alice. However, if Bob tries to `plan ahead’ which action to take after every possible choice of ${n}$ , he cannot produce such a plan. Bob can beat all Alice’s strategies and Alice can beat all Bob’s (computable) strategies. von-Neumann and Morgenstern’s least controversial idea, solving a zero-sum game with perfect information by backward induction, trembles. The argument that in finite zero-sum games with perfect information a player doesn’t care if the opponent finds out his strategy falls apart.

(To be continued…)

Tag Archive

A Lovely Proof (?) of Minimax

Lloyd Shapley

Who Was Colonel Blotto?

Expert testing part 3

Lecture notes on expert testing

A post on Rabin

Computability in game theory

Email Subscription

Entries by:

Recent Posts

Recent Comments

Blogroll

Archives

Meta