Next: Bibliography Up: Models, Algorithms and Economics Previous: Game Theory

# Nash Equilibrium

Here is a pointer to a short biography of John F. Nash, Jr. from the Nobel-e-Museum.

First consider only pure strategies (no randomization or mixed strategies). A strategy profile constitutes a Nash Equilibrium if for every for all . Which strategy profile constitutes a Nash Equilibrium in the following game?

 y1 y2 y3 x1 1,4 2,2 2,3 x2 3,1 1,5 4,1 x3 2,0 3,4 1,2

If Player 2 plays y2, then the best response of player 1 is x3. If player 1 plays x3, then the best response of player 2 is y2. This implies that strategy profile (x3,y2) is a Nash Equilibrium. What if player 2 plays y1, then player 1's best response is x2, but for x2, player 2's best response is y2. Similarly if player 2 plays y3, then player 1's best response is x2, but for x2, player 2's best response is y2. Therefore is the only Nash Equilibrium strategy profile.

Consider the game of Matching Pennies.

 Heads Tails Heads -1, 1 1, -1 Tails 1, -1 -1, 1

Observe that there is no pure strategy equilibrium in the Matching Pennies.

When does (pure strategy) Nash Equilibrium exists?
(We are following the treatment in the book [#!OR94!#].)

First we need a few definitions and Kakutani's fixed point theorem. For any player i, define the best response correspondence such that for each , the set

Observe that the strategy profile is Nash Equilibrium if and only if each . Define the correspondence by

where .

A correspondence , where , the set is closed, has a closed graph if for any two converging sequences, and , where and for all k, then . The correspondence , where B is closed, is upper hemicontinuous if it has a closed graph and the images of the compact sets are bounded. A function defined on a convex set is quasi-concave if its upper contour set , for some value t is convex.

Here is the fixed point theorem due to Kakutani, and fir Illustration in one dimension see Figure 3.

Theorem 3 (Kakutani's fixed point theorem)   Let be a nonempty, convex, compact set and be a upper hemicontinuous correspondence from A to itself, such that is nonempty and convex for each . Then f has a fixed point, i.e. for some .

Lemma 2   If the sets are nonempty, Si is compact and convex, and the utility function is continuous, and quasi-concave in each of the variables si, then the function bi is nonempty, convex-valued, and upper hemicontinuous.

Proof: By definition the function bi(s-i) maximizes the continuous function ui(.,s-i) over a compact set Si. Therefore the set . The set bi is convex, since the set of maximizers of a quasi-concave function on a convex set are convex. For upper hemicontinuity, we need to show that for any sequence such that , . For all n, for all . By continuity of ui, . This implies that .

Theorem 4   A game has a Nash Equilibrium if for all , the sets Si are nonempty, convex and compact subset of Euclidean space, and the utility function is continuous and quasi-concave in each si.

Proof: Recall the definition of the correspondence given by where . From Lemma 2 we know that bi()'s are nonempty, convex-valued and upper hemicontinuous. Now all the conditions for Kakutani's fixed point theorem are satisfied by the function b. Therefore there exists a fixed point for this correspondence, say , such that . Clearly s is a Nash Equilibrium since each .

Of course this theorem applies when the sets Si are convex, but in our examples like Matching Pennies etc., the sets Si are finite! OOPS! Before we do the proofs for this scenario - lets introduce mixed strategy Nash Equilibrium.

A mixed strategy profile constitutes a Nash Equilibrium if for every , , for all . For example, consider the Matching Pennies game again. This game has a mixed strategy equilibrium if each player chooses Heads or Tails with equal probability. Although this game didn't have a pure strategy Nash Equilibrium, but has a mixed strategy Nash Equilibrium! Here is the celebrated Nash Theorem!

Theorem 5 (Nash Equilibrium Theorem)   Every finite strategy game has a mixed strategy Nash Equilibrium.

Proof: Let player i, has M pure strategies in the set Si, i.e., |Si|=M. Recall the simplex , representing the probability vectors

Clearly this simplex is nonempty, compact and convex. Notice that the expected utility is linear in the probabilities, implying that the utility function under mixed strategy is continuous as well as quasiconcave. Now we have all the properties required for Kakutani's theorem.

In the following lemma we state an important property of mixed strategy Nash Equilibrium. It means that each player, given the distribution of strategies played by all his opponents, is indifferent among all pure strategies that (s)he plays. This lemma can be used to find the mixed strategy equilibrium.

Lemma 3   Strategy profile is an equilibrium if and only if for every player every pure strategy in the support of (i.e. having non-zero probability of being selected) is a best response to .

Proof: Necessary: Suppose that there is a strategy in support of , but is not a best response to . Then by the linearity of the utility function ui, player i can increase h(is/er) utility by transferring the probability from si to a strategy, say s'i that is a best response (i.e. player i would have played s'i in place of si and increased h(is/er) utility).
Sufficiency: Suppose that is not a Nash Equilibrium. Then there is a player who has a strategy with . This implies that there is a pure strategy such that . But by assumption for all which are played with positive probabilities.

What are Nash Equilibriums for the following game?

 Col-1 Col-2 Row-1 2,2 3,4 Row-2 1,3 5,1

Suppose Player 1 selects Row-1 and Row-2 with probability 1-p and p respectively, i.e. . Then the utility of Player 2, under strategy Col-1 and Col-2 should be the same, i.e.,

Similarly, if Player 2 selects Col-1 and Col-2 with probability , then

Therefore Nash Equilibrium is .

Here is a graphical way of computing Nash Equilibrium of games. In general let the game be given as follows:

 1-q q 1-p a1,b1 a2,b2 p a3,b3 a4,b4

Expected utility of the row player (Player 1) will be

u1= (1-p)(1-q)a1+ (1-p)qa2+ p(1-q)a3+ pqa4, or

u1= (a1-a2-a3+a4)pq+ (a3-a1)p+ (a2-a1)q + a1.

Similarly, expected utility of the column player, Player 2, will be

u2= (1-p)(1-q)b1+ (1-p)qb2+ p(1-q)b3+ pqb4, or

u2= (b1-b2-b3+b4)pq+ (b3-b1)p+ (b2-b1)q + b1.

First consider Player 1 (the row player). We can rewrite the utility function u1 as a function of p, representing the equation of a line , as follows.

u1=[(a1-a2-a3+a4)q+ (a3-a1)]p + [(a2-a1)q+a1].

The aim of the Player 1 is to maximize u1. In our example

u1=[3q-1]p+[q+2].

For , p=0. For q=1/3, . For , p=1.

Similarly aim of the Player 2 is to maximize u2. In our example

u2=[(b1-b2-b3+b4)p+ (b2-b1)]q + [(b3-b1)p+b1],

or

u2=[-4p+2]q + [p+2].

For , q=1. For p=1/2, . For , q=0.

The plots are shown in Figure 4, and the intersection of the two graphs, where p=1/2 and q=1/3 is the Nash Equilibrium in this case. These graphs can be viewed as no regrets graphs'' of the two players. Given the strategy of a player, the best strategy of the other player is the corresponding point on the graph. For example, if Player 1 chooses p=.2, then the best strategy for player 2 is to choose q=1 (or Player 2 has no regrets in choosing q=1).

Next: Bibliography Up: Models, Algorithms and Economics Previous: Game Theory
Anil Maheshwari
2003-03-05