What is Game Theory?�
Game Theory Definition�
Theory of rational behavior for interactive decision problems. In a game, several agents strive to maximize their (expected) utility index by choosing particular courses of action, and each agent's final utility payoffs depend on the profile of courses of action chosen by all agents. The interactive situation, specified by the set of participants, the possible courses of action of each agent, and the set of all possible utility payoffs, is called a game; the agents 'playing' a game are called the players.
Definitions�
Definition: Zero-Sum Game – A game in which the payoffs for the players always adds up to zero is called a zero-sum game.
Definition: Maximin strategy – If we determine the least possible payoff for each strategy, and choose the strategy for which this minimum payoff is largest, we have the maximin strategy.
A Further Definition�
Definition: Constant-sum and nonconstant-sum game – If the payoffs to all players add up to the same constant, regardless which strategies they choose, then we have a constant-sum game. The constant may be zero or any other number, so zero-sum games are a class of constant-sum games. If the payoff does not add up to a constant, but varies depending on which strategies are chosen, then we have a non-constant sum game.
Game theory: assumptions�
Game theory: assumptions (Cont)�
(4) Each decision maker has perfect knowledge of the game and of his opposition.
(5) All decision makers are rational; that is, each player, given two alternatives, will select the one that yields him the greater payoff.
��Two-Person Zero-Sum and Constant-Sum Games �
Cont..
Cont...
Zero Sum Games�
Consider:
Player 1 with three strategies S1, S2, and S3 and Player 2 with four strategies OP1, OP2, OP3, and OP4.
Zero Sum Games (Cont)�
Cont...
�Two-person zero-sum game with a saddle point�
Player #2
Player#1
Payoff Matrix
to Player#1
Row Domination:
(2)>(1) and (3)>(1), so eliminate Row (1).
Column Domination: (A)>(C) and (B)>(C) so eliminate column A and B.
| A | B | C |
1 | 6 | 5 | -4 |
2 | 9 | 7 | -2 |
3 | 9 | 8 | -3 |
Reduced Matrix
| C |
2 | -2 |
3 | -3 |
Saddle point
| A | B | C |
1 | 6 | 5 | -4 |
2 | 9 | 7 | -2 |
3 | 9 | 8 | -3 |
The value shown is the smallest in its row - to player #2’s advantage - and the largest in its column - to player #1’s advantage.
Two-Person Non constant-Sum Games�
The Prisoner’s Dilemma�
The prisoner’s dilemma is a universal concept. Theorists now realize that prisoner’s dilemmas occur in biology, psychology, sociology, economics, and law. The prisoner’s dilemma is apt to turn up anywhere a conflict of interests exists -- and the conflict need not be among sentient beings. Study of the prisoner’s dilemma has great power for explaining why animal and human societies are organized as they are. It is one of the great ideas of the twentieth century, simple enough for anyone to grasp and of fundamental importance (...). The prisoner’s dilemma has become one of the premier philosophical and scientific issues of our time. It is tied to our very survival (W. Poundstone,1992, p. 9).
Prisoner’s Dilemma�
-What would you do if you were Prisoner A?�
| Prisoner B refuses deal | B turns state’s evidence |
Prisoner A refuses deal | 1 year; 1 year | 3 year; 0 year |
A turns state’s evidence | 0 year; 3 year | 2 year; 2 year |
In such games, one player’s gain is not equal to the other’s loss. Example: Prisoner’s dilemma�
e.g
�General questions in two-person games�
Linear Programming and Zero-Sum Games
The value of the game and the optimal strategies for the row and column players reward matrix may be found by solving the row player’s LP and the column player’s LP, respectively.
�Linear Programming and Zero-Sum Games (Cont)�
The dual of the row (column) player’s LP is the column (row) player’s LP. The optimal objective function value for either the row or the column player’s LP is the value of the game to the row player. If the row player departs from her/his optimal strategy, she/he may receive an expected reward that is less than the value of the game. If the column player departs from her/his optimal strategy, she/he may incur an expected loss that exceeds the value of the game. Complementary slackness may be used to simultaneously solve the row and the column player’s LP’s.
Minimax Rule�
Minimax procedure�