Perfect Bayesian equilibrium
From Wikipedia, the free encyclopedia
| Perfect Bayesian Equilibrium | |
|---|---|
| Solution concept in game theory | |
| Relationship | |
| Subset of | Bayesian Nash equilibrium |
| Significance | |
| Proposed by | Cho and Kreps[citation needed] |
| Used for | Dynamic Bayesian games |
| Example | signaling game |
In game theory, a Perfect Bayesian Equilibrium (PBE) is a solution with Bayesian probability to a turn-based game with incomplete information. More specifically, it is an equilibrium concept that uses Bayesian updating to describe player behavior in dynamic games with incomplete information. Perfect Bayesian equilibria are used to solve the outcome of games where players take turns but are unsure of the "type" of their opponent, which occurs when players don't know their opponent's preference between individual moves. A classic example of a dynamic game with types is a war game where the player is unsure whether their opponent is a risk-taking "hawk" type or a pacifistic "dove" type. Perfect Bayesian Equilibria are a refinement of Bayesian Nash equilibrium (BNE), which is a solution concept with Bayesian probability for non-turn-based games.
Any perfect Bayesian equilibrium has two components -- strategies and beliefs:
- The strategy of a player in a given information set specifies his choice of action in that information set, which may depend on the history (on actions taken previously in the game). This is similar to a sequential game.
- The belief of a player in a given information set determines what node in that information set he believes the game has reached. The belief may be a probability distribution over the nodes in the information set, and is typically a probability distribution over the possible types of the other players. Formally, a belief system is an assignment of probabilities to every node in the game such that the sum of probabilities in any information set is 1.
The strategies and beliefs also must satisfy the following conditions:
- Sequential rationality: each strategy should be optimal in expectation, given the beliefs.
- Consistency: each belief should be updated according to the equilibrium strategies, the observed actions, and Bayes' rule on every path reached in equilibrium with positive probability. On paths of zero probability, known as off-equilibrium paths, the beliefs must be specified but can be arbitrary.
A perfect Bayesian equilibrium is always a Nash equilibrium.
Gift game 1
Consider the following game:
- The sender has two possible types: either a "friend" (with probability ) or an "enemy" (with probability ). Each type has two strategies: either give a gift, or not give.
- The receiver has only one type, and two strategies: either accept the gift, or reject it.
- The sender's utility is 1 if his gift is accepted, -1 if his gift is rejected, and 0 if he does not give any gift.
- The receiver's utility depends on who gives the gift:
- If the sender is a friend, then the receiver's utility is 1 (if he accepts) or 0 (if he rejects).
- If the sender is an enemy, then the receiver's utility is -1 (if he accepts) or 0 (if he rejects).
For any value of Equilibrium 1 exists, a pooling equilibrium in which both types of sender choose the same action:
- Equilibrium 1. Sender: Not give, whether they are the friend type or the enemy type. Receiver: Do not accept, with the beliefs that Prob(Friend|Not Give) = p and Prob(Friend|Give) = x, choosing a value
The sender prefers the payoff of 0 from not giving to the payoff of -1 from sending and not being accepted. Thus, Give has zero probability in equilibrium and Bayes's Rule does not restrict the belief Prob(Friend|Give) at all. That belief must be pessimistic enough that the receiver prefers the payoff of 0 from rejecting a gift to the expected payoff of from accepting, so the requirement that the receiver's strategy maximize his expected payoff given his beliefs necessitates that Prob(Friend|Give) On the other hand, Prob(Friend|Not give) = p is required by Bayes's Rule, since both types take that action and it is uninformative about the sender's type.
If , a second pooling equilibrium exists as well as Equilibrium 1, based on different beliefs:
- Equilibrium 2. Sender: Give, whether they are the friend type or the enemy type. Receiver: Accept, with the beliefs that Prob(Friend|Give) = p and Prob(Friend|Not give) = x, choosing any value for
The sender prefers the payoff of 1 from giving to the payoff of 0 from not giving, expecting that his gift will be accepted. In equilibrium, Bayes's Rule requires the receiver to have the belief Prob(Friend|Give) = p, since both types take that action and it is uninformative about the sender's type in this equilibrium. The out-of-equilibrium belief does not matter, since the sender would not want to deviate to Not give no matter what response the receiver would have.
Equilibrium 1 is perverse if The game could have so the sender is very likely a friend, but the receiver still would refuse any gift because he thinks enemies are much more likely than friends to give gifts. This shows how pessimistic beliefs can result in an equilibrium bad for both players, one that is not Pareto efficient. These beliefs seem unrealistic, though, and game theorists are often willing to reject some perfect Bayesian equilibria as implausible.
Equilibria 1 and 2 are the only equilibria that might exist, but we can also check for the two potential separating equilibria, in which the two types of sender choose different actions, and see why they do not exist as perfect Bayesian equilibria:
- Suppose the sender's strategy is: Give if a friend, Do not give if an enemy. The receiver's beliefs are updated accordingly: if he receives a gift, he believes the sender is a friend; otherwise, he believes the sender is an enemy. Thus, the receiver will respond with Accept. If the receiver chooses Accept, though, the enemy sender will deviate to Give, to increase his payoff from 0 to 1, so this cannot be an equilibrium.
- Suppose the sender's strategy is: Do not give if a friend, Give if an enemy. The receiver's beliefs are updated accordingly: if he receives a gift, he believes the sender is an enemy; otherwise, he believes the sender is a friend. The receiver's best-response strategy is Reject. If the receiver chooses Reject, though, the enemy sender will deviate to Do not give, to increase his payoff from -1 to 0, so this cannot be an equilibrium.
We conclude that in this game, there is no separating equilibrium.
Gift game 2
In the following example,[1] the set of PBEs is strictly smaller than the set of SPEs and BNEs. It is a variant of the above gift-game, with the following change to the receiver's utility:
- If the sender is a friend, then the receiver's utility is 1 (if they accept) or 0 (if they reject).
- If the sender is an enemy, then the receiver's utility is 0 (if they accept) or -1 (if they reject).
Note that in this variant, accepting is a weakly dominant strategy for the receiver.
Similarly to example 1, there is no separating equilibrium. Let's look at the following potential pooling equilibria:
- The sender's strategy is: always give. The receiver's beliefs are not updated: they still believe in the a-priori probability, that the sender is a friend with probability and an enemy with probability . Their payoff from accepting is always higher than from rejecting, so they accept (regardless of the value of ). This is a PBE - it is a best-response for both sender and receiver.
- The sender's strategy is: never give. Suppose the receiver's beliefs when receiving a gift is that the sender is a friend with probability , where is any number in . Regardless of , the receiver's optimal strategy is: accept. This is NOT a PBE, since the sender can improve their payoff from 0 to 1 by giving a gift.
- The sender's strategy is: never give, and the receiver's strategy is: reject. This is NOT a PBE, since for any belief of the receiver, rejecting is not a best-response.
Note that option 3 is a Nash equilibrium. If we ignore beliefs, then rejecting can be considered a best-response for the receiver, since it does not affect their payoff (since there is no gift anyway). Moreover, option 3 is even a SPE, since the only subgame here is the entire game. Such implausible equilibria might arise also in games with complete information, but they may be eliminated by applying subgame perfect Nash equilibrium. However, Bayesian games often contain non-singleton information sets and since subgames must contain complete information sets, sometimes there is only one subgame—the entire game—and so every Nash equilibrium is trivially subgame perfect. Even if a game does have more than one subgame, the inability of subgame perfection to cut through information sets can result in implausible equilibria not being eliminated.
To summarize: in this variant of the gift game, there are two SPEs: either the sender always gives and the receiver always accepts, or the sender always does not give and the receiver always rejects. From these, only the first one is a PBE; the other is not a PBE since it cannot be supported by any belief-system.
More examples
For further examples, see signaling game#Examples. See also [2] for more examples. There is a recent application of this concept in Poker, by Loriente and Diez (2023).[3]