Game with a hierarchy structure

A model of a conflict situation with a fixed sequence of moves and interchange of information between the players. The main object of investigation in the theory of games with a hierarchy structure is the problem of finding the largest guaranteed result and an optimal strategy for a selected player. Suppose that players $I$ and $II$ , respectively, tend to an increase in the pay-off functions $f_{1} (x_{1}, x_{2})$ and $f_{2} (x_{1}, x_{2})$ ( cf. Gain function), continuous on the product of two compacta $X_{1}, X_{2}$ ; $x_{1} \in X_{1}$ , $x_{2} \in X_{2}$ . The following different types of games can be formulated according to the character of the information and the order of moves.

The game $Γ_{1}$ . Player $I$ chooses $x_{1} \in X_{1}$ and communicates his choice to player $II$ . Let

$P (x_{1}) = {x_{2} : f_{2} (x_{1}, x_{2}) = max_{y \in X_{2}} f_{2} (x_{1}, y)}$

be the set of optimal choices of player $II$ . Then the largest guaranteed result for player $I$ is

$G_{1} = sup_{x_{1} \in X_{1}} min_{x_{2} \in P (x_{1})} f_{1} (x_{1}, x_{2}) .$

The game $Γ_{2}$ . Player $I$ expects to have and indeed will have information on the choices of player $II$ ; he communicates his strategy, that is, a function ${\tilde{x}}_{1} = x_{1} (x_{2})$ , where ${\tilde{x}}_{1} \in {\tilde{X}}_{1}$ — the set of all mappings from $X_{2}$ to $X_{1}$ , to player $II$ . The largest guaranteed result for player $I$ is

$G_{2} = sup_{{\tilde{x}}_{1} \in {\tilde{X}}_{1}} inf_{x_{2} \in P_{2} ({\tilde{x}}_{1})} f_{1} ({\tilde{x}}_{1}, x_{2}),$

where the set of optimal choices of player $II$ is

$P_{2} ({\tilde{x}}_{1}) = {x_{2} : f_{2} ({\tilde{x}}_{1}, x_{2}) = sup_{y \in X_{2}} f_{2} (x_{1} (y), y) - δ ({\tilde{x}}_{1})},$

where $δ ({\tilde{x}}_{1}) \geq 0$ , and $δ ({\tilde{x}}_{1}) = 0$ if and only if $max_{y \in X_{2}} f_{2} (x_{1} (y) y)$ is achieved.

The game $Γ_{3}$ . Player $I$ expects to have and indeed will have information on the choices of player $II$ in the form ${\tilde{x}}_{2} = x_{2} (x_{1})$ , where ${\tilde{x}}_{2} \in {\tilde{X}}_{2}$ — the set of all mappings from $X_{1}$ to $X_{2}$ ; he communicates to player $II$ his strategy ${\tilde{x} t i l d e}_{1} = x_{1} ({\tilde{x}}_{2})$ , where ${\tilde{x} t i l d e}_{1} \in {\tilde{X} t i l d e}_{1}$ — the set of mappings from ${\tilde{X}}_{2}$ to $X_{1}$ . The largest guaranteed result of player $I$ is

$G_{3} = sup_{\begin{matrix} {\tilde{x} t i l d e}_{1} \in {\tilde{X} t i l d e}_{1} \end{matrix}} inf_{\begin{matrix} {\tilde{x}}_{2} \in P_{3} (\tilde{x} t i l d e_{1}) \end{matrix}} f_{1} ({\tilde{x} t i l d e}_{1}, {\tilde{x}}_{2}),$

where

$P_{3} ({\tilde{x} t i l d e}_{1}) = {{\tilde{x}}_{2} : f_{2} ({\tilde{x} t i l d e}_{1}, {\tilde{x}}_{2}) = sup_{\begin{matrix} y \in {\tilde{X}}_{2} \end{matrix}} f_{2} (x_{1} (y), y) - δ ({\tilde{x} t i l d e}_{1})},$

$δ ({\tilde{x} t i l d e}_{1}) \geq 0$ , where now $δ ({\tilde{x} t i l d e}_{1}) = 0$ if and only if $max_{y \in {\tilde{X}}_{2}} f_{2} (x_{1} (y), y)$ is achieved.

A relation between the results in these games determines for player $I$ knowledge of the information concerning the actions of player II: $G_{1} \leq G_{3} \leq G_{2}$ . Using the scheme indicated in the construction of the strategies of the players, games with arbitrarily deep recursion can be formulated. The following assertion holds: in the games $Γ_{2 m}$ , $m > 1$ , the largest guaranteed result for player $I$ is $G_{2}$ ; in the games $Γ_{2 m + 1}$ , $m > 1$ , the largest guaranteed result is $G_{3}$ . The problem of determining $G_{1}$ is related to a class of problems of minimax type with related restrictions.

Methods have been developed for solving $Γ_{1}$ using penalty functions, necessary optimality conditions and approximation to the original game by games with unique responses for player $II$ . Complete solutions are known for special classes of games: games with close interests, bimatrix games, bilinear games, etc. The problem of determining $G_{1}$ is not well-posed relative to changes in the function $f_{2} (x_{1}, x_{2})$ in the uniform metric and of the sets $X_{1}$ and $X_{2}$ in the Hausdorff metric. A general method has been proposed for regularizing the solution of the game $Γ_{1}$ ; regularization of the problem relative to the pay-off function of $II$ is effected at the expense of introducing an artificial inaccuracy in the determination of $max_{x_{2} \in X_{2}} f_{2} (x_{1}, x_{2})$ . The determination of the magnitude $G_{2}$ reduces to the solution of a set of problems in mathematical programming.

Suppose that for arbitrary $ϵ > 0$ the following functions, sets and numbers are defined:

$f_{2} (x_{1}^{H} (x_{2}), x_{2}) = min_{x_{1} \in X_{1}} f_{2} (x_{1}, x_{2}),$

$L_{2} = max_{x_{2} \in X_{2}} f_{2} (x_{1}^{H} (x_{2}), x_{2}) = max_{x_{2} \in X_{2}} min_{x_{1} \in X_{1}} f_{2} (x_{1}, x_{2}),$

$E_{2} = {x_{2} \in X_{2} : f_{2} (x_{1}^{H} (x_{2}), x_{2}) = L_{2}},$

$K = {\begin{cases} sup_{(x_{1}, x_{2}) \in D} f_{1} (x_{1}, x_{2}) & if D \neq \emptyset, \\ - \infty & if D = \emptyset, \end{cases}$

$f_{1} (x_{1}^{ϵ}, x_{2}^{ϵ}) \geq K - ϵ, (x_{1}^{ϵ}, x_{2}^{ϵ}) \in D \neq \emptyset,$

$M = inf_{x_{2} \in E_{2}} sup_{x_{1} \in X_{1}} f_{1} (x_{1}, x_{2}),$

$D = {(x_{1}, x_{2}) : f_{2} (x_{1}, x_{2}) > L_{2}},$

$f_{1} (x_{1}^{a ϵ} (x_{2}), x_{2}) \geq sup_{x_{1} \in X_{1}} f_{1} (x_{1}, x_{2}) - ϵ .$

Under the conditions stated, $G_{2} = max (K, M)$ and the strategy

$\tilde{x}_{1}^{ϵ} = {\begin{cases} x_{1}^{ϵ} & if x_{2} = x_{2}^{ϵ}, & K > M, \\ x_{1}^{a ϵ} (x_{2}) & if x_{2} \in E_{2}, & K \leq M, \\ x_{1}^{H} (x_{2}) & otherwise, \end{cases}$

guarantees that player $I$ receives $max (K, M) - ϵ$ for sufficiently small $ϵ$ . As is clear from the definitions, an optimal strategy consists of a certain number of stages, the last playing the part of a strategy by punishment.

If $L_{2} < f_{2} (x_{1}, x_{2})$ and if $f_{2} (x_{1}, x_{2})$ has no local maxima with value $L_{2}$ on $X_{1} \times X_{2}$ , then $K \geq M$ and an optimal strategy has the simple form:

$\tilde{x}_{1}^{ϵ} = {\begin{cases} x_{1}^{ϵ} & if x_{2} = x_{2}^{ϵ}, \\ x_{1}^{H} (x_{2}) & if x_{2} \neq x_{2}^{ϵ} . \end{cases}$

A solution can be found in a similar way for $Γ_{3}$ ; it also reduces to the solution of a sequence of problems in mathematical programming.

When side payments for player $I$ are introduced into a game with a hierarchy structure as functions of the choices of player $II$ , the expression for the largest guaranteed result for player $I$ is significantly simplified. In the game $Γ_{2}$ , where

$w_{1} = f_{1} (x_{1}, x_{2}) - z, w_{2} = f_{2} (x_{1}, x_{2}) + z,$

$x_{1} \in X_{1}$ , $x_{2} \in X_{2}$ , $0 \leq z \leq z^{0}$ , and player $I$ chooses strategies $x_{1} (x_{2})$ , $z (x_{2})$ , the determination of $G_{2}$ reduces to the solution of a problem in mathematical programming:

$G_{2} = max_{x_{1}, x_{2}} min [f_{1} (x_{1}, x_{2}), f_{1} (x_{1}, x_{2}) + f_{2} (x_{1}, x_{2}) - L_{2}],$

$f_{2} (x_{1}, x_{2}) \geq L_{2} - z^{0} .$

In general, the application of arbitrarily small side payments $z (x_{2})$ in games with a hierarchy structure allows player $I$ to achieve the largest possible guaranteed result, reckoning on the generosity of his partner.

The games formulated can be generalized to the case of step-by-step receipt and use of information in a dynamical way. In the case where the states of the players are described by differential or difference equations there arises an extensive class of problems connected with the diversity of the forms of the players' information on the state and trend as a physical process as well as a process of making a decision. Generalizations of the games $Γ_{1}$ and $Γ_{2}$ are considered to the case of prohibited situations, that is, the presence of joint restrictions on the players' choices.

The formulations mentioned relate to the case where player $I$ has complete information on the pay-off function and the set of his choices. If player $I$ knows that the continuous pay-off function of $II$ satisfies the inequalities

$f_{2}^{-} (x_{1}, x_{2}) \leq f_{2} (x_{1}, x_{2}) \leq f_{2}^{+} (x_{1}, x_{2})$

for known continuous functions $f_{2}^{-} (x_{1}, x_{2})$ and $f_{2}^{+} (x_{1}, x_{2})$ , then the largest guaranteed result in $Γ_{2}$ is defined by maximizing conditions for a function of a single variable.

A more general version of the case where player $I$ has incomplete information of the interests of player $II$ is as follows. Player $I$ knows the function $f_{2} (x_{1}, x_{2}, α)$ , $α \in A$ , and knows that the true pay-off function satisfies $f_{2} (x_{1}, x_{2}) = f_{2} (x_{1}, x_{2}, α_{0})$ for some unknown value $α = α_{0}$ . With such information, the solution of $Γ_{2}$ for finite sets $A$ reduces to maximizing functions of several variables; for infinite $A$ the problem is more complicated. The presence of indefinite factors in the formulation of $Γ_{1}$ does not lead to a significant complication of the problem, since this case reduces to that of a case without indefiniteness. In the indefinite case of $Γ_{2}$ , a number of problems are considered, where the concept of a players' strategy is extended at the expense of the hypothesis that player $I$ communicates his effectiveness criterion to player $II$ , that is, some $\hat{a} \in A$ , so that the final choice $x_{1}$ can be performed by obtaining information about $x_{2}$ and the effectiveness criterion of player $II$ . If player $II$ is cautious in the case, that is, he holds to the principle of the largest guaranteed result, and player $I$ communicates to him the parametrized strategy $x_{1 α} (x_{2}, \hat{a})$ , $α \in A$ , then it can be shown that the largest guaranteed result of player $I$ is $G_{2} = inf_{α \in A} G_{2 α}$ , where $G_{2 α}$ is the largest guaranteed result of player $I$ in the game $Γ_{2}$ for a given $α \in A$ . A similar result holds without assuming that player $II$ is cautious, if player $I$ knows a parametric family of sets $X_{2} (α)$ , $α \in A$ , one of which is the true one.

Close to the problem just discussed is that of finding the largest guaranteed result of player $I$ in $Γ_{2}$ in the presence of a parameter $α$ in the pay-off functions of the players characterizing environmental uncertainty, where player $II$ is informed by his choice of the concrete value of $α$ and player $I$ is not informed.

In the case where $Γ_{2}$ is repeated indefinitely, the extent to which player $I$ is informed about the interests and possibilities of player $II$ can be increased because of the information contained in the responses of player $II$ to the action of player $I$ . Procedures are accordingly constructed that allow player $I$ , starting with some play, to obtain a result arbitrarily close to the result guaranteed to him by complete information. Such results are also obtained in a game $Γ_{1}$ with indefiniteness. If the moments when player $I$ obtains information on the indeterminate factors $α$ are not fixed, then player $I$ can obtain in the remaining repetitions a result that is arbitrarily close to that guaranteed to him by complete information, under weaker assumptions on the pay-off functions of the participants. Moreover, player $I$ in $Γ_{1}$ can obtain a similar result simply by observing the values of his own pay-off function.

The formulations of the games under consideration carry over naturally to the case of many persons whose interactions, in the sense of priority of action and transfer of information, have a hierarchy structure. In analyzing these games it is necessary to stipulate a rule of interaction of the players on the same level. Thus, when three-person games are considered, where the pay-off functions of the players have the form

$w_{1} = f_{1} (x_{1}, x_{2}, x_{3}),$

$w_{2} = f_{2} (x_{1}, x_{2}, x_{3}), w_{3} = f_{3} (x_{1}, x_{2}, x_{3}),$

$x_{1} \in X_{1}$ , $x_{2} \in X_{2}$ , $x_{3} \in X_{3}$ , then in order to describe the largest guaranteed result of a chosen player $I$ who has priority of action, it is necessary to make concrete his information on the behaviour of the players $II$ and $III$ . If $II$ and $III$ form a rigid coalition to the knowledge of $I$ , that is, they formulate coalition criteria and determine their choices together, this case is equivalent to the previous two-person games as far as $I$ is concerned. Clear results have been obtained also in the case where the players $II$ and $III$ either are in a coalition known to player $I$ or act as individuals if they can then obtain a better result than is given by coalition; in this case neither player $II$ nor player $III$ has independent information on the moves of the other, and the order of these moves is given by player $I$ . Games having a "fan" structure have been analyzed in detail: a distinguished player $Π_{0}$ ( who controls the centre) and $n$ other players on the next level in the hierarchy (the producers of the output) tend to an increase in the pay-off functions $f_{0} (x_{0}, x)$ and $f_{i} (x_{0}^{i}, x_{i})$ , $i = 1 \dots n$ , respectively, where $x_{0} = {x_{0}^{1} \dots x_{0}^{n}}$ is the choice of $Π_{0}$ , $x_{0} \in X_{0}$ , $x_{0}^{i} \in X_{0}^{i}$ , and $x = {x_{1} \dots x_{n}}$ is the set of choices of the players on the lower level of the hierarchy, who act moreover, independently, and the player with index $i$ deals with the choice $x_{i} \in X_{i}$ . All sets are assumed to be compact and the functions to be continuous. Player $Π_{0}$ expects information (and will have it) on the choices $x_{i} \in X_{i}$ and informs every player $i$ of the corresponding strategy function $\tilde{x}_{0}^{i} = x_{0}^{i} (x_{i})$ defined on $X_{i}$ with values in $X_{0}^{i}$ . For $n$ - person games with a hierarchy structure, expressions have been obtained for the largest guaranteed result of the distinguished player under various extensions of his class of strategies, at the expense of transmitting to the players on lower levels information on the actions of their colleagues, as well of of introducing actions of their colleagues and elements of bluff. As with games for two persons, the possibility of side payments to the distinguished player simplifies the determination of his guaranteed result considerably.

Using games with a hierarchy structure, a natural interpretation has been obtained of the various mechanisms of centralized control of active economic subsystems. The game $Γ_{1}$ describes the process of centralized control by means of prices; $Γ_{2}$ models the policy of penalties and encouragement via stimulation of production; and $Γ_{3}$ models the process of resource distribution as a function of the industrial methods of using these resources.

References[edit]

[1]	Yu.B. Germeier, "Non-antagonistic games" , Reidel (1986) (Translated from Russian)

Comments[edit]

Game $Γ_{1}$ is often referred to as a Stackelberg game. In the formulation given, player $I$ is the leader who conveys his decision to player $II$ , the follower, who makes his decision afterwards. See [a1], Chapt. IV. In the economic literature, game $Γ_{2}$ is said to have an incentive structure. Player $I$ , the leader again, does not announce his action, but instead his strategy to player $II$ . The decision of player $II$ then also determines the action (i.e. decision) of player $I$ ; player $II$ ' s decision is substituted into player $I$ ' s strategy, which results in player $I$ ' s decision [a2].

References[edit]

[a1]	T. Basar, G.J. Olsder, "Dynamic noncooperative game theory" , Acad. Press (1982)
[a2]	P.B. Luk, Y.C. Ho, G.J. Olsder, "A control-theoretical view on incentives" Automatica , 18 (1982) pp. 167–179