A model of a conflict situation with a fixed sequence of moves and interchange of information between the players. The main object of investigation in the theory of games with a hierarchy structure is the problem of finding the largest guaranteed result and an optimal strategy for a selected player. Suppose that players
and ,
respectively, tend to an increase in the pay-off functions
and (
cf. Gain function), continuous on the product of two compacta ;
,
.
The following different types of games can be formulated according to the character of the information and the order of moves.
The game .
Player
chooses
and communicates his choice to player .
Let
be the set of optimal choices of player .
Then the largest guaranteed result for player
is
The game .
Player
expects to have and indeed will have information on the choices of player ;
he communicates his strategy, that is, a function ,
where —
the set of all mappings from
to ,
to player .
The largest guaranteed result for player
is
where the set of optimal choices of player
is
where ,
and
if and only if
is achieved.
The game .
Player
expects to have and indeed will have information on the choices of player
in the form ,
where —
the set of all mappings from
to ;
he communicates to player
his strategy ,
where —
the set of mappings from
to .
The largest guaranteed result of player
is
where
,
where now
if and only if
is achieved.
A relation between the results in these games determines for player
knowledge of the information concerning the actions of player II: .
Using the scheme indicated in the construction of the strategies of the players, games with arbitrarily deep recursion can be formulated. The following assertion holds: in the games ,
,
the largest guaranteed result for player
is ;
in the games ,
,
the largest guaranteed result is .
The problem of determining
is related to a class of problems of minimax type with related restrictions.
Methods have been developed for solving
using penalty functions, necessary optimality conditions and approximation to the original game by games with unique responses for player .
Complete solutions are known for special classes of games: games with close interests, bimatrix games, bilinear games, etc. The problem of determining
is not well-posed relative to changes in the function
in the uniform metric and of the sets
and
in the Hausdorff metric. A general method has been proposed for regularizing the solution of the game ;
regularization of the problem relative to the pay-off function of
is effected at the expense of introducing an artificial inaccuracy in the determination of .
The determination of the magnitude
reduces to the solution of a set of problems in mathematical programming.
Suppose that for arbitrary
the following functions, sets and numbers are defined:
Under the conditions stated,
and the strategy
guarantees that player
receives
for sufficiently small .
As is clear from the definitions, an optimal strategy consists of a certain number of stages, the last playing the part of a strategy by punishment.
If
and if
has no local maxima with value
on ,
then
and an optimal strategy has the simple form:
A solution can be found in a similar way for ;
it also reduces to the solution of a sequence of problems in mathematical programming.
When side payments for player
are introduced into a game with a hierarchy structure as functions of the choices of player ,
the expression for the largest guaranteed result for player
is significantly simplified. In the game ,
where
,
,
,
and player
chooses strategies ,
,
the determination of
reduces to the solution of a problem in mathematical programming:
In general, the application of arbitrarily small side payments
in games with a hierarchy structure allows player
to achieve the largest possible guaranteed result, reckoning on the generosity of his partner.
The games formulated can be generalized to the case of step-by-step receipt and use of information in a dynamical way. In the case where the states of the players are described by differential or difference equations there arises an extensive class of problems connected with the diversity of the forms of the players' information on the state and trend as a physical process as well as a process of making a decision. Generalizations of the games
and
are considered to the case of prohibited situations, that is, the presence of joint restrictions on the players' choices.
The formulations mentioned relate to the case where player
has complete information on the pay-off function and the set of his choices. If player
knows that the continuous pay-off function of
satisfies the inequalities
for known continuous functions
and ,
then the largest guaranteed result in
is defined by maximizing conditions for a function of a single variable.
A more general version of the case where player
has incomplete information of the interests of player
is as follows. Player
knows the function ,
,
and knows that the true pay-off function satisfies
for some unknown value .
With such information, the solution of
for finite sets
reduces to maximizing functions of several variables; for infinite
the problem is more complicated. The presence of indefinite factors in the formulation of
does not lead to a significant complication of the problem, since this case reduces to that of a case without indefiniteness. In the indefinite case of ,
a number of problems are considered, where the concept of a players' strategy is extended at the expense of the hypothesis that player
communicates his effectiveness criterion to player ,
that is, some ,
so that the final choice
can be performed by obtaining information about
and the effectiveness criterion of player .
If player
is cautious in the case, that is, he holds to the principle of the largest guaranteed result, and player
communicates to him the parametrized strategy ,
,
then it can be shown that the largest guaranteed result of player
is ,
where
is the largest guaranteed result of player
in the game
for a given .
A similar result holds without assuming that player
is cautious, if player
knows a parametric family of sets ,
,
one of which is the true one.
Close to the problem just discussed is that of finding the largest guaranteed result of player
in
in the presence of a parameter
in the pay-off functions of the players characterizing environmental uncertainty, where player
is informed by his choice of the concrete value of
and player
is not informed.
In the case where
is repeated indefinitely, the extent to which player
is informed about the interests and possibilities of player
can be increased because of the information contained in the responses of player
to the action of player .
Procedures are accordingly constructed that allow player ,
starting with some play, to obtain a result arbitrarily close to the result guaranteed to him by complete information. Such results are also obtained in a game
with indefiniteness. If the moments when player
obtains information on the indeterminate factors
are not fixed, then player
can obtain in the remaining repetitions a result that is arbitrarily close to that guaranteed to him by complete information, under weaker assumptions on the pay-off functions of the participants. Moreover, player
in
can obtain a similar result simply by observing the values of his own pay-off function.
The formulations of the games under consideration carry over naturally to the case of many persons whose interactions, in the sense of priority of action and transfer of information, have a hierarchy structure. In analyzing these games it is necessary to stipulate a rule of interaction of the players on the same level. Thus, when three-person games are considered, where the pay-off functions of the players have the form
,
,
,
then in order to describe the largest guaranteed result of a chosen player
who has priority of action, it is necessary to make concrete his information on the behaviour of the players
and .
If
and
form a rigid coalition to the knowledge of ,
that is, they formulate coalition criteria and determine their choices together, this case is equivalent to the previous two-person games as far as
is concerned. Clear results have been obtained also in the case where the players
and
either are in a coalition known to player
or act as individuals if they can then obtain a better result than is given by coalition; in this case neither player
nor player
has independent information on the moves of the other, and the order of these moves is given by player .
Games having a "fan" structure have been analyzed in detail: a distinguished player (
who controls the centre) and
other players on the next level in the hierarchy (the producers of the output) tend to an increase in the pay-off functions
and ,
,
respectively, where
is the choice of ,
,
,
and
is the set of choices of the players on the lower level of the hierarchy, who act moreover, independently, and the player with index
deals with the choice .
All sets are assumed to be compact and the functions to be continuous. Player
expects information (and will have it) on the choices
and informs every player
of the corresponding strategy function
defined on
with values in .
For -
person games with a hierarchy structure, expressions have been obtained for the largest guaranteed result of the distinguished player under various extensions of his class of strategies, at the expense of transmitting to the players on lower levels information on the actions of their colleagues, as well of of introducing actions of their colleagues and elements of bluff. As with games for two persons, the possibility of side payments to the distinguished player simplifies the determination of his guaranteed result considerably.
Using games with a hierarchy structure, a natural interpretation has been obtained of the various mechanisms of centralized control of active economic subsystems. The game
describes the process of centralized control by means of prices;
models the policy of penalties and encouragement via stimulation of production; and
models the process of resource distribution as a function of the industrial methods of using these resources.
References[edit]
[1] | Yu.B. Germeier, "Non-antagonistic games" , Reidel (1986) (Translated from Russian) |
Game
is often referred to as a Stackelberg game. In the formulation given, player
is the leader who conveys his decision to player ,
the follower, who makes his decision afterwards. See [a1], Chapt. IV. In the economic literature, game
is said to have an incentive structure. Player ,
the leader again, does not announce his action, but instead his strategy to player .
The decision of player
then also determines the action (i.e. decision) of player ;
player '
s decision is substituted into player '
s strategy, which results in player '
s decision [a2].
References[edit]
[a1] | T. Basar, G.J. Olsder, "Dynamic noncooperative game theory" , Acad. Press (1982) |
[a2] | P.B. Luk, Y.C. Ho, G.J. Olsder, "A control-theoretical view on incentives" Automatica , 18 (1982) pp. 167–179 |