CUBO A Mathematical Journal Vol.11, N o ¯ 02, (55–84). May 2009 Cournot Models: Dynamics, Uncertainty and Learning Ferenc Szidarovszky Systems & Industrial Engineering Department, The University of Arizona, Tucson, Arizona, 85721-0020, USA email: szidar@sie.arizona.edu and Vernon L. Smith and Steven Rassenti Interdisciplinary Center for Economic Science, George Mason University, 3330 Washington Boulevard Arlington, VA 22201 emails: vsmith2@gmu.edu, srassent@gmu.edu ABSTRACT This chapter gives an overview of the recent developments in the theory of dynamic oligopolies including some new results. We will discuss the Cournot classical model and its extensions to product differentiation, multiproduct models, price adjusting oligopolies, labor managed and rent seeking games. The dynamic process based on these models will be analyzed. From the theoretical point of view we will investigate models with and without full information, with partial cooperation among the firms, and under the assumption that the information about the production levels of the rivals has time delay. We will also introduce and discuss special learning procedures based on repeated price information. We will also briefly discuss investigations based on laboratory experiments, in which more realistic cases can be examined. RESUMEN Damos una descripción general de los recientes desarrollos de la teoria de dinámica de oligopolios incluyendo algunos resultados novos. Discutimos el modelo clásico de Cournot y sus extensiones 56 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) a la diferenciación producto, modelos multiproducto, ajuste de precios de oligopolios, labores dirigidas y juegos de rent seeking. La dinámica de procesos basados en estos modelos será anal- izada. Desde el punto de vista teórico investigaremos modelos con y sin información completa, con cooperación parcial entre las firmas, y bajo la suposición que la información al respecto de los niveles de producción de los rivales tiene un tiempo de retrazo. Introduzimos y discutimos proce- sos especiales de aprendizaje basados en información de precios repetidos. También brevemente discutimos investigaciones sobre experimentos de laboratórios, en los cuales casos mas realisticos son examinados. Key words and phrases: n-person game, oligopoly, dynamic systems, stability learning. Math. Subj. Class.: 91A20, 91A80. 1 Introduction Since the pioneering work of [1] oligopoly models are the most frequently discussed topics in the literature of mathematical economics. They describe the interaction of manufacturers and service suppliers through some market demand structure. Most of the authors consider this problem as a game in which the supplied quantities (or selected prices) are the strategies and the payoff functions are defined as the profits of the firms. In a competitive environment the Nash equilibrium is the solution of the game, in which none of the firms can increase its profit by changing its production level alone. In an N -person oligopoly there is no guarantee in general for the existence of such equilibrium, and even if equilibrium exists, there is the possibility of the existence of multiple equilibria ([5]). Several versions of the Cournot model has been developed in the literature including oligopolies without and with product differentiation, multiproduct oligopolies, labor managed and rent seeking models as well as oligopsonies, in which the firms also compete on the factor market. A comprehensive summary of the different model variants is presented in [6]. If the state of an oligopoly is a Nash equilibrium at a certain time, then no firm has the interest to move away from the equilibrium, therefore the state will remain at the equilibrium for all future times. However in a disequibilium state at least one of the firms is able to increase its profit by changing strategy, so the state of the system will change. If the new state is an equilibrium, then it will remain the state of the system for all future times. Otherwise another change of the state will occur, and so, a dynamic process will develop. The dynamics of this process depend on the desires of the firms, and also on the accuracy and timing of the available information. During this process the firms are able to monitor repeated information on the demand structure and the actions of the competitors, which rises the possibility of some learning mechanisms. The research on dynamic behavior of the firms can be done in two fundamentally different ways. The mathematical models are always based on certain assumptions on the objectives of the firms, on the types and analytical properties of the functions involved, and on the information structure. Those assumptions very often differ from the economic reality. In more realistic cases it is very often impossible to obtain nice mathematical results, so simulation is used. The applied simulation methodology still depends on the basic assumptions of the model, it simple generates experimental results in the absence of analytical tools. CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 57 Another methodology is based on actual laboratory experiments, when real people make the decisions under computer generated environment, and by monitoring their repeated actions we are able to gain a detailed understanding of their priorities, information usage, and decision making mechanism. In this paper we will give a brief overviews of the most important problems arising in examining dynamic oligopolies, and in addition we will offer some new results on this topic. This chapter is developed as follows. Static oligopoly models will be first introduced in Session 2 and the existence and uniqueness of the equilibrium will be discussed briefly. Dynamic models with full information on the demand structure will be then examined, and Session 4 will be devoted to partial cooperation among the firms. Session 5 will deal with the effect of uncertainty in the information on the demand structure. Information delays will be considered in Session 6 when we will discuss how the stability of the equilibrium is lost through Hopf bifurcation giving the possibility of the birth of limit cycles. Some special learning schemes will be introduced in Session 7. The fundamentals of experimental economics will be outlined in Session 8. Finally, some conclusions will be drawn. 2 The Cournot Model and Its Extensions Consider an economy of N firms that produce the same product or offer the same service to a homogeneous market. Let k = 1, 2, . . . , N denote the firms and xk the produced or offered quantity of firm k. The total output of the industry is Q = ∑N k=1 xk, and we assume that the market price f is a function of Q. If Ck(xk) is the cost of firm k, then its profit is given as ϕk(x1, . . . , xN ) = xkf (Q) − Ck(xk). (2.1) Let Lk denote the capacity limit of firm k. Then an N -person noncooperative game is defined, in which the firms are the players, interval [0, Lk] is the set of strategies of player k, and ϕk is its payoff function. This model is known as the classical Cournot model, or the single-product quantity adjusting oligopoly without product differentiation. In the literature of oligopoly theory it is usually assumed that functions f and Ck (k = 1, 2, . . . , N ) are twice continuously differentiable and for all xk ∈ [0, Lk] and Q ∈ [0, ∑N k=1 Lk], (A) f ′ (Q) < 0; (B) xkf ′′ (Q) + f ′ (Q) < 0; (C) f ′ (Q) − C′′k (xk) < 0. Notice that the payoff ϕk of player k does not depend explicitly on the outputs of each individual competitor, it depends on only the output Qk = ∑ l 6=k xl of the rest of the industry. For each feasible value of Qk we can easily determine the best strategy choice of firm k, which is called its best reply. Under assumptions (A)-(C), function ϕk is strictly concave in xk with any fixed value of Qk, and simple differentiation shows that 58 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) arg max xk {xkf (xk + Qk) − Ck(xk)} =            0, if f (Qk) − C ′ k(0) < 0; Lk, if f (Qk + Lk) + Lkf ′ (Qk + Lk) −C′k(Lk) > 0; zk, otherwise. (2.2) where zk is the unique solution of the strictly monotonic equation f (Qk + xk) + xkf ′ (Qk + xk) − C ′ k(xk) = 0 (2.3) in [0, Lk]. Since the value of zk depends on Qk, we may write zk = Rk(Qk). In our future analysis we will need the derivative of the best response function. In the neighborhood of an interior optimum, Rk is differentiable as the consequence of the implicit function theorem. By differentiating equation (2.3) implicitly with respect to Qk, we have f ′ · (1 + R′k) + R ′ k · f ′ + xkf ′′ · (1 + R′k) − C ′′ k · R ′ k = 0 implying that R ′ k(Qk) = − f ′ (Q) + xkf ′′ (Q) 2f ′(Q) + xkf ′′(Q) − C′′ k (xk) . Assumptions (B) and (C) imply that −1 < R′k(Qk) < 0. Clearly a strategy vector x ∗ = (x ∗ 1, . . . , x ∗ N ) is a Nash-equilibrium if and only if for all k, (i) x ∗ k ∈ [0, Lk]; (ii) x ∗ k = Rk ( ∑ l 6=k x ∗ l ) . It is well known (see for example, [5]) that under conditions (A)-(C) there is a unique Nash equilibrium. Assume next that the firms produce different but related goods. Let x1, . . . , xN denote again the pro- duced quantities and let fk(x1, . . . , xN ) denote the price of the product of firm k. Then the profit of this firm is given as ϕk(x1, . . . , xN ) = xkfk(x1, . . . , xN ) − Ck(xk). (2.4) This model is known as a single product oligopoly with product differentiation. In Bertrand (or price adjusting) oligopolies we consider a single product oligopoly with product differ- entiation in which each firm selects its price. If P1, . . . , PN are the selected prices and dk(P1, . . . , PN ) is the demand function of the product of firm k, then the profit of this firm can be obtained again by equation (2.4), where for all k, fk = Pk and CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 59 xk = dk(P1, . . . , PN ), so the profit functions now depend on only the price selections. Multiproduct oligopolies are obtained by assuming that the firms produce M different items. Let x (m) k denote the output of firm k of product m, then the firm’s production can be characterized by its production vector xk = (x (1) k , . . . , x (M) k ). The vector Q = ∑N k=1 xk shows the total production vectors of the industry. If Ck(xk) is the production cost of firm k, then its profit is given as ϕk(x1, . . . , xN ) = x T k f (Q) − Ck(xk). (2.5) Here f (Q) = ( f1(Q), . . . , fM (Q) ) with fm(Q) (1 ≤ m ≤ M ) being the price function of product m. Consider again the classical Cournot model (2.1) and let lk(xk) denote the labor force needed by firm k to produce output xk. Then the profit per labor of this firm can be given as Ψk(x1, . . . , xN ) = ϕk(x1, . . . , xN ) lk(xk) = xkfk(Q) − Ck(xk) lk(xk) . (2.6) If the firms maximize profits per labor instead of their total profits, then the oligopoly is said to be labor managed. Let N denote the number of agents involved in rent seeking activity. Let xk be the expenditure of agent k and fk(xk) its production function for lotteries, then the probability that agent k will win the rent is Pk = fk(xk) ∑N l=1 fl(xl) . If the rent is normalized to 1, then the expected net rent of agent k is given as Πk = fk(xk) ∑N l=1 fl(xl) − xk. (2.7) This model is known as a rent-seeking game. By introducing the new variables yk = fk(xk) and Ck = f −1 k it is clear that the payoff function (2.7) has the new form Πk = yk ∑N l=1 yl − Ck(yk). (2.8) Notice that function form (2.8) reduces to (2.1) by selecting f (Q) = 1/Q, so rent-seeking games are usually considered as special oligopolies. The equilibria of the above extensions can be defined in the same way as it has been presented for the classical Cournot model. Existence and uniqueness results are represented in [6]. 60 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) 3 Dynamic Models with Full Information In this session we consider only the classical Cournot model, and assume that the firms know the true price function and the simultaneous output values of all competitors. In this case for firm k, the best output choice is Rk(Qk) as it was shown in Session 2.2. If xk is the current output of the firm and Rk(Qk) is its desired output, then under continuous time scales the firm will change its output in the direction toward the desired output, since it cannot ”jump” with the output value instantaneously. Therefore the output change of the firms can be described by the system of ordinary differential equations ẋk(t) = Kk (Rk(Qk(t)) − xk(t)) (k = 1, 2, . . . , N ) (3.1) where Kk > 0 is a constant being called the speed of adjustment of firm k. Here we assume that the firms know their best response functions, and instantaneous information is available about the market price. If P (t) denotes the price at time period t, then P (t) = f (xk(t) + Qk(t)) from which firm k is able to calculate the output if the rest of the industry: Qk(t) = f −1 (P (t)) − xk(t). In this way the firms have all necessary information to proceed with output adjustments (3.1). An output vector x ∗ = (x ∗ 1, . . . , x ∗ N ) is a steady state of system (3.1) if and only if for all k, x ∗ k = Rk   ∑ l 6=k x ∗ l   , that is, when x ∗ is a Nash equilibrium. Example 1 Assume linear price and cost functions: f (Q) = B − AQ, Ck(xk) = akxk + bk (k = 1, 2, . . . , N ). In this case equation (2.3) can be written as B − A(Qk + xk) + xk(−A) − ak = 0 implying that xk = − Qk 2 + B − ak 2A , so the best response of firm k is CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 61 Rk(Qk) = − Qk 2 + B − ak 2A by assuming interior optimum. Therefore the dynamic system (3.1) can be rewritten as follows: ẋk = Kk  − 1 2 ∑ l 6=k xl − xk + B − ak 2A   (3.2) for k = 1, 2, . . . , N . ∇ Let now x ∗ = (x ∗ 1, . . . , x ∗ N ) denote an interior equilibrium of the classical Cournot model. We can examine the local asymptotic stability of this equilibrium with respect to the dynamic process (3.1). The Jacobian J C of the system at the equilibrium has the special structure        −K1 K1r1 · · · K1r1 K2r2 −K2 · · · K2r2 . . . . . . . . . . . . KN rN KN rN · · · −KN        (3.3) where rk = R ′ k(Q ∗ k) with Q ∗ k = ∑ l 6=k x ∗ l . We have seen earlier that −1 < rk < 0. The characteristic polynomial of J C can be written as ϕ(λ) = det(D + ab T − λI) with D = diag(−K1(1 + r1), . . . , −KN (1 + rN )), a = (K1r1, . . . , KN rN ) T , and b T = (1, . . . , 1). In obtaining a closed form representation of ϕ(λ) we can use the well-known fact that with any N -element vectors a and b, det(I + ab T ) = 1 + a T b. Therefore ϕ(λ) = det(D − λI) · det(I + (D − λI)−1abT ) = N ∏ k=1 (−Kk(1 + rk) − λ) · [ 1 + N ∑ k=1 Kkrk −Kk(1 + rk) − λ ] . (3.4) The main result of this session can be formulated as Theorem 1 All eigenvalues of J C have negative real parts implying the local asymptotic stability of the equilibrium. Proof The roots of function (3.4) are λ = −Kk(1 + rk), which are all negative, and the roots of the bracketed factor. This equation clearly can be rewritten as 62 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) s ∑ i=1 αi βi − λ + 1 = 0 (3.5) where αi < 0, βi < 0 and the βi values are different. This equation is equivalent to a polynomial equation of degree s, so there are s real or complex roots. Let g(λ) denote the left hand side, then lim λ→±∞ g(λ) = 1, lim λ→±βi g(λ) = ±∞, g ′ (λ) = s ∑ i=1 αi (βi − λ)2 < 0. The graph of function g is shown in Figure 1. Clearly there is a root before β1 and one root between βi and βi+1 for i = 1, 2, . . . , s − 1. Since βi < 0 for all i, we found s real negative roots. Hence all eigenvalues of J C are real and negative, which completes the proof. 2 1 2 1s s g 1 Figure 1: The graph of function g. Assume next that the time scales are discrete. If xk(t) denote the production level of firm k at time period t, then its best choice with given xl(t) (l 6= k) values is Rk(Qk(t)), where Qk(t) = ∑ l 6=k xl(t). In many industries the firms are unable to make large changes in their production levels during a single time period, therefore they select levels in the direction toward their best choices. This dynamism can be conveniently modelled as xk(t + 1) = αkxk(t) + (1 − αk)Rk(Qk(t)) (3.6) for k = 1, 2, . . . , N , where 0 ≤ αk < 1 is a given constant for all k. In order to examine the asymptotic behavior of the equilibrium notice first that the Jacobian of this system at the equilibrium can be given as follows: CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 63 J D =        α1 (1 − α1)r1 · · · (1 − α1)r1 (1 − α2)r2 α2 · · · (1 − α2)r2 . . . . . . . . . . . . (1 − αN )rN (1 − αN )rN · · · αN        . (3.7) We can rewrite this matrix similarly to the continuous case: J D = D + ab T with D = diag ((α1 − 1)r1 + α1, . . . , (αN − 1)rN + αN ) , a = ((1 − α1)r1, . . . , (1 − αN )rN ) T , and b T = (1, . . . , 1). Therefore the characteristic polynomial of J D is given as ϕ(λ) = det(D + ab T − λI) = det(D − λI) det(I + (D − λI)−1abT ) = N ∏ k=1 ((αk − 1)rk + αk − λ) · [ 1 + N ∑ k=1 (1 − αk)rk (αk − 1)rk + αk − λ ] . (3.8) In this case Theorem 1 can be modified as follows: Theorem 2 All eigenvalues of J D are inside the unit circle if and only if N ∑ k=1 (1 − αk)rk (αk − 1)rk + αk + 1 > −1. (3.9) In this case the equilibrium is locally asymptotically stable. If the left hand side is smaller than −1, then the equilibrium is unstable. Proof The eigenvalues are λ = (αk − 1)rk + αk, which are all positive, and the roots of the bracketed factor. First we show that the roots (αk − 1)rk + αk are inside the unit circle. Since they are positive it is sufficient to show that (αk − 1)rk + αk < 1, which is obviously true, since it can be rewritten as (αk − 1)(−rk − 1) > 0, where both factors are negative. The other roots are the solutions of equation (3.5) where αi < 0 and 0 < βi < 1 for all i. The graph of function g is the same as it is shown in Figure 1 with the only difference 64 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) that all βi values are between 0 and 1. All roots between the pairs βi and βi+1 are inside the unit circle, and the smallest root is also inside the unit circle if and only if g(−1) > 0. It is easy to see that this inequality is equivalent to condition (3.9). 2 If all firms select the best response, then αk = 0 for all k, and in this case condition (3.9) can be rewritten as N ∑ k=1 rk 1 − rk > −1, (3.10) which certainly holds if all values rk are sufficiently close to zero. In the case of symmetric oligopolies the Rk best responses are identical, so rk ≡ r. In this further special case relation (3.10) simplifies to the following: N r 1 − r > −1 which can be rewritten as r > −1 N − 1 . (3.11) Example 2 Consider again the linear case given in the previous example. Since R′k(Qk) = − 1 2 for all k, relation (3.11) holds only for N = 2, so the equilibrium is asymptotically stable for only duopolies with αk = 0 (k = 1, 2, . . . , N ). In this case relation(3.9) simplifies as N ∑ k=1 αk − 1 αk + 3 > −1 (3.12) which certainly holds if the αk values are sufficiently close to 1. ∇ In this session we have focused on the classical Cournot model, however its extensions and all model variants can be examined similarly to this case. 4 Models with Partial Cooperation A usual way how the firms might increase their profits is to seek certain cooperation with the rivals. A common way to model such ”partial” cooperation can be given as follows. Let ϕk denote the profit of firm k (1 ≤ k ≤ N ), and let γkl denote a nonnegative constant for all k and l, which shows the cooperation level of CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 65 firm k toward firm l. Then firm k maximizes ϕk + ∑ l 6=k γklϕl that is, it takes a certain portion of the profits of the competitors into account in addition to its own profit. Thus the payoff function of firm k becomes Ψk(x1, . . . , xN ) = (xkf (Q) − Ck(xk)) + ∑ l 6=k γkl(xlf (Q) − Cl(xl)) (4.1) in the case of the classical Cournot model. For the sake of mathematical convenience assume that γkl ≡ γk for all k and l, that is, each firm has identical cooperation levels toward its rivals. In this special case Ψk(x1, . . . , xN ) = (xk + γkQk)f (Q) − Ck(xk) − γk ∑ l 6=k Cl(xl). (4.2) With given value of Qk, the best response of firm k can be obtained by simple differentiation. Assuming interior optimum, then at the optimum f (xk + Qk) + (xk + γkQk)f ′ (xk + Qk) − C ′ k(xk) = 0. (4.3) The derivative of the left hand side with respect to xk is the following: 2f ′ (Q) + (xk + γkQk)f ′′ (Q) − C′′k (xk). Assume that conditions (A) and (C) introduced in Session 2.2 are satisfied with a modified version of condition (B): (B’) (xk + γkQk)f ′′ (Q) + f ′ (Q) < 0 for all xk ∈ [0, Lk], Qk ∈ [0, ∑ l 6=k Ll] and Q = xk + Qk. Then Ψk is strictly concave in xk, so the payoff maximizing xk value is unique. If it is interior, then it can be obtained as the unique solution of the monotonic equation (4.3). Let Rk(Qk) denote the solution as before. By differentiating equation (4.3) implicitly with respect to Qk it is easy to see that R ′ k(Qk) = − (1 + γk)f ′ (Q) + (xk + γkQk)f ′′ (Q) 2f ′(Q) + (xk + γkQk)f ′′(Q) − C′′ k (xk) . (4.4) Clearly both the numerator and denominator are negative, so R ′ k(Qk) is always negative. In addition, R ′ k(Qk) > −1 if the following stronger version of condition (C) is satisfied: (C’) (1 − γk)f ′ (Q) − C′′k (xk) < 0 for all 0 ≤ xk ≤ Lk and 0 ≤ Q ≤ ∑N l=1 Ll. Under conditions (A), (B’) and (C’) all results of the previous session remain true for this case with the only difference that R ′ k(Qk) is now given by equation (4.4). 66 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) 5 Models with Uncertain Price Function In this session we will examine again the classical Cournot model, however the methodology and the results to be discussed here can be extended to other model variants. Assume now that the firms do not know the true price function f , but they have certain estimates of it. Let fk denote the estimated price function by firm k. Then this firm believes that its profit is ϕk(x1, . . . , xN ) = xkf k(Q) − Ck(xk). (5.1) Assume that conditions (A)-(C) are satisfied with f being replaced by f k. Then with all fixed Qk, firm k has a believed profit maximizing output Rk(Qk), which is usually different than the ”true” best response of the firm. Consider first continuous time scales and assume that similarly to the full information case, each firm adjusts its production level into the direction toward its believed best reply. Then we have a modified version of system (3.1): ẋk(t) = Kk ( Rk ( Qk(t) ) − xk(t) ) (k = 1, 2, . . . , N ) (5.2) where Qk(t) is the estimate of Qk(t) by firm k. Notice that the true price function is not known by the firm, so it cannot compute the true value of Qk. Instead the following method is used. The true market price, which the firm observes is f (xk + Qk). On the other hand, firm k believes that it equals f k(xk + Qk), so in fact, firm k solves the equation f k(xk + Qk) = f (xk + Qk) to get its estimate Qk = (f −1 k ◦ f )(xk + Qk) − xk. (5.3) For the sake of simplicity introduce the function Gk = f −1 k ◦ f . Then system (5.2) can be rewritten as ẋk(t) = Kk ( Rk (Gk (xk(t) + Qk(t)) − xk(t)) − xk(t) ) (5.4) for k = 1, 2, . . . , N . Notice that the steady state of this system (if exists) is usually different than the Nash equilibrium, since Rk usually differs from the true best response function Rk. We can refer to this steady state as the ”believed” equilibrium, which will be denoted as x ∗ = (x ∗ 1, . . . , x ∗ N ). The Jacobian J C of the system has a similar form to matrix (3.3):        K1 (r1(g1 − 1) − 1) K1r1g1 · · · K1r1g1 K2r2g2 K2 (r2(g2 − 1) − 1) · · · K2r2g2 . . . . . . . . . . . . KN rN gN KN rN gN · · · KN (rN (gN − 1) − 1)        (5.5) CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 67 with gk = G ′ k(x ∗ k + Q ∗ k). In the full information case, fk ≡ f for all k, therefore Gk is the identity function with gk = 1. In this case J C reduces to matrix (3.3). Otherwise fk is only an estimate if f , and if this estimate is sufficiently good, then Gk is close to the identity function with gk ≈ 1. The location of the eigenvalues of J C can be similarly examined as it was shown in the full information case. Observe that J C = D + ab T where in this case D = diag (−K1(1 + r1), . . . , −KN (1 + rN )), a = (K1r1g1, . . . , KN rN gN ) T and b T = (1, . . . , 1). The characteristic polynomial of this matrix is also similar to (3.4): N ∏ k=1 (−Kk(1 + rk) − λ) · [ 1 + N ∑ k=1 Kkrkgk −Kk(1 + rk) − λ ] . (5.6) Since f is strictly decreasing and we may assume that all estimates fk are also strictly decreasing (otherwise the firms’ estimates are irrealistic), Gk is strictly increasing with nonnegative derivative gk. Therefore the proof of Theorem 1 can be used without any changes to show that the ”believed” equilibrium is locally asymptotically stable in this case as well. Assume next that the time scales are discrete. In this case the discrete dynamic system (3.6) is modified as xk(t + 1) = αkxk(t) + (1 − αk)Rk (Gk (xk(t) + Qk(t)) − xk(t)) (5.7) for k = 1, 2, . . . , N , where we used equation (5.3) again. The steady state of this system (if exists) is usually different than the Nash equilibrium, so we might refer to the steady state as a ”believed” equilibrium similarly to the continuous case. The Jacobian J C of system (5.7) has the special structure        α1 + (1 − α1)r1(g1 − 1) (1 − α1)r1g1 · · · (1 − α1)r1g1 (1 − α2)r2g2 α2 + (1 − α2)r2(g2 − 1) · · · (1 − α2)r2g2 . . . . . . . . . . . . (1 − αN )rN gN (1 − αN )rN gN · · · αN + (1 − αN )rN (gN − 1)        (5.8) where rk and gk are as before. In the full information case gk = 1, so J C reduces to matrix (3.7). Otherwise fk is only an estimate of f , and if this estimate is sufficiently good, then Gk is close to the identity function with gk ≈ 1. This matrix can also be rewritten as J D = D+ab T with D = diag ((α1 − 1)r1 + α1, . . . , (αN − 1)rN + αN ), a = ((1 − α1)r1g1, . . . , (1 − αN )rN gN ) T and b T = (1, . . . , 1). The characteristic polynomial of J D has a sim- ilar form to the previously discussed cases: N ∏ k=1 ((αk − 1)rk + αk − λ) · [ 1 + N ∑ k=1 (1 − αk)rkgk (αk − 1)rk + αk − λ ] . (5.9) 68 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) It is easy to see that Theorem 2 remains valid in this case with the only difference that condition (3.9) has to be modified as N ∑ k=1 (1 − αk)rkgk (αk − 1)rk + αk + 1 > −1. (5.10) 6 The Effect of Information Delay In this session we will examine how delayed information affects the asymptotic behavior of the equilibrium. For the sake of simplicity we will consider only the classical Cournot model. Assume that at time period t firm k obtains a delayed information on the production level of the rest of the industry, Qk(sk), where t− sk is the delay. If firm k uses this latest information to form its best response, then the dynamic system becomes ẋk(t) = Kk (Rk (Qk(sk)) − xk(t)) (k = 1, 2, . . . , N ). (6.1) If the delay is known and it is denoted by dk(t), then sk = t − dk(t), and so equation (6.1) becomes a difference-differential equation. However the delay is uncertain in real economies, therefore a convenient modelling way is offered by considering it as a random variable and replacing the random right hand sides of equation (6.1) by their expected values. In this way a Volterra-type integro-differential equation is obtained: ẋk(t) = Kk (∫ t 0 w(t − s, Tk, mk)Rk (Qk(s)) ds − xk(t) ) . (6.2) The weighting function w is defined as w(t − s, T, m) = { 1 T e − t−s T , if m = 0; 1 m! ( m T )m+1 (t − s)me− m(t−s) T , if m ≥ 1, (6.3) where T > 0 is a real and m ≥ 0 is an integer parameter. This weighting function has the following properties: (a) ∫ ∞ 0 w(s, T, m)ds = 1; (b) If m = 0, then weights are exponentially decreasing with the largest weight given to the most current data. If m ≥ 1, then the most current data has zero weight, the weight is increasing to a maximal value at t − s = T , and decreases thereafter. (c) With increasing value of m, the weighting function becomes more peaked around t − s = T , as m → ∞ the weighting function converges to the Dirac delta function centered at t − s = T . (d) If T → 0, then the weighting function converges to the Dirac delta function centered at zero. Theorem 3 System (6.2) is equivalent to a system of ordinary differential equations by introducing additional unknown functions. CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 69 Proof Assume first that m = 0 and let P be any function of time. Introduce the new function P0(t) = ∫ t 0 1 T e − t−s T P (s)ds. By simple differentiation, Ṗ0(t) = 1 T (P (t) − P0(t)) . Assume next that m ≥ 1, and for all l = 0, 1, . . . , m introduce the functions Pl(t) = ∫ t 0 1 l! ( m T )l+1 (t − s)le− m(t−s) T P (s)ds. Then by differentiation, Ṗl(t) = m T (Pl−1(t) − Pl(t)) and Ṗ0(t) = m T (P (t) − P0(t)) . By selecting Pk(s) = Rk (Qk(s)), the integro-differential equation system is equivalent to the following system of ordinary differential equations: ẋk(t) = Kk (Pkm(t) − xk(t)) (1 ≤ k ≤ N ) Ṗkl(t) = qk Tk (Pk,l−1(t) − Pkl(t)) (1 ≤ k ≤ N, 1 ≤ l ≤ mk) Ṗk0(t) = qk Tk (Pk(t) − Pk0(t)) (1 ≤ k ≤ N ) with qk = { 1, if mk = 0; mk, if mk ≥ 1. (6.4) 2 Linearizing equation (6.2) we have for all k, ẋkδ(t) = Kk  rk ∫ t 0 w(t − s, Tk, mk) · ∑ l 6=k xlδ(s)ds − xkδ (t)   (6.5) where rk = R ′ k(Q ∗ k), and xkδ (t) is the deviation of xk(t) from its equilibrium level. As it is usual in the theory of ordinary differential equations we look for the solution as xkδ (t) = vke λt . By substituting this form into (6.5) and letting t → ∞ we have (λ + Kk)vk − Kkrk ∫ t 0 w(s, Tk, mk)e −λs ds ∑ l 6=k vl = 0 70 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) and by using simple integration and the definition of the gamma function this equation simplifies as (λ + Kk)vk − Kkrk ( 1 + λTk qk )−(mk+1) ∑ l 6=k vl = 0. (6.6) Introduce function Ak(λ) = (λ + Kk) ( 1 + λTk qk )mk+1 to see that equations (6.6) are equivalent to a determinantal equation det        A1(λ) −K1r1 · · · −K1r1 −K2r2 A2(λ) · · · −K2r2 . . . . . . . . . . . . −KN rN −KN rN · · · AN (λ)        = 0. (6.7) Notice that by introducing D = diag (A1(λ) + K1r1, . . . , AN (λ) + KN rN ) , a = (−K1r1, . . . , −KN rN ) T and b T = (1, . . . , 1) this equation can be rewritten as det(D + ab T ) = det(D) · det(I + D−1abT ) = N ∏ k=1 (Ak(λ) + Kkrk) · [ 1 − N ∑ k=1 Kkrk Ak(λ) + Kkrk ] = 0. First we prove that all roots of equation Ak(λ) + Kkrk = 0 (6.8) have negative real parts. Clearly λ 6= 0, and assume that with some root λ, Reλ ≥ 0. Then |λ + Kk| > Kk and ∣ ∣ ∣ ∣ 1 + λTk qk ∣ ∣ ∣ ∣ > 1 implying that Kk > |Kkrk| = |Ak(λ)| > Kk which is an obvious contradiction. As the asymptotic behavior of the equilibrium is concerned we have to examine the locations of the solutions of equation CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 71 1 − N ∑ k=1 Kkrk Ak(λ) + Kkrk = 0. (6.9) Notice first that it is equivalent to a polynomial equation, so there are finitely many roots. In the general case computer methods are needed to locate the roots, however in special cases analytic results can be obtained. Assume now that the firms are identical and the equilibrium is symmetric. Then K1 = . . . = KN = K, T1 = . . . = TN = T , m1 = . . . = mN = m, q1 = . . . = qN = q, and r1 = . . . = rN = r showing that equation (6.9) reduces to the following: (λ + K) ( 1 + λT q )m+1 + (1 − N )Kr = 0. (6.10) Consider first the case of T = 0, which corresponds to models without information delay. In this case (6.10) has a unique root λ < 0, so the equilibrium is locally asymptotically stable. Note that this symmetric case is a special case of Theorem 1. Assume next that T > 0 and m = 0. Then (6.10) becomes quadratic: λ 2 T + λ(1 + KT ) + K (1 + (1 − N )r) = 0. Since all coefficients are positive, all roots have negative real parts implying the local asymptotic stability of the equilibrium (see for example, [8]). Consider next the case of T > 0 and m = 1. In this case (6.10) is a cubic equation: λ 3 T 2 + λ 2 (2T + T 2 K) + λ(1 + 2KT ) + K (1 + (1 − N )r) = 0. (6.11) All coefficients are positive and the Routh-Hurwitz criterion implies that all roots have negative real parts if and only if (2T + T 2 K)(1 + 2KT ) > T 2 K (1 + (1 − N )r) , (6.12) which can be rewritten as 2T 2 K 2 + T K (4 + r(N − 1)) + 2 > 0. (6.13) The discriminant of the left hand side is r(N − 1) [8 + r(N − 1)] , where r(N − 1) < 0. Therefore we have the following cases. Case 1. If 8 + r(N − 1) > 0, then the left hand side of (6.13) has no real root, so it always holds. Therefore the equilibrium is locally asymptotically stable. 72 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) Case 2. If 8 + r(N − 1) = 0, then there is a unique root T K = −4 − r(N − 1) 4 = − (8 + r(N − 1)) + 4 4 = 1 and if T K 6= 1, then the equilibrium is locally asymptotically stable. Case 3. If 8 + r(N − 1) < 0, then there are two real roots (T K) ∗ 1,2 = −4 − r(N − 1) ± √ (4 + r(N − 1)) 2 − 16 4T 2 . Notice that −4 − r(N − 1) = −(8 + r(N − 1)) + 4 > 0, so both roots are positive. Hence the equilibrium is locally asymptotically stable if T K < (T K) ∗ 1 or T K > (T K) ∗ 2, where (T K) ∗ 1 < (T K) ∗ 2. The equilibrium is unstable if (T K) ∗ 1 < T K < (T K) ∗ 2. Figure 2 shows the stability region of the equilibrium. 1 8 N 0-1 r KT 1 Figure 2: Stability region in the (r, KT ) space From the above analysis we can draw the following interesting conclusions. If N < 9, then −8/(N − 1) < −1, so Case 1. occurs regardless of the value of r, so the equilibrium is always locally asymptotically stable. That is, we need at least 9 firms to have instability. If N = 9, then Case 1. occurs for r > −1 and Case 2. occurs for r = −1. Assume next that N ≥ 10. If r > − 8 N−1 , then the equilibrium is always locally asymptotically stable, if r = − 8 N−1 , then it occurs if KT 6= 1 (that is, always except a particular value of KT ), and if r < − 8 N−1 , then KT has to be sufficiently small or sufficiently large to guarantee local asymptotical stability. It is also interesting to note that the stability conditions depend on only the product KT and not on the individual values of these parameters. It shows a certain compensation between the speed of adjustment and average information delay. CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 73 We also know from the above analysis that by crossing the critical values (T K) ∗ 1 and (T K) ∗ 2 with the value of T K, stability is lost or gained. It is interesting to examine what happens at these critical values. We will show that Hopf-bifurcation occurs (see for example, [4]) implying the possibility of limit cycles around the equilibrium. At the critical values inequality (6.13) as well as (6.12) become equality implying that equation (6.11) can be rewritten as 0 = λ 3 T 2 + λ 2 (2T + T 2 K) + λ T 2 K (1 + (1 − N )r) 2T + T 2K + K(1 + (1 − N )r) = ( λT 2 + (2T + T 2 K) ) ( λ 2 + K(1 + (1 − N )r) 2T + T 2K ) showing that one eigenvalue is negative, λ1 = − 2+T K T , and the other two are pure complex. Differentiating equation (6.11) implicitly with respect to T we have 3λ 2 λ̇T 2 + λ 3 2T + 2λλ̇(2T + T 2 K) + λ 2 (2 + 2T K) + λ̇(1 + 2T K) + λ2K = 0 implying that λ̇ = −2T λ3 − λ2(2 + 2T K) − 2Kλ 3λ2T 2 + 2λ(2T + T 2K) + (1 + 2T K) . (6.14) By letting α 2 = K(1 + (1 − N )r) 2T + T 2K clearly λ̇ = 2T α 3 i + α 2 (2 + 2T K) − 2Kαi −3α2T 2 + 2αi(2T + T 2K) + (1 + 2T K) with real part Reλ̇ = −2α4T 2(2 + 2T K) + 4α2T (T α2 − K)(2 + T K) (−2α2T 2)2 + 4α2(2T + T 2K)2 where we used the simple fact that 1 + 2T K = T 2 α 2 . The numerator is 4α 2 T (α 2 T − T K2 − 2K). The firs factor is positive, and the second factor can be rewritten as 74 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) K(1 + (1 − N )r) 2 + T K − T K2 − 2K = K 2 + T K · ( (1 + (1 − N )r) − (2 + T K)2 ) = K 2 + T K ( 1 + 2(T K + 1) 2 T K − (2 + T K)2 ) = 1 (2 + T K)T ( −(T K)3 − 2(T K)2 + (T K) + 2 ) = 1 − (T K)2 T 6= 0 (6.15) where we used the equality form of inequality (6.13). Since Reλ̇ 6= 0, there is the possibility of a limit cycle around the equilibrium. The discrete version of this model and its asymptotical properties can be discussed similarly. 7 Learning in Oligopoly Models For the sake of simplicity we will discuss the classical Cournot model again and assume linear cost and price functions. Therefore assume that the cost function of firm k is Ck(xk) = αkxk + βk, and the true price function is f (Q) = B − AQ, where Q = ∑N k=1 xk as before. Assume that the firms have only limited knowledge on the price function, and during the dynamic process they repeatedly update their beliefs of the price function giving rise of a learning process. In this session we will examine three cases. Case 1. Assume that the firms know the value of Q, where the price becomes zero. In this case firm k believes that the price function is fk(Q) = εk ( B A − Q ) , but does not know that εk = A is the true value. Case 2. If the firms know only the slope of the price function, then firm k believes that the price function is fk(Q) = εk − AQ, but does not know that εk = B is the true value. Case 3. If the firms know the price at Q = 0 but they are uncertain about the slope, then firm k believes that the price function is fk(Q) = B − εkQ, but does not know the true value εk = A. As we will see, the dynamic learning processes will be significantly different in the above cases. Starting with Case 1. we examine the game first from the viewpoint of firm k. It believes that the profit of each firm (including itself) is ϕ (k) l (x1, . . . , xN ) = xlεk ( B A − Q ) − (αlxl + βl) (l = 1, 2, . . . N ). (7.1) By assuming interior optima, the believed best response of firm l is xl = B A − αl εk − Q. (7.2) By adding these equations Q = N B A − 1 εk N ∑ l=1 αl − N Q CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 75 implying that firm k believes that at the equilibrium the total production of the industry is Q (k) = 1 N + 1 ( N B A − ∑N l=1 αl εk ) and the corresponding equilibrium price is fk(Q (k) ) = εk ( B A − Q(k) ) = 1 N + 1 ( Bεk A + N ∑ l=1 αl ) . (7.3) Firm k also produces the corresponding believed equilibrium level xk = B A − αk εk − Q(k) = B (N + 1)A − αk εk + ∑N l=1 αl εk(N + 1) . (7.4) Therefore in reality, the total production level of the industry becomes Q = N ∑ k=1 xk = N B (N + 1)A − N ∑ k=1 αk εk + ( N ∑ l=1 αl )( N ∑ k=1 1 εk ) 1 N + 1 (7.5) with the corresponding equilibrium price P = B − AQ = B N + 1 + A N ∑ k=1 αk εk − A N + 1 ( N ∑ l=1 αl )( N ∑ k=1 1 εk ) . (7.6) The actual price is usually different than the believed prices by the firms. For firm k, the discrepancy between the actual and believed price is D (k) = P − fk(Q (k) ) = B N + 1 ( 1 − εk A ) + A N ∑ k=1 αk εk − 1 N + 1 ( N ∑ l=1 αl )[ A N ∑ k=1 1 εk + 1 ] . (7.7) Based on this price discrepancy firm k thinks as follows. If D (k) = 0, then the believed price equals the actual price, so the believed price is considered correct. If D (k) > 0, then the believed price is too low, so firm k wants to increase its estimate on the price function by increasing εk. If D (k) < 0, then its price estimate was too high, so the firm wants to decrease it by decreasing the value of εk. By assuming continuous time scales this adjustment process can be modelled as ε̇k = KkD (k) (k = 1, 2, . . . , N ), (7.8) and in the discrete case as εk(t + 1) = εk(t) + KkD (k) (k = 1, 2, . . . , N ). (7.9) Here Kk > 0 is the speed of adjustment of firm k. 76 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) First we prove that systems (7.8) and (7.9) have only one steady state εk = A (k = 1, 2, . . . , N ) which is the full information case. Notice first that D (k) = 0 for all k may occur only if the εk values are identical. Let ε denote this common value, then 0 = B N + 1 ( 1 − ε A ) + A ε N ∑ k=1 αk − 1 N + 1 ( N ∑ k=1 αk ) [ AN ε + 1 ] = B N + 1 ( 1 − ε A ) + ( N ∑ k=1 αk ) ( A ε − 1 ) 1 N + 1 . If ε > A, then both terms are negative; if ε < A, then both are positive, and if ε = A, then they are zero. Hence the only steady state is εk = A for all k. In order to analyze the asymptotical stability of systems (7.8) and (7.9) notice first that ∂D (k) ∂εk = − B (N + 1)A + A ε 2 k ( −αk + 1 N + 1 N ∑ l=1 αl ) and for l 6= k, ∂D (k) ∂εl = A ε 2 l ( −αl + 1 N + 1 N ∑ k=1 αk ) . For the sake of simple notation introduce the variable γk = 1 N + 1 N ∑ l=1 αl − αk. (7.10) The Jacobian of the continuous system (7.8) has the form J C =         K1 ( − B (N +1)A + Aγ1 ε2 1 ) K1Aγ1 ε2 1 · · · K1Aγ1 ε2 1 K2Aγ2 ε2 2 K2 ( − B (N +1)A + Aγ2 ε2 2 ) · · · K2Aγ2 ε2 2 . . . . . . . . . . . . KN AγN ε2 N KN AγN ε2 N · · · KN ( − B (N +1)A + AγN ε2 N )         (7.11) = D + ab T with D = diag ( −K1B (N + 1)A , . . . , −KN B (N + 1)A ) , a = ( K1Aγ1 ε 2 1 , . . . , KN AγN ε 2 N )T and b T = (1, . . . , 1). Therefore the characteristic polynomial of this matrix is the following: CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 77 ϕ(λ) = N ∏ k=1 ( −KkB (N + 1)A − λ )  1 + N ∑ k=1 KkAγk ε2 k −KkB (N +1)A − λ   . (7.12) The eigenvalues are λ = −KkB (N +1)A < 0 and the solutions of equation N ∑ k=1 KkAγk ε2 k − KkB (N +1)A − λ + 1 = 0 which has the same form as equation (3.5) by assuming that γk ≤ 0 for all k. Notice that this condition holds if the marginal costs αk are close to each other. Then by repeating the proof of Theorem 1 we can show that all eigenvalues have negative real parts implying the local asymptotical stability of the equilibrium. The Jacobian of the discrete system (7.9) is similarly J D = I + J C , which has the same structure as J C , but the identity matrix has to be added to the diagonal matrix D. Therefore its characteristic polynomial has the form N ∏ k=1 ( 1 − KkB (N + 1)A − λ )  1 + N ∑ k=1 KkAγk ε2 k 1 − KkB (N +1)A − λ   . (7.13) By repeating the proof of Theorem 2 we can show that all eigenvalue of J D are inside the unit circle at the equilibrium εk = A if γk ≤ 0 and KkB (N +1)A < 2 for all k, furthermore N ∑ k=1 Kkγk(N + 1) 2(N + 1)A − KkB > −1. (7.14) In Case 2. we assume that firm k believes that the price function is fk(Q) = εk − AQ, and the firms learn about the value of εk. Then firm k believes that the profit of any firm l (including itself) is ϕ (k) l (x1, . . . xN ) = xl(εk − AQ) − (αlxl + βl) (7.15) so the believed best response of firm l is xl = εk − αl A − Q, (7.16) the total output of the industry is believed to be Q (k) = N εk − ∑N l=1 αl (N + 1)A (7.17) with price fk(Q (k) ) = εk + ∑N l=1 αl N + 1 . (7.18) 78 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) Based on this belief firm k produces the amount xk = εk − αk A − Q(k) = εk − (N + 1)αk + ∑N l=1 αl (N + 1)A . (7.19) In reality however each firm thinks in the same way but believes in its own εl value in the price function, so the actual total production level of the industry becomes Q = N ∑ k=1 xk = 1 (N + 1)A ( N ∑ k=1 εk − N ∑ l=1 αl ) (7.20) with actual equilibrium price P = B − AQ = B − 1 N + 1 ( N ∑ k=1 εk − N ∑ l=1 αl ) . (7.21) Based on the discrepancy D (k) = P − fk(Q (k) ) = 1 N + 1 ( (N + 1)B − N ∑ l=1 εl − εk ) (7.22) the dynamic process become similar to (7.8) and (7.9) with the only difference that in this case D (k) is given in equation (7.22). Similarly to the previous case we can prove that there is a unique steady state εk = B for all k, which corresponds to the full information case. Clearly D (k) = 0 for all k, if the εk values are identical. Let ε denote this common value, then (N + 1)B − N ε − ε = 0 implying that ε = B. Notice that systems (7.8) and (7.9) are both linear in this case, so local asymptotical stability implies global asymptotical stability. The coefficient matrix in the continuous case is J C = 1 N + 1        −2K1 −K1 · · · −K1 −K2 −2K2 · · · −K2 . . . . . . . . . . . . −KN −KN · · · −2KN        (7.23) and in the discrete case J D = I + J C . Similarly to Theorems 1 and 2 we can easily prove that the continuous system is always asymptotically stable and the discrete system is asymptotically stable if and only if for all k, Kk < 2(N + 1), and N ∑ k=1 Kk 2(N + 1) − Kk < 1. CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 79 We turn our attention next to Case 3, when the firms learn about the slope of the price function. In this case we assume the firm k believes that the price function is fk(Q) = B − εkQ, the profit of firm l (l = 1, 2, . . . , N ) is believed by firm k to be ϕ (k) l (x1, . . . , xN ) = xl(B − εkQ) − (αlxl + βl), (7.24) so the best believed output choice is xl = B − αl εk − Q, and the believed total production of the industry is Q (k) = N B − ∑N l=1 αl (N + 1)εk . (7.25) The believed equilibrium price, fk(Q (k) ) = B + ∑N l=1 αl N + 1 , (7.26) is the same for all firms. Firm k also believes that its equilibrium output is xk = B − (N + 1)αk + ∑N l=1 αl (N + 1)εk . (7.27) Therefore in reality the total production of the industry becomes Q = N ∑ k=1 xk = 1 N + 1 (( B + N ∑ l=1 αl ) N ∑ k=1 1 εk − (N + 1) N ∑ k=1 αk εk ) (7.28) with actual equilibrium price P = B − AQ = B − A N + 1 (( B + N ∑ l=1 αl ) N ∑ k=1 1 εk − (N + 1) N ∑ k=1 αk εk ) . (7.29) Based on the discrepancy between the actual and believed price D (k) = 1 N + 1 ( N B − A ( B + N ∑ l=1 αl ) N ∑ k=1 1 εk + A(N + 1) N ∑ k=1 αk εk − N ∑ l=1 αl ) (7.30) the dynamic process is similar to the previous cases (7.8) and (7.9). Note that D (k) is the same for all firms, and therefore a set of εk (k = 1, 2, . . . , N ) values is a steady state of the system if and only if B ( N − A N ∑ k=1 l εk ) + (N + 1)A N ∑ k=1 αk εk − ( N ∑ l=1 αl )( A N ∑ k=1 1 εk + 1 ) = 0. 80 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) Since this is a single equation for N unknowns with a feasible solution εk = A, there are infinitely many steady states. That is, there is the possibility that all firms believe in wrong price functions but none of them notices it since believed and actual prices still coincide. In this case no learning is possible in this way. In the above discussed cases we always assumed that the firms use instantaneous information about prices, however in reality price informations are always delayed. The effect of information lag in the learning process can be similarly examined to the cases being demonstrated in Session 2.6. 8 Laboratory Experiments In November 2002 the Nobel Committee awarded Vernon Smith the prize in Economics for a body of work spanning a half-century that demonstrated that controlled laboratory experiments could be used to study economic behavior, exactly as experiments in the hard sciences study physical phenomena. Smith summa- rized his methodological breakthrough in a highly referenced 1982 paper, ”Microeconomic Systems as an Experimental Science.” Today, economic experiments are widely undertaken to explore three main avenues of research: to inform economic theory, to test-bed newly designed institutions under stressful environmental conditions, and to understand how brain activity leads to economic behavior. As economic theorists we should be very concerned with rescuing our theories from the doldrums of mathematical curiosity by sub- jecting them to the rigors of laboratory scrutiny. This chapter has so far presented a review and some new developments in oligopoly theory that are enmeshed with an abundant theoretical literature in this area, but have we not discussed how to assess whether those results relate to what people really do. We can greatly increase the value of our theories to the society that invests in them by becoming concerned with how people really organize themselves to form, sustain, and adapt rules of order in order to generate beneficial outcomes for themselves. Each laboratory that conducts experiments with human subjects approaches its research with different auxiliary hypotheses but uses the common analytic framework that has three main components: the envi- ronment, the institution, and the behavior of the human subjects. The environment includes the subjects’ preferences or incentives for achieving various allocations in the exchange system, subjects’ productive ca- pabilities and system technical constraints on achieving those allocations, and knowledge about the initial conditions and allocation in the exchange system in which subjects will participate. The experimenter controls the environment using induced values, a mapping of outcomes to different monetary earnings, and carefully worded instructions. For example, in a simple oligopoly environment the experimenter may privately inform each experimental subject i through computerized instructions that he will be playing the role of a producer of a fictitious good, and that he will be able to, in any given period during the upcoming experiment, produce up to ui units of the good at a cost of $ci per unit, and that if he can sell those units he will earn a cash profit, paid to him by the experimenter, which will be the difference between the market price, p, he sells each unit for, and his cost of production. The institution consists of a set of rules that completely specify (1) what messages subjects are allowed to send and when they can send them, (2) how these messages are translated into reallocations of environ- mental conditions, and (3) what feedback the subject receives about the messages that were sent and the reallocations they produced. The experimenter typically controls for the institution with a computer network that instantiates these rules. For example, in one simple oligopoly environment a ’producer’ may only be able to make only one offer of a given total quantity that he is willing to sell at a particular per unit price in CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 81 a given period, and the institution may gather those offers, order them from lowest to highest price, and sell as many of them as possible at a uniform price to buyers whose bids to buy have been ordered from highest to lowest. The institution in this simple example may decide not to reveal publicly what offers and bids are submitted or what was the volume traded in the period. The behavior is what the experimenter can then observe in the form of messages that are actually sent by subjects as they participate in the experiment. By controlling the environment, E, and the institution, I, and creating experimental ’treatments’ which can selectively alter either of them, the experimenter can estimate the behavioral response function bi(E, I) for each subject, i. Institutions where there is repeated interaction among subjects precipitate learning and the behavioral response functions themselves become functions of time, b t i(E, I), that depend upon the sequence of information delivered to the subject by the institution and the perceived reallocations in the environment. The experimental economics laboratory typically uses economic theory to predict the form of the behav- ioral functions bi(·, ·), and thus predicts the environmental reallocations that will be produced as the subjects interact. It becomes possible to compare predicted outcomes to actual outcomes and predicted behavioral functions to estimated behavioral functions to ask how well does the theory perform in the laboratory? Con- sider the following example of a two-player game that has been extensively studied in the laboratory. In this case the messages are very simple. Subject One must move first and decide whether to stop the game immediately, in which case he receives a payment of $10 and Subject Two also receives a payment of $10, or pass the decision on to Subject Two. If Subject One chooses to pass, Subject Two now must choose between an allocation which pays Subject One $0 and himself $40, or an allocation which pays Subject One $ 15 and himself $25. Using game theory, Nash equilibrium predicts that when this game is played once by anony- mous traders who have complete information, Subject 2 would always choose the (0, 40), and that Subject 1, realizing what Subject 2 would do, will always to stop the game immediately for a ($10, $10) allocation. In fact, when this experiment is run typically only 50% of the Subject 1’s elect to stop, and, conditional on passing to Subject 2, 75% of the Subject 2’s choose the allocation ($15, $25). Thus in the typical population the expected payoff of Subject 1 if he passes is .75 × 15 = 11.25, greater than 10. This simple experiment demonstrates the failure of a theory that does not fully account for the evolved tendency of human subjects to divine opportunity through understanding the history and intentions of those with whom they interact. A greater irony reveals itself in this scenario when we relax the Nash assumptions by repeating the game and giving subjects incomplete information in the form of their only their own payoffs: then the Nash prediction is robust! Fouraker and Siegel [3] conducted the first extensive experimental study of basic oligopoly theory more than 40 years ago. They hypothesized that although the original quantity adjusting Cournot model does not directly discuss the information conditions of the agents, those conditions as well as the number of agents might have important consequences during repeated economic interactions. They were right: In Cournot quantity adjusting experiments where subjects either knew (complete info.) or didn’t know (incomplete info.) their rivals payoff function, the distribution of outcomes suffered more variability under complete information, while the Cournot prediction was more robust with less information. There are many experiments in various other environments that verify that when provided information subjects attempt to use it in not always an entirely predictable manner. Further analysis provided by the Fouraker/Siegel data showed that both rivalistic (the tendency to increase your output when you observe others are producing more than you) and cooperative (the tendency to decrease your output when you observe others are producing more than you) signaling behaviors are more prevalent when information is complete. This result was most prevalent in 82 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) duopolies and transferred strongly to a Bertrand price-setting environment. Since the original oligopoly experiments, there has been an exponential growth of published oligopoly theory that deals with a multitude of mathematical nuances and assumptions, but comparatively few experi- mental tests that reign in the relevance of all those theories to human behavior. An interesting experimental study executed by [2] points out an environmental condition, low profitability under competition, that may confound the predictions of extant theory in a price setting scenario. They create a simple single product multi-period environment where 5 independent producers have identical cost structures such that they can each produce 100 units at cost c per unit, 100 units at cost c + d per unit, and 100 additional units at cost c + d + e per unit. The demand function is linear and represented by more than a hundreds robot buyers who simply reveal their value for consuming the product during the experiment. The demand function intersects the aggregate supply function at 1200 units, where the marginal cost of production is c + d + e. At the competitive price the sellers’ share of surplus is an order of magnitude less than the buyers’. Each period during the experiment the oligopolists must post a single price at which they are willing to sell their product and the infinitesimal buyers queue up electronically and buy in order of lowest price available. Buyers are rationed uniformly amongst producers tied in price. A competitive Nash equilibrium exists that has each producer offering his 300 units at price c + d + e and earning 100d + 200e, since a higher price would exclude the producer completely and a lower price would produce a loss on his higher cost units. But this is not what the average group of oligopolists does in this environment. They engage in an offering strategy that creates price cycles that tend to rise rapidly and fall slowly. The highest price producer (perhaps 2 or 3) is always excluded in a given period, but the remaining active sellers are rewarded with supra-competitive profits. These subjects, under much more complicated environmental and institutional conditions than in the simple two person game discussed above, and without direct communication, create a continuous tension between competitive (gradual undercutting) and cooperative (raise prices) behavior that serves them well in the long run. Our experimental philosophy has been shaped largely by our interpretation of Friedrich Von Hayek’s view that economics is the study of the co-emergent order of the brain and our social institutions. This philosophical framework leads us to look for ecologically rational explanations of both individual and insti- tutional behavior. In particular we assume that individuals are adapted to specific functional needs that arose over evolutionary time, but that are now expressed by our modern brains through interactions with our current institutions and environments. However, as our brains learn to cope with their modern scenario they encounter cognitive opportunity costs that we attempt to overcome by the development of institutions that can perform the requisite computations and reallocations cheaply and efficiently. Economic experiments are crucial in testing the theory used to prescribe institutional innovation and in test-bedding the prescription before its implementation. 9 Conclusions In this paper dynamic oligopoly models were examined. After introducing the classical Cournot model and its extensions, the stability of single product oligopoly was discussed with the assumption that full and instantaneous information was available to all firms concerning the market price. The equilibrium is always locally asymptotically stable with continuous time scales, however in order to preserve stability in the discrete case we have to assume that either the derivatives of the best response functions of all firms are CUBO 11, 2 (2009) Cournot Models: Dynamics, Uncertainty and Learning 83 sufficiently small, or the firms do not change much their output levels. Then we assumed that there was partial cooperation among the firms when each firm took a certain portion of the profits of the rivals into account in its payoff function. Under realistic conditions we could obtain the same stability results as in the previous case. The firms are usually uncertain in the market as they use only an estimate of the price function in their decisions on the best choices on production levels. We could show that similar stability conditions hold for this case as well, however the production level might converge to a steady state which differs from the Nash equilibrium. We also investigated the effect of information delay in market price, and showed that stability can be lost. We have derived conditions for stability and instability of the equilibrium and showed that at the critical value of the bifurcation parameter Hopf bifurcation occurs giving the possibility of the birth of limit cycles. Three particular learning processes were then introduced and examined when the firms had only limited information on the linear price function. It was interesting to see that the number of steady states (that is, the possibility of learning) and the asymptotic properties of the learning process were different for different types of uncertainty. All theoretical results discussed in these sessions were based on certain specific assumptions on cost and market demand structure as well as on particular assumptions on the behavior of the decision makers. Such special conditions are not always satisfied in real economies, and in addition, the decision making managers do not think and decide always as we expect them to do. The most appropriate methodology in examining human decision making and economic processes without special conditions is based on laboratory experiments with realistic environment. Experimental economy is this very important procedure in which the actual decisions of the participants are repeated, observed and analyzed and hence we are able to gain the right insight into the minds of the decision making humans. Received: March 14, 2008. Revised: May 15, 2008. References [1] Cournot, A., Recherches sur les Principes Mathèmatiques de la Thèorie de Richesses, Hachette, Paris (English translation (1960): Researches into the Mathematical Principles of the Theory of Wealth. Kelley, New York.), 1838. [2] Durham, Y., McCabe, K., Olson, M., Rassenti, S., and Smith, V., Oligopoly Competition in Fixed Cost Environments, International Journal of Industrial Organization, Fall, 2003. [3] Fouraker, L.E., and Siegel, S., Bargaining Behavior, McGraw-Hill Book Company/New York, 1963. [4] Guckenheimer, J. and Holmes, P., Nonlinear Oscillations, Dynamic Systems and Bifurcations of Vector Fields, Springer-Verlag, Berlin/Heidelberg/New York, 1983. [5] Okuguchi, K., Expectations and Stability in Oligopoly Models, Springer-Verlag, Berlin/Heidelberg/New York, 1976. [6] Okuguchi, K. and Szidarovszky, F., The Theory of Oligopoly with Multi-Product Firms, Springer- Verlag, Berlin/Heidelberg/New York (2nd edition), 1999. 84 Ferenc Szidarovszky, Vernon L. Smith and Steven Rassenti CUBO 11, 2 (2009) [7] Smith, V.L., Microeconomic Systems as an Experimental Science, American Economic Review, Decem- ber, 1982. [8] Szidarovszky, F. and Bahill, T.A., Linear System Theory, CRC Press, Boca Raton/London (2nd edition), 1998. N05-Cournot