1 Introduction

Cap and trade mechanisms are one of the centrally important policy tools to achieve ambitious greenhouse gas reduction targets required in the context of recent climate agreements (e.g., the 2015 Paris agreement, UNO 2015). In the present article, we analyze the design of cap and trade mechanisms and their impact on firms’ investment decisions in different technologies and on their final production decisions. We then derive the optimal design for ideal market conditions, but also for non-ideal situations where firms either exercise market power or competition authorities’ decisions are partially constrained by requirements of political or legislative processes.

Cap and trade mechanisms designed to internalize social cost of pollution enjoy increasing importance in environmental legislation worldwide. Well-known examples are given by the European Union Emission Trading System (ETS), the California Cap-and-Trade programme or the Korea ETS. “As of April 2020, there were 23 emissions trading systems covering around 9% of global emissions” (see IEA 2020). A very prominent example of a very large planned system is the China national ETS which is likely to start in 2021.Footnote 1 An important aspect when introducing cap and trade mechanisms is the possibility of competition authorities to grant emission permits for free (see, e.g., ICAP 2020). This apparently allows to crucially facilitate the political processes leading to the introduction of cap and trade systems. As Convery (2009) in an early survey on the origins and the development of the EU ETS observes: “The key quid pro quos to secure industry support in Germany and across the EU were agreements that allocation would take place at Member State level [...], and that the allowances would be free.” Very similar observations can also be found in many other contributions to that issue.Footnote 2

A one and for all lump sum allocation of permits which is entirely independent of firms’ actions has a purely distributive impact in case firms take the market price of emission permits as given. This fundamental insight in principle dates back to Coase (1960). The design of free allocations in currently active cap and trade systems typically does not have such lump sum property, but includes explicit or implicit features of updating (see IEA 2020). Updating of free allocation schemes designed to consistently adapt to an industry’s dynamic development has an impact on firms’ behavior, however.Footnote 3 First, it leads to a distortion of the operation of existing production facilities since for existing cap and trade systems current output and emissions do have an impact on allocations granted to those facilities in the future. Second, it also has an impact on firms’ incentives to modify their production facilities through upgrading, retiring and building of new facilities since typically free allocations to some extent are granted contingent on newly installed or retired facilities.Footnote 4

The literature which analyzes the impact of updating on firms’ behavior up to now has focused on the first aspect. That is, those contributions provide very rich insight on the impact of updating based on past output or emissions on firms’ production and emission decisions, and they abstract from changed free allocations to a firm due to changed installed facilities, however. The most prominent contributions in this context include Moledina et al. (2003), Böhringer and Lange (2005), Rosendahl (2008), Mackenzie et al. (2008), Harstad and Eskeland (2010)), Böhringer et al. (2017), Qiu et al. (2017), Meunier et al. (2017) or Meunier et al. (2018). Reviewing current and planned emission trading schemes worldwide reveals, however, that legislations which provide free emission permits also include elements which update based on newly installed or retired production facilities (Narassimhan et al. 2018; IEA 2020).Footnote 5 It is the purpose of the present article to focus on the above-mentioned second aspect and its impact on the technology mix chosen by firms. To the best of our knowledge, this is the first article that formally analyzes the impact of updating on firms’ investment incentives in an analytical framework.

Usually, production facilities in most industries allow for production during longer horizons of time. Since demand typically varies over time, it is optimal for firms to invest in a portfolio of different technologies. Technologies with high investment cost which allow for production at low marginal cost are installed to run most of the time, whereas technologies with low investment cost which allow for production at higher marginal cost cover more infrequent demand periods. Prominent examples of capital-intensive and long-lasting investments in production facilities and fluctuating demand due to limited storage possibility are given by, e.g., the production of cement, steel or electricity (with electricity being the most prominent and well-studied example in this context). We thus analyze a framework with fluctuating demand where (strategic) firms first determine their technology mix by investing in different production technologies with different emission intensities. Production facilities allow firms to produce for a continuum of spot markets subject to fluctuating demand. Production at each spot market causes emissions, and total emissions at all spot markets are capped by an emission permit market. We consider the impact of a cap and trade system where different amounts of free allocations are granted for newly built facilities. After having established the market equilibrium both for firms’ investment and production decisions and for the emission permit market, we first determine the first best solution as a benchmark. Analogous to most of the previous literature, if distributional concerns do not matter, in an ideal market with perfectly competitive firms updating is not optimal (i.e., no free allocations should be granted to any investment), and the total emission cap should be set such that the permit price equals to marginal social cost of pollution.

In the main part of the paper, we then analyze the optimal design of a cap and trade system if the market is not ideal. First, we consider the case that firms behave imperfectly competitively when making their investment and their production decisions. In this case, investment and production are inefficiently low. As we show, measures undertaken to stimulate investment incentives (such as capacity paymentsFootnote 6) have a very close relationship to free allocations made for newly installed production facilities. In this case, it is optimal to either grant capacity payments or free allocations in order to stimulate inefficiently low investment incentives. As we show, however, in a closed system with endogenous permit market, it is not optimal to implement total investment at first best levels since this would imply an inefficiently high permit price which excessively depresses spot market output. Our results do thus have direct implications for the design of capacity markets in the presence of a cap and trade system.

Second, we analyze the case where the design of the cap and trade mechanism is subject to political constraints (compare the second paragraph of this section) and the competition authority has to determine the optimal market design given those constraints.Footnote 7 We first analyze how the optimal target on total emissions should be set if free allocations in all technologies are exogenously fixed. As we find, for moderate levels of free allocations the target on total emissions should be set such that the equilibrium permit price is above marginal social cost of pollution. For high levels of free allocation, for example, in case of full allocation where all permits used by a certain technology during a compliance period are freely allocated, the total cap on emissions should be set such that the equilibrium permit price is below marginal social cost of pollution.

We then analyze the case that free allocation only for a specific technology is exogenously fixed and determine the optimal level of free allocation for the remaining technology. In order to avoid excessive distortions of the resulting technology mix, it is typically optimal to grant free allocation for the remaining technology. That is, the insights obtained from the first best benchmark that free allocations are never optimal, are no longer true if allocation to one of the technologies is exogenously fixed. Moreover, if this technology is relatively dirty (as compared to the technology with exogenously fixed allocation), the level of free allocation should remain below the exogenously fixed allocation. If on the contrary the remaining technology is relatively clean, the level of free allocation should even be above the exogenously fixed allocation. Observe that often observed practices of full allocation (see IEA 2020; ICAP 2020) induce a pattern of free allocation which is completely opposed to those findings.

Let us finally mention further related strands of the literature. First, several articles analyze firms’ incentives to adopt cleaner technologies. Those include, for example, Requate and Unold (2001), Montero (2002) or Requate and Unold (2003), for a survey of this literature, compare Requate (2005). Whereas all those contribution focus on the comparison of emission taxes and cap and trade systems, our contribution is the first one to consider the phenomenon of updating in the context of technology choice.

Second, from a modeling perspective the present paper also contributes to the literature which analyzes investment decisions in several technologies. For a survey on this literature, see Crew et al. (1995). Further contributions include Zöttl (2010), or more recently Grimm et al. (2017). Our article is (to the best of our knowledge) the first one to introduce an endogenous emission permit market in such a framework, and to derive the optimal design of a cap and trade mechanism with technology-specific free allocations both for ideal and imperfect market conditions.

The remainder of the article is structured as follows: Sect. 2 states the model analyzed throughout this article, and Sect. 3 derives the market equilibrium for a given cap and trade mechanism. We determine the optimal market design for the benchmark case of perfect competition in Sect. 4, for the case of imperfect competition in Sect. 5 and for partially constrained cap and trade mechanisms in Sect. 6. Section 7 concludes.

2 The model

We consider n symmetric firms. Each firm i first has to choose investment in two different types of production facilities prior to producing output \(q_i(\theta )\) at many consecutive spot markets with fluctuating demand. We denote total investment of firm i in both technologies by \(x_{1i}\) and investment in technology 2 by \(x_{2i}\). Observe that in the framework analyzed, where demand fluctuates over time, it is optimal for firms to invest into a mix of both technologies. We will consider the case that technology 2 allows cheaper production, but exhibits higher investment cost. Those units have to run most of the time in order to recover their high investment cost. (This is typically denoted “baseload–technology.”) Technology 1 has relatively low investment cost, but produces at high marginal cost. Those units are built in order to serve during periods of high demand (this is typically denoted “peakload–technology”), but run idle if demand is low. Inverse demand for the quantity \(Q(\theta )\) at spot market \(\theta \in [{\underline{\theta }},{\overline{\theta }}]\) is given by \(P(Q(\theta ),\theta )\). Output produced by the different facilities causes emissions, and all emissions have to be covered by emission permits. Permits are issued in the context of a cap and trade mechanism and can be traded at a permit market. The total amount of permits available is limited, and this induces a market clearing price e for permits. Firms receive free allocation of permits based on installed capacities, i.e., each firm i receives the technology-specific amount \(\left( A_2 x_{2i}+A_1 \left( x_{1i}-x_{2i}\right) \right) \) of permits for free, all remaining permits required for production have to be purchased (can be sold) at the permit market.Footnote 8 For the overall timing of our setup, see Fig. 1. Subsequently, we provide all definitions and assumptions for our setup in detail.

Fig. 1
figure 1

Illustration of timing

Investment and production decisions Firms have to choose investment in capacities for two different technologies, \(t=1,2\). Each technology t has constant marginal cost of investment \(k_t\), constant marginal cost of production \(c_t\) and an emission factor \(w_t\) which measures the amount of the pollutant emitted per unit of output. We denote total investment of firm i in both technologies by \(x_{1i}\) and investment in technology 2 by \(x_{2i}\). For aggregate values, we define \(X_t=\sum _{i=1}^{n} x_{ti}\).Footnote 9 For an illustration of the resulting marginal cost function, see Fig. 2. We denote by \({q(\theta )}=({q}_1(\theta ), \ldots , {q}_{n}(\theta ))\) the vector of spot market outputs of the n firms in demand scenario \(\theta \), and by \({Q(\theta )}=\sum _{i=1}^n {q}_i\) total spot market output in scenario \(\theta \).

Demand Inverse demand for some quantity Q at spot market \(\theta \in [{\underline{\theta }},{\overline{\theta }}]\) is given by \(P(Q,\theta )\). Each spot market has weight \(f(\theta )\) with corresponding distribution \(F(\theta )=\int _{{\underline{\theta }}}^{\theta }f(\theta )\mathrm{d} \theta \).Footnote 10 Inverse demand satisfies \(P_{q}({Q}, \theta ) < 0\), \(P_{\theta }(Q,\theta ) \ge 0\), \(P_{q\theta }(Q,\theta ) \ge 0\) and \(P_{q}({Q},\theta ) + P_{{q}{q}}({Q},\theta )\frac{Q}{n}<0\) for all \(Q,\theta \in {\mathbb {R}}\).Footnote 11

Cap and trade mechanism and social cost of pollution Total emissions produced at all spot markets \(\theta \in [{\underline{\theta }},{\bar{\theta }}]\) are given by \({\mathcal {T}}\). The social cost associated with emissions is denoted by \(D({\mathcal {T}})\). We assume \(D_T({\mathcal {T}})\ge 0\) and \(D_{TT}({\mathcal {T}})\ge 0\). A cap and trade mechanism limits total emissions at some level T such that \({\mathcal {T}}\le T\). We denote the resulting market price for emission permits by e. We make the following assumptions regarding trade at the permit market, i.e., (i) Emission permit trading is arbitrage-free, and storage of permits is costless, and (ii) firms are price takers at the permit market.Footnote 12 Notice, finally, that our formulation allows for technology-specific free allocations (\(A_1\), \(A_2\)). Free allocations might, for example, depend on the emission intensities \(w_1,w_2\) and (or) the usage rates of each technology.

3 The market equilibrium

In this section, we derive the market equilibrium with a cap and trade mechanism, for the case of perfect and imperfect competition on the product market. To characterize the market equilibrium for a given cap and trade mechanism \((T,A_1,A_2)\), we first determine firms’ profits \(\pi _i\), given investments \(x_{1},x_{2}\) and given spot market output \(q(\theta )\).

$$\begin{aligned} \pi _i (x_{1i}, x_{2i})= & {} \int \limits _{{\underline{\theta }}}^{\theta _{{\overline{B}}}} \left( P(Q,\theta )-c_2-w_2 e \right) q_i(\theta , x) {d} F(\theta )+\int \limits _{\theta _{{\overline{B}}}}^{\theta _{{\underline{P}}}} \left( P(X_{2},\theta )-c_2-w_2 e \right) x_{2i} {d} F(\theta )\nonumber \\&+ \int \limits _{\theta _{{\underline{P}}}}^{\theta _{{\overline{P}}}} \left( P(Q,\theta )-c_1-w_1 e \right) q_{i}(\theta , x) {d} F(\theta )+ \int \limits _{\theta _{{\overline{P}}}}^{{\overline{\theta }}} \left( P(X_{1},\theta )-c_1-w_1 e \right) x_{1i} {d} F(\theta ) \nonumber \\&- \int \limits _{\theta _{{\underline{P}}}}^{{\overline{\theta }}} \left( (c_1+w_1 e)-(c_2+w_2 e) \right) x_{2i} { d} F(\theta )- (k_2-A_2 e)x_{2i} - (k_1-A_1 e)(x_{1i}-x_{2i}).\qquad \end{aligned}$$
(1)

Note that the permit market affects both the firms’ marginal production cost and their investment cost. No matter whether permits have been allocated for free or have to be bought at the permit market, firms face opportunity cost of \(w_t e\) when deciding to produce one unit of output with technology \(t=1,2\). This opportunity cost increases their marginal production cost to \(c_t + w_te\), \(t=1,2\). Investment cost is affected by the firms’ anticipation of a free allocation of permits. A free allocation is equivalent to a subsidy paid upon investment: If each unit of capacity invested is assigned \(A_t\) permits, investment cost \(k_t\) is reduced by their value, that is by \(A_t e\) for \(t=1,2\).

The critical spot market scenariosFootnote 13\(\theta _{{\overline{B}}},\theta _{{\underline{P}}},\theta _{{\overline{P}}}\) indicate whether firms produce either at the capacity bounds \(x_2,x_1\) (that is, at the vertical pieces of their marginal cost curves), or on the flat (i.e., unconstrained) parts of their marginal cost curves. They depend on the intensity of competition at the spot market and are illustrated in Fig. 2, both for the case of perfect and imperfect competition. For \(\theta \in [{\underline{\theta }},\theta _{{\overline{B}}}]\), firms produce the output at marginal cost \(c_2\). For \(\theta \in [\theta _{{\overline{B}}},\theta _{{\underline{P}}}]\), firms are constrained by their investment in the base load technology and produce \(X_{2}\), still at marginal cost \(c_2\), and prices are driven by the demand function. At those demand levels, using the peak load technology 1 is not yet profitable. Observe that \(F(\theta _{{\underline{P}}})-F(\theta _{{\overline{B}}})\) measures the fraction of time where investment in the base load technology is binding which we will refer to as constrained base duration. For \(\theta \in [\theta _{{\underline{P}}},\theta _{{\overline{P}}}]\), firms produce output at marginal cost \(c_1\), and we denote \(1-F(\theta _{{\underline{P}}})\) as peak duration.Footnote 14 Finally, for all realizations above \(\theta _{{\overline{P}}}\), firms are constrained by their total capacity choice \(X_1\), and prices are driven exclusively by the demand function; we denote \(1-F(\theta _{{\bar{P}}})\) as constrained peak duration. In the subsequent lemma, we characterize the market equilibrium when firms invest in the base load and in the peak load technology.

Fig. 2
figure 2

Illustration of the critical spot market scenarios. Left: The case of a perfectly competitive market, right: the general case with imperfect competition. In the figure, we denote marginal revenue by \(MR(Q,\theta ):=P(Q,\theta )+ P_q(Q,\theta )\frac{Q}{n}\)

Lemma 1

For a given cap and trade mechanism \((T,A_1,A_2)\), define the total investment condition \(\Psi _{I}\), the base investment condition \(\Psi _{II}\) and the permit pricing conditionFootnote 15\(\Psi _E\) as follows:

$$\begin{aligned} \Psi _{I}:= & {} \int _{\theta _{{\overline{P}}}}^{{\overline{\theta }}}\left[ P(X_{1}^{*},\theta )+P_q(X_{1}^{*},\theta )\frac{X_{1}^{*}}{n} -(c_1+w_1 e^*)\right] \mathrm{d} F(\theta ) - \left( k_1- A_1e^*\right) \end{aligned}$$
(2)
$$\begin{aligned} \Psi _{II}:= & {} \int _{\theta _{{\overline{B}}}}^{\theta _{{\underline{P}}}}\left[ P(X_{2}^{*},\theta )+P_q(X_{2}^{*},\theta )\frac{X_{2}^{*}}{n} -(c_2+w_2 e^*)\right] \mathrm{d} F(\theta ) \nonumber \\&+\int _{\theta _{{\underline{P}}}}^{{\overline{\theta }}} (c_1 -c_2) + (w_1-w_2) e^* \mathrm{d} F(\theta ) - \left( k_2-k_1\right) + (A_2 - A_1)e^* \end{aligned}$$
(3)
$$\begin{aligned} \Psi _{E}:= & {} \int _{{\underline{\theta }}}^{\theta _{{\overline{B}}}} w_2 Q(e^*,\theta )\mathrm{d} F(\theta ) + \int _{\theta _{{\overline{B}}}}^{\theta _{{\underline{P}}}} w_2 X_{2}^{*} \mathrm{d} F(\theta ) + \int _{\theta _{{\underline{P}}}}^{\theta _{{\overline{P}}}}w_1 Q(e^*,\theta ) \mathrm{d} F(\theta ) \nonumber \\&+\int _{\theta _{{\overline{P}}}}^{{\overline{\theta }}} w_1 X_{1}^{*} \mathrm{d} F(\theta )-\int _{\theta _{{\underline{P}}}}^{{\overline{\theta }}} (w_1-w_2) X_{2}^{*} \mathrm{d} F(\theta ) - T \end{aligned}$$
(4)

Equilibrium investment \(X_1^{*},X_2^{*}\) and the equilibrium permit price \(e^*\) simultaneously solve \(\Psi _{I}=\Psi _{II}=\Psi _E=0\).

Proof

See Appendix A. \(\square \)

In the lemma, (2) is the first-order condition that determines total investment. Firms choose their total investment \(\frac{X_{1}^{*}}{n}\) as to equal marginal profits generated by their last running unit (running at total marginal cost \(c_1+ w_1e^*\)) to the investment cost of that unit (given by \(k_1 - A_1e^*\)). As already mentioned above, under a cap and trade mechanism, the value of the permits required for production at the spot market is part of the firms’ marginal production cost, and the value of free allocations is of firms’ marginal cost of investment.

Now, let us provide some intuition on the determinants of the optimal base load investment. Since total investment \(X_1\) has already been fixed [it is determined by (2)], the firms’ decision when choosing \(X_2\) has to be interpreted as a decision of virtually replacing units of technology 1 by technology 2. The cost of such virtual replacement of the marginal unit (given by \(k_2-k_1 - (A_2 - A_1)e^*\)) has to equal the extra profits generated by that unit due to lower marginal production cost. Lower production cost of one additional unit has two effects: First, for all demand realizations \(\theta \in [\theta _{{\overline{B}}},\theta _{{\underline{P}}}]\) one more unit is produced (that would not have been produced without the replacement); for \(\theta \in [\theta _{{\underline{P}}},{\overline{\theta }}]\), one more unit can be produced at lower marginal cost \(c_2+w_2e^*\) (instead of \(c_1+w_1e^*\)) due to the replacement. (Compare also Fig. 2.)

The market price for permits, \(e^*\), depends on the emission target T set by the market designer as well as the technology mix installed by the firms. At the equilibrium permit price, the market exactly clears, allowing for total emission of T units of the pollutant. Notice that in expression (4), total emissions are determined by multiplying total production at all spot markets with the relevant emission factors of the respective technologies (\(w_1 , w_2\)).

Finally, observe that Lemma 1 characterizes the market solution when firms decide to invest in both technologies, that is, when indeed \(0<X_2^*<X_1^*\) is obtained. First, whenever the base load technology \((k_2,c_2)\) is very unattractive,Footnote 16 then only the peak load technology \((k_1,c_1)\) is active. Second, if the base load technology \((k_2,c_2)\) is always more attractiveFootnote 17 than the peak load technology \((k_1,c_1)\), then only technology \((k_2,c_2)\) is active in the market equilibrium. Notice that in principle, the case of investment in a single technology is covered by our framework, it is obtained by eliminating the possibility to invest in technology 2, and expression (2) then determines investment in the single technology. To keep the notational burden limited, however, we do not explicitly include those corner solution in the exposition of the paper, but opted to focus on all those cases when firms indeed choose to investment in both technologies.

To conclude the discussion of Lemma 1, let us already at this point mention the relevance of endogenously modeling the emission permit market as compared to the case which assumes an exogenously fixed price for pollution. Observe that for a constant emission price, equilibrium investment under imperfect competition differs from that obtained under perfect competition by the terms \(\int _{\theta _{{\overline{P}}}}^{{\overline{\theta }}}P_q(X_{1},\theta )\frac{X_{1}}{n}\) and \(\int _{\theta _{{\overline{B}}}}^{\theta _{{\underline{P}}}}P_q(X_{2},\theta )\frac{X_{2}}{n} \mathrm{d} F(\theta )\), respectively. This corresponds to the difference between scarcity prices and marginal scarcity profits. Since those terms are negative (and profits concave given our assumptions), investment incentives under imperfect competition are lower than under perfect competition. That is, in the absence of an explicit market for emission permits (when pollution is, for example, taxed at some fixed level \(e^0\)) subsidies for investment (for example, by granting free tax vouchers \(A_1>0\) and \(A_2>A_1\), respectively) which exactly compensate for those differences would induce optimal investment incentives. Since the emission price is endogenous in our framework, however, we will obtain a different result (compare Theorem 2).

Before we now discuss existence of the market equilibrium, we introduce the following definitions which will simplify the subsequent analysis and allow for a more intuitive discussion of our results:

Definition 1

 

  1. (i)

    We denote the impact of increased total investment on total emissions (for fixed e) by \(A_1^E:=\frac{\partial \Psi _{E}}{\partial X_1^*}=\left( 1-F(\theta _{{\bar{P}}})\right) w_1\), observe \(A_1^E>0\). This allows to state the impact of changed emission price \(e^*\) on the equilibrium condition \(\Psi _{I}\) as follows \(\frac{\partial \Psi _{I}}{\partial e^*}=A_1-A_1^E\).

  2. (ii)

    We denote the impact of increased base load investment on total emissions (for fixed e) by \(A_2^E:=\frac{\partial \Psi _{E}}{\partial X_2^*}=\left( 1-F(\theta _{{\bar{B}}})\right) w_2 - \left( 1-F(\theta _{{\underline{P}}})\right) w_1 \). This allows to state the impact of changed emission price \(e^*\) on the equilibrium condition \(\Psi _{II}\) as follows \(\frac{\partial \Psi _{II}}{\partial e^*}=A_2-A_1-A_2^E\). We furthermore denote \(w_2^E:=\frac{1-F(\theta _{{\underline{P}}})}{1-F(\theta _{\bar{B}})}w_1\) (which implies \(A_2^E>0\) if and only if \(w_2 > w_2^E\)) and \(w_2^L:=\frac{F(\theta _{{\overline{P}}})-F(\theta _{{\underline{P}}})}{1-F(\theta _{\bar{B}})}w_1\) (which implies \(A_1^E+A_2^E>0\) if and only if \(w_2 > w_2^L\)).

  3. (iii)

    We denote the impact of changed \(X_1\) on the equilibrium condition \(\Psi _{I}\) by \(\Psi _{I1}:=\frac{\partial \Psi _{I}}{\partial X_1^*}\), the impact of changed \(X_2\) on the equilibrium condition \(\Psi _{II}\) by \(\Psi _{II2}:=\frac{\partial \Psi _{II}}{\partial X_2^*}\), and the impact of changed e on the equilibrium condition \(\Psi _{E}\) by \(\Psi _{\mathrm{Ee}}:=\frac{\partial \Psi _{E}}{\partial e^*}\). Observe that those three expressions are negative.

ObserveFootnote 18 that \(A_1^E=\left( 1-F(\theta _{{\bar{P}}})\right) w_1\) determines the total amount of additionally necessary permits resulting from an additionally invested unit of total capacity (formally given by the partial derivative of total emission with respect to \(X_1\), i.e., \(\frac{\partial \Psi _E}{\partial X_1}\)). An increase of the permit price e now has two opposing effects on total investment incentives: On the one hand, investment incentives are reduced by the amount \(A_1^E\); on the other hand, they increase by \(A_1\) due to the increased value of free allocations.

A similar reasoning is obtained for investment incentives in the base load technology. \(A_2^E\) determines the total amount of additionally necessary permits resulting from the replacement of one unit of the peak technology with one unit of the base technology (formally given by the partial derivative of total emission with respect to \(X_2\), i.e., \(\frac{\partial \Psi _E}{\partial X_2}\)). An increase of the permit price e has two opposing effects on total investment incentives: On the one hand, they are reduced by the amount \(A_2^E\); on the other hand, they increase by \((A_2-A_1)\) due to the increased value of free allocations.

Notice that \(A_1^E\ge 0\), whereas \(A_2^E\) can also become negative. That is, an increased level of total investment \(X_1^*\) always implies additionally necessary emission permits. An increased level of base investment \(X_2^*\) does only imply additionally necessary emission permits if the base technology is “dirtier” than the peak technology. Interestingly, the cutoff point is obtained for \(w_2=w_2^E<w_1\) since an increased level of \(X_2^*\) leads to increased emissions for \(\theta \in [\theta _{{\underline{P}}},{\overline{\theta }}]\) if \(w_2>w_2\), but also leads to one unit of additional output for the demand levels \(\theta \in [\theta _{{\overline{B}} },\theta _{{\underline{P}}}]\).

As already argued, Lemma 1 only characterizes the market equilibrium by establishing necessary conditions. In the subsequent lemma, we now want to establish second-order conditions for the existence of the market equilibrium.

Lemma 2

(Second-order conditions)

  • Lemma 1 characterizes the market equilibrium if

    $$\begin{aligned}&(a) \,\, \left( A_1-A_1^E \right) A_1^E -\Psi _{I1} \Psi _{\mathrm{Ee}}<0, \quad (b) \,\, \left( A_2-A_1-A_2^E \right) A_2^E -\Psi _{II2} \Psi _{\mathrm{Ee}}<0\\&(c)\,\, \left( \left( A_1-A_1^E \right) A_1^E -\Psi _{I1} \Psi _{\mathrm{Ee}}\right) \left( \left( A_2-A_1-A_2^E \right) A_2^E -\Psi _{II2} \Psi _{\mathrm{Ee}} \right) \\&\quad > \left( A_1-A_1^E\right) A_1^E \left( A_2-A_1-A_2^E \right) A_2^E \end{aligned}$$
  • If the levels of free allocation satisfy \(\left( A_1 - A_1^E\right) A_1^E\le 0\) and \(\left( A_2 - A_1 - A_2^E\right) A_2^E\le 0\), then condition (i) is satisfied.

  • Define by \(A_1^{\mathrm{lim}}\) the highest \(A_1\) yielding \( \left( A_1-A_1^E\right) A_1^E - \Psi _{I1}\Psi _{\mathrm{Ee}} \le 0\); define by \(A_2^{\mathrm{lim}}\) the highest \(A_2\) yielding \( \left( A_2-A_1^E-A_2^E\right) \left( A_1^E+A_2^E\right) - \left( \Psi _{I1}+ \Psi _{II2}\right) \Psi _{\mathrm{Ee}}\le 0\). The second-order conditions (i) cannot be satisfied if either \(A_1 \ge A_1^{\mathrm{lim}}\) or \(A_2 \ge A_1^{\mathrm{lim}}\).

Proof

See Appendix B. \(\square \)

Part (i) of the lemma establishes the standard second-order conditions which guarantee negative semi-definiteness of the Hessian matrix of firms’ optimization problems. It allows the usual application of the implicit function theorem in order to conduct an analysis of comparative statics for the equilibrium characterized in Lemma 1. In part (ii), we establish conditions when those second-order conditions are satisfied and part (iii) provides an upper bound on the levels of free allocation such that higher allocations always violate those second-order conditions.

Notice that for the case of a monopolistic or a perfectly competitive market, the second-order conditions established in Lemma 2(i) guarantee that there exists a unique market equilibrium which is characterized by Lemma 1. For the case of oligopoly, when firms behave strategically, also asymmetric equilibria might arise, however. Let us explicitly mention at this point that in this case, our analysis focuses on symmetric investment decisions only.

After having established the market equilibrium, we now determine the impact of changing the parameters of the cap and trade mechanism \((A_1,A_2,T)\) in an analysis of comparative statics. If the second-order conditions specified in Lemma 2 (i) are satisfied, we obtain the following results:

Lemma 3

(Comparative statics of the market equilibrium) 

  1. (i)

    Higher free allocation for the base load technology \(A_2\) always yields higher investment in the base load technology (i.e., \(\frac{\mathrm{d} X_2}{\mathrm{d} A_2}>0\)). We furthermore obtain \(\frac{\mathrm{d} X_1}{\mathrm{d} A_2}<0\) if and only if \(\left( A_1-A_1^E\right) A_2^E< 0\). Define \(A_1^{\mathrm{cross}}\) as the highest \(A_1\) yielding \( \left( A_1-A_1^E\right) A_2^E \le \Psi _{\mathrm{Ee}} \Psi _{I1}- \left( A_1- A_1^E\right) A_1^E \), we obtain \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_2}<\frac{\mathrm{d} X_1^*}{\mathrm{d} A_2}\) if and only if \(\left( w_2>w_2^E\right) \) and \(A_1\in (A_1^{\mathrm{cross}},A_1^{\mathrm{lim}})\).

  2. (ii)

    Higher free allocation for the peak load technology \(A_1\) always yields higher investment in the peak load technology (i.e., \(\frac{\mathrm{d} X_1}{\mathrm{d} A_1}>\frac{\mathrm{d} X_2}{\mathrm{d} A_1}\)). Define by \(A_2^{\mathrm{total}}\) the highest \(A_2\) which yields \(\left( A_2-A_1^E-A_2^E\right) A_2^E- \Psi _{II2} \Psi _{\mathrm{Ee}} \le 0\) and by \(A_2^{\mathrm{cross}}\) the highest \(A_2\) which yields \( \left( A_2-A_1^E-A_2^E\right) A_1^E - \Psi _{I1}\Psi _{\mathrm{Ee}} \le 0\). There exists a unique \(w_2^S\) with \(w_2^E < w_2^S \le w_1\) such that \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}<0\) if and only if \(w_2 > w_2^S\) and \(A_2\in (A_2^{\mathrm{total}},A_2^{\mathrm{lim}})\). Furthermore, we obtain \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}>0\) if and only if \(w_2 < w_2^S\) and \(A_2\in (A_2^{\mathrm{cross}},A_2^{\mathrm{lim}})\).

  3. (iii)

    For a change of the total emission cap T, we obtain \(\frac{\mathrm{d} X_1^*}{\mathrm{d} T}>0\) if and only if \(\left( A_1<A_1^E\right) \), and we furthermore obtain \(\frac{\mathrm{d} X_2^*}{\mathrm{d} T}>0\) if and only if \((A_2-A_1<A_2^E)\).

Proof

See Appendix C. \(\square \)

Fig. 3
figure 3

Results of comparative statics in the degree of free allocation. Left: For the degree of free allocation to the base load technology \(A_2\), right: for the degree of free allocation to the peak load technology \(A_1\). For the case of linear demand, we obtain, left: \(A_1^{\mathrm{cross}}(w_1=0,w_2=w)=\frac{1-F(\theta _{\overline{P}})}{1-F(\theta _{{{\overline{B}}}})}F(\theta _{{{\overline{B}}}}) w\), \(A_1^{\mathrm{cross}}(w_1=w-w_2^E, w_2=w_2^E)=w_2^E\), \(A_1^{\mathrm{lim}}(w_1=w,w_2=0)=\left( 1-F(\theta _{{\underline{P}}})\right) w\) and right: \(A_2^{\mathrm{total}}(w_1=0,w_2=w)=\left( 1-\frac{1-F(\theta _{\overline{P}})}{1-F(\theta _{{{\overline{B}}}})}F(\theta _{{{\overline{B}}}})\right) w\), \(A_2^{\mathrm{cross}}(w_1=w-w_2^S,w_2=w_2^S)=w_2^E\)

As we establish in the lemma, an increase of the free allocation \(A_2\) in the base load technology always leads to increased base load investment [i.e., \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_2}>0\), see point (i)], and an increase of the free allocation \(A_1\) in the peak load technology always leads to increased investment in the peak load technology [i.e., \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}>\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}\), see point (ii)]. The impact of such changes on the remaining investment decisions is more ambiguous. In the subsequent paragraphs, we briefly sketch the central trade-offs, and a complete proof is only provided in the appendix, however. First, consider a variation of the free allocation \(A_2\) and determine its impact on the system of equilibrium conditions established in Lemma 1. The total differential yields:Footnote 19

$$\begin{aligned} \frac{\mathrm{d} \Psi _{I}}{\mathrm{d} A_2}= & {} \Psi _{I1} \frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} +\Psi _{\mathrm{Ie}} \frac{\mathrm{d} e^*}{\mathrm{d} A_2} =0 \end{aligned}$$
(5)
$$\begin{aligned} \frac{\mathrm{d} \Psi _{II}}{\mathrm{d} A_2}= & {} \Psi _{II2} \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2} +\Psi _{\mathrm{IIe}} \frac{\mathrm{d} e^*}{\mathrm{d} A_2} + \frac{\partial \Psi _{II}}{\partial A_2} = 0 \end{aligned}$$
(6)
$$\begin{aligned} \frac{\mathrm{d} \Psi _{E}}{\mathrm{d} A_2}= & {} \Psi _{E1} \frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} +\Psi _{E2} \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2} +\Psi _{\mathrm{Ee}} \frac{\mathrm{d} e^*}{\mathrm{d} A_2} = 0 \end{aligned}$$
(7)

In order to directly evaluate the impact of the changed emission price \(\frac{\mathrm{d} e^*}{\mathrm{d} A_2}\) on the equilibrium conditions for total investment and investment in the base load technology, we solve expression (7) for \(\frac{\mathrm{d} e^*}{\mathrm{d} A_2}= \frac{ \Psi _{E1}}{-\Psi _{\mathrm{Ee}}} \frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} + \frac{ \Psi _{E2}}{- \Psi _{\mathrm{Ee}}} \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2}\) and plug into expression (5) which yields:

$$\begin{aligned} \frac{\mathrm{d} \Psi _{I}}{\mathrm{d} A_2}= & {} \left( \Psi _{I1}+\Psi _{\mathrm{Ie}} \frac{ \Psi _{E1}}{-\Psi _{\mathrm{Ee}}} \right) \frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} +\left( \Psi _{\mathrm{Ie}} \frac{ \Psi _{E2}}{-\Psi _{\mathrm{Ee}}} \right) \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2} = 0\nonumber \\\Leftrightarrow & {} \left( -\Psi _{\mathrm{Ee}} \Psi _{I1} + \left( A_1 - A_1^E \right) A_1^E\right) \frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} +\left( \left( A_1 - A_1^E \right) A_2^E\right) \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2} = 0 \end{aligned}$$
(8)

Observe that the coefficient on the expression \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_2}\) determines the total impact of changed \(X_1^*\) on the equilibrium condition \(\Psi _{I}\). This is given by the direct impact (i.e., \(\Psi _{I1}\)) and the indirect impact which takes into account the impact of changed \(X_1^*\) on the emission price and its feedback on the equilibrium condition \(\Psi _{I}\) (i.e., \(\Psi _{\mathrm{Ie}} \frac{ \Psi _{E1}}{-\Psi _{\mathrm{Ee}}}=\frac{1}{-\Psi _{\mathrm{Ee}}}\left( A_1-A_1^E \right) A_1^E\)). Observe that the total impact of changed \(X_1^*\) on the equilibrium condition \(\Psi _{I}\) is negative if the second-order conditions established in Lemma 2(i) are to be satisfied. This directly illustrates why \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_2}\) cannot drop to zero.

Furthermore, observe that the total impact of changed \(X_2^*\) on the equilibrium condition \(\Psi _{I}\) is only indirect, since \(\Psi _{I}\) does not directly depend on \(X_2\). That is, we only have to take into account the impact of increased \(X_2^*\) on the emission price \(e^*\) and its feedback on the equilibrium condition \(\Psi _{I}\). According to Definition 1, an increase of \(X_2\) leads to an increased equilibrium emission price if \(A_2^E>0\) (i.e., for \(w_2>w_2^E\), we obtain a decreased equilibrium emission price if \(A_2^E<0\), i.e., for \(w_2<w_2^E\)). The impact of an increased emission price on the equilibrium condition \(\Psi _{I}\) depends on the degree of free allocation \(A_1\). Whenever \(A_1<A_1^E\) (i.e., \(\frac{\partial \Psi _{I}}{\partial e^*}<0\), compare Definition 1), an increased emission price leads to a decrease of firms’ total investment activity \(X_1^*\). In this case, the reduction of scarcity rents (obtained when total capacity is binding) caused by the increased emission price dominates the increased value associated with the permits granted for free. The reverse holds true for a high level of free allocation, i.e., \(A_1>A_1^E\) where an increased emission price leads to increased total investment \(X_1^*\). Whenever the impact of increased investment \(X_2^*\) yields a decreased emission price which is obtained for cleaner base load technologies (for \(A_2^E<0\), i.e., \(w_2<w_2^E\)), we obtain the opposite results. In sum, \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_2}>0\) if and only if \(\left( A_1-A_1^E\right) A_2^E>0\), as stated in the theorem.

Finally, expression (8) also provides the intuition when \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_2}\ge \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2}\) is obtained (i.e., also investment in the peak load technology increases). To this end, observe that \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_2}= \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2}\) if and only if in expression (8), the total impact of changed \(X_1^*\) is precisely of the same size as the total impact of changed \(X_2^*\), but of opposite sign. As shown in the theorem, this is only obtained if the increase of investment in the base technology leads to an increase of the emission price (for \(A_2^E>0\), i.e., \(w_2>w_2^E\)) and if this increase has a sufficiently positive impact on the equilibrium condition \(\Psi _{I}\), i.e., for allocation \(A_1\) sufficiently big (for \(A_1>A_1^{\mathrm{cross}}>A_1^E\)). All those results are illustrated in the left graph of Fig. 3.

Likewise, we can analyze the impact of changing \(A_1\), as established in Theorem 3(ii). Analogous to expressions (19) (20) and (21), we can determine the total derivative and solve for \(\frac{\mathrm{d} e^*}{\mathrm{d} A_1}\). After plugging in, we obtain for \(\frac{\mathrm{d} \Psi _{I}}{\mathrm{d} A_1}+\frac{\mathrm{d} \Psi _{II}}{\mathrm{d} A_1}\) (observe \(\frac{\partial \Psi _{I}}{\partial A_1}=-\frac{\partial \Psi _{II}}{\partial A_1}=e\)):

$$\begin{aligned} \frac{\mathrm{d} \Psi _{I}}{\mathrm{d} A_1}+\frac{\mathrm{d} \Psi _{II}}{\mathrm{d} A_1}= & {} \left( \Psi _{I1}+\Psi _{\mathrm{Ie}} \frac{ \Psi _{E1}}{-\Psi _{\mathrm{Ee}}}+\Psi _{\mathrm{IIe}} \frac{ \Psi _{E1}}{-\Psi _{\mathrm{Ee}}} \right) \frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\nonumber \\&+\left( \Psi _{II2}+\Psi _{\mathrm{IIe}} \frac{ \Psi _{E2}}{-\Psi _{\mathrm{Ee}}} +\Psi _{\mathrm{Ie}} \frac{ \Psi _{E2}}{-\Psi _{\mathrm{Ee}}} \right) \frac{\mathrm{d} X_2^*}{\mathrm{d} A_1} = 0 \nonumber \\\Leftrightarrow & {} \left( -\Psi _{\mathrm{Ee}} \Psi _{I1} + \left( A_2 - A_1^E - A_2^E \right) A_1^E\right) \frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\nonumber \\&+\left( -\Psi _{\mathrm{Ee}} \Psi _{II2}+\left( A_2 - A_1^E - A_2^E \right) A_2^E\right) \frac{\mathrm{d} X_2^*}{\mathrm{d} A_1} = 0 \end{aligned}$$
(9)

Analogous to above the coefficients on the expressions \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}\) and \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\), determine the impact of changed investment \(X_1^*\) or \(X_2^*\) on both equilibrium conditions. The sum of both coefficients is strictly negative if second-order conditions are not to be violated [compare Lemma 2(iii)]. This directly illustrates why \(\frac{\mathrm{d} X_2}{\mathrm{d} A_1}\) cannot reach the level of \(\frac{\mathrm{d} X_1}{\mathrm{d} A_1}\). (In other words, increased free allocation \(A_1\) cannot leave investment in the peak load technology unchanged.)

Furthermore, as we show, for small \(A_2\) both coefficients are negative (thus, \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\) and \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}\) have opposite signs); since \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}>\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}\), this implies \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}<0\). Observe that the coefficient of the expression \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\) is increasing in \(A_2\), and the coefficient of expression \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\) is increasing in \(A_2\) if \(A_2^E>0\) (i.e., \(w_2>w_2^E\)). That is, for \(A_2\) high enough the coefficients become nonnegative, leading to altered monotonicity behavior. As we show in the theorem, we can establish a relative level of dirtiness \(w_2^S\) (with \(w_2^S\ge w_2^E\) and \(w_2^S=w_2^E\) in the case of linear demand) which separates the cases when either of the coefficients becomes zero for higher levels of \(A_2\). (Remember the sum of both coefficients has to be negative in order to satisfy the second-order conditions, see above.) Whenever the coefficient of \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}\) is equal to zero, expression (9) directly implies \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}=0\) and vice versa, as stated in the theorem.

Finally, in Theorem 3(iii) we provide the results of comparative statics with respect to the parameter T. For an intuition of those results, observe first of all that an increase of the total emission cap T leads to a reduction of the equilibrium permit price. This in turn induces increased total investment \(X_1^*\) if [similar to the intuition for part (i)] the increase of scarcity rents (which is obtained due to lower emission price) dominates the decreased value of the emission permits granted for free, i.e., \(A_1<A_1^E\). The opposite result is obtained for \(A_1>A_1^E\). Similarly, the reduced emission price induces increased investment in the base load technology \(X_2^*\) if the total impact of reduced emission price on the base load investment condition is negative, i.e., if and only if \(A_2<A_1+A_2^E\) (i.e., \(\Psi _{\mathrm{IIe}}<0\)). If we denote total emissions which obtain in the absence of any environmental policy by \({{\bar{T}}}\), then lowering the cap on total emissions T below \(\bar{T}\) corresponds to the introduction of a cap and trade mechanism.

Discussion and policy implications: Let us conclude this section by briefly discussing policy implications of our results. Many existing and planned cap and trade systems worldwide do grant free allocations; in most cases, we observe updating which also considers market entry of new capacities or retirements (IEA 2020). In those cases, investment in production facilities receives free allocations based on their expected technology-specific needs (see IEA 2020; ICAP 2020). Moreover, however, updating in many cases additionally includes components which grant free allocations based on past output choices. This induces additional distortions, both on investment and output choices, since firms’ decisions anticipate free allocations received in later compliance periods. Unlike previous contributions (compare, for example, Böhringer and Lange 2005; Mackenzie et al. 2008, or Damon et al. 2019), we are able to explicitly analyze the impact of free allocations on investment incentives. To keep our analysis tractable, however, we refrain from considering several compliance periods in a dynamic setup. For an interpretation of our analytical results in the context of cap and trade systems which grant free allocations based on past output and additionally grant free allocations to newly built facilities, a careful calibration of the parameters \((A_1, A_2)\) of our setup is required to reasonably approximate real-world incentives for our model.Footnote 20 An interesting reference case is obtained when free allocations are granted such as to cover all emissions caused by a certain installation. In our framework, we define this by \(A_1^{\mathrm{full}}\) and \(A_2^{\mathrm{full}}\). Observe \(A_1^{\mathrm{full}}\in [1-(F(\theta _{{{\overline{P}}}}))w_1,1-(F(\theta _{{\underline{P}}}))w_1]\) and \(A_2^{\mathrm{full}} \in [1-(F(\theta _{{{\overline{B}}}}))w_2, w_2 ]\).

In the context of our formal results, for an allocation scheme \((A_1^{\mathrm{full}},A_2^{\mathrm{full}})\) we obtain increased total investment and increased base load investment when introducing the trading system (i.e., the emission cap is lowered below \({{\bar{T}}}\) in our framework). To see this, first, observe that \(A_1^{\mathrm{full}}>A_1^E\) which according to Lemma 3 (iii) leads to an increase of \(X_1^*\). Second, observe that \(A_2^{\mathrm{full}}-A_1^{\mathrm{full}}>A_2^E\) (since \(A_2^{\mathrm{full}}>(1-F(\theta _{{{\overline{B}}}})) w_2\) and \(A_1^{\mathrm{full}}<(1-F(\theta _{{\underline{P}}})) w_1\)) which according to Lemma 3 (iii) leads to increased investment in the base load technology.

To apply the findings of Lemma 3(i), consider the example of an electricity market with lignite or coal-fired plants as a representative base load technology and open cycle gas turbines as a representative peak load technology. Since open cycle gas turbines have lower emission factors, we obtain \(w_2>w_1\), which directly implies \(w_2>w_2^E\) (compare Definition 1). Since furthermore \(A_1^{\mathrm{full}}>A_1^E\), as established above, we can directly conclude that an increase (decrease) of the free allocation \(A_2\) not only would yield increased base load investment but also an increased (decreased) emission price and increased (decreased) total investment.

After having analyzed the market equilibrium which is obtained in the presence of an emission trading system and derived its properties of comparative statics, we now proceed to the main part of this article and analyze the optimal design of a cap and trade mechanism.

4 Optimal market design under perfect competition

In this section, we determine the optimal cap and trade mechanism. We first determine the first best solution as a benchmark, which is obtained for the case of a perfectly competitive market when a regulator can freely choose all parameters \((A_1,A_2,T)\) of the cap and trade mechanism (see Theorem 1). We then analyze several market imperfections and solve for the corresponding second best solutions. We first determine the optimal cap and trade mechanism which should be chosen for an imperfectly competitive market (see Sect. 5). We then analyze the case when competition authorities cannot freely choose all parameters \((A_1,A_2,T)\) of the cap and trade mechanism but only a subset of them (see Sect. 6). In order to answer all those questions, we first determine total welfare generated in a market with some cap and trade mechanism \((A_1,A_2,T)\):

$$\begin{aligned} W(A_1,A_2,T)= & {} \int \limits _{{\underline{\theta }}}^{\theta _{{\overline{B}}}} \left[ \int _0^{Q^*} \left( P(Y,\theta )-c_2 \right) Y \mathrm{d} Y \right] {d} F(\theta )+\int \limits _{\theta _{{\overline{B}}}}^{\theta _{{\underline{P}}}} \left[ \int _0^{X_2^*} \left( P(X_2^*,\theta )-c_2 \right) Y \mathrm{d} Y \right] {d} F(\theta ) \nonumber \\&\quad \int \limits _{\theta _{{\underline{P}}}}^{\theta _{{\overline{P}}}} \left[ \int _0^{Q^*} \left( P(Y,\theta )-c_1 \right) Y \mathrm{d} Y \right] {d} F(\theta )+ \int \limits _{\theta _{{\overline{P}}}}^{{\overline{\theta }}} \left[ \int _0^{X_1^*} \left( P(X_1^*,\theta )-c_1 \right) Y \mathrm{d} Y \right] {d} F(\theta ) \nonumber \\&- \int \limits _{\theta _{{\underline{P}}}}^{{\overline{\theta }}} (c_1-c_2) X_2^* {d} F(\theta )- k_2 X_2^* - k_1(X_1^*-X_2^*)-D(T). \end{aligned}$$
(10)

Observe that welfare does not directly depend on the parameters \((A_1,A_2,T)\) chosen for the cap and trade mechanism, but only indirectly through the implied investment and production decisions \(X_1^*,X_2^*\) and \(Q^*\). In order to maintain presentability of the results, we relegated all computations to appendix and directly characterize the optimal cap and trade mechanism in the subsequent lemma.

Lemma 4

The optimal cap and trade mechanism solves the following conditions:

$$\begin{aligned} (i)&W_{A_1}:=\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1} \Omega _{I} + \frac{\mathrm{d} X_2^*}{\mathrm{d} A_1} \Omega _{II} =0 \\ (ii)&W_{A_2}:=\frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} \Omega _{I} + \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2} \Omega _{II} =0 \\ (iii)&W_{T}:=\frac{\mathrm{d} X_1^*}{\mathrm{d} T}\Omega _{I} + \frac{\mathrm{d} X_2^*}{\mathrm{d} T}\Omega _{II}-D_T(T)+ e^*+\frac{\Delta }{ n}=0. \end{aligned}$$

The expressions \(\Omega _{I}\) and \(\Omega _{II}\) determine the total impact of changed \(X_1^*\) and changed \(X_2^*\), respectively, on total welfare. They are defined as follows:

$$\begin{aligned}&\Omega _{I}:=\int _{\theta _{{{\overline{P}}}}}^{{\overline{\theta }}} \frac{-P_q X_1^*}{n} \mathrm{d} F(\theta ) - A_1 e^* -\frac{\Delta }{n} A_1^E \\&\Omega _{II}:=\int _{\theta _{\overline{B}}}^{\theta _{{\underline{P}}}} \frac{-P_q X_2^*}{n}\mathrm{d} F(\theta ) - (A_2 - A_1)e^* - \frac{\Delta }{n} A_2^E . \end{aligned}$$

The term \(\frac{\Delta }{n}:= \frac{ \int _{{\underline{\theta }}}^{\theta _{{{\overline{B}}}}} \frac{\mathrm{d} Q^*}{d e^*} \left( - P_q \frac{Q^*}{n} \right) \mathrm{d} F(\theta ) + \int _{\theta _{{\underline{P}}}}^{\theta _{{{\overline{P}}}}}\frac{\mathrm{d} Q^*}{d e^*}\left( - P_q \frac{Q^*}{n} \right) \mathrm{d} F(\theta )}{ \int _{{\underline{\theta }}}^{\theta _{{{\overline{B}}}}} \left( \frac{\mathrm{d} Q^*}{d e^*} w_2 \right) \mathrm{d} F(\theta ) + \int _{\theta _{{\underline{P}}}}^{\theta _{{{\overline{P}}}}}\left( \frac{\mathrm{d} Q^*}{d e^*} w_1 \right) \mathrm{d} F(\theta )}>0\) determines the impact of changed emissions on welfare for those spot markets where investment is not binding.

Proof

See Appendix D. \(\square \)

We now provide some intuition for the conditions which characterize an optimal cap and trade mechanism. We first consider the optimal choice of the free allocation to the peak load technology given by \(A_1\). Observe that the optimality conditions (i) and (ii) express the impact of changed free allocation on total welfare exclusively through the channel of changed investment in the base load technology \(X_2^*\) and changed total investment \(X_1^*\). The total impact of changed investment on total welfare is denoted by \(\Omega _{I}\) and \(\Omega _{II}\), and this total impact can be broken down into three components corresponding to the three summands of \(\Omega _{I}\) and \(\Omega _{II}\), respectively.

First, observe that at all those spot markets where total investment is binding (i.e., for \(\theta \in [\theta _{{{\overline{B}}}}, \theta _{{\underline{P}}}]\) and \(\theta \in [\theta _{{{\overline{P}}}}, {{\overline{\theta }}}]\), respectively), imperfectly competitive investment behavior induces too low investment incentives, and an increase of investment \(X_2^*\) or \(X_1^*\) leads to increased welfare given by the markup \(-P_q\frac{X}{n}\). Second, free allocation \(A_1>0\) or \((A_2-A_1)>0\) induces too high investment incentives, and thus, an increase of investment would lead to a reduction of welfare given by the monetary value of the free allocation (i.e., \(A_1 e^*\) and \((A_2-A_1)e^*\)). Notice that in a world with exogenously fixed emission price \(e^*\), the optimal level of free allocation should be chosen such as to balance those two effects.Footnote 21 Since the emission price is endogenous in our analysis, an additional term is obtained. An increase of investment \(\mathrm{d} X_1^*\) or \(\mathrm{d} X_2^*\) leads to increased emissions of \(\mathrm{d} X_1^* A_1^E\) and \(\mathrm{d} X_2^* A_2^E\) at those spot markets where investment is binding. Since total emissions are capped by T, however, this necessarily has to imply an equivalent reduction of emissions at those spot markets where investment is not binding (i.e., for \(\theta \in [{\underline{\theta }}, \theta _{{{\overline{B}}}}]\) or \(\theta \in [\theta _{{\underline{P}}}, \theta _{{{\overline{P}}}}]\)). Since production decisions are also imperfectly competitive, a reduction of output leads to reduced welfare generated at those spot markets. This impact is quantified by the term \(\frac{\Delta }{n}\) defined in the lemma. That is, taking into account the endogenous nature of the emission price leads to a lower degree of optimal free allocation \(A_1\) than suggested by an analysis with exogenously fixed emission price.

The impact of a changed emission cap T on total welfare has a similar structure than the impact of changed free allocations. Analogous to above, a changed emission cap leads to changed investment incentives, and the impact of changed investment incentives on welfare is given by the terms \(\Omega _{I}\) and \(\Omega _{II}\) which have already been discussed above. As we will see later on in Theorems 1 and 2, if the levels of free allocation are chosen optimally such as to obtain \(\Omega _{I}=\Omega _{II}=0\), those terms will not be relevant for the optimal choice of the emission cap. If the levels of free allocation are not chosen optimally, however, they have to be considered when determining the optimal level of the emission cap T (compare Theorems 3, 4, 5 and 6).

Apart from having an impact on investment incentives, a changed emission cap T leads to changed welfare also through several other channels. First, most apparently an increased emission cap leads to increased emissions which reduce welfare by the marginal social cost of pollution \(D_T\). Second, observe that on the other hand, an increased emission cap leads to a welfare increase since it implies a reduced emission price which allows for increased output. The welfare increase at each spot market is given by the changed output multiplied by the difference between marginal cost as perceived by the firms and true marginal cost, i.e., \(dQ (w_i e^*)\), for \(i=1,2\). Put differently, however, this corresponds to the changed pollution at each spot market multiplied by the emission price \(e^*\), and the change in welfare at all spot markets then is simply given by the total change of emissions multiplied by the emission price, i.e., \(\mathrm{d} T e^*\). As we will see in the subsequent Theorem 1, for a perfectly competitive market the optimal cap and trade mechanism only balances those two effects and equates the marginal social cost of pollution to the emission price (i.e., \(e^*=D_T\)).Footnote 22 Third, observe that an increased emission cap T leads to a reduced emission price. This allows to reduce the welfare loss obtained due to imperfect competition at those spot markets where investment is not binding and output too low. Notice that the impact of changed emissions on welfare at those spot markets where investment is not binding has already been discussed above, it is given by \(\frac{\Delta }{n}\).

Based on the findings of Lemma 4 as the first best benchmark, we can now directly establish the optimal cap and trade mechanism which is obtained for a perfectly competitive market

Theorem 1

(Optimal market design, first best benchmark) Under perfect competition, the optimal market design satisfies

$$\begin{aligned} (i) \,\, A_1^* = 0 \qquad \qquad (ii) \,\, A_2^* = 0 \qquad \qquad (iii) \,\, T^*: \, \,\, e^* =D_{T}(T). \end{aligned}$$

Proof

See Appendix E. \(\square \)

Discussion and policy implications: The theorem demonstrates that in a competitive market (i.e., \(n \rightarrow \infty \)), where all aspects of the cap and trade system (here: parameters \(A_1\), \(A_2\) and T) can be freely chosen by authorities, full auctioning is unambiguously optimal (i.e.,  no free allocations should be granted) and the emission target T should be set such that the equilibrium permit price equals marginal social cost of environmental damage. That is, as already discussed above, the optimal cap and trade mechanism balances welfare losses due to foregone production at all spot markets given by \(e^*\) with the marginal social cost of pollution given by \(D_T\). In case of the EU-ETS, for example, in the early phases I and II (2005–2012) free allocations have been granted to all sectors including the electricity sector. One important reason has apparently been of political nature allowing for industry support when introducing the cap and trade system (see, e.g., Convery 2009). For the electricity sector, this has been changed since the start of phase III (2013), since then no free allocations have granted to electricity producersFootnote 23. This seems to be perfectly in line with the results obtained in Theorem 1. Observe, however, that the theorem requires all aspects of the cap and trade mechanism to be set optimally, including the total amount T of permits issued by authorities. Likely, also this total amount T of emissions is subject to political constraints which require the emission price e to remain within certain limits (likely below the socially ideal level). Let us mention already at this point, however, that we are able to derive the optimal design also in case of an exogenously fixed overall emissions cap T which is not optimally chosen. As we show later on in Theorem 3, also in this case it is optimal to fully auction permits without granting any free allocations. Thus, the elimination of free allocations in the electricity sector in the EU-ETS since 2013 is perfectly in line with our results, provided we can assume that producing firms interacts in a perfectly competitive way (for the relaxation of this assumption, see the subsequent section).

In the subsequent two sections, we now consider market imperfections which make an attainment of the first best outcome impossible. First, we analyze the case of imperfectly competitive markets (see Sect. 5). Apart from imperfect competition, another source of market imperfection arises when authorities cannot freely choose all parameters \((A_1,A_2,T)\) of the cap and trade mechanism, but only a subset. Such situations arise, for example, when the level of free allocation for (some of) the different technologies or the total emission cap is exogenously fixed due to political arrangements or lobbing of firms and the competition authority can only determine the remaining parameters (see Sect. 6).

5 Optimal market design under imperfect competition

After having determined the first best benchmark (Theorem 1), we now analyze the case of an imperfectly competitive market. Under imperfect competition, investment incentives are generally too low (compare Sect. 3). One of the usually proposed measures to overcome problems of under-investment is given by capacity payments which are assigned in capacity markets.Footnote 24 Notice that the fundamental impact of a capacity payment is equivalent to that of free allocations in our framework. That is, in our framework it is equivalent if a monetary payment \(S_t\) or free allocations of value \(A_t e^*\) for \(t=1,2\) are granted to a firm per unit of investment. In total, those measures lead to a reduction of marginal cost of investment by \(A_t e^* + S_t\). Thus, the optimal design of both instruments is closely related. In the present section, we want to shed light on this interdependence and determine the optimal design of capacity payments and of the cap and trade system.

Theorem 2

(Optimal Market Design under Imperfect Competition) Under imperfect competition, the optimal market design satisfies

$$\begin{aligned} (i)&A_1^* e^* + S_1^*= \int _{\theta _{{\overline{P}}}}^{{\overline{\theta }}}\left( \frac{-P_q X_1^*}{n } \right) \mathrm{d} F(\theta )- \frac{\Delta }{n }A_1^E \\ (ii)&A_2^* e^* + S_2^*= \int _{\theta _{{\overline{B}}}}^{\theta _{{\underline{P}}}}\left( \frac{-P_q X_2^*}{n }\right) \mathrm{d} F(\theta )+\int _{\theta _{{\overline{P}}}}^{{\overline{\theta }}}\left( \frac{-P_q X_1^*}{n } \right) \mathrm{d} F(\theta )- \frac{\Delta }{n }\left( A_1^E+A_2^E\right) \\ (iii)&T^*: \, \,\, e^* =D_{T}(T) - \frac{\Delta }{n }. \end{aligned}$$

Now, assume that \(P_{q\theta }=0\). We then obtain \(A_1^* e^* + S_1^* > 0\). For \(w_2\le w_2^E\), we obtain \(A_2^*e^* + S_2^*>A_1^* e^* + S_1^*\); for \(w_2 > w_2^E\), we can obtain \(A_2^* e^* + S_2^*=0\).

Proof

See Appendix E. \(\square \)

The optimal levels of total free allocations and capacity payments \(A_t^* e^*+S_t^*\) under imperfect competition are thus typically different from zero, a striking difference to the result obtained under perfect competition (see Theorem 1). The fundamental reason why this is the case follows directly from the insights provided by Lemma 1 and the subsequent discussion of the results: Imperfectly competitive firms not only exercise market power at the spot markets, but also choose their capacity to optimally benefit from scarcity prices, implying reduced investment incentives.

As already discussed in the text following Lemma 1 (compare the last paragraph which discusses Lemma 1), for an exogenously fixed price for pollution (e.g., a Pigouvian tax at some fixed level \(e^*\)) optimal investment incentives are obtained by subsidizing investment such as to precisely compensate for the difference between scarcity rents and marginal scarcity profits. To stick as close as possible to our notation, such subsidy could be made by assigning the amounts \(A_1\) and \(A_2\) of free tax vouchers to each unit invested in either of the technologies. Equivalently, such subsidy could also be realized by appropriately chosen levels of capacity payments \(S_t\), for \(t=1,2\). Formally, for an exogenous emission price the total amount of subsidies \(A_t e^* + S_t\) is given by expressions (i) and (ii) of Theorem 2 by setting \(\Delta =0\). (Notice that for exogenously fixed permit price, we have \(\Delta =0\).) Those would be the levels of capacity payments granted by an optimal capacity market which disregards the endogenous nature of the emission permit price.

However, if the emission permit price is not exogenous but changes with firms’ investment incentives (indeed \(\Delta >0\)), those levels of capacity payments are not optimal, as we show. Positive free allocation leads to increased investment incentives, which (through an increased emission price) can lead to reduced output (and thus pollution) at those spot market where investment is not binding. The terms including the expression \(\frac{\Delta }{n}\) take this welfare loss into account. This leads to a reduced level of the optimal degree of free allocation. As we show in the theorem, under imperfect competition the degree of free allocation for the peak load technology is always positive, i.e., \(A_1^* e^* + S_1^*>0\). For the optimal allocation for the base load technology, ambiguous results are obtained. If the base load technology is less emission intensive than the peak load technology (i.e., \(w_2 \le w_1\)), increased investment in the base load technology leads to reduced emissions and thus allows for more output at spot markets where investment is not binding. As we show, this always implies that the base load technology should receive more total subsidies than the peak load technology, i.e., \(A_2^* e^* + S_2^*>A_1^* e^* + S_1^*\). On the other hand, if the base load technology is more emission intensive than the peak load technology (\(w_2>w_1\), i.e., an increase of base load investment leads to increased emission price), then it might be optimal to grant less, i.e., \(A_2^*e^* + S_2^* < A_1^* e^* + S_1^*\) or even \(A_2^* e^* + S_2^*<0\), as we show.

Finally, consider the optimal choice of the total emission cap T for the case of imperfect competition. A brief look at the optimality condition (iii) established in Lemma 4 reveals that the impact of a changed emission cap on investment decisions can be neglected since the levels of free allocation are determined optimally (such as to obtain \(\Omega _I=\Omega _{II}=0\)). What matters, however, is the fact that an increased emission cap leads to a reduced emission price which in turn allows to reduce the welfare loss induced by imperfectly competitive production decisions at those spot markets where investment is not binding (given by \(\frac{\Delta }{n}\)). As a result, the optimal cap on total emissions is chosen such as to yield an emission price below the marginal social cost of pollution.

Discussion and policy implications: Based on our results, we can draw several important conclusions for the debate on the design of cap and trade mechanisms. First of all, policy makers should be very aware of the close relationship of capacity markets and cap and trade systems which update based on installed production facilities. Most importantly, as we show, the optimal capacity payments have to be fundamentally different for markets with exogenous emission price (e.g., for an emission tax) than for markets with endogenous price (as for a cap and trade system). Disregarding those aspects when designing capacity payments for a market with a cap and trade system will lead to flawed market outcomes.

Furthermore, policy makers might be able to exploit this strongly interdependent nature of capacity payments and free allocations, especially if there are rigidities present in either of the instruments. On the one hand, those rigidities might be present for the cap and trade system when free allocations have to be granted to enhance political support (compare our discussion in Sect. 1). In this case, optimal capacity payments should be designed such as to grant a total level of subsidies specified by our optimality conditions. On the other hand, the introduction of capacity markets to address problems of under-investment might be desirable, but politically infeasible for certain markets or industries. In such case, appropriately designed levels of free allocations might at least partially overcome this inefficiency. Our results established in Theorem 2 characterize the optimal policy choice for all those cases.

6 Optimal design of a partially constrained cap and trade mechanism

In Theorems 1 and 2, we determined the optimal design of a cap and trade mechanism when all its parameters \((A_1,A_2,T)\) can be freely chosen by the competition authority. We first analyzed the case of a perfectly competitive market, which yields the first best benchmark (Theorem 1) and then the case of imperfect competition (Theorem 2). Another source of market imperfection, apart from imperfect competition, arises when the competition authorities cannot freely choose all parameters \((A_1, A_2, T)\) of the cap and trade mechanism. Such rigidities might be due to political constraints and arrangements or due to lobbing of firms. As already discussed extensively in the introduction of this article, free allocations have been key to guarantee the political support necessary to introduce cap and trade systems, compare Convery (2009), Tietenberg (2006), Bovenberg et al. (2008) or, for example, Grubb and Neuhoff (2006)Footnote 25. It is the purpose of the present section to analyze how a competition authority should optimally design a cap and trade mechanism if it can determine only a subset of the parameters of the cap and trade mechanism, whereas the remaining parameters are exogenously fixed due to the above discussed problems.

Theorem 3 determines the optimal degree of free allocations for the case of exogenously fixed level of the total emission cap T. In Theorems 4, 5 and 6, we determine the optimal degree of free allocation to the remaining technologies and the corresponding level of the optimal total emission cap T. Observe that our results obtained in Lemma 4 in principle would allow for a detailed analysis of those questions both for the cases of perfect and imperfect competition. In order to limit the notational burden in the present paper, we restrict ourselves to the case of perfect competition, however. In this case, the optimality conditions determined in Lemma 4 read as follows

$$\begin{aligned} W_{A_1}:= & {} \frac{\mathrm{d} X_1^*}{\mathrm{d} A_1} \left( -A_1\right) e^* + \frac{\mathrm{d} X_2^*}{\mathrm{d} A_1} \left( A_1-A_2\right) e^* =0 \end{aligned}$$
(11)
$$\begin{aligned} W_{A_2}:= & {} \frac{\mathrm{d} X_1^*}{\mathrm{d} A_2} \left( -A_1\right) e^* + \frac{\mathrm{d} X_2^*}{\mathrm{d} A_2} \left( A_1-A_2\right) e^* =0 \end{aligned}$$
(12)
$$\begin{aligned} W_{T}:= & {} \frac{\mathrm{d} X_1^*}{\mathrm{d} T}\left( -A_1\right) e^* + \frac{\mathrm{d} X_2^*}{\mathrm{d} T}\left( A_1-A_2\right) e^* -D_T(T)+ e^*=0. \end{aligned}$$
(13)

We first analyze the case of an exogenously fixed level of the cap on total emissions T, and an example might be a situation where politicians are willing to introduce a cap and trade mechanism, but are reluctant to induce too severe (even though optimal from an overall welfare point of view) distortions on the economy. The above optimality conditions directly reveal that in a perfectly competitive market, no free allocations should be granted to firms, independently of the level of the emission cap.Footnote 26 This is summarized in Theorem 3.

Theorem 3

(Optimal design for fixed emission cap T) For any exogenously fixed total emission cap T, it is optimal to choose the levels of free allocation \(A_1^* =A_2^* = 0\).

That is, the result obtained in the first best benchmark (Theorem 1) where no free allocation has been found to be optimal is also obtained if the total emission cap is not set at an optimal level. This seems to be perfectly in line with the abandonment of free allocations for the electricity sector in most member states in the context of the EU-ETS since 2013. For a more detailed discussion of the policy implications, see the discussion following Theorem 2. Observe that the reverse does not hold as we show in the subsequent theorem, however.

Theorem 4

(Optimal design for fixed allocations \(A_1\) and \(A_2\)) Suppose the initial allocations \(A_1\) and \(A_2\) are fixed exogenously. Define

$$\begin{aligned} \Gamma _0(A_1,A_2):= (A_1-A_1^E)A_1 \Psi _{I1}+ (A_2-A_1-A_2^E)(A_2-A_1) \Psi _{II2} . \end{aligned}$$
(14)

The optimal emission cap \(T^*\) has to be set such as to satisfy \(e^*=D_T(T^*)\) for \(\Gamma _0(A_1,A_2)=0\), \(e^*>D_T(T^*)\) for \(\Gamma _0(A_1,A_2)>0\), and \(e^*<D_T(T^*)\) for \(\Gamma _0(A_1,A_2)<0\).

Proof

See Appendix F. \(\square \)

Fig. 4
figure 4

Choosing the optimal \(T^*\) for exogenously fixed initial allocations \(A_1\) and \(A_2\). Left: for relatively dirty base technology, i.e., \(w_2>w_2^E\), right: for relatively clean base technology, i.e., \(w_2<w_2^E\)

That is, for levels of free allocation \(A_1,A_2\) which are not set optimally the optimal cap on emissions T typically does not implement an emission price \(e^*\) equal to the social cost of pollution \(D_T\). To get an intuition for the result, note first that the cap T on total emissions governs the price for emission certificates \(e^*\) which in turn influences both investment decisions and unconstrained production decisions at those spot markets where investment is not binding. Optimal production decisions are induced by an emission price equal to the social cost of pollution. This is only overall optimal in case of optimal investment incentives.

Now, first observe that in case of positive free allocations (as considered in the theorem), investment incentives are distorted, however. That is, for \(A_1>0\) investment incentives in the peak load technology are too high; for \(A_2>A_1\) (\(A_2<A_1\)), investment incentives in the base load technology are too high (low). A distortion of the emission price can then be suited to at least partially adjust investment incentives.

Second, observe that the impact of a changed emission cap T on investment incentives already has been derived in Lemma 3 (iii) and was discussed in the subsequent text. As established there, a higher emission cap T (implying a lower emission price \(e^*\)) leads to increased investment in the peak load technology \(X_1^*\) if and only if \(A_1<A_1^E\), and it leads to increased investment in the base load technology \(X_2^*\) if and only if \(A_2<A_1+A_2^E\).

Intuitively, Theorem 4 formally joins those two effects; that is, whenever the levels of free allocation \(A_1,A_2\) are such as to induce over investment, the total cap on emissions should be set such as to induce an emission price which leads to a reduction of investment incentives and vice versa. All those findings are illustrated graphically in Fig. 4.

Consider the case \(A_2=A_1>0\), where all technologies get the same amount of free allocations (the 45-degree line of Fig. 4). In the light of the above discussion, this implies first of all that investment incentives in the base load technology are undistorted (since \(A_2=A_1\)) and investment incentives in the peak load technology are too high. For \(A_1<A_1^E\), investment incentives are reduced for a higher emission price; for \(A_1>A_1^E\), they are reduced for a lower emission price. Next, consider the case \(A_2=A_1+A_2^E\). In this case, a changed emission price \(e^*\) has no impact on investment in the base load technology, and analogous to above, the optimal cap T is thus designed exclusively such as to reduce the too high investment incentives in the peak load technology (i.e., for \(A_1<A_1^E\), we have \(e^*>D_T\) and vice versa).Footnote 27

Discussion and policy implications: We conclude the discussion of Theorem 4 by applying our findings to the above discussed policy of full allocation \((A_1^{\mathrm{full}},A_2^{\mathrm{full}})\) as already introduced at the end of Sect. 3. Remember that we derived the following properties for the levels of full allocation: \(A_1^{\mathrm{full}}>A_1^E\) and \(A_2^{\mathrm{full}}>A_1^{\mathrm{full}}+A_2^E\). As already discussed, if we consider either lignite or coal-fired plants as the representative base load technology and open cycle gas turbines as the representative peak load technology, we also obtain \(A_2^{\mathrm{full}}>A_1^{\mathrm{full}}\). For our framework, we thus obtain that the optimal cap on total emissions has to be set such that the equilibrium permit price is lower than the social cost of pollution, i.e., \(e^*<D_T\).

In the subsequent Theorem 5, we consider the case that only the allocation for the peak load technology \(A_1\) is exogenously fixed, and allocation for the base load technology \(A_2\) and the total emission cap T can be determined optimally, however.

Theorem 5

(Optimal design for fixed allocation \(A_1\)) Suppose the allocation for the peak technology \(A_1\) is exogenously fixed. The optimal allocation for the base technology then solves \(A_2^*= \frac{\mathrm{d} X_2^*/\mathrm{d} A_2 - \mathrm{d} X_1^*/\mathrm{d} A_2}{\mathrm{d} X_2^* /\mathrm{d}A_2}A_1\). More specifically, we obtain (see left graph of Fig. 5)

$$ \begin{aligned} \left\{ \begin{array}{l@{\quad }l@{\quad }l} A_2^*=0 &{} if &{} (A_1^{\mathrm{cross}}\le A_1< A_1^{\mathrm{lim}}) \\ 0<A_2^*<A_1 &{} if &{} \left( (0< A_1< A_1^E)\, \& \, (w_2<w_2^E)\right) OR \left( (A_1^E< A_1< A_1^{\mathrm{cross}})\, \& \, (w_2>w_2^E) \right) \\ A_1<A_2^* &{} if &{} \left( (A_1^E< A_1< A_1^{\mathrm{cross}})\, \& \, (w_2<w_2^E)\right) OR \left( (0< A_1 < A_1^E)\, \& \, (w_2>w_2^E) \right) . \end{array}\right. \end{aligned}$$

The optimal cap \(T^*\) is such that \(e^*>D_T\) if \(A_1 < A_1^E\) and \(e^*<D_T\) if \(A_1 > A_1^E\).

Proof

See Appendix G. \(\square \)

Fig. 5
figure 5

Left: Choosing the optimal \(A_2^*\) for exogenously fixed initial allocation \(A_1\). Right: Choosing the optimal \(A_1^*\) for exogenously fixed initial allocation \(A_2\)

Observe that the optimality condition for \(A_2\) as stated in the theorem is obtained directly by rearranging expression (12). To derive the properties of the optimal degree of allocation for the base load technology \(A_2^*\) as stated in the theorem, we can now make use of the properties of comparative statics derived in Lemma 3(i). Most importantly, as established there, we always obtain \(\frac{\mathrm{d} X_2^*}{\mathrm{d}A_2}>0\). We thus obtain \(A_2^*>A_1\) if and only if \(\frac{\mathrm{d} X_1^*}{\mathrm{d}A_2}<0\). Since we only consider nonnegative levels of free allocation, we furthermore obtain \(A_2^*=0\) whenever \(0<\frac{\mathrm{d} X_2^*}{\mathrm{d}A_2}<\frac{\mathrm{d} X_1^*}{\mathrm{d}A_2}\). All those results of comparative statics have been derived in Lemma 3 and have been discussed subsequently in Sect. 3. Figures 3 and 5 do thus in principle look identical; observe, however, that Fig. 3 exclusively states results of comparative statics, whereas Fig. 5 illustrates the properties of the optimal allocation \(A_2^*\) by making use of the previously obtained findings.

The intuition for the optimal level of free allocation \(T^*\) in principle goes along the same lines as the one provided for the findings of Theorem F. As compared to the first best benchmark, for positive allocation \(A_1\) investment incentives in the peak load technology are too high. Whenever \(A_1<A_1^E\), we obtain \(\Psi _{\mathrm{Ie}}<0\) which implies that a higher emission price \(e^*\) allows to reduce investment incentives in the peak load technology. Observe furthermore that investment incentives in the base load technology as induced by \(A_2^*\) are either too high or too low (i.e., \(A_2^*>A_1\) for \(w_2>w_2^E\) and vice versa). As we show in the theorem, it is optimal to set the total cap such as to obtain an emission price \(e^*>D_T\) which induce reduced investment incentives in the peak load technology whenever \(A_1<A_1^C\). Observe that for \(A_1=A_1^C\), we obtain \(A_2^*=A_1=A_1^C\) which implies undistorted investment incentives in the base load technology; in this case, we thus obtain \(e^*=D_T\). The reverse holds true for the case \(A_1>A_1^C\) where the optimal emission cap \(T^*\) has to be set to obtain \(e^*<D_T\) which induces reduced investment incentives in the peak load technology.

In the subsequent Theorem 6, we consider the case that allocation for the peak load technology \(A_2\) is exogenously fixed, and allocation for the base load technology \(A_2\) and the total emission cap T is determined optimally.

Theorem 6

(Optimal design for fixed allocation \(A_2\)) Suppose the allocation for the base technology \(A_2\) is exogenously fixed. The optimal allocation for the peak technology then solves \(A_1^*= \frac{\mathrm{d} X_2^* /\mathrm{d}A_1}{\mathrm{d} X_2^*/\mathrm{d} A_1 - \mathrm{d} X_1^*/\mathrm{d} A_1}A_2\). More specifically (see right graph of Fig. 5)

$$ \begin{aligned} \left\{ \begin{array}{l@{\quad }l@{\quad }l} A_1^*=0 &{} if &{}(A_2^{\mathrm{cross}}\le A_2< A_2^{\mathrm{lim}}) \\ 0<A_1^*<A_2 &{} if&{} \left( (0< A_2< A_2^{\mathrm{cross}})\, \& \, (w_2<w_2^S)\right) OR \left( (0< A_2< A_2^{\mathrm{total}})\, \& \, (w_2>w_2^S) \right) \\ A_2<A_1^* &{} if&{} (A_2^{\mathrm{total}}< A_2 < A_2^{\mathrm{lim}}). \end{array}\right. \end{aligned}$$

Define \(A_2^{\mathrm{em}}:=A_1^E+A_2^E\). We obtain \(A_2^{\mathrm{em}}< A_2^{\mathrm{total}}\) and \(A_2^{\mathrm{em}}< A_2^{\mathrm{cross}}\). The optimal cap \(T^*\) is such that \(e^*<D_T\) if \(A_2 < A_2^{\mathrm{em}}\) and \(e^*>D_T\) if \(A_2 > A_2^{\mathrm{em}}\).

Proof

See Appendix H. \(\square \)

Observe that the optimality condition for \(A_1\) as stated in the theorem is obtained directly by rearranging expression (11). To derive the properties of the optimal degree of allocation for the peak load technology \(A_1^*\) as stated in the theorem, we can now make use of the properties of comparative statics derived in Lemma 3(ii). Most importantly, as established there, we always obtain \(\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}>\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}\). Since we only consider nonnegative levels of free allocation, we obtain \(A_1^*>0\) whenever \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}<0\). Furthermore, we obtain \(A_1^*>A_2\) whenever \(\frac{\mathrm{d} X_2^*}{\mathrm{d} A_1}<\frac{\mathrm{d} X_1^*}{\mathrm{d} A_1}<0\).

Let us finally provide some intuition for the optimal cap on total emissions T. First of all, observe that for \(A_2<A_2^{\mathrm{total}}\) and \(A_2<A_2^{\mathrm{cross}}\), we always obtain \(0<A_1^*<A_2\) which implies that investment incentives both in the base load and the peak load technology are too high as compared to the first best benchmark. For low levels of allocation to the base load technology (i.e., \(A_2<A_1^*(A_2)+A_2^E\)), we obtain \(\Psi _{\mathrm{IIe}}<0\) which makes it optimal to induce an emission price \(e^*>D_T\) to lower investment incentives for both technologies.Footnote 28 Observe that for \(A_2=A_1^*(A_2)+A_2^E\), we obtain \(\Psi _{\mathrm{IIe}}=0\), and a distortion of the emission price above (or below) social cost of pollution has no impact on investment incentives in the base load technology. However, investment incentives in the peak load technology are too high (since \(A_1^*>0\)). Since \(\Psi _{\mathrm{Ie}}<0\), the distortion of the emission price above social cost of pollution is thus still suited to reduce investment incentives in the peak load technology. In total, the theorem thus balances increased investment incentives in the base load technology with reduced investment incentives in the peak load technology. The cutoff is reached where \(\Psi _{\mathrm{IIe}}+\Psi _{\mathrm{Ie}}=0\) which implies \(A_2=A_1^E+A_2^E=A_2^{\mathrm{em}}\). That is for \(A_2<A_2^{\mathrm{em}}\), it is optimal to set an emission cap \(T^*\) which induces \(e^*>D_T\) and for \(A_2>A_2^{\mathrm{em}}\), the optimal cap \(T^*\) induces \(e^*<D_T\).Footnote 29 All those results are illustrated in Fig. 5.

Discussion and policy implications: We conclude the discussion of Theorems 5 and 6 by applying our findings to the above discussed policy of full allocation \((A_1^{\mathrm{full}},A_2^{\mathrm{full}})\) which served as the main illustrating example throughout this article:

First, consider the case of Theorem 5, where the allocation \(A_1\) for the peak technology is exogenously fixed. As already shown, under full allocation we obtain \(A_1^{\mathrm{full}}\in [A_1^E, (1-F(\theta _{{\underline{P}}}))w_1]\). For the case of a completely clean base load technology (that is \(w_2=0\), in the context of electricity markets this would be the case for nuclear power plants, for example) under the current rules such technology would not obtain any permits. As our results directly show, however, such technology should be granted more free allocations than the peak technology, i.e., \(A_2^*>A_1^{\mathrm{full}}\). Moreover, the total emission cap \(T^*\) should be chosen such as to implement \(e^*<D_T\) in order to dampen excessive investment incentives induced by those levels of free allocation.

Next, consider the case of Theorem 6 where the allocation \(A_2\) for the base technology is exogenously fixed. The optimal level of free allocation for the peak technology has to be strictly positive if the peak technology is less emission intense than the base technology. In particular, if the peak technology is completely clean (that is \(w_1=0\), in the context of electricity markets this would be the case for small biogas-fired engines or turbines for example), under full free allocation this technology would not receive any free permits. As our results directly show, however, such technology should be granted a positive amount of free permits. Unlike in the case discussed in the preceding paragraph, however, the level of free allocation for the peak load technology should remain below the exogenously fixed level of free allocation for the base load technology. The reason for this difference lies in the fact that free allocations \(A_2\) for the base load technology only have an impact on the resulting technology mix, and free allocations \(A_1\) to the peak load technology have an impact on the resulting technology mix and on firms’ total investment activity. For exogenously fixed \(A_2\), the optimal level of \(A_1^*\) is thus more moderate since it also leads to distorted total investment decisions. Finally, notice for the optimal cap on total emissions: Since \(A_2^{\mathrm{em}}<(1-F(\theta _{{{\overline{B}}}}))w_2<A_2^{\mathrm{full}}\) (compare Definition 1 and the last paragraph of Sect. 3), we can directly conclude that the total cap on emission has to be chosen such as to implement \(e^*<D_T\) in order to dampen excessive investment incentives induced by those levels of free allocation.

In sum, if one of the technologies is granted an exogenously fixed level of free allocation (e.g., due to lobbying), then the optimal pattern of allocations to the remaining technology is completely different from the one that is obtained under full allocation. Furthermore, for high levels of free allocation, the cap on total emissions should be chosen such as to induce an emission permit price which is below marginal social cost of pollution in order to reduce the distortions on the resulting technology mix.

7 Conclusion

Tradeable pollution permits are an increasingly important policy tool in environmental legislation worldwide. The possibility to freely allocate permits provides an important possibility to share the regulatory burden. This seems to significantly enhance the political support for recently introduced legislations (see, for example, Tietenberg 2006; Bovenberg et al. 2008, or Convery (2009)). Since free allocations typically are subject to implicit or explicit updating, the allocation of permits has an impact on firms’ decisions: First, updating of free allocations on the one hand has an impact on firms’ operation of existing production facilities if they believe that current output or emissions do have an impact on allowances granted in future periods. Second, updating will also have an impact on firms’ incentives to modify their production facilities. The first aspect has already been intensively analyzed in the literature (compare, for example, Böhringer and Lange 2005; Mackenzie et al. 2008; Böhringer et al. 2017, or Meunier et al. 2018).

All those contributions focus on the impact of free allocations with updating on output decisions and abstract from investment decisions. However, free allocations with updating also have a direct impact on firms’ investment incentives which determine production capacities and the technology mix in the long run. It has been the purpose of the present article to analyze its impact on firms’ investment decisions and to derive the optimal cap and trade mechanism in such an environment.

In the present article, we have thus analyzed an analytical framework with tradeable permits and a cap on total emissions. Potentially strategically acting firms have been able to invest into production facilities (with different emission intensities) which allow for production for a longer horizon of time. After establishing the market equilibrium and the resulting technology mix, we have analyzed the optimal design of the cap and trade mechanism.

As a benchmark, we established the first best solution which is obtained for an ideal market. We then have derived the optimal design of the cap and trade system for a series of market imperfections. First, we have analyzed the case of strategic investment and production decisions in an imperfectly competitive market. This allowed to highlight the close interdependency of mechanisms to overcome low investment incentives (such as, for example, capacity markets, compare Fabra 2018) and cap and trade systems: If the endogenous nature of emission prices in the presence of a cap and trade system is disregarded, too high investment incentives are induced by such mechanisms.

We then have analyzed the case that the competition authority cannot freely choose all parameters of the cap and trade system due to restrictions imposed by the political processes. The optimal choice of the remaining parameters differs substantially from that observed for the first best benchmark. Our result showed, for example, that if a certain technology receives free allocations, it is typically optimal to grant free allocations also to the other technology. Interestingly, those free allocations granted to the other technology should be higher in case this technology is less emission intensive.