1 Introduction

Reducing economic disparities across the region and assisting vulnerable states have long been on the policy agenda of the European Union.Footnote 1 The debt crisis of 2008 highlighted the importance of this goal and provided the impetus for deeper fiscal integration in the EU. An ambitious interpretation of deeper fiscal integration involves implementing a common tax and transfer system (see, e.g, Casella 2005; Fuest and Peichl 2012). Nevertheless, shifting tax and transfer authorities to the EU level is not free of classical equity-efficiency considerations. To the best of my knowledge, a normative treatment of the issue in a Mirrleesian framework does not yet exist in the literature.

This study simulates three optimal income tax scenarios in a Mirrleesian framework for 24 EU countries and quantifies the implications of implementing these tax scenarios. I use gross hourly earnings data from 2014 wave of the EU Structure of Earnings Survey (SES) in order to calibrate the model. In the benchmark scenario (scenario 1, S1), every country maximizes its own welfare over separate budget constraints. In the case of a European tax and transfer system (scenarios 2 and 3), the social planner maximizes the total welfare in the EU with respect to a common budget constraint. For this scenario, two different possibilities are considered. In scenario 2 (S2), the EU is treated as a single country and, hence, the same tax scheme is applied everywhere. In scenario 3 (S3), the social planner is able to condition the taxes to the country of residence (“tagging”).Footnote 2

The analysis begins by investigating the outcomes at the EU level. I find that both scenarios involving a common tax and transfer system yields welfare improvement compared to the benchmark. On the other hand, more than two-thirds of the agents become worse off. Whereas S2 leads to a contraction in total output up to 2.65%, efficiency implications of S3 are sensitive to the assumption regarding the income effects on labor supply. Specifications with and without income effects produce an expansion by 1.53% and a contraction by 0.62% respectively.

As a next step, the implications of three scenarios on the economic outcomes by country are explored. In both S2 and S3, countries exhibiting relatively higher mean in the wage rate distribution (all of the Western and some of the Northern European countries) transfer resources to the other countries and become worse off. Transfers are higher in an S3 scenario. In relative terms, Denmark transfers the most resources in both S2 and S3 (around 13% and 24% of its total gross income respectively). S2 implies higher labor distortions for almost every country, explaining the aggregate decline in total output. In S3, compared to the benchmark, high-mean countries are distorted more and low-mean countries are distorted less. Yet, the social planner is able to extract higher labor effort from high-mean countries when the utility function incorporates income effects. When the utility function does not allow for income effects, higher distortion in high-mean countries directly translates into lower total output. As a result, implication of S3 on aggregate output is dependent on the specific form of the utility function.

The national tax schedules are analyzed in order to understand the changes in welfare distributions within individual countries. In low-mean countries, those with the lowest income benefit the most from cross-country transfers of S2 and S3. In countries with a high mean and low standard deviation (e.g., Sweden), distortion and average taxes on the rich rise sharply in S2 compared to the benchmark, making them the worst off in terms of indirect utility. On the other hand, countries with high mean and high standard deviation (e.g., Germany) exhibit higher overall marginal taxes in the benchmark. Therefore, S2 does not lead to a significant shift in the marginal tax rate schedule. In such countries, the poor are the worst off in S2 because they must share the tax revenues with the poor in low-mean countries. In S3, the latter effect dominates and leads to a disproportional decline in the indirect utility of the poor in all high-mean countries.

S2 and S3 demonstrates the significant losses that high mean countries have to bear for a European tax and transfer system. In order the limit the extent of the direct losses, I investigate a more restrictive scenario. Social planners of S2 and S3 solve their problem with an additional constraint, that is, linearly aggregated social welfare must not be lower than the benchmark in any country. Together with the new constraint, the social planners ensure that a certain fraction of the individuals in transferring countries is better off as a result of the integrated European tax system. In this scenario, S2 does not yield an improvement in social welfare and, hence, it is not investigated in detail. S3, different than the full optimum, implies lower optimal marginal income taxes for every country. While about half of all Europe is better off and there is sizable output expansion in S3 compared to benchmark, increase in total welfare is restricted to 1%.

The remainder of this paper is organized as follows. Section 2 briefly discusses the related studies and the contribution of this study. Section 3 formalizes the social planner’s problems in the three scenarios. Section 4 introduces the two specifications used for the utility function. Subsequently, the calibration procedure is explained. The EU-level, country-level and within-country results are presented. Finally, Sect. 5 concludes.

2 Related literature

A vast literature explores and analyzes the different possibilities of deeper fiscal integration in Europe.Footnote 3 This study is the first to estimate optimal Mirrleesian schemes for a European tax and transfer system. The analysis of “tagging” (S3), which considers 24 EU countries as subgroups, and the “linear country welfare improving scenario” have not yet been pursued within the discussion on a European tax and transfer system. A number of studies that investigate the implications of a potential EU-wide labor income taxation system, albeit via different methodologies, are discussed in detail below.

Recently, a growing literature evaluates the distributional outcomes of different policies across the EU via micro-simulation approach using EUROMOD. Paulus et al. (2017), for example, analyzes the consequences of fiscal consolidation measures implemented following the financial crisis of 2008. See Figari et al. (2015) for a detailed discussion of the micro-simulation approach for policy analysis.Footnote 4

Closest to the purpose of this paper among the micro-simulation literature, Bargain et al. (2013b) and Dolls et al. (2013) study the redistributive and stabilizing effects of implementing a common tax and transfer scheme in the euro area. By using household level data from EUROMOD, the authors construct a European tax and transfer system as the weighted average of the observed national tax and transfer systems. In a next step, they compare the results of the hypothetical scenario to the observed scenario. In this study, I abstract from the stabilizing effects and take a normative approach to evaluate equity-efficiency implications of different scenarios at the optimum. Conceptually, S2 of this study is equivalent to the European Tax and Transfer System discussed in Bargain et al. (2013b) and Dolls et al. (2013). I find that the optimal tax scheme of S2 is more distortive than almost every nationally optimal tax scheme. Thus, resulting implications on (in particular) efficiency for individual countries are different than Bargain et al. (2013b) and Dolls et al. (2013), where a centralized tax system is assumed to be the weighted average of existing national tax schemes. In S2 of this study, for example, labor supply falls in most of the countries, both with and without income effects.

Kopczuk et al. (2005) studies optimal world redistribution. For a Cobb–Douglas utility function with given parameters and an average flat tax rate, they estimate the ability distributions of 118 countries that matches empirical Gini coefficients and mean incomes. In a next step, they adjust the parameters of the utility function to minimize the squared errors between empirical PPPs and to match the average labor supply. Finally, they estimate a flat tax rate and a constant demogrant to approximate the optimal world redistribution.

More relatedly, Seelkopf and Yang (2018), relying on the methodology in Kopczuk et al. (2005), constructs an optimal EU-wide income taxation system. I estimate fully non-parametric optimal European tax schemes, contrary to the flat tax rate combined with a demogrant in Seelkopf and Yang (2018), whose EU-wide tax system corresponds to S2 with income effects in this study. Solving for non-parametric schemes allows me to thoroughly investigate the changes in the within-country distributions of welfare (see Sect. 4.3.3). Notable insights are as follows. In S2, richer are the worst off in high-productivity countries with low standard deviation (e.g., Sweden), whereas poorer are the worst off in high-productivity countries with high standard deviation (e.g., Germany). Redistributive benefits of an integrated tax system for poor countries, however, always come at the expense of the poor in high-productivity countries in S3. Moreover, in Seelkopf and Yang (2018), poorer countries experience a decline in labor distortions and EU-level labor supply remains the same. With the non-parametric tax schemes, I find that marginal income taxes increase especially for the lower part of the distribution in poorer countries, where a high fraction of the population is located, as a result of S2. Consequently, income weighted marginal taxes increase for majority of the countries in S2, including the poorer ones, and there is a sizable contraction in total output.

Additionally, different from Seelkopf and Yang (2018), this study recovers the ability distributions over the micro data (SES) and applies empirical PPPs directly to the observed wage rates. Because the construction of PPP-adjusted wage rates in this study does not require any prior assumptions, results can be obtained via different preference specifications. Section 4.3.2 showcases that there are considerable differences between results for utility specifications with and without income effects for S3.

In the optimal income tax literature initiated by Mirrlees (1971) and Akerlof (1978) was the first to argue that conditioning taxes on observable characteristics (tags) that are correlated with earnings ability would improve the performance of taxation.Footnote 5 As emphasized above, this study is the first to consider “tagging” for the discussion of a centralized European tax and transfer system. The common practice in the literature of “tagging” is to compare the outcome of the tagged scenario (S3) to the pooled scenario (S2). See, for example, Cremer et al. (2010) and Bastani et al. (2013) among others. Evaluating an EU-wide tax system with tagging by country of residence provides a natural framework to compare the tagged scenario (S3) to the separate maximization (S1). I find that the optimal marginal income taxes increase with “tagging” compared to separate maximization for the groups with higher mean.

Relatedly, Kessing et al. (2020) split the districts of the US into two groups (large metropolitan and other regions) in order to study the trade-off between enhanced redistribution and efficiency enhancing migration. Their model incorporates an extensive margin migration decision and, therefore, the region of residence functions as an endogenous tag. This study assumes immobile workers and considers country of residence as an exogenous tag.Footnote 6

A longstanding argument in the theory of fiscal federalism is that redistributive taxation should be carried out at the central level due to the mobility of the taxpayers (see, e.g., Oates 1999). On the other hand, a connected body of the literature embeds multiple layers of governments into the Mirrleesian optimal income tax model and, more relevant for the EU, analyzes optimal tax policies and public goods provision when workers are immobile.Footnote 7 See Boadway and Keen (1993), Aronsson and Blomquist (2008) and Aronsson (2010). The analysis in this paper does not consider public goods provision, but adds to this literature by quantifying the welfare and equity-efficiency implications of centralized vs. decentralized solutions in an applied example.

3 Conceptual framework

There is a set K of N countries in the economy. Each country k constitutes a fraction \( s_{k} \), that satisfies \( \sum _{k \in K} s_{k} = 1 \), of the total population. Countries are populated by an immobile unit continuum of agents who differ with respect to their wage rates (abilities), w. In each country k, wage rates are distributed with density \( f_{k}(w) \) over the same support \( [{\underline{w}}, {\overline{w}}] \).Footnote 8

Agents derive utility over the same separable preferences that are a function of consumption, c(w) , and labor supply, \( l(w) = \frac{z(w)}{w} \), where z(w) corresponds to gross income:

$$\begin{aligned} U(w) = u(c(w)) - v\Big (\frac{z(w)}{w}\Big ) \end{aligned}$$
(1)

Disutility of labor supply satisfies usual convexity conditions, that is \( v^{\prime }(.)>0 \), \( v^{\prime \prime }(.)>0 \). Properties of u(.) differ across specifications.

In each of the three tax scenarios, the social planners employ a social welfare function G(.) . Note that specific form of G(.) varies according to the choice of u(.) in order to satisfy the diminishing marginal utility of consumption principle. The social planners have access to a non-linear income tax instrument, T(z(w)) , that satisfies: \( z(w) - c(w) = T(z(w)) \). The information structure of the model is standard. When choosing the optimal tax schedule, the social planners are able to observe z(w) but not w. Hence, incentive compatibility constraints must be employed in order to ensure that each agent reveals his or her true ability type. In what follows, by using the first-order approach, incentive compatibility constraints are replaced by a law-of-motion that describes how the utilities of different ability types change at the optimum.Footnote 9

Scenario 1

(Benchmark) Social planner’s problem \( \forall k \in K \):

$$\begin{aligned} \underset{\{c(w), z(w)\}_{w \in W}}{\max } \int _{w \in W} G(U(w))f_{k}(w)dw \end{aligned}$$
(2)

subject to government budget constraint

$$\begin{aligned} \int _{w \in W} (z(w) - c(w))f_{k}(w)dw = 0 \end{aligned}$$
(3)

and incentive compatibility constraint

$$\begin{aligned} \dfrac{dU(w)}{dw} = \dfrac{z(w)}{w^2}v^{\prime }\Big (\frac{z(w)}{w}\Big ) \end{aligned}$$
(4)

In S1, the social planner solves a standard Mirrlees problem for each country separately according to its individual budget constraint. Therefore, this benchmark scenario excludes the possibility of cross-border transfers. At the same time, the social planner observes the differences in the probability distributions of the agents in each k (\( f_{k}(w) \)) and can differentiate the tax schemes across countries.Footnote 10

Scenario 2

(Pooled) Social planner’s problem:

$$\begin{aligned} \underset{\{c(w), z(w)\}_{w \in W}}{\max } \int _{w \in W} G(U(w))f_{P}(w)dw \end{aligned}$$
(5)

subject to government budget constraint

$$\begin{aligned} \int _{w \in W} (z(w) - c(w))f_{P}(w)dw = 0 \end{aligned}$$
(6)

and incentive compatibility constraint

$$\begin{aligned} \dfrac{dU(w)}{dw} = \dfrac{z(w)}{w^2}v^{\prime }\Big (\frac{z(w)}{w}\Big ) \end{aligned}$$

where

$$\begin{aligned} f_{P}(w) = \sum _{k \in K}s_{k}f_{k}(w) \end{aligned}$$
(7)

The problem of the social planner in S2 is technically identical to S1. The only difference is that the probability distribution function used in S2 (\( f_{P}(w) \)) is obtained by pooling the populations of different countries. In other words, the social planner treats the entire population as a single country in S2. Note that, by construction of the problem, budget constraints of the countries are also pooled. Hence, transfers between countries are possible. However, the social planner does not exploit the information on differing probability distributions across countries and, therefore, applies the same tax scheme to every country.

In order to construct the probability distribution of the pooled population, the individual probability distributions of various countries are scaled by their population shares and then the resulting probability distributions across countries is summed (see Eq. (7)). As a result, a single wage rate distribution (that integrates to one) is obtained in which any wage bin represents the probability interval of the pooled population.Footnote 11

Scenario 3

(Tagging) Social planner’s problem:

$$\begin{aligned} \underset{\{c(w), z(w)\}_{w \in W}}{\max } \sum _{k \in K} \int _{w \in W} G(U_{k}(w))f_{k^{\prime }}(w)dw \end{aligned}$$
(8)

subject to government budget constraint

$$\begin{aligned} \sum _{k \in K} \int _{w \in W} (z_{k}(w) - c_{k}(w))f_{k^{\prime }}(w)dw = 0 \end{aligned}$$
(9)

and incentive compatibility constraint

$$\begin{aligned} \dfrac{dU_{k}(w)}{dw} = \dfrac{z_{k}(w)}{w^2}v^{\prime }\Big (\frac{z_{k}(w)}{w}\Big ) \end{aligned}$$
(10)

where

$$\begin{aligned} f_{k^{\prime }}(w) = s_{k}f_{k}(w) \end{aligned}$$
(11)

S3 is conceptually similar to S2. The social planner maximizes the weighted sum of the welfares of all countries over a common budget constraint. This implies that cross-border transfers are possible. The difference in the notations of S2 and S3 stems from the notion that the social planner recognizes the differences in the probability distributions of the agents across countries while optimally choosing the tax schedules.Footnote 12 Therefore, the social planner is able to condition the law-of-motion between indirect utilities of the agents to an observable tag: the country of residence (see Eq. (10)). As a result, the final optimal tax schemes may differ across countries.

The countries’ individual probability distributions in S3 (\( f_{k}^{\prime }(w) \)) are different than those of S1 (\( f_{k}(w) \)). This reflects the social planner’s need of taking the population shares into account while optimally designing the country-specific tax schedules. To construct \( f_{k}^{\prime }(w) \), it suffices to scale countries’ probability distributions by their population shares. Note that, integration of the resulting probability distributions yields countries’ population shares, that is \( \int _{w \in W}f_{k}^{\prime }(w) = s_{k} \). Hence, the new probability of any wage bin in a given country incorporates both the probability of drawing that particular country and wage bin.Footnote 13 Probabilities of wage bins with the densities of S1 (\( f_{k}(w) \)), on the other hand, represents only the conditional probabilities (e.g., conditional on being in a country).

4 Calibration and simulations

In this section, I simulate the three scenarios introduced in Sect. 3 for 24 EU member states. The difficulty of obtaining analytical results for Mirrleesian optimal income tax problems is repeatedly mentioned in the literature. However, interpreting the results of simulations for a large set of countries would be challenging without a basic understanding of the role played by the wage rate distributions. Therefore, Appendix B.1 provides examples for two imaginary countries which are intended to build intuition by utilizing a simple applied framework. Main lessons are as follows. If a country, ceteris paribus, exhibits higher mean or standard deviation, this results in higher nationally optimal marginal income taxes. As a result of “tagging”, S3, social planner finds it optimal to increase marginal income taxes of such countries in order to raise tax revenue that can be redistributed to the other countries. Even when the two countries’ taxes are nationally optimal, pooling the two countries may not necessarily lead the population weighted average of the national tax schemes.

In the rest of this section, first, the specifications used in the simulations are presented. Second, the calibration procedure for the main simulations is introduced. Third, EU-level, country-level and within-country results are respectively presented. Finally, following a brief discussion on political feasibility, a more restrictive centralized taxation scenario is pursued.

4.1 Simulation specifications

All the simulations are performed with two different specifications:

$$\begin{aligned} U(c, z)= & {} c - \dfrac{(z/w)^{1+1/\epsilon }}{1+ 1/\epsilon }, \quad G(U) = log(U) \end{aligned}$$
(12)
$$\begin{aligned} U(c, z)= & {} ln(c) - \dfrac{(z/w)^{1+1/\epsilon }}{1+ 1/\epsilon }, \quad G(U) = U \end{aligned}$$
(13)

Specifications (12) and (13) differ with respect to their assumptions regarding the income effects on labor supply. Unlike in specification (12), labor supply is sensitive to the income effects in (13). As mentioned in Sect. 3, social welfare function is chosen such that the objective functions of the resulting maximization problems are concave on consumption. Hereafter, I consider (12) as the main specification and report the results for (13) only when the qualitative conclusions differ.Footnote 14

Considering a generalized CRRA social welfare function, the logarithmic specification corresponds to the case where inequality aversion (denoted by \(\rho \)) equals to one. In Appendix B.6, I consider social planners with different aversions to inequality, that are \( \rho = 0.5 \) and \( \rho = 2 \) for the main specification that does not exhibit income effects on the labor supply.

4.2 Calibration to Europe

Simulations require identifying the densities (\( f_k(w) \)) of the national wage rate distributions. This section recovers the parameters of the distributions of 24 EU countries for which data are available in the 2014 wave of Structure of Earnings Survey (SES).Footnote 15

The universe consists of individuals between 20 and 60 years old who work full-time. I convert earnings of the agents in all countries into a comparable currency unit, purchasing power standard (PPS). See Appendix A.1 for a description and further details about the preparation of the dataset. In Appendix B.4, I consider the case in which the social planners do not take differences in purchasing powers into account and set optimal tax schemes over nominal Euro units.Footnote 16

Next, the hourly wage rate distributions of the four regions of Europe (east, west, north and south) are descriptively characterized in order to gain a rough understanding into the differences in the skill levels across regions. This exercise is also helpful in identifying appropriate supports for the wage rate distributions (more details on this are below). The presentation of those graphs appears in Appendix A.2.

Assuming gross hourly wage rates follow a log-normal distribution in each country, I separately estimate the means and standard deviations of national wage rate distributions.Footnote 17 Subsequently, populations of the countries are recovered by summing the frequency weights of the individuals in the survey.Footnote 18 Table 1 presents the resulting parameters of the wage rate distributions and population shares used in the simulations. Note that, in line with the recent literature, I perform a robustness check where underlying ability distributions in each country are assumed log-normal appended with a Pareto tail. See Appendix B.7.

Table 1 Parameters of log-normal wage rate distributions and population shares

Figure 1 illustrates the parameters of countries’ log-normal wage rate distributions and population shares in a bubble chart. Given the two-country examples, this visual representation is helpful to gain insights into the outcomes of the simulations. In S1, optimal marginal taxes (and redistribution) increases for the countries higher up and more to the right in the graph. Therefore, the countries in the upper right corner (e.g., the United Kingdom and Germany) are likely to be the least affected ones by an increase in distortion that can be brought about by S2. In S3, on the other hand, countries in the upper part of the graph are expected to be more distorted; whereas those in the lower portion would be expected to be less distorted. In the scenarios involving a common tax and transfer system (S2 and S3), we may presume that countries at the top of the graph lose the most due to the transfers to the countries with low mean wage rates.

Fig. 1
figure 1

Bubble chart of wage rate distribution parameters. See Table 1 for a guide to country abbreviations. Axes x and y respectively correspond to the means (\(\mu \)) and the standard deviations (\(\sigma \)) of the estimated log-normal wage rate distributions. Bubble sizes represent population shares (\( s_k \)) of the the corresponding countries within the 24 countries in the sample

The supports of the wage rate distributions are chosen as [1, 50]. The right panel in Fig. 8 is helpful for the discussion of this choice. Setting lower support to one is essential to capture the poor households residing mostly in Eastern Europe. On the other hand, only a small portion of individuals earn more than 50 PPS per hour compared to the rest of the population, irrespective of the region of residence. Hence, 50 was selected as the upper support and only very high wages earned by a miniscule portion of the population are excluded.

Wage rate distributions have to be discretized for the simulations. I split the interval [1, 50] into 20 equal-sized wage bins and follow Mankiw et al. (2009) to compute the probabilities of the bins. See Appendix B.2 for a brief description of the discretization procedure utilized. Finally, I set the elasticity of labor supply (\( \epsilon \)) to 0.5, a commonly employed value in the literature. Appendix B.5 provides robustness checks for the main results by using two different values for \(\epsilon \), 0.33 and 0.75. Employing the elasticity estimates for different countries, provided in Bargain et al. (2014), Appendix B.9 discusses the potential implications of accounting for (potentially) heterogeneous labor supply elasticities.

4.3 Results

Results are presented in three layers. First, the implications of the three scenarios for overall Europe are investigated. Next, changes in economic outcomes are examined by country. Finally, national tax schemes are explored in order to analyze the effect of three scenarios on within country distributions of welfare. Note that interpretation of most results require a point of comparison. In what follows, S1 is considered as the benchmark for the purpose of comparing the scenarios involving a common tax and transfer system to the discrete maximization.

While calculating the aggregate welfares in the EU or country level (in Sects. 4.3.1 and 4.3.2 respectively), final measures of the numerical values should be comparable across scenarios. For clarity, I compute the EU and country level welfares respectively as of measure N (that equals 24, the number of countries in the database) and one in all three scenarios. In order to calculate welfare by country, it suffices to integrate the utilities of the agents within countries with the densities used in the problem of S1, that is \( SWF^{i}_{k} = \int _{w \in W}U(w)f_k(w)dw \) where \( i = S1, S2, S3 \). After computing welfare levels of each country as of measure one, total welfare in the EU can be calculated as \( SWF_{EU}^{i} = N\sum _{k \in K}s_kSWF_{k}^{i} \), \( i = S1, S2, S3 \). Notice that, due to scaling by N, \( SWF_{EU}^{i} \) is of measure 24.

4.3.1 EU-Level

When assessing the welfare implications of three scenarios, it is useful to recall the discussions found in Sect. 3. Two aspects are of particular importance: cross-border transfers and the possibility of conditioning tax schemes to the country of residence. In S1, the social planner can differentiate the tax schemes across countries but cross-border transfers are not possible. In S2, the picture is reversed: there can be transfers across borders but the social planner applies the same tax scheme to every country. In S3, both cross-border transfers and differentiating the tax schemes across countries are possible. Note that the instruments of the social planners in S1 and S2 are a subset of the third social planner’s instruments. As a result, S3 is welfare improving compared to S1 and S2.

The relationship between the total welfare of S2 and S1 is ambiguous. The possibility of cross-border transfers in S2 undoubtedly generates redistributive benefits and contributes to an improvement in social welfare. At the same time, inability of tailoring tax schemes for different ability distributions in different countries has efficiency costs. For example, a country with low level of inequality would not require high labor distortions in order to meet its redistributive requirements. However, if an integrated tax system requires high labor distortions, that country would also have to bear the burden of high marginal income taxes.

The EU-level results with the main specification (given in Eq. (12)) are presented in Table 2. The second row shows the percentage changes in total welfare compared to the benchmark. Improvement of welfare in S2 suggests that benefits of cross-border transfers outweighs the losses arising from inability of differentiating the tax schemes across countries. As expected, S3 exhibits the highest level of social welfare.

Table 2 Results at the EU-level (without Income Effects)

In spite of the increase in total welfare, the third row of Table 2 indicates that approximately 68% of all the agents in the EU are worse off in the scenarios of EU-wide tax and transfer system. This is due to the fact that some of high-mean countries, such as Germany, France and the UK, exhibit very large population shares (see Fig. 1). Because these countries transfer resources in S2 and S3, they become worse off. As a result, a large fraction of the agents within Europe lose in terms of indirect utility.Footnote 19

As shown in the last row of Table 2, S2 and S3 lead to a contraction in total output respectively by 2.65% and 0.62% compared to S1. The next section provides the discussion of the underlying reasons for this outcome.

Table 3 Results at the EU-level (with income effects)

Table 3 presents the same set of results for the specification with the income effects (given in Eq. (13)). Notice that most of the results are both qualitatively and quantitatively similar, except for the efficiency implications of S3. In this case, S3 yields an expansion in total output by 1.53% due to the social planner’s ability to exploit the income effects on labor supply. More is written on this topic in the next section.

4.3.2 Country-level

For the purpose of general illustration, Fig. 2 depicts transfers by country in S2 and S3. The left panel consists of countries that transfer resources after the implementation of a common tax and transfer system; countries on the right panel are receivers of those transfers. Note that sorting countries according to the percentage welfare loss (compared to benchmark) yields exactly the same ranking. Hence, it is not reported here.

Fig. 2
figure 2

Transfers (as a fraction of total gross income) in European tax and transfer systems. See Table 1 for a guide to country abbreviations. Notice the difference between x-axis scales of left and right panels. The left panel and the right panel respectively incorporate the countries that transfer resources and the receivers of the transfers in S2 (blue) and S3 (red) (colour figure online)

Ranking of the countries in Fig. 2 is almost entirely consistent with the ranking of the means of national wage rate distributions. Therefore, it can be concluded that mean is the most important parameter in determining the winners versus the losers as well as their respective rankings. Two exceptions, however, are the UK and Estonia. Recall from the two-country examples that countries with a higher standard deviation transfer resources. Although the UK has a lower mean compared to Sweden and Finland (see Table 1), it transfers more resources because it has a much higher standard deviation. Analogous reasoning holds for Estonia.

In Fig. 2, it is interesting to observe that results of S3 is only an amplified version of those in S2. Countries that win (lose) in S2 continue winning (losing), but to a greater extent. In essence, ability of conditioning tax schemes to the country of residence allows the social planner to exploit the tax base of high-mean countries more effectively in order to increase redistribution to the low-mean countries.

Overall, ex-ante losers of fiscal integration are mostly located in northern (Denmark, Finland, Sweden and the UK) and western (Belgium, Denmark, France, the Netherlands and Luxembourg) Europe. Outwards transfers relative to total gross income are highest in Denmark, around 13% and 24% respectively in S2 and S3. Southern countries (Cyprus, Spain, Malta, Portugal and Slovenia) would receive transfers; however Italy would become a provider of transfers by a small margin. All of eastern Europe wins with Bulgaria and Romania being at the top of the list.Footnote 20

Hereafter, results are presented for four countries that represent different segments of Fig. 1. Sweden represents the high mean-low standard deviation segment; Germany represents the high mean-high standard deviation segment; Slovakia represents the low mean-(relatively) low standard deviation segment; and Poland represents the low mean-high standard deviation segment. Table 4 shows more detailed results with the main specification for these four countries. Results of all 24 countries are provided in Appendix B.3.

Table 4 Results by country (without income effects) for the three scenarios

The first panel in Table 4 presents welfare and transfer levels by country in the three scenarios. Consistent with the size of transfers in S2 and S3, the welfare levels of the four diverse countries progressively approach each other. Germany transfers more resources because it exhibits a higher mean wage rate than Sweden and, therefore, loses a greater fraction of its welfare. Analogously, the welfare gain of Slovakia is higher compared to Poland.Footnote 21

Panel 2 of Table 4 showcases the income weighted marginal tax rates and percent changes in total output compared to S1. Among the four countries of interest, income weighted average marginal tax rates increase for all in S2 compared to the benchmark, except for Germany. Table 8 further suggests that Germany is the only country with an, albeit very slight, declining distortion among all 24 countries. The reasons for Germany’s uniqueness is discussed in the next section. The rise in marginal tax rates for almost all countries is consistent with the intuition built with the example of a pooled tax scheme demonstrated in Fig. 11. Pooling countries leads to a highly dispersed gross earnings distribution which requires higher marginal tax rates at the optimum compared optimal marginal taxes of S1. As a result, total output declines in all of the countries except for Germany, explaining the overall contraction presented in Tables 2 and 3.

Finally, in line with the two-country examples, income weighted average distortions increase for the high-mean countries in S3, and decrease for the low-mean countries. As a result, high-mean countries contract and low-mean countries expand. Because total income generated in the high-mean countries is greater, S3 still leads to a contraction in total output (see the last column of the last row in Table 2).

Table 5 Results by country (with income effects) for the three scenarios

Simulation results with specification (13) are shown in Table 5. Because the qualitative nature of the results for S1 and S2 do not change, they are not discussed further. On the other hand, there are notable differences in the results for S3. The rest of this section therefore focuses on the results for S3 in Table 5. Note that transfer receiving countries enjoy higher utility levels than countries providing transfers. This is consistent with the previous findings in the literature of tagging (see, e.g., Mankiw and Weinzierl 2010). The main motive to implement tagging is the assumption of wage rates being correlated with the tagged groups. Hence, tagging is a motion from no information end to the perfect information end. In this case, the social planner finds it optimal to extract higher labor effort from more productive agents because marginal disutility of an extra unit of output is lower for them. This causes total welfare of high-mean countries to be lower at the optimum. Moreover, it reverses the sign of the change in total outputs across countries. In contrast to the main specification, high-mean countries expand whereas low-mean countries contract.

The social planner is only able to accomplish this when the utility function incorporates the income effects on labor effort. Equations (14) and (15) are the first-order-conditions of the agents with respect to labor supply respectively for specifications (12) and (13).

$$\begin{aligned} T'(z)= & {} 1 - \Big (\dfrac{1}{w}\Big )\Big (\dfrac{z}{w}\Big )^{1/\epsilon } \end{aligned}$$
(14)
$$\begin{aligned} T'(z)= & {} 1 - \Big (\dfrac{1}{w}\Big )\Big (\dfrac{z}{w}\Big )^{1/\epsilon }c \end{aligned}$$
(15)

Because income effects are assumed away in (14), higher marginal tax rates directly translate into a lower labor supply. On the other hand, in (15), an increase in the marginal tax rate together with an increase in the labor supply can be justified if net income (c) declines sufficiently. Hence, the social planner is able to extract higher labor effort from high mean countries such as Germany and Sweden in spite of increasing income weighted marginal tax rates compared to the benchmark S1. The analogous holds for Poland and Slovakia. In contrast to the total output implications of S3 with specification (12), specification (13) yields an expansion due to the higher labor supply from more productive countries (see the last column of the last row in Table 3).

For the convenience of interpersonal utility comparisons, heterogeneities in preferences are assumed away in the simulations, similar to the bulk of the normative policy analysis literature.Footnote 22 On the other hand, Bargain et al. (2013a), using the data for married women, shows that there are considerable differences over preference for work across 11 European countries and the US. They show that the ranking of individuals in welfare distribution across countries can be altered by the choice of welfare metric, once heterogeneities in preferences are accounted for. Thus, there might be ramifications for cross-country redistribution.

According to the findings of Bargain et al. (2013a), Nordic countries together with Portugal and Belgium exhibit higher willingness to work, whereas Austria, Germany, Ireland and Netherlands are more work averse. A welfare metric that evaluates individuals with higher willingness to work as better off, e.g., “rent metric” of Bargain et al. (2013a), would favor work averse countries in terms of redistribution. Hence, in such a scenario, one might expect that part of the outwards transfers from countries such as Germany and Netherlands might be taken over by, for example, Nordic countries and Belgium. Furthermore, if low-productivity countries exhibit higher willingness to work on average compared to the high-productivity countries, the overall extent of cross-country transfers might decline.Footnote 23 Analogous of the interpretations would hold for a welfare metric that considers work averse countries as better off.

4.3.3 Within-country

In this section, I examine national tax schedules in order to gain insights into the changes in distributions of welfare within countries. Note that the tax schedules in this section are presented over PPS units of income. But, it would suffice to multiply PPS units with PPPs presented in Table 7 in order to translate the schedules into national currency units.

The upper panel of Fig. 3 depicts the marginal and average tax rate schedules of Germany implied by the three scenarios. As emphasized in the previous section, the difference in the optimal marginal tax rate schemes in S1 and S2 is hardly noticeable. This is because Germany exhibits both a high mean and a high standard deviation in the wage rate distribution and hence, requires a higher marginal tax rate schedule in S1 compared to other countries.Footnote 24 As a result, Germany is one of the least affected countries from the increase in distortion brought about by S2. Consistent with the two-country examples, marginal tax rate scheme shifts upwards in S3. Average tax rate schemes, on the other hand, gradually shift upwards and to the left, indicating that redistribution to the poor declines despite increasing taxes on the rich.

Fig. 3
figure 3

Resulting tax schedules of the three scenarios (Germany and Sweden). Marginal tax rates are calculated according to Eq. (14). Average tax rates are given by \( (z(w)-c(w))/z(w) \) where z(w) and c(w) respectively denote the gross and net income for an agent of type w.

Results for Sweden are demonstrated in the lower panels of Fig. 3. Because of exhibiting a low standard deviation, optimal marginal tax rates decline sharply after low incomes in S1. Therefore, in contrast with Germany, middle and high incomes are exposed to increasing marginal taxes in S2. In S3, the marginal tax rate schedule shifts upwards, just as in all other countries with a relatively higher mean. Movement in the average tax rate scheme across scenarios is similar to Germany. The main difference is that taxes paid by the rich are the highest in S2 due to the sharp increase in marginal taxes compared to the benchmark.

Figure 4 illustrates the percentage of changes in the indirect utilities of 20 wage bins in Germany and Sweden for S2 and S3 compared to benchmark. In the countries that transfer resources, the poor might be the worst off because of sharing the benefits of redistribution with other countries. Moreover, the rich might be the worst off due to the increased average taxes. Figure 4 suggests that the former dominates in both countries in S3. In S2, the poorest are the worst off in Germany, whereas the richest lose more, on average, in Sweden. The different result in Sweden is caused by the sharp increase in the average taxes paid by the rich in S2. Although not presented here, this result holds for all high-mean countries that exhibit relatively low standard deviation. As explained above, due to having a low standard deviation, high incomes are exposed to a sharp increase in distortion and average taxes in S2.

Fig. 4
figure 4

Changes in indirect utilites relative to S1 (Germany and Sweden). Left (right) panel depicts the changes in the indirect utilites of the agents in S2 (S3) compared to S1

Figure 5 presents the tax schemes for the low-mean countries, Poland and Slovakia. Similar to the case of Sweden, the increase in distortion brought about by S2 is higher for Slovakia which exhibits relatively low standard deviation. In S3, consistent with two-country examples, marginal tax rate scheme shifts downwards for both of the countries. In contrast to high-mean countries, the average tax rate schedule gradually moves downwards and to the right suggesting an increased redistribution towards the poor and lower taxes for the rich. Only exception is the very high incomes of SK who pays the highest taxes in S2 as is the case in Sweden.

Fig. 5
figure 5

Resulting tax schedules of the three scenarios (Poland and Slovakia). Marginal tax rates are calculated according to Eq. (14). Average tax rates are given by \( (z(w)-c(w))/z(w) \) where z(w) and c(w) respectively denote the gross and net income for an agent of type w

Analogous to the case of high-mean countries, poorer households in low-mean countries are better off due to the increased transfers (flowing from the high-mean countries), whereas a reduction in average taxes leads to utility gains for richer households. Figure 6 indicates that transfers to the bottom of the distribution dominates the latter, leading poorer in both Poland and Slovakia to gain the most in the scenarios involving a European tax and transfer system.

Overall, it can be concluded from this section that the biggest winners of implementing a common tax and transfer scheme are the poor in low-mean countries. In S2, relative changes in indirect utilities within high mean countries are dependent on the standard deviation. If the wage rate distribution of a country exhibits relatively low standard deviation, richer households in that country lose the most in S2 as a result of sharply increased distortion and average taxes. Otherwise, the poor lose the most because they have to share the tax revenues of their country with the low-mean countries. In S3, on the other hand, utility gains of the poor in low-mean countries come at the expense of the poor in high-mean countries irrespective of the standard deviation.

Fig. 6
figure 6

Changes in indirect utilites relative to S1 (Poland and Slovakia). Left (right) panel depicts the changes in the indirect utilites of the agents in S2 (S3) compared to S1

Finally, it is worth noting that changes in income taxes might trigger imperfect movement of labor and capital. This is less concerning for S2 in which the tax schemes are the same across countries. On the other hand, implementing S3 might lead labor and capital to move from transferring countries, which experience increased labor distortions compared to the benchmark, towards transfer receiving countries. Movement of (productive) labor would likely contribute to the redistribution of tax revenues from high-tax to low-tax countries.Footnote 25 Furthermore, if labor and capital are complements in the production function, capital mobility might induce productivity redistribution in favor of low-tax countries. As a result, the optimal cross-country transfers of S3 might be achieved via smaller increases in the marginal income taxes of high-productivity countries compared to the findings of this section.

4.3.4 Political feasibility and a more restrictive tax and transfer system

A centralized tax and transfer scheme in Europe clearly has major redistributive benefits. This is consistent with the long-standing agenda of Europe that aims at mitigating economic disparities in the region and maintaining a reasonable standard of living for all of its citizens. Additionally, while it is not the focus of this study, Bargain et al. (2013b) argues that a European tax and transfer system would help providing macroeconomic stability for credit-constrained countries. They find that, after replacing one-third of national tax systems with an EU-wide tax scheme, 10–15% of a macroeconomic income shock would be absorbed. On the other hand, for a given ability pool in Europe, an integrated taxation system must generate direct costs that are needed to be borne by high-productivity countries. This study can also be seen as an attempt to clarify the extent of such costs at the optimum.

It is highly probable that high-productivity countries would not volunteer for a centralized taxation system in order to avoid such costs. Indeed, both in S2 and S3, almost every individual in high-productivity countries become worse off compared to the benchmark. Is there a possible scenario where a European tax and transfer system is still beneficial but extreme costs to the highly productive countries are avoided? As an attempt to address this question, I investigate a more restrictive case of a European tax and transfer system in the rest of this section.

Social planners of S2 and S3 solve their optimization problem by considering an additional constraint. That is weighted sum of utilities should not be less than S1 in any country. Formally, I add the following constraint to S2 and S3:

$$\begin{aligned} \int _{w\in W} U_{k}^{r}(w)f_k(w) \ge \int _{w\in W} U_{k}^{S1}(w)f_k(w) \quad \forall k \in K \quad where \quad r \in \{S2,S3\} \end{aligned}$$
(16)

In this scenario, the social planners still have concavity in their objective function and, hence, concerned about redistribution. On the other hand, they impose the restriction that linearly aggregated social well-being should improve in every country. This still does not guarantee that countries that transfer resources to the others would voluntarily enter to a centralized tax system. If a country aggregates individual well-being with a concave social welfare function, it might still be worse off compared to the benchmark under the new scenario.Footnote 26 Nevertheless, with the new restriction, the social planners of S2 and S3 ensure that a certain fraction of the workers become better off in the countries that transfer resources.

Table 6 Results at the EU-level with linear country welfare improving system (without income effects)

EU-level results are presented in Table 6. With the new restriction, Europe becomes worse off as a whole in S2. When the government cannot differentiate taxes by country of residence, the social planner prioritizes efficiency and it is optimal to set marginal income taxes very low (not reported here) to ensure that at least richer households are going to be better off in a centralized tax system. As a result there is substantial expansion in the total output but very little redistribution to the needy. Hence, social welfare in the EU, which is aggregated by a logarithmic function, declines. Thus, I do not pursue S2 in the rest of this section.

S3 still improves social welfare, albeit by a lower magnitude compared to the fully optimal taxation system. Under this scenario, there is still a sizable expansion in total output while almost half of the population in Europe is better off. Indirect utility of every individual in transfer receiving countries improve. In addition, a considerable fraction of the individuals residing in the transferring countries, from 27% in Cyprus to almost 50% in Sweden, do not experience a decline in their well-being. By country fractions of individuals that are better of in S3 are reported in Appendix B.8 (Fig. 19).

Fig. 7
figure 7

Transfers (as a fraction of total gross income) with linear country welfare improving system in S3. See Table 1 for a guide to country abbreviations. Notice the difference between x-axis scales of left and right panels. The left panel and the right panel respectively incorporate the countries that transfer resources and the receivers of the transfers at the full optimum (see Fig. 2)

Figure 7 presents the country-level transfers for S3. In order to enable an easy comparison to the full optimum, placement of countries across the panels are the same as Fig. 2. Notice that Portugal, Malta, Cyprus, Slovenia, Poland and Portugal, if anything, becomes slight losers of the new system. While outwards transfers do not exceed 2% of the total output of the transferring countries, these transfers still correspond to a sizable fraction of the total output of the transfer receiving countries. Finally, in contrast to the full optimum case, optimal marginal taxes decrease under the new scenario for transferring countries compared to the benchmark. By lowering the tax marginal tax rates, the social planner is still able to extract resources from high productivity countries without making the rich worse off. Country specific tax schemes and distribution of within-country welfare implied by S3 can be found in Appendix B.8 (respectively Figs. 20, 21).

Finally, in order to structure the discussion of political feasibility further, I implement the test proposed by Bierbrauer et al. (2020). According to Bierbrauer et al. (2020), the following statements are equivalent—as long as the tax reforms are monotonicFootnote 27: “The median voter benefits from a reform”, “There is a majority of voters who benefit from a reform”. In the setting of this study, the single-crossing condition is satisfied in all scenarios and, thus, the tax reforms are monotonic. It is more convenient to explore whether the latter statement holds within each country. I find that if a country transfers (receives) resources, majority of that country is worse (better) off compared to the benchmark scenario. Thus, departing from Fig. 2, it can be concluded the median voter of 14 out of 24 countries is better off with S2 or S3 studied in the previous section. S3 of this section, on the other hand, performs worse based on the criteria of Bierbrauer et al. (2020). In this scenario, only 8 out of 24 countries receive transfers for the sake of avoiding high costs to highly productive countries (see Fig. 7). Thus, the median voter benefits from the reform in 8 countries.

5 Conclusion

This study analyzes the equity-efficiency implications of a European tax and transfer system in a Mirrleesian setting. Using the Structure of Earnings Survey 2014 wave, three different income taxation scenarios are calibrated and simulated for 24 member states of the EU. In S1, every state discretely solves its own maximization problem. In S2 and S3, welfare in the EU is maximized with respect to a common budget constraint. In S3, the social planner can condition taxes to the country of residence whereas the social planner of S2 is restricted to apply the same tax scheme to every country. For the purpose of comparing the common tax and transfer systems to discrete maximization, S1 is considered as the benchmark while discussing the numerical results.

In spite of the increase in total welfare, more than two-thirds of the households become worse off in S2 and S3 compared to the benchmark. When the social planner is unable to differentiate the tax schemes across countries (S2), total output contracts by 2.65%. Implications of S3 on the total output is sensitive to the assumption regarding the income effects on labor supply. When the income effects are assumed away, S3 leads to a contraction by 0.62% while the specification with the income effects yields an expansion of 1.53%.

Countries with high mean transfer resources to low-mean countries. Because the size of the transfers are higher in S3 compared to S2, total welfare decreases (increases) gradually in high- (low-) mean countries when moving from S1 to S3. Pooling the populations in S2 results in a highly dispersed gross earnings distribution, leading to increased distortion (and reduced output) in almost all of the countries. In S3, on the other hand, high-mean countries contract, whereas low-mean countries expand when the income effects are assumed away. When the utility function allows for the income effects, the conclusion is reversed, explaining the contrast in the total output implications of S3.

Poorer households in low-mean countries gain the most from both scenarios involving a common tax and transfer system. Within high-mean countries, standard deviation of the wage rate distribution determines the segment of the population that experiences the highest decline in indirect utility. In countries with a low standard deviation (e.g., Sweden), the richest lose the most due to much higher taxes and distortion compared to the benchmark scenario. The poorest lose the most in countries with a high standard deviation (e.g., Germany) because of sharing the redistributive taxes with the poorest in low-mean countries. In S3, the latter effect dominates in all high-mean countries, rendering the problem as a trade-off between the poor in high- and low-mean countries.

The outcomes of S2 and S3 lays out the potential costs of an integrated tax system to the high productivity countries. It is possible to limit the costs by imposing the restriction that linearly aggregated social welfare must not decline in any country compared to the benchmark. Inferring from the size of the transfers and the fractions of individuals that become better off in every country, such a scenario seems more politically feasible compared to the full optimum case. On the other hand, with the new constraint, S2 yields a reduction in aggregated well-being, whereas S3 improves social welfare by about 1%.

The analysis in this study can be extended in multiple directions. First, it would be interesting to examine the effect of cross-country heterogeneities in redistributive preferences and labor supply elasticities on the model outcomes. Second, the assumption of immobile workers can be removed to investigate the optimal tax schedules under the trade-off between enhanced redistribution and productivity-enhancing migration as in Kessing et al. (2020). Finally, incorporating extensive margin labor supply decision to the model seems promising for future research.