Introduction

In the framework of spatial economics, a theme of paramount relevance is represented by analyzing how firms and employees decide to locate in a specific region.

Understanding the settlement process of individuals is also essential for planners that should define appropriate policies able to attract new employees in their jurisdictions. Obviously, where the new location will occur and how much employment will be generated are strictly intermingled. Still, in the literature, the crucial link between location choices and the firm's size has not been investigated in depth. Few exceptions can be found. The most influential for the subsequent literature on location choices is the work of Carlton (1983), who simultaneously models (via duality theory) both the decisions of where to locate and how many employees to hire. Recently, in the context of transport and land-use modelling, Hensher et al. (2019) present a formal model in which the number of firms influences the number of jobs at a specific location. De Palma et al. (2019) model the origin of cities and urban hierarchy within a framework with two types of agents characterized by heterogeneous preferences.

On the theoretical side, different strands of the literature have clearly recognized that location processes are related to the interplay between, on one side, various sources of external economies that contribute to agglomeration, and on the other side, congestion costs, competition and the spatial distribution demand that tend to facilitate dispersion. Besides, within the evolutionary perspective, it is asserted that economic systems are pervaded by endemic uncertainty and that regional entrepreneurship tends to be a path-dependent process, being also the outcome of heterogeneous behaviours of agents characterized by bounded rationality. There is overwhelming evidence that new firms do not opt for optimal locations in terms of cost minimization. Still, they are affected by local structures that originated in the past, social interactions, and the regional knowledge base (new firms typically exploit local knowledge and skills) (Boschma and Frenken 2018).

In the empirical literature, given the discrete nature of location choices, they are typically modelled through the McFadden–Carlton conditional logit approach (McFadden 1974; Carlton 1979, 1983). This approach provides sound theoretical underpinnings of empirical models, even if they refer to a theoretical objective function of a representative profit-maximizing agent. Moreover, as Carlton (1983) suggested, the idiosyncratic reasons behind each choice of the preferred location are in the error term, and the reasons behind the labour demanded and firm size.

Starting from these premises, this paper aims to provide a more general framework that allows deriving spatial patterns of firms and employees from individual site selections based on bounded rationality. We consider that decision-makers are different in their attitude regarding centripetal and centrifugal forces, and this heterogeneity within the same group of economic agents can lead to different spatial structures. In particular, we here adopt an evolutionary location approach by considering that the selection of a region by firms and employees can be modelled, in a probabilistic context, as a ball extraction from urns.

Urn models are particularly versatile for the modelling of preferential attachment problems, as in our case (for an overview, see Johnson and Kotz 1977). Hence, in the proposed setting, we adopt such a perspective; specifically, we follow a Polya urn approach (see Mahmoud 2008). Polya urns are constructed as standard boxes with coloured balls, with the relevant characteristics that additional balls of some colours are added or removed before the phase of extraction. In doing so, such probabilistic models provide a time-dependent probability of taking a given color from the urn—hence, describing the paradigmatic cases of the changing of the conditions in the attachment process. The versatility of the Polya urn models explains why they are so popular in economic theory, and why they are suitable for applications in the field of economic geography with special reference to path-dependence that is considered a typical property of industrial dynamics (see e.g. Arthur 1994; Bottazzi et al. 2007; Martin 2012). Interestingly, Polya urns intervene also in the theoretical explanation of the rank-size laws when the position of the sizes at given ranks is viewed as a discrete attachment process (see e.g. Ausloos and Cerqueti 2016; Cerqueti and Ausloos 2015).

As we will see, extraction procedure and urns composition will be suitably conceptualized to describe the location decision criteria of the actors. In so doing, urn theory allows modelling agents' preferences in a flexible way, accounting for differences in how heterogeneous agents put a different relative to positive externalities due to spatial interaction instead of local congestion costs, competition and the spatial distribution of demand.

This general framework generates either:

Equally spaced firms, so that positive externalities and natural advantages do not influence location patterns and firms are distributed over the regions accordingly to a uniform distribution; or, differently, uneven distribution of firms across regions, hence pointing to the presence of aggregation effects.

Equally sized firms, so that the distribution of firms and employees are overlapping; or, differently, disproportionality between employees' distribution and firms' distributions for each sector. In this second case, we may infer that firms' size heterogeneity within the same sector affects this scenario.

Thus, our setting allows us to account for a rich set of location patterns despite its great simplicity.

Equally important, it provides a background for two main features usually observed in the empirical distributions of firms and employees. First, the divergence of firms from the regional uniform distribution in a specific industry. Second, the divergence between the spatial patterns of firms and employees in a specific industry.

The rest of the paper is organized as follows. Section Related Literature. How to Place our Contribution in the Field of Economic Geography? reviews the related literature, Section The Model is devoted to the conceptualization and development of the model. The cases of firms and employees are conveniently distinguished. Section A Discussion of Advantages and Limitations of Urn Theory with Reference to the Literature discusses the benefits of the model concerning the research gap we are addressing, its limitations and possible extensions. Section Conclusions and future research offers some concluding remarks, along with the identification of future lines of research.

Related Literature. How to plAce Our Contribution in the Field of Economic Geography?

The present contribution can be placed at the interface between the traditional New Economic Geography and the institutional and evolutionary approaches on the theoretical side. The urn model provides microeconomic foundations to the spatial distribution of firms and employees based on the interplay between, on one side, different sources of external economies that contribute to agglomeration, and on the other side, to congestion costs and competition that facilitate dispersion. In the well-known synthesis of the New Economic Geography, location patterns are modelled as the results of factors such as demand distribution in space, transport costs and agglomerative forces such as increasing returns to scale (internal to the firm). Among these factors, the complex role of agglomeration economies has received much interest, even in the early contribution of the traditional location theory (Weber 1929; Hoover 1948; Marshall 1964), in theories of central place systems (Christaller 1933, Lösch 1954) and in the less-known von Thünen's work (von Thünen 1826) on the mechanism of industrial agglomeration and city formation (see Fujita 2012). For recent surveys of the various agglomeration and dispersion forces considered in the literature, see Duranton and Puga (2004), Fujita and Thisse (2013), and Behrens and Robert-Nicoud (2015).

Besides, our modelling approach borrows several insights from the study of the location of new firms from a micro-evolutionary perspective based on the hypothesis of bounded rationality (Pred 1967) and the Behavioural Location Theory (Townroe 1991). The evolutionary economic perspective provides a rich view for understanding the spatial distribution of firms and employees and especially its dynamics (Boschma and Frenken 2006, 2007, 2018). Entrepreneurship not only tends to be a geographically localized phenomenon, but it is also a spatially uneven process that tends to persist over time. This implies regional entrepreneurship tends to be a path-dependent process (see Boschma and Martin 2012; Martin and Sunley 2012). The intention is not to deny that location-specific characteristics may matter (see, e.g. Boschma and Lambooy 1999; Martin and Sunley 2012; Boschma and Martin 2012). However, instead of saying that locations matter in a deterministic manner, an evolutionary approach to spatial clustering accounts for dynamic processes and pre-existing structures in regions are expected to condition but not determine their spatial outcome. Our approach also falls under evolutionary economic geography since it accounts for path dependence in which each event changes the probability of a subsequent event to occur (David 2000; Arthur 1989). In this view, path dependence does not simply arise from the assumption of increasing returns, as is the case in NEG models.

On the empirical ground, alongside advancements in theory, empirical studies addressing the role of several location determinants continue to increase. In the empirical literature on regional and urban economics, the determinants of industrial location decisions have been investigated following two main methods: discrete choice modelling and count data modelling. The former method includes both the characteristics of the decider (firms, employees, etc.) and those of the available regions and evaluates them as distinct factors; the latter deals with the territories' features. Specifically, discrete choice models are used when the unit of analysis is the economic agent, while counting data models are adopted when micro-geographical data are not available and, therefore, the units of study are regions (see Arauzo-Carod et al. 2010 and Bhat et al. 2014 for a critical assessment of the methods and results). Although mainly developed in the context of standard theories, empirical modelling of location decisions has gone beyond fully deterministic approaches (e. g. conditional logit). It now allows for the incorporation of taste variation in decision-makers (“mixed logit” or “random parameter multinomial logit” models). Still, a systematic approach to industrial location modelling has not been found. Discrete microeconomic data were typically analyzed following the random utility maximization (RUM) approach put forward by McFadden's (1974) seminal article. Carlton (1979, 1983) applied the conditional logit model (CLM) and first demonstrated that location choices of branch plants can be modelled in a random utility maximization setting, as suggested by McFadden. Therefore, the main advantage of the McFadden–Carlton CLM approach is that the empirical setting is explicitly embedded in the theoretical random utility maximization framework.

The McFadden–Carlton CLM approach has become commonly adopted in the literature to investigate location choices of firms and households (for reviews, see Schmidheiny and Brülhart 2011 and Arauzo-Carod et al. 2010).

Although, as suggested by Carlton (1983) and more recently by Guimaraes et al. (2004) and Arauzo-Carod et al. 2010, the Independence of Irrelevant Alternatives (IIA) assumption of the error term constitute a limitation. In other words, once controlling for the observable characteristics of regions and individuals included in the model, the error term captures all the rest, that is the unobservable characteristics. It may lead to infer that all locations are similar to the decision-makers. If this assumption is violated, estimated coefficients can be biased. The proposed urn theory approach overcomes this severe limitation.

Recent studies have also resorted to the Poisson (count) models to tackle the problem of modelling complex choice scenarios with many small spatial units. Its theoretical justification based on the random utility maximization approach was provided by Guimaraes et al. (2004).

Yet, there are still open questions. For example, Schmidheiny and Brülhart (2011) show that the CLM and the Poisson models differ starkly in terms of their implied predictions and that all intermediate cases can be represented as a nested logit model. Moreover, differently from the Poisson model, the conditional logit does not appear appropriate if one wants to model the location process as a dynamic one since it implies.

"that the aggregate number of agents is fixed and that differences across locations affect only the distribution of those agents across those locations" (Schmidheiny and Brülhart 2011). Nevertheless, as pointed out by Bartik (1985, p. 16).

"the conditional logit approach remains attractive because of its computational feasibility compared with other alternative approaches to the discrete choice problem.".

The Model

Motivation: In Search of a Theoretical Foundation for Well-known Empirical Regularities

The present location model unifies the settlement process of firms and employees in a stochastic setting. In the real world, entrepreneurs may have limited information about the actual conditions of regional attributes driving the choice of where to open a new branch. Moreover, the firm generally is active for many years during which all the relevant location factors change over time. Accordingly, the actual location choice should be thought at least as an optimization process subject to uncertainty, or even a decision-making process following satisfying principles.

The implications of uncertainty on individual location choices have been well recognized in regional science and urban studies. Nevertheless, while stochastic models of location are not completely new, they were adopted to investigate the decisions of various parts of the population, separately—e.g. entrepreneurs, migrants, criminals—like in the study of the spatial variations of crime (e.g. Deutsch and Epstein 1998), or in the migration decisions to relocate or not (e.g. O'Connell 1997; Schmidt 2014). In the context of the logistics literature, stochastic models of location were first applied to facilities location problems (see, among many others, Jucker and Carlson 1976, Hodder and Dincer 1986; see also Krarup and Pruzan 1983, Snyder 2006 for surveys). These modeling approaches of location choices, while considering the plant location decisions, do not consider the settlement of employees.

This paper represents an attempt to provide a theoretical ground of the spatial location of firms and employees under uncertainty, considering their interdependency.

Particularly, we start from two empirical regularities: the deviation between the actual distribution of firms and the uniform distribution and the discrepancy between the actual distribution of employees and the one of the existing firms. As for the first empirical regularity, a “no localization” state—wherein even “first nature” location advantages are irrelevant—can be associated to the uniform distribution of firms across a homogeneous space.

To give an example of the first empirical regularity, Fig. 1 compares the interregional settlement of firms in the office machinery and computers industry across 133 EU regions (blu line) with a randomly generated distribution based on a uniform law over [1,400], which is the variation range of the considered phenomenon (red line).

Fig. 1
figure 1

Firms in the office machinery and computers industry in Europe—Regional shares. The real distribution (blue line) is compared with a randomly generated distribution on the basis of the uniform law on [1,400] (color figure online)

We perform a Kolmogorov–Smirnov test of the equality of distributions with a null distribution of this statistic calculated under the hypothesis that the sample of data is drawn from the reference uniform distribution. Results confirm that the factual distribution is far from following a uniform law (Table 1), as the graphical evidence clearly suggest (Fig. 1).

Table 1 One-sample Kolmogorov–Smirnov test against theoretical uniform distribution on [1,400]

Several factors beyond “first nature advantages”, that concur to such a deviation have been identified in the literature. Papageorgiou and Smith (1983) claims that the uniform distribution is unstable if subjective attitudes regarding human interactions are accounted for. Moreover, within the evolutionary economic literature, further reasons can be identified, including the imitative behaviour of a newly located firm in respect of the existing firms (Arthur 1994; David 2000). We, in fact, have assumed that clustering and establishments’ location is path-dependent (for a similar view, see Boschma and Lambooy 1999).

The second empirical regularities that the urn model refers to is the divergence between the spatial patterns of firms and employees in a specific industry. Figure 2 serves as an example, and compares the interregional settlement of employees in the office machinery and computers industry across 133 EU regions (blu line) with the spatial distribution of plants (red line).

Fig. 2
figure 2

Employees and establishments in the office machinery and computers industry in Europe—Regional shares

The visual representation indicates that the distribution of employees does not mimic the one of firms, and the results of the Kolmogorov–Smirnov test of the equality of distributions clearly suggest that they cannot be considered as drawn from the same distribution.

The null distribution of this statistic is calculated under the hypothesis that the sample of data is drawn from the reference, i.e. the distribution of establishments across space. Results clearly imply that the null hypothesis has to be rejected (see Table 2).

Table 2 One-sample Kolmogorov–Smirnov test of employment against the distribution of firms

We can summarize the motivations behind our study. In the context of well-established theories, such as the New Economic Geography and even the Evolutional Economic Geography, the issue of uncertainty in location choices by firms and employees have been only marginally considered. Stochastic location models are not entirely new in the literature, but they do not consider the interdependency of the location choices of firms and employees.

More importantly, we aimed to provide a theoretical framework able to reconcile the empirical regularities with a more realistic hypothesis of bounded rationality of entrepreneurs and workers.

In this perspective, location choices are viewed as an optimization process subject to uncertainty.

The offered theoretical perspective is related to the preferential attachment problem, where deciders select one of the available alternatives based on a prefixed criterion. Specifically, as we will see in detail below, we describe the process of the placement of employees and firms over a set of regions by modelling employees and firms as coloured balls and regions as urns. The probability that a given employee (or firm) places in a specific region is the same as extracting a particularly coloured ball from the urn related to the region. The complexity of the environment lets the available opportunities be not identical so that the randomness of the extractions is biased by special events and external factors. We admit balls additions and removals at each step of the process to capture this property of the location dynamics, hence letting our approach be particularly appropriate in our context.

A General View and Basic Assumptions of the Model

We suggest that the evidence highlighted in Sect. Motivation: In Search of a Theoretical Foundation for Well-known Empirical Regularities can be explained by the interplay between agglomeration economies and countervailing factors acting toward dispersion.

In particular, accordingly to the arguments developed above, we assume that natural advantages do not play any role in driving the regional selection procedure or firms location process. This assumption has a great impact mainly at the beginning of the location process—i.e., when regions are empty and the selection of a region obeys a uniform distribution. However, as we will see below, deviations from the uniform distribution in successive steps occur, reasonably due to the presence of scale economies, intra-regional competition and other sources of economic factors.

In the context of the distribution of the employees, following Carlton (1983), we consider that it should be tailored on the one related to the existing firms. In fact, the plant location choice is usually taken in a long-run perspective, as a part of a more complex investment decision making process, involving, among other key strategic choices-like technology, financing, marketing and distribution etc., the plant firm size (see Hayter 1997 for a similar viewpoint). Hence, the spatial distribution of workers in one industry is certainly linked to the decisions taken by entrepreneurs about the number of employees to hire.

We basically assume that employees choose their location as soon as firms have settled in space. This is reasonable in a world where mobility of labour across sectors and space is allowed and firms are the basic source of economic growth and thus of jobs.Footnote 1

In particular, under the assumption that firms belonging to the same sector are equivalent in terms of their sizes, an employee will select the locating area on the basis of the relative number of firms belonging to the available regions. The size of the firm is predetermined at the beginning of the process of site selection by firms and employees but it can change over time. Thus, for what concerns the location process related to the employees, we here take for us the evidence that it is driven by the size of the firms within the same sector. In so doing, we move from the reasonable intuition that employees spread over the available regions proportionally to the typical dimension of firms of the industrial sector—which is measured as the average firm size of the sector. This stated, we consider as grounding assumption that the counterfactual regional distribution of the employees is the one of the firms—hence, implicitly assuming that all the firms belonging to a given sector share the same size. In saying this, we are pointing out that the divergence between firms and employees depends only on the heterogeneity in firm size.

The Location Choice/Urn Extraction Procedure

We assume that the economic system is composed by a finite number of regions, Z. and R is the set of possible alternatives that firms consider as feasible for establishing a plantFootnote 2 (\(R\subset Z)\).

We consider an urn containing a finite number of coloured balls. We denote such colours as\({C}_{1}, {C}_{2},\dots ,{C}_{R}\). The \(R\) regions can be viewed as the colors of the balls in the urn.

For both cases of employees and firms, the process of region selection is modelled by a sequential extraction of a ball from a suitably defined urn, and the colour of the extracted ball represents the selected region.

We consider \(S\) industrial sectors forming the entire economic system.Footnote 3 We aim at modelling and discussing different types of relationships between the distribution—at a regional level—of the employees and of the firms, both for each sector and for the overall economic activity. The distribution of employees and firms is described as a dynamic settlement process through an urn model. The final outcome of the preference optimization of each economic agent is a probability law over the available regions, and it is strongly dependent on the agent taking the decision. In the following section we enter the details and explain how the urn model can be applied for modelling the rational location choices of firms and employees, respectively.

Firms

We consider a set of \(K\) firms.

The probability that the \(k\)-th firm locates in the \(r\)-th region coincides with the probability that the \(k\)-th individual extracts from the urn \(U\) a ball with color \({C}_{r}\).

In general, we assume that the rational location choice of the \(k\)-th firm is affected by the following characteristics of each region \(r\):

  1. (1)

    presence in the region \(r\) of firms of the same sector to take advantage of industry-specific knowledge spillovers, skilled workers in the local labour market, and within-industry input–output linkages;

  2. (2)

    presence in the region \(r\) of firms operating in ancillary activities to exploit complementarities with upstream and downstream firms along the value chain and knowledge spillovers across sectors;

  3. (3)

    presence in region \(r\) of infrastructures and services leading to a more profitable business environment, included public supports for firm formation or relocation (urbanization economies).

  4. (4)

    Intra-regional competition

  5. (5)

    Congestion costs

  6. (6)

    the spatial distribution of demand for the product of the firm to save on transport costs

The first three factors are related to the advantages of agglomeration economies, and they favour the spatial concentration of economic activities, while factors (4) and (5) and (6) work in the opposite direction, and they are intended to capture the dispersion forces usually considered in the literature. Accordingly, to each region r it is associated a possible score, which is a state defined by the combined effect of the above-mentioned regional attributes. As we will see, the scores of the regions lead to a probability distribution which gives information on how likely a firm will select a given region when locating. In particular, the single firm is endowed with a cardinal utility function uk, representing the “value” or “profit” placed by the entrepreneur to the envisaged score associated to each region—hence, to the probability distribution induced by the scores. The higher the score, the higher the worthiness of the region for the individual firm.

The probability that the \(k\)-th firm locates in the \(r\)-th region increases as the ratio between the number of balls with colour \({C}_{r}\) and the total balls contained in \(U\) does. Moreover, the heterogeneity among the firms and their preferences based on the interplay between agglomerative effects and dispersion forces is modelled through the modification of the urn configurations at each firm’s location decision. In particular, we consider a multistage procedure, in which the \(k\)-th stage is associated to the selection of the region by the \(k\)-th firm.

At the initial stage, the colours are assumed as being identically distributed in the urn. This means that the first extraction from the urn (i.e.: the location choice of the first firm) is implemented according to a uniform distribution over the available regions. The uniform hypothesis of the initial configuration of the urn stands for the irrelevance of natural advantages, which are the only mean to let an empty region be preferable than another empty one and are not considered here.

In the dynamic of this site selection model, the economic agents choose to be situated in a particular location because of its specific combination of the relevant characteristics highlighted above. Whatever the specific preference structure of each firm, what is particularly relevant for the firms is the distribution of the firms itself, in the sense that the \(k\)-th firm’s choice are affected by the location of the previous \(k-1\) firms.

So, in general, while the role of “historical accidents” is accounted for in the first step of the selection process, the uniform distribution of the colors in the urn is an invalid condition in each step after the first one.

At the first stage, the firm labeled with \(1\) implements the selection of the region. In the context of colored balls in the urn, the individual labeled with \(1\) extracts a ball from \(U\).

The drawn color corresponds to the region where the 1-st firm locates. So, the number of the firms in the \(R\) regions after the 1-st drawn from the urn is a vector \(({n}_{1}^{\left(1\right)},{n}_{2}^{\left(1\right)},\dots ,{n}_{R}^{\left(1\right)})\), where the subscript indicates the color/region. Of course, if the extracted ball has color \({C}_{\overline{r} }\), then it results \({n}_{\overline{r} }^{\left(1\right)}=1\) and \({n}_{r}^{\left(1\right)}=0\) for each \(r\ne \overline{r }\).

After the extraction, the drawn ball is reinserted in the urn. At the same time, some other colored balls are inserted in or removed from the urn. At the end of this procedure, the number of the balls with colours \({C}_{1}, {C}_{2},\dots ,{C}_{R}\) is\({C}_{1}^{(2)}, {C}_{2}^{(2)},\dots ,{C}_{R}^{\left(2\right)}\), respectively. The resulting new configuration of the balls in the urn changes—in general—the probability of extracting a ball with a specific color in the next step, according to the preference structure of the decision maker (see below the details on this for the general case of the \(k\)-th step). This new urn is the one used by the firm labeled with 2 for the ball drawn procedure.

Now, the firm labelled with 2 takes a ball from the urn. The number of the firms in the regions after the 2-st drawn from the urn is \(({n}_{1}^{\left(2\right)},{n}_{2}^{\left(2\right)},\dots ,{n}_{R}^{\left(2\right)})\), where the subscript indicates the color/region. In accord to the previous step, the nonempty regions are all the ones selected in the 1-st and 2-nd step, and we have \({n}_{1}^{\left(2\right)}+{n}_{2}^{\left(2\right)}+\dots +{n}_{R}^{\left(2\right)}=2\).

The procedure goes on according to this rule: at the \(k\)-th stage, the firm extracts from a urn such that the number of balls with colors \({C}_{1}, {C}_{2},\dots ,{C}_{R}\) is\({C}_{1}^{(k)}, {C}_{2}^{(k)},\dots ,{C}_{R}^{\left(k\right)}\), respectively.

As already mentioned above, the probability \({p}_{\overline{r} }^{(k)}\) of taking a specific color/region \({C}_{\overline{r} }\) from the urn is the relative number of the balls with color \({C}_{\overline{r} }^{(k)}\) in the urn, so that

$${p}_{\overline{r} }^{(k)}=\frac{{C}_{\overline{r} }^{(k)}}{\sum_{r=1}^{R}{C}_{r}^{\left(k\right)}} , \forall k=\mathrm{1,2},\dots ,K and \overline{r }=\mathrm{1,2},...,R.$$
(1)

In general,\({p}_{\overline{r} }^{(k)}\ne {p}_{\overline{r} }^{(k-1)}\), for each \(k=\mathrm{1,2},\dots ,K\) and \(\overline{r }=\mathrm{1,2},...,R.\)

After the \(k\)-th drawn, the number of firms in the regions is \(({n}_{1}^{\left(k\right)},{n}_{2}^{\left(k\right)},\dots ,{n}_{R}^{\left(k\right)})\).

The balls addition/removal procedure implemented at each stage –and the associated probability distribution \({p}^{(k)}=({p}_{1}^{\left(k\right)},{p}_{2}^{\left(k\right)},\dots ,{p}_{R}^{(k)})\)- has economic reasoning.

In fact, the \(k\)-th firm takes its rational choice on the basis of its utility \({u}_{k}:{\varvec{D}}\to {\mathbb{R}}\), where \({\varvec{D}}\) is the set collecting all the discrete probability distributions of the type \(p=({p}_{1},{p}_{2}, \dots , {p}_{R})\) over the set \(\{\mathrm{1,2}, \dots , R\}\), where

$${p}_{\overline{r} }\in \left[\mathrm{0,1}\right]; \forall \overline{r }=\mathrm{1,2},\dots ,R and \sum_{r=1}^{R}{p}_{r}=1.$$

The meaning of such probabilities in our settlement process is

$${p}_{\overline{r} }=Prob\left(to draw a ball of color {C}_{\overline{r} }\right).$$

At each stage, the probability distribution reveals the preference structure of the k-th firm, according to the specific cardinal utility function \({u}_{k}\) which places a value to each region taking into consideration the interplay between the perceived regional attributes (1)–(6) described above. The value of the cardinal utility would be different for different firms. For example, a firm that displays a strong attitude towards interaction and put a relatively lower weight to congestion costs will make a different location choice compared to a firm that displays a strong aversion to competition relative to the advantages it perceives in terms of positive externalities related to the presence of other firms in the same region. Example 1 below is suitably designed to show how the location choices may differ for agents with heterogenous preferences with respect to the interplay between agglomeration and dispersion forces.

Hence, the probability to extract a region r can be conceived as the result of a procedure of rational choice by the firm (See Simon 1955). The key assumption here is that the k-th firm has incomplete information about the final state s of the region in which it will end up operating, in terms of the presence of other firms.

This is one main source of uncertainty that we consider in the present model: the k-th firm can observe the spatial distribution of existing firms but it cannot know how many other firms will locate in the same region or elsewhere.

After the rational choice procedure, the colored balls are added (removed) in (from) the urn, so that the distribution of the balls in the urn is in accord to the rational choice probability distribution, which is exactly \({p}^{(k)}=({p}_{1}^{\left(k\right)},{p}_{2}^{\left(k\right)},\dots ,{p}_{R}^{(k)})\) as in formula (1).

The specific shape of the utility function depends also on the sectors of activities of the previously located firms and on the locating one. We assume that firms are shared among \(S\) sectors of activity and denote the sectors by \(s=1, 2,\ldots ,S\).

To better explain how the settlement process works, an illustrative example is needed. We refer here to the manufacturing industry and a rational location choice by firms only driven by the possibility to exploit upstream and downstream complementarities across industries and industry-specific Marshallian economies.

Example 1

Consider five regions (\(R=5)\) and suppose that eleven firms have already chosen their regional location. So, the 12-th firm must now extract a ball from the urn and locate (i.e.: \(k=12\)).

Let us also suppose that the located firms belong to four sectors (i.e.: \(S=4)\). Specifically, three of them are in the Textiles, two of them are in the Motor vehicles, six of them are in the Machinery and one in the Basic metals.

The regional distribution of the firms is as in Fig. 3.

Fig. 3
figure 3

Situation at the beginning of the 12-th step, describing the localization of the eleven firms among the five regions. Different symbols stand for different sectors of activities. Circles represent the Textiles, stars are associated to the Motor vehicles, square is Machinery while the rectangle indicates the Basic metals

Suppose now that the 12-th firm belongs to the Textiles and it is a firm with a strong attitude towards interaction within the same sector, in other words, for the entrepreneur the most relevant positive feature for choosing a location is given by the possibility to take advantage of localization economies. He also values the possibility to activate backward and forward linkages across sectors, but to a lesser extent. Moreover, local congestion costs and competition are of negligible relevance in his preference structure. The firm is aware about the location configuration presented in Fig. 3. Then, it ranks regions in accord to its preferences. An example of plausible reasoning could be the following: regions \(r=\mathrm{1,2},3\) are free of firms coming from the Textiles sectors. Then, placing in one of such regions could not be a good choice. However, \(r=1\) is better than the other two, since the presence of a higher number of firms is for sure associated to a higher level of infrastructures and urbanization economies, hence facilitating the business activity. Besides, the existing firms are operating in a sector, which may be important for the firm, because of the possible complementarities with an upstream sector producing specialized machinery for textile products. Nevertheless, region \(r=4\) should be a better choice for the possibility to exploit Marshallian economies. Region \(r=5\) has also some attractive features due to the presence of one firm from the Textiles. In fact, the new established firm can also take advantage of the already established business environment in the Textiles sector fostered by the existing firm (e.g. presence of skilled labor, and/or specialized services for the textile industry). Said this, \(r=5\) is perceived to be better than \(r=\mathrm{1,2},3\).

To conclude, the 12-th firm decides to locate in a region different from \(r=\mathrm{1,2},3\). In a preference scale, it decides to assign mark 10 to the preferred region \(r=4\) and, by comparison, mark 7 to \(r=5\) and mark 5 to \(r=1\). Regions \(r=2\) and \(r=3\) have mark 3. Such scores lead to a probability distribution \({p}^{(12)}=\left({p}_{1}^{\left(12\right)},{p}_{2}^{\left(12\right)},{p}_{3}^{\left(12\right)},{p}_{4}^{\left(12\right)},{p}_{5}^{\left(12\right)}\right)=(\frac{5}{28},\frac{3}{28}, \frac{3}{28}, \frac{10}{28},\frac{7}{28})\).

The probability distribution \({p}^{(12)}\) may be viewed as the solution of a utility-based constrained optimization problem. The utility function is suitably defined as \({u}_{12}:{\varvec{D}}\to {\mathbb{R}}\), so that

$${u}_{12}\left({p}^{\left(12\right)}\right)={u}_{12}\left(\frac{5}{28},\frac{3}{28}, \frac{3}{28}, \frac{10}{28},\frac{7}{28}\right)=\underset{p\in {\varvec{D}}}{\mathrm{max}}{u}_{12}\left(p\right),$$

and \({\varvec{D}}\) is the admissible region.

The removal/addition procedure is then in accord to the optimizing probability \({p}^{(12)}\). Specifically, if we suppose that, for example, the number of the colored balls in the urn at the 11-th step was configurated as \({(C}_{1}^{(11)}, {C}_{2}^{(11)},{C}_{3}^{(11)},{C}_{4}^{(11)},{C}_{5}^{(11)})=(\mathrm{8,3},\mathrm{2,14,25})\), then the (not unique) urn for the 12-th extraction could be created by: adding two balls of color \({C}_{1}\), adding three balls of color \({C}_{2}\), adding four balls of color \({C}_{3}\), adding six balls of color \({C}_{4}\) and removing eleven balls of color \({C}_{5}\). The resulting configuration of the colors is \({(C}_{1}^{(12)}, {C}_{2}^{(12)},{C}_{3}^{(12)},{C}_{4}^{(12)},{C}_{5}^{(12)})=(\mathrm{10,6},\mathrm{6,20,14})\), and the probability distribution associated to the extraction of a colored ball from the urn is exactly \({p}^{\left(12\right)}\).

At this point, the 12-th extraction takes place. Consistently with the above example, consider that the drawn ball is of color \({C}_{4}\). Then, we have \({(n}_{1}^{(12)}, {n}_{2}^{(12)},{n}_{3}^{(12)},{n}_{4}^{(12)},{n}_{5}^{(12)})=(\mathrm{3,1},2,4,3)\), where the bold indicates the change with respect to the location of Fig. 3.

The regional distribution of the firms at the end of the 12-th step is illustrated in Fig. 4.

Fig. 4
figure 4

Situation at the end of the 12-th step. The location of the 12-th firm is highlighted. Symbols are those of Fig. 3

Notice that the location outcome would have been different if the 12-th firm were a textile firm placing a relatively higher value to the variety of the local environment, in terms of the presence of firms operating in different sectors, compared to the possibility to exploit sector-specific external economies. In this case, the entrepreneur would have chosen region 5 as the preferred site to open a new branch. Higher scores would have been attributed to regions with a greater variety of sectors, and an associated probability distribution \({p}^{(12)}\) could have been as follow:

$${u}_{12}\left({p}^{\left(12\right)}\right)={u}_{12}\left(\frac{3}{28},\frac{2}{28}, \frac{6}{28}, \frac{7}{28},\frac{10}{28}\right)=\underset{p\in {\varvec{A}}}{\mathrm{max}}{u}_{12}\left(p\right),$$

The process of location of firms in the regions stops at the \(K\)-th stage.

The final distribution of the \(K\) firms in the \(R\) regions will be denoted by \(\tilde{p }=\left({\tilde{p }}_{1},{\tilde{p }}_{2},\dots , {\tilde{p }}_{R}\right)\), and it is the outcome of how firms located during the \(K\) steps. Hence, it can be derived in a natural way from the number of firms in the regions at the end of the \(K\)-th drawn from the urn, namely \(({n}_{1}^{\left(K\right)},{n}_{2}^{\left(K\right)},\dots ,{n}_{R}^{\left(K\right)})\), as follows:

$${\tilde{p }}_{\overline{r} }=\frac{{n}_{\overline{r} }^{(K)}}{K}, \forall \overline{r }=\mathrm{1,2},\dots ,R.$$

It is possible to derive the final regional distribution of the firms belonging to a specific sector by the settlement procedure implemented above.

At each step of the location procedure, we identify the sector of the located firm so that \(({n}_{1}^{\left(s,k\right)},{n}_{2}^{\left(s,k\right)},\dots ,{n}_{R}^{\left(s,k\right)})\) is the vector collecting the number of the firms of sector \(s\) in the \(R\) regions. The final distribution of the firms belonging to sector \(s\) is then a vector \({\tilde{p }}^{(s)}=\left({\tilde{p }}_{1}^{(s)},{\tilde{p }}_{2}^{(s)},\dots , {\tilde{p }}_{R}^{(s)}\right)\),

where

$${\tilde{p }}_{\overline{r} }^{(s)}=\frac{{n}_{\overline{r} }^{(K,s)}}{\sum_{r=1}^{H}{n}_{r}^{(K,s)}}, \forall \overline{r }=\mathrm{1,2},\dots ,R and s=\mathrm{1,2},\dots ,S.$$

The final distribution of firms normally does not follow a uniform law because of path dependence and the location factors described above (i.e. industry-specific Marshallian economies, complementarities between upstream and downstream firms across sectors along the value chain, urbanization economies, the distribution of demand).

Employees

We now deal with the employees’ distribution, and assume an economic system populated by \(L\) employees.

Also in this case, we assume that an employee locates in the \(r\)-th region with the same probability of extracting a ball with color \({C}_{r}\) from a urn.

As explained in Sect. Motivation: In Search of A Theoretical Foundation for Well-known Empirical Regularities the settlement procedure of employees is conditioned to the spatial distribution of firms’ establishments. As in the Ellison and Glaeser (1997)’s approach, we consider that, under the initial random location scenario, firms locate independently from each other but employees do not. Particularly, the lumpiness of establishments is incorporated in the model by assuming that, in the theoretical case (homogeneous regions), firms are identical in terms of firm size, an expression of the strength of internal scale economies typical of the sector.

Hence, our grounding assumption is that firms belonging to the same sector \(s\) hire the same number of employees, say \({L}_{s}\). So, the number of employees of sector \(s\) in region \(r\) is proportional to the number of firms of sector \(s\) in region \(r\), and the proportionality factor is exactly \({L}_{s}\). This means that, under the initial random location scenario, the spatial distribution of the firms and the spatial distribution of employees of sector \(s\) should coincide.

Deviations from this configuration is the result of two main forces: (a) the heterogeneity of firms’ size, which could emerge dynamically, leading to several firms departing from the typical industrial structure of the sector, and (b) a different site selection process of economic agents (firms vs employees) since the structure of preferences of workers is usually different from those of entrepreneurs. For examples workers may assign higher attractiveness to some regions (e.g. metropolitan regions for the easier access to urban amenities, regions with higher wages).

However, the settlement process is of sequential type: the first employee is assumed to select a region in accord to the initial assumption, since the industrial system is not affected at the beginning by the effects (a)–(b). After the first step, the economical-geographical activity proceeds, and effects (a)–(b) need to be taken in full consideration. We want to stress, that the identification of the sector is of paramount relevance at the beginning of the location process.

There are \(S\) urns, one for each sector. We denote them as \({U}_{1},{U}_{2}, \dots ,{U}_{S}\).

At the beginning of the location process, each urn contains balls of colors \({C}_{1}, {C}_{2},\dots ,{C}_{R}\), so that the distribution of the colored balls in \({U}_{s}\) coincides with the final distribution of the firms of sector\(s\), i.e.\({\tilde{p }}^{(s)}=\left({\tilde{p }}_{1}^{(s)},{\tilde{p }}_{2}^{(s)},\dots , {\tilde{p }}_{R}^{(s)}\right)\), for each\(s=\mathrm{1,2},\dots ,S\).

Suppose that the first employee (first extraction from the urn) belongs to the sector \({s}_{1}\in \{1, 2,\dots ,S\}\). Then, she/he extracts her/his region by drawing a ball from the sector-specific urn \({U}_{{s}_{1}}\). Then, the employee locates, according to the color of the extracted ball. The number of the employees belonging to sector \(s\) in the \(R\) regions after the 1-st drawn is a vector \(({m}_{1}^{\left(1,s\right)},{m}_{2}^{\left(1,s\right)},\dots ,{m}_{R}^{\left(1,s\right)})\), for \(s=\mathrm{1,2},\dots ,S\).

After the extraction, coloured balls are added/removed from each urn (and not only from \({U}_{{s}_{1}}\)). This hypothesis captures the dependence structure among different sectors, which actually may interact.

By adopting the same notation of the previous section, at the end of the addition/removal procedure the number of the balls with colours \({C}_{1}, {C}_{2},\dots ,{C}_{R}\) in urn \({U}_{s}\) is\({C}_{1}^{(2,s)}, {C}_{2}^{(2,s)},\dots ,{C}_{R}^{\left(2,s\right)}\), respectively, for each\(s=\mathrm{1,2},\dots ,S\). These urns are ready for the second extraction (i.e., the second employee), even if only one of the urns will play an active role (the one associated to the sector of activity of the employee labeled with 2).

Indeed, the second employee is assumed to belong to the sector \({s}_{2}\in \{1, 2,\dots ,S\}\). Then, she/he extracts her/his region by drawing a ball from \({U}_{{s}_{2}}\), while the remaining urns are not considered.

Recursively: at the \(l\)-th stage, the number of the balls with colors \({C}_{1}, {C}_{2},\dots ,{C}_{R}\) in urn \({U}_{s}\) is\({C}_{1}^{(l,s)}, {C}_{2}^{(l,s)},\dots ,{C}_{R}^{\left(l,s\right)}\), respectively, for each\(s=\mathrm{1,2},\dots ,S\). The number of the employees belonging to sector \(s\) in the \(R\) regions after the \(l\)-th drawn is\(({m}_{1}^{\left(l,s\right)},{m}_{2}^{\left(l,s\right)},\dots ,{m}_{R}^{\left(l,s\right)})\), for \(s=\mathrm{1,2},\dots ,S\) and\(l=\mathrm{1,2},\dots ,L\).

The procedure stops at the \(L\)-th stage, when the last employee extracts and locates in a region.

Also in this case, the addition and removal of coloured balls from the urn is due to a rational choice over the set \({\varvec{D}}\) of the discrete probability distributions on \(\left\{\mathrm{1,2},\dots ,L\right\}.\)

Specifically, the \(l\)-th employee, belonging to sector \({s}_{l}\), is assumed to have utility function \({u}_{l{s}_{l}}:{\varvec{D}}\to {\mathbb{R}}\). The utility maximization over \({\varvec{D}}\) leads to a removal/addition of coloured balls from each urn, and not only from \({U}_{{s}_{l}}\), to obtain a distribution of colored balls identical to the preferred probability distribution \({\overline{p} }^{(l,s)}=({\overline{p} }_{1}^{\left(l,s\right)},{\overline{p} }_{2}^{\left(l,s\right)}, \dots , {\overline{p} }_{R}^{(l,s)})\), i.e.

$${\overline{p} }_{\overline{r} }^{(l,s)}=\frac{{C}_{\overline{r} }^{(l,s)}}{\sum_{r=1}^{R}{C}_{r}^{\left(l,s\right)}} , \forall l=\mathrm{1,2},\dots ,L; s=\mathrm{1,2},\dots ,S and \overline{r }=\mathrm{1,2},\dots ,R.$$

As already stated above, all the urns change their configurations because the action of a sector might change the equilibria also in different sectors, and let a region be more or less attractive than another one also for employees belonging to sectors different from \({s}_{l}\).

At each stage, the probability distribution reveals the preference structure of the l-th worker, similarly as in the case of the site selection process by firms. As for their specific selection criteria, we may suggest that, given the spatial distribution of firms that mainly delineate job opportunities, workers are assumed to be interested in locating in urban areas or more populated areas. Thus, the second main source of uncertainty that we consider in the present model relates to the incomplete information by the l-th worker about the final population density of the region, while he can observe the existing distribution of firms across space and the regions already chosen by workers at earlier stages.

The final distribution of the employees belonging to sector \(s\) over the \(R\) regions is a vector \({\tilde{q }}^{(s)}=\left({\tilde{q }}_{1}^{(s)},{\tilde{q }}_{2}^{(s)},\dots , {\tilde{q }}_{R}^{(s)}\right)\), where

$${\tilde{q }}_{\overline{r} }^{(s)}=\frac{{m}_{\overline{r} }^{(L,s)}}{{L}_{s}} , \forall \overline{r }=\mathrm{1,2},\dots ,R and s=\mathrm{1,2},\dots ,S.$$

By the sectorial analysis we can also infer the distribution of the employees in the overall manufacturing activity. Such a distribution is denoted as \(\tilde{q }=\left({\tilde{q }}_{1},{\tilde{q }}_{2},\dots , {\tilde{q }}_{R}\right)\), where

$${\tilde{q }}_{\overline{r} }=\frac{\sum_{s=1}^{S}{m}_{\overline{r} }^{(L,s)}}{L} , \forall \overline{r }=\mathrm{1,2},\dots ,R$$

The deviation between \({\tilde{q }}^{(s)}=\left({\tilde{q }}_{1}^{(s)},{\tilde{q }}_{2}^{(s)},\dots , {\tilde{q }}_{R}^{(s)}\right)\) and \(\tilde{q }=\left({\tilde{q }}_{1},{\tilde{q }}_{2},\dots , {\tilde{q }}_{R}\right)\) can be viewed as the final result of a cumulative process in the location of firms and employees.

The dissimilarity between the spatial distribution of firms and employees is essentially due to the heterogeneity of firms in term of plant size, according to the arguments proposed above on the correspondence between equal size and uniform distribution.

A Discussion of Advantages and Limitations of Urn Theory with Reference to the Literature

The objective of this section is to show the usefulness of urn theory for the study of location processes of firms and employees under uncertainty. The discussion will be carried out by taking also a close look at the reference literature.

First, the modelling approach allows the researcher to portray the complexity of spatial clustering without radically simplifying it. Particularly, the treatment of space is midway between the formalized schematic and the more realistic narrative. Hence, it can be adopted as a background theoretical framework to rationalize the empirical evidence of the geographic concentration of economic activities in the context of spatial economics. In the dartboard approach, the measurement of localization explicitly involves a comparison between the distributions of employment and firms and it is anchored to a theoretical model of location (Ellison and Glaeser 1997). While other approaches over the discrete space, such as those based on relative entropy measures (e.g. Brülhart and Träeger 2005; Cutrini 2009, 2010, among others), make use only of lattice employment data, with the underlying location choice of firms completely on the shadow. Given these premises, we believe that the present framework can be adopted for the development of a testing procedure able to identify a significant departure from a random location scenario.

Second, location choices of agents are considered under more general assumptions than optimization criteria and they are modelled in a flexible way, so that a wide range of spatial patterns of firms and employees can be rationalized. In this respect, it is worth stressing that we do not explain the location choices of businesses and employees on the basis of deterministic factors that guide the choices as it is the standard approach in the literature. We have moved from a rather different idea. In fact, we have assumed that individual preferences can be viewed as perturbations of the random configuration, and the urn composition at each stage is a proxy of the preference structure of the single firm that is taking the location choice. In this respect, we build on Papageorgiu and Smith (1983) who maintained that even the uniform distribution of identical individuals across space is unstable if one considers that individuals may vary in their attitude regarding spatial interaction.

Third, we have started from the view that the chosen location of a firm and the chosen number of people employed are two source of information that are not independent from each other. Our approach is able to consider both location choices and firm size, as suggested by Carlton (1983). We have assumed that, at the beginning of the site selection process, all firms are equally sized within each sector, and this hypothesis is justified by the need to account for the heterogeneity in minimal optimal scale and industrial structure across industries. Moreover, in the successive steps, the model allows for heterogeneity of firm sizes even within each industry.

Fourth, the modelling approach features a role for history and 'historical accidents' that determine path-dependence and persistence, and it is basically consistent with the evolutionary perspective in economic geography (David 2000; Arthur 1989; Boschma and Lambooy 1999; Boschma and Martin 2012; Martin and Sunley 2012). This is one of the main advantages of the model, that is, it considers the dynamics of the localization process. In fact, agglomeration is modelled not only as the result of increasing returns at the plant level, as is the case in NEG models, but it is also a product of history. The current choices of firms depend on the past choices of previously settled firms. In this respect, it is worth mentioning the Independence of Irrelevant Alternatives (IIA) assumption of the error term and how it constitutes a limitation to consider cumulative processes in empirical studies. In other words, once controlling for the observable characteristics of regions and those of individuals, the error term captures all the rest, that is the unobservable characteristics not included in the model and it may lead to infer that all locations are similar to the decision makers. If this assumption is violated, estimated coefficients can be biased. The proposed urn theory approach overcomes this severe limitation.

Finally, the interaction between positive externalities and congestion costs (competition) is explicitly considered in the model through heterogenous preferences of the decision makers and the resulting rational choices (see the example provided in Sect. A General View and Basic Assumptions of the Model).

For all these reasons, we claim that the urn model can be a useful device in the context of location choice modelling. Nevertheless, we do not contend that the approach presented here is free from limitations and, in this respect, we can suggest that its outstanding flexibility allows for possible variations and several refinements that may overcome some of them.

To give an example, we have selected firms and employees as the relevant economic actors in our model. Our choice was driven by the goal of providing a theoretical background for the two empirical regularities highlighted in Sect. 3.1.1. It goes without saying that, thanks to the high flexibility of the model, it is possible to consider other actors, depending on the aim of the study. To give an example, to conceptualize the role of demand in the location choice of firms, it would be more appropriate to considers firms and consumers as the relevant economic agents.

Conclusions and Future Research

This paper develops a stochastic model for describing the location process of firms and employees among a set of regions. We postulate spatial structures as emerging from the micro-behaviours of heterogeneous economic agents. As a first step, we consider two different key economic agents: entrepreneurs and workers, each individual within each subset of the population is endowed with a specific preference structure about the desirability to locate in a particular region.

The model is grounded on urn theory and it is able to capture the wide number of aspects driving the location choice. Particularly, it allows conceptualizing the process of localization under more general assumptions than the traditional non-random patterns driven by optimization of economic agents. Uncertainty is implicitly assumed because the decision maker is not certain that a locational choice would be profitable or satisfying until regional attributes completely unfold. A key insight from our model is that the location divergence between firms and employees is due to the divergences in terms of sizes of the firms belonging to the same sector.

The framework presented here considers the problem of translating the theory of location choice behaviour into concrete models suitable for the empirical analysis of the spatial distribution of firms and workers. It can be seen as a first step to the development of a systematic theorization of the location processes under uncertainty. In this respect, our setting can be further explored in several directions, and we give here only two noticeable suggestions: first, an explicit reference to the decision time and its effects on the location process could be included. Specifically, it seems to be reasonable to assume that firms can take the opportunity of selecting the appropriate time for locating. As an example, think at a firm producing a patented product and at how the patent expiration date influences the managerial decisions. In this context, the extraction from the urn occurs at (random) times whose identification is a complex task (for instance, it might come out from an optimal stopping problem); second, the presence of an exogenous factor driving the location process could be an interesting aspect to be discussed. Suppose that there are some rumours on a new regulation which incentives investments in a specific region. The presence of uncertainty leads to a further addition/removal of coloured balls in the urn, whose number and distribution is unknown. In this case, a Bayesian-type analysis or scenarios simulations could be of some usefulness.