Department of Economics School of Business, Economics and Law at University of Gothenburg Vasagatan 1, PO Box 640, SE 405 30 Göteborg, Sweden +46 31 786 0000, +46 31 786 1326 (fax) www.handels.gu.se info@handels.gu.se WORKING PAPERS IN ECONOMICS No 662 On-the-job search and city structure Aico van Vuuren June 2016 ISSN 1403-2473 (print) ISSN 1403-2465 (online) On-the-job search and city structure∗ Aico van Vuuren † Version: February 2016 Abstract This paper investigates an equilibrium search model in which search frictions are increas- ing with the distance to a city’s central business district, allowing for on-the-job search and endogenous wage formation and land allocation. The findings suggest that the decentralized market results in a more segregated outcome than may be socially desirable. The externality comes from the misguided incentives for the low-paid workers, who have a high preference for central locations in order to climb up the job ladder. Policies reducing the rental costs of un- employed workers for locations close to the central business district may potentially increase welfare. Keywords: Search, city structure, urban economics JEL-codes: J00, J64, R14 ∗We would like to thank Yves Zenou, Florian Sniekers, Jos van Ommeren as well as participants of the Search and Matching Conference in Edinburgh and the workshop on labor mobility in Louvain-la-Neuve in particular Paul Beaudry and Etienne Wasmer for useful comments. †University of Gothenburg. 1 Introduction This paper investigates the structure of cities, extending the mono-centric city model of Was- mer and Zenou (2002) by including on-the-job search and endogenous wage formation. Wasmer and Zenou (2002) found that there are two mutually exclusive equilibrium city structures in the absence of on-the-job search and assuming ex-ante homogeneous workers. In the first city struc- ture, unemployed workers live as close as possible to the central business district (CBD), while employed workers live farther away. This equilibrium, labeled as the “integrated equilibrium”, exists whenever the level of search frictions is low. Another city structure, labeled as the “segre- gated equilibrium”, occurs in the case of high search frictions. In that case, the unemployed live as far away as possible from the CBD (and hence pay low rents), while the employed workers are centrally located. This paper finds that these two equilibria are special cases of a large range of equilibria characterized by the location of the unemployed. Our findings suggest that neither an integrated nor a segregated city is formed when the distance-dependent search efficiency of em- ployed workers is identical to that of unemployed workers (while their commuting costs are low). Instead, we obtain an internal solution of the decentralized market in which low-paid workers are located the closest to the CBD, while high-paid workers are located the farthest away. The unemployed are located in between these two groups of workers. The decentralized market outcome is found to be inefficient under very general circumstances. The externality comes from the misguided incentives given to the low-paid workers. These workers prefer to locate themselves close to the CBD in order to obtain a job in which they receive a higher wage. However, since our basic analysis assumes homogeneity in worker productivity and firm matches, social welfare is not affected by a job-to-job transition, but only by a transition of a worker from unemployment into a paid job. The social planner thus prefers a situation in 1 which unemployed workers are located at a closer distance to the CBD than the market would allocate. In terms of the location decision, worker incentives are aligned with the maximization of social welfare only when wages are equal to worker productivity. In this case, the monopsony power of the firms is the single determinant for the externality. However, since this monopsony power is also necessary in the first place for firms to open vacancies and hire workers, the market can never result in an efficient outcome. Our model features a robustness check whereby the assumption of homogeneneity in the productivity of job matches is shown not to be essential to obtain this result. We close our model by assuming that firms post wages as in Burdett and Mortensen (1998). Most of our results are not dependent on this assumption, while the remaining results can be adapted to other wage-setting mechanisms, conditional on the assumption that monopsony wages are paid. Gautier et al. (2010) identified one good reason to focus on wage-posting models. Ab- stracting from endogenous land allocation, they look at many different wage-setting mechanisms (such as wage-posting and wage-bargaining models), and conclude that wage posting is the only framework in which the market outcome can be constrained efficient. As we show in this paper that even wage posting can never result in an efficient allocation, our inefficiencies must be due to endogenous land allocation. Our paper makes a case for the presence of subsidized housing for the unemployed work- ers. Although rewarding unemployed workers for living closer to the CBD may increase social welfare, such subsidies should be based on employment status and not on earned income, and should terminate directly after the acceptance of a job. We also find that the potential gains for such a policy are limited: our calibration exercises indicate that the inefficiencies that result from endogenous land allocation are only around 1 percent of the total loss of production due 2 to search frictions. Nevertheless, the potential regional consequences of such a policy are enor- mous. We find, in some cases, that the decentralized market chooses a city structure that is almost completely segregated, while the optimal city structure turns out to be an integrated city structure. Like the original paper by Wasmer and Zenou (2002), our paper is highly dependent on the assumption that workers face lower access to jobs when residing at a greater distance from those jobs. There is ample evidence that even during the age of the internet, workers are reluctant to apply for jobs that are farther away from them. For example, Marinescu and Rathelot (2015) use data from CareerBuilder.com to show that the vast majority of job seekers send applications for jobs that are no farther than ten miles from their present residence. In their study of the United Kingdom, Manning and Petrongolo (2015) find an even higher bias towards the present residence. Our paper is closely related to Kawata and Sato (2012), who also extend the model of Wasmer and Zenou (2002) by including on-the-job search. There are two important differences. First, we use other assumptions on how wages are offered to workers.1 Second, they assume search effort to be independent of distance to the CBD, which implies that unemployed workers are always located the farthest away from the CBD (due to the higher commuting costs of the employed workers). This also implies that our welfare results concerning the location of the workers do not exist in their model. Our paper is also related to the work of Smith and Zenou (2003), who extend the model of Wasmer and Zenou (2002) by endogenizing search effort. They show that this extension can result in an equilibrium in which the unemployed live in both central areas as well as areas close to the city border. However, their analysis does not include on-the-job search, 1In particular, they use the competitive search framework of Garibaldi and Moen (2010) and the directed search framework of Menzio and Shi (2011). 3 and features exogenous wage formation. The remainder of this paper is as follows. Section 2 sets up the model and Section 3 looks at the partial equilibria of the labor and housing market. Section 4 analyzes the general equilibrium. Section 5 looks at the social planner, while Section 6 calibrates the model. Section 7 discusses our results and looks at potential extensions; Section 8 concludes. 2 The Model 2.1 General notation and assumptions We assume that the total number of workers equals unity and that workers are uniformly located along a linear, closed and mono-centric city.2 Time is continuous and land is owned by absentee landlords. We define µ as the total number of matches per unit of labor supply; u is the unem- ployment rate and v is the total number of vacancies. Moreover, we define d, with 0 ≤ d ≤ 1 as the distance to the CBD. Let s(d) be the search efficiency of an unemployed worker at distance d. As in Wasmer and Zenou (2002), we assume the following function: s(d) = s0 − ad, (1) where s0 and a are relative efficiency parameters and s0 ≥ a, since otherwise some workers would have negative search efficiency. Moreover, we assume that the search efficiency of the employed workers at distance d equals ψs(d), 0 ≤ ψ ≤ 1. The parameter ψ is relative search efficiency of employed workers. Workers and jobs arrive according to a Poisson process. We assume a general expression for the contact technology between job seekers and vacancies (Gautier et al., 2010): µ = α [ s(du)u+ (1− u)ψs(de) ]ν vξ, (2) 2see Zenou (2009) for an explanation of these terms. 4 where du and de are the average distance to the CBD of, respectively, the unemployed and employed workers. The parameters ν with 0 ≤ ν ≤ 1, and ξ with 0 ≤ ξ ≤ 1, measure the relative contributions of job seekers and vacancies to the contact technology. The parameter α measures the overall efficiency of the matching process. Let λ := µ/ [ s ( du ) u+ (1− u)ψs ( de )] . The implied contact rates for unemployed and employed workers are λs(d) and λψs(d). Vacancies can be opened by firms with a per-period cost equal to γ. We assume τ ≥ 0 to be the commuting costs at distance d from the CBD for employed individuals. Without loss of generality, we set the transportation costs of the unemployed equal to zero. In addition, we denote the land rent at distance d as R(d) and denote R(1) = RA as the exogenous rental costs of agricultural land. We assume that workers produce b in case of unemployment and p in case of being matched to an employer. We assume that p− b > τ , which allows us to obtain sensible equilibria in which unemployed workers accept jobs. Wages are denoted by w. The wage-offer distribution of firms is denoted by F . We denote the distribution of wages among employed workers by G and we assume that matches are destroyed with an exogenous job destruction rate equal to δ. 2.2 Workers Define Vu(d) as the lifetime discounted value of a worker who is unemployed and living at distance d from the CBD and denote Ve(d,w) as the lifetime discounted value of an employed worker also living at distance d from the CBD and working at a wage equal to w. We obtain the following Bellman equations: ρVu(d) = b−R(d) + λs(d) ∫ w+ ϕ ( max d′ Ve(d′, w) − Vu(d) ) dF (w), (3) 5 and ρVe(d,w) =w − τd−R(d) + λψs(d) ∫ w+ w ( max d′ Ve(d′, x)− Ve(d,w) ) dF (x) + δ(max d′ Vu(d′)− Ve(d,w)), (4) where ϕ is the reservation wage of the unemployed and w+ is the upper bound of the support of the wage-offer distribution. Every period, unemployed workers receive their home production minus their rental costs. In addition, they have the possibility of receiving a job offer (with a rate equal to λs(d)); that job offer is accepted whenever the value of accepting that job offer is larger than the value of rejecting it. Employed workers receive their wages minus the sum of the transport and rental costs. They also have the possibility of receiving a job offer and – if a job offer is received, they accept it when the value is higher. Finally, they have the possibility of losing their jobs; in that case, they receive the value of an unemployed worker. 2.3 Firms We define Vv(w) as the value of a firm that opens a vacancy that pays w in the case of a match; Vj(w) is the value of a match that pays wage w. We obtain ρVv(w) = λ v ( us ( du ) + (1− u)ψ ∫ w ϕ s(de(x))dG(x) ) Vj(w)− γ, (5) where de(w) is defined as the average distance to the CBD of an employed worker that earns w. The right-hand side of equation (5) can be explained as follows: the value of a vacancy equals the rate at which a match is formed multiplied by the value of a match, i.e. Vj(w). In addition, we have to subtract the costs of keeping a vacancy open, i.e. γ. A match is formed when the firm connects with either an unemployed worker or an employed worker who earns less than w. Hence, the rate at which a match is formed is the sum of the rates of these two events. Taking into account that matching is random, these rates can be shown to equal λus(du)/v and 6 λ(1 − u) ∫ w ϕ de(x)dG(x)/v for unemployed and employed workers. The value of a match can be calculated by its Bellman equation; after rewriting, we obtain Vj(w) = p− w ρ+ δ + λψs(de(w)))(1 − F (w)) . Hence, the value of a match equals the properly discounted flow of profits, taking into account that a match can be ended by either exogenous reasons (occurring at a rate equal to δ), or because the worker finds a better match (occurring at a rate equal to λψs(de(w))(1 − F (w))). 2.4 Wage-setting mechanism In line with Burdett and Mortensen (1998), we assume that firms post wages to workers, implying that in the case of homogeneous firms, all firms should have the same expected pay-off at the moment that they open a vacancy (i.e. V j(w) should be independent from the wage in equilib- rium). This setup implies that the wage-offer distribution has three important properties: (i) it has no mass points, (ii) the lowest offered wage equals the reservation wage of the unemployed workers, (iii) w+ < p. The proofs of these claims are standard and can be found in other papers (Burdett and Mortensen, 1998; Gautier et al., 2010, among others). 3 Partial equilibrium 3.1 Partial equilibrium at the housing market As in Wasmer and Zenou (2002), we can use the condition that, in the absence of relocation costs, unemployed workers should have the same value independent of the distance to the CBD. Otherwise, some unemployed workers might be able to relocate themselves and gain in terms of utility – and this situation cannot be an equilibrium. Hence, we stipulate that Vu(d) = V u for 7 the value of the unemployed. Likewise, we stipulate that employed workers with a wage w should receive the same value as well (i.e. Ve(d,w) = V e(w)). Moreover, define the bid-rent function, Ψu(d), as the maximum rent that an unemployed worker is able to pay for residing at distance d from the CBD in order to obtain the value V u. Likewise, we define the bid-rent function Ψe(d,w) for the employed workers.3 As an equilibrium condition for the housing market, we stipulate that R(d) = maxw∈[ϕ,w+]{Ψu(d),Ψe(d,w)}, that is, the rent equals the maximum bid of the workers. Based on equation (3), Ψu(d) = b+ λs(d) ∫ w+ ϕ ( V e(w) − V u ) dF (w)− ρV u. Likewise, define the bid-rent function Ψe(d,w). Based on equation (4), we obtain Ψe(d,w) =w − τd+ λψs(d) ∫ w+ w ( V e(x)− V e(w) ) dF (x) + δ(V u − V e(w)) − ρV e(w). (6) Note that the bid-rent functions are not all that interesting on their own, because the maximum amount a person is willing to pay depends on the amount that she has to pay for alternative locations. In addition to that, the bid rents depend on the unknown equilibrium values V u and V e(w). Therefore, the derivatives of the bid rents with respect to d are more valuable for the analysis. First, we find that the higher the absolute values of these derivatives, the more centrally located are the individuals. Second, it turns out that we are able to write these derivatives of the bid-rent functions as independent of the unknown values of V u and V e(w). Therefore, we concentrate on the derivatives of the bid-rent functions rather than on the bid-rent functions themselves. Note that the bid-rent function can be derived from its derivative as the solution to a differential equation. 3Since the concept of bid rents is standard in urban economics, we do not elaborate further on it. See Fujita (1989) and Zenou (2009). Also note that we use somewhat abusive notation, since Ψu also depends on V u and V e(w);ϕ ≤ w ≤ w+. 8 The (partial) derivatives of the bid rents, Ψu and Ψe with respect to d, can be obtained by taking derivatives of these values, i.e. Ψ′u(d) = −aλ ∫ w+ ϕ ( V e(w) − V u ) dF (w), (7) which is negative. It is easier to interpret this first-order derivative by looking at its absolute value, which equals the amount that an unemployed worker is willing to pay for a more central location. Looking at the absolute value of the right-hand side, this willingness equals the increase in the rate at which a worker receives additional job offers when moving to a more central location (i.e. aλ) times the average gain received from such an offer. Likewise, we can construct the first- order derivative of Ψe(d,w) ∂Ψe(d,w) ∂d = −τ − aλψ ∫ w+ w ( V e(x)− V e(w) ) dF (x), (8) which is also negative. The absolute value of the right-hand side of this equation equals the reduction in commuting costs plus the increase in the rate at which a worker receives additional job offers (i.e. aλψ) times the average gain received from such an offer. As in Wasmer and Zenou (2002), the second-order derivatives with respect to d equal zero. The cross-partial derivative equals ∂2Ψe(d,w) ∂d∂w = aλψ ∂V e(w) ∂w (1− F (w)), (9) which is positive for all w < w+. Workers with higher wages have less to gain by moving closer to the CBD, since the likelihood of obtaining even better job offers declines with the wage level. The distribution of land is now determined by the workers who have the highest bid rent. Let W(d) be the set of wages paid to workers with distance d, or W(d) = { w|∀w′ ∈ [ϕ,w+] : Ψe(d,w) ≥ Ψe(d,w′) ∧Ψe(d,w) ≥ Ψu(d) } . 9 Note that this set can be empty in the case that the willingness to pay for the unemployed is strictly larger. In addition, W(d) should have measure zero, since d has measure zero. Hence, W(d) cannot be an interval. The following lemma states that the sets must be strictly increasing in d. Lemma 1 Suppose that we have d1, d2 ∈ [0, 1] and d1 < d2. Then, for any w1 ∈ W(d1) and w2 ∈ W(d2) we have w1 < w2. Proof: See Appendix A.  A direct result of Lemma 1, together with the non-interval restriction of W(d), is that W(d) is single valued. We therefore denote w(d) = W(d), using d(w) = de(w) for its inverse. After partially integrating (7) and (8) and taking the derivatives of the left- and right-hand sides of equation (4), we obtain the following first-order derivative of the bid-rent function for unemployed and employed workers:4 Ψ′u(d) = −aλ ∫ w+ ϕ [ 1− F (w) ρ+ δ + λψs(d(w))(1 − F (w)) ] dw, (10) and ∂Ψe(d,w) ∂d = −τ − aλψ ∫ w+ w [ 1− F (x) ρ+ δ + λψs(d(x))(1 − F (x)) ] dx. (11) We define the set of distances at which the unemployed workers live from the CBD by Du. The introduction of Du has an implication for w(d), because it is not defined for any d ∈ Du. Therefore, we define w(d) = suppd′ aλ ∫ w+ ϕ [ 1− F (x) ρ+ δ + λψs(d(x))(1 − F (x)) ] dx. (17) In that case, all unemployed workers allocate themselves farther away from the CBD than all employed workers, and du = 1 − u. This case was called the “segregated city” by Wasmer and Zenou (2002). The final step is to determine the equilibrium land rents. We can use that R′(d) = ∂Ψe(d,w(d)) /∂d for all d /∈ Du. Hence, the derivation of R(d) between du + u and 1 results from solving the differential equation with initial condition R(1) = RA. The solution of R(d) between du and du + u is in line with Wasmer and Zenou and simply equals R(du + u) + Ψ′u(·)(d − du − u). The differential equation for values between 0 and du is identical to the one for values between du + u and 1, but now with an initial condition for R(du). Further details of the derivation of the equilibrium land rents can be found in Appendix C. Unlike the situation of Wasmer and Zenou (2002), our rent function is not linear for all d /∈ Du, since a decrease in distance implies not only an improvement of search efficiency of the workers and a decrease in commuting costs but it also implies that the wage of the worker is lower when located closer to the CBD. Hence, such a worker has a higher likelihood of obtaining wage offers higher than the present wage than does the worker who lives a little farther away 6This inequality can be derived after equating (10) and (11) and using w = w˜ and w˜ > ϕ and ψ < 1. 14 Distance to CBD10 RA Rent u du du + u Figure 2: Illustration of the partial equilibrium at the housing market from the CBD, even if both workers had been living at equal distance. Figure 2 illustrates the partial equilibrium at the housing market. Rent equals RA at distance 1 from the CBD and gradually increases when moving closer to the CBD. There is an increasing rate at which the rent increases when moving left in the figure, because ∂Ψe(d,w(d))/∂d increases with distance. The unemployed live in between du and du +u from the CBD. Rents are linear in this interval. Workers with low wages live somewhere between 0 and du and these workers pay the highest rents. Comparing our results with those of Wasmer and Zenou (2002), we conclude that the intro- duction of on-the-job search makes it possible to explain many more equilibrium city structures than only the segregated and integrated city structures in a model without on-the-job search and exogenous search effort. Obviously, many cities, especially in the mainland of Europe, cannot be classified as either segregated or integrated and are more in line with the donut-shaped city outcome that we described in this section with employed workers at a short and high distance from the CBD and the unemployed in between these employed workers.7 Of course, the cities in 7There is some empirical evidence for this statement: Gobillon et al. (2010) investigate the inner city of Paris 15 mainland Europe are also those with a high level of (historical) amenities, while their city struc- tures are also highly influenced by social housing policies. Still, our paper makes one important prediction, and that is that when ψ is high, then it is more likely to obtain a donut-shaped city than when ψ is very low (in which case, only a segregated or integrated city can occur). We are aware of only one paper, Ridder and Van den Berg (2003), that tries to estimate ψ for a number of different countries using macro-data. They find ψ to be substantially lower in the United States and the United Kingdom, than in the Netherlands, Germany and France. 3.2 Partial equilibrium at the labor market Equalizing in- and outflow of the unemployed results in the following steady-state level of unem- ployment: (1− u)δ = uλs ( du + u 2 ) . (18) Similarly, equalizing the in- and outflow of workers with a wage at least paying w results in δ(1 −G(w))(1 − u) = λ(1 − F (w)) { us ( du + u 2 ) + (1− u)ψ ∫ w ϕ s(d(x))dG(x) } , (19) where the left-hand side is the number of workers dismissed from a job that pays at least w. The right-hand side is the number of unemployed workers who find a job paying at least w plus the number of employed workers paid less than w who find such a job. Appendix D shows that conditional on the wage-offer distribution F , the function G can be solved as a root of a second-order polynomial. and conclude that the highest unemployment rates are located close to the CBD. However, unemployment rates in the CBD itself are lower than the metropolitan average. Finally, A˚slund et al. (2010) find that for Stockholm the highest job density can be found in the CBD and that unemployment rates are especially high in the suburban areas around that district. 16 The reservation wage is determined by ρVu(du) = ρVe(d(ϕ), ϕ). (20) Lemma 1 states that d(w) is decreasing. From the second assumption of Subsection 2.4 we know that d(ϕ) can only take two values: either it equals zero when du > 0 or it equals u when du = 0. Appendix E calculates these two cases. 3.3 Wage posting Using Lemma 1 and equations (18) and (19), we see that the value of a vacancy that posts a wage w equals ρVv(w) = 1− u v δ 1 −G(w) 1 − F (w) p− w ρ+ δ + λψs(d(w))(1 − F (w)) − γ. (21) Since all firms are homogeneous, it is necessary that Vv(ϕ) = Vv(w), for all wages in between ϕ and w1 < w˜. Note that it can never be optimal to post a wage just below w˜. This is because there is a discontinuity in the quit rate of workers at this wage level. Workers who are paid marginally below this wage live closer to the CBD than do unemployed workers, while workers who are paid exactly this wage or more live farther away from the CBD than unemployed workers do. This implies that the number of workers who quit their jobs because they find a better wage offer drops sharply at this wage level. This does not occur because of a mass point in the wage-offer distribution, but merely because the workers have a lower search intensity. It also implies that the value of a vacancy (i.e. Vj(w)) jumps upward at the wage level of w˜ and that no firms will pay slightly below this wage. Therefore, a firm only considers paying below w˜ when its profit margin per worker is sufficiently higher than the firms that pay above the threshold. The condition to be satisfied is Vj(w1) = Vj(w˜). The wage distribution above w˜ can be obtained by using Vj(w˜) = Vj(w) for every w between w˜ and w+. 17 4 General equilibrium We close our model by assuming that opened vacancies must have an expected profit equal to zero. Using this assumption and substituting w = ϕ into equation (21), we obtain 1− u v δ p− ϕ ρ+ δ + λψs(d(ϕ)) = γ. (22) For the general equilibrium, we need to determine the variables ϕ, u, du and v, the distributions F and G and the rent function R. The restrictions for this equilibrium are equations (15), (18), (19), (20) and (22), as well as the solutions for the land rent derived in Appendix C. Unless otherwise stated, we assume for this section and the next that µ = ξ = ψ = 1. Our main reason is that general results are extremely hard to obtain. There is also another reason to focus on this case: Gautier et al. (2010) show in a model without endogenous land allocation that among a large set of wage-setting mechanisms, only wage posting can result in an efficient outcome, as long as ν = ξ = ψ = 1. The issue here is whether this result still holds for our extended model. Note that the literature on job search models without on-the-job search has not found a lot of evidence for an increasing-returns-to-scale matching technology (Petrongolo and Pissarides, 2001). Therefore, we also look at the constant-returns-to-scale matching technology in the calibration section. Theorem 3 An equilibrium with a positive number of vacancies exists whenever p− b > 2γ(ρ + δ)α 1 2s0 − a . (23) Proof: See Appendix F. Condition (23) is standard and states that we can expect a positive number of vacancies if there is a sufficient difference between production at the workplace and at home. Unfortunately, a 18 general proof of uniqueness is difficult for this model, but we can prove uniqueness under the conditions of Lemma 4. Lemma 4 Suppose that ρ/δ ↓ 0 and (23) applies. Then, the equilibrium is unique. Proof: See Appendix G.  The condition that ρ/δ ↓ 0 is standard in job search models, but obviously debatable. It is possible to obtain the following very conservative sufficient condition ρ δ < adu s0 − adu . Note that the right-hand side of this restriction is the percentile loss in the search efficiency of the most efficient unemployed worker in comparison to her search efficiency at distance zero from the CBD. To check whether this restriction is satisfied for reasonable values of ρ, s0, a and δ, we look at estimates obtained from Ridder and Van den Berg (2003). They estimate δ to be in between 5 percent for Germany and 42 percent for the United States. Recent estimates of the discount rates indicate figures as low as 2 percent (see Laibson et al., 2007, and Gautier and Van Vuuren, 2015). It implies that in the United States, workers located in the CBD should be at least about 2 percent more efficient in their search as the unemployed for the condition to be satisfied. In Germany, unemployed workers should have at least a 40 percent lower search efficiency than the workers located in the CBD. Note that the restriction is conservative, since it is unlikely to hold whenever a is very small – whereas our model with a = 0 has been proven to have a unique equilibrium under very general conditions. 19 5 Welfare analysis In line with Zenou (2009), we define the social welfare function as Ω(v, du) = ∫ ∞ 0 ( (1− u(t))p + u(t)b− γv − (1− u(t))τde ) e−ρtdt ≡ ∫ ∞ 0 ωt(v, du)e−ρtdt. (24) The first term between brackets is production of the employed workers, the second term is production of the unemployed homeworkers, the third term is total costs for vacancies and the last term is the average commuting costs of the employed workers. We consider the situation in which the social planner faces the same search frictions as the decentralized market, and investigate the outcome when the social planer is able to maximize welfare by choosing v and du. This implies that the social planner has as state variable u with du dt = δ(1− u(t))− λu(t)s ( du + u(t) 2 ) . (25) We show in Appendix H that the first-order derivative of Ω with respect to du is strictly positive whenever τ > λa p− bρ+ δ + λs0 , (26) assuming that either ν in (2) or ψ equals one, while it is strictly negative in the case when the >-sign is replaced by a <-sign. This implies that, with the exception of equality of the left- and the right-hand sides, the social planner prefers either du = 0 or du = 1−u. This can be explained as follows: suppose that the social planner is able to move the unemployed workers somewhat closer to the CBD – say, by distance ∆. The social planner benefits from this move, because of the reduction in the equilibrium unemployment rate (i.e. u′(du)∆) times the difference in production between employed and unemployed workers (p − b). Second, these newly employed workers increase total commuting costs (i.e. u′(du)τte∆). Third, the commuting costs further 20 increase by τu∆, because the existing employed workers have to commute a greater distance (since there are u unemployed workers that move a distance ∆ closer to the CBD and hence also u employed workers should move the same distance from the CBD). It can be shown that the first term is proportional to (p−b)aλ, while the last two terms are proportional to (ρ+δ+λs0)τ , resulting in (26). From the discussion following equation (16), we know that when ψ = 1, the decentralized market always implies that du > 0, even when the employed have very low commuting costs. This result automatically implies the inefficiency of the decentralized market outcome, including wage posting. Still, the question remains whether the social planner would prefer a more integrated city structure than that obtained by the market. We arrive at the following result. Lemma 5 Suppose that ρ < (a(1− u)/(s0 − a(1−u)))δ. Suppose that the social planner prefers the segregated city to be the preferred outcome. Then the segregated city is also the outcome of the decentralized market. Proof: see Appendix I. The condition in Lemma 5 is conservative. Our simulation results indicate that the results presented in this lemma are true even when the condition is not satisfied. The proof behind the lemma is based on the fact that for a fixed level of λ (and hence the number of vacancies), the location incentives of the workers are only in line with the policy maker when workers are paid their marginal productivity. In contrast, unemployed workers are less willing to pay for closer locations if the wages are lower than their marginal productivity, while the willingness of employed workers to pay for these locations is higher. Hence, in the market equilibrium, employed workers live too close to the CBD. This result is intuitive and implies that workers do not make 21 socially optimal location decisions when not fully compensated for their search activities. It also implies that any model in which workers are paid monopsony wages can be expected to make this prediction. Among these are wage-bargaining models and models in which firms are allowed to make counteroffers (such as in Postel-Vinay and Robin, 2002). The only reason why we still need the assumption of wage posting is to allow for endogenous vacancy creation as well. We can show in our case of wage posting that the unemployment rate is higher under the market than under the social planner.8 We show in Appendix I that this difference further enlarges the gap between the outcomes of the social planner and the market. The fact that unemployment is higher under the market outcome is in line with previous results. Gautier et al. (2010) find that the unemployment rate under wage posting and ξ = ν = ψ = 1 is equal for the market and the planner. This result implies that the wages have exactly the level under wage posting to obtain the right amount of vacancies. In our case, however, wages are higher and hence unemployment is higher, since firms have an additional reason to offer higher wages. In the standard wage-posting framework, firms offering higher wages reduce the likelihood for their workers to receive better outside offers. In our case, firms that offer higher wages also reduce the number of these outside offers, since better-paid workers live farther away from the CBD. A couple of instruments may be used to restore the housing decisions of the decentralized market. We propose here an instrument that reduces the costs of unemployed workers to live closer to the CBD. The introduction of such an instrument implies that condition (15) becomes τ − χ = aλ ∫ w˜ ϕ [ 1− F (x) ρ+ δ + λs(d(x))(1 − F (x)) ] dx for the case that ψ = 1, where χ is the rent reduction received by the unemployed worker when 8This statement is formalized in Lemma 6. 22 she lives one unit closer to the CBD. It implies that she receives (1 − d)χ when she lives at distance d from the CBD. This policy instrument can restore the location decisions made by the decentralized market. Note that this policy instrument is exactly the same instrument as the single effective instrument considered by Smith and Zenou (2003). 6 Simulations of the model We simulate our model using the same set of parameter values as Wasmer and Zenou (2002), who use a bargaining model with the bargaining power equal to 0.5 and assume that there is no on-the-job search (i.e. ψ = 0). Hence, δ = 0.1, ρ = 0.05, τ = b = 0.3, α = p = 1, ξ = ν = 0.5 and γ = 0.3. In addition, we set ψ = 1 for the model with on-the-job search. The simulation results are listed in panel A of Table 1. For the sake of completeness, we replicate the results of Wasmer and Zenou and report the optimal welfare for both models.9 Although welfare in the Wasmer and Zenou (2002) model is optimal, given both the city structure and model parameters, a suboptimal city structure can be chosen by the market. In fact, equation (26) is also the condition for the social planner to choose the segregated city structure, while Wasmer and Zenou (2002) show that the market chooses this city structure whenever τ > γavus(du + 1/2) . Our simulations show that the market chooses a suboptimal city structure only when a = 0.5 and the loss in welfare is small. 9See also page 531 of Wasmer and Zenou, 2002. The only difference in our table is welfare, since we assume absent commuting costs for the unemployed. 23 Wasmer and Zenou On-the-job search a u v du s(du) ωt ω∗t a u v du s(du) ωt ω∗t a A. ν = ξ = 0.5, α = 1 1 0.0686 0.131 0 0.966 0.763 0.763 0.139 0.246 0.049 0.881 0.684 0.693 0.75 0.0685 0.130 0 0.974 0.764 0.764 0.153 0.258 0.106 0.864 0.674 0.684 0.6 0.0685 0.129 0 0.979 0.764 0.764 0.230 0.247 0.612 0.564 0.665 0.680 0.55 0.0685 0.129 0 0.981 0.764 0.764 0.252 0.238 0.748 0.519 0.668 0.679 0.5 0.121 0.120 0.879 0.530 0.763 0.764 0.236 0.299 0.759 0.560 0.674 0.680 0.25 0.100 0.106 0.894 0.763 0.776 0.776 0.202 0.226 0.798 0.775 0.695 0.702 0.1 0.0915 0.100 0.900 0.905 0.782 0.782 0.186 0.219 0.813 0.909 0.704 0.712 B. ν = ξ = 1, α = 8 1 0.0806 0.149 0 0.959 0.750 0.750 0.0786 0.157 0.027 0.934 0.749 0.750 0.75 0.0805 0.147 0 0.970 0.750 0.751 0.0786 0.157 0.047 0.935 0.750 0.751 0.6 0.0804 0.146 0 0.976 0.750 0.751 0.0799 0.156 0.091 0.922 0.750 0.751 0.55 0.0804 0.146 0 0.977 0.751 0.751 0.0820 0.156 0.142 0.899 0.750 0.751 0.5 0.1458 0.137 0.854 0.536 0.710 0.751 0.1057 0.137 0.543 0.702 0.749 0.751 0.25 0.1188 0.121 0.881 0.765 0.733 0.764 0.1098 0.133 0.890 0.763 0.764 0.764 0.1 0.1082 0.114 0.891 0.905 0.742 0.771 0.0996 0.125 0.900 0.905 0.771 0.771 aNumbers in bold imply that the planner chooses a segregated city. Table 1: Calibration results. The variable ω∗t is the per period optimal welfare. 24 Comparing both models, it becomes immediately clear that the unemployment rates are higher in our model with on-the-job search. This result is not surprising, given the fact that we use a constant-returns-to-scale contact technology, which implies that the search efficiency of the unemployed is frustrated by the search activities of the employed workers. Since the vacancy rate is higher under on-the-job search, it also implies that welfare should be lower. More surprising is the result that welfare increases for values of a between 0.5 and 1. This result is also related to the choice of a constant-returns-to-scale contact technology – since even the social planner is not able to obtain higher welfare levels when a becomes smaller. Thus, a lower level of a also makes search more efficient for the employed workers, thereby reducing the opportunities of the unemployed in such a way that it is even possible that more vacancies are needed in order to obtain the same level of unemployment. Note that for the model of on-the-job search only the simulations with a < 0.6 result in a completely segregated labor market (where du = 1 − u), while all other cases result in a mixed city in which the unemployed are living in between the low-paid and high-paid employed workers. Comparison between the two models is complicated by the constant-returns-to-scale matching function, since employed workers congest the labor market for unemployed workers, thereby considerably reducing their contact rate. Therefore, we also look at a model with an increasing- returns-to-scale matching function, where ν = ξ = 1. All other parameters are the same – apart from α, which is set to 8 here in order to obtain reasonable unemployment rates. The results are presented in panel B of Table 1. The calibration results of the two models are very similar for a up to 0.5. For smaller values, the unemployment rate for the model with on-the-job search is somewhat lower and welfare is somewhat higher than under the model without on-the- job search,10 implying that the high unemployment rates in Panel A were due completely to 10Note that the Hosios condition of the model without on-the-job search no longer holds, implying that welfare 25 congestion. This result is also true for the difference between the optimal and market welfare. Welfare can be interpreted here as the realized percentage of total potential output in the case without any search frictions and commuting costs. This interpretation implies that the market loses about 25 percent of that potential production when a = 1 and that the social planner is able to reduce that loss only by less than half a percent (i.e. 0.001 / 0.25). The welfare loss due to suboptimal location choices of the workers is therefore quite small. Hence, a policy to reduce the loss of efficiency due to suboptimal locations (such as the rent subsidy for the unemployed discussed in the previous section) cannot have a large impact. 7 Discussion and extensions The main contribution of this paper is the observation that a social planner (weakly) prefers a more integrated city than the city structure that is formed by the decentralized market. This result is easily explained in a model in which there are homogeneous jobs, since in such a model on-the-job search does not add any value to social welfare. The social planner is therefore only interested in the commuting costs of employed workers and the job-finding probabilities of unemployed workers. Hence, one of the necessary restrictions to obtain this outcome in the market is that employed workers also only consider their commuting costs when making decisions about their location. However, since there are differences in wages between identical jobs, employed workers also look at the potential to receive a higher wage offer – and this externality explains our inefficiency result. The question arises whether this result is robust when we loosen the assumption of homo- geneity in jobs. We therefore look in this subsection at a stochastic job-matching model in the tradition of Pissarides (2000, chapter 6). In order to ease notation, we assume that p follows a within a city structure may no longer be optimal. 26 standard uniform distribution (although our results can easily be extended to any distribution). It has been shown in earlier work that wages are ranked according to their productivity; hence, workers can only have different wages if they also have different levels of productivity (Gautier et al. (2010) provide a formal proof). We denote the productivity distribution of employed workers by H. In addition, we denote K(p) for the wage that is paid for a match that yields productivity level p. Note that this function is strictly increasing, since wages are ordered based on their productivity. Hence, F (K(p)) = p and G(K(p)) = H(p), where F and G still follow the definitions of the earlier sections. Using the same techniques as in the standard version of our model, we can use bid rents to determine the location of unemployed workers. We thus rewrite equation (15) to obtain τ = aλ(1− ψ) ∫ 1 p K ′(x) [ 1− x ρ+ δ + λψs(d(x))(1 − x) ] dx + aλ ∫ p˜ p K ′(x) [ 1− x ρ+ δ + λψs(d(x))(1 − x) ] dx, (27) where p˜ = H−1(du/(1 − u)) and p is the lowest productivity level accepted by unemployed workers. The welfare function of the social planner becomes Ω(v, du, p) = ∫ ∞ 0 { (1− u(t))EHp+ u(t)b− v(t)γ − deτ } e−ρtdt. Solving the planner’s problem is complicated by its dependence on the equilibrium distribution H. Therefore, we follow an alternative: in Appendix J we show that for any given value of λ, we have Ω(v, du, p) = ρV ∗u (du + u/2) = b+ λs(du + u/2) ∫ 1 p 1− p ρ+ δ + λψs(d(p))(1 − p)dp, (28) whenever ρ/δ ↓ 0 and where V ∗u is the value of the unemployed workers when everybody is paid according to their productivity level. In addition, we prove in Appendix K that Ω(v, du, p) = ρV ∗e (p, d(p)) = p− τd(p) + λψs(d(p)) ∫ 1 p 1− p ρ+ δ + λψs(d(p))(1 − p)dp, (29) 27 whenever ρ/δ ↓ 0 and where V ∗e is the value of the employed workers when everybody is paid according to their productivity level. The combination of the last two equations implies that when everybody is paid according to their productivity, then the lifetime expected utility is independent of the labor market state and equal to the welfare function of the planner. This result comes from the fact that when ρ is small, then the social planner is interested only in the steady-state distribution of the workers and not in the state of the worker at the particular moment that the social planner has to make her decision. This result also implies that by placing an unemployed worker closer to the CBD, the gains must outweigh the costs of placing employed workers farther away from the CBD. Hence, the condition for Ω to be maximized with respect to du is identical to the condition ∂V ∗e (p, d(p)) ∂d(p) = ∂V ∗u (du) ∂du . Moreover, by equating the derivatives of (28) and (29), we obtain τ = aλ(1− ψ) ∫ 1 p [ 1− x ρ+ δ + λψs(d(x))(1 − x) ] dx + aλ ∫ p˜ p [ 1− x ρ+ δ + λψs(d(x))(1 − x) ] dx. (30) This differs from (27) only with regard to the disappearance of K ′(·). Since a monopsony model in general implies that K ′(·) 6= 1, the outcome of the social planner differs from the market equilibrium. Table 2 reports the results of a calibration exercise based on the model with heterogeneous firms. We use exactly the same calibrated parameters as in Panel B of Table 1 except for the commuting costs, which are set to 0.15. Setting these to their original level would have resulted in a segregated market equilibrium for any level of a (which is not such an interesting case to investigate). In line with the results of this section, we find a difference between the market and 28 Planner Market a u v du s(du) ωt p u v du s(du) ωt p ν = ξ = 1, α = 8 1 0.057 0.270 0 0.971 0.7063 0.290 0.060 0.291 0 0.970 0.706 0.300 0.75 0.056 0.257 0 0.979 0.708 0.290 0.070 0.286 0.152 0.859 0.702 0.324 0.6 0.055 0.249 0 0.983 0.708 0.290 0.074 0.281 0.240 0.857 0.702 0.358 0.55 0.053 0.214 0.060 0.952 0.707 0.290 0.076 0.280 0.274 0.829 0.702 0.343 0.5 0.055 0.213 0.065 0.954 0.708 0.293 0.079 0.278 0.329 0.816 0.702 0.353 0.25 0.056 0.247 0.102 0.967 0.710 0.288 0.098 0.261 0.856 0.773 0.707 0.430 0.1 0.070 0.329 0.390 0.903 0.716 0.276 0.084 0.261 0.915 0.904 0.715 0.431 Table 2: Calibration results for the model with heterogeneous firms. the planner; surprisingly, the differences in welfare are somewhat higher than those reported in Table 1. Note that our model is also restrictive in the linearity of the search efficiency function. Using a slight variation of the derivations in Appendix H, we obtain the following first-order condition of du for an interior solution of the social planner: τ = λs ′(du + u/2) ρ+ δ + λs(du + u) + λs′(du + u/2) . Hence, interior solutions are possible even in the case of homogeneous workers, but with a non- linear search efficiency function. However, this is not likely to change the conclusion that the market yields a more segregated outcome than the social planner does. 8 Concluding remarks This paper investigated the impact of the location decision of workers when they are allowed to search on the job. Two conclusions from our model differ markedly from the present state of the literature (i.e. Wasmer and Zenou, 2002). First, it is possible to explain a large set of city structures rather than merely the extreme types (i.e. completely integrated or segregated). We realize that we are not the first to obtain such a result, but we show that this result can 29 be derived even without assuming house consumption to be a normal good for any income level (as in, for example Fujita, 1989). Second, it is possible to show that the city structure that results from the market is inefficient under very general conditions. Hence, although there is a role for governmental intervention, we have also shown that the benefits from such a policy are potentially small. Even though some of our results are based on wage posting, many results can be shown to translate automatically to other models, such as wage bargaining models. Still, our model does not allow for all types of wage-setting mechanisms – such as models in which firms make counteroffers (as in Postel-Vinay and Robin, 2002). For this model, the only potential equilibrium results in a completely segregated city structure. This result is obtained from the fact that unemployed workers are offered their value of unemployment when accepting a job; their values therefore do not depend on distance other than the rents. This also implies that their bid- rent function is always equal to zero. Hence, location decisions are obviously inefficient in this wage-setting mechanism as well. In future work, we can extend our analysis taking account of endogenous search effort such as in Smith and Zenou (2003). References A˚slund, O., J. O¨sth and Y. Zenou (2010), “How important is access to jobs? Old question improved answer”, Journal of Economic Geography, 10, 389–422. Burdet, K. and D.T. Mortensen (1998), “Wage differentials, employer size and unemploy- ment”, International Economic Review, 39, 257–73. 30 Fujita, M. (1989), “Urban economic theory: land use and city size”, Cambridge University Press, Cambridge. Garibaldi, P. and E.R. Moen (2010), “Job to job movements in a simple search model”, American Economic Review, 100, 343–47. Gautier, P.A. and A.P. van Vuuren (2015), “The estimation of present bias and time preferences using land-lease contracts ”, working paper, VU Amsterdam. Gautier, P.A. C.N. Teulings and A.P. van Vuuren (2010), “On-the-job search, mismatch and efficiency”, Review of Economic Studies, 77, 245–72. Gobillon, L. T. Magnac and H. Selod (2010), “The effect of location and finding a job in the Paris region”, Journal of Applied Econometrics, 26, 1079–1112. Kawata, K. and Y. Sato (2012), “On-the-job search in urban areas”, Regional Science and Urban Economics, 42, 715–26. Laibson, D. A. Repetto and J. Tobacman (2007), “Estimating discount functions with consumption choices over the lifecycle”, NBER Working Paper No. 13314, Cambridge (MA). Manning, A. and B. Petrongolo (2015), “How local are labor markets? Evidence from a spatial job search mode”, working paper, London School of Economics. Marinescu, I. and R. Rathelot (2015), “Mismatch unemployment and the geography of job search”, working paper, University of Chicago. Menzio, G. and S. Shi (2011), “Efficient search on the job and the business cycle”, Journal of Political Economy, 119, 468–510. Petrongolo, B. and C.A. Pissarides (2001), “Looking into the black box: a survey of the 31 matching function”, Journal of Economic Literature, 39, 390–431. Pissarides, C.A. (2000), “Equilibrium unemployment theory ”, The MIT Press, Cambridge (MA). Postel-Vinay, F. and J.M. Robin (2002), “Equilibrium wage dispersion with worker and employer heterogeneity”, Econometrica, 70, 2295–2350. Ridder, G. and G.J. van den Berg (2003), “Measuring labor market frictions: a cross- country comparison”, Journal of the European Economic Association, 1, 224–44. Smith, T.E. and Y. Zenou (2003), “Spatial mismatch, search effort, and urban spatial struc- ture”, Journal of Urban Economics, 129–56. Wasmer, E. and Y. Zenou (2002), “Does city structure affect job search and welfare”, Journal of Urban Economics, 51, 515–41. Zenou, Y. (2009), “Urban labor economics”, Cambridge University Press, Cambridge. Appendix A Proof of Lemma 1 We have that Ψe(d1, w1) ≥ Ψe(d1, w2) and Ψe(d2, w2) ≥ Ψe(d2, w1). Hence for any pair of d1 and d2 we have Ψe(d1, w1)−Ψe(d2, w1) =Ψe(d1, w1)−Ψe(d1, w2) + Ψe(d1, w2)−Ψe(d2, w2) + Ψe(d2, w2)−Ψe(d2, w1) ≥ Ψe(d1, w2)−Ψe(d2, w2). Since the second-order derivatives equal zero, we have ∂Ψe(d, w1) ∂d = Ψe(d2, w1)−Ψe(d1, w1) d2 − d1 , and ∂Ψe(d, w2) ∂d = Ψe(d2, w2)−Ψe(d1, w2) d2 − d1 . Using the inequality above and d2 > d1, we obtain ∂Ψe(d, w1) ∂d ≤ ∂Ψe(d, w2) ∂d . 32 But then we obtain that w1 ≤ w2 since we know that the cross-partial derivative of Ψ is positive. Now suppose that we have equality. This implies that also the values in between d1 and d2 should be occupied by workers with wage w1.11 Hence there is a mass point at w1. This is ruled out by the wage posting, i.e. see Section 2.4.  B Proof of Lemma 2 Suppose that Du is not convex. It implies that there is an interval (d1, d2) that is not occupied by unemployed workers while d1, d2 ∈ Du. Since the value of unemployed workers should be constant on Du, we have Vu(d1) = Vu(d2). From the main text (proven below lemma 12), we also know that R′(d) = ∂Ψe(d, w(d)/∂d for all d ∈ (d1, d2). Taking derivatives of (3) and using the same steps to obtain equation (10), we obtain ρV ′u(d) = Ψ′u(d)−R′(d) = Ψ′u(d)− ∂Ψe(d, w(d)) ∂d for all d ∈ (d1, d2). Note that since by assumption d1 is a local maximum of Vu(d) we must have that ρV ′u(d1) < 0 and hence Ψ′u(d) < ∂Ψe(d, w(d))/∂d. Since we know that for all d ∈ (d1, d2), the second-order derivative of R(d) equals ∂Ψe(d, w(d)) ∂d∂w w ′(d) > 0 where the inequality comes from Lemma 1 and (9). Since we know that Ψ′u(d) is constant, it implies that ρV ′(d) < ρV ′(d1) < 0 for all d ∈ (d1, d2). But then Vu(d2) < Vu(d1) which is a contradiction to the assumption that both should be equal.  C Derivation of the land rents As stated in the main text, we have that R′(d) = ∂Ψe(d, w(d))∂d . (31) Using the fact that R(1) = RA and substitution of (31) results in R(d) = RA − ∫ 1 d ∂Ψe(d, w(x)) ∂d dx. (32) Using (11) and changing the order of the integrals, the second term at the right-hand side of (32) equals ∫ 1 d ∂Ψe(d, w(x)) ∂d dx = τ(d − 1)− aλψ ∫ w+ w(d) (1− F (w)) ρ+ δ + λψs(d(w))(1 − F (w)) ∫ d(w) d dxdw. Solving for the integrals and substitution of the result into (32) gives R(d) = RA − τ(d − 1) + aλψ ∫ w+ G−1( d−u1−u ) (1− F (x))(d(x) − d) ρ+ δ + λψs(d(x))(1 − F (x))dx. For du < d ≤ du + u, we have R(d) = R (du + u)− ∫ du+u d R′(x)dx, (33) 11Suppose that w1 = w2 and suppose that there is a c ∈ (0, 1) for which there is a w∗ 6= w1 with Ψ(d(c), w1) < Ψ(d(c), w); d(c) = cd1 + (1 − c)d2. Suppose that w∗ > w1, then since d(c) < d2, we have from the above that w∗ ≤ w2 = w1 which contradicts that w∗ > w1. Suppose that w∗ < w1, then since d(c) > d1 we must have w∗ ≥ w1. 33 and R(d) = Ψu(d). Hence R′(d) = Ψ′u(d) = −aλ ∫ w+ ϕ [ (1− F (w)) ρ+ δ + λψs(d(w))(1 − F (w)) ] dw. (34) Substitution of (34) into (33) results in R(d) = R(du + u)− aλ ∫ w+ ϕ (1− F (x)) ρ+ δ + λψs(d(x))(1 − F (x))dx(d− du − u). Finally, when d < du, we can use (31) to obtain R(d) = R (du)− ∫ du d ∂Ψe(d, w(x)) ∂d dx. (35) Using (11) and changing the order of the integrals, we obtain ∫ du d ∂Ψe(d, w(x)) ∂d dx = −τ (du − d)− aλψ ∫ w+ w(du) (1 − F (w)) ρ+ δ + λψs(d(w))(1 − F (w)) ∫ du d dxdw + ∫ w(du) w(d) (1 − F (w)) ρ+ δ + λψs(d(w))(1 − F (w)) ∫ d(w) d dxdw. Solving for the integrals and substitution of the result into (35) results in R(d) = R(du)− ( τ + aλψ ∫ w+ w˜ (1− F (x)) ρ+ δ + λψs(d(x))(1 − F (x))dx ) (d− du) +aλψ ∫ w˜ G−1( d1−u ) (1− F (x))(d(x) − d) ρ+ δ + λψs(d(x))(1 − F (x))dx. (36) D Derivation of G(w) Based on the definition of s(·) from equation (1) and the definition of d(·) from equation (13), we obtain ∫ w ϕ s(d(x))dG(x) = { G(w)s0 − (1− u)a 12G(w)2 if w ≤ w˜ G(w) (s0 − au)− (1− u)a 12G(w)2 + du 1−uau otherwise . It implies that G(w) is the root of a second-order polynomial. E Derivation of the reservation wages Using equation (20) and the values of Vu and Ve in equations (3) and (4) and then using equation (36), we obtain that ϕ = b+ τdu + λ [1− ψ] s(du) ∫ w+ ϕ (1− F (w)) ρ+ δ + λψs(d(w))(1 − F (w))dw − aλψ ∫ w˜ ϕ (1− F (w)) (du − d(w)) ρ+ δ + λψs(d(w))(1 − F (w))dw. (37) in the case that du > 0. For the case that du = 0, we obtain ϕ = b+ τu + λs(u) [1− ψ] ∫ w+ ϕ (1 − F (w)) ρ+ δ + λψs(d(x))(1 − F (w))dw. (38) 34 F Proof of Theorem 3 For the moment suppose that du > 0. Using ν = ξ = 1, the zero-profit condition (22) can be written as u2s2 (u 2 + du ) p− ϕ us (u 2 + du ) ρ+δ δ + (1− u)ψs0 = γδα . (39) For u = 0, the left-hand side equals zero. For u = 1, the left-hand side equals s (1 2 + du ) (p− ϕ) δρ+ δ . Since u = 1, we have that ϕ = b + tdu. From equation (17), we obtain that du = 1 − u = 0 and hence ϕ = b. Substitution of this result into the left-hand side of (39) and using the definition of s(.), we obtain that the left-hand side of (39) at u = 1 equals ( s0 − a 2 ) (p− b) δρ+ δ . Hence equation (39) has a root under the assumptions made in the proposition. The reservation wage equation (37) has a root, because if ϕ = 0, then the left-hand side of that equation equals zero, while the right-hand side is positive. When ϕ = p, then the left-hand side equals p, while the right-hand side equals b+ tdu < b+ t. Since p− b is assumed to be larger than τ , it immediately implies that this equation has a root. In the case that du = 0, equation (39) must be replaced by u2s2 (u 2 + du ) p− ϕ us (u 2 + du ) ρ+δ δ + (1− u)ψs(u) = γδα . The arguments above still apply for this case.  G Proof of Lemma 4 For the situation of ψ = 1 and ρ/δ ↓ 0 we can rewrite equation (39) as u2s2 (u 2 + du ) p− ϕ us (u 2 + du ) + (1− u)s0 = γδα . (40) The derivative of the numerator with respect to u is obviously positive. The denominator can be rewritten as s0 − au (u 2 + du ) . This has a first-order derivative equal to −a ( u+ du + u ∂du ∂u ) . (41) When du < 1− u, we obtain from the flow condition (18) that ∂du ∂u = auλ δ + λs (du + u) > 0. In the case that du = 1− u it equals one and hence ∂du/∂u in (41) is larger than -1. Hence the first-order derivative of the denominator of (40) is smaller than −adu which is strictly negative in the case that ψ = 1. Hence the left-hand side of (40) is strictly increasing and hence has a single root.  35 H Derivation of equation (26) Using the definition of de and (13) and solving for the integrals results in de = ∫ w+ w d(x)dG(x) = (1 − u) ∫ w+ w G(x)dG(x) + u ∫ w+ w˜ dG(x) = 12 (1 + u)− du 1− uu. (42) After substitution of (42), the Hamiltonian for the maximization of Ω with respect to equation (25) equals H = e−ρt [ (1 − u)p+ ub− γv − 12(1− u)(1 + u)τ + duuτ ] + ζ [ δ(1− u)− λus ( du + u 2 )] , where ζ is the costate variable. Note that we have λ = ( s0 − a 2 − (1− ψ) [s0(1 − u) + au(du + u/2)] )ν−1 vξ. The partial derivatives of λ with respect to u and du equal 0 whenever either ν = 1 or ψ = 1. Based on this, the Euler conditions with respect to v and u equal ∂H ∂v = γe −ρt − ζ (ξ − 1)λus ( du + u2 ) v = 0, (43) and ∂H ∂u = −e −ρt [p− b− (u+ du) τ ]− ζ [δ + λs (du + u)] = −ζ˙, (44) while the first-order derivative with respect to du equals ∂H ∂du = e−ρtuτ + ζuλa. (45) Taking derivatives of equation (43) with respect to t, we can derive that ζ˙ = −ρζ and substitution of this result into (44), we obtain ∂H ∂u = −e −ρt [p− b − (u+ du) τ ]− ζ [ρ+ δ + λs (du + u)] = 0. (46) Solving for ζ from this equation and substitution into (45) and rewriting results in ∂H ∂du = e −ρtu ρ+ δ + λs (du + u) [(ρ+ δ + λs0)τ − aλ(p− b)] , which is larger than zero, because the term between brackets was assumed to be positive. I Proof of Lemma 5 Define τM and τP as the levels of τ that equalize the left- and the right-hand side of (17) respectively (26) and define uM (τM ) and uP (τP ) as their corresponding unemployment rates. Moreover, we denote λP = λ(uP ) and λM = λ(uM ). We have to prove that τP > τM . We use an intuitive proof for this. Take into account that the condition that we have a completely segregated market equilibrium can also be written as Ψu(1 − u)−Ψu(1) < Ψe(1− u)−Ψe(1), (47) i.e. the worker who is at the city edge would benefit less from moving to the location of the highest paid worker (i.e. 1-u) then the best paid worker loses from moving to the city edge. Now, fix for the moment λM and suppose that we have a situation in which all workers are paid their marginal productivity p. 36 Then, using standard Bellman techniques it is possible to obtain that the value of an unemployed worker V U∗ equals ρV U∗(d) = b−R(d) + λs(d) ( V E ∗ (p)− V U∗(d) ) , (48) where V E ∗ (p) = maxd V E ∗(p, d) and where V E∗(p, d) is the value of an unemployed worker eanring p at distance d. In addition, we have V E∗(p, d) = p− τd −R(d) + δV U∗ ρ+ δ . (49) Substitution of (49) into (48) and solving for V U∗ results in ρV U∗(d) = b−R(d) + λs(d)ρ+ δ + λs(d) (p− b− τ +R(d)). The bid rents of the unemployed workers can now be found by solving for R(d) in the equality V U∗(d) = V U∗(1). It equals Ψ∗u(d) = RA + λ(p− b− τ) a(1− d) ρ+ δ + λs(1) . The bid-rent of the employed workers can be found likewise. It equals Ψe(d) = τ(1 − d) +RA. We have Ψu(1− u)−Ψu(1) < Ψ∗u(1− u)−Ψ∗u(1), and Ψe(1− u)−Ψe(1) > Ψ∗e(1− u)−Ψ∗e(1). Moreover Ψ∗u(1− u)−Ψ∗u(1) = λ(p− b− τ) au ρ+ δ + λs(1) , and Ψ∗e(1− u)−Ψ∗e(1) = uτ. Hence a sufficient condition for (47) is (p− b− τ) λaρ+ δ + λMs(1) < τ. Solving for τ this results in τ > (p− b)λMaρ+ δ + λM s0 , and hence τM should be smaller than the left-hand side of this equality. Now, we still have to prove that this also holds for endogenous λ. In Lemma 6 we prove that uM (τM ) > uP (τP ) and hence λM ≡ λ(uM (τM )) < λ(uP (τP )) ≡ λP . Hence τP = aλP p− b ρ+ δ + λP s0 > aλM p− b ρ+ δ + λMs0 = τM .  Lemma 6 Suppose that ψ = ξ = ν = 1. We have that uM (τM ) > uP (τP ). 37 Proof: Using the first-order condition of v of the planner in (43) and (46) to solve for ζ results in ∂H ∂v = γ − p− b− (uP + du) τ ρ+ δ + λs (du + uP ) αuP s ( du + uP 2 ) = 0. Substitution of du = 1− uP , we obtain: ∂H ∂v = γ − p− b− τ ρ+ δ + λs (1)αuP s ( 1− uP2 ) = 0, (50) and substitution of τp from (26) into (50) and rewriting results in uP s ( 1− uP2 ) p− b ρ+ δ + λs0 = γα. Substitution of the flow condition for unemployment (18) results in Q(uP (τP )) ≡ u2P ( s0 − a ( 1− uP2 ))2 (δ + ρuP )s0 − uPa ( 1− uP2 ) (ρ+ δ) = γ α(p− b) . For the decentralized market we assume that τ = τM and therefore substitute du = 1 − u into equation (39) to obtain Q(uM (τM ))(p− ϕ) = γδ α , Since ϕ > b, we obtain that Q(uM (τM )) > γδ α(p− b) , which implies that Q(uM (τM )) > Q(uP (τP )). Note that the numerator of Q is increasing in u, while the denominator is decreasing in u. Hence Q is an increasing function and therefore uP (τP ) < um(τm)  J Derivation of equation (28) Define V˜ ∗e (p, d) as V˜ ∗e (p, d) = V ∗e (p, d)− δ ρ+ δ V ∗ u (du). (51) The standard Bellman equation of V ∗u (du) equals ρV ∗u (du) = b+ λs(du) ∫ 1 p ( V ∗e (x, d(x)) − V ∗u (du) ) dx. (52) Substitution of (51) into (52) and rewriting, we obtain ρV ∗u (du) = ub+ λus(du) ∫ 1 p V˜ ∗e (p, d(p))dp, (53) while using the same techniques we can also derive that [ρ+ δ + λψs(d(p))(1 − p)] V˜ ∗e (p, d(p)) = p− d(p)τ + λψs(d(p)) ∫ 1 p V˜ ∗e (x, d(x))dx. (54) We prove (28) by using the intermediate step that if we prove ρV ∗u (du) = ub+ (1− u) ∫ p p (x− d(x)τ) dH(x) + {(ρ+ δ + λψ(1− p)s(d(p))) (1− u)H ′(p)} ∫ 1 p V˜ ∗e (x, d(x))dx. (55) 38 then we are done by substitution of p = 1 in (55). We prove this by induction. The base is easily obtained by substitution of p = p in equation (55) and substitute the steady-state condition (δ + λψ(1 − p)s(d(p)))(1 − u)H ′(p) = λs(du)u in (53). For the induction step we need to prove ρV ∗u (du) = ub+ (1− u) ∫ p+∆ p (x− d(x)τ) dH(x) + (1− u)H ′(p+ ∆)× {(ρ+ δ + λψ(1 − (p+ ∆))s(d(p+ ∆)))} ∫ 1 p+∆ V˜ ∗e (x, d(x))dx. (56) Based on the assumption that (55) is correct. From (55), we have ρV ∗u (du) = ub+ (1 − u) ∫ p p (x− d(x)τ) dH(x)+ (ρ+ δ + λψ(1 − p)s(d(p))) (1− u)H ′(p)V˜ ∗e (p, d(p))∆+ (ρ+ δ + λψ(1 − p)s(d(p))) (1− u)H ′(p) ∫ 1 p+∆ V˜ ∗e (x, d(x))dx. (57) Substitution of (54) into (57) and rewriting results in ρV ∗u (du) = ub+ (1 − u) ∫ p+∆ p (x− d(x)τ) dH(x) + (1− u)H ′(p)× {λψs(d(p))∆ + (ρ+ δ + λψ(1 − p)s(d(p)))} ∫ 1 p+∆ V˜ ∗e (x, d(x))dx. (58) Since the in- and outflow of workers with a productivity level exactly equal to p should be equal to each other in steady-state, we have (δ + λψ(1 − p)s(d(p))) (1 − u)H ′(p) = λus(du) + λψ(1 − u) ∫ p p s(d(x))dH(x), (59) while also the in- and outflow of workers with a productivity level exactly equal to p+∆ should be equal to each other in steady-state or (δ + λψ(1− (p+ ∆))s(d(p+ ∆))) (1 − u)H ′(p+ ∆) = λus(du) + λψ(1− u) ∫ p+∆ p s(d(x))dH(x). (60) Combining equations (59) and (60), we obtain (δ + λψ(1 − (p+ ∆))s(d(p+ ∆))) (1− u)H ′(p+ ∆) = (1 − u)H ′(p) {λψs(d(p))∆ + (ρ+ δ + λψ(1− p)s(d(p)))} (61) Substitution of (61) into (58) results in (56).  K Derivation of equation (29) This is a direct result of (54) and the definition of V˜ ∗e in (51) ρV˜e ∗ (p, d(p)) = ρ/δ ρ/δ + δ+λs(d(p))(1−p)δ { p− d(p)τ + λs(d(p)) ∫ 1 p V˜ ∗e (x, d(x))dx } + 11 + ρ/δV ∗ u (du). Now, using ρ/δ ↓ 0 gives the desired result.  39