Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 187 Journal of Applied Economics and Business Studies (JAEBS) Journal homepage: https://pepri.edu.pk/jaebs ISSN (Print): 2523-2614 ISSN (Online) 2663-693X Contribution of Spatial Concentration in Exaggerating Inequalities: Across Various Urban Regions of Pakistan Uzma Tabassum1*, Shaista Alam2 and Ambreen Fatima3 1 PhD Scholar and Research Assistant, Applied Economics Research Centre (AERC), University of Karachi 2 Senior Research Economist, Applied Economics Research Centre (AERC), University of Karachi 3Senior Research Economist, Applied Economics Research Centre (AERC), University of Karachi ABSTRACT Industrial agglomeration and inequalities are much of the concerns in recent literature. Pakistan, being a developing country, is also restricted by resource availability to treat all regions equally with respect to investment and development. As a result, regions with growing agglomeration experience higher income levels relative to other regions. To investigate this empirically this study employed propensity score matching (PSM) across urban regions in Pakistan using Labour Force Survey data 2017-18. For agglomeration regional herfindahl indices were estimated and regions with above average index value along with having positive index growth were considered as treated or agglomerated regions. The positive and significant coefficient of regions with treatment signifies that regions with agglomeration were found to have higher relatively income. Hence introducing industrial concentration in untreated regions would be effective in reducing inequalities rather tackling them by reducing agglomeration in agglomerated regions. Keywords Inequality, Agglomeration, Propensity Score Matching, Urban Regions, Herfindahl Index. JEL Classification C31; D30; D3; O14; O15; 1. Introduction Over the last few epochs world trends draw attention to two apparent traits in economic growth. One is rising inequalities and the other is increasing geographical concentration of economic activities across regions. Various strands of the theory suggest that openness of region and removal of barriers in free mobility of factor will probably result in a higher level of concentration of industries and as a result greater specialization of the region. This might have repercussions in growing income disparities across regions. Such growth is also observable in Pakistan where unbalanced * uzma_tbsm@hotmail.com The work is the part of PhD thesis under process. Uzma Tabassum, Shaista Alam and Ambreen Fatima 188 growth strategy is pervaded because of resource constraints (Tabassum, 2016).Therefore investments and economic activities concentrate in some regions of the country that amplifies inequalities across regions, as the exogenous characteristics of regions, skill composition of the workforce and agglomeration economies attributed to productivity differences that in turn assumed to be reflected in income distribution. Given that the distributions of industries are not uniform across regions in an effort to specifically elicit the effect of agglomeration on spatial disparities it is important to know that how the concentration of industries affect the growth process of a particular region. What is the role of urbanization and localization economies in provoking spatial disparities? How these spatial disparities are connected with inequalities in the income distribution of individuals living in that region? The various regions would be different in sizes with respect to their physical and economic structure, depending on the agglomeration behaviour. The swiftness and assortment of industrial agglomeration in a region describe the economic conditions of that region. On the theoretical side, the concept of Agglomeration economies define the process of clustering of firms in close proximity to one another to exploit the benefits of being together within intension to minimize per-unit cost of production or achieved economies of scale (UN-HABITAT, 2011). The concept was first acknowledged by Weber (1909) while the detailed explanation of the sources of these economies was given by Marshall (1890). Marshall (1890) provided three bases through which economies of scale can be achieved. According to him agglomeration enables firms to economize on their unit cost by sharing knowledge regarding efficient production technologies and management strategies. Further Firms may agglomerate to share supply of intermediate inputs. The demand for intermediate input is greater when there are many firms located at the same place the average cost of providing such intermediate inputs to each firm will be low as high demand enable the producer of intermediate input to achieve scale economies. Lastly Sharing labour pool benefit firm when the variation in firm product demand is greater than the variation in industry level demand. The labour Fired from the firm that is facing low product demand can be hired by a firm that is facing high product demand. For doing so firm have to bear no cost for searching and training labour thus declining unit cost for two reasons: one by firing unwanted labours by a firm facing low product demand. And secondly by reducing search and training cost by high product demand firm. Duranton & Puga (2004) also describe all these as a process of learning, sharing and matching. In a nutshell productivity and income growth of region positively affect by agglomeration but these positive effects may come from urbanization economies or localization economies. In localization economies firms that belong to a single industry cluster together to exploit agglomeration benefits thus they specialize in that particular industry production. While in urbanization economies firms that belong to different industries cluster together to exploit agglomeration benefits that results diversified Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 189 production. In viewpoint of efficiency spatial inequalities may have positive and negative effects. Inequalities resulting from regional specialization based on localization economies may be beneficial as it increases productivity but if these are caused by external economies or urbanization economies then they may not be optimal (Kim, 2008). Spatial inequalities caused by unrestrained population concentration in large primate cities may inflict many social ills in society. Thus spatial inequalities possibly are undesirable for society if it provoke social inequalities across the region. This research seeks to shed some light on the process of agglomeration and its effect on the income of region that experiencing these agglomerations. The literature focusing spatial disparities between urban regions based on agglomeration are rather scarce in the context of Pakistan. A recent set of empirical literature both nationally and internationally majorly investigated the determinants of income inequalities. These studies point out socio-economic, political and open economy variables to contribute in influencing inequalities. Disaggregating these determinants includes individual and region specific variables such as education, health, gender etc. for the former and industrial clustering, policies and public service provision, agglomeration patterns etc. for the later (Sial et al, 2018., Fambon, 2017., Naseer & Ahmed, 2016., Burki & Khan, 2012., Davtyan; 2014., Jamal, 2006). Extensive research is already available domestically on the aspects explaining prevalence of regional inequalities but it largely focuses on socio-economic determinants of inequalities while the impact of location specific industrial clustering on embellishing inequalities has mainly overlooked. This research will evaluate the contribution of spatial concentration in inflating inequalities across various regions by exploring the likelihood of prosperous region become more prosperous, as heterogeneity across industrial expansion and in labour demand frames inequalities among individuals. The empirical assessment is based on survey data conducted by the Pakistan Bureau of Statistics (PBS). In the study large cities and other urban areas have taken as spatial units. Because some urban regions were not included in past years survey so they left intentionally from recent sets of survey to avoid inconsistency in data overtime. For estimating the agglomeration impact, this study used the Labour Force Survey (LFS) for the period of 2017-18. Further to identify the urban regions that are experiencing persistent inter-temporal spatial concentration in the case of Pakistan, the Herfindahl indices are computed for the years 2010 and 2018. The regions that have above average level of agglomeration in a most recent period and positive growth from the base year marked as treated regions. The study used non-parametric Propensity Score Matching (PSM) techniques to overcome the problem of selection bias arise due to the selection of regions who received treatment-spatial agglomeration, particularly in non- experimental designs. This research is a value addition to existing literature of Pakistan as it is the first study that employed propensity score matching to study agglomeration linkages with spatial inequalities arises from the unbalanced growth of regions. Further, it also contributes in empirical literature by estimating Herfindahl index for spatial Uzma Tabassum, Shaista Alam and Ambreen Fatima 190 concentration at regional level over time which is used as a base for identification of treated and untreated urban regions. The rest of the paper is proceed by highlighting empirical literature done nationally and internationally, followed by the review of theoretical model, section four sheds light on the estimation technique and data employed, section five discusses the results and final section wrap up finding and expected policy implications. 2. Review of Literature Spatial concentration of industries is placed among the highest feature of economic activity in both developed and seeking to be developed nations. A vast existing empirical literature suggest that industries are unevenly distributed across spatial units because of difference in transportation cost, immediacy to market, presence of skilled and cheap worked force and natural locational advantage etc. Thus industries agglomerate in a few regions more than others that are comparatively less attractive (Ellison & Glaser; 1997, Maurel & Sedilot, 1999 and Abid & Mushtaq, 2011). Geographic concentration and regional specialization of industries were seen by many researchers as to be two sides of the same coin. Specialization of region and concentration of industries in specific location were defined using the analogous production structures as base so as to reflect the same reality which is evident from various studies in the existing literature of regional economics (Aiginger, 1999). It was since 19th and 20th century that researchers were modeling regional specialization and geographical concentration rooting from trade theory and location theory respectively and conducting empirical studies on them. The literature about regional concentration probably commence with Krugman (1991). Krugman (1991) was the one which estimated Gini coefficient to capture concentration in various regions by comparing the regional market shares for a single industry with manufacturing’s employment structure. Setting a traditionally localized industry (automotive industry) as a benchmark he found that large numbers of industries were localized though most of these were not cutting-edge and that industries related to high technology sectors such as textile, were highly clustered. However estimated concentration of such industries was not solely depends on technological spillover and what he found was biased in terms of the data he got. Aiginger & Rossi-Hansberg (2006) deliberated regional specialization and geographical concentration as the two sides of the coin and theses two could be calculated from the same matrix showing industries in rows and geographical regions in columns. For the estimation of specialization columns are to be measured while for that of concentration rows are to be considered. Aiginger & Davies (2004) using a mathematical model pointed out that if inequalities across columns vary so does it varies across rows. The pragmatic evidence on the relationship between agglomeration economies and regional inequalities in developed nations may have immense importance. Geppert et al Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 191 (2008) explored this connection in the context of West Germany using data from 1980 to 2000 and suggest that industrial agglomeration can be a considerable contributor to regional income differentiation. They find that the number of regions that are already rich with large agglomeration of industries has succeeded in further distancing themselves from the others. Thus, concentration of industries has sizeable contribution to spatial income differentiation. Larger agglomerated regions such as Munich, Hamburg and Frankfurt are leading among the richer regions that are becoming richer and showing a sign of urbanization economies. Though, the analysis does not ascertain the particular nature of causal mechanism that could be underlying this relationship. Kim (2008) surveyed the theoretical and empirical literature about spatial and regional inequalities. The survey of literature underscores that endowment of resources, proximity to port and increasing returns attributable to concentration of human interaction increases spatial inequalities. All these are beneficial for region to enhance their productivity therefore these regions grow relatively with faster rate of growth. However congestion cost and excessive urban concentration may be harmful. Thus, in equilibrium there is an optimal level of spatial disparities to combat with inefficient level of agglomeration. Midelfart (2004) examined the role played by agglomeration of economic activities, as measured by density of employment, and composition of labour force skills in explaining regional income difference in the Norwegian economy especially focusing on Norway's manufacturing sector. Using data for the sub-regional NUTS 4 spatial units of Norway for two time period 1989 &1999 the study concluded that high income seems to be an upshot of agglomerated economic activities in contrast to differences in skill composition of labor force. Two different measures of income are considered as a proxy to measure Income disparities, per hour GDP of employees and mean hourly earnings. Considering GDP per hour the regional inequalities are found significant and much more distinct relative to earnings per hour. Further disparities shows widening trend while considering GDP per hour in contrast to the result of average hourly earnings which shows reduction in regional disparities for same time span. Regional specialization and the geographic concentration patterns of manufacturing industries as measured by dissimilarity index in Bulgaria, Estonia, Hungary, Romania and Slovenia had changed during 1990-1999 (Traistaru et al., 2002). They found that in Bulgaria and Romania regional specialization had increased with no significant changes in Estonia, Hungary and Slovenia. Further they found that large economies of scale, high technology and high wages were associated with highly concentrated industries therefore agglomeration of these industries foster the growth of these region in which they clustered. On the other hand industries with low technology and low wages were more likely to be dispersed spatially. Glaeser et al (1992) and Henderson et al (1995) have made decisive contributions to the empirical literature focus on urban area-city. They estimate the effect of city Uzma Tabassum, Shaista Alam and Ambreen Fatima 192 employment growth on specialization and diversification index. The specialization index is measured as a proportion of industry employment to the total employment of city. The diversity index computed by Glaeser et al (1992) is calculate by taking the employment share of five largest industries in the city while Henderson et al (1995) measured diversity index as employment share of all industry in the city aggregate employment. Glaeser et al (1992) also include the inverse of plant average size as a competition index. The result of Gleaeser's model shows that city growth is positively associated with diversity index and negatively by specialization and competition index. Along with specialization and diversification index Henderson et al (1995) also used city employment in all other industries as an additional urbanization variable and found positive impact of both specialization and overall employment on the growth of employment. Focusing on the role of agglomeration of industries in specific regions as pedestal of productivity effects and developmental impulses Fan & Scott (2003) investigates the interconnections between economic growth and geography in the developing countries of East Asia. Findings indicate that agglomeration and productivity have strong association in Chinese industries, especially electronics, computer and garments industries since the public policies in china follows economic liberalization. The scrutiny of empirical literature done in Pakistan indicates that, the literature on agglomeration is usually focuses on the extent with which industries agglomerate in a particular region and the effect of agglomeration on the efficiency of similar and dissimilar firms in a local system. Burki & Khan (2011) measure agglomeration of industries using data taken from Census of Manufacturing Industries (CMI). As per results the agglomeration is pervasive in Pakistani industries, the 35.3% industries are profoundly agglomerated. While, 38.2% industries are somewhat less agglomerated and 26.5% Pakistani industries are dispersed. District population, increase in road density, skilled worked force are found significant contributor in promoting spatial concentration of industries. Chaudhry & Haroon (2015) evaluate the economic impact of new firm in the market using Punjab’s Directory of Industries for the year 2002, 2006 and 2010. The finding indicates that new firm entry has noteworthy effect on socio- economic outcome. Some of these outcomes are immediate while some outcomes become visible with lag. They recommended that policy makers should be familiar with nature of firm as impacts of new firm differ with the type of firms that affirms the need of specific industrial policy. Azhar & Adil (2019) explore the effect of agglomeration on social inclusion & firm efficiency for Punjab districts. Social inclusion is made of mortality rate, antenatal care, improved water & sanitation which then compressed into a single variable using factor component analysis. The results demonstrate that agglomeration is positively associated with firm efficiency and also it has significant association with social-inclusion. The districts with higher agglomeration also experience higher social inclusion. In developed nations the empirical evidence on spatial inequalities and agglomeration may Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 193 contain immense importance, the pattern of agglomeration and its impact on spatial inequalities in developing nations like Pakistan is crucial to study. But despite the obvious policy concerns, there has been no empirical work found that explores the role of agglomeration in defining the income level of that region where these agglomerations take place. This study tries to fill the gap that is evident in the literature. 3. The Review of Selected Theoretical Model In the theoretical literature the most cited work is the "New Economic Geography (NEG)" model put forward by Krugman (1991). Researchers have based their empirical findings on the theoretical arguments placed by New Economic Geography (NEG) model. According to NEG model spatial inequality is the net effect of two contrasting forces, one is centripetal force and the other is centrifugal force. Centripetal forces determine agglomeration or concentration of economic activities that is caused by imperfect competition, increasing returns to scale and mobility of factor. On the other side centrifugal forces such as congestion, high transportation cost, restricted mobility of factor encourages dispersion of economic activities. When the cost of supplying goods at far off places would is high then firm would be dispersed around markets that are immobile. In contrast when transportation cost is low immobile market can easily get access to distinct places that encourage firms to agglomerate at particular place gaining advantages of economies of scale and externalities. The region that experiencing increasing agglomeration also come up with higher economic activities which results in spatial inequalities showing the positive inter relation (Curve AA). At certain level higher agglomeration will results in increase congestion and negative externalities offsetting the positive gain of agglomeration. Consequently a new relationship established between agglomeration and spatial inequality that is opposite to the previous one (Curve RR). Figure-1: Nexus between income growth and regional inequalities Source: Gardiner et al. 2010. Spatial agglomeration of economic activities positively contributes in the economic growth of nation through localized positive spillovers. Agglomeration increase productivities and real output via labour pooling and knowledge spill over as shown Uzma Tabassum, Shaista Alam and Ambreen Fatima 194 from the curve SS. The equilibrium level of agglomeration, inequalities and growth comes from the intersection of two curves RR and AA. There is a tradeoff between national growth and spatial inequalities produce from agglomeration. The effective intervention of government that break up agglomeration shift RR curve to RR1 and reduces spatial inequality by the transfer of resources to poorer regions but also erase the spillover effects that results in decreasing national growth. The income of region in which economic activities clustered grow relatively at a faster rate therefore a similar trade-off became visible within regions. Therefore on the basis of NEG model it is expected that the regions that are experiencing agglomeration may undergo high income level relative to other regions. 4. Estimation Techniques First the Herfindahl indices will be computed to measure the inter-temporal spatial concentration of sector. Using these indices values study will identifies regions that experience higher than average initial level of concentration and positive growth of spatial concentration. 4.1 Measurement of Agglomeration Herfindahl index of spatial concentration is calculated as the sum of the regional shares in national employment in the particular industry both by industry and regions. Symbolically, 𝐒𝐂𝐑𝐣 𝐜 = βˆ‘π’π‚π‘π’π£ 𝐜𝟐 Where: π‘ͺπ‘Ήπ’Šπ’‹ 𝒄 = The concentration ratio: The weight of employment in the sector i from the city j in the overall employment of the sector π‘ͺπ‘Ήπ’Šπ’‹ 𝒄 = π‘¬π’Šπ’‹ π‘¬π’Š = π‘¬π’Šπ’‹ βˆ‘ π‘¬π’Šπ’‹π’Š The above index ranges from 0 to 1. The more a region’s Herfindahl index is closer to 0 the less spatially concentrated the region is, and the more it is away from 0 or closer to 1 the more it is said to be spatially concentrated. 4.2 Measuring the linkage between spatial concentration and income To estimate causal treatment effects of agglomeration in exaggerating income inequalities across various regions Propensity-Score Matching (PSM) technique is used. The PSM technique was initially presented in the pioneering work of Rosenbaum and Robin (1983a, 1983b, 1985). Propensity score matching was the first time applied in the field of macroeconomics, to find the effects of currency unions on trade flow. Afterward, many economists used this technique to evaluate the causal effects in non experimental research. Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 195 Propensity Score Matching (PSM) In an experimental research randomization is assumed as there is an equal distribution of group members' characteristics for both treated and control group via random sampling and random assignments. Thus randomization limits potential selection bias therefore the only interest is to estimate cause of an effect. The observational studies lack randomization as conclusions are drawn based on data that is collected without random assignment therefore the results become inconsistent and biased. In such condition Propensity Score Matching provides the most accurate estimates of the impact. The study employed Propensity Score Matching (PSM) to analyze the effect of agglomeration of industries on the regional income in Pakistan. For the analysis purpose regions that are experiencing increasing growth of agglomeration are treated as β€œTreatment Group” while the regions where agglomeration is not persistent are considered as β€œControl group or Counterfactuals.” As we have N units (subjects or Regions). C denotes a treatment condition and Y as a potential outcome. For each unit i (i=1.......41), Ci =1 indicates that the subject unit is belongs to the treatment group with subsequent potential outcome O1i and Ci =0 shows that the subject i is in counterfactual group with potential outcome O0i. To analyze causal effects in counterfactual framework the core interest is to find treatment effect for each unit i defined as βˆ†i= O1i - O0i 1 For each unit i, Y1i and Y0i are not discernible at the same time because the same unit cannot simultaneously be in both the treated and untreated groups. Alternatively, Average Treatment Effects (ATE) can be estimated ATE = E (O1 – O0) = E (O1) –E (O0) 2 The second frequently estimated effects are Average Treatment on Treated (ATT). It is the average effect of treatment on the units which are treated defined as ATT = (O1 - O0 | C=1) = E (O1/C = 1)] - E (O0/C = 1) 3 The ATE is the difference between average outcomes of treated group and the average outcome of untreated group, no one of which is fully observe. The estimation of O1 requires average over the outcome of treatment for treated group (O1|C = 1) which are observed as well as average over the outcome of treatment for the untreated group (O1|C = 0), the unobserved counterfactual. In a similar way to estimate O0 average over the outcome of non-treatment for untreated group (O0|C = 0) and average over the outcome of non-treatment for treated group (O0|C = 1) required, from which (O0|C = 1) is also an unobserved counterfactual. ATE= E (O1 - O0) = E (O1) - E (O0) 4 Uzma Tabassum, Shaista Alam and Ambreen Fatima 196 = [Pr (C =1) E (O1|C = 1)] +[ Pr (C =0) E(O1|C = 0)] - [Pr (C =1) E(O0|C = 1)] + [ Pr (C =0) E(O0|C = 0)] 5 = Pr(C=1) [E (O1|C = 1) - (E (O0|C = 1)] + Pr (C = 0) [E (O1|C = 0) - E (O0|C = 0)] 6 Here Pr (C=1) is the probability that unit is treated and Pr (C = 0) is the probability that unit is not treated. The first part of the equation-6 is just the proportion of units receiving the treatment multiply by the Average Treatment Effect on treatment. It is very likely in the absence of randomization that subjects' background characteristics in the two groups are significantly different which may also influence potential outcome so the present study cannot directly assess the effect of treatment without controlling for other influential factors. The unbalanced distribution influential factor between two groups may create selection which creates biased ATE. Hence, to solve the selection bias problem and evaluate the treatment effect matching adopts two postulations. Conditional Independence Assumption (CIA) Conditional on influential factors X, C and (O1 & O0) are independent. X is the vector of observed cofounders. To estimate ATT the Conditional Independence Assumption (CIA) states O0βŠ₯ C | X or O0βŠ₯ C | Pr(X) 7 Assumption states that there is no association among treatment and the potential outcome that arise without treatment, conditional on observable characteristics or the propensity score (Rosenbaum and Rubin; 1983). Further for estimating ATE the CIA states that conditional on observable characteristic or propensity score, treatment is independent of both treated and untreated outcomes because for the estimation of ATE treated units are required to serve as a counterfactuals for untreated units. O1, O0 C | X or O1, O0 C | Pr(X) 8 Common Support Assumption (CSA) A second important assumption besides independence required in matching method is the common support or overlap condition. It precludes the occurrence of perfect predictability of C conditional on X. It ensures that units with the similar values of X possess a positive probability of being both in treated and untreated (Heckman et al; 1999). Furthermore, the common support assumption makes sure that these propensity scores must fall between 0 and 1. 0 chi2 Mean Bias Med Bias Unmatched 0.215 3143.96 0.000 18.8 9.2 Matched 0.003 43.05 0.555 1.6 1.2 Source: Author estimation using LFS-2017-18 The graph indicates how individual variables balance after matching. The x-axis displays the standardized bias, which is the percentage difference of the sample means in the treated and non-treated as a percentage of the square root of the average of the sample variances in the treated and non-treated groups (Rosenbaum and Rubin; 1985) Figure-6 Standardized bias across covariates Source: Author estimation using LFS-2017-18 It can be visualized from figure that the unmatched sample exhibit large imbalances with standardized bias being present across many of covariates but once the matching is done the standardized differences diminished significantly. Further the same results can also be seen through the histogram. It can be observed that the standardized -100 -50 0 50 Standardized % bias across covariates Unmatched Matched Uzma Tabassum, Shaista Alam and Ambreen Fatima 208 percentage bias has reduced significantly after the matching. Comparing the upper panel which shows the percentage bias in unmatched (non-treated group) with the second panel reveals significant reduction in percentage bias after the matching. Figure-7 Histogram of Standardized % Bias Source: Author estimation using LFS-2017-18 6. Conclusion and Policy Implications The purpose of this research is to explore the variation of spatial concentration /agglomeration across urban districts of Pakistan and analyses the effect of this concentration on the prevalence of income inequalities. The basic view is that spatial concentration amplifies inequalities across regions, because of the differences in productivity. The empirical literatures on the spatial disparities between urban regions based on agglomeration are rather scarce. This study attempts to start bridging this gap by using survey data conducted by Pakistan Bureau of Statistics (PBS). The empirical results and the evidences provided clearly indicate that distribution of income is significantly affected by spatial concentration supports the viewpoint of New Economic Geography (NEG) model. As per New Economic Geography (NEG) model the economic activities will be higher in the region of increased agglomeration which results in spatial inequalities showing the positive inter relation. To reveal results this study used two steps methodology. First the Herfindahl indices have computed, using Labour Force Survey (LFS) for the year 2010 to 2018, to measure the inter-temporal spatial concentration of industries. Using the values of indices urban regions are identified that have positive and above average level of concentration. Four large cities, Gujranwala, Sialkot, Lahore, Multan and four other urban areas Rawalpindi, Faisalabad, Gujranwala 0 .0 5 .1 .1 5 D en si ty -80 -60 -40 -20 0 20 40 60 80 Standardized % bias across covariates Unmatched 0 .0 5 .1 .1 5 D en si ty -80 -60 -40 -20 0 20 40 60 80 Standardized % bias across covariates Matched Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 209 and DG Khan fulfilled both criteria. After identifying the agglomerated regions the second step was to estimate causal treatment effects of agglomeration on regions income that responsible to exaggerate inequalities across agglomerated and non agglomerated regions. Propensity-Score Matching (PSM) was applied for this purpose. For the analysis eight regions that are experiencing increasing growth of agglomeration was considered as "Treated Group" while the regions where agglomeration appeared non persistent was considered as β€œControl Group.” The statistically significant coefficient of average treatment effect on treated (ATT) with positive sign indicated that the one percent increase in spatial concentration increase income level of the households that reside in these urban districts of Pakistan by 346854. Further no significant statistical differences have been found after matching and most of the covariates found balanced as per ps-test results. After performing PSM absolute percentage bias also decrease significantly. The absolute mean bias has reduced from 18.8 to 1.6 percent and absolute median bias reduced from 9.2 to 1.2. In a nut shell, it can be concluded that agglomeration is important determinant of regional variations in income. The reported findings have significant implications for regional policy. It means that obstructing the agglomeration of industries with the intention of encouraging equality may hurts productivity. In order to lessen spatial income inequalities focusing on the development of regional agglomeration of activities, rather than dispersion of resources to every single region, seems to be the correct way to carry on. The effective policies to promote agglomeration are needed for growing urban population that has the potential to turn out to be an effective part of economic growth. Pakistan fronting many challenges; Key to this is the provision of employment to growing labour force and create vibrant industrial sector that can provide variety of jobs requiring skills of different intensity. In the absences of such opportunities the standard of living for those that are affected abated which may lead social unrest. There is also need to improve infrastructure in order to promote agglomeration and greater inclusion and equity. References Abid A. Burki& Mushtaq A. Khan (2011). "Agglomeration Economies and their Effects on Technical Inefficiency of Manufacturing Firms: Evidence from Pakistan", International Growth Centre Working paper Aiginger, K. &Rossi-Hansberg, E. (2006). β€œSpecialization and Concentration: a note on theory and evidence” Empirica 33, pp. 255–266. Aiginger, K. (1999) β€œDo Industrial Structures Converge? A survey on the empirical literature on specialization and concentration of industries”, WIFO-Working papers. Aiginger, K., Davies, D. (2004). β€œIndustrial Specialization and Geographical Concentration; Two sides of the same coin?” Journal of Applied Economics 7(2), pp. 231-248. Uzma Tabassum, Shaista Alam and Ambreen Fatima 210 Austin P.C. (2011) "A tutorial and case study in propensity score analysis: An application to estimating the effect of in-hospital smoking cessation counseling on mortality". Multivariate Behavioral Research, 46 pp. 119–151. Azhar, A. & Adil, S. (2016). " Effect of Agglomeration on Socio-Economic Outcomes: A District Level Panel study of Punjab" The Pakistan Development Review Papers and Proceedings: The 32nd Conference of the Pakistan Society of Development Economists, December 13 - 15, 2016, pp. 155-170 Burki, A. A & Khan, A. M. (2012). "Agglomeration Economies and their Effects on Technical Inefficiency of Manufacturing Firms: Evidence from Pakistan", International Growth Centre Working paper Chaudhry, A., & Haroon, M. (2015). "The Economic Impact of New Firms in Punjab". The LahoreJournal of Economics 20, pp. 143–182 Cochran, W. G., & Rubin D. B. (1973). β€œControlling Bias in Observational Studies: A Review,” Sankhya, The Indian Journal of Statistics, 35(4), pp.417–446. Cochrane, W., and S. Chambers (1965). β€œThe Planning of Observational Studies of Human Populations,” Journal of the Royal Statistical Society, Series A, 128, pp. 234–266. Daytyan, K. (2016). "Interrelation among Economic Growth, Income Inequality, and Fiscal Performance: Evidence from Anglo-Saxon Countries," Review of Public Economics, vol 217(2), pp 37-66. Dehejia, R. H., and S. Wahba (2002), β€œPropensity Score Matching Methods for Nonexperimental Causal Studies,” Review of Economics and Statistics, 84, pp. 151–161. Duranton, G., Puga, D. (2004). β€œMicro-foundations of urban agglomeration economies.” In: Henderson, J.V., Thisse, J.-F. (Eds.), Handbook of Urban and Regional Economics, Vol. 4, North-Holland, Amsterdam, pp. 2063-117. Ellison, G., E. Glaeser (1997). "Geographic Concentration in U.S. Manufacturing Industries: A Dartboard Approach", Journal of Political Economy, 105, pp. 889– 927. Fambon, S. (2017) "The Determinants of Inequality and Income Gap between Urban and Rural Areas in Cameroon: Evidence from the ECAM3 Household Survey". Advances in Economics and Business, 5(7), pp.394 - 410. Fan, C., and A. Scott. (2003). Industrial Agglomeration and Development: A Survey of Spatial Economic Issues in East Asia and a Statistical Analysis of Chinese Regions. Economic Geography 79 (3), pp. 295–319. Flury, B. K., Riedwyl, H. (1986) "Standard distance in univariate and multivariate analysis". The American Statistician. 40 pp. 249–251. Gardiner, B., Martin, R., & Tyler, P. (2010). "Does spatial agglomeration increase national growth? Some evidence from Europe". Journal of EconomicGeography, 11(6), pp. 979-1006 Geppert, K., Marti, G & Axel, W. (2008). "Economic Growth of Agglomerations and Geographic Concentration of Industries: Evidence for West Germany", Regional Studies, 42(3), pp. 413-421. Glaeser, E., H. Kallal, J. Sheinkman, and A. Schleifer (1992), β€œGrowth in Cities”, Journal of Political Economy, 100, pp. 1126-1152. https://ideas.repec.org/a/hpe/journl/y2016v217i2p37-66.html https://ideas.repec.org/a/hpe/journl/y2016v217i2p37-66.html https://ideas.repec.org/s/hpe/journl.html https://ideas.repec.org/s/hpe/journl.html Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 211 Heckman, J., Ichimura, H. & Todd, P. (1997), "Matching as an Econometric Evaluation Estimator: Evidence from Evaluating a Job Training Program", Review of Economic Studies, 64(4), pp. 605-654. Heckman, J., Ichimura, H., Smith, J. & Todd, P. (1998), "Characterization of Selection Bias Using Experimental Data", Economelrica, 66(5), pp. 1017–98. Heckman, J., Robert, L & Smith, J. (1999)."The economics and econometrics of active labor market programs". In Handbook of labor economics, 3(1), pp. 1865–209. Henderson, V., A. Kuncoro, and M. Turner (1995), β€œIndustrial Development in Cities”, Journal of Political Economy, 103, pp.1067-1152. Imbens, G. (2004): β€œNonparametric Estimation of Average Treatment Effects under Exogeneity: A Review,” The Review of Economics and Statistics, 86(1), pp. 4–29. Jamal, H (2006). "Does inequality matter for poverty reduction? Evidence from Pakistan's poverty trends" The Pakistan Development Review 45(3), pp. 439-459 Khan, S., and E. Tamer (2010), β€œIrregular Identification, Support Conditions and Inverse Weight Estimation,” Econometrica, 78 (6), pp. 2021-2042. Kim, S. (2008) "Spatial Inequality and Economic Development: Theories, Facts, and Policies". Commission on Growth and Development, working paper No.16 Krugman, P. (1991) "Geography and Trade" Cambridge: M.I.T. Press, Marshall, A. (1890) β€œPrinciples of Economics” London. Maurel, F., B. Sedillot (1999) "A Measure of the Geographic Concentration in French Manufacturing Industries", Regional Science and Urban Economics, 29, pp. 575– 604. Midelfart, K. H. (2004) "Does agglomeration explain regional income inequalities?" Norwegian School of Economics and Business Administration and CEPR Naseer, S. & Ahmed, A. M. (2016) "Determinants of Income Inequality among the Earners in Pakistan" S3H Working Paper Series, National University of Sciences and Technology (NUST). Normand S.L. T., Landrum M.B., Guadagnoli E., Ayanian J.Z., Ryan T.J., Cleary P.D., McNeil B.J.(2001) "Validating recommendations for coronary angiography following an acute myocardial infarction in the elderly: A matched analysis using propensity scores". Journal of Clinical Epidemiology. pp. 387–398. Rosenbaum, P. R., &. Rubin, D. B. (1983a) "The central role of the propensity scores in observational studies for causal effects." Biometrika., 70, pp. 41–55 Rosenbaum, P. R., &. Rubin, D. B. (1983b). "Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome." Journal of the Royal Statistical Society, Series B. 45, pp.212–218. Rosenbaum, P. R., &. Rubin, D. B. (1985) "Constructing a control group using multivariate matched sampling methods that incorporate the propensity score." The American Statistician 39(1), pp. 33-38 Sial, M.S., Zheng, C., Khuong, N.V., Khan, T., Usman, M. (2018), "Does firm performance influence corporate social responsibility reporting of chinese listed companies?" Sustainability, 10(7), pp. 1-12. Tabassum, U. (2016)"Regional Specialization Trade and economic size of cities" Thesis " University of Karachi. Uzma Tabassum, Shaista Alam and Ambreen Fatima 212 Weber A, (1909) Über den Standort der Industrien (Mohr, TΓΌbingen). Translated by C. J. Friedrich, 1929, as Alfred Weber’s Theory of the Location of Industries (University of Chicago Press, Chicago) Journal of Applied Economics and Business Studies, Volume. 4, Issue 4 (2020) 187-214 https://doi.org/10.34260/jaebs.449 213 Annexure Table A1: Estimates of Probit Model (If Agglomeration=1) Standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 Source: Author's estimations based on LFS 2017-18 Variables Coefficient Average schooling 0.075*** (0.00517) Age of highest earner 0.009*** (0.00092) Education of highest earner -0.012*** (0.0044) Age of head of household 0.005*** (0.00093) Dependency Ratio 0.0184** (0.00802) Migrant household 0.394*** (0.0440) Technical/Vocational training 0.266*** (0.0410) Working age members in household 0.0224*** (0.00589) Gender of HOH 0.298*** (0.0445) Work hours 0.0099*** (0.00118) Female Participation Rate -0.282*** (0.0351) Constant -1.617*** (0.246) Occupational dummies Most of the dummies appears statistically significant Provincial dummies Observations 18,722 LR chi2(46) 3143.96*** Pseudo R2 0.215 Uzma Tabassum, Shaista Alam and Ambreen Fatima 214 Table A-2: Distribution of Estimated Propensity Score-Livelihood Percentiles Smallest 1% 0.007168 0.002032 5% 0.014106 0.0023592 10% 0.021964 0.0025063 Obs 18722 25% 0.106504 0.0025063 Sum of Wgt. 18722 50% 0.251298 Mean 0.25002 Largest Std. Dev. 0.167869 75% 0.36499 0.8400924 90% 0.466755 0.8439572 Variance 0.02818 95% 0.536866 0.8474239 Skewness 0.349559 99% 0.678517 0.8526713 Kurtosis 2.591341 95% 0.536866 0.8474239 Skewness 0.349559 99% 0.678517 0.8526713 Kurtosis 2.591341 Note: The region of common support is [.002032, .8526713] Source: Author's estimations based on LFS 2017-18