“I don't mind stealing bread from the mouths of decadents, but I can't feed on the powerless when my cup's already overfilled …”

~Chris Cornell, Hunger Strike (1991)

1 Introduction

Since 1971, the United Nations (UN) has actively monitored a number of Least Developed Countries (LDCs), or countries having structural frailties that make them especially susceptible to economic downturns, natural disasters, human conflicts, contagious diseases, and other existential threats.Footnote 1 As a set, LDCs represent the poorest of the world’s poor, with more than 75% of their people living in poverty and bearing all the deprivation and other cruelties that poverty inflicts. While countries worldwide face challenges, a combination of three factors formally defines LDCs: (i) per capita gross national income (GNI), composed of the gross national product adjusted for the value of investment needed to maintain a constant level of production, measuring prosperity; (ii) a human assessment index (HAI), composed of health and education indicators measuring human capital; and (iii) an economic vulnerability index (EVI), composed of exposure and shock indicators, measuring economic risk. The current list of LDCs has 46 countries—with a combined population of about 12% of the world’s 7.8 billion—33 of which are located in Sub-Saharan Africa.

This concentration of poverty is so entrenched that it is understood as common knowledge. Less understood, is the geography of the toll that Sub-Saharan poverty takes on the people caught up in it—and, even less, how this geography has shifted relative to the massive changes in settlement that have reshaped Africa over the past few decades. (Examples of recent progress on this front include Jean et al. 2016, Xie et al. 2016, and Osgood-Zimmerman et al. 2018). As shown in Table 1, which lists data from the last 20 years summarizing the world at large, the 46 LDCs, and the whole of Sub-Saharan Africa, the rate of urbanization across these groups registered annual averages of 2.15%, 4.08%, and 4.12%, respectively. A closer look is provided in Table 2, which ranks the top 20 countries in the world by their urbanization rates in 2000, 2005, 2010, 2015, and 2019. At the beginning of the timeframe, countries located in Sub-Saharan Africa accounted for 11 of the top 20—by the end, they accounted for 17 of the top 20. In all cases shown on the table, the annual rate of urbanization exceeds 4.00%, a pace that yields a doubling in under two decades. Though Africa is the least urbanized of the settled continents, it is experiencing the greatest urban growth of all—mostly in its Sub-Saharan countries, where 15.75% of the region’s 1.1 billion people now live in agglomerations of a million or more (World Bank and World Development Indicators 2020).

Table 1 Annual rate of urbanization, 2000–2019
Table 2 Top 20 countries by annual rate of urbanization, 2000–2019

The rapid transformation of Africa raises a fundamental question, which this paper begins to investigate: has urbanization there been accompanied by improvements in personal wellbeing? While economists, geographers, regional scientists, and others have good reason to think that that urbanization, a process having both spatial and temporal dimensions, makes people in the developing world better off, evidence remains sparse (Glaeser 2013; Young 2012; Glaeser and Ghani 2015; Bryan et al. 2020; OECD 2020). This paper addresses the gap by examining the relationship between urbanization and a key dimension of wellbeing across Sub-Saharan Africa: child growth failure, including (i) stunting; (ii) wasting; and (iii) underweight (Osgood-Zimmerman et al. 2018). Child growth failure is a fundamental indicator of wellbeing because, globally, about 45% of deaths among children under the age of five are associated with malnutrition and most of this death occurs in the LDCs of Africa and Asia (WHO 2021).

Like all projects focused on Sub-Saharan Africa, this one faces a tall barrier: namely, that quality data on urbanization and people there is extremely limited, a problem compounded by the fact that what data does exist comes from a variety of sources and was never meant to work together. A main contribution, therefore, is to reconcile an array of data—collected for different purposes, across different spatial scales, and over different timeframes—in a manner that enables some insight into the question set out above. Specifically, the paper combines remotely sensed data on urbanization, collected at continuous time intervals, with microdata on the health of people, collected at discrete time intervals, in a way that enables exploration of the relationship between the two. This reconciliation yields a rich database, covering all of Sub-Saharan Africa, that is used to estimate a series of econometric models, including spatial autoregressive specifications, that systematically connect the large-scale process of urbanization to the small-scale experience of child health outcomes. By estimating models at multiple spatial scales, the analysis is able to evaluate both the compatibility of the data involved and the stability of the relationships it uncovers. Overall, the results suggest that the wave of urbanization breaking across Sub-Saharan Africa is likely to the benefit of its people—but clearly indicate the need for further geographical research.

2 Theory and stylized facts

The first organizing idea of this work is that urbanization makes people better off, so much so that it matters to economic growth at the country level. This idea is converted into a stylized fact.Footnote 2 by formally testing it via a model of convergence in per capita gross domestic product (GDP) within a panel of countries from all around the world. The results are presented in Table 9, in the appendix; further discussion of the model is beyond the scope of this paper, but details are available from the authors upon request.Footnote 3 The model contains two variables on urbanization—the percent of total population living in agglomerations of greater than one million and urban population growth—both of which positively influence economic growth.Footnote 4 The second organizing idea of this work is that urbanization rearranges not only people but, also, the conditions impacting them—meaning that the deprivations of poverty move along with people and may be mediated, for better or for worse, by new locational circumstances. Moreover, urbanization follows the arrow of time so the urbanization happening throughout Sub-Saharan Africa is reconfiguring the population in a way that is more-or-less permanent, or at least not likely to reverse. This permanence—or irreversibility, regardless of the scale from which it is viewed (Hopkins 2001; Lai 2021)—brings further weight to the question of personal wellbeing that motivates this work.

Putting these two ideas together, the lens of time geography is helpful because it exposes what Hägerstrand (1989) called the bare skeleton that connects the large-scale outcomes of urbanization to the small-scale experiences of individuals.Footnote 5 In a nutshell, time geography is concerned with the movement of people through space under constraints tied to location that wax and wane with the passage of time (Hägerstrand 1970, 1989). It is common in neoclassical economics to assume that relocation decisions are rational and based on complete information; time geography holds that, these assumptions notwithstanding, the wellbeing of individuals is inextricably tied to place, plus the circumstances of the moment. What’s past is past, no matter its bearing on the future, and constraints on the human condition—so-called capability, coupling, and authority constraints—tighten and relax accordingly. This lens is useful for understanding the behavioral relationships of geographic systems across multiple spatial scales because it explicitly connects the spatial behavior of people to their environment (Golledge and Stimson 1997; Miller 2017). Not only that, the constraints-oriented approach of time geography squares neatly with Sen’s (1985, 1999) capability approach to development economics, which has been translated directly into standards for urban planning in developing countries throughout the world (Frediani and Hansen 2015).

Urbanization presents in time geography because of how it restructures capability constraints and coupling constraints: in terms of the former, people are better able to care for themselves and their dependents; in terms of the latter, people are brought into contact with organizations, including markets, medical facilities, social networks, and more, that may enhance their wellbeing.Footnote 6 This is explained by the fact that cities serve as both centers of production, offering increased opportunity, and centers of consumption, offering increased access to goods and services—meaning that moves to them hold the possibility of welfare gains in both dimensions. The idea of the consumer city was popularized in an eponymous paper by Glaeser et al. (2001) but was first used by Weber (1921) in a taxonomy that distinguished between consumer and producer cities by economic base (Batty 2018); decades ago, Harris and Ullman (1945) and Ullman (1962) explored similar themes from the perspective of urban geography. Even earlier, location theorists identified cities as being, above all else, market centers that serve as focal points for the trade of agricultural commodities and other products—among them rare and specialized services like medical care (von Thünen 1826; Christaller 1933; Lösch 1938). The present analysis works from the premise that cities are beneficial to people, even as it acknowledges that there is much to learn about cities in LDCs, in general, and those in Sub-Saharan Africa, in particular (Ashraf et al. 2016; Glaeser and Ghani 2015; Gollin et al. 2017; Glaeser 2014, 2020; Bryan et al. 2020). For example, sanitation is a critical public good without which urbanization can be dangerous (Glaeser 2013).

The discussion now turns to stylized facts, which are two: urbanization promotes economic growth; and, as Sub-Saharan Africa has urbanized over the past decades, child growth failure has moderated. As discussed, the first of these is established by helpful pair of hypothesis tests in a macroeconomic model of convergence that is contained in the appendix. The second of these is documented in Table 3, which reports on urbanization and child growth failure by country. By both measures—impervious surface area and light intensity—urbanization has increased regionwide (Cameroon being the exception) between the two years of observation, t– and t. At the same time, in most cases, the measures of child growth failure, expressed as z-scores, have also registered improvements, moving closer to the mean (zero) from their negative starting levels. Note here that, as footnoted in the table, time t– and time t differ by country, due to when the rounds of survey data on child growth failure were conducted; impervious surface area and light intensity are collected continuously from space and are keyed in time to the survey data. It is readily apparent from the table that, as urbanization has increased, child growth failure has moderated—but appearances are not evidence, so the question remains: has urbanization in Sub-Saharan Africa been accompanied by improvements in personal wellbeing?

Words convey the cruelties of poverty far better than z-scores. A stunted child has a height-for-age z-score below minus two standard deviations from the median of the reference population. Stunted children, who are too short for their age, often suffer from cognitive and physical impairment, leading to poor educational performance and productivity, plus higher risk of various forms of mortality and morbidity. A wasted child has a weight-for-height z-score that is below minus two standard deviations from the median of the reference population. While wasting is less prevalent than stunting, it is more severe in the sense that wasted children are more likely to die than stunted children (Black et al. 2008). Wasted children, who are too thin for their height, suffer acutely due to recent and drastic weight loss. Wasting results from a sudden reduction in food intake and/or frequent or prolonged illness in the absence of medical care. Last, an underweight child has a weight-for-age z-score that is below minus two standard deviations from the median of the reference population. Underweight children, who are too light for their age, may also be wasted and/or stunted (Dop 2016; Goudet et al. 2015). Worldwide, about 20.3% and 6.9% of children under the age of five years old were, respectively, stunted and wasted in 2019 (UNICEF, WHO, World Bank 2020). The upcoming sections on database development and econometric analysis provide a closer look.

3 Data and geospatial methodology

The data at the core of this analysis is of two different forms: (i) data on urbanization collected at continuous time intervals via satellites; and (ii) microdata on the health of children collected at discrete time intervals via USAID’s Demographic and Health Surveys (DHS) and harmonized by the Advancing Research on Nutrition and Agriculture (AReNA) project (IFPRI 2020). The following paragraphs describe the data and explain how it has been reconciled for the purpose at hand. The centerpiece of the discussion is a spatial sampling strategy that enables the DHS survey data, collected on different individuals across different timeframes, to be used in an econometric analysis to examine change over time.

Getting consistent statistical data on urbanization, measured via developed land, across multiple countries over long intervals anywhere in the world is difficult. The DHS data utilized here does contain urban/rural information, but the definition of an urban location is country-specific and differs across countries. For example: some countries define urban areas based on population; others define urban areas based on administrative boundaries; and still others define urban areas based on combination of the two (United Nations 2019). For that reason, this analysis turns to two remotely sensed databases, briefly introduced in Table 3, that cover continent-spanning areas longitudinally.

The first of these is a land cover measure of impermeable surface area, a key indicator of urban land use, obtained from the Global Artificial Impervious Areas (GAIA) 1985–2018 database created by Gong et al. (2020). The GAIA database is derived from Landsat imagery at a 30 m \(\times \) 30 m resolution and is capable of capturing even subtle changes over its 33-year-long timeframe. The accuracy of GAIA at the global scale is 90%, based on tests done for the years 1985, 1990, 1995, 2000, 2005, 2010, and 2015. The second of these is a luminosity measure of nightlights—a key indicator of human settlements (Doll et al. 2006; Li and Zhou 2017; Pinkovskiy 2017; Düben and Krause 2021) and, as well, economic activity (Henderson and Storygard (2012)—developed by Li et al. (2020). This harmonized global nighttime light database consists of luminosity data from the Defense Meteorological Satellite Program / Operational Linescan System, from 1992–2013, and the Visible Infrared Imaging Radiometer Suite, from 2012–2018. It has been radiometrically corrected to remove light from the aurora borealis, wildfires, and other noise and it enables consistent monitoring of nightlights at a 1 km \(\times \) 1 km resolution over its 27-year timeframe. Unlike the GAIA data, there is no measure of accuracy for the luminosity data, because Li et al. (2020) do not have a comprehensive means of ground-truthing it. The two remotely sensed databases complement each other and, together, form a portrait of urbanization across Sub-Saharan Africa.

The DHS data covers Sub-Saharan Africa over a 30-year timeframe, but it presents challenges when compiling it for a cross-sectional, longitudinal analysis. In particular, the data is collected over multiple rounds at different times in different countries—see time t– and time t, as identified at the bottom of Table 3—with each round surveying different households. So, while the DHS data is exceptionally rich, the challenge is to use it longitudinally because it is not a panel survey. Given the structure of the DHS, the goal here is to turn its information into an approximation of a panel survey through a spatial sampling strategy and combine it with the data on landcover and nighttime light. This process involves four steps, which are illustrated in Fig. 1 using the case of Uganda:

  • In the first step, illustrated in Fig. 1a, child-level z-scores were assigned to their geocoded households; points were offset randomly in the DHS (by up to two kilometers in urban areas and by up to five kilometers in rural areas) to preserve the privacy of respondents. The points in the map are colored by the rounds in which the DHS was conducted and, as noted Table 3, in the case of Uganda, time t– = 2001 and time t = 2016. The silhouettes, used to characterize child growth failure, follow those commonly used by UNICEF and are reprinted from Osgood-Zimmerman et al. (2018).

  • In the second step, illustrated in Fig. 1b, the three z-scores were interpolated from the locations where they were recorded into a 1 km \(\times \) 1 km grid using an empirical Bayesian kriging (EBK) approach developed by Krivoruchko (2012) and Gribov and Krivoruchko (2020). Kriging is a theoretically sound approach that yields errors as part of its output (Longley et al. 2015) and the resulting trend surfaces seamlessly cover all of Sub-Saharan Africa. In each country, the earliest survey round is used to construct the base-line trend surface, and the latest round is used to construct the end-line trend surface. As shown below, it does not matter to the analysis that the earliest and latest rounds are in different years across different countries, because first-year effects are controlled for in the econometric models. The EBK approach was selected for this step because it integrates Bayesian probabilities that account for the uncertainties in choosing parameters to build variograms. It automatically subsets data and builds localized semi-variograms through iterative simulations; in this way, it yields more accurate estimates of standard errors than other forms of kriging.Footnote 7

  • In the third step, illustrated in Fig. 1c, a 10 km \(\times \) 10 km hexagonal grid, composed of hexagons, denoted \(\eta \), was created and placed over the trend surfaces of the three measures of child growth failure. The 10-km size was selected because it best captures the DHS sample points and reconciles well with the 1 km \(\times \) 1 km grids of the urbanization and child growth failure variables. Zone-level mean values were then calculated for the hexagons and assigned to first-round households depending on their location within the grid. In this way, the trend surfaces, which cover the entirety of all surveyed countries, act as proxies that enable measurement between time t– and time t. A hexagonal grid was adopted instead of a square grid because hexagons have more locational precision (Birch et al. 2007). This is what enables change over time, measured as the difference between the average value within the hexagons at time t and the average value at time t–, to be regressed on child-level data from time t–.

  • In the fourth step, illustrated in Fig. 1d, the urbanization indicators were calculated by creating buffer zones around households and summing up the values within those buffer zones using Google Earth Engine. The buffer radius is two kilometers for households in urban clusters and five kilometers in rural clusters, corresponding to the GPS offsets used by DHS to protect the privacy of respondents.

Fig. 1
figure 1

Step-by-step geoprocessing

Fig. 2
figure 2

Spatial sampling in Ethiopia

Fig. 3
figure 3

Spatial sampling in Namibia

Fig. 4
figure 4

Spatial sampling in Nigeria

Fig. 5
figure 5

Spatial sampling in Zambia

The steps just outlined form a straightforward—and easily replicable—spatial sampling approach that enables data on different individuals collected across different timeframes to be used to approximate change over time. Although admittedly subject to a degree of imprecision, introduced by privacy-preserving offsets in geocoding and the use of the two survey rounds, it yields a spatially and temporally reconciled database (from sources never meant to work together) covering all of Sub-Saharan Africa.

Maps speak volumes to spatial scientists—economists, geographers, and regional scientists alike—so the analytical results of the process just described are summarized in Figs. 2, 3, 4, 5, which display Ethiopia, Namibia, Nigeria, and Zambia, respectively. In each figure: (a), (b), and (c) cover stunting at time t–, time t, and t minus t– (d), (e), and (f) cover wasting at time t–, time t, and t minus t–; and (g), (h), and (i) cover underweight at time t–, time t, and t minus t–. Note that the color ramps flip in the last map in each row—i.e.: the column containing (c), (f), and (i)— because those maps register change in, not levels of, child growth failure. In all cases, red represents worse, or worsening, conditions, as measured by the z-scores and changes in the z-scores. To be clear, the difference is the average value within a given hexagon, \(\eta \), at time t minus the average value at time t– (that is: \({\eta }_{t}\)\({\eta }_{t-}\)) and these values are used as dependent variables in the child-level regressions below; the independent variables include urbanization, as calculated in the last step of the geoprocessing procedure, plus micro data captured by the initial DHS round. The strength of this methodology lies in the fact that it is straightforward: it relies on tried and true tools of cartography, quantitative geography, and geographic information science (Fotheringham et al. 2000; Longley et al. 2015); is easily replicable; and computationally inexpensive. The result is a database that covers all of Sub-Saharan Africa and includes over 105,000 observations at the child level.

4 Econometric analysis

As stated in the introduction, the models estimated in this section systematically connect the large-scale process of urbanization to the small-scale experience of child health outcomes. They do this by progressively narrowing in terms of spatial scale and expanding in terms of complexity. At this stage, the analysis makes no claim to precision, but it does, broadly, establish the relationship it set out to explore.

The DHS is conducted in multiple rounds across Sub-Saharan Africa, meaning that the county- and region-level models can be estimated as unbalanced panels, leveraging all rounds of the survey data to expand the sample size. Additional control variables at these levels come from the World Bank’s World Development Indicators (WDI) database.Footnote 8 At the child level, the models account for the error introduced via the kriging process—maps of the EBK errors are available from the authors upon request—and spatial dependence is addressed via an autoregressive specification (see Anselin 1988). Across all scales, the dependent variables are the three forms of child growth failure: (i) stunting; (ii) wasting; and (iii) underweight. Each of these is initially measured in the form of z-scores, but they are operationalized somewhat differently at different spatial scales. Specifically, the dependent variables are expressed as prevalence in the country-level models; levels in the region-level models; and changes within 10 km \(\times \) 10 km hexagons in the child-level models. Descriptive statistics for all variables engaged in the analysis are provided in Table 4 and variable descriptions are provided in Tables 10 and 11, in the appendix.

Table 3 Urbanization and child health metrics in Sub-Saharan Africa
Table 4 Summary statistics

To start, the country- and region-level models take the general form of:

$${\varvec{\zeta}}= \left({\zeta }_{it}^{s}, { \zeta }_{it}^{w}, {\zeta }_{it}^{u}\right)= f \left({{\varvec{u}}}_{it},{{\varvec{x}}}_{it}, {\varvec{\phi}}\right),$$
(1)

where the zetas [\({\zeta }_{it}^{s}\), \({\zeta }_{it}^{w}\), \({\zeta }_{it}^{u}\)] \(\in{\varvec{\zeta}}\) represent values derived from the z-scores for stunting, wasting, and underweight in country/region i at time t; \({{\varvec{u}}}_{it}\) represents the two urbanization variables; \({{\varvec{x}}}_{it}\) represents a vector of explanatory variables measuring economic conditions; and \({\varvec{\phi}}\) represents a vector of locational (country- or region-level) fixed effects. In these models, child growth failure is a function of the urbanization hypothesized to impact it and basic economic controls obtained from the WDI database. Equation (1) is implemented as:

$${\varvec{\zeta}}={\beta }_{0}+{{{\varvec{\beta}}}_{1}\cdot {\varvec{u}}}_{it}+{{{\varvec{\beta}}}_{2}\cdot {\varvec{x}}}_{it}+{\varvec{\phi}}+{\varepsilon }_{it}.$$
(2)

Here, i indexes the country (n = 33) or region (n = 257), as the case may be; the vector of dependent variables, \({\varvec{\zeta}}\), is used to indicate that the specifications are identical across \({\zeta }_{it}^{s}\), \({\zeta }_{it}^{w}\), and \({\zeta }_{it}^{u}\); and the index t indicates that all variables are measured contemporaneously. The \(\beta \) s are estimable parameters expressed in single and vector (bold) form and \({\varepsilon }_{it}\) ~N (0, \({\sigma }^{2}\)) is the stochastic error term. This basic set-up extends to the individual level:

$$\Delta {{\varvec{\zeta}}}_{\eta }={\beta }_{0}+{{{\varvec{\beta}}}_{1}\cdot {\varvec{u}}}_{it-}+{{{\varvec{\beta}}}_{2}\cdot {\varvec{x}}}_{it-}+{{\varvec{\beta}}}_{3} \cdot {{\varvec{\kappa}}}_{i}+{{\varvec{\beta}}}_{4} \cdot {{\varvec{\tau}}}_{it-}+{\varvec{\phi}}+{\varepsilon }_{i}.$$
(3)

In Eq. (3), \(\Delta {{\varvec{\zeta}}}_{\eta }\) is change in child growth failure, as captured by differencing mean z-scores in hexagons \(\eta \) at time t and time t–, where the years, t and t–, are as recorded in Table 3; \(\eta \) indexes unique hexagons (n = 4,626) in the sampling net and i (n = 105,467) indexes individual children. (Although a 10 km \(\times \) 10 km sampling net may seem large, it is not given that the net covers all of Sub-Saharan Africa: on average, each hexagon captures 21 children.) At this level, the vector \({{\varvec{x}}}_{it-}\) contains household conditions related to women’s education and standing, plus household assets. As additional controls, given the exploratory nature of the analysis, the equation contains: \({{\varvec{\kappa}}}_{i}\), a location-specific measurement error from the EBK procedure at both the beginning and end of the time period, and, \({{\varvec{\tau}}}_{it-}\), a set of time controls that captures the base year and number of years elapsed between t and t–.

Finally, spatial autoregressive specifications of (3) are implemented by introducing a spatial lag of the dependent variables. The spatial lags were created using an n \(\times \) n (105,467 \(\times \) 105,467) row-standardized spatial weights matrix (\({W}_{ij}, \forall i \ne j\)) where neighboring observations i and j are defined as children living in households located within 15 km of each other. The lag is then calculated by multiplying the three dependent variables by this weights matrix. The spatial lag complicates estimation because it is endogenous: surrounding values depend on each other, so the model can no longer be properly estimated via ordinary least squares, or OLS. Among the many possible interventions available to correct for the problem is Kelejian and Prucha’s (1998) spatial two-stage least squares (S2SLS) strategy, which provides a solution that yields unbiased and efficient estimates, even if spatial error dependence persists (Das et al. 2003). This is a ~ mild intervention that relies on a set of instrumental variables composed of all explanatory variables plus spatial lags of those same variables. It is a least squares estimator plenty appropriate for this stage of an analysis, when the focus is on establishing broad relationships. The resulting model is as follows:

$$\Delta {{\varvec{\zeta}}}_{\eta }={\beta }_{0}+\lambda \cdot {W}_{ij}\cdot\Delta {{\varvec{\zeta}}}_{\eta }+{{{\varvec{\beta}}}_{1}\cdot {\varvec{u}}}_{it-}+{{{\varvec{\beta}}}_{2}\cdot {\varvec{x}}}_{it-}+{{\varvec{\beta}}}_{3}\cdot {{\varvec{\kappa}}}_{i}+{{\varvec{\beta}}}_{4}\cdot{\varvec{\tau}}+{\varvec{\phi}}+{\varepsilon }_{i},$$
(4)

where \({W}_{ij}\cdot\Delta {{\varvec{\zeta}}}_{\eta }\) is the endogenous spatial lag and \(\lambda \) is the autoregressive parameter measuring spatial dependence.

The OLS estimation results are reported in Tables 5, 6, 7 for the country-, region-, and child-level models, respectively, and the (child-level) S2SLS estimates are reported in Table 8; all fixed effects have been suppressed in order to conserve space, but are available upon request. To be clear, at in child-level models, the dependent variable is the difference in the average value of hexagon \(\eta \) between time t and time t– where the child is located, so it is a proxy for individual change—taking on one of 4,626 possible values—but the explanatory variables are all child specific and vary accordingly across the 105,467 observations. These models progressively increase the set of control variables listed above to check the sensitivity of the urbanization parameter estimates. For each nutrition indicator, three model specifications are estimated in the country and region-level analyses and four model specifications are estimated in the child-level analysis, for both aspatial (OLS) and spatial (S2SLS) cases. For ease of reference, the equations are numbered, including all specifications, from (1) to (42). The next paragraphs summarize the estimation results, which, together, validate the data processing methodology detailed above and form a holistic picture of the relationship between urbanization and personal wellbeing in Sub-Saharan Africa.

Table 5 OLS estimates of country-level models
Table 6 OLS estimates of region–level models
Table 7 OLS estimates of child-level models, aspatial
Table 8 S2SLS estimates of child-level models, spatial

The summary—which focuses on the urbanization variables and only touches on the controls—begins with the sets of country and region-level estimates. The country-level models, reported in Table 5, which take the prevalence of stunting, wasting, and underweight (measured as national percentages) as their dependent variables, show consistent evidence of a negative relationship between urbanization and child growth failure. Importantly, these models are panels, so they account for the evolution of urbanization, measured via impervious surface area and luminosity, over time. The exception is luminosity in the three wasting models: (4), (5), and (6). These results are consistent, as registered by the estimated parameters, \(\widehat{\beta }\), and robust, as shown by the t-statistics, across the various specifications, which introduce the two economic indicators one-by-one. As expected, higher GDP per capita and a business climate that is friendly to women also reduce the prevalence of child growth failure. Next, the region-level models, reported in Table 6, which take of levels of stunting, wasting, and underweight (measured as regional mean values of z-scores) as their dependent variables, show additional evidence that, overall, urbanization reduces child growth failure. These models are also panels, so they, too, account for the evolution of urbanization. Note that the relationships here are positive, not negative, because negative z-scores indicate growth failure—the more negative the worse. The impervious surface area variable comes in insignificant in just one stunting equation, (12) when both economic indicators are introduced and, like the national-level models, luminosity does not register in the wasting equations, (13), (14), and (15), or in the final equation, (18). Also like the national models, the estimated influence of urbanization is consistent and holds across specifications. In models (11) and (12) women’s education and the asset-based wealth index have a positive influence. These country and region-level models begin suggest that urbanization does matter to personal wellbeing.

Moving on, the child-level models, reported in Tables 7 and 8, which take change in child growth failure, captured by differencing mean z-scores in hexagons \(\eta \) at time t and time t–, as their dependent variables, deliver direct evidence of a relationship between urbanization, \({{\varvec{u}}}_{it-}\), and child growth failure. Table 8 also shows consistent evidence of spatial dependence in the models, and should be considered the more accurate of the two sets. In these models, all of the explanatory variables are initial conditions—that is, measured at time t–, as listed in the footnote of the table—and most are measured at the level of the child or household that the child belongs to. (Note that all models control for error introduced by the interpolation / kriging process at both time t and time t– but that, as measurement error, there is to particular expectation for these variables.) Impervious surface area and luminosity, measured as change at the child’s location, are statistically significant and positive, meaning that they are associated with improvement, in virtually all cases—in both the spatial and aspatial models. In the OLS estimates, impervious surface area is statistically significant in 12/16 equations and luminosity is significant in 15/16 equations. In the S2SLS estimates, impervious surface area is statistically significant in 13/16 equations and luminosity is significant in 12/16 equations. While not perfectly consistent, these results meet the broad expectation that urbanization matters to personal wellbeing. Looking past the urbanization variables, household characteristics also matter: years of women’s education, a female head of household, and indicator variables (medium and high tercile) derived from the asset-based wealth index, are all positive and, for the most part, statistically significant. The initial levels of the models’ own variables, lagged to time t–, are consistently significant and negative, meaning that poor starting levels of stunting, wasting, and underweight lead to worsening of those maladies over time. The outstanding elements of the models, \({{\varvec{\kappa}}}_{i}\), \({\varvec{\tau}}\), and \({\varvec{\phi}}\), are, respectively, the measurement errors from kriging; temporal effects tracing the timing of the DHS survey; and country fixed effects. All are included as controls due to the exploratory nature of the models and have no direct interpretation.

What can be said, based on these regressions, exploratory as they are, is that urbanization matters to wellbeing: children living in urbanized areas are less likely to be stunted, wasted, and/or underweight. The modeling results connect the stylized facts, as set out above, but more refined econometric analysis is needed to say anything with precision—or about cause and effect (Angrist and Pischke 2015).

5 Summary and conclusion

This paper opened by explaining that Sub-Saharan Africa, home to 33 of the 46 UN-designated LDCs, is experiencing the greatest urban growth in the world and posed a question fundamental to the region: has urbanization there been accompanied by improvements in personal wellbeing? While a decisive answer to this question is not available, the investigation opened here suggests a reason for optimism: urbanization, measured across spatial scales and over time, is associated with reduced child growth failure, in the form of stunting, wasting, and underweight. This signals that the wave of urbanization breaking across Sub-Saharan Africa is likely to the benefit of its people. The few remaining comments are general observations derived from the evidence and implications for further research.

The two organizing ideas behind this work—that urbanization: (i) makes people better off; and (ii) is a spatial process that rearranges not only people but, also, the conditions impacting them—appear to hold up under empirical scrutiny. The constraints-oriented approach of time geography adopted as a theoretical lens is helpful because it exposes the behavioral relationships of geographic systems across multiple spatial scales. And, it squares neatly with the capability approach to development economics. Future work on this project will look further into these dynamics, particularly by exploring opportunities for policy intervention. Such interventions may come in a variety of forms, including aid from the international community focused on enhancing the benefits of urbanization, plus technical and/or financial assistance with industrial reorganization. For example, a key element of the urbanization process is an economic reorientation, away from agriculture toward industry. United Nations designated LDC status brings, among other things, benefits related to development financing, including eligibility for special grants and loans from donor countries and financial institutions; multilateral trading, including preferential market access and favorable terms of trade; and, importantly, assistance, including access to the Enhanced Integrated Framework, a program supported by 24 countriesFootnote 9 and eight international agencies.Footnote 10 Clearly, given the trajectory of Sub-Saharan Africa, to be successful, these programs should be implemented with cities and their powerful agglomeration economies in mind.

From a policy perspective, it is important to understand not just relationships, as explored here, but also the various pathways through which urbanization contributes to improvements in personal wellbeing. Moreover, while child growth failure is a critical measure, it is but one among many dimensions of health. It is a measure that can be built upon through the monitoring of indicators, especially incidence of communicable disease and other problems that manifest through the density of cities. To this point, wellbeing implies a holistic view of quality of life—and quality of life in cities can readily be enhanced by paying careful attention to the negative externalities of urbanization in the urban planning process. Finally, it must be acknowledged that the findings presented here are the preliminary results of a larger process of compiling and reconciling remotely sensed and DHS data across the entirely of Sub-Saharan Africa; the purpose of this particular paper has been to document the methodology involved. The empirical findings, while sound, are only preliminary. They are findings that must be refined via further development of the data, plus more precisely tuned econometric models. Like many projects, this one concludes by qualifying its findings and noting the need for not only more, but more expansive research. The matter is one of critical importance, given the sheer number of lives being transformed by urbanization in Sub-Saharan Africa and all of the complexity involved in the transition. The futures of many people, including generations to come, depend on making the transition successfully.