Introduction

Childhood obesity is a major public health challenge in the USA [1]. The prevalence of obesity among children aged 2–19 varies by race/ethnicity, and the number of Latinx children and adolescents who are presenting with obesity is increasing [2,3,4]. Latinx youth represent one-quarter of the entire population of children under the age of 18 in the USA and are disproportionally burdened by obesity [5, 6]. A continuing rise in the prevalence of obesity among Latinx children is a serious concern because an excess of body fat is associated with adverse effects on health that may persist into adulthood such as hypertension, prediabetes and diabetes mellitus, sleep apnea, asthma, and depression [1, 6,7,8]. Although the prevalence varies among Latinx Americans of different ethnic backgrounds regardless of origin group, Latinx adults are more likely to be diagnosed with these medical conditions, as compared to their non-Latinx White counterparts [9,10,11,12,13].

Unhealthy weight gain develops from a chronic, positive energy balance through an interplay of genetic, biological, behavioral, socioeconomic, and environmental factors [14,15,16]. For example, underlying medical causes such as poisoning, infections, respiratory or heart disease, and immune system, central nervous system, or endocrine system dysfunction may lead to obesity [17,18,19,20,21,22,23]. Also, children from low socioeconomic status (SES) families are more likely to be diagnosed with obesity, and, predictably, this is true among Latinx youth in general and in California, specifically [24, 25]. Relatedly, the consumption of high-calorie foods with little-to-no nutritional value and the chronic psychosocial stress brought on by having limited economic resources influence weight trajectories among Latinx children [26,27,28]. Moreover, the neighborhood socioeconomic environment where Latinx children live influences body weight status, and neighborhoods with predominately racial/ethnic minority residents have higher levels of poverty and lower levels of education, which are independently associated with childhood obesity [29,30,31,32,33,34,35]. Notably, data indicate that living in either a low-SES household or a low-SES neighborhood is enough to increase the risk of child obesity [36]. Multiple domains of the social neighborhood environment also are relevant to the development of overweight and obesity [37]. For example, social cohesion and collective socialization and trust in Latinx communities are some of the more pertinent factors because they can impact norms regarding diet and physical activity [38]. Regarding the home and family environment, the parents of Latinx youth tend to underestimate their child’s weight status and perceive their child as “normal” weight even when the child is classified as overweight or obese according to body mass index (BMI) categories [39]. Taken together, these multiple influences enhance the risk for obesity among children in general, as well as in Latinx children specifically, a population where prevalence is increasing especially rapidly [2,3,4,5,6].

Obesity is a multifactorial disease, and thus, we need to appreciate fully the relevance of how socioenvironmental factors like residential context combine to associate with increased body weight. As but a single example, findings from the Moving to Opportunities Study conducted by the Department of Housing and Urban Development indicate that relocation to a low-poverty census tract led to lower obesity prevalence [40]. One way to address the significance of multifactorial neighborhood-level determinants of obesity is to use novel methods on unique populations to draw out the extent to which distinctive residential contexts give rise to social patterning that produces variation in obesity prevalence [31, 32]. Indeed, our data source allows us to overcome some of the methodological challenges plaguing the current literature, including an objective collection of height/weight measures, access to medical diagnoses, adequate sample size and number of children per residential tract, and racial/ethnic diversity [41]. Specifically, we geocoded pediatric medical record data for residents in Southern California, a geographical area with the largest Latinx population in the USA [42]. Moreover, as a geographic setting, Los Angeles County has the nation’s largest Latinx population, while Orange County ranks tenth among the counties with the largest Latinx population in the USA [42]. We took advantage of this heterogeneity to estimate the differential impacts of neighborhood context on children’s BMI z-scores. First, we explored the relationship between neighborhood context and BMI z-scores. Given past work demonstrating an association between neighborhood socioeconomic status and children’s body weight, we expected that neighborhood disadvantage would associate with heightened BMI z-scores. In addition, we expected race/ethnicity would moderate the association between neighborhood context and BMI z-scores, and that Latinx children will have higher body weight relative to other race/ethnicities irrespective of contextual factors.

Methods

Data Sources

We use a compilation of data from multiple sources. We extracted electronic medical records (EMR) data from children (n = 53,735) between the ages of 0 and 18 admitted between June 2013 and June 2018 as inpatients across a variety of care settings. EMR data were excluded for pediatric cancer patients (n = 1319), those without complete body mass index (BMI) measures (n = 3921), and outlier patients for the length of stay (n = 529) [40]. Patient addresses from the remaining EMR data were geocoded and linked to the corresponding residential Census tract. To capture the appropriate time period for analysis, neighborhood measures were assigned to children temporally by first taking the child’s address from the electronic medical record at the time of their height and weight measurement. Then, we used 5-year ACS estimates for the census tracts which “surround” the timing of the child’s records. In this way, the 5-year ACS estimates characterize the child’s area within that 5-year period. As previously done, we used Census tracts to represent neighborhoods [43]. Social and economic indicators were extracted from the 2014–2018 American Community Survey (ACS) data. We excluded observations unmatched to Federal Information Processing Series (FIPS) Codes (n = 7610), those living in Census tracts with fewer than 20 children per tract (n = 32,234), and children living outside of California (n = 90), resulting in a total sample size of 8092 children nested within 672 census tracts or neighborhoods.

Variables

The key outcome is age- and sex-specific BMI z-score (i.e., the number of standard deviation (SD) units that the child’s BMI deviates from the age- and sex-normed mean reference value, based on the 2000 Center for Disease Control (CDC) Growth Charts: USA) [44, 45]. We included all covariates available to us from the EMR to represent the child and familial characteristics. Sociodemographic characteristics include age at the time of visit, sex (1, girl; ref.; 2, boy), race/ethnicity (1, Latinx, ref.; 2, non-Latinx White; 3, non-Latinx Black; 4, Asian; 5, Other Race), and insurance type (1, Private, ref.; 2, Public (e.g., MediCare or MediCal)) as a proxy for socioeconomic status (SES) [31, 46,47,48]. Other measures include admission type (1, elective, ref.; 2, emergency; 3, other (i.e., urgent care or accident)); and diagnosis based on ICD9/10 codes (0, absence of diagnosis; 1, presence of diagnosis), including bacterial and viral infection (A00-A99), diseases of the blood and blood-forming organs, and disorders involving immune mechanisms (D50-D89), endocrine, nutritional and metabolic diseases (E00-E89), ref., diseases of the circulatory system (I00-I99), nervous system diseases (G00-G99), congenital malformations, deformations and chromosomal abnormalities (Q00-Q99), diseases of the digestive system (K00-K95), diseases of the genitourinary system (N00-N99), diseases of the musculoskeletal system (M00-M99), diseases of the respiratory system (J00-J99), injury, poisoning, and certain other external consequences (S00-T88), and other diagnoses (H00-H59, L00-L99, O00-O9, P00-P96, Z00-Z99). We also included length of stay (mean = 3.82; Std. Dev. = 5.95) and patient maximum pain assessed by healthcare providers throughout the child’s hospital stay using several developmentally and situationally appropriate measurement tools, including faces, legs, activity, cry, and consolability scale, faces pain scale, numeric rating scale, and neonatal pain, agitation and sedation scale (range from 0, no pain to 10, severe pain). Social and economic indicators of the child’s neighborhood of residence come from the ACS and include community-level education (% adults with < 12 years of education; % adults with 12 years of education; % adults with > 12 years and < 16 years of education; % adults with 16 years of education; % adults with 18 years of education; % adults with > 18 years and < 21 years of education), median community-level income, percent of female-headed households, percent receiving public assistance, percent in poverty, percent of homes in the tract that are rented, and racial/ethnic composition measured by percent of major racialized categories (% non-Latino White; % non-Latino Black; % Latinx; % Asian; % Other Race). After testing the cross-level assumption that the random effect of the intercept is correlated with a level 1 measure, using a Hausman test, we uncovered that some unobservable neighborhood characteristics relegated to the error term are correlated with some observable patient characteristics. We tested for correlations between neighborhoods and patient-level characteristics and found that the offending explanatory measures were race/ethnicity and insurance type. Then, we computed neighborhood aggregates of these patient-level characteristics (i.e., means of race/ethnicity and insurance type) to control for neighbor effects and included these measures at level 2 in our fully specified models [49].

Statistical Analysis

We first used latent profile analysis (LPA) to group neighborhoods into clusters based on the above-listed socioeconomic indicators extracted from the ACS, which are often used to define a child’s neighborhood of residence [50, 51]. Using MPlus 8.8 software, we estimated a 1-profile model in a type I covariance structure and fit successive models with an increasing number of profiles to characterize neighborhoods [52, 53]. We used entropy and theoretically driven evidence to select the most appropriate number of profiles and further evaluated model fit with AIC, BIC, a-BIC, and loglikelihood values to identify the most parsimonious model. Specifically, our substantive neighborhood cluster interpretations are based on theoretical neighborhood stratification observed across the USA (i.e., upper class, middle class, working class, and disadvantaged communities). The stability of our 4-profile solution was verified by model fit statistics wherein we stopped adding profiles once the model fit indicators began to increase and the entropy levels began to decrease (Table 1). Given our data, analyses indicated that neighborhoods are most appropriately captured by a 4-profile solution (Table 1). Figure 1 displays the patterning of neighborhood-level characteristics by LPA-generated neighborhood profiles. We standardized all measures for this figure by converting median household income to the ratio of income to poverty using 2016-inflation adjusted dollars for a 3-person household.

Table 1 Model fit & usefulness information for LPAs with 1−5 latent profiles
Fig. 1
figure 1

Descriptive neighborhood-level characteristics by LPA-generated neighborhood profiles

We then test the influence of distinct neighborhoods on children’s BMI z-scores using multilevel linear regression modeling with Stata 16.1 software [54]. This technique treats level-1 children as nested within level-2 neighborhoods and neutralizes the lack of independence of data within higher groups. Our modeling approach uses adaptive quadrature to adjust for problems that otherwise downwardly bias estimated standard errors, including different sample sizes for level-1 and level-2 units, clustering within neighborhoods, variable numbers of cases within level-2 units, and heteroscedastic error terms [55]. We performed a series of conditional models that first included the covariates of a child- and family-level predictors (age, sex, race/ethnicity, insurance type, admission type, diagnosis, length of stay, patient-reported pain levels) to test the influence of child and family factors on children’s BMI z-scores. In the next set of models, we included the LPA-constructed neighborhoods and neighborhood averages of race/ethnicity and insurance status at the neighborhood level (and a neighborhood-level error component) along with the child- and family-level predictors and an individual error term. In our last set of models, we included interactions between the LPA-constructed neighborhoods and race/ethnicity at the child/family level to determine whether race/ethnicity moderates the association between neighborhood disadvantage and the expected heightened BMI z-scores for Latinx children.

Results

We display how the four neighborhood profiles cluster across the area of study in Fig. 2. We assigned the following labels: Advantaged, Middle-Class, Working-Class, and Disadvantaged based on the descriptive characteristics. The advantaged neighborhood profile is largely concentrated in east Orange County, north-west Riverside County, and north-west San Diego County regions. As shown in Table 2, Advantaged neighborhoods have the highest median household income ($130,461), the highest overall levels of education (31% of residents have at least 16 years of education), and the lowest percentage of people living in poverty (8%). The Disadvantaged communities make up the areas around north-west Orange County and central Riverside County regions and score the worst on nearly every indicator. These communities have the lowest median household income ($41,470), the lowest education levels (48% of adult residents lack a high school degree), and the highest proportion of the population in poverty (29%).

Fig. 2
figure 2

Neighborhood profiles by census tracts, Southern California

Table 2 Descriptive neighborhood-level characteristics by LPA-generated neighborhood profiles

We show descriptive information for the child and family characteristics overall, and by neighborhood context in Table 3. The average overall BMI is 0.38 SDs above the national reference. However, the majority of children fall at or below the 67th percentile, or in the “Healthy Weight” category. Children living in Advantaged neighborhoods have 0.02 SDs below the mean while those residing in Disadvantaged neighborhoods have 0.65 SDs above the mean. The mean age for the entire sample was 9.80 years, with children in Advantaged neighborhoods slightly older than those in other neighborhood types. In Fig. 3, we show that, although there is some clustering of children of specific race/ethnicity across neighborhoods, each racial/ethnic group is represented within each neighborhood. As displayed in Table 3, children admitted to the emergency department and presenting with diseases of the digestive system are slightly over-represented in Disadvantaged neighborhoods compared to those residing in other neighborhoods.

Table 3 Means and standard deviations (SD) for independent and dependent variables overall and by neighborhood profile
Fig. 3
figure 3

Distribution of children across LPA neighborhood contexts by race/ethnicity

Results from our multilevel linear regression models predicting BMI z-scores are shown in Table 4. Model 1 partitions the total variation in BMI z-scores into within- and between-neighborhood variance. Model 2 only included the child/family-level characteristics (age, sex, race/ethnicity, insurance type, admission type, diagnosis, length of stay, patient-reported pain levels). Model 3 includes the LPA-generated neighborhoods. Fully specified Model 4 includes the interaction terms by race/ethnicity and neighborhood profiles. The random effects estimated across all models indicate significant variation in pediatric BMI z-scores across neighborhoods.

Table 4 Multilevel linear regression models predicting BMI z-scores; N = 8011 in 672 census tracts

In Model 1, intraclass correlation from an unconditional model reveals that around 20% of the variation in children’s BMI z-scores is attributed between neighborhoods. In Model 2, we see that non-Latinx white (− 0.32; 95% CI: − 0.44 −  − 0.19), non-Latinx black (− 0.34; 95%CI: − 0.58 −  − 0.09), and Asian children (− 0.56; 95%CI: − 0.72 −  − 0.39), relative to Latinx, have lower BMI z-scores. Each additional year in age significantly increased BMI z-scores by 0.03 SD units (95% CI: 0.02 − 0.03). Publicly insured children (0.15; 95% CI: 0.05 − 0.26) and those with diseases of the genitourinary system (0.19; 95% CI: 0.04 − 0.35) and higher pain levels (0.04; 95% CI: 0.03 − 0.05) have higher BMI z-scores, whereas children with longer lengths of stay (− 0.01; 95% CI: − 0.02 −  − 0.01), congenital malformations, deformations and chromosomal abnormalities (− 0.33; 95% CI: − 0.46 −  − 0.20), and diseases of the musculoskeletal system and connective tissues (− 0.15; 95% CI: − 0.28 −  − 0.01) have lower BMI z-scores. In Model 3, accounting for neighborhood profiles slightly attenuates the racial/ethnic and socioeconomic differences in BMI z-scores indicating that some of the lower BMI z-scores for non-Latinx white, non-Latinx black, and Asian children, relative to Latinx, and higher BMI z-scores for publicly insured children, are due to the neighborhood environment. Model 3 largely mirrors Model 2 in other child/family-level factors and further shows that higher levels of neighborhood disadvantage independently associate with higher BMI z-scores. More specifically, children housed in Disadvantaged neighborhoods have 0.19 SDs higher BMI (95% CI: 0.02 − 0.36) compared to those living in Middle-Class communities. Finally, we examined whether race/ethnicity moderates the influence of neighborhood context on BMI z-scores. Model 4 indicates an interaction between neighborhoods and race/ethnicity. Specifically, we find that Latinx children living in Middle-Class neighborhoods have higher BMI z-scores than Asian (− 0.79; 95% CI: − 1.47 − 0.12) and Other Race (− 0.47; 95% CI: − 0.89 −  − 0.05) children living in the most disadvantaged communities.

Discussion

This is the first study to date that employed LPA to detail how neighborhood context matters differently for children’s body weight status in a predominantly Latinx sample. We extracted EMR data from pediatric patients living in Southern California, a geographic setting with the largest Latinx population in the USA [42]. Importantly, we controlled for comorbid conditions like endocrine, nutritional, and metabolic diseases, as well as chromosomal, cardiovascular, pulmonary, neurological, musculoskeletal, and digestive system disorders that otherwise would bias estimates [7, 14,15,16,17,18,19,20,21,22,23]. Specifically, we generated neighborhood typologies based on social and economic indicators extracted from the ACS and found that the residential socioeconomic environment is associated with heightened BMI z-score. Our finding is consistent with published studies, but other researchers failed to account for underlying diagnoses that could introduce bias. Thus, in line with our expectations, using a diverse sample and accounting for important comorbid conditions, we show that a child’s neighborhood of residence independently associates with higher BMI z-scores. This could be driven by the overall higher incidence of overweight and obesity in the USA among Latinx youth, combined with compounding neighborhood-level determinants such as the absence of a health-promoting infrastructure that led to the greater prevalence of higher BMI [2,3,4, 31,32,33,34,35,36,37].

Our findings support previous reports that race/ethnicity moderates the association between LPA-constructed neighborhoods and children’s body weight. Specifically, here, we found that Latinx children living in more advantaged neighborhoods had higher BMI z-scores than Asian and Other children residing in the most disadvantaged communities. These results are troublesome given that a wealth of literature supports the notion that with affluence a child’s risk of presenting with obesity declines. Vis-à-vis individual-level indicators, the variation in children’s body weight associates with age, sex, race/ethnicity, SES, physical activity level, screen-time viewing, and sleep duration/bedtime [56,57,58,59]. Family-level, obesity-relevant factors include family physical activity, family mealtimes, food insecurity, and the income-to-need ratio [60,61,62]. The relationship among individual characteristics, in-home practices, and child obesity, however, is complicated by the influence of neighborhood-level descriptors [31,32,33,34,35,36,37]. For example, neighborhood median income modifies the association between a child’s BMI z-score and proximity to fast-food restaurants [63]. More specifically, the proximity to “unhealthy” food establishments has a stronger adverse effect on body weight status in lower-income neighborhoods, compared to the effect on residents of more affluent communities [63]. Thus, we need to optimize and implement a range of long-term, multicomponent intervention programs that focus on the early prevention of obesity. For example, a 3-year-long multilevel approach to reduce obesity among low-income, primarily Latinx children in Northern California, did show some promise for reducing weight gain [64]. Also, it is noteworthy that combined nutrition–physical activity initiatives, like The California Endowment’s Healthy Eating, Active Communities program, can be effective at preventing and reducing childhood obesity in low-income communities [65].

Published studies do offer insight into community-level factors that enhance susceptibility to obesity among children from socially vulnerable groups. For example, neighborhood safety influences the amount of physical activity and screen time [66, 67]. Also, poor adherence to physical activity recommendations among low-income Latinx women is a predictor of excessive gestational weight gain, which is independently associated with infant birthweight percentile [68, 69]. With regard to children, racial/ethnic differences exist across different groups in terms of physical activity and sedentary behaviors overall, and Latinx, compared to non-Latinx Black and non-Latinx White, children were found to participate in the least physical activity [70,71,72,73]. Crucially, the neighborhood context does impact girls and boys in a different way in terms of health benefits [31]. Relatedly, similar findings have been reported with regard to race/ethnicity. In Los Angeles County, for example, WIC-participating children across levels of neighborhood poverty are less likely to develop obesity, yet the prevalence of obesity remains higher among Latinx WIC-enrolled children compared to those who are non-Latinx White, non-Latinx Black, or Asian [74, 75]. Additionally, the neighborhood’s social and cultural fabric is yet another obesogenic factor. Indeed, the interplay of social group norms and networks like peers, school teachers, and digital mass media exposure informs acceptable ranges of body size and contributes to weight gain among Latinx youth [76,77,78]. Overall, data show that there is an immense disparity in the rates of pediatric overweight and obesity among racial/ethnic groups [4, 79]. Hence, intervention programs and public policies should include a multicomponent approach that targets parental behaviors, family patterns, household conditions, and the built and social infrastructure of communities [64, 65].

Our study, though informative, is not without limitations. We are constrained by the limited individual- and family-level variables available in the EMR data that are typically used for administrative purposes. For example, prior work indicates that children of immigrant parents present with higher levels of overweight, but we do not have information on immigration status nor origin group [80, 81]. Still, we included all covariates available to us in the EMR data known to associate with children’s body weight. Moreover, we use a public–private insurance dichotomization as a proxy measure for SES. We acknowledge that it is not ideal to use insurance type as a proxy for SES, but publicly provided health care coverage like Medicaid is only obtainable by children who meet stringent income criteria, with the exception of those with certain medical conditions [82]. Still, insurance coverage does have reasonable validity and reliability when used as a marker for individual-level SES [31, 32, 46,47,48]. Other scholars should test the reliability of our findings by using a more comprehensive set of individual- and family-level covariates known to associate with children’s body weight, including but not limited to nativity status, cultural differences, or parental education and income. Similarly, the cross-sectional nature of our data that are restricted to Los Angeles County and Orange County in Southern California limits the scope of our analyses. Nonetheless, this geographic setting has the largest Latinx population in the USA, which provides a unique opportunity to estimate the differential impacts of comprehensive neighborhood profiles on BMI z-scores in a diverse group of children [42]. Also, we used census tracts to represent neighborhoods. Census tracts are by no means a perfect operationalization of residential environments, but they remain a useful spatial entity available to us in the approximation of a neighborhood [30,31,32, 43]. Certainly, using addresses instead of zip codes provides a more robust spatial unit of analysis [37]. Finally, we examined the influence of distinct neighborhoods on children’s BMI z-scores. Although BMI is a simple measure, BMI categories are reasonably good for diagnosing pediatric obesity, especially when height and weight measures are collected objectively [83, 84]. Indeed, for our current study, height and weight were measured by trained healthcare workers.

Here, we sought to understand the pathways that underly the development of increased body weight in children by applying novel methods to a rich data source. Specifically, we used geocoded EMR data from pediatric patients to identify potential strategic targets for obesity prevention among Latinx youth residing in neighborhoods located in Los Angeles County and Orange County of Southern California. Our findings underscore the complex relationship between community racial/ethnic composition and body weight status during childhood. Indeed, here, we confirmed the significance of the place of residence on children’s body weight by focusing on the differential influence of race/ethnicity and neighborhood socioeconomic context. The results of our analyses raise critical questions for future research and represent opportunities for policy to help children maintain a healthy weight.