Background

The burdens of COVID-19 pandemic are not economically and socially equal across different populations. Socioeconomically disadvantaged populations are reported to be at greater risk during the pandemic, with higher rates of infection recorded in impoverished communities, ethnic minorities, the homeless, and people with pre-existing stigmatized infectious disease (e.g., HIV) [1,2,3]. Prior evidence shows that populations with socioeconomic disadvantages may be more vulnerable to the COVID-19 pandemic and bear a higher risk of morbidity and mortality. For example, it is difficult for poor people to comply with strict sanitation requirements and adopt hygiene tips to protect themselves from the virus, since low-income families are more likely to have less access to clean drinking water and a sufficient supply of sanitizers during the pandemic [4]. Ethnic minority migrants are more likely to live in more crowded, multigenerational housing and thus are less able to keep social distancing [4]. Fewer than a fifth of Black American workers and roughly a sixth of Hispanic workers have the opportunity of telework, compared with nearly 30 percent of non-Latino whites [5]. Black Americans are therefore more likely to work in basic service jobs that require frequent human contact (such as food, delivery, transportation) [5].

These conditions placed ethnic minorities with socioeconomical disadvantages at higher risk of COVID-19 infection, morbidity, and mortality. Long-standing historical discrimination, inadequate access to quality health care, and the economic constraints force these populations to make decisions based on economic reasons rather than giving priority to overall health [6]. Besides, socioeconomic inequities are also associated with population inequities in terms of pre-existing diseases (e.g., diabetes, heart disease, respiratory diseases), which may lead to higher mortality rates among people with pre-existing conditions after contracting COVID-19 [7, 8]. More importantly, inadequate health awareness about COVID-19 also plays a vital role in increasing the risk in these poor communities [9]. Previous study shows a disproportionately high rate of positive tests among blacks and low-income people without health insurance [10]. The disparity in health risks between ethnicities may not be limited to infection. A recent investigation shows that ethnic minorities who have ever been faced with racial discrimination are more likely to be hesitant about vaccination [11]. Long-standing medical mistrust based on racial/ethnic groups may also explain the lower rates of vaccination intake among ethnic minorities [12]. It is reported that non-Hispanic blacks are 11% less likely to be vaccinated than non-Hispanic whites, while Asian Americans are 50% more likely to be vaccinated [13].

Although some efforts have been made by prior studies on this issue, there are still some research gaps. First, most previous studies have focused on individual-level explanations for the risk of COVID-19 infection in specific populations, with using hospitalization data. This study aims to depict, at a macro-level, the associations between population distribution by ethnicity and vulnerability to this pandemic in specific regions. Second, compared with prior studies that show the relationships between population distribution of ethnicities and infection risk at a given timepoint, this study provides the dynamics over time by examining their associations at multiple equidistant timepoints. The examination at a given timepoint lacks sufficient reliability, since the frequency mismatch between population distribution by ethnicity (which is usually stable over several years) and infection/vaccination (which is often volatile and varies from day to day). Third, prior studies have not addressed whether the disproportionate risk of COVID-19 infection faced by ethnic minorities would extend to vaccination coverage. This study fills this research gap by providing an empirical examination.

Ethnic disparities in infection risk and health care utilization in the pandemic: a review

There are growing concerns that racial/ethnic minorities are at the disproportionate risk of mortality and morbidity of the pandemic. A national cohort study of the veterans in the US finds that the additional risks and burdens experienced by racial minorities could not be entirely explained by healthcare conditions, residence place, or the hospitalized facilities where they access care [14]. Similar results are observed in another study which uses ambulatory care data from Bronx, New York, and reports that blacks have a 1.6 times higher risk of mortality than whites [15]. Massachusetts General Hospital records a rise in Hispanic patients from 9% before the outbreak of pandemic to 35–40% after [16]. A report from Boston Medical Center shows that about 44.7% outpatients with novel coronavirus disease symptoms are non-Hispanic blacks and 14.3% are non-Hispanic whites [14]. Non-Hispanic blacks and non-Hispanic whites account for 42% and 13.4% of severe cases in ICU with mechanical ventilation [17].

The elevated risk for colored people can also be related to the differences in pre-existing comorbidities and testing rates [18]. Ethnic minorities have higher odds of obesity, diabetes, and cardiovascular diseases, which could easily induce comorbidities of novel coronavirus disease, leading to a higher mortality risk [19]. There are also significant racial differences in the awareness, knowledge, and response to novel coronavirus disease, with non-Hispanic whites having the highest level of relevant health and hygiene knowledge, subsequently followed by Asian Americans, Hispanics, and non-Hispanic blacks [20]. Such differences may prevent ethnic minorities from taking appropriate countermeasures and may further exacerbate ethnic disparities in mortality and morbidity risk during the pandemic [20]. Besides, ethnic minorities are more likely to be faced with socioeconomic disadvantages, such as living in overcrowded places where it is difficult to keep physical distancing and holding essential jobs in social care and public transport where the risk of exposure is higher, which could also increase their mortality and morbidity [21, 22]. A recent study shows that African Americans have at least 50% higher odds than whites of working in essential industry sectors that are often exposed to infection, such as hospitals, healthcare and social assistance, and animal slaughtering and processing [23]. Inefficiencies and inadequacies in the health care system are an important reason why the poor are kept away from health care. Support for universal health care and primary health care remains inadequate [24], especially for immigrants dominated by ethnic minorities. More than 46.7 million immigrants now live in the US, but 11 million of them are undocumented, so they are not entitled to public health care and funding for public services [25]. In Texas, 32% of undocumented immigrants live below the poverty line and 64% do not have health insurance [2]. Such evidence may highlight the potentially disproportionate risks that ethnicities and minorities face over a long period of time prior to the outbreak of the pandemic.

Method

Setting

The advantage of using the US sample is that the US is a multi-ethnic country. According to the latest statistics on the ethnic profile of the US [26], the most prevalent racial or ethnic group is the White alone non-Hispanic population at 57.8%. The Hispanic population is the second-largest racial or ethnic group, comprising 18.7% of the total population. The non-Hispanic black population total 44.78 million people (13.5%) in 2020. Asian Americans are now the fastest-growing major racial or ethnic group in the US, accounting for 6.1% of the total population in the same year.

Data description

Open data from multiple sources are combined and matched for regression analysis in this study. Specifically, novel coronavirus disease data (confirmed cases) come from the New York Times that continuously tracks the daily real-time cumulative counts of coronavirus cases in the US at the county level.

Data of vaccination at the county-level come from the Center for Disease Control and Prevention (CDC) in the US, which discloses the vaccination coverage not only in the total population, but also among those aged > 12, > 18 and > 65 years.

Population composition data of ethnic minority and socioeconomic factors at the county level are collected from the County Health Rankings & Roadmap Program (CHRRP, 2020), which is initiated and conducted in collaboration by Robert Wood Johnson Foundation and University of Wisconsin and aims to improve health equity and promote evidence-based policy and practical approaches [27, 28]. The CHRRP synthesizes the multi-source national health surveillance data with the time span ranging between 2016–2019 from various governmental agencies.

The availability of data from well-equipped statistical and survey institutions above provides an opportunity to examine the association between population distribution and health risks during the COVID-19 pandemic.

Variables

Dependent variables are the confirmed cases of COVID-19 (standardized by population, taken in natural log) at the county level, and the vaccination coverage in total population and among those aged > 12, > 18, and > 65 years.

Independent variables are population distribution by ethnicities, including population percentage of non-Hispanic black (Mean = 9.47, S.D. = 14.55), population percentage of Asian American (Mean = 1.59, S.D. = 2.84), and population percentage of Hispanics (Mean = 9.66, S.D. = 13.74).

Covariates include the following variables. Primary care physician is measured by the number of primary care physicians per 100,000 population (Mean = 52.78, S.D. = 34.52). Life expectancy at the county level is measured by the average life expectancy of a county (Mean = 76.83, S.D. = 7.43). Housing problem is measured by the percentage of households with at least one of following problems including overcrowding, high housing costs, lack of kitchen, or plumbing facilities (Mean = 13.95, S.D. = 4.34). Regions are classified into four categories according to two aspects (1) the median household income, and (2) the rich-poor polarization (measured by the income ratio of 80th and 20th percentile in a certain county). These four categories include Category-1, if the rich-poor polarization is above the average level and the median household income is above the average level; Category-2, if the rich-poor polarization is below the average level and the median household income is above the average level; Category-3, if the rich-poor polarization is above the average level and the median household income is below the average level; and Category-4, if the rich-poor polarization is below the average level and the median household income is below the average level.

Statistical data analysis

Multiple linear regressions adjusted with robust standard errors are conducted to examine the relationships between variables in this study. To capture the dynamics over time, multiple linear regressions at equidistant time points (i.e., May 28th 2020—Jan 28th 2021, with one-month interval between each timepoint) are used to examine the associations between population distribution by ethnicities and the confirmed cases of COVID-19. Still, multiple linear regressions at equidistant time points (i.e., Jan 25th—Aug 25th, 2021, with one-month interval between each timepoint) are used to examine the associations between population distribution by ethnicities and vaccination coverage during this pandemic.

Given that vaccination began in January, 2021, multiple linear regressions of confirmed cases at equidistant time points are set to end on Jan 28th, 2021. By this way, this study seeks to isolate the impact of vaccination on confirmed cases when estimating the influence of population distribution on confirmed cases. Stata 16.0 (Stata Corp. LLC., College Station, TX, USA) is applied in the analysis. The regressions are shown as below.

$$Confirmed\;cases\;of\;COVID-19\;at\;the\;county\;level\;(standardized\;by\;population,\;taken\;in\;natural\;log)\;t=\beta0+\beta1\;Population\;percentage\;of\;non-Hispanic\;black\;+\;\beta2\;Population\;percentage\;of\;Asian\;American\;+\;\beta3\;Population\;percentage\;of\;Hispanics\;+\;\beta4\;Primary\;care\;physician\;per\;100,000\;population\;+\;\beta5\;Life\;expectancy\;at\;the\;county\;level\;+\beta6\;Percentage\;of\;households\;with\;housing\;problem\;+\;\beta7\;Category\;of\;regions\;+\;\varepsilon t$$

Where the subscript t indicates the t-th time point, including May 28th 2020, Jun 28th, Jul 28th, Aug 28th, Sep 28th, Oct 28th, Nov 28th, Dec 28th 2020, Jan. 28th 2021.

$$Vaccination\;coverage\;at\;the\;county\;level\;(for\;different\;age\;groups\;including\;total\;population\;and\;>12,\;>18,\;and\;>65years)\;t=\;\beta0\;+\;\beta1\;Population\;percentage\;of\;non-Hispanicblack\;+\;\beta2\;Population\;percentage\;of\;Asian\;American+\;\beta3\;Population\;percentage\;of\;Hispanics\;+\;\beta4\;Primary\;care\;physician\;per\;100,000population\;+\;\beta5\;Life\;expectancy\;at\;the\;county\;level\;+\;\beta6\;Percentage\;of\;households\;with\;housing\;problem\;+\;\beta7\;Category\;of\;regions\;+\;\varepsilon t$$

Where the subscript t indicates the t-th time point, including Jan 25th 2021, Feb 25th, Mar 25th, Apr 25th, May 25th, Jun 25th, Jul 25th, Aug 28th 2021.

Empirical results

The spatial distributions of confirmed cases (standardized by population, and the dynamics with time) are illustrated (see Fig. 1 in Appendix). Results of Table 1 demonstrate that the population percentage of ethnic minorities in a county (i.e., the non-Hispanic blacks and Hispanics) was positively associated with the confirmed cases (for the non-Hispanic blacks, 0.004–0.041, p < 0.01 from May 2020 to Jan 2021; for the Hispanics, 0.006–0.023, p < 0.01 from May 2020 to Jan 2021).

Table 1 The influences of population distribution by ethnic minority on confirmed cases of novel coronavirus disease in the US (county level)

For the population percentage of Asian Americans, its association with confirmed cases was significantly positive in the early phase (0.042 in May 2020, and 0.027 in Jun 2020, p < 0.01), and insignificant in the mid-phase (0.010 in Jul 2020, 0.009 in Aug 2020, 0.006 in Sep 2020, -0.004 in Oct 2020, p > 0.10), and then significantly negative in the later phase (-0.018 in Nov 2020, -0.018 in Dec 2020, -0.018 in Jan 2021, p < 0.01). These results provide evidence for ethnic differences in the infection risk exposure during the pandemic.

Besides, the spatial distributions of vaccination coverage (population percentage, and the dynamics with time, in total population and among those aged > 12, > 18, and > 65 years) are illustrated (see Figs. 2, 3, 4 and 5 in Appendix). As depicted in these figures, the population aged > 65 years outperform other age groups in vaccination coverage. Tables 2, 3, 4 and 5 show the associations between population distribution by ethnicities and vaccination coverage in total population and among those aged > 12, > 18, and > 65 years respectively. As shown in Table 2, there are significant negative associations between population distribution by non-Hispanic Black/Hispanics and vaccination coverage at all timepoints in the total population, which indicates that counties with a larger proportion of non-Hispanic Black/Hispanics in the total population have a lower level of vaccination coverage. Similar support is found for the population distribution by non-Hispanic Black/Hispanics across different age groups. In Tables 3, 4 and 5, counties with a larger proportion of non-Hispanic Black/Hispanics in age groups > 12, > 18, and > 65 years also show lower vaccination coverage at most timepoints. Further, as shown in Table 2, 3, 4 and 5, the associations between population distribution by Asian Americans and vaccination coverage at most timepoints and in most age groups are insignificant. These results indicate that there are no associations between population distribution by Asian Americans and vaccination coverage.

Table 2 The influences of population distribution by ethnic minority on COVID-19 vaccination coverage (total population) in the US (county level)
Table 3 The influences of population distribution by ethnic minority on COVID-19 vaccination coverage (> 12 years of age) in the US (county level)
Table 4 The influences of population distribution by ethnic minority on COVID-19 vaccination coverage (> 18 years of age) in the US (county level)
Table 5 The influences of population distribution by ethnic minority on COVID-19 vaccination coverage (> 65 years of age) in the US (county level)

Discussion

This study provided an empirical examination on the relationships between population distribution by ethnicity and infection risk/vaccination coverage during the pandemic from the population composition perspective. In practice, it could help policymakers formulate effective plans for the geographical distribution of healthcare resources to improve the health care access of vulnerable groups in health crises.

Results of this study provided macro-level evidence for previous studies that reported the inequity in health risks for ethnic minorities during this pandemic. This study revealed that the population distributions of non-Hispanic blacks and Hispanics are shown stably and positively associated with the confirmed cases in all cross-sections from May, 2020 to Jan, 2021, which could confirm previous findings reporting the disproportionate infection risk to ethnic minorities such as the non-Hispanic Black during the pandemic [14,15,16,17]. In addition, results of this study suggested that the negative association between population distribution of ethnic minorities and health disparities can extend from infection risk to health care utilization/access during the pandemic. Areas with higher population distribution of non-Hispanic Black/Hispanics displayed a lower level of vaccination coverage in this study. This finding worked not just for total population, but also for subgroups aged > 12, > 18 and > 65 years. Some potential causes may lead to a higher risk of infection and less access to vaccination for ethnic minorities. To be specific, this groups of people is often faced with poorer socioeconomic status [21, 22, 29, 30]. They have more chance of having essential jobs with higher exposure risks [5, 23, 31,32,33], but have less access to health services [4, 6, 12]. They are also more likely to suffer from inequity in pre-existing diseases and stigmatization [18, 19, 34], and to lack adequate health awareness, information, and knowledge about the novel coronavirus disease pandemic [9, 20, 35].

The population aged > 65 years are shown to outperform other age groups in vaccination coverage (see Figures). This higher vaccination coverage is consistent with the prioritization of this age group early in the vaccination program. Following recommendations by the Advisory Committee on Immunization Practices to prioritize COVID-19 vaccination for specific groups of the US population at highest risk for COVID-19 hospitalization and death, many states included older adults among the first groups eligible for vaccination [36]. However, vaccination coverage for those aged > 65 years still varies by race and ethnicity. Among recipients aged > 65 years of at least 1 dose of COVID-19 vaccine, 70.7% were White, 7.6% were non-Hispanic Black, 6.7% were Hispanic, and 3.5% were Asian American [37].

In contrast, the association between the population distribution of Asian Americans and the confirmed cases showed a transition from positive to negative in this study. The association between population distribution of Asian Americans and vaccination coverage did not appear significant as well. Both results may indicate that Asian Americans are better adapted to the requirements of health care practice in the pandemic, though a higher level of confirmed cases among Asian Americans during the early stage of pandemic outbreak because of social vulnerability in racial/ethnic minority groups. These results confirmed prior findings that Asian Americans performed better on COVID-19 knowledge, attitudes and health literacy in the US [38], and had lower hospitalization rates than other minorities in a survey of 12 states [39].

This study has some policy implications. First, the characteristics of population distribution should be considered in epidemic prevention and control, which is of great importance for public health management in a region or even a country. The regions whose population distribution reflects a high risk of infection (and low vaccination acceptance) should be focused. Additional assistance, such as the increase in welfare and food supplies and the reduction in racial segregation, is particularly important for these regions during the pandemic. Second, the improvement of medical and testing conditions is the key practices to reduce inequity in healthcare resources in these regions. Government agencies should increase the necessary investments in medical resources, expand the coverage of health insurance, establish more medical aid centers, equip more professional medical personnel, and provide timely and free nucleic acid testing, etc. besides non-pharmaceutical intervention (e.g., lockdown [40, 41], social distancing). Third, the disparities in vulnerability to this pandemic between regions with different population distribution structures highlight deep-rooted social problems that need to be addressed for the long-term development of society. Forth, continued efforts are needed to improve vaccination coverage among persons in the group aged under 65 years. Community-specific messaging could engage this age group by using trusted sources to explain the value of vaccination to communities and individuals and to address concerns about vaccine safety. In addition, this age group might be reached by setting up strategically-located mobile and walk-in clinics with flexible hours, providing vaccinations in the workplace, and encouraging employers to provide paid leave for employees to receive the vaccine and treat any vaccine-related side effects.

There are still some limitations in this study. First, this study treated the population distribution as static, since it is stable in a short period of time. Due to unavailability of data, this study could not include the dynamic characteristics of population distribution on a county level in daily/monthly frequency to investigate their associations with the pandemic development. With relevant data available, future research need to provide a comprehensive investigation. Moreover, in the absence of data on the population distribution and the pandemic development at a lower regional level, this study focused mainly on the county-level relationships. With data at a lower regional level available, future research can conduct a fine-grained investigation, such as at the community level, to make research findings of more practical significance. Furthermore, the availability of data limited the investigation in other multi-ethnic countries outside the US. Future research can make comparative analysis when other data sources are available.

Conclusions

The population distribution by racial/ethnic minority is shown, on county level, associated with the infection risk and vaccination coverage in the US during the COVID-19 pandemic. The findings provide a macro-level identifier of health risks for epidemic prevention and control, and provide scientific support for reducing inequity in health risks of vulnerable populations and promoting regional public health management. It is therefore essential to pay close attention to the regions whose population distribution reflects a higher health risk and take measures to reduce inequity faced by vulnerable populations in terms of health resources, social security, racial and social issues. Further research needs to make a more fine-grained exploration of the relationships between population distribution and regional health risks at lower regional levels and in the contexts of other multi-ethnic countries with richer characteristics of population distribution, in order to provide more operational and universal guidance.