Introduction

figure a

The prevalence of diabetes in China has increased markedly in the past few decades. The proportion of Chinese adults estimated to have diabetes was 0.9% in 1980, 2.5% in 1994 and 10.9% in 2013 [1,2,3]. The increase is thought to be the most rapid worldwide, and it is related to China’s recent rapid economic development and urbanisation, which are contributing to socioeconomic and epidemiological transition [4].

Evidence from developed countries that have completed the epidemiological transition shows that non-communicable diseases (NCDs) are initially more common in population subgroups of high socioeconomic status (SES) and then, with increasing development, become more common in lower SES groups [5]. However, the evidence from low- and middle-income countries (LMIC) is limited [6]. Previous studies have reported inconsistent associations between SES and diabetes prevalence in mainland China, and findings were not obviously influenced by study year or the level of economic development of the study area [7, 8]. However, in Hong Kong and Taiwan, where economic development and epidemiological transition are at a more advanced stage than in mainland China, an inverse association between SES and diabetes prevalence has been described [7]. Reliable assessment of the association between SES and diabetes in different parts of China is needed to plan and evaluate health services and diabetes prevention strategies. In order to examine the recent socioeconomic pattern of diabetes risk in China, we describe the associations between SES and both prevalent and incident diabetes, using data from the China Kadoorie Biobank (CKB), a large prospective cohort study of about 0.5 million Chinese adults.

Methods

Study population

Detailed information about the study design, survey methods and population of the CKB has been reported previously [9]. Briefly, the baseline survey took place between June 2004 and July 2008 in ten geographically defined areas (five urban and five rural) of China. The areas were selected according to local disease patterns, exposure to certain risk factors, population stability, levels of SES, quality of death and disease registries, local commitment and capacity. Overall, 512,891 adults aged 30–79 years were enrolled, with a response rate of about 30%.

Ethical approval for this study was obtained from the University of Oxford, the Chinese Centre for Disease Control and Prevention and the local Centres for Disease Control and Prevention in the ten study regions. All participants provided written informed consent.

Assessment of socioeconomic status

We assessed participants’ SES status in two ways: (1) the highest level of school education an individual attained (no formal school, primary school, middle or high school, and college or above); and (2) total household income in the previous year (<10,000, 10,000–19,999, 20,000–34,999, and ≥35,000 Chinese yuan).

Assessment of covariates

We obtained covariates from the baseline questionnaire, including demographic characteristics (age, sex, urban and rural residence, and study regions), health-related behaviours (regular active and passive smoking, regular consumption of alcohol, fresh fruit, fresh vegetables and fish), personal and family medical history (history of CHD, stroke or transient ischaemic attack [TIA], cancer and diabetes). Regular active smoking was defined as current smoking on most days or more. Regular passive smoking was defined as exposure to other people’s tobacco smoke for 4 or more days per week. Regular consumption of alcohol was defined as drinking alcohol monthly or more frequently. Regular consumption of fresh fruit, vegetables and fish was defined as consumption of these foods on four or more days per week. A range of physical measurements were undertaken by trained technicians using a standard protocol and calibrated instruments, including BMI, waist and hip circumference, fat percentage and systolic and diastolic BP. Physical activity was estimated by summing the metabolic equivalent task (MET) h per day spent on work, commuting, housework and non-sedentary recreational activities [10]. Economic–geographic areas of China were categorised into four groups according to the National Bureau of Statistics of China and ranked from high to low per capita disposable income of households as Eastern, Northeastern, Central and Western, which reflected different levels of economic development in China [11]. For BMI, we used cut-off points for Chinese populations to define overweight (≥24 kg/m2 and <28 kg/m2) and obesity (≥28 kg/m2) [12].

Baseline prevalent diabetes, follow-up and ascertainment of incident diabetes

A 10 ml non-fasting blood sample was collected from study participants at baseline and fasting time was recorded. Plasma glucose levels were tested on-site using the SureStep Plus meter (LifeScan, Shanghai, China). Participants with plasma glucose levels from 7.8 mmol/l to less than 11.1 mmol/l were invited to return for a fasting plasma glucose test on the following day. Participants were asked at baseline: ‘Has a doctor ever told you that you had diabetes?’, with those reporting ‘yes’ defined as having self-reported previously diagnosed diabetes. Screen-detected diabetes was defined among participants without self-reported diabetes on the basis of any of: (1) random plasma glucose level ≥7.0 mmol/l and a fasting time ≥8 h; (2) random plasma glucose level ≥11.1 mmol/l and a fasting time <8 h; and (3) fasting plasma glucose level ≥7.0 mmol/l. Prevalent diabetes includes both self-reported previously diagnosed diabetes and screen-detected diabetes.

The vital status of participants was obtained periodically from local death registries based at China’s Disease Surveillance Points system, checked annually against local residential records and health insurance records and confirmed with street committees or village administrators. Incident diabetes was identified from linkage to disease and death registries and national health insurance databases, collecting details of diagnoses resulting in, or during, any hospital admission via individuals’ unique national ID. We defined incident diabetes using the E10-E14 codes from the Tenth Revision of the International Classification of Diseases (http://apps.who.int/classifications/icd10/browse/2016/en).

Statistical analysis

We excluded 2672 (0.5%) participants with missing, implausible or extreme values for all variables, leaving 510,219 participants for cross-sectional analyses of prevalent diabetes (electronic supplementary material [ESM] Fig. 1). We excluded 30,066 (5.9%) participants with previously diagnosed diabetes (3.1%) or screen-detected diabetes (2.8%) at baseline, leaving 480,153 participants for prospective analyses of incident diabetes. All analyses were stratified by sex. Logistic regression was used to estimate ORs and 95% CIs for the association between SES and baseline prevalent diabetes. For educational level, we adjusted for confounding variables including age at baseline (continuous), ten study regions, family history of diabetes, and household income in model 1. To further assess potential mediating variables for the association of SES with diabetes, we added BMI as a continuous variable to model 1 to create model 2. In model 3, we added other potential mediating variables, including waist circumference, fat percentage, physical activity, regular alcohol consumption, regular active smoking, regular passive smoking, consumption of fresh fruit, fresh vegetables and fish, history of CHD, stroke or TIA and cancer, systolic BP and diastolic BP. The potential mediating variables were selected based on prior knowledge of underlying mechanisms linking SES and diabetes [13, 14]. For household income, we built similar models but further adjusted for educational level and household size in all models. There was no evidence of serious multicollinearity in any model based on values of variance inflation factors.

To estimate incidence, we calculated person-years from baseline until the date of first record of incident diabetes, death, loss to follow-up or 31 December 2013, whichever came first. We used stratified Cox regression models to estimate the HRs and 95% CIs for the association between SES and incident diabetes, with various models constructed in similar ways to those used for analysing prevalent diabetes. We checked the Cox proportional hazards assumption using Schoenfeld residuals and found the assumption was not violated for any model. We performed likelihood ratio tests to investigate potential interactions between SES and variables of interest, by comparing models with and without interaction terms, and investigated potential linear trend effects of SES on incident diabetes by comparing models including and excluding a linear effect term.

We performed further analyses to investigate the impact of a composite measure of educational level and household income on incident diabetes. We categorised participants into four groups: (1) low educational level and low income; (2) low educational level and high income; (3) high educational level and low income; and (4) high educational level and high income. Low educational level was defined as having education at primary school level or below; low household income was defined as having annual household income <20,000 Chinese yuan. Additional sensitivity analyses were done after excluding individuals with baseline CHD, stroke or TIA, or cancer, to reduce potential reverse causality. R software (version 3.3.3; www.R-project.org, Vienna, Austria) was used to perform the analyses.

Results

Characteristics of participants and patterns of covariates with SES

Among 510,219 participants included for cross-sectional analyses, 59.0% were women and 44.1% were from urban areas (Table 1). The mean (SD) age was 52.3 (10.9) years for men and 50.9 (10.5) years for women. Higher proportions of men than women received college-level education or above (7.9% vs 4.5%) and were in the highest household income group (20.3% vs 16.5%), and lower proportions of men than women received no formal education (8.9% vs 25.2%) and were in the lowest household income group (26.0% vs 29.7%). Men with higher educational level and household income were more likely to be overweight or obese and had higher BMI, waist circumference and percentage fat than those in lower SES groups (ESM Tables 1 and 2). The absolute differences in these covariates by SES in women were very small, with values slightly higher in both the middle educational level and the middle household income groups compared with those at either extreme (ESM Tables 1 and 2).

Table 1 Sex-specific characteristics of CKB participants included in cross-sectional analyses

Associations between SES and prevalent diabetes

Age-standardised diabetes prevalence increased with higher educational level and household income in men, but the patterns were less clear in women (Table 2). After adjustment for age, study region, family history of diabetes and household income, there was a positive association between educational level and prevalent diabetes in men and an inverse association in women (pinteraction < 0.0001 between sex) (Fig. 1). The ORs (95% CIs) derived from model 1, comparing highest vs lowest educational level, were 1.21 (1.09, 1.35) in men and 0.69 (0.63, 0.76) in women. There was a positive association between household income and prevalent diabetes in both men and women, with the association more pronounced in men than in women (pinteraction < 0.0001 between sex) (Fig. 1). The ORs (95% CIs), comparing highest vs lowest household income category from model 1, were 1.45 (1.34, 1.56) in men and 1.26 (1.19, 1.34) in women. These associations were attenuated after additional adjustment for BMI but remained statistically significant, changing slightly after further adjustment for other potential mediating variables (Fig. 1, ESM Tables 3 and 4).

Table 2 Sex-specific diabetes prevalence among CKB participants included in cross-sectional analyses according to educational level and household income
Fig. 1
figure 1

Adjusted ORs and 95% CIs for prevalent diabetes associated with educational level (a, b) and household income (c, d) in men (a, c) and women (b, d), plotted on a logarithmic scale. For educational level, model 1 was adjusted for age at baseline (continuous), study region, family history of diabetes and household income. Model 2 was further adjusted for BMI based on model 1. For household income, model 1 was adjusted for age at baseline (continuous), study region, family history of diabetes, educational level and household size. Model 2 was further adjusted for BMI based on model 1. See ESM Tables 3 and 4 for data

Associations between SES and incident diabetes

The characteristics of participants without diabetes at baseline (n = 480,153) by SES were similar to those of people included for cross-sectional analyses (ESM Tables 57). During a median of 7 years (3.4 million person-years total) of follow-up, 2032 (0.4%) participants were lost to follow-up and 22,103 (4.6%) participants died. In total, 9544 new cases of diabetes were identified, 5937 among women (Table 3). Age-standardised diabetes incidence increased with higher household income in both men and women. A U-shaped association was observed between educational level and diabetes incidence because the lowest incidence of diabetes occurred in both men and women who had received a maximum of middle or high school education (Table 3). The associations of SES with incident diabetes were similar to those with prevalent diabetes. In model 1, educational level was positively associated with incident diabetes in men, but an inverse association was found in women (pinteraction < 0.0001 between sex) (Fig. 2). The adjusted HRs (95% CIs), comparing highest vs lowest educational level, were 1.27 (1.07, 1.51) in men and 0.80 (0.67, 0.95) in women. Household income was also positively associated with incident diabetes in men, but there was a non-significant positive association between household income and incident diabetes in women (pinteraction < 0.0001 between sexes) (Fig. 2). The adjusted HRs (95% CIs), comparing highest vs lowest household income category, were 1.36 (1.19, 1.55) in men and 1.06 (0.95, 1.17) in women. There was no evidence of a departure from linear trend for any analysis (p ≥ 0.05), except for educational level and incident diabetes in women (p = 0.047). After additional adjustment for BMI, the associations between SES and incident diabetes were markedly attenuated and no longer statistically significant at the 5% level for any comparison between the highest and lowest SES groups (Fig. 2, ESM Tables 8 and 9). Further adjustment for other potential mediating factors had little effect on the results (ESM Tables 8 and 9).

Table 3 Sex-specific diabetes incidence among CKB participants included in prospective analyses according to educational level and household income
Fig. 2
figure 2

Adjusted HRs and 95% CIs for incident diabetes associated with educational level (a, b) and household income (c, d) in men (a, c) and women (b, d), plotted on a logarithmic scale. For educational level, model 1 was stratified by age at baseline (5 year age group) and study region, and adjusted for age at baseline (continuous), family history of diabetes and household income. Model 2 was further adjusted for BMI based on model 1. For household income, model 1 was stratified by age at baseline (5 year age group) and study region, and adjusted for age at baseline (continuous), family history of diabetes, educational level and household size. Model 2 was further adjusted for BMI based on model 1. See ESM Tables 8 and 9 for data

The association between educational level and incident diabetes did not appear to be modified by age, urban or rural residence, economic–geographic areas or household income in men or women (ESM Fig. 2). However, the positive association of household income with incident diabetes was more pronounced in urban areas than in rural areas in men (pinteraction = 0.035) but not in women (pinteraction = 0.99) and was more pronounced in less economically developed geographic areas (Northeastern, Central and Western) than in more developed areas (Eastern) in both sexes (pinteraction = 0.017 in men and pinteraction = 0.010 in women) (ESM Fig. 3). Absolute risk of incident diabetes was lowest in men with low household income regardless of educational level and highest in women with low educational level and high household income (ESM Table 10). The relative risk of developing diabetes compared with the groups with low educational level and low household income was highest in men who had both high educational level and high household income and in women who had low educational level but high household income (ESM Table 10). Excluding people with a history of any of CHD, stroke, TIA or cancer at baseline from the study population had no effect on the associations of SES with prevalent or incident diabetes (ESM Tables 11 and 12).

Discussion

This large nationwide prospective study in about 0.5 million Chinese adults showed that the associations between SES and diabetes prevalence and incidence differ between men and women. Among men, both educational level and household income were positively associated with risk of diabetes. Among women, there was an inverse association between educational level and risk of diabetes, whereas for household income the positive associations with diabetes prevalence and incidence were weaker than in men. The associations of educational level and household income with diabetes appeared to be partly mediated by BMI.

A previous systematic review reported inconsistent associations of educational level and income with prevalent type 2 diabetes in China [7]. Most of the studies included in the systematic review had small sample sizes, were restricted to a particular geographic area and the association between SES and diabetes prevalence was rarely the primary research question. Few previous studies explored the association separately in men and women.

To our knowledge, our study is the first nationwide prospective study to describe the association between SES and diabetes incidence in China. The positive association between SES and incident diabetes in men in our study contrasts with findings from high-income countries [15] and Taiwan [16], but they are consistent with those from Thailand [17]. Evidence for an association between SES and incident diabetes from other LMIC is sparse [15]. A study of 10,704 Chinese adults living in Qingdao, a coastal city in China, reported an inverse association between educational level and incident diabetes in both sexes during 2001–11 [18]. In that study, a high proportion of both men and women had college education or higher (16.8% and 24.0%, respectively) as opposed to 7.9% and 4.5% in the present study and 8.7% of men and 6.1% of women in similar age groups in the 2010 Chinese census [19]. It is possible that the Qingdao findings represent the patterns of association in Chinese populations at a more advanced stage of epidemiological transition than other parts of China. The proportions of people in the highest education category in our study were similar to those of participants in the 2010 Chinese census. However, it is important to recognise that the patterns we observed may not reflect those in the whole population of China; the CKB sample was never intended to be nationally representative and the response rate was approximately 30% (comparing favourably with other large nationwide biobank studies, such as the UK Biobank).

SES has profound effects on health through complex processes, such as access to healthcare, health behaviours and environmental exposures [20]. The mechanisms through which SES influences the development of diabetes are not fully understood. The association between SES and diabetes may be partially explained by the distribution of conventional risk factors for diabetes such as being overweight or obese, which are strongly patterned by SES in many populations [13, 14]. In this study, the associations between SES and both diabetes prevalence and incidence were attenuated after adjustment for BMI, suggesting that BMI was likely to be a key mediator in the pathway linking SES and diabetes prevalence and incidence in the Chinese population. This finding was consistent with that from western countries [21].

We observed sex differences in the associations between educational level and diabetes prevalence and incidence. These sex differences can be at least partly attributed to differences in the relationship between education and BMI in men and women. Previous cross-sectional studies in China have reported that men with higher educational level were more likely to be overweight or obese than those of lower educational attainment, while the converse was true in Chinese women [22,23,24]. A review of studies from LMIC reported that the burden of obesity tends to shift from groups with high to low educational level during economic development, and this transition occurs earlier for women than men [25]. This evidence may explain the inconsistent findings between men and women in our study. Further exploration of the association between SES and BMI in China would be interesting but was beyond the scope of this study.

Interestingly, in women, we observed an inverse association between educational level and incident diabetes, while the association between household income and incident diabetes was positive (although not statistically significant). Compared with income, educational level is probably more strongly linked to an individual’s cognitive functioning and health-promoting behaviours, and plays a greater role in the onset of diseases, while income is more strongly related to the progression of diseases [26]. This suggests that educational level may be a more sensitive predictor of disease development than income during the epidemiological transition, at least in women. We found some evidence that the positive association between income and incident diabetes was weaker in both sexes in more economically developed geographic areas than in those at earlier stages of development. This may indicate that the transition in the association between SES and risk of diabetes from being positive to inverse is already occurring in more economically developed areas in China. However, more evidence is required to support this hypothesis.

Strengths and limitations

A key strength of this study is that it is the first nationwide study to describe contemporary associations between SES and diabetes incidence in China. The large and diverse population sample permits the investigation of potential differences in the association between SES and diabetes in settings where economic development and epidemiological transition may be at different stages. Reviews of medical records in about 1000 incident cases identified a positive predictive value of 97% based on ADA diagnostic criteria [27] and medication use. In addition, extremely low loss to follow-up limits the potential for biased risk estimates.

One of the key study limitations is potential under-ascertainment of both prevalent and incident diabetes. The diabetes prevalence in the CKB was slightly lower than in contemporaneous nationally representative surveys in which diabetes diagnosis was based on a combination of self-report, fasting blood glucose, glycated haemoglobin measurement and oral glucose tolerance tests [2, 28]. It is not clear whether under-ascertainment of prevalent diabetes differed across SES groups. For incident diabetes, only hospitalised diabetes events were identified in the CKB. Hospital admission rates may be greatly affected by people’s ability to pay and vary by SES in China. If Chinese people of higher SES are more likely to be admitted to hospital, as reported by a previous study [29], then under-ascertainment of diabetes might be higher in lower SES groups; this would bias the association between SES and incident diabetes. We established that the proportion of prevalent diabetes that was self-reported increased with higher SES in both men and women in the CKB population. Our findings would therefore potentially have exaggerated the positive association between SES and incident diabetes in men, and the inverse association with educational level in women is likely to have been underestimated. Future research is required with more complete diabetes case ascertainment, such as from repeated cross-sectional measures of diabetes status using reliable laboratory tests. Furthermore, the statistical power of detecting a 6% relative difference in hazard of incident diabetes between the highest and lowest household income in women is low (power = 0.31), which may contribute to the non-significant positive association between household income and incident diabetes. We were not able to identify type of diabetes, but most diabetes cases were likely to be type 2 diabetes given the age of the study population.

In summary, we found that among Chinese adults in mainland China, educational level was positively associated with prevalent and incident diabetes in men, but inversely associated with diabetes in women. Household income was positively associated with prevalent and incident diabetes in both sexes. The findings of this study provide useful information for identifying priority groups for prevention of diabetes and to allow evaluation of the effect of policies that influence health inequalities in China. Health and social policies that aim to reduce socioeconomic and geographic inequalities in diabetes burden should give priority to primary and secondary prevention of overweight and obesity in Chinese adults and take account of potentially different associations with SES in men and women. Future research is needed to identify effective approaches to reducing inequalities by SES in diabetes incidence and to investigate whether SES also affects the risk of developing diabetic complications in China.