Introduction

figure a

Worldwide, cerebrovascular disease (CBD) and type 2 diabetes mellitus are common disorders that are among the top ten leading causes of death, killing approximately eight million people in 2016 [1]. CBD includes a variety of medical conditions that affect the blood vessels of the brain and cerebral circulation. There are two main classifications of CBD: ischaemic and haemorrhagic. Thus far, accumulating evidence suggests that type 2 diabetes is independently associated with the risk of CBD, especially ischaemic CBD [2,3,4,5,6,7,8]. In a collaborative meta-analysis of 102 prospective studies including 698,782 participants, the adjusted HR was 2.27 (CI 1.95, 2.65) for ischaemic stroke in people with vs those without type 2 diabetes [9]. Type 2 diabetes may cause atherosclerotic changes in the cerebral arteries, and lead to ischaemic stroke [10]. Several studies have examined the association between type 2 diabetes and haemorrhagic CBD, but with inconsistent results [3, 5, 8, 9, 11,12,13,14,15,16,17,18,19]. Furthermore, only a few cohort studies have examined the relationship between type 2 diabetes and intracerebral haemorrhage and subarachnoid haemorrhage individually [12, 20]. To our knowledge, no studies have focused on the association between midlife type 2 diabetes and CBD in late life.

Both type 2 diabetes and CBD are complex genetic and lifestyle-related disorders. Genetic and familial environmental factors (such as fetal environment, maternal smoking or childhood socioeconomic status) are involved in the development of type 2 diabetes [21] and CBD [22, 23], but their contribution to the association between type 2 diabetes and CBD is unknown. In population-based studies that address the association between type 2 diabetes and CBD, some major confounders can be controlled for in data analysis, but not familial factors (such as genetic background and early-life environmental factors shared by family members). Twin studies provide an opportunity to minimise potential confounding due to unmeasured genetic predisposition and shared familial environment in childhood or adolescence. Twins are generally raised together and share the same genetic background as well as intrauterine, childhood and adolescent environments. As naturally matched pairs, the confounding effects of genetics and familial environments are removed when comparisons are made within the pair. Thus, co-twin case−control analyses can be more informative than unrelated case−control analyses [24, 25].

In the current study we aimed to: (1) examine the associations between midlife type 2 diabetes (i.e. type 2 diabetes with an onset between 40 and 59 years of age) and risk for CBD in late life; and (2) explore whether genetic and familial environmental factors could explain the type 2 diabetes–CBD association using data from a nationwide cohort of Swedish twins.

Methods

Study population

This prospective, nested case−control study included twins from the nationwide Swedish Twin Registry (STR), which was started in the 1960s [26]. In 1998–2002, all living twins (≥40 years of age, i.e. born in 1958 or earlier) in the registry were invited to participate in the Screening Across the Lifespan Twin study (SALT), a full-scale screening that gathered data on an extended set of variables via computer-assisted telephone interview. Out of 44,919 twin individuals eligible for the interview, 5009 who had not reached the age of 60 by the end of available follow-up on 31 December 2014 (born after 1954), 417 who died by 31 December 2014, 773 who had CBD before the age of 60 years, 4210 who had type 2 diabetes with onset age <40 years or ≥60 years, 310 who had type 1 diabetes, and 1114 who had transient ischaemic attack were excluded. Finally, 33,086 individuals remained for the current analyses (Fig. 1).

Fig. 1
figure 1

Flow chart of the study population. T1DM, type 1 diabetes mellitus; T2DM, type 2 diabetes mellitus

Data collection

Information on demographics (age, sex and educational attainment), lifestyle (smoking, alcohol consumption), anthropometric measures (weight and height), zygosity and medication use (including treatment for type 2 diabetes) was obtained from the SALT survey [26]. Education was defined by the maximum years of formal schooling attained, and dichotomised into <8 vs ≥8 years. BMI was calculated as weight (kg) divided by height squared (m2), and categorised as <20 kg/m2 (underweight), 20–24.9 kg/m2 (normal weight), 25–29.9 kg/m2 (overweight), and ≥30 kg/m2 (obese). Smoking status was categorised as never smoked, former smoker and current smoker. Alcohol consumption was dichotomised as never consumed alcohol vs past/current consumer of alcohol. Marital status was defined as married/cohabitating vs single (including divorced).

Information on history of type 2 diabetes, CBD, hypertension and heart disease (including coronary heart disease, cardiac arrhythmias and heart failure) was derived from the National Patient Registry (NPR), which covers all inpatient care in Sweden since the 1960s up to the end of 2014. Each medical record in the NPR included up to eight discharge diagnoses recorded as International Classification of Disease (ICD) codes. The seventh revision (ICD-7) was used through to 1968, the eighth revision (ICD-8) from 1969 through to 1986, the ninth revision (ICD-9 [www.icd9data.com/2007/Volume1]) from 1987 through to 1996 and the tenth revision (ICD-10 [http://apps.who.int/classifications/icd10/browse/2016/en]) since 1997 through to the end of available follow-up in 2014.

Informed consent was required from all participants. The data collection procedures were approved by the Regional Ethics Committee at Karolinska Institutet, Stockholm, Sweden, and the Institutional Review Board of the University of Southern California, USA.

Ascertainment of diabetes

Both type 1 and type 2 diabetes were ascertained based on self- and informant-reported history of type 2 diabetes, glucose-lowering medication use, or NPR (ICD-7 code 260, ICD-8 and -9 code 250, and ICD-10 codes E10–E14). The age at type 2 diabetes onset was estimated according to the earliest recorded date of type 2 diabetes in the NPR or the self-reported date of type 2 diabetes onset available in SALT. Midlife type 2 diabetes was defined as type 2 diabetes onset at age 40–59 years. Information on treatment of type 2 diabetes was collected from the SALT survey and the Swedish Drug Registry (The Anatomical Therapeutic Chemical code A10) since 2006. Treatment of type 2 diabetes was categorised as diet, oral glucose-lowering drugs, insulin and combined oral glucose-lowering drugs and insulin.

Assessment of CBD

Information on CBD diagnosis (ICD-7 codes 330–334, ICD-8 codes 430–438, ICD-9 codes 430–437, ICD-10 codes I60–I68) was derived from NPR records between 1960 and 2014. The CBD subtypes in the current study included: (1) cerebral infarction; (2) occlusion and stenosis of precerebral or cerebral arteries not resulting in cerebral infarction (termed as occlusion of cerebral arteries below); (3) subarachnoid haemorrhage; (4) intracerebral haemorrhage; and (5) unspecified CBD [27]. The age of CBD onset was estimated according to the earliest recorded date of the CBD diagnosis in NPR.

Statistical analyses

The characteristics of the study population by midlife type 2 diabetes or CBD were compared using χ2 tests for categorical variables, unpaired Student’s t test for continuous variables with normal distribution, and Mann–Whitney U test for continuous variables with non-normal distribution. Generalised estimating equation (GEE) models were used in unmatched case−control analyses, controlling for the clustering of twins within a pair. In unmatched case−control analysis, the basic-adjusted models were adjusted for age, sex and education. The multi-adjusted models were further adjusted for BMI, smoking, alcohol consumption, marital status, hypertension and heart disease. Data for the co-twin matched case−control study were analysed using conditional logistic regression, in which twin pairs who were discordant for outcome. In the co-twin control design, the disease-free co-twin (in both monozygotic and dizygotic twin pairs) is used as a control for the diseased twin. Using discordant twin pairs is more informative than using unrelated case−control samples, as cases and controls matched for genetic background and familial environmental factors such as fetal environment, maternal smoking or childhood socioeconomic status [28, 29]. In the co-twin control analysis, the basic-adjusted models were adjusted for sex and education. The multi-adjusted models were further adjusted for BMI, smoking, alcohol consumption, marital status, hypertension and heart disease.

Logistic regression was used to test the difference in ORs from the GEE model and conditional logistic regression by examining the difference in midlife type 2 diabetes between unmatched and co-twin controls [30]. If an OR for the observed association in the unmatched case−control analysis becomes strengthened or attenuated (or even disappears) in co-twin control analyses, and the difference in ORs from the GEE and conditional logistic regression was significant, genetic and/or shared familial environmental factors are likely to play a role in the association [29, 31]. In contrast, if the OR in conditional logistic regression remains similar to that from the GEE, and the difference in ORs was not significant, then the confounding by genetic or shared familial environmental factors in the observed association is small or null [24, 30, 32].

Age, sex, education, BMI, smoking, alcohol consumption and marital status were considered as potential confounders. Missing values on education (n = 1222), smoking (n = 1182), alcohol consumption (n = 1253), BMI (n = 1801) and marital status (n = 757) were imputed using Rubin’s rule for pooling estimates to obtain valid statistical inferences [30]. All statistical analyses were performed using SAS statistical software version 9.4 (SAS Institute, Cary, NC, USA) and IBM SPSS Statistics 20.0 (IBM, New York, NY, USA).

Results

Characteristics of the study population

The 33,086 twin individuals included 14,969 (45.2%) men and 18,117 (54.8%) women (χ2 = 86.45, p < 0.001). Among all participants, 1248 (3.8%) had midlife type 2 diabetes and 3121 (9.4%) had CBD in late life. Compared with type 2 diabetes-free individuals, those with midlife type 2 diabetes were more likely to be younger, male, smokers and alcohol drinkers, and have heart disease, hypertension, and higher BMI (Table 1). Among individuals with midlife type 2 diabetes, those with CBD were older than those without CBD, and there were no significant differences between the two groups in terms of type 2 diabetes treatment (Table 2).

Table 1 Characteristics of the study participants by midlife type 2 diabetes mellitus (onset age 40–59 years) (n = 33,086)
Table 2 Clinical features of participants with midlife type 2 diabetes mellitus by CBD (n = 1248)

Association between midlife type 2 diabetes and late-life CBD in unmatched case−control analysis

In multi-adjusted GEE models, those with midlife type 2 diabetes had significantly higher ORs of cerebral infarction (OR 1.29 [95% CI 1.03, 1.61]) and occlusion of cerebral arteries (OR 2.03 [95% CI 1.20, 3.44]) in late life than type 2 diabetes-free participants. The associations of midlife type 2 diabetes with subarachnoid haemorrhage (OR 0.52 [95% CI 0.12, 2.21]) and intracerebral haemorrhage (OR 0.78 [95% CI 0.45, 1.36]) were not significant (Table 3).

Table 3 ORs and 95% CIs of midlife T2DM in relation to different types of cerebrovascular disease in late life from GEE (T2DM-free as the reference)

Association between midlife type 2 diabetes and late-life cerebral infarction in co-twin matched case−control analysis

In the co-twin matched case−control analysis, the basic- and multi-adjusted ORs for the association between midlife type 2 diabetes and cerebral infarction in late life were 1.60 (95% CI 0.90, 2.84) and 0.96 (95% CI 0.51, 1.80), respectively (Table 4).

Table 4 ORs and 95% CIs for the association between midlife T2DM and late-life cerebral infarction in co-twin control analyses using cerebral infarction discordant twin pairs from conditional logistic regression

The difference in ORs from the GEE model based on unmatched case−control analysis vs co-twin control analysis was not statistically significant (basic-adjusted: OR 1.03 [95% CI 0.67, 1.50; p = 0.897]; multi-adjusted: OR 0.94 [95% CI 0.62, 1.39; p = 0.780]). Because the estimates from the GEE vs matched co-twin control analysis did not differ significantly, genetic and early-life familial environmental factors appeared not to play a role as confounders in the association between midlife type 2 diabetes and cerebral infarction in late life.

Supplementary analysis

Considering possible sex differences in CBD development [2, 33], we performed stratified analyses. The associations between midlife type 2 diabetes and CBD risk and its subtypes did not vary by sex (electronic supplementary material [ESM] Table 1). As midlife type 2 diabetes was associated with mortality (OR 2.70 [95% CI 2.34, 3.11]), we repeated the analysis with an additional adjustment for survival status. The results of this analysis were similar to those of the initial GEE models. Finally, we repeated the analyses excluding data with missing values for covariates, and the results were not significantly altered (ESM Table 2).

Discussion

In this large-scale, nationwide, population-based study of Swedish twins, we found that midlife type 2 diabetes was significantly associated with increased risk of cerebral infarction and occlusion of cerebral arteries, but not intracerebral haemorrhage or subarachnoid haemorrhage in late life. Genetic and early-life familial environmental factors appeared not to account for the association between midlife type 2 diabetes and late-life cerebral infarction.

Type 2 diabetes may lead to various micro- and macro-vascular changes often culminating in major clinical complications, such as CBD. Epidemiologic studies have shown that type 2 diabetes is associated with a two- to fivefold increased risk of ischaemic CBD, especially cerebral infarction [3, 5, 8, 9, 11,12,13,14,15,16,17,18,19, 34, 35]. So far, few studies have particularly focused on the association between midlife type 2 diabetes and CBD in late life. In our study, we found that midlife type 2 diabetes conferred a moderate risk for cerebral infarction and occlusion of cerebral arteries in late life.

Several cohort studies have investigated the association of type 2 diabetes with risk of haemorrhagic CBD, and some of them found no such association [3, 5, 7, 14], but others indicated an increased [9] or a decreased [12, 36] risk of haemorrhagic CBD in diabetics. In a longitudinal study examining the relation between type 2 diabetes and haemorrhagic CBD, type 2 diabetes was associated with a decreased subarachnoid haemorrhage risk and an increased intracerebral haemorrhage risk [12]. However, in another two population-based cohort studies, type 2 diabetes was not associated with intracerebral haemorrhage or subarachnoid haemorrhage [3, 18]. The only study addressing the association of midlife type 2 diabetes and haemorrhagic CBD (including intracerebral and subarachnoid haemorrhage) also showed negative results in women [5]. Consistent with these studies, we found that midlife type 2 diabetes was not associated with intracerebral and subarachnoid haemorrhage in late life.

The mechanisms underlying the association of type 2 diabetes with cerebral infarction and occlusion of cerebral arteries are complex and not completely understood [37]. Individuals with diabetes develop dyslipidaemia characterised by small dense low-density lipoproteins, reduced high-density lipoproteins and increased triacylglycerol levels, as well as accelerated atherogenesis [38, 39]. Metabolic disturbances, such as insulin resistance, compensatory hyperinsulinaemia, inflammation, hyperlipidaemia and hyperglycaemia may also contribute to cerebrovascular events in people with type 2 diabetes [37]. The negative association between type 2 diabetes and haemorrhagic CBD may reflect endothelial proliferation and thickening of the basement membrane in cerebral vessels induced by type 2 diabetes that lead to increased risk of infarction but not rupture [40].

Although evidence has shown that adverse socioeconomic circumstances in early life are associated with type 2 diabetes as well as CBD [41,42,43], the potential contribution of genetic susceptibility to the type 2 diabetes–CBD association is still unknown. In the present study, the lack of significant difference in ORs from the GEE and co-twin control analysis indicates that genetic and unmeasured familial environmental factors (such as maternal nutrition status and childhood socioeconomic situation) appeared not to account for the association between type 2 diabetes and cerebral infarction. However, because of the limited number of cerebral infarction discordant twin pairs in the co-twin analysis, the role of genetic and early-life familial environmental factors in the type 2 diabetes–CBD association needs further investigation.

There are several strengths and limitations in the current study. First, the large nationwide population-based twin cohort provided us with a unique opportunity to examine further the effect of midlife type 2 diabetes on late-life CBD risk, controlling for some unmeasured confounders such as genetic and early-life familial factors. Second, by including only midlife type 2 diabetes and late-life CBD, we could make the temporality clear and thus minimise the possibility of reverse causation. Nonetheless, the limitations in our study need to be pointed out. First, despite the large cohort, the sample size in co-twin matched case−control analysis was limited. Although the association between type 2 diabetes and cerebral infarction in unmatched case−control analysis was attenuated in co-twin matched analysis, the difference was not statistically significant. Thus, caution is needed with regard to the role of genetic and early-life familial environmental factors in the type 2 diabetes–cerebral infarction association. Second, blood glucose levels were not available in the STR or SALT. Consequently, given the higher prevalence of undiagnosed type 2 diabetes in elderly people [44], individuals with undiagnosed type 2 diabetes might have been misclassified as type 2 diabetes-free, which could have led to an underestimation of the association. In addition, participants with midlife type 2 diabetes were about 2 years younger than type 2 diabetes-free individuals. The possible explanations could be that participants with midlife type 2 diabetes could develop CBD before age 60 years and were excluded from the study population, and that patients with midlife diabetes might die before 60. Thus, this might have also led to underestimation for the given association. Third, as both monozygotic twins (who share 100% of their genetic background) and dizygotic twins (who share only 50% of their genetic background) were included in co-twin matched case−control analyses, we could not completely control for the genetic background. Finally, although some lifestyle-related factors (such as smoking and alcohol consumption) and diseases (such as hypertension and heart disease) in midlife were taken into account in the analyses, dietary intake, physical activity and hyperlipidaemia could not be controlled for due to the unavailability of the data among all participants.

In conclusion, our study provides evidence that midlife type 2 diabetes is associated with the risk of cerebral infarction and occlusion of cerebral arteries, but not intracerebral haemorrhage or subarachnoid haemorrhage in late life. Our findings highlight the need to control midlife type 2 diabetes to help prevent cerebral infarction and occlusion of cerebral arteries in late life. Genetic and early-life familial environmental factors appear not to contribute to midlife type 2 diabetes–cerebral infarction association. However, due to the limited sample size in co-twin control analysis, large longitudinal twin studies are warranted for further clarification.