Background

Stroke is not only the second leading cause of death worldwide but also one of the main causes of adult-acquired disabilities [1, 2]. In certain countries, the incidence of stroke has increased over time. For example, the incidence of stroke in the city of Frederiksberg in Denmark increased from 618/100,000 during the period from 1972–1974 to 1,190 per 100,000 during the period from 1989–1990 [3], and the incidence of stroke among males in Gothenburg, Sweden increased by 35% per year from 1985 to 1990 [4]. In recent years, the incidence of stroke has gradually increased among younger populations. In a region of Western Norway, the incidence of ischemic stroke among young adults was 11.4/100,000 between 1988 and 1997, with women accounting for the majority of patients who experienced a stroke before reaching 30 years of age and males accounting for the majority of patients who experienced a stroke after 30 years of age [5]. In India, the prevalence of stroke is 1.54, with a death rate of 0.6 per 1,000 population; the age-adjusted stroke mortality is believed to be 60% higher in South-East Asia than in European populations [6]. The same situation exists in Iran; a systematic review reported that the stroke prevalence for various ages in Iran ranges from 23 to 103 per 100,000 populations [7].

There have been numerous recent research reports regarding the risk factors for stroke, which include not only unhealthy lifestyle habits such as smoking and drinking but also hypertension, diabetes, and a family history of diseases, among other traits [818]. However, different sets of risk factors and risk intensities have been identified by various studies, and no comprehensive systematic review has compiled and summarized the findings of these studies. In this investigation, based on a large number of prospective cohort studies from around the world, a systematic review and a meta-analysis were conducted to summarize the risk factors for stroke in Western and Asian populations.

Methods

Our investigation was conducted as suggested by the Meta-analysis of Observational Studies in Epidemiology (MOOSE) group [19]. A detailed query strategy and inclusion and exclusion criteria were developed, and the relevant literature was screened for inclusion using these standards. Two research groups were established. The first research group was responsible for searching for and reviewing relevant studies and extracting the original data from these investigations. The second group evaluated the quality of the included studies. Finally, pooled statistics describing various effects were calculated.

Systematic literature search

The following electronic literature databases were searched: PubMed, EMBASE (the Excerpta Medica Database), Web of Science, Google Scholar, the Chinese Biomedical Literature Database (CBM), the integrated Chinese Medical Citation Index/Chinese Medical Current Contents (CMCI/CMCC) database, and the Cochrane Central Register of Controlled Trials (CENTRAL) database of the Cochrane Library. The following search terms were utilized: ((((((((stroke) OR ischemic) OR hemorrhagic) OR observational study) OR cohort study) AND risk factors) OR incidence) OR mortality) AND prospective study. In addition, the library of the Third Military Medical University was manually searched for relevant information. The period specified in these searches ranged from the creation date of each database to May 2013.

Inclusion and exclusion criteria

Studies were included if they met the following criteria: ① The included studies were prospective, population-based cohort studies; ② The included studies involved research participants who were volunteers from a community or hospital, with no limitations placed on sex, age, or type of stroke, although the participants were required to be free from any complicating infectious or traumatic conditions; ③ The included studies were required to provide HRs for the risk factors; ④ The observation endpoint of the included studies was the onset of stroke; ⑤ The included studies were required to base the determination of disease on internationally adopted professional diagnostic criteria[20, 21]; ⑥ Comparability of the cohorts was required on the basis of the design or analysis; ⑦ The included studies were required to provide a full contingency table and/or adjusted HR/RR (relative risk) values.

The exclusion criteria included the following: ① Studies examining populations that might be affected by interactions between genes and the environment, such as the Asian-American population, were excluded; ② Case control studies and other retrospective studies were excluded from consideration; ③ Duplicate publications, animal experiments, reviews, and systematic reviews were excluded from consideration.

Selection and data extraction

The retrieved literature was carefully reviewed, and various information was extracted for each examined study, including the study’s first author, publication year, sample size, study type, population characteristics, study endpoint, and HR/RR values and 95% confidence intervals (CIs) (or a contingency table). The data were independently extracted in a blinded manner by two researchers (LZ and XC) from the first research group. A third researcher (LL) from this research group was consulted to resolve discrepancies in the extracted data, and a consensus was reached through discussion.

Quality assessment and statistical analyses

The included observational studies were subjected to a comprehensive quality assessment using the Newcastle-Ottawa Scale as a guide [22]. This quality evaluation was performed in a blinded manner by two researchers (YW and YZ) from the second research team, who assigned quality scores to the included studies. The studies that were given different quality scores by the two researchers were referred to a third researcher (DY) from this research team for evaluation, and a final quality score was obtained.

Q tests were conducted to determine the heterogeneity of the included studies. If I2 ≥ 50% and P < 0.1, a random-effects model was used to combine the studies; by contrast, if I2 ≤ 50% and P > 0.1, a fixed-effects model was used to combine the studies. The adjusted HR was used to measure the effect sizes for the pooled prospective cohort studies, and 95% CIs were determined for the pooled studies. Egger’s regression test was used to verify the absence of publication bias, and by model transformation between the fixed effects model and random effects model for the sensitivity analysis. The fundamental approach utilized in this non-parametric method involves first trimming (removing) small sample studies that cause funnel plot asymmetries. The trimmed symmetric remainder is then used to estimate the center value of the funnel plot, and the removed studies and their corresponding estimated missing values are then entered into the plot on both sides of this center value [23], enabling an assessment of the stability of the entire systematic review. All of the statistical analyses were conducted using the Stata 11.0 software package.

Results

Included studies

The database search resulted in the identification of 1,288 articles; in addition, 2 conference-related publications were identified from other sources. In total, 224 of these articles were duplicates that had been identified in multiple electronic databases, and 941 articles did not address the desired research topic; thus, 125 studies were included in the initial screening (including 123 articles and two conference-related publications). The initial screening of the abstracts of these studies resulted in the exclusion of 51 non-prospective cohort studies, 15 reviews or systematic reviews, and 2 animal studies. The full-text versions of the remaining 56 articles were then assessed, and 34 articles that did not meet the inclusion criteria were excluded from consideration. Ultimately, 22 prospective cohort studies were included in this systematic review and meta-analysis (Figure 1 and Table 1).

Figure 1
figure 1

Flow diagram.

Table 1 Characteristics of the prospective studies on stroke

Study characteristics

This study examined data reported by researchers from Western (North America, Europe, and Oceania) and Asian (East Asia, South Asia, and Southeast Asia) countries. In the 22 prospective cohort studies that were included in this investigation, the sample sizes ranged from a minimum of 39 cases to a maximum of 7,444 cases; 30,406 cases were examined in these studies. Eleven of the included studies involved Western populations, and the remaining 11 studies involved Asian populations. In 20 of the included studies, the outcomes included both ischemic stroke and hemorrhagic stroke, whereas the remaining 2 studies only examined ischemic stroke (Table 1).

Synthesis of results

This study examined 7 risk factors for stroke in human populations, including biochemical indices, lifestyle choices, and medical history. A meta-analysis produced pooled HRs with 95% CI values for each of these factors.

The combined results indicated that 6 factors affected the incidence of stroke in Western populations. These factors included BMI, SBP, hypertension, diabetes, cardiac causes, and smoking. The following pooled HR values with 95% CIs were obtained for these factors: 0.96 (0.93 ~ 0.99) for a BMI of 18.5-21.9 kg/m2; 2.39 (1.44 ~ 3.97) for an SBP ≥140 mm Hg; 1.79 (1.39-2.30) for hypertension; 1.85 (1.43 ~ 2.38) for diabetes; 1.74 (1.47 ~ 2.04) for cardiac causes; and 2.05 (1.68 ~ 2.49) for smoking; However, 0.89 (0.76 ~ 1.04) for alcohol consumption (Table 2 and Additional files 1, 2, 3, 4, 5, 6, 7, 8).

Table 2 Meta-analysis outcomes

In addition, in a subgroup analysis of smoking and cardiac causes, there was statistical significance in the 4 subgroups, including atrial fibrillation, current smoking, smoking ≤20 cigarettes/d and smoking >20 cigarettes/d. The pooled HRs with 95% CI values were as follows: 1.59 (1.33 ~ 1.90), 2.27 (1.76 ~ 2.93), 2.31 (1.80 ~ 2.96) and 2.99 (2.31 ~ 3.86) (Table 2 and Additional files 5, 7).

The included studies of Asian populations indicated that 7 factors affected the incidence of stroke. These factors included BMI, SBP, hypertension, diabetes, cardiac causes, smoking and alcohol. The following pooled HR values with 95% CIs were obtained for these factors: 1.28 (1.15 ~ 1.46) for a BMI of ≥25.0 kg/m2; 2.29 (1.04 ~ 5.09) for an SBP of 120–139 mm Hg; 6.22 (2.27 ~ 17.08) for an SBP ≥140 mm Hg; 2.84 (2.10-3.82) for hypertension; 1.95 (1.42 ~ 2.67) for diabetes; 1.87 (1.45 ~ 2.42) for cardiac causes; 1.27 (1.04 ~ 1.55) for smoking; and 1.28 (1.07 ~ 1.53) for alcohol consumption (Table 2 and Additional files 9, 10, 11, 12, 13, 14, 15, 16).

In the subgroup analysis of cardiac causes, there was statistical significance for atrial fibrillation. The pooled HR with the 95% CI was 2.30 (1.50 ~ 3.53) (Table 2 and Additional file 12).

Quality of the included studies

The Newcastle-Ottawa scale results revealed that 95.45% of the included studies earned above 2 stars for the NOS selection item, 77.27% of the included studies earned above 1 star for the NOS comparability item, and 36.36% of the included studies earned above 2 stars for the NOS exposure item (Table 3).

Table 3 The results of the Newcastle-Ottawa scale (NOS)

Egger's test was performed to assess the influential factors for each of the two populations that were examined (Table 4). Publication bias affected 2 factors in studies of Western populations: cardiac causes (t = 6.32, P = 0.000) and smoking (t = 6.21, P = 0.000). This analysis revealed that publication bias affected 4 factors in the studies of Asian populations: BMI = 22.0-24.9 kg/m2 (t = -9.52, P = 0.000), BMI ≥25.0 kg/m2 (t = 3.36, P = 0.012), diabetes (t = 4.50, P = 0.003), smoking (t = 11.39, P = 0.000) and alcohol (t = 1.06, P = 0.008).

Table 4 Egger's test & Sensitivity analysis

By model transformation between the fixed effects model and random effects model for the sensitivity analyses of the included factors (Table 4). The sensitivity analysis results were consistent (Additional files 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16).

Discussion

This systematic review of prospective cohort studies on the incidence of stroke summarized 7 risk factors for stroke that have been reported with relatively high frequency in prior studies. The prospective cohort studies incorporated into this review examined 867,782 Western or Asian participants and included investigations of large cohorts and studies of small community groups. To avoid biases attributable to differences in ethnicity and geography, the statistical analyses of the final combined results were stratified by Western or Asian population types.

Six factors affected the incidence of stroke in both Western and Asian populations. These factors included hypertension and diabetes, which are well-known risk factors for the incidence of stroke; thus, these findings are consistent with the results of earlier reports [66].

Studies have reported that a BMI ≥25 kg/m2 is associated with increased mortality among middle-aged populations, particularly from cardiovascular and cerebrovascular diseases, which are the leading causes of death [67]. Previous studies have reported similar findings for obesity [68]; in our study, obesity was one of the risk factors for the onset of disease in Asian populations, and an elevated BMI was associated with greater risk (Figure 2). In addition, the result of the risks associated with a BMI ≥25 kg/m2 was greater among Asian populations than among Western populations.

Figure 2
figure 2

Risk factors in Western and Asian populations. (**Former, Current, ≤20 Cigarettes/d and >20 Cigarettes/d; #Cardiac causes: Coronary heart disease, Left Atrial hypertrophy, Angina, Atrial fibrillation; HR Hazard risk; CI, Confidence interval; BMI, Body mass index; SBP, Systolic blood pressure).

Stroke-related studies in the community of Framingham in the United States and the North Karelia region of Finland, as well as investigations of interventions for stroke-related risk factors in 7 Chinese cities, have confirmed the existence of a correlation between blood pressure and stroke [6974]. SBP, a direct indicator of hypertension, is also a factor that directly influences the incidence of stroke. However, the risk of stroke was elevated in hypertensive patients relative to patients exhibiting high blood pressure without previously diagnosed hypertension [75]. The results of this study indicate that an increasing SBP is associated with an increased risk of stroke. In fact, the risk of stroke was greater among the participants with SBP ≥140 mm Hg than among the patients with diagnosed hypertension in both Asian and Western populations (Figure 2). A possible reason for this phenomenon is that most individuals diagnosed with hypertension have received appropriate anti-hypertensive interventions. Previous studies have shown that patients diagnosed with stroke often receive antihypertensive medication [15, 16]. From the results of our investigation, the risks associated with SBP ≥140 mm Hg is greater among Asian populations than among Western populations. For example, the cohort studies of a workforce in China [42],which examined steelworkers in Beijing, China, found an age-adjusted RR of 5.6 among these individuals, which was higher than the RR of 3.5 obtained in the Framingham study [76]. In our previous retrospective study [77], the Rothman and Keller [78] model was utilized to establish a multivariate model for predicting the incidence of stroke in human populations. Consistent with previous reports, this model represented the occurrence of stroke resulting from the combined long-term effects of multiple risk factors [79]. Our prior results also explored the effects of the interactions between hypertension and other risk factors on the incidence of stroke. However, the question of whether changes in other risk factors could alter the intensity of the risk of hypertension with respect to the incidence of stroke in a particular population (Western or Asian) merits additional study and discussion.

The Busselton Study reported that atrial fibrillation is a risk factor for stroke, with an RR of 5.9 [80], and other studies have also consistently indicated that atrial fibrillation is a risk factor for stroke [81]. In our investigation, atrial fibrillation was again identified as a risk factor; the result for this factor in Western populations was 2.30 (1.50 ~ 3.53), and there was no difference in Asian populations by interval estimation (Table 2). Consistent with our results, cardiovascular-related diseases (such as angina and CHD) were identified as risk factors for stroke as early as the Framingham Study and related reports [82]. Previous studies have suggested that left ventricular hypertrophy (LVH) is not correlated with the occurrence of stroke [13]; however, our investigation indicated that LVH is a risk factor for stroke in Western populations. Therefore, effective measures for preventing stroke in Western populations include lifestyle improvements, appropriate treatments, and reductions in the morbidities associated with chronic cardiovascular and cerebrovascular diseases.

Smoking is a risk factor for stroke, and the HR values for smoking-related behaviors were relatively high in Western populations. After stratifying the results based on the periods during which smoking occurred, it was determined that former smoking had no effect on the incidence of stroke in Western populations. Certain prior studies have suggested that being a former smoker can affect the incidence of stroke [8385], whereas other studies have found no significant effects associated with this characteristic [86]. We observed a dose–response association for the number of cigarettes smoked per day in Western populations: the HR value associated with smoking more than 20 cigarettes per day was much greater than the HR values of other risk factors by point estimation (Figure 2); this result is consistent with the findings of previous studies [86, 87]. Therefore, reducing daily cigarette consumption is an effective preventive measure for reducing the incidence of stroke.

The results from the studies of Asian populations indicated that long-term alcohol consumption was also a risk factor for stroke, although this factor had no effect on the incidence of stroke in Western populations (Table 3). Prior studies have also produced controversial results with respect to the significance of this factor: certain studies have determined that heavy long-term alcohol consumption is a risk factor for stroke [88], but other studies have reached the opposite conclusion [89]. However, heavy long-term alcohol consumption is a risk factor for many chronic diseases, and therefore, limiting alcohol consumption may play an indirect role in preventing the incidence of stroke.

Limitations

This systematic review and meta-analysis used the Newcastle-Ottawa scale to score all of the included studies. Overall, the quality of the studies on Chinese populations was significantly lower than the quality of the studies on other populations (T-Test: Z = 3.015, P = 0.007). The primary reason for this finding was that the populations examined in the former studies were mostly large, long-term, community-based cohorts. For example, the studies by the British regional heart study [24] and the PROCAM study [29, 30], among others, were based on data from large European cohorts [29, 30, 32]. By contrast, most of the included Asian studies examined small community-based cohorts or even hospital cases. For example, the research participants assessed by the cadre's sanitarium of Xi’an, China [37] were recruited from a sanatorium for retired military cadres in Xi'an, China. Result biases may be relatively low for large community cohorts but higher for smaller samples. In addition, bias may also be related to the dropout rate. For instance, the study in Shanghai, China [53] reported a dropout rate of 3.09%, whereas the CMCS study [44] reported a dropout rate of up to 22.5%. Although this investigation involved the examination of a large number of literature databases, there was less investigation of the unpublished literature and conference-related information, potentially creating publication bias.

Furthermore, only a limited number of studies were included in this investigation; thus, a large amount of data regarding risk factors was missing. In addition, few studies were pooled to obtain certain data points, particularly for Asian populations, and the number of studies finally included in the meta-analysis was very low for individual risk factors (usually in the range of 1 to 10), and this issue might have produced bias in the results. Therefore, these limitations are related to the absence of studies from South Asia and West Asia, and we believe this might be one reason for the publication bias. Consequently, to obtain more reliable data and results, additional large-sample, multi-center, randomized controlled studies and/or long-term cohort studies must be conducted.

Conclusions

This meta-analysis presents the data from large-scale prospective cohort studies to summarize and evaluate the risk factors for stroke in Western and Asian populations. The results of this investigation are credible, and the pooled results obtained in this review are more reliable than the findings from prior small-scale cohort studies and case–control studies. The analyses conducted in this investigation revealed that Western and Asian populations differ with respect to the factors that affect the incidence of stroke. Therefore, different interventional approaches should be implemented to reduce the risk of stroke among high-risk individuals from different regions and different ethnic groups.