Background

Alzheimer's disease (AD) and related forms of dementia affect 13% of all adults over age 65 and the number of individuals with dementia in the United States is expected to increase from a current prevalence of 5.1 million to over 7.7 million individuals by 2030[1]. The effects of AD on society are substantial. In the United States the annual direct costs associated with AD are $148 billion and caregivers provide 8.5 billion hours of care annually[1]. Alzheimer's disease is also a leading reason for admission to long-term care facilities.

The pathophysiology of AD involves the accumulation of protein plaques composed of β-amyloid and the formation of neurofibrillary tangles related to hyperphosphorylation of tau proteins[2]. The cause of AD is multi-factorial with some of the strongest risk factors for AD include advanced age, female gender, educational achievement, family history and specific genetic mutations[3]. There are few modifiable risk factors which have been identified for AD, which include a history of head trauma and most cardiovascular risk factors[3].

Short-term cognitive dysfunction lasting days to weeks has been commonly observed following surgery[4] and is often referred to as postoperative cognitive dysfunction (POCD). Recently, some potential mechanisms linking inhaled volatile anesthetics to AD pathology have been proposed to provide a link between exposure to general anesthesia (GA) and the subsequent development of POCD or AD following surgery[5]. Commonly utilized inhaled anesthetics have been demonstrated to increase the formation of AD precursors including β-amyloid plaques[6, 7], and neurofibrillary tangles[8, 9] in animal models and in vitro studies. However, randomized controlled trials have failed to demonstrate an increased risk of persistent cognitive impairment in the 1 - 2 years following exposure to GA when compared to regional anesthesia[10]. To date there have been no randomized controlled trials that have examined the risk of developing cognitive impairment meeting clinical criteria for AD or related forms of dementia associated with exposure to GA. There have been previous narrative reviews on the potential relationship between anesthesia and POCD[11, 12] and systematic reviews which have examined the relationship between GA and postoperative delirium or POCD[1317]. Most reviews found that there was limited evidence to suggest any difference between GA anesthesia and regional anesthesia on the incidence of post-operative delirium or POCD [13, 1517]. One exception was a review on anesthesia for hip fracture surgery which found a reduction in acute postoperative confusion for regional anesthesia compared to GA [14]. To date there are no reviews that have examined whether exposure to GA is a risk factor for the development of AD. Therefore, the objective of our study is to systematically review the literature on observational studies examining the association between exposure to GA and subsequent development of AD. Understanding the risk of AD associated with exposure to GA would help in determining the relationship between GA and AD and inform strategies to prevent or minimize the risk of AD following surgical procedures.

Methods

Search strategy

We searched MEDLINE, EMBASE, and Google Scholar using key words and medical subject headings to identify relevant articles (see Additional File 1, Document 1). The reference lists of retrieved articles were hand-searched for additional references. There were no language restrictions and data from both published and unpublished studies were included provided that sufficient information was available for data extraction. Two authors (DS and NS) were involved in the selection of studies for the review and discrepancies were resolved by discussion after retrieving the full text of articles in question.

Types of studies

We followed the MOOSE[18] and PRISMA[19] guidelines for the reporting of meta-analysis of observational studies. Observational studies examining exposure to GA and risk of AD including both case-control and cohort studies were eligible for the review. We included human studies where the mean age of study population was ≥ 50 years of age. We excluded studies that examined POCD, postoperative delirium, or abnormalities on neuropsychological testing without a diagnosis of dementia from the review to focus only on the outcome of development of AD.

Definition of exposure

Exposure to GA was defined as any reported history of surgery under GA when compared to no history of surgery under GA. Where information was available, the following information was also recorded: history of exposure to GA when compared to regional anesthesia (RA); the number of previous GAs; cumulative exposure to GA as measured in minutes of GA exposure; the type of surgery performed under GA (cardiovascular, non-cardiovascular, or neurological); the type of agent utilized for GA; and, duration of time between exposure to GA and assessment for AD.

Definition of Outcomes

The primary outcome was a diagnosis of AD of any severity according to clinical impression or standard AD diagnostic criteria. These AD criteria included: National Institutes of Neurological and Communicative Disorders and Stroke - Alzheimer's Disease and Related Disorders (NINDS-ADRDA)[20]; Diagnostic and Statistical Manual of Mental Disorders (DSM)[21]; or the International Classification of Diseases. Secondary outcomes included time to development of AD, and the development of early-onset (<65 years) or late-onset AD (>65 years).

Data extraction

Two reviewers (DS and NS) extracted data from included studies using a standard data extraction form. Information extracted from studies included: age of participants, gender distribution, source population and description of sample selection methods. We also recorded potential confounders including medical morbidity and indication for surgery. For case-control studies, we recorded the number of cases and controls with exposure to GA and the summary odds ratio (OR) and 95% confidence intervals (CI) reported in studies from matched or adjusted analyses.

Assessment of Study Quality

Primary studies were reviewed according to the STROBE criteria[22] and study quality checklist based on the Newcastle-Ottawa criteria for case-control and cohort studies was developed[23] to describe potential risk of bias according to seven aspects of study quality (see Additional File 2, Document 2). We classified case-control studies as being at overall high or low risk of bias. A case-control study was considered to be at low risk of bias if all of the following study quality items were recorded as low risk of bias: definition of cases, sample used for selection of controls, and matching or adjusting for a minimum of age and gender.

Data Synthesis and Meta-Analysis

We included a qualitative description of studies meeting inclusion criteria using text and tables. Studies were first assessed qualitatively for homogeneity and homogeneous studies were combined in meta-analysis to arrive at (OR) and 95% confidence intervals (CI) for the association between GA and AD. We combined case-control studies that controlled for a minimum of both age and gender either through matching or statistical adjustment. When multiple control groups were used in a study we only included the control group that was most representative of the source population for meta-analysis. Random effects models were utilized for all meta-analyses given the expected heterogeneity between different studies. Study weights were assigned using the inverse of the study variance. The software package Comprehensive Meta-Analysis (version 2.2) was utilized for all analyses.

Heterogeneity

We assessed study heterogeneity qualitatively by assessing study populations and study designs. We assessed statistical heterogeneity quantitatively using the Cochran Q statistic and used p-values of ≤0.1 as our threshold for heterogeneity. We used the I2 statistic to quantify the degree of statistical heterogeneity.

Subgroup and Sensitivity Analysis

We planned to analyse case-control and cohort studies were separately. We planned to undertake the following subgroup analyses: by control group (no surgery control groups or regional anesthesia control groups); high and low risk of bias studies; AD diagnosis (standard AD criteria compared to clinical criteria or other methods of diagnosis); and, type of surgery (non-cardiac surgery, neurological, or cardiac surgery). We also examined the summary OR for exposure to any GA after excluding each study sequentially to evaluate for studies that may have had a large influence on the results of the meta-analysis. We also conducted a meta-regression of the association between the reported OR for GA exposure and risk of AD using year of publication as an independent variable as older anesthestic agents may be associated with a greater risk of AD when compared to newer anesthetics[6].

Publication Bias

We assessed the potential publication bias by visual inspection of the funnel plot produced by plotting the standard error against the log OR of studies.

Results

Study Selection

The flow of studies through the review process is outlined in Figure 1. A total of 15 case-control studies were identified that met inclusion criteria[2438]. Fourteen of the included studies were published as full manuscripts and one was presented in poster form at a conference[37]. There were no cohort studies that met our inclusion criteria.

Figure 1
figure 1

Flow of Studies through Review Process.

Description of Included Studies

The characteristics of the included studies are described in (see Additional File 3, Table S1). A total of 1,752 cases and 5,261 controls were included in the respective studies. Cases and controls in most studies were comparable in terms of age and gender distribution. Ten studies used matching to control for potential confounding related to age and gender with the remaining 5 studies using statistical adjustment for these factors. Eight studies determined exposure to GA through interviews with proxies of cases, and 5 studies used medical records to determine exposure. Two studies used incident cases of dementia from cohort studies and exposure to GA was determined through interviews of individuals prior to them developing AD.

Assessment of Risk of Bias

The potential risk of bias associated with various aspects of study design is described in Additional File 4, Table S2. Four studies were rated as being at overall low risk of bias[32, 33, 36, 37] with the remaining studies being at higher potential risk of bias due to some aspect of study design.

Meta-Analysis

Association between Exposure to Any Surgery with General Anesthetic and Alzheimer's Disease

All of the 15 studies included in the systematic review controlled for age and gender and were subsequently suitable to combine in meta-analysis. The association between exposure to GA and risk of AD for each of the studies is summarized in Additional File 5, Table S3. Meta-analysis of the 15 studies revealed no statistically significant association between previous exposure to GA and development of AD (pooled OR = 1.05; 95% CI: 0.93 - 1.19: Z = 0.80, p = 0.43) (Figure 2). There was little evidence of statistical heterogeneity between studies as assessed by the Q statistic (Q = 9.76, df = 14, p = 0.78) and the magnitude of statistical heterogeneity was minimal (I2 = 0.0%).

Figure 2
figure 2

Forest Plot of Any Exposure to General Anesthesia and Alzheimer's Disease.

Subgroup Analysis

Exposure to General Anesthesia Compared to Regional Anesthesia

A total of two studies reported on the risk of AD associated with GA and regional anesthesia[26, 30]. The OR for AD associated with exposure to GA was 1.06 (95% CI: 0.64 -1.74, p = 0.83) in these two studies (Figure 3). The OR for AD associated with previous exposure to regional anesthesia was 0.68 (95% CI: 0.4 - 1.14, p = 0.14). The difference in the odds ratios was not statistically significant (Q = 0.55, df = 1, p = 0.46).

Figure 3
figure 3

Forest Plot of General Anesthesia Compared to Regional Anesthesia and Risk of Alzheimer's Disease.

Number of Prior Anesthetics or Cumulative Exposure to General Anesthesia

One study reported on the association between number of anesthetics and the cumulative duration of anesthesia and AD[31]. There was no statistically significant association between the number of surgeries involving GA or cumulative exposure to GA. The OR for exposure to six or more surgeries under general compared to fewer than six was 1.44 (95% CI: 0.77 - 2.71) and the association between a cumulative duration of greater than 10 hours of exposure to GA when compared to less than 10 hours was 1.63 (95% CI: 0.53 - 5.04).

Studies with Low Risk of Bias Compared to High Risk of Bias

A total of four studies [32, 33, 36, 37] were rated as being at overall low risk of bias according to our risk of bias assessment. The OR for exposure to GA and risk of AD in these low risk of bias studies was 1.09 (95% CI: 0.93 - 1.28; Z = 1.05, p = 0.29) while the OR in the high risk of bias studies was 1.00 (95%CI: 0.82 - 1.21; Z = 0.03, p = 0.98). There was no statistically significant difference in the OR between the high and low risk of bias studies (Q = 0.83, df = 1, p = 0.36).

Criteria for Defining Dementia

Eight studies used standardized criteria for diagnosing AD [27, 28, 3035] and seven studies used clinical criteria or other methods for diagnosing AD [2426, 29, 36, 38, 39]. The OR for exposure to GA and development of AD in studies that used standard criteria was 1.06 (95% CI: 0.84 - 1.33, p = 0.63) and the OR for GA and AD in the remaining studies was 1.05 (95% CI: 0.90 - 1.22, p = 0.53). There was no significant difference in the OR for these subgroups (Q = 0.13, df = 1, p = 0.72.).

Type of Surgery

There were no studies included in our review that specified the type of surgery that was performed under general anesthesia so no subgroup analyses could be undertaken.

Time Between Exposure to General Anesthesia and Assessment of Outcome

Only one study [25] specified a minimum duration of 5 years between exposure to GA and AD with an OR of 0.61 (0.22 - 1.63) while the remaining studies did not report the duration of time elapsed between GA exposure and assessment of dementia.

Sensitivity Analysis and Meta-Regression

The OR for exposure to any GA and AD was not statistically significant when calculated after each study was sequentially excluded. In addition, the summary OR did not change after excluding the two studies that used stepwise regression to adjust for confounders[35, 38] when compared to studies that explicitly matched or adjusted for a minimum of age and gender.

There was no significant association observed between the log OR and year of publication in meta-regression. To investigate the potential effect that additional studies would have had on our results, we calculated the Orwin's failsafe N to determine the number of studies of a given effect size that would be required to change the observed OR by a given amount. We selected a mean observed OR for a hypothetical group of missing studies to be 1.3 based on the observed upper range of the ORs in the studies identified in our review. Assuming a minimal clinically significant OR of 1.2 for the association between GA and AD, an additional 25 studies, each with a mean observed OR of 1.3, would be needed to arrive at an statistically significant OR of 1.2.

Publication Bias

There was no evidence of potential publication bias as assessed by visual inspection of the funnel plot produced by plotting the standard error against the log OR of studies.

Discussion

This is the first systematic review and meta-analysis of observational studies examining the association between exposure to GA and risk of developing AD. We found a number of case-control studies and no cohort studies that examined the association between GA and development of AD. Meta-analysis did not reveal any association between previous exposure to GA and development of AD. This finding was supported by several subgroup and sensitivity analyses. However, many of the studies included in our review were at some potential risk of bias from certain aspects of study design and the potential biases inherent in all observational studies may have contributed to the observed findings.

The findings from our review are consistent with the existing clinical literature on the lack of long-term cognitive sequelae associated with exposure to GA. Although cognitive dysfunction is common following surgery, much of the observed changes in cognition may be attributed to the stress of surgery and recovery[12, 39] rather than any specific effects of anesthetic technique. Large-scale studies have reported that the prevalence of POCD in the week following non-cardiac surgery is 25% with 9.9% of individuals continuing to display cognitive dysfunction at 3 months following surgery[4]. However, a randomized controlled trial comparing GA to regional anesthesia in non-cardiac surgery found little difference in persistent POCD associated with different anesthetic techniques. In this study, regional anesthesia was associated with a non-statistically significant difference in early POCD at day 7 (19.7% vs 12.5%), although the proportions of individuals with POCD in the regional and GA groups were similar at 3 months (14.3% vs. 13.9%)[40]. This same study reported that most individuals recovered cognitive function by 1 - 2 years following surgery[10], indicating that there is little evidence of long-term cognitive consequences associated with GA when compared to regional techniques. Similar results have been reported in other randomized controlled trials comparing GA to regional anesthesia in the elderly[41] as well as observational studies[39]. Systematic reviews of POCD following non-cardiac surgery identified limited evidence for long-term cognitive effects of GA although there were relatively few well-designed studies with follow-up times of sufficient duration to allow for the development of AD following exposure to anesthesia[13, 17]. POCD appears to be more common following cardiac surgery when compared to non-cardiac surgery likely due to factors such as cardiopulmonary bypass, intraoperative hypotension, and risk factors which are common for both cardiovascular disease and cerebrovascular disease[42, 43]. The relative contribution of GA to POCD in cardiac surgery setting is difficult to ascertain as many cardiac procedures cannot be undertaken using regional anesthetic techniques.

In contrast to our review and previous prospective studies, evidence from animal models and in vitro studies suggesting that exposure to GA may promote AD processes. Animal studies have suggested that exposure to the volatile anesthetic gas halothane may increase amyloid deposition in transgenic mice while isoflurane had little effect on amyloid deposition[6]. Another animal study found that sevoflurane was also associated with increases in beta-amyloid in mouse models[44]. Increases in tau hyperphosphorylation have also been observed with exposure to ether anesthesia[8]. Additional human tissue culture studies report that isoflurane[7, 45, 46], desflurane[47], and sevoflurane[7] may also be associated with increases in beta-amyloid related pathology. In our review we were not able to ascertain possible differential effects of individual anesthetic agents. However, halothane is an older medication that has largely been replaced by newer anesthetic agents and our study did not find that there was any association between year of publication and association between GA and AD. To date, there have been no studies that have examined the effects of GA on AD pathology in humans following typical exposure to GA through either biomarkers or neuroimaging and the results observed in these basic studies require further confirmation from human studies to determine the clinical significance of these findings.

The main strengths of our study include the rigorous methods employed to identify studies and assessment of potential risk of bias. The lack of association between GA and risk of AD was robust to a number of subgroup and sensitivity analyses. We did not observe any evidence of publication bias in the studies included in our review, which decreases the likelihood that our findings were related to our method of selecting articles. Finally, the observed lack of association between exposure to GA and AD is not likely to be affected by confounding related to factors associated with requiring a surgical procedure as this would have been expected to show an elevated risk of AD associated with GA which was not observed.

There are potential limitations to our study. Recall and information bias are problematic in all case-control studies and are potentially of greater importance in case-control studies of conditions such as AD where individuals cannot provide an accurate record of past exposures. Other studies have shown that proxy reporters can be used to determine exposure status in case-control studies of neurological diseases although the accuracy of reporting is dependent on the nature of the exposure[48]. All of the studies included in our review attempted to address this source of bias by using proxy reporters for exposure history or using medical records. In three of the studies included in this review, the accuracy of proxy reporters for determining exposure history was assessed through agreement between the history provided by controls with normal cognition and proxy reporters of these controls[24, 28, 32] with agreement between controls and proxies of controls on history of exposure to GA in the moderate range of agreement indicating that proxy reports of exposure may be prone to bias. Other important factors such as potential dose-response relationship were only available from one study and no studies included information on the type of surgery which may have an important impact on the subsequent risk of AD. Finally, some individuals with early cognitive impairment or dementia may be less likely to be offered surgical procedures which may have reduced the apparent observed risk of AD associated with GA in our review.

Conclusions

At the present time there we found no evidence to support an association between exposure to GA and increased risk of developing AD based on available observational studies. Patients should be warned about the potential long-term cognitive sequelae of undergoing surgical procedures and efforts should be made to optimize the perioperative care of older adults who are at risk of delirium and postoperative cognitive decline. The decision to use GA over other anesthetic techniques should be made on the basis of the surgical procedure and other clinical factors related to anesthetic choice. Further long-term prospective cohort studies or randomized controlled trials using biomarkers or neuroimaging modalities are required to further understand the associations between GA and AD.