Introduction

Primary aldosteronism (PA) is the most common surgically treatable cause of secondary hypertension with an estimated prevalence of 5–20% within the hypertensive population [1,2,3,4,5,6,7]. In the vast majority of cases, PA is either caused by bilateral adrenal hyperplasia or by a unilateral aldosterone-producing adenoma (APA). While bilateral hyperplasia is generally treated with a mineralocorticoid receptor agonist, adrenalectomy is the preferred treatment for patients with APA [8, 9].

In 2008, The Endocrine Society published a clinical practice guideline on PA with the goal of improving screening, work-up and treatment of PA worldwide [8]. The guideline recommended the use of the aldosterone-to-renin ratio (ARR) to detect cases of PA among hypertensive patients. Due to the risk of false-positive ARRs, case confirmation with a confirmatory test was recommended in all patients with a positive ARR. Computed tomography (CT) was recommended to exclude adrenocortical carcinoma and in case surgery for PA was indicated, and adrenal venous sampling (AVS) was recommended in all patients to distinguish APA from bilateral hyperplasia [8]. In 2016, an update of the Endocrine Society Guideline was published [9]. This revised guideline suggested that a specific subgroup of patients potentially do not have to undergo confirmatory testing or AVS. However, these recommendations were based on a relatively low level of evidence [9].

Within the work-up to adrenalectomy in patients with PA, however, a large variety of diagnostic tests and imaging modalities are available to establish the PA diagnosis and to determine laterality of disease. This is reflected in the numerous controversies in the literature and between experts in the field regarding the different preoperative work-up strategies. Currently, the most important topic of discussion is whether all patients should undergo confirmatory testing and AVS [10,11,12]. Moreover, these diagnostics could be considered expensive, laborious or technically demanding. Based on the above, we hypothesized that clinicians might deviate from the Endocrine Society guideline within current daily clinical practice.

In the past, complete cure of hypertension after the operation was estimated in approximately 50% of patients [13, 14]. However, recently the Primary Aldosteronism Surgery Outcome (PASO) study group and our own study group showed the less optimistic results by presenting a 27–37% cure rate within large, international and well-executed studies [12, 15, 16]. This stresses the need to evaluate current practice with the goal to improve the benefits of surgery. Since the present literature lacks data on how the work-up to surgery is performed in daily practice, we set out to evaluate and describe the performed work-up within a worldwide cohort of patients who underwent unilateral adrenalectomy for PA between 2010 and 2016.

Methods

Study population

We performed an international retrospective cohort study across 16 referral medical centers in The USA, Europe, Canada and Australia. The study cohort was established by the International CONNsortium study group and the derivation of the cohort and the blood pressure-related outcomes has been described in previous publications [15, 16]. In brief, all consecutive patients who underwent unilateral total adrenalectomy for APA between 2010 and 2016 were included retrospectively. We did not include or exclude patients based on the performed work-up strategy and, therefore, the diagnosis of PA and indication for surgery were based on the treating physicians’ assessment and discretion. In the majority of cases, biochemical evidence of PA was based on the ARR (or in some cases single aldosterone) measurements. Confirmatory testing was performed according to the physicians’ preference and/or availability of this test within each medical center. Unilateral disease was diagnosed based on CT and/or magnetic resonance imaging (MRI) and/or AVS according to the preference of the physician and/or availability of these modalities within each medical center. Since the cohort was initiated for a different study aim (i.e., to describe the reduction in blood pressure and antihypertensive medications after adrenalectomy), patients with missing preoperative or follow-up data regarding systolic blood pressure (SBP), diastolic blood pressure (DBP) or corresponding number of antihypertensive medications were not included in the cohort [15]. Data collection was performed separately within each center with the use of a standardized data-entry manual. Patient demographics, disease characteristics, laboratory data (e.g., measurements of ARR and confirmatory testing), results of CT/MRI/AVS, operative characteristics, pathology diagnosis and timing of follow-up were collected. Institutional review board approval was obtained in all participating centers.

Outcomes

The primary aim of this study was to evaluate the adherence to The Endocrine Society guideline for the work-up of patients treated for PA [8]. As our cohort consisted of who had an adrenalectomy before (or around) publication of the most recent 2016 version of the guideline, we chose to primarily compare our results to the guideline which was published in 2008 [8]. Within the new 2016 guideline, only two recommendations regarding work-up were introduced:

  1. I.

    In case of hypokalemia, plasma renin levels below detection levels and aldosterone above >20 ng/dL (550 pmol/L), no confirmatory testing may be needed;

  2. II.

    In case of age <35 years old, hypokalemia, marked aldosterone excess and unilateral cortical adenoma on CT, no AVS may be needed [9].

We additionally aimed to evaluate the potential influence of these new recommendations on clinical practice by examining the proportions of patients fulfilling/meeting these criteria within our cohort.

The secondary aim of this study was to identify potential disease or patient characteristics which encouraged clinicians to distinguish APA from bilateral hyperplasia and to determine laterality of disease based on CT alone without performing AVS.

Definitions

Due to the different assays and reference values within the participating centers, we were not able to analyze absolute values of biochemical measurements. To compare laboratory data between the centers, measurements were classified as elevated or suppressed when they were above the upper or below the lower limit of the center’s local reference ranges, respectively. Marked aldosterone excess was defined as an elevated aldosterone level, and hypokalemia was defined as either a potassium level below the local reference range or the use of potassium supplementation. When the results of biochemical measurements (e.g., ARR or confirmatory test) were not known within the operating centers, the measurement was reported as not performed. When these measurements were performed in other medical centers before referral, the results were reported within the database. Complete cure of hypertension was defined according to the PASO consensus criteria [12].

Statistical analysis

Normally and not normally distributed continuous data are shown as mean (± standard deviation) and median (range). To compare continuous variables between groups, the Mann–Whitney U Test was used for not normally distributed data and independent samples t tests for normally distributed data. The Chi-Square test and Fisher’s exact test were used to analyze group differences for categorical variables. To analyze potential variables associated with not making use of AVS, we performed multivariable logistic regression with backward stepwise selection including variables with p < 0.25 in univariable analysis. Only patients who underwent CT were included in this analysis. Multiple potential prognostic variables had missing values. These variables were imputed using multiple imputation generating 20 imputed datasets [17]. Outcomes were not imputed. Pooled odds ratios with 95% confidence intervals were obtained from multivariable logistic regression. All tests were two-sided and p-values < 0.05 were considered significant. Statistical analysis was performed using SPSS version 23.0 (Chicago, Illinois, USA), and figures were constructed using Graphpad Prism version 7.02 (GraphPad Software Inc, California, USA) and Draw.io version 10.5-1 (JGraph Ltd, Northamptonshire, UK).

Results

Four hundred and thirty-five (85%) patients were eligible for analysis [15]. Baseline characteristics of these patients are presented in Table 1. Most patients were men (57.2%). The mean age and mean BMI were 50.7 ± 11.4 years and 29.7 ±6.0 kg/m2, respectively. Hypokalemia was present in 73.9% of patients, and most patients had grade 1 hypertension (41.4%). Preoperative work-up data of these patients are presented in Table 2. In 82.9% of patients, a complete measurement of the ARR was performed and in 94.5% of these patients, the ARR was elevated indicating PA. A confirmatory test was performed in 32.9% of all patients, indicating PA in 89.5%. CT, MRI and AVS were performed in 86.9%, 17.0% and 65.3% of the cohort, respectively. Almost half of the patients (49.9%) underwent both CT and AVS for subtype testing. CT only, MRI only and AVS only were used in 28.5%, 5.1% and 3.7% of patients, respectively. Furthermore, CT combined with MRI was used in 3.9% and MRI combined with AVS in 4.4% of patients. All three modalities were used in 4.6% of patients (Fig. 1). As indicated in Fig. 2, large variability in work-up strategies was observed between the different medical centers. Depending on the medical center, the use of a confirmatory test, CT and AVS ranged from 0.0 to 94.6%, 66.7 to 100.0% and 9.1 to 100.0% of patients, respectively. All centers used AVS in some cases, and only one center performed AVS in all cases. Furthermore, MRI was used in all medical centers except one.

Table 1 Baseline characteristics of 435 patients
Table 2 Preoperative work-up
Fig. 1
figure 1

Imaging modalities used for subtype testing. CT = computerized tomography, MRI = magnetic resonance imaging; AVS = adrenal venous sampling

Fig. 2
figure 2

Large heterogeneity in the use of confirmatory testing, MRI, CT and AVS. CT = computerized tomography, MRI = magnetic resonance imaging, AVS = adrenal venous sampling. Legend: This figure shows the use of different diagnostic modalities in the work-up of primary aldosteronism specified by continent and medical center. As presented, a large variability in work-up strategies was used in daily clinical practice and, in contrast to the guideline, confirmatory testing and AVS were not regularly performed

Adherence to the 2008 endocrine society guideline

Out of the 435 patients who underwent surgery for PA, screening was performed by a complete ARR in 361 patients (83.0%) and in 341 patients (78.4%), this ARR was elevated suggesting PA (Fig. 3). Of the patients without a preoperative ARR, a preoperative aldosterone measurement was performed in 63.5% of patients showing elevated aldosterone levels in 72.3% of these patients. A confirmatory test was performed in 114 of the 341 patients with an elevated ARR (33.4%), and in 102 patients (29.9%) the test indicated PA. Ninety-one of these 102 patients (89.2%) underwent CT, and in 11 patients (10.8%) no CT was performed. These 11 patients underwent MRI and/or AVS. Sixty out of 91 patients (65.9%) also underwent AVS, and in 57 (62.6%) patients the AVS indicated unilateral disease. When combining these results, 57 out of the 435 (13.1%) patients who had surgery within this cohort underwent the complete work-up as recommended by the 2008 Endocrine Society Guideline [8]. All other patients did not undergo all recommended diagnostic modalities or, for instance, had an ARR or confirmatory test not compatible with PA (Fig. 3). Complete cure of hypertension after the operation was comparable between the 13.1% of patients with the complete work-up and all other patients, 30% and 27%, respectively.

Fig. 3
figure 3

Work-up in current clinical practice in contrast to the Endocrine Society Guideline 2008. *2 patients had failure of AVS procedure and in 1 patient AVS showed no lateralization. PA = primary aldosteronism, ARR = aldosterone-to-renin ratio, CT = computerized tomography, MRI = magnetic resonance imaging, AVS = adrenal venous sampling. Legend: This figure describes the work-up to surgery for primary aldosteronism in daily clinical practice as it was performed within this study cohort. The performed work-up was compared to the 2008 Endocrine Society Guideline as this was accurate during the inclusion period of the study. The figure shows low guideline adherence since 13.1% underwent the complete work-up according to the guideline. Patients were excluded when: (1) they did not undergo one of the recommended diagnostic test, (2) the results of the aldosterone-to-renin ratio and confirmatory test did not correlate with primary aldosteronism, (3) the results of adrenal venous sampling did not indicate unilateral aldosterone hypersecretion (all indicated by the horizontal arrows to the right)

Evaluation of the 2016 endocrine society guideline

Within the complete cohort, 177 patients (40.7%) preoperatively had known elevated aldosterone, suppressed renin and spontaneous hypokalemia omitting the need for confirmatory testing according to the revised 2016 guideline. Only 49 (18.9%) of the 258 patients that did not meet these criteria underwent confirmatory testing. Among the 242 patients in whom a preoperative CT was performed, only 30 (12.4%) patients were younger than 35 years of age and only 14 (5.8%) patients also had an elevated aldosterone, spontaneous hypokalemia and a unilateral nodule on CT. According to the 2016 guideline, these 5.8% of patients did not have to undergo AVS [9].

Variables associated with performing AVS

Univariable analysis showed that AVS was more frequently performed in case of older age, male gender, longer duration of hypertension, presence of hypokalemia, CT indicating bilateral disease or normal adrenal anatomy and a smaller tumor on CT (Table 3). After multivariable regression analysis, bilateral disease or normal adrenal anatomy on CT (OR 16.19; CI 3.50–74.99) (p < 0.001), smaller tumor size on CT (mm) (OR 0.06; CI 0.04–0.08) (p < 0.001) and hypokalemia (OR 2.00; CI 1.19–3.32) (p = 0.008) remained independently associated with performing AVS.

Table 3 Variables associated with performing AVS for subtype testing

Discussion

This study evaluated the work-up to adrenalectomy for PA within current daily practice in an international retrospective cohort of surgical patients. The results displayed a large variability in work-up strategies between the participating centers worldwide. During the inclusion period of this study, the 2008 Endocrine Society Guideline was applicable [8]. Only 13.1% of the operated patients underwent a complete work-up as was recommended by this guideline. Although almost all patients underwent CT (or MRI), confirmatory testing and AVS were performed in only one-third and two-third of the operated patients, respectively. The results also showed a significant variability in work-up within the majority of centers. Moreover, this variability in work-up was also shown between countries, between the centers within these countries and within the individual centers. Therefore, this study illustrates that clinicians most likely chose a particular work-up strategy, such as the selective use of AVS, based on their preferences or guided by case specifics instead of following the Endocrine Society or local guideline.

Currently, the ARR is the most reliable test for screening for PA [8, 9, 18, 19]. Confirmatory testing is recommended for all patients with a positive ARR to exclude the false-positive ARR results. However, this study shows the large variability in diagnostic work-up worldwide in which a confirmatory test was performed in only 32.9% of patients. The relatively low proportion of patients who underwent a confirmatory test and the large variability between the centers could be due to the fact that all confirmatory tests have some limitations and no universally accepted “gold standard’’ confirmatory test for PA is identified in the current literature [20,21,22,23,24,25,26]. Additionally, confirmatory tests are relatively expensive and frequently difficult to perform in outpatient settings [8, 9]. This may have contributed to the changes in the 2016 guideline. Our data show that a relatively large proportion of patients (40.7%) fulfills the triad of marked aldosterone, suppressed renin and hypokalemia. Omitting confirmatory testing in these patients would have been in agreement with the revised guideline of 2016. Therefore, this change in the guideline could induce a substantial reduction in confirmatory testing. It should be noted that this revised recommendation was based on a relatively low level of evidence, and therefore, not performing a confirmatory test is not without risks, especially because a patient with primary hypertension could be incorrectly diagnosed with PA and potentially undergo surgery based on the false-positive ARR results. Nevertheless, the recently published study by Umakasi et al. [27] supports the recommendation of omitting confirmatory testing in case of aldosterone excess > 20 ng/dL (550 pmol/L), suppressed renin and hypokalemia by presenting a PA diagnosis in 100% these cases.

There seems to be no consensus between clinicians on the use of AVS, as evidenced by the 34.7% of patients in this study who did not undergo AVS. Proponents argue that AVS should be considered as the “gold standard’’ for subtype testing, because multiple studies have shown its superiority over CT in determining disease lateralization. In these studies, the results of CT were compared to AVS as reference standard [11, 28, 29]. Opponents of AVS argue on the practical difficulties such as higher costs and the need of an interventional radiologist. This limits the wide availability of AVS, because some centers do not have the financial resources or expertise to perform AVS. In addition, AVS is an invasive procedure and also has failure and complication rates [10, 30, 31]. Furthermore, they argue that no significant differences in outcomes, such as antihypertensive medications or quality of life, were observed between CT and AVS within a randomized trial [32].

In this study, all participating medical centers used AVS in at least some patients. This suggests that AVS was available for all medical centers during some period of the inclusion period. Hence, we speculate that clinicians most likely chose to perform or not perform AVS based on their preferences or guided by case specifics. AVS was more frequently performed on patients with higher age, male gender, longer duration of hypertension and preoperative hypokalemia. Potentially, these represent the patients with more severe hypertension and/or hyperaldosteronism since some of these factors are also known as risk factors for less favorable clinical outcomes after adrenalectomy [12, 33, 34]. Furthermore, AVS was more frequently done in case of smaller tumor size, bilateral disease or normal adrenal anatomy on CT. Recently, Williams et al. also showed that AVS was more frequently performed in case of male gender and smaller tumor size on CT in univariable analysis. Furthermore, they indicated that AVS was used more often in case of lower blood pressure, higher ARR and lower estimated glomerular filtration rate [35].

The results of multivariable analysis within our cohort showed CT findings were independently associated with performing AVS. The presence of bilateral disease or normal adrenal anatomy on CT proved to be the most important trigger for clinicians to use AVS in daily practice. Likewise, 96.1% of patients with bilateral disease or normal adrenal anatomy on CT also underwent AVS. In contrast, patients with a clear unilateral nodule on CT and especially patients with larger tumors were less likely to undergo AVS. This further supports that CT findings most likely have the highest influence on the choice to perform or not perform AVS in daily clinical practice. Furthermore, hypokalemia proved to be independently associated with performing AVS. Nevertheless, it should be noted that Umakosi et al. [27] recently showed a higher percentage of hypokalemia in patients with APA compared to bilateral adrenal hyperplasia, 87% versus 21%, respectively. Therefore, one could argue to perform AVS less frequently in case of hypokalemia. This seems to be in line with the 2016 guideline allowing the omission of AVS in case of a clear unilateral cortical adenoma on CT when this is combined with hypokalemia, age < 35 years and marked aldosterone excess [9]. Within this study, only 6% of patients met these conditions and, consequently, this new recommendation only has marginal influence on daily practice.

Additionally, this study shows that work-up and surgery for PA also is performed in some centers with relatively low volume compared to centers in which PA is a clinical spearhead. This could have resulted in lower guideline adherence due to lower expertise and could stress the importance of further centralization of the treatment of PA. Furthermore, the low adherence can be due to the variety of diagnostic tests available of which some can be considered expensive, laborious or technically demanding. Also, it has been reported that it takes an average of 17 years for research evidence to reach clinical practice and, therefore, time to adoption of guidelines can take up to years or decades [36].

This study has some limitations. Similar to the majority of studies on PA, the retrospective design is a weakness. As a result, this study is more prone to missing data compared to prospective studies. Potentially, this could have led to lower rates of performed preoperative measurements of the ARR and confirmatory testing, as we chose to classify these modalities as not performed when the results were not known within the local patient files or referral letters. On the other hand, the retrospective design most likely is appropriate to evaluate different types of work-up strategies in clinical practice, as it reduces the influence of study protocols on decisions made by clinicians and therefore reflects daily practice. Because this is a surgical cohort, medically treated patients were not included. Therefore, we do not know if the results are representative for the medically treated PA population. Due to the different laboratory assays and reference values within the participating centers, we chose to not analyze absolute values of biochemical measurements. Therefore, marked aldosterone excess was defined as an aldosterone level above the local reference range, instead of the > 20 ng/dL (550 pmol/L) cutoff suggested with the 2016 guideline [9]. Consequently, this could have influenced the 40.7% of patients meeting the criteria for omission of confirmatory testing.

The blood pressure-related outcomes within this cohort were published earlier and therefore not reported within this manuscript [15, 16]. As presented within these and other recently published studies, complete cure of hypertension after the operation is far from a certainty [12, 15, 16, 37]. Potentially, this is due to the large variability in work-up strategies which stresses the importance of evaluating how we currently perform the work-up to surgery for PA. Although this study showed no difference in cure of hypertension between patients with and without a work-up as recommended by the guidelines, our study cohort actually is not suitable for properly investigating the potential influence of the presented uniformity in work-up strategies on the outcomes after surgery. This is due to the retrospective design which is prone to confounding by indication.

In conclusion, this study examined the work-up to surgery for PA within current worldwide daily clinical practice. The results demonstrate large variability in work-up strategies both within and between the medical centers resulting in relatively low guideline adherence. If we want to further improve the benefits of surgery for PA in the future, we should strive for a more uniform work-up to surgery worldwide.