Introduction

Pituitary adenoma (PA) is the second most frequent intracranial neoplasm and presents clinically as an incidental finding or with endocrine or mass effect manifestations [4]. Over the past decades, transsphenoidal surgery has established itself as the gold standard, first-line treatment for most subtypes of PA [15, 45, 82].

Recently, the influence of gender on clinical outcomes has seen a massive increase in interest among the scientific community, as outcome differences with clinical relevance have been established in various domains such as cardiovascular disease, autoimmune disease, and infectious disease. [11, 20, 41, 49, 50, 56, 62, 63, 65, 70, 80, 85, 87, 90]

Although there is literature concerning gender differences in the biology of the pituitary gland and in the presentation as well as the biology of PA, gender differences regarding the outcome of patients who underwent transsphenoidal resection of PA are poorly understood. [3, 25, 61, 63, 74, 76, 77, 88] It is currently unknown whether the different physiological hormone levels or any other gender differences may impact patient selection, success of treatment, or hormonal cut-off values [28, 32, 33].

Published data on this topic are scarce, and the authors are not aware of any literature review on gender differences in pituitary surgery, although there is one study assessing gender differences in non-surgical aspects of non-functioning PA (NFPA) [25]. Systematic reviews and meta-analyses can lead to more realistic results with better generalisability and less risk of bias compared to single studies [64]. In this study, we systematically reviewed the literature to evaluate the influence of gender and sexual hormones on outcomes after endoscopic or microscopic transsphenoidal PA surgery.

Materials and methods

Overview

A systematic review was carried out to identify any studies reporting at least one of (1) GTR (rate of radiological gross total resection), (2) rate of new endocrinological deficits, or (3) biochemical remission (for patients with hormone-secreting adenomas) after resection of PA stratified by gender or by preoperative sexual hormone (estrogen, testosterone, prolactin). Title and abstract screening, full-text review, and data extraction were handled independently by two reviewers (ST and SH), and disagreements at any stage were resolved by discussion and consensus. Persisting disagreements were resolved by discussion with a third reviewer (VS). We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol [64].

Search strategy

The PubMed/MEDLINE database was searched to identify eligible articles. The search strategy included combinations of the following terms: pituitary; adenoma; surgery; resection; transsphenoidal; gender; sex; male; female; prolactin; testosterone; estrogen; gross total resection; GTR; deficit; endocrine; endocrinological; and biochemical (Supplementary Table 2). Word variations and exploded medical subject headings were searched for whenever feasible. Additionally, reference lists were hand-searched to identify further studies of interest. The last comprehensive search was conducted on April 30, 2023.

Study selection

Only in vivo studies enrolling humans of all age groups in English, Italian, French, Dutch, and German were considered. As no controlled trials were anticipated, prospective and retrospective single-arm cohort studies and case series of adult individuals were also included. We excluded pediatric cases series. Case reports and small case series with less than 5 patients were excluded. To be considered, patients had to undergo endoscopic or microscopic transsphenoidal resection of PA. Studies had to assess at least one of the three abovementioned outcomes of interest stratified either by gender or by sexual hormone levels. In this way, we were able to rate the potential influence of sexual hormones and gender on outcomes. Studies reporting only resection of Rathke cleft cysts, craniopharyngiomas, or other lesions were excluded. We also excluded studies dealing mainly with transcranial or combined procedures. Studies reporting the outcomes of interest with a mix of targeted GTR and subtotal resection (STR) (i.e., a realistic caseload) were included. Exact cohort duplicates were excluded, although we did include updates of previously published cohorts with a sample size increase of at least 50%. Studies published before the 1st of January 1990 were excluded.

Data extraction and quality assessment

We extracted the following information if available from all included publications: study design and year of publication, number of patients, mean patient age and gender distribution, data on prolactin, testosterone, and estrogen levels, as well as data on GTR, new endocrinological deficits, and biochemical remission among patients with secreting adenomas. The methodological quality of included studies was graded using the GRADE framework [38].

Statistical meta-analysis

Based on anticipated heterogeneity and low event rates among studies, a random-effects analysis model (Mantel–Haenszel) that assesses odds ratios (OR) was chosen as the primary statistical method [39]. Cochran’s Q and I2 were used to evaluate heterogeneity, and a P < 0.1 was considered as relevant heterogeneity. All statistical analyses were carried out in RevMan version 5.4. Forest plots were generated to illustrate the main results of the meta-analysis.

Results

Literature search

As seen in the PRISMA flowchart in Fig. 1, the PubMed/Medline search provided 3238 articles to which none were added through other sources. After duplicate removal (n = 35), 3203 records were screened and 294 were assessed for eligibility through full-text screening, concluding 40 studies included in qualitative synthesis, all of which were also eligible for quantitative meta-analysis [2, 6,7,8,9,10, 12, 14, 17, 21, 27, 29, 31, 34, 40, 42, 47, 48, 51,52,53, 55, 57,58,59, 67, 68, 75, 78, 79, 81, 83, 86, 89, 91,92,93,94,95,96].

Fig. 1
figure 1

Flowchart of the literature eligibility assessment process

Study characteristics

The details of the 40 included studies are summarized in Table 1. We identified 31 studies reporting postoperative biochemical remission [2, 6, 8, 10, 12, 14, 21, 29, 31, 34, 40, 42, 47, 48, 51,52,53, 55, 57,58,59, 67, 68, 75, 78, 79, 81, 86, 89, 92, 94], eight reporting rate of GTR [7, 17, 27, 68, 75, 91, 94, 95], five reporting incidence of postoperative diabetes insipidus (DI) [7, 21, 58, 83, 93], four reporting preoperative levels of prolactin [21, 67, 94, 96], one study reporting rate of postoperative hyperprolactinemia [94], one study reporting both postoperative adrenocorticotropic hormone (ACTH) and thyroid-stimulating hormone (TSH) deficiency [9], one study reporting postoperative panhypopituitarism [83], and one study reporting any endocrinological deficit [55], while each of those endpoints was stratified by gender. No studies were found that reported preoperative levels of testosterone or estrogen, postoperative follicle-stimulating hormone (FSH), luteinizing hormone (LH) deficiency, growth hormone (GH) deficiency, or postoperative rate of syndrome of inappropriate antidiuretic hormone secretion (SIADH) by gender. Endpoints reported by one or less study could not be analyzed and are reported in Supplementary Table 1.

Table 1 Details of selected studies

Detailed qualitative interpretation of all analyzed outcomes including detailed certainty assessments is shown in Table 2.

Table 2 Summary of findings

Biochemical remission

Overall, 31 studies including 3605 patients (1410 male, 2195 female) were analyzed via random-effects meta-analysis, which showed an odds ratio of postoperative biochemical remission in males vs. females of 0.83 (95% CI 0.59–1.15). Heterogeneity was high with a I2-value of 71% (P < 0.00001) and the overall effect was 1.13 (P = 0.26) (Fig. 2). Twenty-three of the included studies were rated at a GRADE certainty of “very low” due to their small sample size while six were rated at a GRADE certainty of “low” (Table 1). Overall, after a detailed certainty assessment, the certainty of the outcome of this analysis was very low (Table 2).

Fig. 2
figure 2

Results of random-effects meta-analysis. a Rate of biochemical remission. b Rate of gross total resection. c Rate of postoperative diabetes insipidus. d Level of preoperative prolactin in ng/ml

Rate of GTR

In the evaluation of eight studies including 1155 patients (518 male, 637 female) via random-effects meta-analysis, an odds ratio of GTR in male vs. female patients showed to be 0.68 (95% CI 0.34–1.39). Again, heterogeneity was high with a I2-value of 74% (P = 0.0003) and the overall effect was 1.04 (P = 0.30) (Fig. 2). Five of the included studies were rated at a GRADE certainty of “very low” due to their small sample size while three were rated at a GRADE certainty of “low” (Table 1). Overall, after a detailed certainty assessment, the certainty of the outcome of this analysis was very low (Table 2).

Postoperative diabetes insipidus

The rate of postoperative DI in male vs. female patients was analyzed via random-effects meta-analysis including five studies with 598 patients (240 male, 358 female). The odds ratio was shown to be 0.40 (95% CI 0.26–0.64) with a low heterogeneity with a I2-value of 0% (P = 0.96). The overall effect was 3.92 (P < 0.0001) (Fig. 2). Three of the included studies were rated at a GRADE certainty of “very low” due to their small sample size while two were rated at a GRADE certainty of “low” (Table 1). Overall, after a detailed certainty assessment, the certainty of the outcome of this analysis was low (Table 2).

Preoperative level of prolactin

Four studies including 315 patients (108 male, 207 female) were analyzed via random-effects meta-analysis. The mean difference in the preoperative level of prolactin in male vs female patients was shown to be 11.62 (95% CI − 119.04–142.27) with high heterogeneity of I2 = 79% (P = 0.003). The overall effect was shown to be 0.17 (P = 0.86) (Fig. 2). Three of the included studies were rated at a GRADE certainty of “very low” due to their small sample size while one was rated at a GRADE certainty of “low” (Table 1). Overall, after a detailed certainty assessment, the certainty of the outcome of this analysis was very low (Table 2).

Discussion

The aim of this study was to identify gender differences in PA surgery. In our meta-analysis, there was a significantly higher rate of postoperative DI in female patients after endoscopic or microscopic transsphenoidal PA surgery, and although there was some data in isolated studies suggesting the influence of gender on postoperative biochemical remission, rate of GTR, and preoperative prolactin levels in patients after endoscopic or microscopic transsphenoidal PA surgery, these findings could not be confirmed in this meta-analysis and demonstrated no statistically significant effect of gender after endoscopic or microscopic PA surgery.

Gender differences are an important and established influence on clinical outcomes in various domains. [41, 50, 62, 63, 65, 70] While there is evidence of gender differences in clinical presentation and tumor size of PA [3, 16, 19, 25, 44, 68, 73, 84, 94], there is still little data available on its influence in PA surgery. Furthermore, in published studies, few data on baseline characteristics and outcomes concerning gender and sexual hormone status of participants is included: preoperative levels of testosterone or estrogen, postoperative FSH, LH or GH deficiency, or postoperative frequency of SIADH are not reported in any study and only single studies reported postoperative hyperprolactinemia [94], postoperative panhypopituitarism [83], postoperative ACTH and TSH deficiency [9], and any endocrinological deficit [55] stratified by gender (Supplementary Table 1).

In our analysis, we found overall no statistically significant impact of gender on biochemical remission of PA after surgery (Fig. 1). In single studies, however, differences by sex on biochemical remission are reported: Park et al. (2017) [68] found worse outcomes of male patients in comparison to premenopausal female patients with GH secreting adenomas. In addition, Yoo et al. [94] found worse outcomes in male than female patients with prolactinomas, and Arasho et al. [3] reported a worse outcome in male than female patients with prolactinoma, although in patients with non-functioning PA, significantly worse outcomes were observed in female than in male patients. The reasons for both these findings remain unclear but might be explained due to the difference in the distribution of patients with female patients with prolactinoma typically presenting at a lower age while not only older age at presentation but also larger tumor size and a possibly more aggressive biology of male patients with prolactinoma have been discussed, which in turn might lead to better outcomes in female patients [1, 13, 18, 19, 23, 24, 26, 94]. Although in prolactinoma a difference in the distribution of patients may be the reason for gender-specific outcomes, in other subtypes of PA the reason for such a difference, as in single studies there seems to be, remains unclear and future studies are needed to not only fully establish such a difference, but also a possible reason for it [3, 68]. The findings of these single studies suggest that in specific subtypes of PA and in specific age groups, sex and gender might have an impact on the biochemical remission of PA after surgery.

The impact of gender on GTR also showed no statistically significant difference between male and female patients. These findings are consistent with the findings of Park et al. (2017) [68], where no statistically significant influence of gender on the rate of GTR was found. Contrary to these findings, Yoo et al. (2018) [94] reported a significantly lower number of GTR in male patients, although this study included a much lower sample size than the analysis of this study and the study of Park et al. [68].

In our analysis, we found a significantly higher rate of postoperative DI in female than male patients. With a low heterogeneity of our analysis, these findings are consistent with included studies. It should be mentioned that there is one study by Joshi et al. [43] which specified postoperative DI by gender for all transsphenoidal surgery and, when analyzing for PA surgery alone, did not find a statistically significant difference of gender on postoperative DI and did not discuss this in detail. Although the rate of postoperative DI in female patients was significantly higher in our analysis, the possible reason for this outcome remains largely unclear. One possible explanation for this difference is a possible age difference between female and male patients at presentation, as prior studies have shown there to be a difference in age of presentation regarding gender in different subtypes of PA, most notably in prolactinoma, as mentioned above [63, 94]. In the studies included in this analysis, the two biggest studies Tiwari et al. [83] and Yasuda et al. [93] found lower age to be a risk factor for developing postoperative DI but neither of those studies analyzed or reported age of presentation by gender and its possible influence on this higher risk of postoperative DI at a lower age [83, 93]. So while there may be gender differences in age of presentation and both lower age at presentation and female gender have now been linked to higher rates of DI, it remains unclear if the two are linked or independent risk factors for postoperative DI. Additionally, the underlying reasons for a higher incidence of DI in both remain unclear while both a more aggressive surgical approach in younger patients and a smaller pituitary gland with a therefore higher vulnerability to resection in females have been discussed [83, 93]. While in our analysis we found a low heterogeneity of included studies, it must be stated that definitions of DI did differ significantly between included studies, most noticeably in the two biggest included studies Tiwari et al. [83] and Yasuda et al. [93] While in Yasuda et al. [93] DI was defined as “(1) polyuria: urinary flow greater than 250 ml/h for more than 2 h and (2) urinary hypoosmolarity: defined as a urinary density less than 1005,” Tiwari et al. [83] simply defined DI as a prescription for desmopressin at the time of discharge. While considering that patients were operatively treated and hospitalized for PA adenoma and it is therefore highly likely that a prescription of desmopressin in this circumstance will have meant postoperative DI, we cannot be sure that in Tiwari et al. [83] the diagnostic criteria of DI was homogeneous in their institution over included years, that this DI is a new postoperative phenomenon, or that, in fact, desmopressin was prescriped for DI in the first place [22, 30, 54, 60, 71, 72].

The levels of preoperative prolactin did not differ significantly between male and female patients, although this part of the meta-analysis has to be carefully interpreted due to the high heterogeneity, low sample size, and due to the sensitivity of meta-analyses of mean differences toward non-normal distributions of the source data, which could not be judged from the original publications. Within the literature on medically treated prolactinomas, the studies of Delgrange et al. [24], Khare et al. [46], and Nishioka et al. [66] all reported a significantly higher level of pre-treatment prolactin in male than female patients and with that a strongly correlating tumor size. Reasons for these bigger tumors in male patients are controversial and might be explained by either a longer delay of diagnosis due to fewer early symptoms of hyperprolactinemia or the greater proliferation potential of these tumors in male patients [24, 46, 66]. As explained above, the meta-analysis on prolactin levels that was possible from the included studies has to be carefully considered. In addition, while in most cases medical treatment is the appropriate initial treatment for prolactinoma [5, 66, 69] and in the abovementioned studies all patients were at least initially treated medically, our analysis only included patients that underwent transsphenoidal PA surgery, and therefore, levels of serum prolactin might be different than in patients that are initially or purely treated medically.

In nearly all the analyzed endpoints, except for the rate of DI, there was high heterogeneity of included studies. This is most likely due to the very different results of included studies regarding the impact of gender on analyzed outcomes, which in turn might be due to the small to very small sample sizes of most included studies. While high heterogeneity is not particularly desired, it could not be avoided in our analysis due to the very little available literature on the influence of gender on PA surgery.

Most included studies (32/40) were given a GRADE rating of “very low” with the remaining studies rated “low” (Table 1). This rating again was due to the low to very low number of either patients overall or outcomes reported by gender in included studies.

In terms of certainty assessment, the risk of bias was rated as not serious in all analyzed outcomes according to the GRADE framework [37]. Inconsistency was rated serious in analyzed outcomes of biochemical remission and GTR. In both outcomes, this was due to a high heterogeneity and low sample size of included studies as mentioned above. In postoperative DI, inconsistency was rated as not serious, as the analysis showed a low heterogeneity. In preoperative levels of prolactin, inconsistency showed to be very serious, due to the abovementioned high heterogeneity, low sample size, and sensitivity of meta-analyses of mean difference toward non-normal distributions of data. Indirectness was rated as not serious in all analyzed outcomes according to the GRADE framework [35]. Imprecision was rated as serious in both postoperative biochemical remission and DI, as very serious in postoperative GTR, and as extremely serious in preoperative levels of prolactin according to the GRADE framework [36], while in all outcomes, plausible residual confounding like age, comorbidities, and surgical indication would reduce the demonstrated effect. Overall, according to the GRADE framework, the analysis of postoperative DI had a low overall certainty, while the analysis of postoperative biochemical remission, GTR, and preoperative levels of prolactin all had a very low overall certainty (Table 2). This overall level of certainty is not surprising considering there were relatively few studies reporting outcomes by gender, a low to very low sample size of those who did, and a therefore relatively low sample size in all analyzed outcomes with most, except for postoperative DI, showing high heterogeneity.

Although our meta-analysis did not find a statistically significant difference of gender in postoperative biochemical remission, rate of GTR, or preoperative prolactin levels, these findings do not establish that there is no difference of gender in these outcomes at all. As this study searched for differences in all transsphenoidal pituitary adenoma surgery regardless of specific subtype, this generalization can lead to misleading findings and possible influences of gender in these subtypes may be overlooked. Furthermore, we cannot assure the homogeneity of included studies concerning age, comorbidities, and surgical indication. Nonetheless, our analysis found a higher rate of postoperative DI in female than male patients and while a possible influence of gender on postoperative biochemical remission, rate of GTR, or preoperative prolactin could not be found in our meta-analysis, still, in single studies, it appears that gender may have an influence on outcomes after pituitary surgery.

While there was a significantly higher rate of postoperative DI in female patients after endoscopic or microscopic transsphenoidal PA surgery, our analysis of the influence of gender on postoperative biochemical remission, rate of GTR, and preoperative prolactin levels did not demonstrate a statistically significant effect. Further research and studies with larger sample sizes and considering PA subtypes and different age groups (premenopausal vs. postmenopausal) are needed to establish a clear understanding of their impact on PA surgery. While reporting data stratified by gender and sexual hormones would be relatively easy, there is still little data reported as such and its significance remains to be examined. Future studies concerning PA surgery should report their data by gender or sexual hormones and ideally further assess their impact on PA surgery.

Limitations

The main limitation of this study is the general lack of gender-specific data reported in publications and the therefore relatively small sample size, although reporting data by gender would be simple and might lead to new evidence regarding gender sciences and its impact on neurosurgery and medicine as a whole.

Furthermore, we analyzed all PA treated with endoscopic or microscopic transsphenoidal surgery as a group and, because of the limited reporting and small sample sizes, did not specify PA by subtype. Gender and sexual hormones might have varying influences on surgical outcomes of different subgroups and a generalization of all PA may lead to misleading results. Additionally, the homogeneity of included studies concerning age, comorbidities, and surgical indication cannot be assured.

Another limitation of this analysis is that most included studies classified PA according to clinical phenotype. While as mentioned our analysis did not differ between subtypes of PA, there might have been differences in the classification of PA in included studies as PA subtypes defined as its clinical phenotype is not always identical to respective pathological studies, and the classification of tumors may have changed over the time period in which included studies were published.

Moreover, we cannot be sure that included studies homogenously defined reported outcomes as, for example, and as mentioned before, postoperative DI was not homogenously defined in all included studies.

Additionally, due to the small sample sizes, there is high heterogeneity in our analyses, and as a meta-analysis, there might be inherent publication bias in this study.

Conclusions

After an extensive literature search, we analyzed 40 studies regarding the influence of gender on endoscopic or microscopic transsphenoidal PA surgery. In our meta-analysis, there was a significantly higher rate of postoperative DI in female patients after endoscopic or microscopic transsphenoidal PA surgery, and although there was some data in isolated studies suggesting the influence of gender on postoperative biochemical remission, rate of GTR, and preoperative prolactin levels, these findings could not be confirmed in this meta-analysis and demonstrated no statistically significant effect. Further research is needed and future studies concerning PA surgery should report their data by gender or sexual hormones and ideally further assess their impact on PA surgery.