Introduction

Pituitary adenomas are benign tumours of the pituitary gland that can be classified clinically on whether they are functioning (hypersecreting pituitary hormone/s) or non-functioning adenomas. Surgical management through endoscopic endonasal transsphenoidal resection is an accepted and commonly used technique to remove these tumours, demonstrating superior rates of gross total resection than traditional microsurgical techniques [1].

Over the past few years, there have been multiple studies demonstrating that a learning curve exists for endoscopic pituitary surgery [2, 3]. However, there appears to be significant variability in the way these studies report their outcomes. This makes understanding and comparing studies challenging.

This study aims to systematically review the literature regarding outcomes for endoscopic pituitary surgery and how this may be related to a surgical learning curve.

Literature search

A search strategy was devised according to the 2020 Preferred Reporting Items of Systematic Reviews and Meta-Analyses (PRISMA) statement [4] (refer to supplementary Figure 1). An electronic search of the databases Medline, Scopus, Embase, Web of Science and Cochrane Library databases was performed from inception until the 7th of May 2023. To identify articles investigating how outcomes for endoscopic pituitary surgery change as a surgeon gains more experience, the following search terms were applied: (pituitary OR pituitary adenoma) AND (visual OR ophthalmology OR endocrine OR hormonal OR resection OR outcome) AND (learning curve OR experience) with prior checking in the MeSH database to include synonyms.

The database search was further supplemented by a search of the reference lists of included studies as well as checking the related article function provided by each database. Titles and abstracts were screened to identify potentially relevant studies. All potentially relevant articles, or articles where it was unclear based on the abstract, were assessed by reviews of the full-text articles.

Articles were deemed eligible if they (1) recorded preoperative information regarding visual function and endocrine function; (2) examined endocrine and ophthalmological function postoperatively; (3) reported complications; (4) performed statistical analysis on outcomes after dividing patients into groups based on when surgery was performed. Studies were excluded when (1) they did not provide long-term follow-up for endocrinological outcomes; (2) a focus of the article was not to examine the learning curve for endocrine outcome; (3) if patients undergoing microscopic pituitary surgery were included; (4) if the surgical technique changed significantly during the study period.

Data extraction

All data was reviewed independently by 2 authors (NC and CO) and discrepancies cross-checked in a consensus meeting.

The following data was obtained from the included studies: number of patients, location of surgery, time period when operations occurred, who performed the surgeries, role of skull base ENT surgeons, duration of follow-up, how the groups were divided temporally, division of functional and non-functional tumours, all available preoperative endocrine information, visual function, average operative time, postoperative CSF leak rate, endocrine outcome, visual outcome and extent of resection.

Quality assessment

We used a modified quality assessment tool incorporating the Cochrane Collaboration tool to assess the methodological quality of the included articles [5]. The quality assessment tool (Table 1) assessed: demographic details, preoperative variables, postoperative variables, complications and learning curve. The same two authors (NC and CO) then evaluated the risk of bias in the individual articles using a modified version of the Cochrane Collaboration method (Table 2). Discrepancies were resolved after discussion and consensus amongst all authors.

Table 1 Quality assessment tool
Table 2 Grading of quality assessment

Results

Study selection

From the literature search, 78 articles were identified through searching Medline, Scopus, Embase, Web of Science and Cochrane Library databases (refer to Fig. 1). Fifty-nine were initially excluded based on the content of the title or the abstract. The most common reason for exclusion was being unrelated to assessing the learning curve. Eighteen articles [2, 3, 6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21] proceeded to full-text review with 10 articles [2, 3, 7,8,9, 11, 12, 15, 20] being selected for inclusion. Of the 8 articles excluded, 2 articles [18, 19] were excluded because they did not assess the operative learning curve, 3 articles [6, 14, 16] did not report on the rate of endocrinopathy in a way that it could be examined in the context of a learning curve, 1 article [21] reports on the same group of patients within a larger database, 1 article [13] was excluded as it reported other sella pathology and 1 article [10] was excluded because the surgical technique changed significantly during the study period.

Fig. 1
figure 1

PRISMA flowchart demonstrating the results of study selection

Study characteristics

Of the 10 included studies, there was significant variation in the number of patients, duration of follow-up, methodology and method of reporting for outcome variables.

Kenan et al. [2] reported on 78 patients who underwent endoscopic endonasal transsphenoidal resection of a pituitary adenoma at Kocaeli University Hospital in Turkey between 1997 and 2005. Operations were performed by one of the two senior authors initially with otolaryngology assistance, but later without. The minimum duration of follow-up was 6 months with variable follow-up ranging from 44.8 months to 9 months. The groups were divided into an early group of 9 cases from 1997 to 2000 and a late group of 69 cases from 2001 to 2005. Endocrine outcomes for cure were defined for prolactinomas and somatotropinomas with other functional adenomas being excluded; patients with non-functioning adenomas did not receive assessment. Variables reported included operative time, postoperative cerebrospinal fluid (CSF) leak rate, rate of diabetes insipidus (DI), degree of resection on MRI scan at 6 months, formal visual assessment at 1 month and endocrine assessment at an undefined time interval.

Leach et al. [3] reported on the first 125 patients who underwent endoscopic endonasal transsphenoidal surgery at the Department of Neurosurgery Royal Salford Hospital in Manchester between 2005 and 2007. All procedures were performed by one surgeon who had been previously performing microscopic transnasal surgery for pituitary adenoma resection and had received 6 months of training in endoscopic pituitary surgery prior. The mean duration of follow-up was 18 months. The groups were divided into two 15-month periods of 53 patients from April 2005 to June 2006 and 72 patients from July 2006 to September 2007. Endocrine outcomes were defined for non-functioning and functioning adenomas. Variables reported included operative time, postoperative CSF leak rate, rate of DI, degree of resection on magnetic resonance imaging (MRI) at 4–6 months, basal and reserve pituitary function at 2 to 3 weeks and ophthalmological assessment at 4 to 6 weeks.

Bokhari et al. [8] reported on 79 consecutive patients who underwent endoscopic endonasal transsphenoidal resection of a pituitary adenoma at St. George Public and Private Hospitals between July 1998 and September 2010. All procedures were performed by a single neurosurgeon. The mean follow-up for patients was 38 months. Patients were divided into 3 equal groups of 27, 26 and 26 cases respectively. These groups spanned between the years 1998–2004, 2004–2006 and 2006–2010. The endocrinological cure for functional adenomas is clearly defined, but the assessment for non-functional adenomas is not clear. Variables reported included postoperative CSF leak rate, rate of DI, degree of resection on MRI scan, formal visual assessment and endocrine assessment.

Chi et al. [9] reported on 80 consecutive patients who underwent endoscopic endonasal transsphenoidal resection of pituitary adenoma at the Department of Neurosurgery Renji Hospital Shanghai between 2011 and 2012. All procedures were performed by a neurosurgeon without otolaryngology assistance who previously performed microscopic transsphenoidal procedures and spent 3 months training in endoscopic techniques. The duration of follow-up is undefined, but all patients had at least 1 month of follow-up. Patients were divided into an early group of patients 1 to 40, and a late group of patients 41 to 80. Endocrine outcomes for functional adenomas are defined, but non-functioning adenomas are not assessed. Variables reported include operative time, postoperative CSF leak rate, rate of DI, extent of resection on MRI, formal visual field assessment and endocrine assessment.

Shou et al. [17] reported on 178 consecutive patients from March 2011 to May 2014 who underwent endoscopic endonasal transsphenoidal pituitary adenoma resection at the Department of Neurosurgery, Shanghai Pituitary Tumour Centre. All procedures were performed by 2 neurosurgeons without otolaryngology assistance. All patients had at least 12 months of follow-up. Patients are divided into 2 groups with 89 patients in each group. It is unclear over what time these 2 groups encompass. Endocrine outcomes for functional adenomas are defined for somatotropinomas and prolactinomas, but not for corticotropinomas or non-functional adenomas. Variables reported included rates of resection on MRI scan, postoperative CSF leak, rate of DI and endocrine outcomes. Visual outcome is not reported.

Qureshi et al. [15] reported on 78 consecutive patients who underwent endoscopic endonasal transsphenoidal surgery for resection of pituitary adenoma at the Department of Neurosurgery Rush University Chicago, between 2006 and 2012. All procedures were performed by a single team of neurosurgeons and otolaryngologists. Patients had at least 6 weeks of follow-up. Patients were divided into an early group of 9 patients and a late group of 69 patients based on post hoc analysis. Endocrine outcomes for functional cure are not defined and assessment of non-functioning adenomas is not defined. Variables reported included operative time, rate of postoperative CSF leak, rate of DI, degree of resection on MRI scan, visual field and endocrine outcomes.

Kim et al. [12] reported on 331 patients who underwent endoscopic endonasal transsphenoidal resection of non-functioning pituitary adenomas at Seoul National University Hospital from 2010 to 2016. Operations were performed by a single surgeon. Post hoc analysis using receiver operating characteristic curve analysis was performed to determine the number of surgical cases for a difference in gross total resection. This created 2 groups: 0–100 cases between 2010 and 2011 and 101–331 cases between 2012 and 2016. Endocrine outcomes for non-functioning adenomas were defined at a specified time interval. Functional adenomas and apoplexy were excluded. Variables reported included rate of postoperative CSF leak, rate of DI, degree of resection on MRI scan, endocrine assessment and visual assessment.

Younus et al. [20] reported on 600 consecutive patients who underwent endoscopic transsphenoidal resection of a pituitary adenoma at the Department of Neurosurgery Will Cornell Medicine, New York, between 2004 and 2018. All surgeries were performed by a single neurosurgeon with assistance from an otolaryngologist and a neurosurgery resident or fellow. Patients were divided into 4 quartiles with 150 patients in each. Endocrine outcomes for functional adenomas are defined at a specified time interval, whereas non-functional adenomas are not defined. Variables reported included rate of postoperative CSF leak, rate of DI, degree of resection on MRI scan, endocrine assessment and visual assessment.

Boetto et al. [7] reported on 53 patients who underwent endoscopic transsphenoidal resection of a non-functioning pituitary adenoma between November 2017 and November 2020 at the Department of Neurosurgery, Montpellier University Medical Centre. Cases were performed by a single neurosurgeon with ENT assistance. Patients had a minimum of 12 months follow-up. Patients were divided into an early and late cohort of 30 patients and 23 patients respectively. These cohorts covered the first 2 years and the final year of the study. All patients underwent endocrine assessment at defined time intervals. Patients with functional tumours were excluded. Variables reported included operative time, rate of postoperative CSF leak, rate of DI, degree of resection on MRI, endocrine assessment and visual assessment.

Huang et al. [11] reported on 273 patients who underwent endoscopic transsphenoidal pituitary adenoma resection between December 2014 and August 2021 at Shanghai Changzhgen Hospital. All procedures were performed by 3 neurosurgeons without the assistance of an ENT surgeon. Endocrine outcome is defined for functional adenomas at a defined interval, but not defined for non-functional adenomas. Cases were divided into an early, middle and late period of 91 cases each. Variables reported included operative time, rate of postoperative CSF leak, rate of DI, degree of resection on MRI scan and endocrine assessment. Visual outcome is not reported.

Demographic findings

Patient demographics

Patient demographics are reported variably between studies. Variables reported included age, gender, type of pituitary adenoma, preoperative endocrinopathy, visual function and radiological factors. These are reported in Table 3.

Table 3 A table demonstrating the demographic and preoperative data for the included studies

Outcome findings

Outcome variables are reported inconsistently between studies. The variables are presented in Table 4 and summarised below.

Table 4 A table demonstrating the outcome variables for the included articles

Rates of gross total resection were reported in all 10 articles [2, 3, 7,8,9, 11, 12, 15, 17, 20]. Six of the articles [2, 9, 11, 12, 17, 20] demonstrate a significant improvement in the degree of resection in later cases compared to the earlier cases. Three of the articles [7, 8, 15] report an improvement, but it was not statistically significant. One article [3] commented on the rate of ‘large tumour residual’ between early and late surgical groups.

Average operative time was reported in 5 articles [2, 3, 7, 11, 15] where a statistically significant reduction in operating time occurs in later surgical groups compared to earlier groups. The remaining 5 articles [8, 9, 12, 17, 20] do not report the operative time.

CSF leak rate was reported in all 10 articles [2, 3, 7,8,9, 11, 12, 15, 17, 20]. CSF leak rate was only reported as an overall percentage in 4 articles [3, 8, 12, 17]. The remaining 6 articles [2, 7, 9, 11, 15, 20] reported CSF leak rate in each surgical group. Two articles [7, 9] demonstrated an increase in the rate of postoperative CSF leak from 2.5 to 7.5% and 0 to 8.6% respectively. The remaining 4 articles [2, 11, 15, 20] reported either stable CSF leak rates, or decreasing rates that were not statistically significant

Visual outcomes were reported in 7 articles [3, 7,8,9, 12, 15, 20]. Of these, 1 article [3] reported a statistically significant difference in the proportion of patients with visual field improvement between surgical groups. One other article [12] reported an OR 2.15 (1.25–3.7) for the effect surgical experience would have on visual recovery. The remaining 5 articles [7,8,9, 15, 20] reported a trend showing high rates of good visual outcomes in the late surgical groups compared to the earlier groups, or had high rates of visual improvement between all groups. There were 3 articles [2, 11, 17] that did not report visual outcomes.

Endocrinological outcomes were variably reported in all 10 articles [2, 3, 7,8,9, 11, 12, 15, 17, 20].

Rates of endocrine outcome or cure for functional adenomas were reported in 7 articles [2, 3, 8, 9, 11, 17, 20]. Of these articles, all reported a trend towards improving rates of endocrine cure between early and late surgical groups with 3 articles [8, 9, 20] demonstrating a statistically significant change. Of the remaining 3 articles, 2 [7, 12] articles only reported on non-functioning pituitary adenomas and 1 article [15] did not report endocrine cure outcomes.

Rates of endocrine outcome or dysfunction for non-functional adenomas were reported in 8 articles [2, 3, 7,8,9, 12, 15, 20]. Of these, 6 articles [2, 3, 7, 12, 15, 20] reported on how the rate of endocrine outcome changed between surgical groups. Three articles [2, 3, 7] report on the rate of new or worsened anterior pituitary insufficiency/hypopituitarism between early and late surgical groups. These outcomes are as follows: Kenan et al. 5 to 0%, Leach et al. 17 to 25%, Boetto et al. 13 to 4.3%. Two articles [8, 9] report that “all patients who were eupituitary preoperatively remained eupituitary postoperatively”. One article [20] reported on the rate of postoperative patients that were eupituitary between surgical groups, demonstrating a trend to improve rates over time. One article [15] reported the rate of new panhypopituitarism between surgical groups, 11% in the early group and 13% in the late surgical group. One article [12] examined all pituitary hormones postoperatively and reported the following: 18.7% eupituitary pre- and postoperatively, 6.3% normalised, 15.4% improved but not normalised, 27.2% unchanged, 32.9% worsened. This article also reported an OR 1.23 (0.65–2.32) for predicting if surgical experience affected endocrine outcomes. Two articles [11, 17] did not report endocrine outcomes for patients with non-functional adenomas.

Rates of permanent diabetes insipidus were reported in 8 articles [2, 3, 7,8,9, 11, 12, 15]. Of these articles, 4 articles [2, 3, 7, 15] report on how the rate of permanent DI changes between surgical groups. The remaining 4 articles [8, 9, 11, 12] report the overall rate of permanent DI. The rates are low ranging from 0 to 6% and therefore do not demonstrate any trends.

Surgical technique used was clearly defined in all articles [2, 3, 7,8,9, 11, 12, 15, 17, 20]. These techniques did not change during the study period.

Study quality

Overall study quality was determined to be moderate in 3 articles [7, 9, 12] and low in 7 articles [2, 3, 8, 11, 15, 17, 20], these are demonstrated in Table 5. No articles were of high methodological quality. Common features between articles of low study quality included defining the method of quantitative visual assessment, defining tumour size and tumour invasiveness radiologically, defining the method of endocrine assessment for functioning and non-functioning adenomas, defining the timing interval for when visual assessment, endocrine assessment and postoperative imaging occurred, defining the complete hormonal function of non-functioning pituitary adenomas and reporting what further treatments if any were required after long-term follow-up.

Table 5 Quality assessment consensus table

Discussion

Rates of gross total resection

Overall, this review demonstrates that 6 articles [2, 9, 11, 12, 17, 20] in the literature report a statistically significant improvement in the proportion of patients receiving a gross total resection with increased surgical. The remaining articles [7, 8, 15] demonstrated a trend towards improvement, but it was not significant. Based on these findings, gross total resection appears to be an outcome that follows a learning curve. This may be explained by surgeons becoming more comfortable attempting resection of residual disease adherent to neurovascular structures, or becoming more adept at using the endoscope to visualise tumour remaining in obscured regions of the operative field. It is not possible based on the variety of methodologies to determine in greater detail the relationship between surgical experience and degree of resection. This would be challenging to analyse given the heterogeneity of pituitary adenomas and the different surgical goals depending on the case.

CSF leak rate

Overall, this review demonstrates no statistically significant association with rates of postoperative CSF leak as surgical experience increases. For the articles that did report CSF leak rate in each surgical group, there were 2 articles with increased rates in later surgical groups, and the other 4 articles reported stable or improved trends. It is not possible to make any recommendations given the small numbers of postoperative CSF leak rates between different articles. Another confounding factor when assessing this outcome is the variability amongst studies in who was performing the reconstruction and closure following tumour resection: in some studies, it was performed by neurosurgeons, in some by ENT surgeons and in one series was initially performed by ENT but than later began to be performed by the neurosurgeons. Previous literature has shown that the rate of postoperative CSF leak more broadly reduces over time as skull base teams gain experience [22].

Visual outcome

Our review demonstrates 2 articles [3, 12] that show surgical experience significantly increases the chance of visual improvement and/or recovery. The remaining articles also demonstrate improvement, but it is not statistically significant. Due to the variability in how visual outcome is reported between articles, it is not possible to determine whether it improves with surgical experience.

Endocrine outcome

Overall, our review demonstrates that endocrine outcomes are variably reported in the literature making direct comparisons between articles challenging. A surgical learning curve is easier to demonstrate for functional adenomas compared to non-functional adenomas. Three articles [8, 9, 20] demonstrate a statistically significant increase in the proportion of patients achieving an endocrine cure with increasing surgical experience.

Non-functional adenomas were reported less frequently in the literature. Only 1 article [12] examined all pituitary hormones postoperatively and reported whether patients improved, stabilised or worsened.

Learning curve

This systematic review demonstrates that there are some outcome variables that do improve with surgical experience. These include degree of gross total resection, visual outcome and rate of endocrine cure for functional adenomas. It is not clear what drives these improvements, but may relate to increased surgeon confidence and aptitude in accessing more difficult-to-reach components of the tumour, thereby allowing more complete resection and potentiating decompression of the visual apparatus or removal of the hypersecreting adenoma. However, the way these articles examine surgical experience is not consistent and is largely based on arbitrary post hoc analysis. In addition, each outcome variable is not reported consistently between articles. This is most significant for outcomes such as endocrine function in non-functioning adenomas or visual outcomes.

Furthermore, it is important to note that the combination of each surgical team varied between articles. This may have affected the individual results of each article. Unfortunately, further analysis of this is not possible for the reasons stated earlier.

If more research is to be undertaken into understanding the learning curve for endoscopic pituitary surgery, a more rigorous and systematic approach to outcome reporting is required. This will enable accurate and translatable assessments of outcomes between articles. Characterisation of what outcomes have a longer learning curve may help focus training on particularly difficult components of the surgery. This training could be enhanced through the use of novel surgical training tools such as surgical simulation models [23].

We have developed an outcome reporting tool that we have implemented in our institutional skull base unit to facilitate systematic and standardised data collection about patients having pituitary surgery (refer to Table 6). The tool is pragmatic and easy to complete, but contains a variety of clinically significant variables.

Table 6 Proposed prospective data collection tool for patients undergoing endoscopic pituitary surgery. Data points entered are either dichotomous yes/no answers (previous radiotherapy), numerical grade or size (Knosp grade, maximal tumour diameter), multiple choice single answer (endocrine hormone function), or multiple choice multiple answer (reconstruction technique)

Limitations

There are several limitations to this review. The main limitation is that there was significant variation in the outcomes that studies reported, and heterogeneity in the way each outcome was measured. This precluded meta-analysis that could quantitatively assess the learning curve and how it impacted individual outcome measures. Included studies were retrospective in nature with the attendant bias related to the choice of the outcome measures after the outcomes had occurred.

Conclusions

We have demonstrated that a learning curve exists for some outcome variables for endoscopic pituitary surgery. However, there is significant heterogeneity in the current body of literature which makes clear comparisons difficult. If more research is to be undertaken to better define factors involved in shaping the learning curve for endoscopic pituitary surgery, we would recommend a rigorous and systematic approach to outcome reporting. Prospective observational studies may be the best study design to investigate this learning curve. Better defining this surgical curve will help improve patient safety by allowing more targeted and efficient training for surgical trainees.