Introduction

Retropubic mid-urethral sling (MUS) surgical techniques vary among surgeons, with use of a Mayo Scissor as a spacer being most commonly used [1, 2]. There is limited research on optimal tensioning methods for achieving favorable patient outcomes [1, 3, 4]; existing studies assessing intra-operative sling tensioning primarily focus on outcomes at 1 year. A retrospective chart review comparing Kelly clamp versus Babcock clamp tensioning found similar rates of post-operative urinary retention and surgical revision, though without reporting on stress urinary incontinence resolution [4]. A subsequent randomized clinical trial (RCT), the "MUST trial," demonstrated that use of a Babcock clamp holding a fixed loop of the sling to determine tensioning resulted in comparable objective and subjective SUI cure rates with the more commonly known Mayo Scissor spacer tensioning, with lower rates of intervention for post-operative bladder outlet obstruction and overactive bladder (OAB) symptoms, but a potential increase in vaginal mesh erosion [3].

These findings suggest that the Babcock clamp technique applies less tension, yet it remains unclear if this translates to discrepancies in longevity between the techniques, given the known decline in MUS cure rates over time [5,6,7]. There is debate regarding degradation of polypropylene mesh after implantation [8,9,10], leaving the possibility that failed MUS over time could be partially due to loss of structural integrity. Additionally, given that age has been implicated in recurrent SUI [11], failure of MUS over time could be resultant of development of intrinsic sphincter deficiency or generalized age-related weakening of the urethral sphincter in the years after MUS insertion. For both possibilities, a “tighter” MUS may provide benefit for long-term SUI cure, despite higher rates of post-operative urinary retention. Post-operative uroflowmetry indicated higher rates of subclinical bladder obstruction with the "Scissor" technique through slower mean flow rates. The likelihood of subsequent surgical intervention among patients exhibiting subtle obstruction such as this remains unknown, given that > 50% of MUS revision surgeries occur more than 1 year post-insertion [5]. Moreover, sustained increases in post-MUS vaginal erosion rates have not been observed since the original MUST trial, which has prompted inquiries into the origin of the discrepancies in this outcome seen in the initial study.

We initiated a cohort study to conduct a 5-year follow-up on the original MUST trial participants. We hypothesized that the “Scissor” technique would have higher rates of self-reported SUI cure at the 5-year follow-up. Goals of the study were to ascertain whether differences between the two tensioning techniques developed with respect to SUI cure, if differences existed between tensioning groups with respect to repeat surgical intervention, and to explore possible reasons for the elevating vaginal erosion rate seen in the original trial.

Materials and Methods

All 318 participants included in the original trial were eligible for cohort follow-up. Participant characteristics and details of the original surgical technique have been previously published [3]. Original participants were contacted approximately 2 months before their 5-year surgical anniversary to inform them of the follow-up study and to seek consent to repeat participation. Given that the 5-year follow-up window occurred during the COVID-19 pandemic, the cohort was developed with primary data collection being collected and managed using REDCap electronic data capture tools hosted at the Clinical Research Unit (CRU), Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada [12, 13]. For those participants who preferred to complete paper copies of the questionnaires, these were delivered and returned by postal mail with subsequent data entry into REDCap. This approach avoided in-person contact as there was a moratorium on in-person research visits within the relevant university and health care institutions. The original MUST RCT utilized an in-person examination to objectively measure the presence or absence of urine loss using a standardized cough-stress test physical examination [3]. Given the aforementioned COVID-19-related organizational directives to minimize unnecessary physical contact, we were unable to include this outcome in this study. However, results from previous MUS trials demonstrate high correlation between patient-reported and clinician-measured outcomes of SUI [3, 7, 14], supporting the decision that patient-reported outcomes in isolation would generate important data relevant to clinical decision making.

To ensure that all surgical interventions related to the original MUS insertion were included in the analysis, and to limit recall and attrition biases, administrative health data were linked to all original 318 trial participants. Administrative data sources included the Discharge Abstract Database (DAD), which captures all inpatient surgical procedures and hospital admissions, and the National Ambulatory Care Reporting System (NACRS), which captures all same-day surgeries and emergency room visits. These datasets adhere to International Classification of Diseases, Canadian Version-10, and the Canadian Classification of Health Intervention standards of coding [15, 16], and have previously been utilized to study longer term outcomes of MUS surgery in Canada [5, 17, 18].

Participants maintained classification by the randomization groups from the original study. Group demographics were compared by absolute standard difference (ASD) to test for a continued balance between randomization groups [19]. In keeping with the original trial, the primary outcome for the 5-year follow-up was bothersome versus nonbothersome SUI symptoms, defined by a score of ≥ 2 on Question 3 of the Urinary Distress Inventory (UDI-6) [3, 20]. Secondary outcomes included bothersome versus nonbothersome OAB symptoms, as defined by a score of ≥ 2 on either Question 1 or Question 2 of the UDI-6; summary scores of the UDI-6, Incontinence Impact Questionnaire (IIQ-7) [20], and International Consultation on Incontinence Questionnaire Female Lower Urinary Tract Symptoms Long Form (ICIQ-FLUTS LF) module questionnaires [21]; as well as the rate and timing of repeat surgery related to SUI symptoms and/or complications from the original mid-urethral sling. Mean difference (MD) and 95% confidence interval (CI) were calculated for proportional and numerical data; repeat surgeries were expressed as both incidence rate and cumulative incidence, as defined by the time between the original MUS surgery and the subsequent surgery [5].

Subsequent surgery for SUI and/or mesh complications was determined through patient report and administrative data. Validated Canadian Classification of Health Intervention (CCI) codes, which indicated a subsequent surgery for SUI treatment and/or to remove or revise implanted surgical devices or mesh, removal of a urethral foreign body, urethral dilation, retropubic or transvaginal urethrolysis, or repair of a urethrovaginal fistula were used to flag subsequent surgery in administrative data [5]. After identification of subsequent surgery by either participant report and/or occurrence of a validated code in the administrative data sources, the specific case was confirmed by chart review to determine the nature of the procedure confirmed. Descriptive statistics reported the frequencies of diagnostic International Statistical Classification of Diseases and Related Health Problems, 10th Revision, Canada (ICD-10-CA) codes and the CCI codes represented by these procedures.

A manual review of all mesh erosion cases recorded in the dataset was performed, in response to questions that have arisen since the original MUST paper as to explanation of mesh erosions in the original Babcock group that were not sustained after conclusion of the RCT. Given that Babcock was a novel technique for the study surgeons, we classified each Babcock case relative to the surgeon’s first Babcock case in the trial to explore for any temporal relationship in mesh erosion and introduction of the Babcock technique into a surgeon’s practice.

An a priori power calculation was performed for this study’s primary outcome, assuming a follow-up rate of 75% at 5 years (n = 238 over both arms) based on prior MUS follow-up studies [7], as well as an SUI cure rate of 69% at 5 years using Question 1 of the UDI-6 [7] and a non-inferiority limit of 15% [22], which has previously been determined as the threshold for surgeons to favor one surgical approach to the treatment of SUI over another [1, 14]. Calculations revealed that a sample of this size would have > 80% power with alpha = 0.05 to exclude a difference in participant-reported SUI cure.

Although all the original trial participants are biologically female, the original MUST trial methodology did not collect self-identified gender as part of demographic characteristics. Realizing this oversight, at the 5-year follow-up we did collect self-reported gender identity, as well as a measure of gender expression, in alignment with expert opinion that incorporation of the complexities of gender should be improved in health research [23, 24]. Given that within a gender identity, an individual may report differing levels of femininity and masculinity, measurement of gender expression is an analytic approach to better understanding how the nuances of gender are connected to experiences of health [25]. A secondary analysis of gender expression as a binary variable of gender polar with exclusively feminine traits (only feminine scores reported, masculine score of zero) or diversity in gender expression with mixed feminine and masculine traits (≥ 1 feminine score, ≥ 1 masculine score) was planned given that gender expression has previously been associated with patient perceptions in pelvic floor disorders [25]. Additionally, sensitivity analyses using a more conservative definition of bothersome SUI and OAB symptoms as defined by a score of ≥ 1 on corresponding questions of the UDI-6 were planned to ensure robust findings.

The original study protocol was registered at ClinicalTrials.gov (NCT02480231), and inclusion/exclusion criteria and technique details have been previously described [1, 3]. This study obtained ethics approval from the relevant university ethics board. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement for reporting of observational studies was followed [26].

Results

The follow-up cohort recruited 260 (81.8%) of the original participants to complete questionnaires at 5 years post-surgery. The most common reason for nonparticipation was the inability to make contact with the original participant. Of these participants, 22 were considered lost to follow-up at 12 months but opted to provide follow-up data at 5 years post-surgery (Fig. 1). There was no difference in the proportion of individuals completing self-reported questionnaires at 5 years between the original Babcock (n = 127) and the Scissor (n = 133) groups (p = 0.759). Administrative data linkage was available for all of the 318 original participants.

Fig. 1
figure 1

Consolidated Standards of Reporting Trials flow diagram. Asterisk indicates responses to at least one outcome

At 5 years post-surgery, the only demographic that statistically differed between groups was marital status, with a higher proportion of participants in the Scissor group now reporting being divorced, separated, or widowed (p = 0.049). As information on marital status was not collected for the original trial, it is unknown if this difference was present at allocation or whether it occurred as a change in relationship status over time. The overall balance in participant characteristics demonstrated that the randomization from 5 years prior was still valid to address confounding. At 5-year follow-up, all participants self-reported their gender as “woman” (Table 1).

Table 1 Patient characteristics at baseline and 5-year follow-up by two mid-urethral sling tensioning techniques (N = 260)

No statistically significant differences existed in the primary and secondary outcomes, nor in the sensitivity analysis using a more conservative definition of OAB and SUI symptoms (Table 2). Analysis stratified by gender expressions indicated a lower proportion of bothersome OAB symptoms (p = 0.022) and lower median score of ICIQ-FLUTS LF (p = 0.039) in the Babcock group among gender polar women (Table 3).

Table 2 Primary outcomes at 5-year follow-up (N = 260)
Table 3 Primary outcomes at 5-year follow-up stratified by gender expressions (N = 256a)

There were 29 administrative data records indicating subsequent SUI- and/or MUS-related surgical intervention after index MUS and 11 reported by participants as related to their original MUS and/or recurrent SUI. This represented 29 individuals, of whom 8 underwent multiple SUI- and/or MUS-related surgical procedures in the 5-year follow-up window. Analysis of repeat surgeries demonstrated similar cumulative incidence of repeat surgery at 5-year follow-up, with the majority of cases occurring in the 1st year after the original procedure (Table 4). Manual review of the repeat surgical intervention revealed that mesh erosions occurred for 4 of the original 7 study surgeons, notably excluding those surgeons with the highest volume of cases in the trial. A pattern was demonstrated with all cases resulting in vaginal mesh erosion occurring within the 1st year of the attending surgeon’s first Babcock case. Notably, 71.4% of the “mesh erosion” cases took place within the first 6 months of the attending surgeon’s index Babcock-tensioning case.

Table 4 Incidence rates of subsequent surgeries up to 5 years post-MUS tensioning by Scissor and Babcock (N = 318)

Discussion

This follow-up of the MUST trial compared prevalence of bothersome SUI symptoms at 5 years after retropubic MUS surgery tensioned by the Mayo Scissor versus the Babcock clamp technique. In addition to self-reported SUI symptoms, median scores of three validated bladder questionnaires and rates of subsequent surgery related to the participant’s original MUS procedures and/or on-going SUI symptoms were additionally compared. No differences were found between the two groups; rates of self-reported SUI cure at 5 years remained high.

Strengths of this study include longer-term follow-up of an RCT. Exploration of participant demographics at 5 years demonstrated minimal differences between the two groups, suggesting that analyzing within the original allocation was still valid and that the randomized approach would continue to mitigate confounding. Although follow-up at 12 months is the most common follow-up period for MUS RCTs [27], we designed this longer-term study in response to the data paucity and resulting calls for longer follow-up studies as “essential” to understanding the benefits and risk of MUS [27]. Additional strengths include the ability to engage participants previously lost to follow-up at 12 months, and the universal linkage of administrative data to all original participants, reducing attrition and recall bias. Limitations of the study include the lack of in-person follow-up. Although previous work at our institution suggests high correlation between subjective participant-reported and objective cure defined by standardized cough stress testing [3, 7, 14], the lack of an in-person study visit precluded the ability to obtain a physical examination and collect the uroflowmetry parameters, which were part of the original trial [3]. Owing to the recruitment profile of the original RCT, this follow-up may have limited generalizability to very young patients, those with extreme body habitus, marginalized populations, and patients with more complex incontinence or prior SUI surgeries. The secondary analysis by gender expression improves generalizability through recognition of the fact that expressions of femininity and masculinity varies among cisgender women; the finding of a difference in OAB scores in the Babcock arm requires further research to understand how one’s self-expression intersects with lower urinary tract symptoms.

This study also builds upon the existing literature on retropubic MUS tensioning [3, 4], demonstrating that cure of SUI symptoms at 5 years is not associated with the specific tensioning technique. Differences in proportions of individuals reporting OAB in the original 12-month study subsequently attenuated and were no longer statistically significant in the 5-year data. Of note, > 10% of participants reported bothersome OAB at the 12-month follow-up, yet at 47.3%, the proportion was noticeably higher 5 years later. This raises the possibility that health changes in the years following MUS insertion, such as those related to aging [28, 29], may have a greater impact on OAB rates than the MUS tensioning technique.

This study offers important insights into surgical decision making. Given that urinary retention may require additional interventions such as catheterization or sling loosening, [30], which results in additional burden on the patient, the favorable findings of reduced post-operative bladder outlet obstruction with the Babcock clamp in the original MUST trial are now bolstered by this study’s reassuring data on 5-year efficacy. As both tensioning techniques were found to have similar 5-year outcomes, continence rates, and patient satisfaction, surgeons can balance this information with their comfort managing post-operative urinary retention to make informed choices regarding sling tensioning methods and optimize individual patient outcomes and satisfaction following MUS procedures.

Additionally, this study suggests a mechanism to explain the signal regarding vaginal mesh erosion reported in the original MUST trial. Mesh erosion cases appeared to have occurred early in a surgeon’s exposure to the Babcock technique. The team’s interpretation is that mesh erosion rates declined after surgeons began “tapping” the MUS mesh into the sub-urethral space upon releasing the Babcock clamp, prior to incision closure. This purposeful step was adopted while the MUST trial was underway, as it effectively flattens the mesh loop and moves the mesh out of direct contact with the vaginal epithelium. Since conclusion of the MUST trial, the Babcock technique has become the prevalent tensioning mechanism at the study’s main institution, without evidence of elevated mesh erosion rates. The study team hypothesizes that the cause of erosions might have been the proximity of the folded mesh upon immediate release of the clamp to the healing incision and/or the edge of the MUS being non-intentionally caught in the suture closure. Given that the reproducible aspects of the Babcock technique are advocated to be of most benefit to surgeons earlier in their career, or those who perform a low volume of MUS, a video of the technique has been developed that includes this “tapping” step (Video).

In conclusion, both the Scissor and Babcock techniques for tensioning retropubic MUS yield comparable and favorable rates of SUI cure at 5 years. Furthermore, there is no sustained discrepancy in symptoms of OAB, disease-specific quality-of-life scores, or the need for subsequent surgeries between the two techniques. Given that the Babcock technique demonstrates lower rates of immediate post-operative urinary retention, there are potential early post-operative benefits to its use and surgeons can confidently consider incorporating the Babcock technique into their MUS practice as an additional intra-operative tensioning option.