Introduction

Dynamic structural abnormalities of the anorectum and pelvic floor can cause symptoms of obstructed defecation and fecal incontinence [1], and are found in an important subgroup of patients with chronic constipation [2, 3]. The most common abnormalities (either singly or together) are rectocele and intussusception [3, 4]. While parameters for diagnosis and intervention vary [1, 5, 6], structurally significant rectoceles and high-grade intussusception (i.e. those descending to the level of anal canal or beyond) [1] may benefit from surgical repair in well-selected patients. Procedures broadly aim to reinforce the rectovaginal septum (mainly focused on rectocele) [7,8,9], excise part of the rectal wall (most commonly using stapling devices) [10,11,12], or suspend the rectum (mainly forms of rectopexy) [13, 14]. The varying popularity of numerous procedures to address these problems reflects the fact that no single approach has achieved obvious clinical primacy and also that there is no high-quality evidence base for decision-making [15].

Laparoscopic ventral mesh rectopexy (LVMR) was first described for external rectal prolapse (ERP) in 1992 [16], and has progressed into international practice as a relatively safe, minimally invasive approach to internal rectal prolapse (IRP) with or without rectocele [17,18,19,20]. While some large patient series provide general support for LVMR in populations of patients with a mix of symptomatic presentations (obstructed defecation or incontinence) due to ERP or IRP [17,18,19, 21, 22], the utility of LVMR for patients with obstructed defecation and IRP is not well-supported by published evidence. Indeed, a previous UK National Institute for Health Research-funded systematic review included only 18 studies with a total of 1238 patients [23]. Of these, the vast majority of included studies provided only level IV (Oxford) evidence. Furthermore, outcomes have generally been based on poorly validated measures, e.g. patient global rating scales and the obstructed defecation syndrome (ODS) score [24,25,26], which were originally developed to evaluate the effect of surgery [27, 28]. There is concern that objectively determined long-term outcomes of LVMR using validated measures will not match those from enthusiasm-driven case series (as has been the case for numerous other surgical procedures for chronic constipation) [29], and this question has become more important with the international scrutiny of mesh-related complications in general. [30].

Therefore, a UK-based multicenter randomized controlled trial (RCT) was designed to determine the clinical efficacy of LVMR compared to controls at short-term follow-up (24 weeks). Secondary objectives were to determine: (1) the clinical effectiveness of LVMR in the medium-term (to 48 weeks to a maximum of 72 weeks), and (2) preoperative determinants of outcome. A detailed description of the study protocol was published elsewhere. [31].

Materials and methods

Patients

This UK multi-institutional RCT was conducted from March 1, 2015, to January 31, 2019, as part of a UK National Institute of Heath Research-funded programme grant (PGfAR: RP-PG-0612-20001) aimed at developing the evidence base for the management of chronic constipation in adults, which is currently lacking [32].

A stepped-wedge randomized trial design permitted observer-masked data comparisons between patients awaiting intervention with those who had undergone surgery. Contrary to most stepped-wedge trials individual patients were randomized rather than utilizing cluster sampling. This is, in effect, a modification of a standard parallel-group, waiting-list control design, but with several advantages. First, a stepped-wedge design is more efficient and thus improves recruitment feasibility (the bane of nearly all surgical trials). Simulation demonstrated that a parallel-group design required a much larger sample size than that proposed for the current study at the same power. Second, the trial design meant that there was only a one-in-three chance (rather than one-in-two chance for a parallel group) of waiting 6 months for surgery, which was more acceptable to patients. The study received national ethical approval (15/LO/0609) and all patients provided their written informed consent. The study was registered with the ISRCTN Registry (ISRCTN11747152 [https://doi.org/10.1186/ISRCTN11747152]).

Eligibility criteria were: (1) age 18–70 years; (2) self-report of problematic constipation; (3) symptom onset > 6 months prior to recruitment; (4) symptoms meeting the American Gastroenterological Association definition of constipation [33]; (5) refractory constipation after a minimum basic standard (lifestyle and dietary measures and ≥ 2 laxatives or prokinetics) tried with no resolution of symptoms and no time requirement; (6) ability to understand written and spoken English (due to questionnaire validity); (7) ability and willingness to give informed consent; (8) failure of non-surgical interventions (minimum of nurse-led behavioral therapy) [34]; (9) IRP as determined by clinical examination and defecography, using the following criteria: (a) recto-anal or recto-rectal intussusception ± other dynamic pelvic floor abnormalities (e.g. rectocele, enterocele, excessive perineal descent); (b) deemed to be obstructing and/or associated with protracted or incomplete contrast evacuation by expert review. [35].

Exclusion criteria were: (1) significant organic colonic disease (red flag symptoms, e.g. rectal bleeding not previously investigated), inflammatory bowel disease, megacolon or megarectum (if diagnosed beforehand), severe diverticulosis/stricture/birth defects deemed to contribute to symptoms; (2) major colorectal excisional surgery; (3) current overt pelvic organ (bladder, uterus, and/or external rectal) prolapse or disease requiring obvious surgical intervention other than LVMR; (4) previous rectopexy; (5) sacral nerve stimulator in situ; (6) rectal impaction (as defined by digital and abdominal examination); (7) significant neurological disease (e.g. Parkinson’s disease, spinal injury, multiple sclerosis, diabetic neuropathy); (8) significant connective tissue disease (e.g. scleroderma, systemic sclerosis, systemic lupus erythematosus [not hypermobility alone]); (9) significant medical comorbidities and activity of daily living impairment (Barthel index ≤ 11); (10) major active psychiatric diagnosis (e.g. schizophrenia, major depressive illness and mania); (11) chronic regular opioid use (at least once daily) deemed to be the cause of constipation based on temporal association of symptoms with onset of therapy; (12) pregnancy or intention to become pregnant during study period; (13) known severe intra-abdominal adhesions.

Final review by pelvic floor multidisciplinary decision team (as per National Health Service [NHS] England recommendation) [36] to confirm appropriateness for surgery was performed for all patients.

Randomization and masking

Participants were randomized to three arms with different delays before surgery (Fig. 1). In group I, LVMR was performed at T0; in group II, at 12 weeks (T12); in group III, at 24 weeks (T24). In all arms, there was a period of 4 weeks post-eligibility screening to arrange the logistics of surgery and ensure that patients had returned to their normal life routine after various assessments. Randomization was stratified by center. The Pragmatic Clinical Trials Unit (PCTU) at Queen Mary University of London developed a validated online randomization system, which was accessed by suitably trained and delegated researchers at recruiting sites and followed the PCTU-approved standard operating procedure for the study.

Fig. 1
figure 1

The CapaCiTY trial 3 Consolidated Standards of Reporting Trials (CONSORT) flow diagram. One patient did not undergo surgery; this patient continued to participate and was included in analysis on intention-to-treat principles

Patients and clinicians were necessarily aware of allocation to different waiting times. However, to minimize bias, a blinded researcher collected outcome data. For quantitative analysis, an analysis plan was developed and signed off by investigators and statisticians who were blind to allocation status and index intervention.

Intervention

LVMR was performed according to a standard technique [31, 37, 38], starting with a peritoneal incision at the level of the sacral promontory and extending caudally (avoiding the hypogastric nerves along the side of the mesorectum) to the deepest part of the pouch of Douglas, and continued down the rectovaginal septum to the pelvic floor. The mesh was sutured to the ventral aspect of the distal rectum and further fixed to the lateral seromuscular borders of the rectum proximal and distal to the incised pouch of Douglas ± pelvic floor. If deemed necessary, the posterior vaginal fornix was elevated and sutured to the anterior aspect of the mesh to allow closure of the rectovaginal septum and correction of a mid-compartment prolapse (if present). The type of mesh inserted was left to surgeon’s choice (not being dependent on any specific clinical grounds). All participating surgeons had performed a minimum of 50 LVMR previously.

Surgery was performed as a day case or short stay procedure [39]. Postoperative management followed routine clinical care. Laxative use was standardized to a weaning course of macrogol transdermal delivery system (TDS) immediately postoperatively for 1 day, then reduced according to ease of bowel movements.

Surgical quality assessment

Adherence to the agreed procedural technique for the first included patient from each center was independently remotely assessed by a delegated surgical team provided by the Pelvic Floor Section of the Association of Coloproctology of Great Britain and Ireland. Monitoring and quality control were conducted remotely via video submission and assessed against the standardized LVMR protocol and defined assessment criteria [31, 37, 38]. Monitoring took the form of planned, random and triggered sessions (Supplementary Table 1).

Outcomes

The primary clinical outcome was Patient Assessment of Constipation Quality of Life (PAC-QOL) score [40]. This widely used, psychometrically robust measure of overall treatment response with concurrent validity to patient global ratings of success has been used by previous behavioral therapies and surgical trials, including LVMR [41], in chronic constipation [42]. For a chronic condition such as chronic constipation, a difference of 1.0 point in the primary outcome (score range = 1–4, with higher scores meaning higher negative effects on quality of life) was considered clinically important and also the minimum required to justify the cost and invasive nature of LVMR, or of a more complex and expensive treatment [43].

Secondary outcomes measures included 14-day diary data prior to each assessment (to record bowel frequency, whether each evacuation was ‘spontaneous’ [no use of laxatives] and/or ‘complete’, concurrent medication, health contacts, time away from normal activities including work, since the patient’s last visit), Generalized Anxiety Disorder scale (GAD-7) [44], Patient Health Questionnaire-9 (PHQ-9) [45], St Marks incontinence score [46], Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire (PISQ-12) [47], avoidant and ‘all or nothing’ behavior subscales of the chronic constipation Behavioral Response to Illness Questionnaire (CC-BRQ) [48], the Brief Illness Perception Questionnaire (BIPQ) [49], the EuroQol Visual Analogue Scale (EQ-VAS), the EuroQol Health Outcome Measure (EQ-5D-5L) [50], and the global patient satisfaction/improvement score on a five-point Likert scale. LVMR has a number of specific complications in addition to the general risks of surgery. These were recorded for outcome reporting. The study (not being of a medicinal product) did not record unrelated adverse events.

Participant, surgeon and research staff experience was investigated through individual digitally recorded telephone or in-clinic interviews up to 1 year after surgery with a purposively selected sample to represent a range of demographics. Separate consent was taken for interviews. Data were analyzed using a pragmatic thematic and qualitative analysis.

Follow-up

The study duration allowed for follow-up to a maximum of 96 weeks (i.e. 24 months) with data collection at 0, 12, 24, 36, 48 weeks post run in (stepped wedge) and thence at 12-week intervals within the cohort assessments at 60, 72, 84 and 96 weeks post run in. Thereafter, participants left the study and returned to ‘routine clinical care’ as determined within their local National Health Service institution (or were recruited to subsequent trials).

Statistical analysis

The sample size was calculated using the primary clinical outcome [40] by simulation using the ‘simsam’ package in Stata® V.14.2 (Stata Corporation, College Station, Texas, USA). Using a stepped wedge design, we hypothesized that PAC-QOL score at any time point during follow-up will be approximately 1.0 point lower (better) than preoperative participants. We assumed that PAC-QOL score followed a normal distribution over all time points with a standard deviation (SD) of 1.5 points and with a correlation between repeated assessments equal to 0.5 points. Simulation showed that detection of a 1-point difference in PAC-QOL score at 6 months with 95% power (purposely chosen to reflect the magnitude and risk of intervention) at the 5% significance level required 34 participants in each of the three arms. Allowing for a 10% loss to follow-up, a sample size of 38 was needed per arm, for a total sample size of 114 patients across the 3 arms. Should the correlation between repeated assessments be < 0.5 points, a sample size of 114 will still provide at least 90% power for the study. This was calculated using the same simulation procedure with correlations of 0.3 and 0.1 points.

The primary outcome was analyzed as a continuous variables on intent-to-treat basis at 24 weeks post-surgery. PAC-QOL scores at the time-points T0, T12, T24, T36, T48, T60, and T72 weeks post run in period in the three arms were analyzed using a mixed linear regression model, adjusting for a random effect of participant and a fixed effect of time since randomization, to estimate mean differences between PAC-QOL score before and after LVMR. To model the effects of surgery, dummy variables were used to indicate if participants had already received treatment before each follow-up time. The Kenward–Roger correction was employed to account for inflated type I error rates due to the small sample size. The contrast of primary interest was between the score at 24 weeks after surgery and the score at baseline. Some outcomes were scores calculated by summing the responses to all the questions in a questionnaire. If fewer than half of the questions were unanswered the missing responses were imputed with the mean of the available cases. All outcomes were analyzed under a ‘missing at random’ assumption (i.e. assuming that ‘missingness’ depended only on outcomes that had been observed). Patient Assessment of Constipation‐Symptoms (PAC-SYM) scores [51] were analyzed by the same approach as above. Binary outcomes were summarized at 24 and 48 weeks post-surgery with number and percent indicating problems and an odds ratio comparing 24 and 48 week outcomes to baseline. Data were analyzed using Stata® V.14.2 (Stata Corporation, College Station, Texas, USA).

Results

From March 1, 2016, to January 31, 2019, of 42 eligible patients, 28 (100% females) were randomized from 6 UK institutions (Fig. 1), representing a significant under-recruitment of the 114 patients required by sample size calculation. The main reason for poor recruitment was the evolution of the mesh controversy during the trial [30], which stopped many centers from delivering the procedure and many patients coming forward for surgery.

Of the 28 patients, 9 were assigned to the T0 arm, 10 to the T12, and 9 to the T24 arm (Fig. 1). The characteristics of the 28 patients are presented in Table 1. There were no substantial differences in baseline characteristics between trial arms. The outcome measures at baseline, collected before the 4-week run in period, are summarized in Supplementary Table 2. Biologic mesh was used in three patients (one patient per group).

Table 1 Demographic and clinical characteristics of randomized patients

Safety analyses

There were no conversions to open surgery. Thirty adverse events were reported by 16 patients, of which 20 were considered to have possible causality related to surgery (Supplementary Table 3); however, none had any long-term sequelae. There were five serious adverse events of which four were deemed to be related to surgery. Three of these were for postoperative pain (expected), one was for pneumonia and none resulted in long-term patient harm (Clavien–Dindo I). No patients had mesh erosions.

Clinical effectiveness

Two patients dropped out of the study before the primary end point and a further 7 failed to complete the primary outcome, which was, therefore, completed by only 19 patients. There was a substantive reduction in estimated PAC-QoL score at 24 weeks compared with the baseline of 1.09 points (95% CI – 1.76, – 0.41], p = 0.0019), exceeding that sought by design (1.0 points). A similar magnitude of change was observed for the modelled secondary outcome (i.e. PAC-SYM score, – 0.92 [95% CI – 1.52, – 0.32], p = 0.0029) (Table 2). Reductions in scores were sustained at later time points, accepting a strong chance of attrition bias.

Table 2 Total PAC-QoL and PAC-SYM scores at baseline and follow-up points post-surgery, with 95% CI and p value for change from baseline to each follow-up point

Secondary outcomes are shown in Table 3, with positive directional effects for nearly all outcomes, with some quite substantial improvements in measures, including > 25% scalar improvements in psychological measures (PHQ-9 score, GAD-7 score, St Mark’s Incontinence Score and EQ-VAS score). Global patient satisfaction was 2.7 points at 24 weeks (i.e. closest to ‘very satisfied’), although this dropped to 2.2 points (i.e. closest to ‘moderately satisfied’) at 48 weeks. This result was mirrored in the global patient improvement score (EQ-VAS score 0–100 points between ‘no effect’ and ‘complete cure’), which was 72.2 points at 24 weeks and 56.5 points at 48 weeks.

Table 3 Continuous secondary outcomes, with unadjusted estimate of difference in mean scores at 24 and 48 weeks post-surgery compared to baseline

The results of the qualitative analysis are shown in Supplementary Table 4.

Discussion

Our analysis of clinical effectiveness showed a reduced symptom burden and improved disease-specific QoL. The magnitude of the effect of surgery (estimated reduction of 1.09 points in PAC-QoL at 24 weeks) was greater than the minimum clinically important difference sought by design (mean change 1.0 points) and this change was statistically significant. In addition, significant and clinically important improvements in PAC-SYM score, coexistent fecal incontinence, and bowel frequency provided further evidence of the benefit of surgery. The findings of the primary outcome showed a continued improvement for the duration of the study period (estimated 1.38-point reduction in PAC-QoL at 72 weeks), and were supported by a panel of secondary outcome measures, accepting inferential limitations posed by potential attrition bias.

The reduction in anxiety surrounding the use of mesh as time passes and the production of updated consensus guidance on patient selection and operative technique may make further study in this are feasible.

Although some adverse effects were reported, LVMR was safe and well tolerated overall. Patient experience of LVMR was, on the whole, positive. Some patients did not find surgery to be the ‘miracle cure’ that they were looking for and some reported negative experiences in the perioperative period, but for the most part these were not related to the operation itself. Some patients also found benefit from the dietary and behavioral changes that they initiated as a result of advice that they were given as part of the perioperative care package. The mesh controversy dominated staff experience. Developing one nationally recognized information sheet and LVMR surgical certification may assist with patient and surgeon anxieties in future.

Over the last decade, LVMR has become an increasingly popular surgical option for patients with high-grade IRP associated with symptomatic presentation manifest as constipation or incontinence. Table 4 shows data of 12 previous studies published since 2010, 7 of which were prospective [17, 21, 52,53,54,55,56] and 5 retrospective [18, 19, 25, 41, 57] in design. Notable amongst these is the large retrospective cohort study of Consten, et al. [18] which reported outcomes of LVMR in a cohort of 919 patients from 2 centers, with a medium-term follow-up (median, 34 months; range 4 months–12 years). Within the cohort, 677 patients had a main diagnosis of IRP. While some data were unsegregated by baseline phenotype (there were a mix of symptomatic presentations and prolapse type: IRP vs. ERP), the investigators reported resolution of ODS symptoms in approximately 70%. Cumulative risks of mesh complications were low (1.5% after 3, 2.9% after 5, and 4.6% after 10 years), particularly for mesh erosions or infection (1.5% at 10 years), as opposed to cumulative symptom recurrence rates, which were higher as compared to other studies (7.5%, 11.1%, and 14.3% at 3, 5, and 10 years, respectively).

Table 4 Studies reporting outcomes of laparoscopic ventral mesh rectopexy (LVMR) in patients with internal rectal prolapse (IRP)

In contrast to nearly all previous studies (Table 4), we explored disease-specific QoL using two validated instruments. Tsunoda et al. [55] used validated instruments (Short-Form 36 Health Survey [SF-36], Fecal Incontinence QoL scale [FIQL], and Patient Assessment of Constipation-QoL [PAC-QOL]) to assess quality of life after LVMR in 25 patients with IRP (all females) and 19 with ERP. Compared to the preoperative assessment, almost all the scale scores on the three quality of life instruments significantly improved over time. Gosselink, et al. [41] compared the functional results of LVMR for obstructed defecation secondary to high-grade IRP in 109 patients with normal and 42 with delayed colonic transit. Although preoperative PAC-QOL scores were higher (worse) in the latter group, the total PAC-QOL score was significantly improved in both groups at 12 months (p < 0.001). The Gastrointestinal Quality of Life Index (GIQLI) was also improved in both groups. The same authors showed equivalent GIQLI improvements in a series of 50 incontinent patients undergoing LVMR for high-grade IRP (p = 0.01) [21].

Limitations

Several limitations associated with our RCT warrant mention. The study was severely hampered by under-recruitment (28 out of a planned 114 patients). The media scrutiny of the use of mesh undoubtedly affected both patient and surgeon perception and willingness to take part in the study. Some centers paused or abandoned LVMR totally in the light of the mesh controversy, and there was a perception that for others the heightened scrutiny of practice in the protocol also negatively affected recruitment. Difficulties in attracting centers to recruit pre-dated the zenith of the mesh publicity and also reflected wide variation in practice across the UK in terms of both patient selection and LVMR operative technique. The attention we paid to strict inclusion and exclusion criteria (based on guidance from the Pelvic Floor Society, London, UK) [36] led to increased scrutiny in many centres where such surgery was previously being undertaken without rigorous application of these criteria. This regrettably led to a rapid revision (manifest as a drop-off) in the number of patients eligible for recruitment to the trial. With the media storm blowing up and the Cumberlege report in preparation [58], there was never a time when the results of this trial were more needed.

Included patients had a high symptom burden and long duration of symptoms that had been refractory to previous treatments, including a minimum of bowel habit training by a specialist practitioner. Patients had been thoroughly investigated and, therefore, could be considered both ‘hard to treat’ and ‘carefully selected’ for surgical intervention.

However, despite these setbacks, the main aim of the trial, namely to determine the effect size of surgery for the first time in a high-quality experimental design, and thus improve on the level IV evidence provided by 18 case series (as outlined in our systematic review) [23], was addressed, albeit at a lower than desirable level of statistical power.

Conclusions

Our results show substantial symptomatic benefit (more than we sought by design) to a cohort of highly selected patients from LVMR performed to a standardized technique.