Introduction

Small bowel obstruction (SBO) is common: It accounts for 20% of abdominal surgery in patients presenting with acute abdominal pain [1], and around one-quarter of SBO admissions culminate in surgery [2]. In the developed world, adhesions remain the most common cause. The limitations of plain abdominal radiography are now appreciated widely, and a seminal 1991 paper transitioned diagnosis towards CT scanning [3], which determines both the cause and level of obstruction more accurately. With improved diagnosis by CT, the pivotal clinical question then becomes whether surgery is necessary or not? The old surgical maxim of, “Never let the sun set on a small bowel obstruction,” has been replaced by a shift towards more conservative management, with around 75% of patients now avoiding an operation [2]. Nevertheless, untreated obstruction can culminate in irreversible mural ischaemia and intestinal perforation, a surgical catastrophe with considerable morbidity and mortality. The dilemma familiar to every general surgeon is thus: Operate too soon and expose the patient to unnecessary surgical risk; too late, and the patient is in extremis.

It is surprising that SBO prognostication still adopts a “try it and see” approach, bolstered by surgical experience. Failed conservative treatment, signs of peritonism, and clinical concerns for ischaemia may precipitate surgery [2]. While it is seemingly intuitive that a multivariable prognostic model incorporating clinical, laboratory, and imaging factors could predict surgical requirement, no models are used widely. While some authors have investigated radiological predictors of surgery, they have largely excluded clinical and laboratory predictors [4,5,6,7,8,9,10,11]. However, it is highly unlikely that surgeons will adopt a model that ignores fundamental clinical factors [12]. Accordingly, we performed a systematic review of clinical, laboratory, and imaging factors that might predict SBO resolution without the need for surgery. We then meta-analysed potential predictors to identify those most likely to contribute usefully to model development.

Materials and methods

Our institution does not require ethical permission for secondary research using primary literature. The research is reported according to the Preferred Reporting terms for Systematic Reviews and Meta-analyses (PRISMA) [13].

Target condition, search strategy, and study selection

We wrote a protocol and then developed and piloted a search string to identify imaging, clinical, and laboratory variables (including existing models) potentially predictive of SBO resolution without operative management (Online supplementary material 1). We used terms to identify studies of bowel obstruction/ileus. We included terms to identify prognostic research. We limited to adults and excluded narrative reviews, editorials, letters, etc. We searched the US National Library of Medicine PUBMED journal citation database (http://www.ncbi.nlm.nih.gov/pubmed) from 2001 to 2021 inclusive. This search was supplemented by a subsequent review of references cited by included studies. The search was performed in September 2021 by two radiologists (VE, DB), one of whom had > 10 years of experience in systematic review design, data extraction, and analysis.

Inclusion/exclusion criteria

We included studies investigating imaging variables potentially predictive of SBO resolution without the need for operative management (or, depending on how the research was framed, factors that predicted surgical intervention). We excluded studies limited to colonic obstruction; solely paediatric studies; studies solely of malignant obstruction; studies limited to early post-operative obstruction (a separate entity with different aetiology); studies limited to inflammatory bowel disease or chronic functional obstruction. We did not specifically set out to exclude studies that did not include intravenous contrast for CT scanning because we anticipated that the large majority would administer contrast, and contrast only influences one potential predictor variable, mucosal enhancement. We also anticipated that any representative consecutive series would include some patients in whom IV contrast is contraindicated but from whom other potential predictors could be extracted. We excluded studies reporting less than five patients in either operative or conservative groups, since these are underpowered.

Data screening and extraction

Potential studies were identified via scrutiny of the online title and abstract, and definite exclusions were discarded. The full text was obtained for those remaining potentially eligible. Uncertainty was resolved by face-to-face consensus meetings between all authors. Mindful of the distinction between prognostic and diagnostic data [12] and aware that studies frequently confound these, we piloted extraction on the initial 10 studies identified to ascertain potential literature quality and thenceforth review viability. We found data could be categorised into five broad headings: study design, patient characteristics, radiological predictors, non-radiological predictors (further subdivided into clinical or laboratory predictors), and clinical outcomes. Thereby informed, we developed an extraction sheet (Microsoft Excel) populated subsequently by selected studies as follows: Study design, patient characteristics (demographics), imaging variables, clinical variables, laboratory variables, and the overall outcome (surgery vs. conservative treatment, expressed as contingency tables). If intra-operative findings such as the presence of ischaemia/strangulation were reported, or if resection was performed, we also extracted these. We noted whether the authors had attempted to develop a predictive score or model.

Risk of bias and applicability assessment

We assessed the study methodological quality and potential sources of bias using a modified Quality Assessment of Diagnostic Accuracy Studies (QUADAS 2) tool [14]. This consisted of the four main domains that assessed patient selection, index tests, reference standards, and patient flow through the study. QUADAS applies to diagnostic studies, so we adapted for prognosis via three additional questions: (1) Did sufficient participants exhibit the primary outcome of interest (defined as > 20 events per study); (2) Were at least three standard clinical variables reported in addition to imaging factors and, if not, did authors justify this; (3) Were at least three predictor estimates reported with non-statistically significant results? We assigned categories of “low/high/unclear.”

Analysis

Extracted data were analysed and expressed as simple summary statistics. We intended to meta-analyse both imaging and non-imaging predictor variables where sufficient data were presented. We excluded predictors reported by less than five individual studies to avoid under-powered meta-analysis. We anticipated heterogenous data and sought predictor association with conservative or operative outcomes rather than precise estimates of strength or interpredictor comparisons. Also, because of anticipated heterogeneity, we intended meta-analysis to reflect general evidence across studies rather than providing precise estimates regarding specific definitions, situations, measurements, and thresholds. Data were extracted as 2 × 2 tables or univariable odds ratios (OR); 2 × 2 results were converted into ORs for meta-analysis. A random-effects meta-analysis used methods of DerSimonian and Laird, with the estimate of heterogeneity taken from the inverse-variance fixed-effect model. Summaries across predictors and individual study results for each predictor were presented as forest plots. High ORs may be due to a small sample size and/or study bias, so we investigated additional factors to determine the strength of evidence for each predictor rather than relying simply on statistical significance. First, two medical statisticians examined the width of the 95% confidence intervals (CI) around the overall effect of each predictor because narrow CIs indicate greater statistical power behind evidence. They then identified predictors where the OR point estimate was consistently above or below 1.0 across all the individual studies that were meta-analysed, indicating reliable results. They also considered whether the OR was “credible” because ORs far removed from 1 usually indicate low statistical power or unreliable evidence.

During extraction, it became apparent that some studies reported outcomes as ischaemia/no ischaemia rather than surgery/no surgery. We therefore analysed these outcomes separately. Two medical statisticians (TP, SM) used STATA 14.2 (StataCorp) for meta-analysis.

Results

The PRISMA flowchart is presented in Fig. 1. The literature search identified 4530 potential articles. After the title and abstracts review, 39 underwent full-text evaluation, and 8 were excluded for the following reasons: one was a systematic review [15]; two reported insufficient surgical data [16, 17]; three reported laboratory or imaging variables that failed to reach the five article threshold [18,19,20]; one did not differentiate small from large bowel obstruction [21]; and one with excessive selection bias [22]. This left 31 studies for inclusion [4,5,6,7,8,9,10,11, 23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44].

Fig. 1
figure 1

Study PRISMA flowchart

Table 1 describes the characteristics of the included studies. In total, 4638 patients were analysed, with a median sample size of 128, ranging between 44 [28] and 313 [23]. Mean patient age per-study ranged from 52 [9] to 73 years [28]. The male-to-female ratio was approximately equal across most studies, except one that reported 77% female participants [28]. Most studies (29, 94%) were single centre, with one study conducted across three USA hospitals [7] and one across four French hospitals [25]. Most studies (23, 74%) gathered data retrospectively via case-note review. Seven were prospective cohort studies [7, 11, 29, 33, 34, 39, 41]. Only one study was a randomised controlled trial [26].

Table 1 Characteristics of studies included in the review, including raw data for patients treated conservatively versus surgically and those with and without small bowel ischaemia at surgery

Risk of bias

The risk of bias is reported in Online Supplementary Material 2. Only 11 (35%) studies raised no concerns regarding the risk of bias [4, 5, 7, 8, 26, 28, 30, 31, 33, 40, 41]. Analysis revealed an unclear or high risk of bias for patient selection in 9 studies (29%) [9, 23, 24, 27, 32, 35, 38, 39, 43]. Additionally, the risk of bias for the predictor and reference standard domains was high or unclear in 42% and 32% of studies, respectively. The main factors underpinning high or unclear risk of bias were failure to report how outcomes were determined and/or difficulty in understanding whether predictors were interpreted without prior knowledge of the clinical outcome and vice versa. In contrast, applicability scores fared much better, with only one article raising concerns and, even then, only judged “unclear” [23].

Imaging, clinical, and laboratory predictors

Computed tomography (CT) was the modality most commonly used to assess patients, as reported in 29 (94%) studies (Table 1). In addition, five studies combined CT and abdominal radiography (AXR), while two studies used AXR alone [39, 44]. To simplify data presentation, we merged the different terms used for similar predictors into eight groups; for example, “rebound tenderness,” “guarding,” and “peritonitis” were merged under “peritonism” (Table 2). A total of 29 potential predictor variables were identified for meta-analysis, representing 14 imaging, 10 clinical, and 5 laboratory variables (Table 3).

Table 2 Description of merged terms used for the review
Table 3 Potential imaging, clinical, and laboratory predictor variables extracted for the review

Predictive score/model

Eight studies (26%) proposed a predictive model, with three of these describing a risk scoring system. Five of these studies used a combination of CT findings as predictors, including presence of a transition point, small bowel dilatation, intraperitoneal free fluid, reduced bowel wall enhancement, and the presence of closed-loop obstruction [4, 5, 8, 10, 11]. One study included a clinical variable (absence of flatus [7]) and another included a laboratory variable (hyponatraemia [9]) in addition to CT findings. A third proposed a combination of age, nasogastric aspirate volume, and the presence of free fluid on CT to stratify participants into high- and low-risk surgical groups [6].

Meta-analysis: surgery versus conservative management

The strength of evidence for predicting surgery was evaluated in 23 (74%) studies. Of the 29 potential predictors identified, 21 were reported in at least five studies and were meta-analysed. Figure 2 shows meta-analysis results for all 21 predictors, while Online Supplementary Material 3 shows individual study and meta-analyses for each predictor. Using the approach based on narrow confidence intervals and consistent OR, described under “Analysis,” we identified five predictors where the evidence to predict surgery was “strong.” There were three imaging variables: Peritoneal free fluid (OR 3.24, 95%CI 2.45 to 4.29); high-grade obstruction (OR 3.58, 95%CI 2.46 to 5.20); mesenteric inflammatory changes (OR 2.61, 95%CI 1.94 to 3.50). There were two clinical variables: abdominal distention (OR 2.43, 95%CI 1.34 to 4.42) and peritonism (OR 3.97, 95%CI 2.67 to 5.90). Previous abdominopelvic surgery was a strong predictor of conservative management (OR 0.58, 95%CI 0.40 to 0.85).

Fig. 2
figure 2

Forest plot of potential imaging, clinical, and laboratory predictor variables reported in more than 5 studies, ordered by confidence interval width. Increasing odds ratio (OR) favours surgery and decreasing OR, conservative management

Meta-analysis: ischaemia versus no ischaemia

The strength of evidence for predicting ischaemic small bowel at surgery was evaluated in ten (32%) studies. Of the 29 potential predictors identified, 10 were reported in at least five studies and were meta-analysed. Figure 3 shows meta-analysis across all 10 predictors while Online Supplementary Material 4 shows individual study metanalyses for each predictor. We identified three predictors where evidence to predict small bowel ischaemia at surgery appeared strong. There were two imaging variables: Peritoneal free fluid (OR 3.49, 95%CI 2.28 to 5.35) and bowel (mural) thickening (OR 3.26 95%CI 1.91 to 5.55). There was one laboratory variable (elevated WBC, OR 4.76, 95%CI 2.71 to 8.36). WBC used various thresholds across studies: ≥ 10 [32, 36], > 10 [24, 38], > 10.5 [10], > 12 [30]. We identified no predictors where evidence to exclude small bowel ischaemia at surgery appeared strong.

Fig. 3
figure 3

Forest plot of potential imaging, clinical, and laboratory predictor variables reported in more than 5 studies, ordered by confidence interval width. Increasing odds ratio (OR) favours ischaemia identified at surgery and decreasing OR, no ischaemia

Discussion

A 2015 meta-analysis investigated CT findings that predicted small bowel ischaemia at subsequent surgery for small bowel obstruction [15]. Analysis of 768 patients from nine studies found that reduced mural enhancement was associated with surgical ischaemia, whereas the absence of mesenteric fluid effectively ruled out strangulation [15]. However, it is unlikely that surgeons will restrict their decision-making to imaging alone. Because of this, we performed a systematic review of potential predictor variables that extended beyond CT, into clinical and biochemical realms. In addition to the meta-analysis of factors to predict ischaemia at surgery, we also looked for associations predicting conservative versus operative management. Furthermore, we did not simply rely on statistical significance following meta-analysis to identify promising predictors but based our selection on the strength of statistical evidence, including assessments of consistency and credibility (with non-credible results reflected by excessive OR values and/or excessively wide confidence intervals).

We meta-analysed 14 imaging, 10 clinical, and 5 laboratory predictor variables but could only identify strong evidence of potential utility for 8 of these. Four were imaging variables derived from CT. Peritoneal free fluid, mesenteric inflammation, and high-grade obstruction (defined by clinical suspicion of complete or nearly complete obstruction) all predicted surgical management, whereas peritoneal free fluid and bowel thickening predicted ischaemia at surgery. Three clinical variables demonstrated potential utility to distinguish surgical from conservative management: Abdominal distension and peritonism predicted surgery, whereas a history of previous abdominopelvic surgery predicted conservative management. No clinical variable appeared predictive of ischaemia at surgery, either positively or negatively. Elevated WBC was the only non-imaging variable that appeared useful in that scenario and was the only promising laboratory predictor overall. The fact that Millet’s analysis [15] was restricted to surgical ischaemia may explain why we identified more predictors. Patients ultimately undergoing surgery are a small proportion of those presenting with obstruction, and the fact that study outcomes were more frequently expressed as surgery vs. conservative management (vs. surgical ischaemia or not), presented us with a greater selection of primary research. It should also be noted that many predictor variables were undefined by the authors. For example, “peritoneal free fluid” or “ascites” was not quantified, and of the six studies meta-analysed for “bowel thickening,” only two defined this, and those definitions differed, being “more than 2 mm” in one study [28] versus “greater than 3mm” in the other [8].

Management of SBO remains problematic. A 2018 review stated that “there has been no reliable clinical method for predicting failure of nonoperative management for adhesive SBO” [45]. Accordingly, the primary aim of our review was to identify both imaging and non-imaging factors that should be investigated for the development and evaluation of a multi-variable model predicting treatment strategy for small bowel obstruction. Ideally, such a model would identify those patients unlikely to respond to conservative management and therefore at high risk of ischaemia. At the same time, an accurate model could identify those patients destined to resolve conservatively, and thus avoid surgery. Unfortunately, the large majority of existing literature that investigates predictors of surgery and/or ischaemia is single-centre, and therefore potentially suffers from spectrum bias and/or insufficient power. A single-centre study by Scrima and colleagues sensibly included both imaging and non-imaging factors, but their model was not evaluated [42]. Zielinski and co-workers developed a model based predominantly on CT scanning, finding vomiting, free intraperitoneal fluid, mesenteric oedema, and absent “small bowel faeces” sign, predictive of surgery [10]. The authors then evaluated the model prospectively, adding more predictors and removing predictors initially found significant during development [11]. The American Association for Surgery of Trauma has advocated an “anatomic severity schema” for small bowel obstruction that was validated subsequently in 351 patients [46].

Despite this work, no model has been implemented widely, and surgeons still rely on their clinical impression combined with imaging findings, suggesting that the models lack external validity or are difficult to implement in daily practice. We aimed to facilitate model development via meta-analysis, which allows the mathematical synthesis of potential predictor variables investigated across multiple studies and centres [12]. At the same time, it is statistically undesirable to meta-analyse a limited number of studies unless they are very large, and with sufficient outcomes. This drove our a priori decision to limit meta-analysis to variables reported in five or more individual studies. For example, several small single-centre studies have suggested that water-soluble contrast follow-through (WSCFT, usually “Gastrografin”) may not only diagnose the presence and site of small bowel obstruction, but may also have both prognostic and therapeutic implications. A 2016 meta-analysis found that WSCFT reduced subsequent surgery significantly (OR 0.55) and was 92% sensitive and 93% specific for predicting non-operative management [47]. We particularly wished to investigate this variable since it appears to be widely implemented by surgeons in daily practice. However, we were unable to identify sufficient primary research to allow meta-analysis.

Our review does have limitations. We selected articles that investigated potential imaging predictors of surgery, and of ischaemia at surgery. While we also chose to analyse non-imaging variables, any research that investigated these in the absence of an imaging test would have been omitted. This is because, a priori, we hypothesised that it would be highly unlikely for any high-quality research to omit imaging since the narrative literature suggests CT is the single most useful investigation, and CT has a role both for diagnosis and as a reference standard in patients managed conservatively, which will constitute the majority. Restriction to variables examined in five or more individual studies meant that some potential predictors were not meta-analysed; we have explained the statistical assumptions that underpin our decision. Because we found the risk of bias affected a large proportion of primary studies, our findings should be interpreted with caution. While we were careful not to promote variables simply on the basis of statistical significance following meta-analysis, we also wish to draw readers’ attention to the difference between statistical evidence of utility and clinical conviction. Any potential model development should not ignore variables that are used widely, even where statistical support is absent. WSCFT is the obvious example.

In summary, a systematic review and meta-analysis of imaging, clinical, and laboratory variables of patients with small bowel obstruction identified 6 potential predictors associated strongly with the need for surgery (5 positively and 1 negatively; 3 were derived from CT), and 3 associated strongly with ischaemia at surgery (all positively; 2 were derived from CT). The development of future multivariable models to guide the management of small bowel obstruction should concentrate on variables that appear to display strong evidence of potential utility. Factors that have not been investigated sufficiently well in the primary literature but which enjoy considerable clinical support should also be considered.