Introduction

Acute appendicitis is one of the most frequent indications for emergency surgical admission [1]. Traditionally, the treatment of acute appendicitis has always been surgical appendicectomy, even if there is good evidence that at least some cases of appendicitis can be treated conservatively. The recent Covid-19 pandemic and the subsequent guidelines to avoid potentially unnecessary surgery during the peak of the pandemic to reduce the workload of our already strained health systems have taught us that a significant percentage of patients with acute appendicitis can be treated conservatively, thus reducing the risks of a surgical operation [2]. Identifying those patients is not as straightforward as it could seem, and failure of conservative management is still quite high [3]. Generally speaking, the efficacy of conservative management depends on the histologic changes of the appendix, being more likely in the early stages of acute appendicitis and in the absence of a faecalith obstructing the lumen of the appendix [4]. However, it is quite difficult to predict the degree of inflammation of the appendix preoperatively, and CT scan has gained popularity as a diagnostic and prognostic tool for acute appendicitis [5]. Unfortunately, CT is not the ideal screening test, as it is expensive and carries a low but discrete risk of radiation-related morbidity. Furthermore, any diagnosis based on single laboratory tests is not as specific as we would need to differentiate complicated vs non-complicated appendicitis and is associated with a non-insignificant risk of false positive. With this study, we tried to build up a predictive model to stratify preoperatively patients with a diagnosis of acute appendicitis to identify those with advanced (or complicated) disease based on easily available laboratory variables.

Materials and methods

Electronic notes of patients operated of appendicectomy from 1 September 2016 to 31 August 2021 were retrospectively retrieved and analysed as part of an audit of the Surgical Emergency Unit of the Oxford University Hospitals NHS Foundation Trust. Further analysis and discussion have been conducted in collaboration with the Digestive and Emergency Surgery Unit of the Hospital of Terni (Italy).

This audit was approved by the Audit Committee of the Oxford University Hospitals NHS Foundation Trust. Ethical committee approval was not deemed to be necessary as data were collected retrospectively and were all anonymised.

Inclusion criteria were given as follows: age ≥ 16yo, preoperative diagnosis of appendicitis, laparoscopic or open operation. Exclusion criteria were given as follows: age < 16yo, BMI > 35, significant comorbidity (cardiac, liver or renal disease, cancer diagnosis, peripheral vascular disease, immunodeficiency, diabetes, coagulation disorder), pregnancy and appendicectomy as part of another operation. Preoperative clinical and laboratory data were retrieved and recorded into an electronic database (Microsoft Excel for Mac v.16.66.1) along with the histology findings on the operative specimen. Cases with > 20% of incomplete data were excluded. Missing data were excluded listwise.

Patients were divided into four categories according to the histologic findings: 1—no appendicitis, 2—mucosal appendicitis, 3—transmural appendicitis and 4—gangrenous appendicitis (including perforated appendixes).

Continuous variables were first analysed for skewness (− 0.5 to 0.5 is normal distribution) and then compared with the ANalysis Of VAriance test (ANOVA). Subsequently, the variables that gained statistical significance at univariate analysis were introduced into a multinomial (ordinal) logistic regression analysis to identify the independent prognostic factors for the histopathology findings (1, 2, 3 or 4 as above). Subsequently, the same factors were entered into a binomial logistic regression analysis to identify the prognostic factors for “gangrenous appendicitis” vs “non-gangrenous appendicitis” or “non-inflamed appendix” (dependent variable) and to create a predictive model, whose model fit measures and ROC curve were calculated. Sensitivity and specificity of the models were calculated. Starting from the last predictive model (“gangrenous vs non-gangrenous appendicitis”), variables were progressively removed until the maximum specificity of the model was obtained. This allowed the identification of a simplified predictive model for “gangrenous appendicitis”.

Statistical analyses were performed with the applications StatPlus for Mac v.8.0.1.0 and Jamovi v.1.2.9.0. Continuous variables are approximated to the thousandths. P values are approximated to the thousandths. P values < 0.05 are considered to be significant.

Results

We retrieved clinical and laboratory data of 2275 patients who had appendicectomy with a preoperative clinical diagnosis of acute appendicitis and fulfilled the inclusion criteria. At histology, 414 (18.2%) did not have any appendicular inflammation, 219 (9.6%) had a mucosal only appendicitis, 1205 (53.0%) had transmural inflammation and 437 (19.2%) had gangrenous appendicitis.

Results of univariate analysis are reported in Table 1. Albumin, CRP, lymphocytes count, neutrophils count, WBC count, CRP/Albumin ratio, CRP/MPV ratio, albumin/MPV ratio and neutrophils/lymphocytes ratio were found to be directly or inversely associated with severity of appendicitis. In particular, a strong correlation was evident between lymphocytes count and degree of inflammation and CRP/Albumin ratio and degree of inflammation.

Table 1 Univariate comparison of laboratory variables

These two variables—lymphocyte count and CRP/albumin ratio—resulted particularly abnormal in patients with gangrenous appendicitis (Fig. 1).

Fig. 1
figure 1

Descriptive plot of lymphocyte count and CRP/Albumin ratio by histopathology group. Histopathology: 1—no inflammation, 2—mucosal inflammation, 3—transmural inflammation and 4—gangrenous appendicitis

Tables 2, 3 and 4 report the results of multivariate analysis. Figures 2 and 3 show the ROC curve and predictive measures of the predictive models. Of the factors entered into the multivariate analysis, only lymphocyte count and CRP/Albumin ratio resulted to be significantly independent prognostic factors for the severity of inflammation (Table 2). When the dependent variable was “gangrenous appendicitis”, albumin, lymphocyte count, neutrophil count, WBC count and CRP/Albumin ratio were independent prognostic variables (Table 3). The predictive model for “gangrenous appendicitis” has low sensitivity but high specificity (Fig. 2). The simplified model obtained by consecutive regression analyses with progressive elimination of variables (backward stepwise regression) to identify the model with the highest specificity is shown in Table 4 and Fig. 3. However, it must be highlighted that R2 for all those models is quite low, from 0.082 to 0.132, which means that only 8–13% of the variability can be explained by the models.

Table 2 Multivariate analysis by ordinal logistic regression and model fit measures
Table 3 Multivariate analysis by binomial logistic regression and model fit measures
Table 4 Simplified predictive model for gangrenous appendicitis
Fig. 2
figure 2

Receiver operating characteristic (ROC) curve of the predictive model for gangrenous appendicitis (see Table 3)

Fig. 3
figure 3

Receiver operating characteristic (ROC) curve of the simplified predictive model for gangrenous appendicitis (see Table 4)

Discussion

Acute appendicitis is one of the most common reasons for emergency surgical admission [1]. In the past, almost invariably, a clinical diagnosis of acute appendicitis, often but not always associated with laboratory confirmation (leucocytosis), led to a surgical operation of appendicectomy. As a consequence, the rate of negative appendicectomies (i.e. whose macroscopic or pathologic examination did not find any sign of inflammation) was quite high. The advent of laparoscopy, with its diagnostic capabilities, should have reduced the rate of negative appendicectomies [6, 7] but this has never been definitely proven [8]. In actual facts, a certain number of cases where symptoms would suggest acute appendicitis were due to other diseases such as tubo-ovarian inflammation, ectopic pregnancy, active ileitis or colitis or simply to mesenteric adenopathy, among the others. In some cases, even laparoscopic exploration is not able to find the cause of right iliac fossa pain. Guidelines on the diagnosis and treatment of acute pain in the right iliac fossa are not always consistent and there are still areas for discussion. Laparoscopic exploration is nowadays considered the gold standard for diagnostic and therapeutic purposes [67]. While the indication for appendicectomy is straightforward in case of macroscopically confirmed acute appendicitis, it has been suggested not to remove the appendix if another possible cause for pain has been detected and possibly treated [9]. The issue remains in those cases where abdominal exploration does not reveal any abnormality in a patient with clear symptoms of appendicitis. The European Association of Endoscopic Surgery Guidelines and the World Society of Emergency Surgery Guidelines suggest removing the appendix anyway, considering the possibility of a mucosal only appendicitis which is not visible from the serosal surface [7, 10], but this approach may be considered an overtreatment and expose to unnecessary, albeit low, risk of complications in cases of innocent appendixes [11]. Ultrasound scan is rarely diagnostic of acute appendicitis, and the rate of false negatives is quite high [10]. Nonetheless, most surgeons still consider blood tests and abdominal USS the basis of the diagnosis of acute appendicitis. To reduce the risk of unnecessary appendicectomies, some Authors and guidelines suggested a liberal use of preoperative CT scan [7]. However, it is well known that also CT scan can be associated with false negative results and it is not immune from radiation-associated risks, in particular in young people [12]. Magnetic resonance imaging (MRI) is expensive and time-consuming, and its real indications in emergency are usually limited to pregnant patients and in case of suspected inflammatory bowel, unless new MRI protocols are developed and implemented specifically for the emergency evaluation of right iliac fossa pain [13]. Up to date, no reliable test, either imaging or laboratory based, has been found to be able to accurately predict the presence of acute appendicitis.

On the other side of the coin, the recent Covid-19 pandemic has completely changed our attitude towards some urgent conditions, teaching us that in some cases refraining from surgery can be the safer option [14]. This has been the case with acute right iliac fossa pain [15]. More recent evidence suggest that most cases of suspected acute appendicitis can have a non-operative management [7], because they can be due either to mild appendicitis that resolves spontaneously or to some other ailment not deserving a surgical operation, such as pelvic inflammatory disease [2]. Some evidence seems to suggest that any non-complicated acute appendicitis can be treated non-operatively unless the presence of a faecalith in the lumen of the appendix prevents its drainage and may increase the risk of mucocele or perforation [4]. Advanced imaging—either CT or MRI—can rule out or confirm the presence of an obstructed appendix, but, as already mentioned, those investigations are not without risks or downsides. Some other evidence confirms that an appendicectomy is indicated only in gangrenous or perforated appendicitis, while non-complicated appendicitis can be treated conservatively, even if one fifth of patients treated non-operatively require an appendicectomy within 30 days from the first diagnosis [16]. Unfortunately, at the moment there is no test that has a clinically significant predictive value for gangrenous appendicitis or that can help us foresee the extent of appendicular inflammation, if any.

Thus, the need to develop a reliable, quick, safe and easily available test to select the patients with acute right iliac fossa pain who may benefit of a surgical operation arises. Any patient seen in the Emergency Department with acute abdominal pain gets at least a blood sample for basic analysis. If correctly interpreted possibly in an aggregated fashion, those values can be highly informative and may help predict the degree of inflammation. Traditionally, white blood cells (WBC) count has been considered a marker of inflammation, along with C-reactive protein (CRP) [7], but WBC and CRP are not highly specific of the degree of inflammation and cannot be used for precise patient selection [17]. Other possible markers of inflammation, such as procalcitonin and interleukin, are not routinely checked in emergency [18]. The Alvarado score [19] and the Appendicitis Inflammatory Response (AIR) Score [20] were laudable attempts at enhancing the diagnostic efficacy but they are not completely reliable and not widely used [7, 21]. The Alvarado score has high sensitivity (99%) to rule out appendicitis when its score is < 5, but its overall sensitivity is between 57 and 93%, with specificity ranging from 81 to 100% [22]. An AIR score ≥ 5 is highly sensitive for appendicitis (90%), but overall sensitivity ranges from 78 to 92% and specificity from 63 to 97% [22]. To overcome the uncertainties associated with these scoring systems, various combinations of imaging and laboratory tests have been proposed, but diagnostic laparoscopy without imaging is still considered a viable option in particular in patients who scored high at Alvarado or AIR [22]. Augustin et al. [23] proposed the Appendicitis Tri-Modal Prediction Score (ATMOS) for the differential diagnosis of right iliac fossa pain in pregnant women, but it has not been externally validated yet.

Our study on a relevant number of cases was aimed at identifying any prognostic factor with significant predictive value among the most common variables easily available from any laboratory at the admission of the patient. Although several variables were associated with histopathology findings at univariate analysis, multivariate analysis confirmed that only lymphocyte count and CRP/Albumin ratio were independently associated with the degree of inflammation. Both variables’ odds ratios are below 1, which means they both have an inverse correlation. In other words, it looks like low lymphocyte count and low CRP/Albumin ratio can be predictive of the severity of inflammation. However, only 8% of the variability of severity of inflammation can be predicted by this model.

Low lymphocyte count is often associated with high neutrophil count; therefore it would seem logical that in acute appendicitis, neutrophils to lymphocyte ratio should be increased. In actual facts, neutrophils/lymphocytes ratio showed a positive correlation with the degree of inflammation at univariate analysis, but this was not confirmed at regression analysis. In other terms, our study showed that neutrophils/lymphocytes ratio may not be reliable to select patients with gangrenous appendicitis who may need an emergency surgical operation.

On the contrary, low lymphocyte count has been identified as an independent prognostic variable who significantly correlates to the degree of inflammation.

Similarly, while CRP per se did not qualify as a diagnostic tool in our study, the CRP to Albumin ratio is significantly and independently associated with severity of appendicitis. CRP is a well-known marker of inflammation, but it is not specific. Albumin is often erroneously considered a nutritional marker but, on the contrary, it has been demonstrated to be much more reliable as an inverse inflammatory marker [24]. Therefore, CRP/Albumin should be directly correlated with the degree of inflammation, as high CRP and low Albumin are both linked to inflammation. In fact, our simplified predictive model considers only low lymphocyte count and high CRP to albumin ratio to significantly correlate with the presence of gangrenous appendicitis. This model has high accuracy (81%) and high specificity (98%).

In other words, low lymphocyte count and CRP/Albumin can be used as markers to select those patients with acute right iliac fossa pain who would benefit from an operation of appendicectomy, i.e. those with gangrenous appendicitis, among those who have a clinical diagnosis of acute appendicitis. Patients with a different laboratory pattern may be considered for non-operative management.

Strengths of this paper are the conspicuous sample size and the fact that an easy predictive model was built up from easily available blood variables.

However, there is also a significant limitation, as the models have low R2, making us infer that basic blood tests may not be specific enough to have a clinically significant prognostic power. This consideration may prompt someone to propose the use of much more expensive and not readily available tests, such as IL-2, to allow an accurate selection of patients, but acute appendicitis is a common presumptive diagnosis in emergency surgery and a condition with low social impact; for this reason, the expenses associated with high-priced tests may not be justified and sustainable. More research may be needed to see if the “clinical acumen” of experienced surgeons may be more reliable than any laboratory or imaging test in the selection of patients with acute right iliac fossa pain.

However, the reliability of the model must be externally validated on large series.

In conclusion, low lymphocyte count and high CRP to albumin ratio can have a role in the selection of patients with suspected acute appendicitis who may deserve a surgical operation, but other predictive models, possibly incorporating clinical examination, imaging and blood tests, should be considered to reach clinical relevance.