Evaluation of the Appendicitis Inflammatory Response Score for Patients with Acute Appendicitis

de Castro, S. M. M.; Ünlü, Ç.; Steller, E. Ph.; van Wagensveld, B. A.; Vrouenraets, B. C.

doi:10.1007/s00268-012-1521-4

Evaluation of the Appendicitis Inflammatory Response Score for Patients with Acute Appendicitis

Open access
Published: 24 March 2012

Volume 36, pages 1540–1545, (2012)
Cite this article

Download PDF

You have full access to this open access article

World Journal of Surgery Aims and scope Submit manuscript

Evaluation of the Appendicitis Inflammatory Response Score for Patients with Acute Appendicitis

Download PDF

S. M. M. de Castro¹,
Ç. Ünlü¹,
E. Ph. Steller¹,
B. A. van Wagensveld¹ &
…
B. C. Vrouenraets¹

5824 Accesses
78 Citations
4 Altmetric
Explore all metrics

An Erratum to this article was published on 14 June 2012

Abstract

Background

Acute appendicitis is still a difficult diagnosis. Scoring systems are designed to aid in the clinical assessment of patients with acute appendicitis. The Alvarado score is the most well known and best performing in validation studies. The purpose of the present study was to externally validate a recently developed appendicitis inflammatory response (AIR) score and compare it to the Alvarado score.

Methods

The present study selected consecutive patients who presented with suspicion of acute appendicitis between 2006 and 2009. Variables necessary to evaluate the scoring systems were registered. The diagnostic performance of the two scores was compared.

Results

The present study included 941 consecutive patients with suspicion of acute appendicitis. There were 410 male patients (44%) and 531 female patients (56%). The area under the receiver operating characteristic curve of the AIR score was 0.96 and significantly better than the area under the curve of 0.82 of the Alvarado score (p < 0.05). The AIR score also outperformed the Alvarado score when analyzing the more difficult patients, including women, children, and the elderly.

Conclusions

This study externally validates the AIR Score for patients with acute appendicitis. The scoring system has a high discriminating power and outperforms the Alvarado score.

Validation of the Appendicitis Inflammatory Response (AIR) Score

Article Open access 06 April 2021

Performance and diagnostic accuracy of scoring systems in adult patients with suspected appendicitis

Article 06 July 2023

Predicting Acute Appendicitis? A comparison of the Alvarado Score, the Appendicitis Inflammatory Response Score and Clinical Assessment

Article 23 September 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In 1880, Robert Lawson Tait performed the first appendectomy for appendicitis in England [1]. Now, more than 130 years later, this most common of all surgical diseases can still be a diagnostic problem. This is demonstrated by the high negative laparotomy rates documented in the literature. A study performed in 2005 in the Netherlands found that approximately 15% of the patients underwent a negative appendectomy, a number similar to another large Swedish study [2]. The negative appendectomy rate was 13% in another large North American study [3].

It is safe to assume that the negative laparotomy rate declined to approximately 10% with the routine use of ultrasonography (US) [4]. The higher sensitivity of computed tomography (CT) seems to have had an even greater effect on the negative laparotomy rate, which has decreased even further to 5–10% [4, 5]. In many European countries, most surgeons still consider acute appendicitis to be a clinical diagnosis and do not routinely perform imaging studies [6].

Scoring systems have been designed to aid in the clinical assessment of patients with acute appendicitis. The Alvarado score is the most well known and best performing in validation studies, but it has some drawbacks [7–9]. Its construction was based on a review of patients who had been operated with suspicion of appendicitis, whereas the score is supposed to be used on all patients with suspicion of appendicitis. Also, the score does not incorporate C-reactive protein as a variable, although many studies have shown the importance of C-reactive protein in the assessment of patients with appendicitis [10].

The recently introduced appendicitis inflammatory response (AIR) score was designed to overcome these drawbacks [11]. This score incorporated the C-reactive protein value in its design and was developed and validated on a prospective cohort of patients with suspicion of acute appendicitis.

The objective of the present study was to externally validate the AIR score on a consecutive cohort of patients with suspicion of acute appendicitis and compare the AIR score’s performance to the Alvarado score.

Methods

The present study selected consecutive emergency room patients with suspicion of acute appendicitis between January 2006 and January 2009. The population consisted of all patients who complained of sudden-onset, non-traumatic abdominal pain. The data of these patients were previously used for a different study evaluating the use of imaging for acute appendicitis [12].

A senior surgical resident initially examined the patients, and the decision to operate was subsequently confirmed by a senior surgical staff member. Imaging by means of US or CT was used selectively in the present study and at discretion of the surgeon. The surgical procedures consisted of either a laparotomy or diagnostic laparoscopy followed by a laparoscopic appendectomy. The diagnosis of acute appendicitis during laparoscopy was established on the basis of macroscopic findings. A macroscopically normal appendix found at laparoscopy was left in situ. The diagnosis of appendicitis was confirmed histologically in all resected specimens. Appendicitis was pathologically diagnosed when infiltration of the muscularis propria by neutrophil granulocytes was seen [13]. Patients were classified into two groups: (1) phlegmonous appendicitis and (2) advanced appendicitis, defined as a macroscopic gangrenous appendix with or without perforation. A periappendicular abscess confirmed on CT was defined as an appendix that is surrounded by a fluid collection and extensive tissue infiltration, which prevents spread of infection into the free abdominal cavity.

Variables recorded to evaluate the scoring systems include nausea, vomiting, anorexia, migration of pain to the right lower quadrant (RLQ), pain in the RLQ, rebound tenderness, muscular defense, body temperature, high white blood cell (WBC) count, proportion of polymorphonuclear leukocytes, and a high level of C-reactive protein (CRP). These variables are necessary to calculate both the Alvarado score and the AIR score. The two scores are based on different variables, with different points assigned to each variable. In the pediatric population, the child’s history obtained from the parents was used if the patient was too young to give a complete history. An overview of the scoring system is given in Table 1.

Table 1 Characteristics of the appendicitis inflammatory response (AIR) score and the Alvarado score^a

Full size table

Statistical analysis was performed with SPSS statistical software (SPSS Inc, Chicago, IL). A p value of <0.05 was considered statistically significant. Pearson’s chi-square test was used to test if differences between dichotomous groups were significant. Fisher’s exact test was used when a table had a cell with an expected frequency of less than 5. The area under the receiver operating characteristic (ROC) curves was used to examine the performance characteristics of the two scoring systems.

Results

The present study included 941 consecutive patients with suspicion of acute appendicitis. There were 410 male patients (44%) and 531 female patients (56%). General patient characteristics are shown in Table 2. The present cohort was older compared to the original AIR cohort. Otherwise, the two cohorts compared remarkably well. The mean patient age was 32 years, with a range of 1–97 years. Of the 941 patients, 201 (21%) were younger than 18 years of age.

Table 2 Patient characteristics

Full size table

Overall, 346 of the 941 patients (37%) had appendicitis: 244 patients had pathologically proven phlegmonous appendicitis, and 92 had pathologically proven advanced appendicitis. Another 10 patients had a periappendicular abscess. These 10 patients were classified as part of the advanced appendicitis group, resulting in a total of 102 patients with advanced appendicitis. Of the remaining 595 patients (63%) with no appendicitis, an alternate diagnosis was found in 220 patients (Table 3). At operation, a pathologically normal appendix without an alternate diagnosis was found in 41 patients (4%). Nonspecific abdominal pain was found in the remaining 334 patients. All patients underwent routine follow-up and did not receive antibiotics unless an alternate diagnosis indicated antibiotic use.

Table 3 Patients with an alternate diagnosis

Full size table

The area under the ROC curve of the AIR score was 0.96 and significantly better than the area under the curve of 0.82 of the Alvarado score (p < 0.05). The AIR score also outperformed the Alvarado score in the analysis of the more difficult to diagnose patients, including women, children, and the elderly (Table 4).

Table 4 Discriminating capacity of the AIR score compared to the Alvarado score, according to patient gender and age using receiver operator characteristic (ROC) curve analysis

Full size table

A score of greater than 4 points gave a similar sensitivity for the AIR score and the Alvarado score (0.93 vs. 0.90, respectively) but gave a much higher specificity (0.85 vs 0.55, respectively) (Table 5). This corresponds to a negative predictive value of 0.95 for the AIR score compared to 0.90 for the Alvarado score. Five hundred thirty-three of the 941 patients (57%) were classified by the AIR score to the low-risk group with fewer than 5 scoring points, including 18 patients with phlegmonous appendicitis and 7 with advanced appendicitis (Table 6). The corresponding result for the Alvarado score was 359 patients (38%), including 27 phlegmonous appendicitis patients and 8 advanced appendicitis patients. Of the 595 nonappendicitis patients, the AIR score correctly classified 508 patients (85%) to the low-risk group, compared to 324 patients (55%) for the Alvarado score.

Table 5 Diagnostic characteristics of the AIR score and Alvarado score according to the cutoff points

Full size table

Table 6 Distribution according to the diagnostic test zone and diagnosis for the AIR score and the Alvarado score

Full size table

A score greater than 8 points had a lower sensitivity for appendicitis for the AIR score compared with the Alvarado score (0.10 vs. 0.29). However, this was associated with a higher specificity (1.00 vs. 0.95, respectively). These scores translate to a positive predictive value of 0.77 and 1.00 for the AIR and the Alvarado scores, respectively. The AIR classified 36 patients to the high-risk group. All of them had appendicitis. The corresponding figure for the Alvarado score was 130 patients, 100 of whom had appendicitis.

The AIR score classified 41 of the 89 negative appendectomies (46%) to the low probability group and none to the high probability group, compared to 10 patients (11%) and 21 patients (16%), respectively, for the Alvarado score.

If the AIR score had hypothetically been implemented in evaluating the present cohort, the data would translate into 533 patients (57%) who would have been observed as outpatients and spared further diagnostic work-up. Twenty-five of these patients (5%) (18 patients [3%] with phlegmonous appendicitis and 7 patients [1%] with advanced appendicitis) would have been missed but probably discovered during routine follow-up the next day, as their score got higher. Thirty-six patients with a high probability of appendicitis (4%) could undergo direct surgery without any negative appendectomies. The remaining 372 patients (40%) would fall in the intermediate group and would undergo diagnostic imaging, thus safely preventing costly imaging in 544 (941−(372+25)) patients (58%).

Discussion

The present study shows that the AIR score has a good statistical discrimination for patients with acute appendicitis and outperforms the Alvarado score. The discriminatory property of the AIR score remains high in the more difficult to diagnose patients (e.g., women, children, and the elderly).

Nowadays, the use of US or CT in patients suspected of having appendicitis is common. However, imaging does not perform well in patients with low and high prevalence of the disease, and CT should be used selectively to minimize exposure to ionizing radiation [14]. Moreover, false negative results may delay surgery and subsequently increase morbidity [15].

A clinical scoring system estimates the probability of appendicitis in a patient and should aid in the decision-making process for treatment because of its simple design and application. There are a number of reasons to use scoring systems in managing cases of appendicitis. A clinical score may be suitable as an instrument for selecting patients for immediate surgery, further examination with imaging techniques, or observation. The score can be repeated during active observation and influence the decision to operate. It must be emphasized that the intent of the scoring system is not to establish a primary diagnosis of appendicitis, but simply to discriminate objectively when there is uncertainty.

Another reason to use such a scoring system is to better describe the patients that are included in clinical studies and thereby facilitate the comparison of results. Many studies performed in patients with appendicitis suffer from selection bias. For instance, two recent studies that compared the use of antibiotics to routine surgery for patients with acute appendicitis reported favorable results with the antibiotic treatment [16, 17]. Unfortunately, the severity of disease in these patients was unclear because the decision to cross over to the surgery group was left to the surgeon. The possible bias in severity of disease was the main critique of these studies after their publication [18–22]. Similarly, the value of diagnostic laparoscopy is dependent on many factors, but it also is limited to the types of patients enrolled in a particular study and thus the prevalence of the disease in a particular population. For instance, including many suspicious cases would lower the yield of laparoscopy significantly, whereas randomly selecting patients with abdominal pain would increase the yield significantly. A validated scoring system can aid in better comparing the results of these studies.

A study on malpractice lawsuits from North America found that appendicitis ranks third among lawsuits, even though appendicitis is the cause of acute abdominal pain only about 5% of the time or less [23]. An objective validated scoring system could legally strengthen decisions made in the emergency room and could avoid malpractice liability. Most claims involve misdiagnosis or delayed diagnosis, and common pitfalls include poor documentation.

The most commonly known scoring system is the Alvarado score. The Alvarado score was first reported in 1986 and was based on the weight of several significant variables found in 305 patients with acute appendicitis. Other variations on the Alvarado score have also been developed but do not differ much [24, 25]. These scoring systems never enjoyed wide application because of their suboptimal discriminatory properties. The AIR score was first reported in 2008. It was based on data collected prospectively from 545 patients admitted for suspected appendicitis at four hospitals. The score was developed on 316 randomly selected patients and evaluated on the remaining 229 patients. It was based on similar values to the Alvarado score, but it also included C-reactive protein as a new variable. A recent meta-analysis showed that when both an elevated WBC count and elevated C-reactive protein level are present, there is a fivefold increase in the positive likelihood ratio for acute appendicitis [10].

Routine use of an Alvarado-like scoring system was evaluated in a large German study comparing 870 patients who did not receive routine scoring with 614 patients who were evaluated with a Alvarado-like scoring system [26]. The scoring system consisted of eight variables developed in another study and validated on a Dutch population [24]. The scoring system also did not include C-reactive protein, and it found no difference in the rates of perforated appendix, negative appendectomy, or complications between groups. However, it did find a significantly lower delayed appendectomy rate (2 vs. 8%) and a lower delayed discharge rate (11 vs. 22%) in the group that routinely used the scoring system.

A conditional strategy with CT only after negative or inconclusive US yielded a sensitivity of 94% in a recent study of patients with acute abdominal pain [27]. In the present cohort, 372 patients (40%) would fall in the intermediate group, and, hypothetically, if they all underwent imaging with this strategy, there would be 22 patients (2%) with a negative appendectomy. Thus the negative appendectomy rate could potentially decline from 10% in the present cohort to 2% in the present cohort with the AIR scoring system.

The AIR score probably works better in the pediatric population than the Alvarado score because the variables scored are easy to apply to children. The Alvarado score requires children to identify nausea, anorexia, and migration of pain. This is probably the reason why the Alvarado score compares best to the AIR score in the adolescent age group, because this group closely mimics the initial cohort on which the Alvarado score was designed.

The management of patients with suspected acute appendicitis is still challenging, and the optimal management strategy is still unknown, even after the introduction of US, CT, and diagnostic laparoscopy. This study externally validates that the AIR score has a high discriminating power and outperforms the Alvarado score. This score could aid in selecting patients who require timely surgery or those who require further evaluation. Finally, the score could safely avoid hospitalization and unneeded investigations in patients in whom the diagnosis is unlikely. Such a scoring system is important for future research to better compare results. But first, a proper prospective randomized controlled trial evaluating the effect of introducing such a score in a relevant patient population has to be performed.

References

Seal A (1981) Appendicitis: a historical review. Can J Surg 24:427–433
PubMed CAS Google Scholar
Andersson RE, Hugander A, Thulin AJ (1992) Diagnostic accuracy and perforation rate in appendicitis: association with age and sex of the patient and with appendicectomy rate. Eur J Surg 158:37–41
PubMed CAS Google Scholar
Hale DA, Molloy M, Pearl RH et al (1997) Appendectomy: a contemporary appraisal. Ann Surg 225:252–261
Article PubMed CAS Google Scholar
Cuschieri J, Florence M, Flum DR et al (2008) Negative appendectomy and imaging accuracy in the Washington state surgical care and outcomes assessment program. Ann Surg 248:557–563
PubMed Google Scholar
Wagner PL, Eachempati SR, Soe K et al (2008) Defining the current negative appendectomy rate: for whom is preoperative computed tomography making an impact? Surgery 144:276–282
Article PubMed Google Scholar
Poortman P, Oostvogel HJ, de Lange-de Klerk ES et al (2009) The use of imaging in the case of suspected acute appendicitis: opinion of Dutch surgeons. Ned Tijdschr Geneeskd 153:B376 [in Dutch]
PubMed Google Scholar
Alvarado A (1986) A practical score for the early diagnosis of acute appendicitis. Ann Emerg Med 15:557–564
Article PubMed CAS Google Scholar
Owen TD, Williams H, Stiff G et al (1992) Evaluation of the Alvarado score in acute appendicitis. J R Soc Med 85:87–88
PubMed CAS Google Scholar
Douglas CD, Macpherson NE, Davidson PM et al (2000) Randomised controlled trial of ultrasonography in diagnosis of acute appendicitis, incorporating the Alvarado score. BMJ 321(7266):919–922
Article PubMed CAS Google Scholar
Andersson RE (2004) Meta-analysis of the clinical and laboratory diagnosis of appendicitis. Br J Surg 91:28–37
Article PubMed CAS Google Scholar
Andersson M, Andersson RE (2008) The appendicitis inflammatory response score: a tool for the diagnosis of acute appendicitis that outperforms the Alvarado score. World J Surg 32:1843–1849
Article PubMed Google Scholar
Unlu C, de Castro SM, Tuynman JB et al (2009) Evaluating routine diagnostic imaging in acute appendicitis. Int J Surg 7:451–455
Article PubMed CAS Google Scholar
Marudanayagam R, Williams GT, Rees BI (2006) Review of the pathological results of 2660 appendicectomy specimens. J Gastroenterol 41:745–749
Article PubMed Google Scholar
Terasawa T, Blackmore CC, Bent S et al (2004) Systematic review: computed tomography and ultrasonography to detect acute appendicitis in adults and adolescents. Ann Intern Med 141:537–546
PubMed Google Scholar
Flum DR, McClure TD, Morris A et al (2005) Misdiagnosis of appendicitis and the use of diagnostic imaging. J Am Coll Surg 201:933–939
Article PubMed Google Scholar
Styrud J, Eriksson S, Nilsson I et al (2006) Appendectomy versus antibiotic treatment in acute appendicitis. a prospective multicenter randomized controlled trial. World J Surg 30:1033–1037
Article PubMed Google Scholar
Hansson J, Korner U, Khorram-Manesh A et al (2009) Randomized clinical trial of antibiotic therapy versus appendicectomy as primary treatment of acute appendicitis in unselected patients. Br J Surg 96:473–481
Article PubMed CAS Google Scholar
Spanos CP (2009) Letter 1: Randomized clinical trial of antibiotic therapy versus appendicectomy as primary treatment of acute appendicitis in unselected patients. (Br J Surg 96:473–481). Br J Surg 96:1223–1224
Article PubMed CAS Google Scholar
Brown E (2009) Letter 1: Randomized clinical trial of antibiotic therapy versus appendicectomy as primary treatment of acute appendicitis in unselected patients (Br J Surg 96:473–481). Br J Surg 96:952
Article PubMed CAS Google Scholar
Patel V, Ahmed K, Ashrafian H (2009) Letter 3: Randomized clinical trial of antibiotic therapy versus appendicectomy as primary treatment of acute appendicitis in unselected patients (Br J Surg 96:473–481). Br J Surg 96:953
Article PubMed CAS Google Scholar
van LA (2009) Letter 5: Randomized clinical trial of antibiotic therapy versus appendicectomy as primary treatment of acute appendicitis in unselected patients (Br J Surg 96:473–481). Br J Surg 96:954
Paice AG, Ali H (2009) Letter 6: Randomized clinical trial of antibiotic therapy versus appendicectomy as primary treatment of acute appendicitis in unselected patients (Br J Surg 96:473–481). Br J Surg 96:954
Article PubMed CAS Google Scholar
Selbst SM, Friedman MJ, Singh SB (2005) Epidemiology and etiology of malpractice lawsuits involving children in US emergency departments and urgent care centers. Pediatr Emerg Care 21:165–169
Article PubMed Google Scholar
Ohmann C, Franke C, Yang Q et al (1995) Diagnostic score for acute appendicitis. Chirurg 66:135–141
PubMed CAS Google Scholar
Enochsson L, Gudbjartsson T, Hellberg A et al (2004) The Fenyo-Lindberg scoring system for appendicitis increases positive predictive value in fertile women—a prospective study in 455 patients randomized to either laparoscopic or open appendectomy. Surg Endosc 18:1509–1513
Article PubMed CAS Google Scholar
Ohmann C, Franke C, Yang Q (1999) Clinical benefit of a diagnostic score for appendicitis: results of a prospective interventional study. German Study Group of acute abdominal pain. Arch Surg 134:993–996
Article PubMed CAS Google Scholar
Lameris W, van RA, van Es HW et al (2009) Imaging strategies for detection of urgent conditions in patients with acute abdominal pain: diagnostic accuracy study. BMJ 338:b2431

Download references

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Authors and Affiliations

Department of Surgery, St. Lucas Andreas Hospital, Jan Tooropstraat 164, 1061 AE, Amsterdam, The Netherlands
S. M. M. de Castro, Ç. Ünlü, E. Ph. Steller, B. A. van Wagensveld & B. C. Vrouenraets

Authors

S. M. M. de Castro
View author publications
You can also search for this author in PubMed Google Scholar
Ç. Ünlü
View author publications
You can also search for this author in PubMed Google Scholar
E. Ph. Steller
View author publications
You can also search for this author in PubMed Google Scholar
B. A. van Wagensveld
View author publications
You can also search for this author in PubMed Google Scholar
B. C. Vrouenraets
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. M. M. de Castro.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

de Castro, S.M.M., Ünlü, Ç., Steller, E.P. et al. Evaluation of the Appendicitis Inflammatory Response Score for Patients with Acute Appendicitis. World J Surg 36, 1540–1545 (2012). https://doi.org/10.1007/s00268-012-1521-4

Download citation

Published: 24 March 2012
Issue Date: July 2012
DOI: https://doi.org/10.1007/s00268-012-1521-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evaluation of the Appendicitis Inflammatory Response Score for Patients with Acute Appendicitis