Background

In 2020, the COVID-19 pandemic had a major impact on many aspects of life worldwide. The pandemic called for novel, unprecedented, and untested measures. The effects of these measures and the unintended consequences of COVID-19 and implemented policies will form the basis for studies for many years in the future even and if normality returns.

In many countries, the pandemic has caused an increasing burden on the healthcare system, which led to down-prioritization and delays in elective diagnoses and surgeries. The effects on emergency care, however, are less well understood. Due to its stochastic nature, the incidence of emergency pathologies should be the least influenced; however, lockdown measures have led to unexpected and unforeseen decreases in emergencies. Furthermore, there is reason to believe that emergencies were not immune to postponements and delays either. Acute appendicitis, a common emergency pathology, should in theory be less impacted than other emergency pathologies such as some forms of trauma (with decreasing incidence due to decreased social interactions during lock-down), or non-COVID chest infections (with decreased incidence due to increased hygiene measures, but increased diagnostics due to being a differential diagnosis to COVID-19 pneumonia), for example.

Multiple studies have been published regarding appendicitis during COVID-19; however, the results have been somewhat conflicting. Delay in diagnosis (on the part of patient/guardian and/or health care) can lead to higher frequency of complications. There is no exact consensus on what constitutes complication in appendicitis. Perforation signs (such as wall-defect, non-regional fibrino-purulent exudate, abscess, and extra intestinal faecal/bowel material are usually considered the most common complications seen at surgery [1]. These signs correspond to defects in mucosal enhancement or wall of appendix, extraintestinal air (local or free), and abscess formation on CT, which we adopted for this study. Most studies have pointed out a decrease in the number of uncomplicated appendicitis but similar number of complicated, suggesting spontaneous resolution of uncomplicated cases. Others have noted the consequent increase in the proportion of complicated cases and in some cases also an increase in the absolute number (prevalence) of complicated cases coupled to a longer prehospital delay [2,3,4,5,6,7].

Normally, appendicitis and its complications require a surgical treatment. Especially during the COVID-19 pandemic, due to many factors, there has been a push to conservative treatment [8], which, in turn, has relied more on an image-based diagnosis. CT (computed tomography) represents the best diagnostic modality in terms of availability and accuracy [9]. The pooled sensitivity and specificity of low-dose CT are 96% and 93%, respectively, while the same figures for standard-dose CT are 96% and 92%, respectively, as reported in a recent meta-analysis [10].

Telemedicine Clinic Ltd. provides out-of-hours emergency teleradiology reporting to a large number of client hospitals in several European countries and has therefore unique access to vast volumes of emergency CT cases. These cases are reported by board-certified, trial-case-selected radiologists with more than 5 years’ experience in emergency radiology.

In this study, the goal was to compare the incidence of appendicitis and its complications (perforation and abscess) in a large cohort of emergency abdominopelvic CT reports obtained in two different European countries during the first months of the COVID-19 pandemic and compare it to the corresponding pre-pandemic period.

Materials and methods

Case selection

All emergency, out-of-hours abdominopelvic CT scans reported by Telemedicine Clinic Ltd. during April to June 2019 and 2020 for the UK and Sweden were collected. Cases were electronically sent to from 56 hospitals in the Sweden and 38 hospitals in the UK. Out-of-hours was defined as 10 pm to 7.30 am local time for Sweden everyday, and 5 pm to 9 am local time for the UK. These hours are based on the needs of the client hospital and their contracts. Additionally, for the UK, data from July 2019 and 2020 were collected. The hospitals are listed in the Supplementary Table.

Cases which were performed out-of-hours but not on an emergency basis were excluded. The initial cohorts consisted of 11,395 and 9975 emergency reports in 2019 and 2020, respectively (Fig. 1).

Fig. 1
figure 1

Flow chart of cases

The reports were filtered for the keyword “app” which narrowed the cohort to a total of 7612 cases. Reports including a history of appendectomy, no mention of appendix or appendicitis in the report, clear unequivocal denial of appendicitis, or description of a normal appendix were excluded by the case selector, author MRT.

Case allocation and reader assessment

A total of 1547 reports were the core of the further analysis. Electronic reports were anonymous and did not show any dates including the date or time of the scanning. The reports were then coded and divided randomly among four senior, board-certified surgeons and radiologists (two each). Each reviewer received 280 unique reports (that only they assessed) in addition to 427 common reports (which were assessed by all readers). The readers were blinded as to which cases are the common or unique ones. The common cases were used for the assessment of agreements.

The readers were asked to score the reports from 1 to 5 regarding the diagnosis of appendicitis, appendiceal perforation, and abscess where 1 was considered definitely negative for diagnosis, 2 probably negative, 3 unclear, 4 probably positive, and 5 definitely positive. For further analysis, scores 4 and 5 were considered positive, while scores 1 and 2 were considered negative, and score 3 was considered unclear.

Measurement of agreement and further analysis of common cases

The reports were divided into three groups according to scores: 1 + 2, 3, and 4 + 5. Fleiss’ Kappa was used for inter-rater agreement levels defined as 0.20 or less as slight agreement, 0.21–0.40 fair agreement, 0.41–0.60 as moderate agreement, 0.61–0.80 as good or substantial agreement, and 0.81 and above as excellent or perfect agreement.

The final score was based on the most scores received if scored the same by three or four readers. In other instances, the reports were presented to a fifth reader, an emergency radiologist (FL) to assign a final score. FL had access to the scores assigned by the four scorers but was blinded to who the reader had been for each score.

Final analyses

The final analyses were done only on reports with probable or definite diagnoses of appendicitis (scores 4 and 5). In these cases, the demographics (age, gender), the prevalence of perforation, and abscess both in terms of proportion (number of complicated cases divided by the number of appendicitis) and the absolute number (prevalence) of complications were compared between 2019 and 2020. The statistical analysis was done for differences between years with X2 when applicable. A p-value of < 0.05 was considered statistically significant.

Ethical aspects

All data regarding patient identification were anonymized prior to extraction, and therefore, ethical approval for the study was not required. There are no individual data in any form. All datasets could be deposited upon request.

Results

Demographics

The mean patient age in 2019 was 49.7 (range 3.2–95.9, median 50.3, standard deviation 18.5) and in 2020 was 47.5 (range 2.7–93.8, median 46.7, standard deviation 18.3). Although patients seemed to be slightly younger in 2020, the difference was not statistically significant. In the UK, the youngest ages of patients were 16.3 and 14.0 years old in 2019 and 2020, respectively. In the Sweden, however, children as young as 3.2 and 2.7 had been imaged with CT. Otherwise, no difference could be observed between the countries and the years.

The female/male ratio was 52.5/47.5 in 2019 and 48.5/51.5 in 2019 and 2020, respectively, with no significant difference (p-value = 0.68).

Number of out-of-hours abdominopelvic CT scans with query appendicitis

Fewer out-of-hours abdominopelvic CT scans with query appendicitis were performed in 2020 compared to 2019, in both Sweden (6111 vs 6433, p-value < 0.001) and UK (5591 vs 7223, p-value < 0.001) (Fig. 2).

Fig. 2
figure 2

Number of acute CT abdomen and pelvis performed

The final finding of appendicitis according to reporting scores is presented in Table 1.

Table 1 Final findings of appendicitis in 2019 and 2020

Inter-rater agreement

The interobserver agreement (Fleiss’ Kappa) level for the presence of perforation and abscess were 0.64 and 0.77, respectively, which were considered a good agreement.

Sweden

The detailed results are shown in Table 2 and Figs. 3 and 4. On average, 91 patients were diagnosed with appendicitis each month in 2020, with the lowest number in April and the highest in June.

Table 2 Res ults of CT-based diag nosis of appendicitis, p erforation, and abscess formation each month in 2019
Fig. 3
figure 3

Swedish results for 2019 and 2020

Fig. 4
figure 4

Percentage of Swedish patients diagnosed with perforations and abscess formations in 2020 vs 2019

In 2020, both the mean proportion of appendicitis patients presenting with perforation and abscess (p values 0.02 and 0.02, respectively) as well as the mean prevalence (absolute number of patients) with perforation and abscess were significantly higher (p-value 0.004 and < 0.001, respectively) compared to 2019 (Fig. 4).

The UK

The detailed results are shown in Table 2 and Figs. 5 and 6. On average, 65 patients were diagnosed with appendicitis each month in 2020, with the lowest figure in April and the largest in July.

Fig. 5
figure 5

UK results for 2019 and 2020

Fig. 6
figure 6

Percentage of patients diagnosed with perforations and abscess formation during 4 months in UK

Unlike Sweden, when compared with 2019, the number of CT-diagnosed cases of appendicitis diagnoses in 2020 was lower in the UK (p < 0.001).

In 2020, both the mean proportion of appendicitis patients presenting with perforation and abscess (p values 0.02 and < 0.001, respectively) as well as the mean prevalence (absolute number of patients) with perforation and abscess were significantly different (p-value 0.007 and < 0.001, respectively) compared to 2019 (Fig. 6). However, unlike Sweden, these figures were initially lower but increased in July.

Discussion

This observational study demonstrates some similarities and differences between the UK and Swedish patterns in the number of diagnosed cases of appendicitis and its complications. In both countries, the COVID-19 outbreak started in January 2020, and in the present study, we can detect a much lower number of performed acute CT of the abdomen in April 2020 than the corresponding month in 2019. In the Sweden, there was an increase in the number and proportion of patients diagnosed with complicated CT-verified appendicitis (perforation/abscess) in May and June 2020 compared to April 2020.

In the UK, the results were not homogenous during the observation period. The decrease in number of abdominopelvic CTs persisted until July. Initially, both the number of cases with appendicitis and complications decreased and this trend continued for several months. The proportion of cases with complications were also initially lower in 2020; however, the number and proportion of patients with abscesses were dramatically increased in July 2020.

Our study illustrates some findings not previously shown. One is that figures and patterns from different countries might be different. This could well be due to public measures and is beyond the scope of this article. Another important point is the temporal course of complications. Especially looking at the data from the UK, one might get different results if different time intervals are chosen for the observation.

Several studies have published results showing an increased proportion of complicated cases in the COVID-era [2,3,4, 8,9,10]. However, some studies have found only reductions of uncomplicated cases [5, 11,12,13,14,15,16] or no change [17]. Several of these latter studies are registration studies having the advantage of having more data from more centres. They are prone to problems with registration studies however. It is conceivable that investigator/physician-initiated studies such as our own might be triggered by observed clinical trends. Our study has some points of strength not seen in prior studies. It is the largest study on actual cases and not registries. It is also the only study encompassing two countries and many hospitals. Additionally, our study is probably the only study where the date of examinations was blinded, thus less likely to bias.

The results of our study are based on CT exams. CT is more accurate than any presurgical examination or test for diagnosis of appendicitis and detection of its complications. Although CT results are probably not as perfect as surgical exploration, surgical explorations are not always offered to all patients especially during COVID [18]. One caveat is that CT, similar to surgery, is not performed on every patient. In order to decrease this confounding factor, we chose to compare the same hospitals and countries during 2019 and 2020. We had access to other countries that were not the same in 2019 and 2020 and decided not include them. We cannot rule out that a higher percentage of patients suspected of appendicitis underwent CT during 2020 [19]. The use of teleradiology CT activity has been shown to correlate with COVID-19 epidemiology [20]. CT however provides another advantage not seen with surgery, i.e., the dates can be blinded and thus avoid some bias.

There are multiple reports of delay in diagnosis and treatment of patients with chronic diseases, including cancer [21, 22]. The increased mortality and morbidity from delay in diagnosis and treatment may only emerge at a later stage [23,24,25]. Ostensibly, emergency health care should be more dependant on the individual patient’s health and choice rather than on the pandemic. Evidence points out similar trends elsewhere. For instance, fewer acute coronary presentations but a higher proportion of severe cases [26].

Full and severe lockdown measures have not been associated with lower COVID-19 mortality [27]. One of the interesting aspects concerning Sweden was its rather different approach to lockdowns. The Swedish approach was somewhat different from many other countries. It relied more on the informed consent of its citizens rather than the use of coercion and mandates. It, therefore, implemented milder forms of lockdowns, in the hope that the population would show more compliance with healthcare measures in the long term. The UK adopted a similar approach initially but soon changed course. As with other observational studies, it is difficult, if not impossible, to evaluate causal relationships [28]. Although lockdowns should not in theory lead to fewer people being able to seek emergency health care, i.e., system delay (although some data as well anecdotal evidence suggest otherwise [16]), it may have led to increased anxiety and fear among patients [29] which in turn might have impacted patients seeking medical attention in time (patient delay).

Our study also demonstrates a different trend in appendicitis and its complications between the UK and Sweden. It is conceivable that in both countries patients with appendicitis sought medical attention later in 2020 than in 2019. However, the degree of delay was more pronounced in the UK. It could be that in the Sweden delays were in the order of hours and days, and thus, the rise of complication rate was evident early. In the UK, we suspect that patients were imaged much later, perhaps weeks later. This led to a significant drop in the number of diagnosed cases first, but as time progressed the complicated cases started coming in. Whatever the reason for such different patterns, it should be a reminder of the importance of timing of an observation.

Interestingly, the pattern in April 2020 is similar to 2019 in the UK. This might have been due to UK being hit by COVID-19 later than Sweden leading to initially very few diagnosed, largely uncomplicated cases.

Our observational study has some limitations; some of which have been addressed briefly above. One is that our studied period does not encompass the whole 24 h period. We were unable to retrieve data during normal working hours and therefore unable to include them. To mitigate this issue, we did study the same hours of operation in 2019 and 2020, but a potential bias can still remain. Another issue is reliance on CT and its reports. We did not have access to data from patients who did not have CT (i.e., who underwent ultrasound only or went directly to surgery). This coupled with reliance on CT reports—albeit blinded to date of exam—can create other potential biases.

Conclusion

In both countries, there was an increasing number and proportion of complications toward the end of the observation period; however, there was variation between observed time period and countries.