Background

With about 60 million global infections and more than 1.5 million deaths at the end of 2020, the COVID-19 pandemic has radically changed the world [1]. Hospitals and healthcare systems had to face a significant number of infected patients needing treatment. Consequently, we significantly reduced surgical activity in the elective setting of about 30 million surgical procedures worldwide in a period of 12 weeks [2]. Nevertheless, surgeons cannot postpone emergency and oncological procedures. Therefore, they issued their recommendations [3,4,5] suggesting caution while performing surgery. Especially at the beginning of the pandemic, major surgical societies and colleges advised using non-operative management of appendicitis and recommended against laparoscopy [6, 7].

In the beginning, the focus was the safety of the operators [8]. Later, the potential worsening of SARS-CoV-2 pneumonia showed a high lethality rate, especially after general anaesthesia [2, 9]. Despite the growing consciousness of the phenomenon, very little attention focused on the effects of the delay in diagnosis and management of surgical diseases. A recent international web survey reported increased non-operative management of acute appendicitis during the COVID-19 outbreak [10].

The lifetime risk of acute appendicitis is 6.7% for women and 8.6% for males [11]. We estimated that about 300.000 patients in the U.S. undergo appendectomy annually, with a raw incidence of 98 cases/100,000 people [12]. Even though non-operative management is suitable for uncomplicated cases [13], laparoscopy remains the standard for treating appendicitis [14].

The study hypothesises that a significant reduction in the number of emergent appendectomies was observed during the pandemic, restricted to complex cases. The study aimed to analyse emergent surgical appendectomies during the pandemic on a national basis and compare it to the same period of the previous year.

Methods

The CRAC study (ChiRurgia Appendiciti COVID-19, COVID-19 Appendicitis Surgery) is a national multicentre, retrospective, observational cohort study to assess the surgical outcomes of patients undergoing an appendectomy. The study compares data collected in the 2 months of March–April 2019 with those of March–April 2020. The Italian Society of Endoscopic Surgery and new technologies (SICE) endorsed the study. The protocol obtained the approval of the Ethical Committee for Clinical Trials of Treviso and Belluno on May 7, 2020 (ID: license 883/CE Marca, Italy).

One hundred fifty eight surgical units of the 448 registered in the Italian Ministry of Health registry (35.3%) adhered to the study, and 113 (71.5%) contributed to sharing data. We collected data through a Google form. The study's primary outcome was the number of appendectomies performed during each of the two months, classified according to the American Association for the Surgery of Trauma (AAST) score [15]. Secondary outcomes were the type of surgical technique (laparoscopic vs open), the number of complications classified according to the Dindo–Clavien grading system, and the mortality at 30 days [16]. Inclusion criteria were age > 18 years and occurrence of emergent appendectomy not associated with other surgical procedures. We collected gender, age, year of surgery, and Appendicitis Inflammatory Response (AIR) score [17] for each included patient. We also analysed the delay of surgery after diagnosis (< or > 24 h) and the conversion rate from laparoscopy to open. We registered hospital stay, post-operative complications classified according to Dindo–Clavien, radiological or surgical re-intervention and mortality within 30 days.

Statistical analysis

Descriptive and inferential statistics

The descriptive statistics for continuous variables were reported as the median-interquartile range (IQR), while those categorical were absolute/relative frequencies. The inferential statistics, either the Mann–Whitney/Kruskal–Wallis or the Fisher’s exact test, was applied to continuous and categorical covariates, respectively. A complete set of the univariate and multivariate binary logistic regression model estimated the likelihood of a surgical complication occurrence (dependent variable in statistics, target in machine learning). We tested eight predictors (independent variables in statistics, features in machine learning) for their potential impact on complications: three continuous (age, AIR and AAST scores) and five categorical (gender, year of treatment, surgery timing, surgical technique and conversion to laparotomy) variables. AIR and AAST, despite their categorical, ordinal nature, were treated as continuous covariates due to the high number of levels (risk of over-parametrisation). We obtained all p values by the exact two-sided method at the conventional 5% significance level. Data were analysed as of February 2021 using R 4.0.4 packages lares version 4.9.12 and H2O version 3.32.0.4 [18].

Development and validation of ML models

The function h2o_automl of the R package lares was applied to access H2O for R, an open-source distributed machine learning (ML) platform [19, 20]. We trained six different supervised ML algorithms for binomial classification for target prediction (complications occurrence): GLM (Generalized Linear Model), GBM (Gradient Boosting Machine), XGBoost (Extreme Gradient Boosting machine), Distributed Random Forest (DRF), DNN (multilayer artificial Deep Neural Network) and NB (Naïve Bayes classifier) as well as two Stacked Ensemble models, one containing all the models, the second only the best from each algorithm class. We balanced the target in the training data via resampling for all models, and no missing-values replacement was needed (only four data were missing). We split the original dataset randomly for training into 80% training set and 20% test one. We used five-fold cross-validation to compare the classifiers to decrease the risk of model overfitting. We investigated the model's performance on the test set and identified the best prediction performance by the Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve.

Results

The database included 1541 patients with acute appendicitis who underwent surgery during the two study periods. According to the inclusion criteria, 1337 (86.8%) represented the study's cohort in the analysis. The study flow diagram is shown in Fig. 1.

Fig. 1
figure 1

Study flow diagram

We collected 546 patients who underwent surgery for acute appendicitis in 2020 vs 791 in 2019. Therefore, we observed a decrease in the appendectomy rate of 31%.

We summarised data about the cohort of patients stratified by year of surgery in Table 1.

Table 1 Patients’ characteristics: Cohort of patients stratified by year of surgery

The median age of the patients was 38 years (IQR 25–55), 37 years (IQR 25–64) for those operated in 2019 vs 39 years (IQR 23–65) for those operated in 2020 (p = 0.526).

The severity of appendicitis was higher among patients undergoing emergent appendectomy in 2020 vs 2019. This severity resulted in both the AIR score (p = 0.036) and the AAST score (p = 0.008).

The surgical technique used was laparoscopy in 1206 (90.2%) cases and laparotomy in 131 (9.8%). An open technique was used in 77 (9.7%) patients during 2019 vs. 54 (9.9%) during 2020 (p = 0.926). The conversion rate was 3.9% in 2019 vs 5.7% during 2020 (p = 0.147).

A thousand one hundred and eight patients underwent surgery < 24 h from admission (80.8%) while 229 (19.2%) > 24 h. In 2019, 146 (18.5%) patients had surgery delayed > 24 h from the admission vs 83 (15.2%) in 2020 (p = 0.122).

The median hospital stay was three days (IQR 2–5), the same as in 2019 (IQR 2–4) and in 2020 (IQR 2–5) (p = 0.722).

Complications were reported in 163 (12.2%) patients, while no complications occurred in 1174 patients (87.8%). Patients with higher AAST appendicitis scores had higher complication rates. We summarised the characteristics of the cohort of patients stratified according to the occurrence of post-operative complications in Table 2. The analysis of the complications according to the Dindo–Clavien grading system showed a statically significant increased rate of adverse events in 2020 (87 patients, 15.9%) compared to 2019 (76 patients, 9.6%) (p < 0.001).

Table 2 Patients’ characteristics: Cohort of patients stratified by complication occurrence

We observed post-operative complications in 99 patients (8.4%) operated with laparoscopy, compared to 32 patients (19.6%) operated with an open technique (p < 0.001), and 19 patients (11.7%) who had a conversion from laparoscopy to an open procedure (p < 0.001).

Further treatment was necessary for 69 (5.2%) patients within 30 days from the surgical operation, of whom 15 patients (1.1%) required a radiological intervention and 54 (4%) patients required further surgery.

Three patients (0.2%) died within 30 days after surgery due to sepsis. The mortality rate was 0.25% in 2019 and 0.2% in 2020.

Analysis of complications: univariate and multivariate models

We report the binary logistic regression model results in Table 3. In the univariate model series, all covariates played a critical role in complication occurrence, except for age and gender. The determinants that increased the likelihood of complication occurrence were undergoing surgery during 2020 (+ 64%), having a unit AIR (+ 10%) or AAST (+ 28%) increase, having waited for surgery > 24 h (+ 61%), undergoing open surgery (+ 124%) and converted to open surgery (+ 114%). The multivariate logistic model AUC was 0.720 and, when comparing it to that from ML modelling, the best AUC was obtained by the GLM model (0.724). Therefore, the ML approach fully confirmed the results obtained by the classical logistic model approach. In Figs. 2 and 3, the ranking of the features (variable importance) is plotted for the top ten models and the best one (GLM), respectively [20].

Table 3 Uni- and multi-variate binary logistic regression models for complication occurrence
Fig. 2
figure 2

Variable importance for the top ten ML models

Fig. 3
figure 3

Variable importance for the best ML model (GLM)

Discussion

The CRAC study collected data from a selected cohort of Italian hospitals and showed that in March and April 2020, the number of appendectomies has drastically dropped compared to 2019. During the first wave of the COVID-19 pandemic, patients undergoing surgery were more frequently affected by more severe appendicitis forms than the previous year's timeframe, according to both the AIR and the AAST scores. The present dataset is much more comprehensive than previous international and national surveys [8, 21, 22] and multicentre retrospective studies [23, 24].

How we should interpret these data is questionable. The reduction in the overall number of appendectomies performed may have several explanations. First, individuals affected by acute appendicitis may have renounced to approach emergency rooms in hospitals for fear of SARS-CoV-2 contagion. Simultaneously, we cannot exclude that surgeon restricted the indications for surgery to more severe cases, offering milder clinical cases the opportunity of conservative medical therapy at home. Most patients with uncomplicated appendicitis require active observation and pain control [25, 26], and this is what might have happened during the first wave of the pandemic.

Since the outbreak of the COVID-19 pandemic in Europe and the U.S., much ahead, it was possible to organise a SARS-CoV-2 vaccination modelling for safe surgery [27], several recommendations issued by surgical societies and institutional bodies have supported surgery's decision-making processes, including emergency scenarios. Although the overall level of evidence of such recommendations was low, there has been a significant impact of these documents on surgeons' daily clinical practice. Globally, during the first wave of the COVID-19 pandemic, recommendations on the treatment of acute appendicitis suggested the use of appropriate non-operative treatments whenever possible to avoid overloading hospitals, already heavily burdened by SARS-CoV-2 patients. Our findings align with the ACIE Appy international survey on the global attitudes in managing acute appendicitis during the pandemic. These showed a statically significant decrease in the number of acute appendicitis patients referred to the hospitals, with only 10% of surgical units reporting > 20 referrals per month [10]. In the present study, while the absolute numbers of cases AIR score > 8 and AAST > 3 remained constant between the two years, a significant reduction in the milder cases was observed.

The second recommendation focused on the technique to adopt. Initial guidelines recommended open appendectomy in case of intra-abdominal sepsis or non-resolving disease following antibiotic [28,29,30]. Former detection of activated viruses (corynebacterium, papillomavirus, HBV, HIV) in the surgical smoke suggested that SARS-CoV-2 might behave similarly [31]. This observation inspired suggestions to avoid the use of laparoscopy. We must consider such recommendations in the existing scenario. Here, European surgeons faced the viral spread during the first wave of the pandemic, together with the lack of ultrafiltration systems, personal protective equipment, surgical workforce and routine testing of patients. However, one year after identifying the SARS-CoV-2 infection in China, an active virus has not been isolated so far from the laparoscopic plumes within the peritoneal cavity of infected subjects.

Consequently, the potential of viral spreading during laparoscopy is not known. Based on clinical judgment, surgeons should prefer a laparoscopic appendectomy when resources are available. The safe performance of laparoscopic appendectomy allows for short hospitalisation. Data from the CRAC study showed that, in Italy, the rate of laparoscopic appendectomy performed in March–April 2020 was comparable to that in the same two months in 2019. This finding contrasts with the trend in favour of open appendectomy reported in other countries [32]. Laparoscopy should always be preferred as it provided, also in this study, a lower rate of complications compared to both open surgery and conversion to open surgery. The high rate of laparoscopy is a consequence of the effort to equip operating rooms with systems for the safe evacuation of laparoscopic plumes. This effort included first homemade systems [33], then the certified ones, following the adoption of the EAES guidelines [3, 34]. Even the rate of interventions converted from laparoscopy to open surgery did not increase significantly. This can be justified by the higher complexity of the cases, as reported by AIR and AAST scores.

As the severity of appendicitis among the operated patients significantly increased during the first wave of the pandemic, this correlated with increased complication rate and severity, but not mortality. Different from perception, the circumstances did not delay admission to the emergency room. Despite the apparent difficulties in hospitals' emergency area organisation, this did not affect the surgical activity's efficiency. Nevertheless, complications observed were significantly more in 2020, as well as their severity. We observed an increase in acute appendicitis complicated with phlegmon, abscess, or diffuse peritonitis. These correspond to grades 3–5 of the American Association for the Surgery of Trauma (ASST) classification. Here, patients presenting with perforated appendicitis increased from 30.3% in 2019 to 39.9% in 2020. The CRAC study showed that those patients undergoing surgery > 24 h after observation experienced more post-operative complications. This finding aligns with the United Kingdom National Surgical Research Collaborative study on a cohort of more than 2500 patients with acute appendicitis. Here, delaying appendectomy for over 48 h was related to a statically significant increased risk of surgical-site infection and 30-day adverse events [35]. Similarly, Alore showed that patients undergoing appendectomy three days after the admission had increased 30-day morbidity and mortality [36]. In our study, the rate of complicated appendicitis reported during the pandemic period of 2020 was higher than that usually found during 2019, and generally, in contemporary literature. Therefore, we argue that it may be reasonable to prioritise patients reporting symptoms lasting > 24 h for operative management.

We must interpret our results within the context of some limitations. First, due to the urgent need for evidence on appendicitis management during the first wave of the COVID-19 pandemic, we performed only a short-term follow-up. The study design did not allow us to assess post-operative visits after 30 days from the surgical intervention. As a consequence, we might miss long-term complications, such as adhesions and incisional hernias. Second, due to the observational design, the quality of data collected depended on the quality of medical records and the researcher's interpretation of charted notes. Third, there is a considerable variation in the organisation of the emergency surgical departments across the country [37, 38]. The most relevant source of bias is probably the heterogeneity of the various centres involved in diagnostic pathways. Fourth, most likely, the reduction in appendectomies performed corresponds to an increase in non-operative management. Unfortunately, we do not have data about the non-operative management of acute appendicitis cases. Ultimately, the study has a non-randomised nature associated with any extensive database. Conclusions from non-randomised studies can be misleading because there is always a chance for selection bias, leading to underestimating or overestimating the real intervention effect. On the other hand, our study's strength lies in the fact that we demonstrated that fewer patients sought medical attention during the lock-down due to the COVID-19 pandemic acute appendicitis in the analysis of individual patients' data in Italy. Here, complicated appendicitis rates increased, leading to a higher incidence of complications than in the past.

Conclusion

During the first wave of the COVID-19 pandemic, the number of appendectomies has drastically dropped in Italy. While patients with severe appendicitis remained constant, we observed a substantial reduction in milder cases undergoing surgery. Consequently, we observed a higher complication rate, despite mitigated by an unchanged high rate of use of laparoscopy.