Introduction

The quality assessment of surgical procedures is becoming one of the priorities for different stakeholders involved in the healthcare system. Such assessment is crucial for any type of surgical procedures but for highly complex surgical procedures, such as hepatobiliary surgery, is even more crucial considering the associated morbidity and mortality risks [1,2,3]. As supported by a long-standing body of the literature, the centralization of complex surgery serves to increase the quality of care following the principle that more experience resulting from a larger number of patients treated is directly associated with better surgical outcome [4]. However, there is still no agreement among experts on precise criteria on what to centralize, where to centralize, and who should be entitled to perform complex procedures such as hepatobiliary surgery [5]. Besides, there is still no agreement on which outcome measures should be considered to assess quality and safety in surgery [6].

With the aim to refresh the discussion on quality assessment and centralization in liver surgery, here we sought to report the trend of outcome of liver surgery in Italy by using the annual cases volume collected by the “Piano Nazionale Esiti” (PNE) in 2023 [7], with the focus on the effect of hospital volume on surgical outcome. Moreover, we offered a narrative review on the centralization concept, and on different outcome measures that can be used as quality and safety metrics in liver surgery.

Methods

Study definition and source of data

This was a nationwide retrospective study of patients who underwent liver procedures for malignant liver tumors in Italy from January 1, 2022 to December 31, 2022. Data were obtained from the anonymized records of National Italian Registry PNE 2023, which is the last published by the Italian National Agency for Regional Health Services [7]. PNE collects information from all the Italian hospitals, both public and private, using the hospital discharge forms, based on the International Classification of Diseases, Ninth Revision (ICD-9-CM) [8]. The study was developed and presented according to Strengthening the Reporting of Observational Studies in Epidemiology [9].

Variable of interest and outcomes

PNE data included the case volume of liver resection per hospital, both public and private, from January 2022 to December 2022 and the crude 30-day mortality rate, meaning the death during the hospitalization and/or within the first 30 days after operation for patients submitted to liver resection from January 2020 to December 2022. Included hospital were classified according to:

  • The twenty Italian administrative regions [10].

  • The three Italian macro-regions [10]: northern, central and southern Italian regions.

  • The case volume during 2021 according to the definition proposed by the by Torzilli et al. [11] that identify three categories: (a) high volume (H-Vol), meaning more than 100 resections per year; (b) intermediate volume (I-Vol), meaning 51–100 resections per year and (c) low volume (L-Vol), meaning 21–50 resections per year. An additional category of hospital performing 20 or less liver resections, defined as very-low volume (VL-Vol) was added to the present analysis (10).

The primary endpoints of this study were:

  1. a)

    the case volume in the year 2022;

  2. b)

    the crude 30-day mortality rate in the available study period 2020–2022;

  3. c)

    the elaboration of a narrative review on centralization surgical volumes and outcome measures.

Snapshot on cancer center centralization and outcome measures in liver surgery

A snapshot on cancer center centralization and outcome measures was prepared by using Medline to identify relevant articles published before the 31st of July 2023, using a combined text and MeSH search strategy. The search terms included centralization, volume, post-operative mortality, morbidity, benchmarking, textbook outcome, liver surgery, liver resection and hepatectomy. The search was limited to articles published in English in the last 10 years, and it was further broadened by extensive cross-checking of all the references in the articles retrieved to identify eventual additional non-indexed literature.

Statistical analysis

Baseline characteristics of the study population were expressed as absolute numbers and relative frequencies measurements for qualitative variables, whereas continuous variables were presented as means with standard deviations (SD) if normally distributed, and non-normal variables were presented as medians with interquartile ranges (IQR). Statistical analysis was performed using the χ2 test for comparison of categorical variables and the Student’s t and Wilcoxon signed-rank tests for normally and non-normally distributed continuous variables, respectively. A P value < 0.05 (two-tailed) was considered statistically significant. All the statistical analyses were performed using Stata version 16.0 (StataCorp) [12].

Results

Snapshot from “Piano Nazionale Esiti (PNE) 2023”

In 2022, 6,126 liver resections for liver tumors were performed in 327 hospitals located in the 20 Italian administrative regions (Tables 1 and Fig. 1). There were 14 H-Vol centers, 19 I-Vol, 31 I-Vol and 263 VL-Vol centers. The total amount of procedures performed was 6,126 and the crude 30-day mortality rate for liver resection performed from 2020 to 2022 was 2.2% (Fig. 2).

Table 1 Snapshot from “Piano Nazionale Esiti (PNE) 2023”. Italian hospitals performing liver surgery according to the 20 administrative regions: volume, procedure performed and outcomes
Fig. 1
figure 1

Map chart displaying the number of Italian hospitals performing liver surgery, along with the identification of regions categorized by high, intermediate, low, and very low surgical volume

Fig. 2
figure 2

Map chart illustrating liver surgeries conducted across Italian regions in 2022, coupled with a representation of the 30-day crude mortality rate

Procedures performed and outcomes according to the three macro-areas

When considering the three Italian macro-areas, differences in the number of liver resections performed become evident. As outlined in Table 2, in the year 2022, 3,504 (57%) procedures took place in 150 (45.7%) hospitals in northern Italy, 1,397 (22.8%) in 82 (22.8%) hospitals in central Italy, and 1,225 (199.9%) in 96 (29.2%) hospitals in the southern Italy. Interestingly, the numbers of H-Vol and I-Vol hospitals were notably higher in the norther regions with 19 centers in the northern, 7 in the central and 7 in southern Italy. Besides, considering the inhabitants per macro-area (northern Italy ∼27 million; central Italy ∼13 million; southern Italy ∼18 millions) the distribution per inhabitant of H-Vol and I-Vol centers are significantly different: 1 center per 1,42 million of inhabitants in the north versus 1 center per 1,85 million of inhabitants in the center versus 1 center per 2,57 million of inhabitants in the south of Italy.

Table 2 Liver surgery according to the three Italian macro-areas: volume, procedure performed and outcomes

Notably, the hospital situated in the northern regions exhibited a significantly lower crude 30-day mortality rate in comparison with the central and southern regions (1.6% vs. 2.5% vs. 3.2%, respectively, p < 0.001).

Procedures performed and outcomes according to hospital volume

A total of 14 (4.2%) centers were categorized as H-Vol, 19 (5.8%) as I-Vol, 31 (9.5%) as L-Vol, and 264 (80.4%) were classified as VL-Vol (Table 3). The distribution of liver resections performed in 2022 were as follows: 2,517 (41.1%) in H-Vol units; 1,292 (21.1%) in I-Vol units; 1,006 (16.4%) in L-Vol units and 1,311 (21.4%) in VL-Vol. Remarkably, still, in 2022, 2,317 (37.8%) procedures took place in L- and VL-Vol units. Notably, out of the 264 VL-Vol units, 220 (83%) carried out ≤ 10 resections, 183 (69%) performed ≤ 5 resections, and 78 (29%) conducted only 1 operation.

Table 3 Liver surgery according to hospital volume: procedures performed and outcomes

Interestingly, the median crude 30-day mortality rate stood at 1.7%, 2.2%, 2.6% and 3.6% for H-, I-, L- and VL-Vol centers, respectively (p < 0.001). Notably, VL-Vol hospitals exhibited an almost two-fold mortality rate compared to H-Vol centers (3.6% vs. 1.7%, p) (Fig. 3). This disparity was even more pronounced for centers that performed only 1 resection, where the mortality rate was 4.6%.

Fig. 3
figure 3

Mortality rate by volume of liver surgery in Italy during 2022

Literature review on centralization, hospital volume and surgeon volume

Centralization

Centralization is defined as a process of concentration of resources, including staff, materials, infrastructures, knowledge, and expertise to enhance the quality of care achieving better outcomes. Consistently, centralization can be identified as the process that leads to the transfer from L-Vol centers to H-Vol centers, those complex and/or risky procedures that require more experiences, more resources (human, technical, structural) that are supposed to be insufficient in L-Vol centers.

Currently, in Italy the Regional Health Systems is based on the “hub-and-spoke” model [13]. Formally defined, the hub-and-spoke is a model that arranges service delivery assets into a network consisting of a leading center (hub) offering a full array of services, complemented by some secondary centers (spokes) that offer more limited services redirecting to the hub patients needing intensive care assistance [13]. Notably, this model is the one that is believed to be able to solve important disparities that limit patient access to specialized cures, such as those related to geographical territorial complexities, health professionals and technological resources which should contribute to limiting patient migration.

To date, there is a large gap between what is the hub-and-spoke model, as it is supposed to be, and what is in the real clinical practice: which cases or which pathologies are for hub and which are for spoke are, in general, not oversight by any local or central agencies. In 2018 a special committee from the European Surgical Association published a landmark paper [14] that listed twelve recommendations for the development of centralization strategies (Supplementary Table 1). To the best of our knowledge, these recommendations have not yet been fully considered nor applied by government agencies.

Surgical volumes for liver surgery

Hospital volume

Considering that the centralization of major cancer surgery in hospitals with higher annual volume of procedures significantly reduces the risk of perioperative morbidity and mortality, as reported in the literature [11, 15,16,17], most of the efforts done so far were limited to listing the main hospital requirements. Based on the landmark paper by Torzilli et al. [11], three different types of requirements were identified for hepatobiliary surgery: hospital requirements, volume requirements, organization requirements:

Hospital requirements

According to experts and to the current national regulation “Regolamento di definizione degli standard qualitativi, strutturali, tecnologici e quantitativi relativi all’assistenza ospedaliera” (Italian Law 135/2012), the hepatobiliary unit should be at least in a first level hospital that should have a series of units/services listed in Supplementary Table 2.

Moreover, H-Vol centers should either have a surgery unit dedicated to hepatobiliary surgery or a team of surgeons specialized in hepatobiliary surgery within a general/digestive surgery department.

Volume requirement

Every Hepatobiliary Unit, either as a Unit or as a team of dedicated surgeons inside a General/Digestive Surgery Division, should perform a minimum case volume per year. While no agreement exists on such a number, Torzilli et al. [11] in 2016 stated that at least 20 liver resections per year should be done for malignant diseases with 90-day mortality rate below 3%. Besides, three categories were identified according to the case volume: (i) L-Vol, meaning 21–50 resections per year; (ii) I-Vol, meaning 51–100 resections per year; (iii) H-Vol, meaning more than 100 resections per year. H-Vol centers should also be recognized as “referral units”, considering the high complexity of pathology, and as “units of excellence” when more than 5 scientific articles per year are published (6).

Organizations requirements

Hospitals should have Diagnostic-Therapeutic Flow Charts dedicated for patients with liver tumours. These charts should include any different type of liver disease and tumour, either primary or metastatic. Finally, importance for the multidisciplinary team assessment should be given, including physical space and quality working hours on a weekly basis.

The PNE data solely records hospital volume, without providing additional information on hospital and organisational requirements. This limitation precludes further interpretations.

Surgeon volume

Although hospital volume and surgeon volume are expected to correspond, this is not always the case in real-world practice. In hepatobiliary surgery, the relative importance of hospital volume versus surgeon volume is very important because both short- and long-term outcomes are dependent on hospital factors, such as the presence of an intensive care unit, and surgeon factors, such as operative technique. However, controversies exist among experts. Nathan et al. [3] showed that the protective effect of hospital hepatic resection volume persisted after case-mix adjustment for competing risk factors, while that was not the case considering the surgeon hepatic resection volume. Indeed, high- and low-volume surgeons had comparable in-hospital mortality rates after hepatectomy [3]. It is, anyway, a matter of fact that the experience of the surgeon represents a very important factor. Certainly, a better understanding of how the surgeon’s experience influences outcomes could help to develop specific healthcare strategies for improving the quality of care of hepatobiliary patients. One of the strategies could be the certification of the learning curve, that is the minimum number of procedures required to become fully proficient on a given specific hepatobiliary procedure. After that, the minimum case volume per year could become less important.

In relation to the PNE, it should be noted that there is no reporting of data on surgeons’ volume.

Literature review on outcome measures: from the crude mortality rate to composite outcomes for post-operative complications

The mortality rate represents the number of deaths within a specific population during a defined time period. The crude mortality rate encompasses all causes of death within a given time interval for a particular population. On the other hand, the risk-adjusted mortality rate takes into account predictors of mortality, making it a more refined measure. Recently, with the enhancement of surgical safety and perioperative care, research emphasis has shifted from solely reducing mortality to also addressing postoperative complications and ensuring a satisfactory quality of life [18].

Postoperative complications signify deviations from the anticipated recovery trajectory following a surgical procedure. Various classification systems for postoperative complications have been proposed, as depicted in Table 4.

Table 4 Studies proving grading systems for post-operative complications

Clavien and colleagues were the pioneers in proposing a classification for adverse post-operative outcomes, which garnered broad acceptance. In 1992 they introduced a standardized system or T29 score [19], later modified in 2004 by Dindo [20] resulting in the widely known and accepted Clavien and Dindo classification (CDC). Both systems were grounded in the patient’s health status and intervention requirements following surgery. While subsequent standards have emerged including the Complexity- and risk-adjusted model [21], the Memorial Sloan Kettering Cancer Center’s (MSKCC) grading system [22] and the Accordian severity system [23], the CDC remains the most prevalent method for grading post-operative complications. Nonetheless, a limitation of the CDC is its focus on the most severe complication experienced by the patient, disregarding less severe events and failing to capture the true overall complexity of adverse post-operative outcomes. To address this, the Comprehensive Complication Index (CCI) was developed in 2013 [24]. This index aggregates the complete burden of post-complications, considering their severity, into a unified score ranging from 0 to 100.

Furthermore, assessing the restoration of a good quality of life often employs quality of life questionnaires, such as the European Organisation For Research And Treatment Of Cancer (EORTC) Core Quality of Life questionnaire (QLQ-C30) [18]. These questionnaires are among the most widely accepted tools for evaluating quality of life.

To stay on PNE data, it records only 30-day mortality rates, which per se does not allow further interpretations. Moreover, 30-day mortality in liver surgery usually underestimates the mortality after hepatectomy, which should be recorded at 90-day [25].

Benchmarking

In addition to the traditional outcome measures, hospitals and healthcare systems are progressively adopting composite tools designed to assess and enhance the quality of care. Benchmarking, characterized as a “continuous process of measuring products, services, and practices against the toughest competitors or those companies recognized as industry leaders” [26], serves as a quality improvement mechanism. It gauges the optimal attainable results within a group of well-defined, low-risk patients to establish meaningful reference values (benchmarks) for comparing outcomes [27]. The primary objective is to determine the most favourable achievable real-world postoperative outcomes [28]. Benchmarking has gained traction to evaluate and elevate the quality of care for patients undergoing liver resections [29,30,31,32,33,34,35,36,37] (Table 5).

Table 5 Studies on benchmarking in liver surgery

The crucial steps in establishing a valid benchmark encompass: (a) selecting the intervention to be benchmarked; (b) identifying patient criteria that enable the selection of candidates; (c) defining specific key outcome indicators (benchmarks); (d) identifying eligible centres and patients; (e) calculating the benchmark values. One of the most widely accepted analytical strategies for defining benchmark values is the “Best Centre, Best Patients” (BCBP) approach. This involves selecting the best centre for treating a specific disease and focusing on low-risk patients. The benchmark values are then determined by calculating the 75th percentile for each centre based on specific outcomes. [29,30,31, 34]. An alternative analytical approach is the Achievable Benchmark of Care (ABC) method, which involves identifying the benchmark as the performance attained by the top 10% of providers, adjusted for the number of patients each provider treats [32, 33].

Textbook outcomes

Another comprehensive multidimensional composite outcome indicator that encompasses the entirety of the surgical care process is the textbook outcome (TO). The TO strives to encapsulate the concept of an ideal “textbook” hospitalization, signifying patients who do not experience adverse outcomes following complex surgical procedures [38]. While the definition of textbook outcomes may differ based on the surgery type, it typically encompasses patients who do not encounter mortality, severe complications, readmission, and exhibit favorable surrogate oncological parameters. Numerous recent studies have focused on evaluating textbook outcomes in liver surgery [36, 38,39,40,41,42]. In 2021 Görgec et al. [39] published the findings of an international multicentric clinical study on textbook outcomes (TO) in liver surgery. The TO indicators encompassed the absence of: intraoperative events ≥ grade 2 based on the Oslo classification [43]; postoperative bile leak grade B or C according to the International Study Group of Liver Surgery classification [44]; severe complications ≥ grade III according to CDC [20], in-hospital mortality, postoperative reintervention, readmission, and the presence of R0 resection margin. A recent systematic review [42] provides an overview of the contemporary international experience with TO in evaluating surgical performance after liver surgery. The review suggests that TO serves as a unified composite metric that may offer a more patient-centered approach and is better suited for quantifying optimal care and facilitating performance comparisons among centers conducting liver surgery.

Quality performance indicators

It’s important to note that while quality and safety are related, they are not synonymous. Safety pertains to preventing negative outcomes, whereas quality involves achieving positive outcomes. As previously mentioned, numerous factors contribute to safety, and in the recent times, the assessment of the quality of a specific surgical procedure is gaining increasing attention from international surgical associations. Woodhouse et al. [45] have recently formulated a set of globally accepted quality performance indicators (QPIs) for hepato-pancreato-biliary procedures. Through a modified Delphi process, three rounds of consultations were conducted with working groups comprising members of the International Hepato-Pancreato-Biliary Association (IHPBA). The final set of QPIs encompasses three categories: structure, process, and outcomes. A total of seven “core” indicators were unanimously agreed upon for liver, pancreatic, and complex biliary surgery, as outlined in Supplementary Table 3. Furthermore, an additional six procedure-specific QPIs were suggested for liver and complex biliary surgery, along with three for pancreatic surgery. These QPIs can be employed to measure and monitor the entire global process of liver surgery at an individual, unit, institutional, and/or jurisdictional level. They encompass not just clinical outcomes, but also structural and procedural characteristics. In this way, they encourage ongoing advancement and enhancement of safe and high-quality hepato-pancreato-biliary surgery on a global scale.

Discussion

The increased demands for quality assessment in liver surgery together with the expected increasing incidence of primary liver tumors in the next few years [11, 45], justified the need to conduct research studies of this type.

The aim of this study was to provide a snapshot of the current trends of outcome of liver surgery in Italy using the data from PNE 202, which refers to the year 2022 that is the last available. By looking at those numbers, it is clear that the centralization of liver surgery in Italy is far from being operational: 14 H-Vol centers performed 41% of the liver resections while 263 VL-Vol centers perform 21% of those cases. Twenty different regional health systems under the National health system should provide more H-Vol centers, at least one per region, and certainly should work to limit the dispersion of a handful of cases in these VL-Vol centers. Based on that snapshot, it appears clear that the hub-and-spoke model does not work.

Consistently, the mortality rate in these VL-Vol centers was almost two-fold the mortality rate of the H-Vol centers (3.6% vs. 1.7%). Of note, 3.6% of 30-day mortality rate could apparently be considered adequate in liver surgery except that VL-Vol centers almost surely performed small, limited resections that, probably, could have been at lower morbidity and mortality risks if they would have been performed in H-Vol centers. In other words, it is likely that H-Vol performed complex cases while L- and VL-Vol centers did not. Besides, there is a trend of association of mortality that decreases by passing from very VL-Vol to L-Vol, then from I-Vol to H-Vol centers. These results are consistent with previous studies, indicating the relationship of hospital volume with postoperative mortality [5, 11, 17]. Unfortunately, PNE data do not include hospital and organisational data, patients’s characteristics, data on morbidity, type of complications details, failure to rescue and 90-day mortality rate that, if available, would allow more analyses. In particular, 90-day mortality should be used instead of 30-day mortality in hepatobiliary surgery to catch, for instance, those cases of post-hepatectomy liver failure that may become irreversible more than one month after the operation [46]. It is not surprising, however, that when 90-day mortality is available the association between hospital volume and outcome still remains significant. Guglielmi et al. [5], in fact, recently reported about the trends in hospital volume and mortality in hepatobiliary surgery in the Veneto region confirming an increased and significant risk of 30- and 90-day mortality in L-Vol centers.

While the metric of quality is a complex process of which a given threshold case volume is just a proxy measure, it is clear that the mortality risk after hepatectomy in Italy decreases when more than 20 resections per year are considered (3.6% vs. 2.6%). Indeed, Dimick et al. [47] already reported that those hospitals that performed more than 20 liver resections per year had significantly lower mortality rate (3.9% vs. 7.6%) even at L-Vol hospitals. The same result was reported for minimally invasive liver surgery by Van der Poel et al. [48], who showed that when more than 20 minimally invasive liver surgery per year are performed the risks of conversion and complication significantly decrease. Similarly, Ardito et al. [49] reported that failure to rescue, that is the mortality after postoperative complications, was lower in H-Vol centers compared to L-Vol centers indicating how the case volume is a measure of the experience of the surgeons team in identifying and treating complications that, if unrecognized, could lead to death.

What should not be underestimated are the social and economic implications of an unequal distribution of high-volume (H-Vol) and possibly more efficient healthcare centers across the Italian territory. In fact, a significant number of patients are willing to migrate to other regions in search of more efficient healthcare facilities where waiting times before an interventional procedure are usually shorter, and outcomes appear to be better. According to the 2020 AGENAS [12] report on healthcare mobility, healthcare expenditure is strongly impacted by healthcare migration, which continues to occur from southern to northern regions. In fact, 97% of the positive balance goes to the coffers of Lombardy, Emilia Romagna, Veneto, and Tuscany (with 697.6, 338.4, 142.9, and 125.6 million euros, respectively), while 76% of the negative balance is carried by Puglia, Sicily, Lazio, Calabria, and Campania (with 192.3, 212.8, 215.9, 280.5, 319.7 million euros, respectively). This aspect should not be underestimated when analyzing the global outcomes of a surgical procedure such as liver surgery at both the national and regional levels.

The centralization process of liver surgery in many countries including Italy should be redesigned. New ideas should be given. One could be to use a twin-track approach, which could save quality and safety for patients and, at the same time, it could preserve the health care professionals that work in L-Vol hospitals: complex cases, either for tumoral presentation, for patient’s status or for surgical approach required, should be centralized in H-Vol centers (hub) while standard cases could be decentralized in I- or even L-Vol centers (spoke) performing more than 20 resections per year. VL-Vol centers (1,351 resections in 2021) should be regulated. By using this twin-track approach more safety and quality in liver surgery should be warranted. Importantly, an efficient and oversight hub & spoke model would also warrant the education and training process of young surgeons, which should be rethought [50].

In the lack of standard outcome measures, which would allow more truly comparison among centers, we here proposed a narrative review on the current outcome metrics adopted in liver surgery aiming to convince the different involved stakeholders, including patients and regulatory agencies, that the metric of quality is a complex process in which the simplification does not pay off. Indeed, the assessment of surgeon’s competency for high-risk procedures should be based on composite metrics, among which certainly the case volume together with the hospital requirements play a role. At the same time, surgeon credentialing should reflect real-world practice data rather than arbitrary benchmarks [51, 52]. A more comprehensive quality measure would come by the diffusion of the new QPIs recently proposed by the IHPBA that being a summary of composite outcomes approved by an international committee should give back a more reliable quality tool [45]. What is still missing in all these types of outcome measures and quality metrics proposed, is the metric around the patient. The age, the performance status, and the presence of comorbidity cannot be neglected as factors associated with the outcome. The use of raw indices (morbidity and mortality rates) cannot allow a true comparison among hospitals, even though in large numbers one could argue that these patient’s factors might be considered well-distributed. However, in the absence of detailed data, this assumption remains a speculation.

This study has several limitations. First, the scarcity of available data from PNE does not allow further analyses, and interpretations. Factors affecting mortality are not reported and, as said, hospital volume acts just as a proxy measure of quality. Second, data from a single year (2022) may not be representative and may not be generalized as a global perspective of the Italian experience in liver surgery. However, this nationwide study on a very large population offers a snapshot of the current trends of outcome after hepatectomy in Italy that may serve as a basis for further considerations and improvements.

In conclusion, this study showed that the centralization process with the hub-and-spoke model for liver surgery in 2022 in Italy was mostly disregarded. Approximately 41% of resections were centralized in higher volume centers with expected decreased mortality. The threshold of 20 cases per year is confirmed to be the minimum case volume. Further studies are required to better detail the factors associated with failure to rescue, and mortality and then, to better detail what is complexity in liver surgery and which centers should be entitled and qualified to perform hepatobiliary surgery.