Introduction

Colorectal cancer (CRC) is the 3rd most common malignancy and the 2nd most deadly cancer worldwide [1]. It is highly prevalent in developed countries but has started to show an increasing trend in China, partially attributed to shifts toward a high-fat, low-fiber diet [2]. CRC is prone to distant metastases, affecting over 50% of patients, with the liver being the primary site in approximately 70% of cases [3, 4]. Therapeutic options for colorectal liver metastases (CRLM) include hepatectomy, chemotherapy, radiotherapy, hepatic artery embolization, and thermal ablation, such as microwave coagulation therapy, radiofrequency ablation [5]. Currently, liver resection (LR) is acknowledged as the most effective treatment for CRLM patients, which can offer prolonged survival and, in selected cases, a chance of cure [6]. Increasing effectiveness of chemotherapy regimens, advances in surgical techniques, and improvements in perioperative patient management have expanded the boundaries of resectability [7, 8]. The current consensus proposes that a disease should be considered technically resectable as long as complete macroscopic resection is feasible while maintaining at least a 30% future liver remnant [9, 10]. Nevertheless, not all technically resectable patients experience a survival benefit from surgery, with 3-year recurrence rates reaching 60–70% [11,12,13]. The earlier the recurrence, the worse the prognosis, but the definition of early recurrence(ER) varies from 6 to 24 months [14,15,16].

Therefore, this meta-analysis aims to elucidate prognostic factors associated with ER in CRLM patients undergoing LR. Subsequently, our objective is to identify individuals with high risk of ER, who might benefit from closer surveillance and appropriate salvage therapy.

Materials and methods

Protocol and reporting

The protocol for this study was registered on PROSPERO (International Prospective Register of Systematic Reviews, www.crd.york.ac.uk/prospero) with the registration number CRD42023444091. This review was carried out according to the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines [17]. The PRISMA checklist is available in Supplementary Table S1.

Data sources and search strategy

All potentially eligible publications were retrieved from PubMed, Embase, Cochrane, and Web of Science from database inception until July 14, 2023. The search, employing the keywords “colorectal liver metastases”, “surgery”, “early”, and “recurrence”, was carried out by two investigators (YT, SFW). Supplementary File 1 included detailed information on the search strategy. Additionally, the bibliographies of included articles and relevant reviews were manually scrutinized to identify additional research and explore potentially relevant studies.

Eligibility criteria and study selection

Subjects were eligible for inclusion if the following criteria were met: (1) Prospective or retrospective studies including patients with CRLM who received liver resection; (2) Articles presenting ER rates categorized by a prognostic factor; (3) Articles reporting a relative ratio (RR) or an odds ratio (OR) (with a 95% confidence interval [CI]) or offering adequate data for RR/OR estimation; (4) No language restrictions.

Studies were excluded according to the following criteria: (1) Articles on palliative surgery; (2) Articles without sufficient data for analysis; (3) Experimental animal studies; (4) Reviews, commentaries, conference proceedings, letters, case reports, editorials, and meta-analysis.

Article screening and study selection were independently performed by two reviewers (SFW, YQW). In instances of discordance, resolution was achieved through collaborative deliberation within the research team, culminating in a final consensus.

Data extraction

The following data were extracted from each included study, and missing data were noted: (1) first author, publication year, country, period of recruitment, study type, patient count, follow-up period, overall recurrence rate, ER definition, ER rate, 5-year OS in the ER group, and inclusion/exclusion criteria (Table 1); (2) Prognostic factors, including patient characteristics (continuous variables: age, carcinoembryonic antigen [CEA], carbohydrate antigen 19–9 [CA199], binary variables: gender), primary tumor characteristics (binary variables: tumor differentiation [poor vs moderate/good], lymph node metastases [LNM], tumor stage [T3-4 vs T1-2], tumor location [rectum vs colon]), liver metastases characteristics (binary variables: number [more vs less], diameter [> 5 cm vs ≤ 5 cm], synchronous metastases, bilobar distribution, extrahepatic metastases), and therapeutic factors (binary variables: laparoscopic resection,simultaneous resection, major hepatectomy,surgical margins [positive vs negative],preoperative chemotherapy, postoperative chemotherapy,blood transfusion,postoperative complications) and clinical risk score [CRS, binary, > 2 vs ≤ 2]; (3) RRs or ORs and corresponding 95% CIs for association between each prognostic factor and ER.

Table 1 Included study characteristics

Continuous variables were summarized using median and interquartile range values, while categorical variables were expressed as counts and percentages. In cases where RR was unavailable, we either convert OR to RR or employed the events and patients counts in both exposed and non-exposed groups to calculate RR. By using standardized forms, two authors (YT, YQW) independently extracted the data from each eligible study. Disagreements were resolved by consensus or discussion with the third person (NYW).

Risk-of-bias assessment

To evaluate the risk-of-bias (RoB) at the study level, the Quality in Prognostic Factor Studies (QUIPS) tool was employed. This tool has six domains, with each domain assigned a RoB rating categorized as high, moderate, or low [38]. Studies were deemed to have low RoB if all domains were rated as low RoB or only one domain scored moderate RoB. Conversely, studies were classified as high RoB if at least one domain scored high RoB or if three or more domains scored moderate RoB. The remaining studies were attributed a moderate RoB rating.

Statistical analysis

The primary outcomes of the study focused on the RRs depicting the association between ER and prognostic factors. When available, preference was given to the most adjusted effect estimate, specifically opting for the Cox multivariable coefficient over the univariable estimate. Subsequently, all pooled outcomes were derived utilizing a random-effects model (Mantel–Haenszel method). The magnitude of the summary effects was graphically represented through forest plots.

Between-study heterogeneity was assessed utilizing the I2 statistical estimate, with an I2 value > 50% regarded as severe heterogeneity [39]. Consequently, subgroup analyses were executed to identify potential sources of heterogeneity. Assessment of reporting bias was undertaken through funnel plots and the Egger's test, specifically for prognostic factors identified in over 10 studies. A P value below 0.1 was deemed indicative of significant publication bias, prompting the execution of Trim and Fill analysis in such instances. Additionally, sensitivity analysis was performed by switching to fixed-effects models to test the robustness of the conclusions.

All statistical analyses were conducted using Review Manager software (Version 5.4) and Stata software (version 14.1). A significant two-way P value for comparison was defined as P < 0.05.

Evidence strength assessment

The grading of evidence strength for the identified associations in observational studies was based on the following criteria: Egger's test P value > 0.1, a cumulative population > 1000, and I2 < 50%. The association attained Class I (high-quality) evidence status when all three conditions were satisfied simultaneously. If two out of these three conditions were met, the association was categorized as Class II (moderate-quality) evidence. Furthermore, class III (moderate-quality) evidence was conferred upon an association when only one of the three conditions was fulfilled. Conversely, the absence of satisfaction for all of these three conditions designated an association as Class IV (low-quality) evidence [40].

Result

Study selection

Our initial search strategy identified a total of 3157 pertinent studies, of which 1064 were removed due to duplication. Following the preliminary screening of titles and abstracts, 1883 abstracts were excluded because they did not meet the inclusion criteria. Furthermore, 12 reports were inaccessible, and 198 potentially relevant articles underwent a thorough review in full text. Ultimately, 174 articles were excluded for diverse reasons and 24 selected studies were included [12, 14,15,16, 18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37], as illustrated in the PRISMA flow diagram (Fig. 1).

Fig. 1
figure 1

PRISMA flowchart version 2020

Study characteristics

This review included 24 studies, comprising 12,705 patients who underwent LR for CRLM, with a comprehensive summary presented in Table 1. Among these, twelve studies adopted a prospective cohort design [12, 18, 21, 22, 28,29,30,31,32, 34,35,36], with the remaining adopting a retrospective cohort approach [14,15,16, 19, 20, 23,24,25,26,27, 33, 37]. The publication years of the studies spanned from 2007 to 2023. In terms of geographical distribution, 15 studies originated from Asia [14,15,16, 19, 20, 23, 24, 26, 27, 29,30,31, 33, 35, 37], 7 from Europe [12, 18, 22, 25, 28, 32, 34], 1 from Australia [36], and the other one from the United States [21]. The recruitment period ranged from 1986 to 2020, with the median duration of follow-up ranging varying from 22 to 86.3 months.

Definition of early recurrence

The definition of ER exhibited variation among the studies. Twelve studies defined ER as six-month following surgery [15, 23, 24, 26, 28,29,30,31,32, 34,35,36], while five studies utilized 12 months as the cutoff for ER [16, 21, 25, 27, 37]. The ER rate ranged from 8.6% to 54.3%, and the pooled ER rate was 30.2% (95% CI, 24.1%–36.4%), indicating substantial heterogeneity across the studies (I2 = 98%, P < 0.001) (Fig. S1). Additionally, Viganò et al. applied a 3-month threshold to define very early recurrence (VER), with 11.6% of patients experiencing VER [12]. The overall CRLM recurrence rate was reported to be between 40.7% and 78.8%. For those patients who underwent early recurrence (ER), the 5-year overall survival (OS) spanned from 11.1% to 45.0% (Table 1).

Prognostic factors

A total of 22 potential prognostic factors were identified before the study, categorized into patient-related factors, primary tumor factors, liver metastasis factors, and treatment-related factors. The characteristics of these prognostic factors in the ER group were presented in Tables 2, 3, 4 and 5. As shown, the median age among patients with ER ranged from 55 to 66 years, and the proportion of males varied between 44.6% and 67.3%. Regarding primary tumor characteristics, poor tumor differentiation in the ER group ranged from 5.6% to 66.7%, and 42.3% to 55.7% of patients had LNM. As factors of liver metastases, bilobar distribution was noted in 26.7% to 74.2% of the ER group, and Jung et al. reported that up to 93.3% had synchronous metastases [24]. As reported, 11.5% to 63.3% of patients with ER had positive surgical margins, and approximately 30% to 80% of patients received chemotherapy.

Table 2 Prognostic factors of patient characteristics in ER group
Table 3 Prognostic factors of primary tumor characteristics in ER group
Table 4 Prognostic factors of liver metastases characteristics in ER group
Table 5 Prognostic factors of therapy characteristics in ER group

Assessment on risk-of-bias

The results of the RoB assessment were presented in Table 6. Employing the QUIPS tool and the criteria described above, 17 studies received a classification of low overall RoB, whereas 5 studies were assigned a moderate RoB rating. Notably, two studies were excluded due to high RoB at this stage [33, 34].

Table 6 Risk of bias assessment using QUIPS tool Risk of bias assessment using QUIPS tool

Meta-analysis for prognostic factors

A total of 21 studies, involving 5791 patients, met the eligibility criteria for the meta-analysis. One study was omitted from consideration due to its utilization of VER (3 months) as the outcome, and two additional studies were excluded on account of high RoB.

All results graphically depicted using forest plots, illustrated in Figs. 2, 3, 4 and 5. Patient- related factors such as age and male gender exhibited no correlation with ER. Elevated concentrations of preoperative CEA (RR, 1.56; 95% CI, 1.19–2.04; I2 = 81%) and CA199 (RR, 1.48; 95% CI, 1.20–1.81; I2 = 36%) were identified as potential risk factors for ER (Fig. 2). Besides, primary tumor factors associated with an increased hazard of ER encompassed poor differentiation (RR, 1.13; 95% CI, 1.03–1.25; I2 = 0%) and LNM (RR, 1.31; 95% CI, 1.17–1.48; I2 = 47%) (Fig. 3). Concerning liver metastases, an elevated risk of ER was associated with factors such as a higher number of metastases (RR, 1.46; 95% CI, 1.26–1.68; I2 = 57%), larger metastases (RR, 1.18; 95% CI, 1.04–1.34; I2 = 29%), and bilobar distribution (RR, 1.37; 95% CI, 1.21–1.55; I2 = 40%) (Fig. 4). Regarding therapeutic factors, major hepatectomy (RR, 1.16; 95% CI, 1.07–1.25; I2 = 0%), positive surgical margins (RR, 1.33; 95% CI, 1.20–1.48; I2 = 34%), and postoperative complications (RR, 1.28; 95% CI, 1.13–1.44; I2 = 30%) have been recognized as risk factors associated with ER (Fig. 5a and b).

Fig. 2
figure 2

Meta-analyses of association between patient characteristics and ER

Fig. 3
figure 3

Meta-analyses of association between primary tumor factors and ER

Fig. 4
figure 4

Meta-analyses of association between liver metastases factors and ER

Fig. 5
figure 5

a Meta-analyses of association between therapeutic factors and ER. b Meta-analyses of association between therapeutic factors and ER

However, the stage and location of the primary tumor, synchronous metastases, extrahepatic metastases, laparoscopic surgery, preoperative or postoperative chemotherapy, and blood transfusion were not found to be statistically associated with ER. All the above results are presented in Table 7.

Table 7 Summary of meta-analysis results and evidence quality

The CRS ranges from 0 to 5 points, with 1 point assigned for each of the following: LNM of the primary tumor, the interval < 1 year from primary tumor resection to the detection of liver metastasis, preoperative CEA > 200 ng/ml, more than one liver tumor, and largest tumor > 5 cm [41]. The combination of RRs in three studies showed that CRS > 2 had the potential to increase the risk of ER (RR, 1.44; 95% CI, 1.17–1.77; I2 = 0%; Egger’s P value = 0.232) (Fig. S2).

Reporting bias

Reporting bias was evaluated by funnel plot and Egger's test. Our results comparing the ER rates between groups with and without LNM of primary tumor revealed an asymmetric funnel, with a P value of 0.035 for the Egger's test (Fig. 6). By filling 4 studies using the Trim and Fill method, the recalculated pooled RR was 1.20, 95% CI (1.06, 1.37) (Fig. 7), which was not significantly changed from the initial estimate (RR, 1.31; 95% CI, 1.17–1.48). Therefore, the presence of publication bias has little significant effect on the overall finding.

Fig. 6
figure 6

Funnel plot of ER in groups with and without LNM in the primary tumor

Fig. 7
figure 7

Trim and Fill analysis of the effect of LNM on ER. The squares represent the adjusted studies

Study quality

Using the rating rules mentioned above, no evidence was rated as Class IV. High-quality (Class I) evidence showed that poor differentiation of CRC, larger and bilobar-distributed liver metastases, major hepatectomy, positive surgical margins, and postoperative complications were factors linked to an elevated hazard of ER. Among other meaningful prognostic factors, elevated levels of CEA and CA199, LNM, and a higher number of liver metastases were rated as Class II (Table 7).

Subgroup analyses

Among the prognostic factors analyzed, an elevated number of metastases was reported of having high heterogeneity (I2 > 50%). Subgroup analyses were conducted, employing diverse thresholds for defining an increased number of metastases, categorized as multiple, > 3, and > 4 metastases. As shown in Fig. 8, all subgroup analyses showed significant differences in the ER rate between cases with more and fewer metastases. Notably, the subgroup of multiple metastases showed great heterogeneity (I2 = 79%), whereas the other two groups did not. Therefore, we found that the divergent definitions of “multiple” across different articles constituted the primary source of heterogeneity.

Fig. 8
figure 8

Subgroup analyses of more metastases

Sensitivity analyses

We performed sensitivity analysis by switching to fixed-effects models on all variables. The results were consistent across all variables except in the case of preoperative chemotherapy, wherein a fixed-effects model revealed an association with diminished risk of ER (RR, 1.11; 95% CI, 1.02–1.21; I2 = 58%) (Fig. S3). However, this result was deemed unreliable and excluded.

Discussion

This is the first-ever published meta-analysis summarizing prognostic factors associated with ER following LR for CRLM. Specifically, most of the studies used a postoperative interval of 6 months to define ER, which is earlier than the ER definition for other tumors in the liver. For example, hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (iCCA) often use 1 year or 2 years as a cutoff value [42,43,44]. This review reveals that following LR for CRLM, the occurrence of ER is approximated at 30.2% (95% CI, 24.1%–36.4%). However, evidence shows that the 5-year OS rate after ER of CRLM ranges from 11.1% to 45%, while that of iCCA is only 8–11.6% [43,44,45]. This suggests that postoperative ER of CRLM is more common but the prognosis is relatively better, compared with other intrahepatic tumors. Furthermore, with the development of surgical techniques and minimally invasive local treatment strategies, CRLM patient is more likely to undergo re-resection and/or ablation after recurrence, and the 5-year OS after repeat hepatectomy is as high as 50% [46].

After pooling data from 21 studies involving 5791 patients, this meta-analysis identified ten prognostic factors that could play a crucial role in ER across four domains: patient-related factors, primary tumor characteristics, liver metastases attributes, and therapeutic factors.

As patient-related factors, elevated levels of preoperative CEA and CA199 were identified as potential risk factors for ER. However, the evidence was classified as level II due to high heterogeneity, which was attributed to varying cutoff values in different studies. Studies indicated that postoperative serum molecular markers had stronger predictive potential. Even within the normal limits, higher levels of postoperative CA199 were effective in predicting ER [15, 47, 48].

For primary tumor factors, the analysis of aggregated RR values indicated a heightened risk of ER associated with poor differentiation and LNM, both indicative of a more advanced tumor stage. The impact of LNM on ER was categorized as class II evidence due to reporting bias, but the Trim and Fill analysis indicated that the presence of publication bias had no substantial influence on the overall findings. Besides, previous investigations have validated that individuals with LNM in primary tumors exhibit an adverse OS and progression-free survival (PFS) [49,50,51]. This reveals the significance of pathological characteristics of primary colorectal tumors in the prognosis assessment following LR. Particularly for metachronous liver metastases, defined as liver metastases discovered after primary tumor surgery, these primary tumor factors may assist surgeons in identifying patients who would derive greater benefits from LR [52, 53].

Characteristics of liver metastases, such as increased size, number, and bilobar distribution, have been recognized as potential risk factors associated with ER. Tumor size and number were frequently treated as dichotomous variables, with varying cutoff values, leading to heterogeneity across studies. However, these two variables can be used to calculate the tumor burden score (TBS) [TBS2 = (maximum tumor diameter)2 + (number of liver lesions)2], and the predictive efficacy of this index has been proved to exhibit higher specificity and sensitivity compared to relying solely on tumor size and number in patients with CRLM [54, 55]. Preoperative radiological imaging can provide first indications about the risk of ER, especially gadoxetate disodium-enhanced magnetic resonance imaging (EOB-MRI) can provide greater sensitivity [56]. Furthermore, the implementation of intraoperative hepatic ultrasonography (IOUS) has been reported to identify more occult hepatic lesions missed by preoperative imaging, thereby potentially mitigating the risk of ER [57, 58].

In terms of therapeutic factors, major hepatectomy, positive surgical margins, and postoperative complications have been identified to increase the risk of ER. But no statistically significant difference was observed in the impact on ER between laparoscopic and open hepatectomy, suggesting the viability of the laparoscopic approach. Major hepatectomy is traditionally defined as the resection of three or more liver segments [59]. Indeed, the presence of more and larger, and bilobar-distributed metastases mentioned above not only represents worse tumor behavior but also increases the probability of occult intrahepatic spread and affects the radicality of the treatment. Consequently, to ensure the complete removal of all lesions, more extensive liver resection may be performed, leaving little room for salvageability [60]. Besides, R1 resection has been identified as a risk factor, implying that overlooked lesions and residual microscopic tumor cells left after surgery contribute to ER. However, it remains uncertain whether recurrence at the surgical margin or the emergence of new metastases are the primary contributors to ER in patients with positive surgical margins. Severe postoperative complications could potentially extend the immunosuppression induced by major surgeries and delay the initiation of adjuvant chemotherapy [14].

The current study failed to demonstrate the benefit of preoperative or postoperative chemotherapy in preventing ER. The reason is that, on the one hand, patients with more advanced tumors and R1 resection have a greater tendency to receive adjuvant therapy, on the other hand, the effects of different regimens and cycles of adjuvant treatment are combined. But a study revealed that > 1 chemotherapy line and progression of disease during last-line chemotherapy, were identified as independent predictors of ER, suggesting that the response to chemotherapy was more important than the chemotherapy itself [22]. Therefore, the cooperation between surgeons and oncologists is essential, especially when aggressive indications are present.

In addition to individual prognostic factors, the CRS combines tumor markers, primary tumor factors, and liver metastasis factors and was widely adopted for the prognosis of CRLM patients. In this study, class I evidence demonstrated that a CRS > 2 increase the risk of ER. Furthermore, several studies have utilized multiple prognostic factors to develop nomograms to better predict ER of CRLM [15, 19, 20]. However, the generalizability of these nomograms requires further research, and there is an urgent need to develop a universal ER risk prediction score.

When liver recurrence occurs, salvage resection had the potential to extend long-term survival [22]. However, it was noteworthy that the secondary resection rate was notably diminished in individuals with ER compared to those experiencing late recurrence, due to worsened condition status and potential surgical complications [26]. Additionally, studies indicated that the survival benefit associated with salvage resection disappeared within the subgroup of patients exhibiting more than two risk factors for ER [29]. For these patients, salvage treatment may accelerate disease progression and postoperative complications, thereby mitigating survival benefits rather than effectively controlling local recurrence. Therefore, the indications for salvage resection in patients with ER should be strictly controlled.

There are several limitations in this review. Initially, all the included studies are non-randomized control trials, introducing the possibility of confounding bias. Moreover, the combination of various ER definitions, ranging from 6 to 24 months, and amalgamation of diverse cutoff values for prognostic factors contributed to the substantial heterogeneity. In addition, only RRs were extracted and combined, not the hazard ratios (HRs), which were less persuasive due to the absence of time-related data. Despite these limitations, pooling evidence from available observational studies enabled us to synthesize relevant and generalizable prognostic factors.

Conclusion

This review offers a consolidated summary of the prognostic factors associated with ER subsequent to LR for CRLM. These findings have the potential to enhance the efficacy of surveillance strategies, refine prognostic assessments, and guide judicious treatment decisions for CRLM patients with high risk of ER. Additionally, it is essential to undertake well-designed prospective investigations to examine additional prognostic factors and develop salvage therapeutic approaches for ER of CRLM.