Background

Hepatocellular carcinoma (HCC) is the third most common cause of cancer-related death globally [1]. Unless HCC is diagnosed at an early stage when liver transplantation, hepatic resection, and radiofrequency ablation are feasible, a poor prognosis is expected due to the limited treatment options [2]. Although several treatment modalities have been applied to advanced HCC, [3, 4] only trans-arterial chemoembolization (TACE) [510] and Sorafenib [11] exhibit survival benefit. Accordingly, TACE and Sorafenib are recommended for HCC in the intermediate and advanced Barcelona Clinic of Liver Cancer (BCLC) stages, respectively [12, 13].

Alpha-fetoprotein (AFP) was first described about 40 years ago and is proposed as a screening and diagnostic tool for HCC [1419]. Recently, the national comprehensive cancer network guidelines also proposed AFP as a alternative tool for diagnosing HCC [20]. Though there is no definitive evidence to support an absolute value of AFP cut-offs [21, 22]. Moreover, several studies have reported that changes in AFP levels may predict treatment outcome [2325]. A recent study showed a correlation between AFP response and radiologic response, time-to-progression (TTP), progression-free survival (PFS), and overall survival (OS) in patients treated with loco-regional therapies [26, 27]. In contrast, controversies regarding the diagnostic role of des gamma carboxy prothrombin (DCP) remain. However, the clinical significance of DCP is emphasized in Asia [28]. Recently, its clinical usefulness was emphasized in a different ethnic group [17, 29, 30].

The primary aim of this study was to determine the clinical utility of AFP and DCP as predictors of treatment outcome in patients with HCC undergoing TACE.

Patients and methods

Patients

Between January 2003 and December 2005, a total of 270 patients underwent TACE as an initial treatment modality at Severance Hospital, Yonsei University College of Medicine, Seoul, Korea. Of these, we excluded 104 with an AFP level < 20 ng/mL [31] or DCP level < 20 mAU/mL, 47 without baseline or follow-up tumor markers, and four with curative resection after TACE. Finally, a total of 115 patients were recruited for statistical analysis and followed-up until December, 2009. The study protocol was approved by the independent institutional review board of our institute.

Diagnosis of HCC

HCC was diagnosed based on the guidelines of American Association for the Study of Liver Diseases [32]. Briefly, patients were diagnosed with HCC if they had a tumor with a maximum diameter >2 cm, typical features of HCC on dynamic computed tomography (CT) (defined as hyperattenuation in the arterial phase and early washout in the portal phase), and an AFP >200 ng/ml [32]. If the maximum diameter of tumor was 1–2 cm, dynamic CT and magnetic resonance imaging (MRI) were performed. HCC was diagnosed if coincidental typical features of HCC were noted. If the tumor did not satisfy the above criteria, a biopsy was performed. When the tumor was <1 cm, ultrasonography was repeated after 3 months.

Treatment and response-evaluation protocols

After diagnosis of HCC was confirmed, angiography was performed to identify the major arterial supply to HCC. TACE was conducted using a solution of 20–50 mg doxorubicin hydrochloride in a 5–20 ml mixed solution of lipiodol and contrast agent. Subsequently, embolization was performed using gelatin sponge particles after TACE. After TACE, occlusion of target vessels and absence of additional tumor blood supply was confirmed.

Radiological response evaluation using CT or MRI and tumor marker response were evaluated 3–4 weeks after TACE. If viable tumor remained at CT or MRI, TACE was repeated. After 2–3 consecutive TACE sessions, at the time of best radiologic response, radiological and tumor marker responses were evaluated. If compact lipiodolization was noted and no viable HCC was identified, than radiologic evaluation and tumor marker measurement were done within 3 months to assess HCC recurrence. If progressive disease was identified, other treatment modalities were considered.

Tumor marker response evaluation

Baseline AFP and DCP levels were determined before TACE. Follow up tumor marker were checked every 3 ~ 4 weeks after TACE concurrent with CT or MRI. To analyze tumor marker response, we determined AFP and DCP levels when the best radiologic response was reached after 2–3 consecutive sessions of TACE. Tumor marker response evaluation was performed when complete remission is observed after consecutive session of TACE in most of subjects, but tumor marker response also analyzed within subject who did not get complete remission. In these patients, tumor marker checked just before last TACE concluded as progressive diseases by mRECIST criteria was used. AFP/DCP levels were measured using a microparticle enzyme immunoassay (AFP, Bayer, Leverkusen, Germany; DCP, Sanko Junyaku Co., Tokyo, Japan). AFP response was defined as a reduction in AFP level of more than 50% from baseline, according to a previous study [27]. Similarly, DCP response was defined as a reduction in DCP level of more than 50% from baseline.

Radiologic response evaluation

Radiologic tumor response was evaluated using the WHO and modified RECIST criteria by CT or MRI. For the WHO criteria, minimum cross product was compared to baseline cross product for calculating change in size. Complete response (CR) was defined as complete disappearance; partial response (PR) was defined as 50% decrease; progressive disease (PD) was defined as a more than 25% increase in the cross product from maximum decrease. All other findings were defined as stable disease (SD). In the mRECIST criteria, length of the major axis of a viable tumor was compared to baseline for calculating change in size. CR was taken as the absence of any enhancing tissue, PR as a 30% decrease in enhancing tissue, and SD as a less than 20% decrease.

Calculation of progression-free and overall survival

The progression-free survival (PFS) time is defined as the time from the TACE start date to the first observation of tumor progression confirmed by mRECIST criteria or death due to any cause. To account for the fact that some patients dropped out for reasons other than tumor progression, PFS for these patients was censored at the time of last follow-up. The overall survival (OS) was calculated from the time of study entry to death. If the patient had no recorded date of death or was still alive at the time of the analysis, OS for these patients was censored at the date that the patient was last seen alive.

Statistical analysis

An independent t-test or Mann–Whitney U test was used to compare continuous variables, and a chi-square or Fisher’s exact test was used to compare categorical variables between two groups (AFP responder vs. AFP non-responder and DCP responder vs. DCP non-responder). AFP and DCP response together with conventional clinical variables at the time of entry to the study were used to identify independent factors that influence PFS and OS via the Cox proportional hazards model. Hazard ratio (HR) and the corresponding 95% confidence intervals (CI) were also indicated. Age, sex, etiology, anti-viral therapy, HBV-DNA positivity, Child Pugh classification, liver cirrhosis, GGT, baseline AFP and DCP, AFP response, DCP responce, BCLC stage, number of tumor, size of tumor, WHO response and mRECIST response were included in univariate regression test on PFS and OS. Gender, Child-Pugh Classification, GGT, AFP, DCP, number and size of tumor, BCLC stage were included in multivariate regression test on PFS. Gender, Cirrhosis, GGT, AFP response, Baseline DCP, number and size of tumor were included on OS analysis. PFS and OS were calculated using the Kaplan–Meier method and compared using a log rank test. Statistical analyses were performed using SPSS ver. 11.0 software (SPSS Inc., Chicago, Ill).

Results

Baseline characteristics

Baseline characteristics of the 115 patients are described in Table 1. The median age was 59 years and male gender predominated (n = 81, 70.4%). Hepatitis B virus was the most common etiology of HCC (n = 80, 69.6%). The median baseline AFP and DCP were 296.7 ng/mL and 231 mAU/mL, respectively. Median follow up was 26 (range, 1.2–75.9) months.

Table 1 Baseline characteristics

Tumor marker responses

After TACE, 91 (79.1%) patients showed AFP response and 24 (20.9%) were non-responders (Table 1). Baseline characteristics were similar between AFP responders and non-responders, except for the significantly higher proportion of Child–Pugh class A in AFP responders than non-responders (97.8 vs. 79.2%; P = 0.004). In contrast, 77 (66.9%) patients showed a DCP response, and 38 (33.1%) did not (Table 1). DCP non-responders showed a significantly higher proportion of solitary HCC (65.8 vs. 45.4%; P = 0.048).

Objective responses after TACE and correlation with tumor marker responses

Table 2 shows objective responses after TACE using WHO and mRECIST criteria and their correlations with AFP and DCP responses. Neither AFP nor DCP responses showed a significant correlation with radiological response evaluated using the WHO criteria. In contrast, AFP, but not DCP, response was significantly correlated with radiological response evaluated using the mRECIST criteria (P = 0.045).

Table 2 Objective responses after TACE and correlation with tumor marker responses

Independent predictors for PFS and OS

Univariate and multivariate analyses were used to identify independent predictors of PFS and OS (Table 3). Regarding PFS prediction, gamma glutamyltranspeptidase (GGT) (HR, 1.003; 95% CI 1.001–1.004; P = 0.012) and baseline AFP (HR 1.000; 95% CI 1.000–1.001; P = 0.049) were identified as independent predictors. Regarding OS prediction, male gender (HR, 2.119; 95% CI, 1.040–4.320; P = 0.039), the presence of cirrhosis (HR, 2.319; 95% CI, 1.281–4.201; P = 0.005), baseline DCP level (HR, 1.000; 95% CI, 1.000–1.001; P = 0.028), number of tumors(HR, 1.443; 95% CI, 1.035-2.011; P = 0.030) and AFP response (HR, 0.276; 95% CI; 0.147–0.518; P < 0.001) were independent predictors of OS. The Kaplan–Meier curves of PFS and OS according to AFP and DCP responses are indicated in Figures 1 and 2.

Table 3 Independent predictors for progression-free and overall survival
Figure 1
figure 1

Progression-free (PFS) and overall survival (OS) curves of AFP responder and non-responder. PFS was similar between AFP responders and non-responders (8.0 vs. 5.5 months; log rank test, P = 0.150; (A)) whereas OS were significantly better in AFP responder than non-responder. (34.9 vs. 13.2 months; log rank test, P = 0.020; (B)).

Figure 2
figure 2

Progression-free survival (PFS) and overall survival (OS) curves of DCP responder and non-responder. Both PFS and OS were not significantly different between DCP responder and non-responder (7.8 vs.7.3 months; log rank test, P = 0.755 for PFS (A) and 34.9 vs. 23.2 months; log rank test, P = 0.203 for OS (B)).

DCP response within higher DCP levels

DCP response did not show statistical significance for prediction of PFS or OS after TACE. However, when we analyzed a subgroup of patients with a baseline DCP level > 200 mAU/ml [31], a significant correlation between DCP response and OS emerged (median 26.7 months in DCP responders vs. 7.5 months in DCP non-responders; log rank test P = 0.002). Thus, application of DCP response to prediction of treatment response and the corresponding optimal cutoff values of baseline DCP level to enhance the treatment-outcome predictability of DCP response should be further investigated.

Combined serological endpoint

Sub-analysis was done to find out whether combined serological endpoint (AFP and/or DCP responses) would improve it prognostication [33]. AFP and/or DCP responders were stratified into combined tumor marker responders (cTM responders, n = 99) whereas subjects without AFP and DCP responses were stratified into cTM non-responders (n =16). PFS was similar between cTM responders and non-responders (15.1 vs. 10.5 months; log rank test, P = 0.259) whereas OS was significantly longer in cTM responders than non-responders (39.0 vs. 21.5 months; log rank test, P = 0.011). In addition, cTM response was selected as one of independent predictors of OS (HR 0.312; 95% CI 0.150-0.649; P = 0.002), together with tumor size and cirrhosis. However, cTM response did not predict PFS independently.

Discordance between cTM response and radiologic response

We divided the study population according to cTM and radiological responses. Definition of cTM response described at combined serological endpoint. Similarly, radiologic responders were defined as patients with CR/PR/SD and radiologic non-responders as those with PD using the mRECIST criteria. Using these definitions, eight (6.9%) cTM responders experienced a radiologic non-response and 12 (10.4%) cTM non-responders experienced radiologic responses; these were categorized as discordant.

Baseline albumin level was identified as an independent predictor of discordance between cTM response and radiologic response (HR, 2.747; 95% CI, 1.266–2.963; P = 0.011). Analysis of the discordant group (n = 20) revealed that the PFS and OS of eight cTM responders with radiologic non-response was not statistically different from that of 12 cTM non-responders with radiologic response (5.1 vs. 5.1 months; P = 0.828 for PFS and 33.8 vs. 7.5 months; P = 0.354 for OS) [Additional file 1].

Discussion

Tumor response evaluation using CT or MRI after systemic chemotherapy or loco-regional treatment is widely used to decide whether to treat further or change treatment strategies. However, high cost, radiation hazard, and associated cancer risks are drawbacks to these response evaluation modalities, [34] whereas tumor marker evaluation after anti-cancer treatment is rapid, easier to estimate, and less expensive. Thus, the clinical implications of tumor response evaluation using tumor markers have been continuously investigated [3538].

This study also showed that AFP response can be an independent predictor of OS in patients undergoing TACE as the first treatment modality, whereas radiologic response was not significant in predicting PFS and OS. Several previous studies have also proposed the clinical usefulness of AFP response. Riaz et al. [27] proposed AFP response after loco-regional therapy as an ancillary method of assessing tumor response and survival, as well as an early objective screening tool for progression by imaging. Furthermore, the supplementary role of AFP response for tumor response evaluation and prediction of better survival and extrahepatic metastasis in patients undergoing hepatic arterial infusional chemotherapy, localized concurrent chemoradiation therapy, and anti-angiogenic treatment such as Sorafenib and Bevacizumab has been proposed [26, 36, 37]. However, to our disappointment, AFP response didn’t show its novel value of prediction in PFS of HCC patients. It might be caused by that all subjects did not have the same follow-up period and timing of assessment scan, furthermore definition of PFS might be subject to bias. So it could be a limitation of our study.

The baseline AFP level in our study also showed statistical significance for predicting PFS in a multivariate analysis (P = 0.035). Similarly, the significance of the prognostic role of baseline AFP level has been accepted in previous studies and several staging systems [39, 40]. In contrast, baseline DCP level was significantly associated with OS in our study. Several other studies investigated the usefulness of baseline DCP level, and demonstrated that higher baseline DCP level was significantly correlated with poor prognosis in HBV-related HCC and with a higher risk of recurrence after curative treatment [35, 41]. Although the reasons why HCC with a higher baseline DCP level is related to poor prognosis remain uncertain, it may in part be due to the fact that the higher baseline DCP level is significantly associated with the presence of vascular invasion, intra-hepatic metastasis, tumor size, Tumor-Node-Metastasis (TNM) stage, and a high frequency of tumor recurrence [42].

When we consider that only viable HCC can produce AFP and that the mRECIST criteria were developed to evaluate the amount of viable HCC, it is not surprising that AFP response showed a significant correlation to tumor response evaluated by mRECIST criteria. Although DCP is also produced by viable HCC, DCP response was not correlated with mRECIST response. This may be because, in contrast to AFP, DCP can be produced in surrounding non-cancer tissues after being stimulated by HCC [42], which might have attenuated the correlation between DCP response and radiologic tumor response by the mRECIST criteria. This also supports our result that DCP response was not significant for predicting PFS or OS after TACE. Because the WHO criteria do not consider specific situations (such as necrosis and the viable portion of HCC) neither the AFP nor DCP responses were correlated with radiologic response evaluation by the WHO criteria.

We further analyzed patients showing discordant outcomes assessed by cTM response and radiologic response, because in this situation physicians must decide whether to maintain the current treatment modality or change it. Although baseline albumin level was identified as a predictor of discordance between cTM response and radiologic response, there was no statistical difference in OS between cTM responders with radiologic non-response and those with cTM non-response but radiologic response (33.8 vs. 7.5 months; P = 0.354). Because a longer OS was noted in patients with cTM response and radiologic non-response in spite of the lack of statistical significance and because the clinical significance of the cTM response might have been attenuated due to the small discordant sub-population, a larger study is needed to reveal the role of cTM response in these cases. In the same context, if the role of cTM response in discordant cases can be demonstrated, cTM non-responders who experience radiologic response may need to have their disease progression monitored closely.

Interestingly, baseline GGT level was identified as an independent prognostic factor of PFS, similar to a recent study that proposed that GGT level is an important prognostic factor in patients with BCLC-intermediate HCC treated with TACE [43]. Considering that GGT level is associated with oxidative stress and resistance to platinum drugs [44, 45], the prognostic value of GGT levels for discriminating subgroups with different risks of disease progression or mortality should be determined.

This study has limitations. First, this was a retrospective study with a relatively small sample size. Also, some study subjects were included in article published recently and analyzed about prognostic value of AFP and DCP by the same institute [46]. However, the aim of this study is clearly different from previous one focusing on the prognostic value of baseline AFP and DCP in all patients with treatment-naïve HCC. In the present study, we focused on the prognostic value of the responses of AFP and DCP by TACE. Second, we excluded patients with low baseline AFP or DCP levels before TACE. Our results might not apply to patients with low baseline AFP or DCP levels. However, because we simply investigated the prognostic values of AFP and DCP response simultaneously, we inevitably selected only patients with elevated AFP and DCP in spite of a potential selection bias. Third, because AFP can be confounded by hepatic necroinflammation, the predictability of the AFP response might have been influenced. AFP response could be poor in patients with persistently elevated ALT after TACE. Furthermore, patients with persistently elevated ALT are reportedly associated with poor OS due to more rapid impairment of the liver function. Physicians should be cautious in applying baseline AFP or AFP response to patients suffering from active hepatitis or to those without elevated baseline AFP and DCP levels. Finally, considering that TACE was introduced as a palliative treatment for intermediate-stage HCC, many HCC patients with BCLC stage A in our study who were indicated for curative treatment underwent TACE. For several reasons, these patients decide to take TACE. Some of them refused surgical resection and others are unable to surgical resection because of their medical condition such as old age(>70 yrs) and poor general conditions (ECOG PS ≥ 2 and/or cachexia). Furthermore, sub-segmental TACE using iodized oil and a gelatin sponge remains an important therapeutic option for patients with HCC in Japan [47]. This is not surprising, because sub-segmental TACE as a curative treatment modality has become widely accepted in Korea as well as in Japan.

Conclusions

The present study demonstrated the clinical importance of AFP response and baseline DCP level for predicting OS in treatment-naïve patients with HCC undergoing TACE as an initial treatment modality. In the future, a treatment algorithm incorporating tumor marker response and radiologic response, which compensate for each other’s drawbacks, should be established to more accurately predict treatment outcomes after TACE.