FormalPara Key Points

Since being introduced recently, lenvatinib (LEN) has been used for unresectable advanced hepatocellular carcinoma (HCC) in the clinical setting.

We found that thyroid dysfunction and appetite loss after the administration of LEN were independent factors associated with shorter progression-free survival.

Relative dose intensity (RDI) was also found to be an important influence on the effect of LEN.

1 Introduction

Hepatocellular carcinoma (HCC) is the second most common cause of cancer-related death worldwide. There were approximately 780,000 deaths from HCC around the world in 2018 [1]. In Japan, a nationwide surveillance program for HCC has been initiated, leading to the detection of 65% of all HCC cases at an early stage and facilitating curative surgical or locoregional treatment [2]. Although there is an initial positive response to surgery or locoregional therapy in patients with HCC, the recurrence rates after these treatments are high: 60–80% over 5 years [3]. Multiple tumor recurrence sometimes occurs during the natural course of the disease; HCC is difficult to cure in such cases and has a poor prognosis. The standard first-line treatment for intermediate HCC is transcatheter arterial chemoembolization (TACE), which is frequently performed in patients with HCC who are not eligible for curative treatment and in patients with multiple HCC recurrence [4]. Previous randomized clinical trials have reported that TACE prevents HCC recurrence for up to 6 months after curative treatment and improves survival rates [5, 6]. Despite these findings, some patients are unresponsive to TACE, and HCC progression or recurrence may occur rapidly [7,8,9].

Sorafenib is a multikinase inhibitor that inhibits the Raf–MEK–ERK signaling pathway, which in turn inhibits tumor cell proliferation and blocks vascular endothelial growth factor receptors, preventing neoangiogenesis [10]. Sorafenib, which has been shown to prolong overall survival in HCC patients, has been the only systemic therapy to be used as a first-line treatment for HCC in Asian cohorts for the last 10 years [11]. In the REFLECT trial, lenvatinib (LEN)—an inhibitor of vascular endothelial growth factor receptors 1–3, fibroblast growth factor receptors 1–4, platelet-derived growth factor receptor α, RET (rearranged during transfection), and stem cell factor receptor—was observed to be safe and noninferior to sorafenib in terms of overall survival, progression-free survival (PFS), and time to progression [12].

LEN has been applied in the clinical setting for unresectable advanced HCC as a first-line, second-line, or third-line treatment. As LEN is a novel agent, clinical studies on its effectiveness and prognostic factors remain limited [13,14,15,16]. The aim of this study was to evaluate and to determine factors associated with the PFS of patients with advanced HCC treated with LEN.

2 Methods

2.1 Study Design

This study used a retrospective, multicenter (six hospitals), observational design. A total of 79 consecutive patients with unresectable HCC who were treated with LEN from March 2018 to January 2019 were enrolled. However, only 77 patients were included in the final analysis because 2 patients were excluded for refusing computed tomography or magnetic resonance imaging. Although the inclusion and exclusion criteria for treatment with LEN were based on those used in the REFLECT trial [12], our retrospective study allowed us to include patients previously treated with one or two lines of systemic therapy, patients with a Child–Pugh score of B7, and patients on dialysis. Eligible patients had a measurable target lesion based on the Modified Response Evaluation Criteria in Solid Tumors (mRECIST) [17], a Barcelona Clinic Liver Cancer stage of B or C [18], a Child–Pugh class of A or 7B [19, 20], and an Eastern Cooperative Oncology Group Performance Status score of 0 or 1 [21]. The patients included also had controlled hypertension, sufficient liver function, and adequate bone marrow function. Patients who had a history of sorafenib or regorafenib use or were on maintenance dialysis were also included.

2.2 Protocol

Patients received 12 mg/day (body weight < 60 kg) or 8 mg/day (for body weight ≥ 60 kg) LEN (Eisai Co., Ltd., Tokyo, Japan). Dose interruptions followed by reductions (to 8 mg/day, 4 mg/day, or 4 mg every other day) due to LEN-related adverse events (AEs) were permitted. Eisai Co., Ltd. has prescribed protocols that were introduced in the REFLECT trial [12]. The AEs were graded according to the Common Terminology Criteria for Adverse Events (CTCAE) version 4.0 [22]. Based on the guidelines for administering LEN, the dose was reduced or the treatment was interrupted when a patient experienced an AE of grade 3 or above or if an unacceptable grade 2 AE occurred. The grade for each AE at its worst during the observation period was recorded. If an AE occurred, the dose was reduced or the treatment was temporarily interrupted until the symptoms improved to grade 1 or 2, based on the guidelines from Eisai Co., Ltd. The relative dose intensity (RDI) was calculated by dividing the actual dose by the ideal dose for the first 30 days of treatment [23, 24].

2.3 Follow-Up and Outcomes

Participants were observed for at least 16 weeks. During safety assessments, vital signs were recorded, hematological and biochemical laboratory testing was performed, and urinalysis was conducted at 1, 2, 4, 8, and 12 weeks after drug administration. AEs were graded according to CTCAE version 4.0 [22]. Therapeutic efficacy was determined at weeks 8–10 using the mRECIST [17]. The tumor markers α-fetoprotein and des-gamma-carboxy prothrombin were measured every 4 weeks. The liver was examined by computed tomography or magnetic resonance imaging using a triple-phase scanning protocol. Baseline data on clinical parameters, AEs, LEN dosages, tumor markers, and radiological findings were collected.

2.4 Statistical Analyses

Data are expressed as the median and range (25–75th percentiles). Statistical analyses were performed using the Kaplan–Meier method, the log-rank test, multivariate logistic regression analysis, and multivariate Cox’s hazard analysis. A P value of < 0.05 was considered significant. Factors with a P value of < 0.10 in the logistic regression and univariate Cox’s hazard analysis were included in the multivariate analysis. Data on patient backgrounds, tumor status, RDI > 70%, decreases in tumor marker levels at 30 days, and AEs were utilized in the univariate and multivariate logistic regression analyses. In addition, response (complete/partial response or no response) was added as a parameter in the univariate and multivariate Cox’s hazard analyses. Side effects included hand-foot skin reaction, diarrhea, hypertension, appetite loss, fatigue, and thyroid dysfunction. Data processing and analysis were performed using StatView software (version 5.0; SAS Institute, Cary, NC, USA).

3 Results

3.1 Patient Characteristics

A total of 77 patients with unresectable advanced HCC were included in the analysis. The patients were observed until May 31, 2019. The median dosing period was 173 days. The median patient age was 73.5 years, and 64 patients (83.1%) were male. All cases were actual clinical cases of Child–Pugh class 5A, 6A, or 7B. Among the four Child–Pugh 7B patients, one patient had previously been treated with sorafenib and regorafenib, and all four patients had a history of radiofrequency ablation and TACE. Twenty-three patients had a history of treatment with tyrosine kinase inhibitors (TKIs). A RDI of >70% for the first 30 days of treatment was achieved in 42 patients (54.5%). The median maximum tumor size was 36 mm, and 34 patients (44.2%) had more than ten tumors. Vascular invasion was observed in 33 patients (42.9%), and metastasis occurred in 27 patients (35.1%) (Table 1).

Table 1 Baseline patient characteristics

3.2 Treatment Outcomes

The mRECIST evaluation was successfully performed during weeks 8–10 in all 77 cases. Based on their mRECIST evaluations, two patients achieved a complete response and 21 a partial response (Table 2). Thirty-seven patients achieved stable disease status and the remaining 17 had progressive disease. The response rate was 29.9% and the disease control rate was 77.9%. There were 42 patients who achieved RDI > 70% (Table 1), and that cohort of patients had a response rate of 45.2% (19 patients responded). The remaining 35 patients who did not achieve a RDI of > 70% had a response rate of 11.4% (four patients responded). There was a significant difference in response between these two groups (P < 0.01).

Table 2 Treatment outcomes

3.3 Adverse Events

AEs occurred in 68 of the 77 patients (88.3%), and the total number of AEs was 182. The median time to AE occurrence after treatment initiation was 13.5 days (interquartile range: 7–23 days). The adverse events observed are listed in Table 3. Appetite loss was the most frequent AE (31 occurrences), followed by fatigue (26 occurrences), hypertension (23 occurrences), and diarrhea (23 occurrences). Almost all of the other AEs were of grade 1 or 2. The number of grade 1 AEs was 89 and the number of grade 2 AEs was 70. Twenty-three patients (29.9%) had grade 3 or 4 AEs. Two grade 4 AEs were observed: a case of liver dysfunction and a case of chronic heart failure. Ultimately, 52 patients (67.5%) required a dose reduction and 47 patients (61.0%) discontinued their treatment due to AEs. The most common reason for dose reduction or discontinuation was appetite loss. Cessation or dose reduction of LEN occurred in 28 of the 31 patients (90.3%) in our appetite loss (≥ grade 1) group and in 34 of the 46 patients (73.9%) in our no appetite loss group (P = 0.07). Cessation or dose reduction of LEN occurred in 12 of the 14 patients (85.7%) in our thyroid dysfunction (≥ grade 2) group and in 50 of the 63 patients (79.4%) in our thyroid dysfunction (< grade 2) group (P = 0.59). The initial dose was reduced in 12 patients based on the judgment of the primary doctor. Excluding these 12 patients, we compared the frequency of AEs between the Child–Pugh 5A patients (n = 31) and the Child–Pugh 6A patients (n = 32). There was no significant difference in the rate of occurrence of grade ≥2 AEs between these two groups (24 patients [77.4%] vs. 24 patients [75.0%], P = 0.83).

Table 3 Adverse events (AEs)

3.4 Factors Contributing to Treatment Outcomes

The multivariate analysis revealed that Child–Pugh 5A (odds ratio [OR] 4.77, 95% confidence interval [CI] 1.31–17.4, P = 0.018), RDI > 70% (OR 5.61, 95% CI 1.40–22.5, P = 0.015), and decrease in AFP level (OR 10.3, 95% CI 1.81–58.7, P < 0.01) were independent factors associated with the response to LEN (Table 4).

Table 4 Univariate and multivariate logistic regression analysis of factors associated with the response to LEN

3.5 Progression-Free Survival

The estimated median PFS was 5.6 months (Fig. 1). The PFS rates at 3, 6, and 12 months were 81.2%, 49.8%, and 34.8%, respectively. Patients with a thyroid dysfunction grade of ≥ 2 presented significantly worse PFS (median PFS: 8.0 vs. 4.2 months, P = 0.014) (Fig. 2). Patients who experienced a loss of appetite tended to have a worse PFS (median PFS: 8.7 vs. 5.2 months, P = 0.075) (Fig. 3). Patients with a tumor size of ≥ 40 mm exhibited significantly worse PFS (median PFS: 8.3 vs. 4.4 months, P = 0.033) (Fig. 4). On the other hand, patients who had good liver function (Child–Pugh class 5A) presented significantly better PFS (median PFS: 9.4 vs. 4.7 months, P < 0.01) (Fig. 5). We also performed a subanalysis comparing the PFS of Child–Pugh class 5A patients with the PFS of Child–Pugh class 6A patients. There was a significant difference in PFS between the two groups (median PFS: 9.4 vs. 4.8 months, P = 0.023). Patients with a RDI of > 70% exhibited significantly better PFS (median PFS: 9.3 vs. 4.7 months, P = 0.016) (Fig. 6). Patients who showed a complete or partial response also presented significantly better PFS (median PFS: 9.3 vs. 4.7 months, P = 0.046) (Fig. 7). We did not assess the overall survival because only 12 patients (15.6%) died during the follow-up period.

Fig. 1
figure 1

PFS of all patients following LEN treatment initiation. The estimated median PFS was 5.6 months. PFS progression-free survival

Fig. 2
figure 2

PFS of patients with a thyroid dysfunction grade of ≥ 2 and the PFS of patients with a thyroid dysfunction grade of < 2. Patients with a thyroid dysfunction grade of ≥ 2 presented a significantly worse median PFS (8.0 vs. 4.2 months, P = 0.014 by log-rank test). PFS progression-free survival

Fig. 3
figure 3

PFS of patients with appetite loss (of any grade) and the PFS of patients with no appetite loss. Patients with appetite loss tended to have a worse median PFS (5.2 vs. 8.7 months, P = 0.075 by log-rank test). PFS progression-free survival

Fig. 4
figure 4

PFS of patients with a tumor size of ≥ 40 mm and the PFS of patients with a tumor size of < 40 mm. Patients with a tumor size of ≥ 40 mm exhibited a significantly worse median PFS (8.3 vs. 4.4 months, P = 0.033 by log-rank test). PFS progression-free survival

Fig. 5
figure 5

PFS of patients with a Child–Pugh class of 5A and the PFS of patients with a Child–Pugh class of 6A or 7B. Child–Pugh class 5A patients exhibited a better median PFS (9.4 vs. 4.7 months, P < 0.01 by log-rank test). PFS progression-free survival

Fig. 6
figure 6

PFS of patients with RDI > 70% and the PFS of patients with RDI ≤ 70%. Patients with RDI > 70% presented a significantly higher PFS (median PFS: 9.3 vs. 4.8 months, P = 0.016 by log-rank test). PFS progression-free survival, RDI relative dose intensity

Fig. 7
figure 7

PFS of patients who showed a complete or partial response and the PFS of patients who did not show a complete or partial response. Patients who showed a complete or partial response presented a significantly higher PFS (median PFS: 9.3 vs. 4.7 months, P = 0.046 by log-rank test). PFS progression-free survival

3.6 Factors Contributing to Progression-Free Survival

The multivariate analysis revealed that a thyroid dysfunction grade  of ≥ 2 (hazard ratio [HR] 4.57, 95% CI 2.05–10.2, P < 0.01), appetite loss (HR 3.58, 95% CI 1.72–7.52, P < 0.01), and a tumor diameter of ≥ 40 mm (HR 2.27, 95% CI 1.17–4.40, P = 0.015) were independent factors associated with poor PFS. In contrast, Child–Pugh class 5A (HR 0.41, 95% CI 0.19–0.90, P = 0.027) and complete or partial response (HR 0.40, 95% CI 0.17–0.95, P = 0.039) were independent factors associated with a better PFS (Table 5). No other factors had a significant effect in the multivariate analysis.

Table 5 Univariate and multivariate Cox’s hazard analysis of factors associated with PFS

4 Discussion

LEN is associated with a higher response rate than other TKIs (sorafenib and regorafenib) [11, 12, 25]. In previous studies, the response rate to LEN (based on the mRECIST) reached 40–60% [12, 14]. In our study, the response rate was 29.9%, which—although higher than the response rates to previous TKIs—was lower than those reported previously for LEN [12, 14]. This result could be due to differences in patient characteristics. Only 44.2% of all patients were of Child–Pugh class 5A, and around half of those patients had advanced HCC (33 patients had vascular invasion and 27 patients had metastasis); moreover, 23 patients (29.9%) were treated with LEN as third-line chemotherapy.

Although the occurrence rate and distribution of AEs in this study were similar to those reported previously [12, 15], the most frequent AE was appetite loss (31 patients [40.3%]), followed by fatigue (26 patients [33.8%]). These AEs proved difficult to control compared to hand-foot skin reactions, diarrhea, hypertension, and thyroid dysfunction. Only a limited number of agents that target these symptoms are available: dexamethasone, domperidone, metoclopramide, and some Chinese herbal medicines (Juzentaihoto, Hochuekkito, and Rikkunshito). In total, 52 of the 77 patients (67.5%) required a LEN dose reduction, and 47 of the 77 patients (61.0%) ultimately stopped taking LEN altogether; both of these rates were higher than those for other TKIs (sorafenib and regorafenib). Thus, AEs associated with LEN should be carefully monitored and treated to promote LEN treatment continuation.

The overall response rate was 29.9%, which was lower than that in the REFLECT trial. This may be due to differences in patient background between the studies, as our study included Child–Pugh 7B patients, patients with tumor thrombosis of the main portal trunk, a tumor burden of over 50% of the whole liver, and patients on dialysis. Logistic regression analysis revealed factors that were associated with the response to LEN. RDI was found to be significantly related to complete or partial response. Recently, the therapeutic effects of LEN at particular dosages have been reported [23, 24]. Those works concluded that achieving a sufficient RDI was crucial to therapeutic efficacy. Our data also showed that those who achieved a RDI of > 70% during the first 30 days of treatment had a significantly higher response rate than those who did not achieve that RDI level (45.2% vs. 11.4%, P < 0.01), in agreement with the previously mentioned reports.

The median PFS in our study was 5.6 months, which was shorter than that found in the REFLECT study [12]. This result may also be due to differences in patient characteristics between the studies. Kaplan–Meier curves of PFS stratified by AEs, tumor diameter, and liver function revealed factors associated with PFS that were subsequently confirmed to be independent predictive factors in the multivariate analysis. Based on the proportion of Child–Pugh class 5A cases and the tumor diameters (which represent the tumor burden), our results appear to be quite reasonable [16]. However, the mechanisms underlying the associations of appetite loss (any grade) and thyroid dysfunction (≥ grade 2) with PFS are unclear. Appetite loss could lead to a deterioration in nutritional status; the serum albumin level after administering LEN for 1 month was 3.2 g/dl in the appetite loss group and 3.5 g/dl in the no appetite loss group (P = 0.013). The albumin level was associated with Child–Pugh class, and a decrease in Child–Pugh class reduced the likelihood of continuing with LEN treatment. Also, the treatment period tended to be shorter in the appetite loss group (147 days vs. 183 days in the no appetite loss group, P = 0.16). Thyroid dysfunction has been reported to be an AE that is positively related to PFS [13]. However, in the current study, thyroid dysfunction (≥ grade 2) was negatively related to PFS. Discontinuation or dose reduction of LEN was observed in 12 of the 14 patients (85.7%) in our thyroid dysfunction (≥ grade 2) group and in 50 of the 63 patients (79.4%) in our thyroid dysfunction (< grade 2) group (P = 0.59). It is unclear why thyroid dysfunction was related to poor PFS, so this requires further study [13]. Although RDI > 70% was found to be an independent factor linked to complete or partial response, it did not retain statistical significance in the multivariate analysis of factors associated with PFS. However, complete or partial response was a significant influence on PFS, and RDI > 70% was observed to be one of the factors associated with responders. Thus, it appears that a RDI of > 70% is linked to response to LEN.

The limitations of this study include its retrospective nature and the small number of participants. It also utilized a multicenter design, and the mRECIST criteria are sometimes difficult to use in clinical practice and require a high level of expertise, implying that the quality and reliability of evaluation may not have been consistently high. Also, as LEN is a novel agent, the observation period was necessarily short, and we could not determine the overall survival rate.

This clinical study has highlighted the influence of appetite loss and thyroid dysfunction on PFS. In particular, appetite loss was difficult to control and was closely related to reduced serum albumin levels. This AE must be managed to ensure that an appropriate dose of LEN is maintained over a sufficient period to achieve favorable outcomes.

5 Conclusions

Our study revealed that thyroid dysfunction and appetite loss after the administration of LEN are independent factors that shorten progression-free survival. These AEs must be managed in order to maintain the relative dose intensity and therefore achieve better outcomes.