The question of whether to perform an inguinal (IL) or ilio-inguinal lymphadenectomy (I-IL) for patients with metastatic melanoma to the inguinal lymph nodes with no clinical or imaging evidence of pelvic disease has been controversial and inconsistently resolved in different institutions for many years.1 There is uncertainty about the incidence of pelvic node involvement in patients with metastatic inguinal disease and whether patients benefit from the addition of pelvic lymphadenectomy in terms of a lower rate of disease recurrence, which has been reported in those with node-positive disease.2 Any oncological advantage of I-IL over IL needs to be balanced with the disadvantage of the potential extra morbidities related to pelvic dissection.3,4,5 These include wound infections, bleeding, hernia, dehiscence/necrosis, seroma, and lymphoedema.3

A meta-analysis of 20 studies in 2016 found that leg lymphoedema developed in approximately 33% (range 25–42%) of patients who underwent IL or I-IL for melanoma.4 Chronic physical discomfort and reduced function can plague lymphoedema patients, as well as troublesome recurrent infections and negative body image.6 Additionally, lymphoedema can result in significant financial burden on those affected because of treatment and compression garment expenses, time off work, and sometimes, the need to change occupations.7

It is generally thought that IL results in lower rates of lymphoedema than I-IL because of fewer removed lymph nodes and less disruption of lymphatic pathways. This view is supported by a previous cross-sectional study.5 There are, however, conflicting results arising from other studies. The prospectively designed MSLT3 and ANZMTG 01.02/TROG 02.018 trials failed to identify increased lymphoedema risk in those who underwent additional pelvic dissection. A recent Canadian retrospective study also arrived at the same conclusion, but lymphoedema was not objectively measured.9 The variability in definition and measurement of lymphoedema, as well as follow-up duration, contributed to inconsistencies in the results.

The Evaluation of Groin Lymphadenectomy Extent for Melanoma (EAGLE FM) study sought to determine if there was a difference in disease-free survival at 5 years after randomisation to IL or I-IL. Secondary endpoints included the prevalence of lymphoedema and quality of life. This EAGLE FM substudy prospectively compared the prevalence and severity of lymphoedema between patients who have had IL and I-IL.

Methods

The EAGLE FM trial is an international, multicentre, phase III, noninferiority, prospective, randomised clinical trial, which commenced in 2015. The study recruited patients from 14 hospitals across Australia, Brazil, Italy, The Netherlands, Slovenia, and the United Kingdom. Patients were identified through participation of the surgeon and/or other multidisciplinary team members as co-investigators on the clinical trial.

Primary outcome measure was disease-free survival at 5 years after randomisation. Secondary endpoints included the prevalence of lymphoedema and quality of life. The EAGLE FM trial was suspended prematurely in 2021 because of rapid changes in systemic therapy for stage III melanoma and cessation of completion lymph node dissection for positive sentinel node biopsy after the results of MSLT II10 and DeCOG SLT.11 The lymphoedema substudy was originally intended to follow-up patients to 60 months, but it was discontinued early after 24 months of follow-up because of participant attrition. EAGLE FM follow-up is still ongoing for the primary and recurrence-related secondary outcomes.

Patients were eligible if they had a Stage III inguinal nodal metastatic melanoma from a primary cutaneous melanoma or no known primary tumour with one or multiple inguinal node(s) involved, histologically or cytologically proven as metastatic melanoma, ECOG performance status between 0 and 2 at randomisation, and were able to provide informed consent. Patients were excluded if there was distant metastatic disease on clinical examination or staging imaging, pelvic lymph involvement on sentinel node biopsy or PET/CT scan, and/or bilateral inguinal lymph node involvement. Please refer to supplementary files for full eligibility criteria and surgical protocol. Ethics approval was obtained from each institution’s review board. This trial was registered in the U.S. National Library of Medicine Clinical Trials Registry (NCT02166788).

A minimisation method was applied to randomise patients continuously into IL or I-IL groups using the MinimPy program.12 The randomisation was stratified by centre, gender, age, macroscopic versus microscopic lymph node involvement, extracapsular spread versus no extracapsular spread, and primary melanoma (known vs. unknown primary).

Lymphoedema Substudy

Lymphoedema-related assessments occurred preoperatively at baseline and at 6, 12, 18, and 24 months after surgery. The lymphoedema assessor was blinded to knowledge about the surgical groups.

Lymphoedema-related assessments included the following:

  1. 1.

    Circumference measurement and volume calculations using a measurement board, marking pencil, and tape measure.

  2. 2.

    Bioimpedance spectroscopy where available.

  3. 3.

    Muscle length assessments; Thomas test for hip flexor length and knee-to-wall test for gastrocnemius/soleus length.

  4. 4.

    Lymphoedema Quality of Life questionnaire (LYMQOL) for patients who were identified to have lymphoedema via volumetric difference or bioimpedance.

Circumferential Measures and Volume Calculations

Patients were measured in the supine position by using a measurement board placed under each leg. Circumferences were taken by using a tape measure at 10-cm intervals along the affected and unaffected legs.13,14 The raw circumferences were converted to limb volumes using a truncated cone formula, and Inter-limb Volume Differences (ILVD) were calculated as a percentage.

Multifrequency Bioimpedance Spectroscopy

Bioimpedance spectroscopy utilises the characteristics of frequency-dependent current flow to quantify changes in extracellular fluid in a participant’s limb. Impedimed SFB7 and U400 were used to perform bioimpedance spectroscopy. Both machines measure the resistance to current flow in the extracellular fluid of the operated leg compared to the contralateral side via use of surface skin electrodes, producing a Lymphoedema Index (L-Dex®). Bioimpedance spectroscopy is unaffected by variations in the participant’s weight due to a change in fat mass or muscle mass and provides an instant tool for assisting in the clinical assessment of lymphoedema change.

Lymphoedema thresholds were defined based on the International Lymphoedema Framework Best Practice Consensus Report and the few published clinical trials available for lower limb lymphoedema at that time of study inception.5,15,16,17

  1. 1.

    ILVD >10% compared to the unaffected limb or

  2. 2.

    L-Dex® >10 or

  3. 3.

    L-Dex® change of >10 points compared with baseline.

Participants were referred for lymphoedema management if they had protocol-defined lymphoedema. Lymphoedema management was at the discretion of the treating institution, including the use of compression garments for treatment or prophylaxis. Use of compression garments was recorded at each follow-up and may have started or ceased at, or between, any timepoint/s.

Muscle Length Assessments

Hip flexor length was assessed by using the position required for Thomas test. Thomas test is performed with the patient in supine lying with lower back flat on plinth. The contralateral leg is passively flexed to end of range. A positive Thomas test occurs when ipsilateral leg raises off the plinth, and this indicates short hip flexor length. Hip flexor length tests were discontinued once a negative test was achieved. Calf muscle length was measured with the patient standing in a lunge position facing the wall and with affected knee and hip extended and heel flat on the floor. The distance from the wall to the distal first phalanx was measured in this position.

LYMQOL

The LYMQOL-LEG questionnaire is a validated condition-specific Quality of Life assessment tool, which can be used for lymphoedema of the limbs both in clinical assessment and as an outcome measure.18 This tool contains domains including physical, emotional, social, role functioning, and other symptoms, such as pain and fatigue. Only participants who present with a protocol-defined lymphoedema completed the LYMQOL-LEG questionnaire.

Statistical Methods

Sample size was calculated for the hypothesis of the primary study, which is noninferior disease-free survival in IL compared with I-IL, within a margin of 7%, at 5 years after surgery. The purpose was to recruit a sample size of 634 subjects (317 in each group) to achieve 80% power at a 0.05 significance level.

Data were stratified based on the type of lymphadenectomy and the presence or absence of lymphoedema. The relationships between each stratum and independent demographic and treatment factors were analysed by using Fisher’s exact test and Wilcoxon rank-sum test. Linear hypothesis tests were conducted to see if the two surgical groups and their interaction with other variables were associated with different likelihoods of developing lymphoedema at each timepoint. Additionally, a marginal model known as Generalised Estimating Equations (GEE) was employed to statistically quantify the relationship between lymphoedema and these factors, taking into consideration the repeated measurement correlations. Independent variables with a p-value < 0.2 and the type of lymphadenectomy were considered in the GEE model. To obtain a more parsimonious model and improve accuracy, insignificant variables were removed from the final multivariate model. The reference group in the model consisted of female IL patients, age < 55 years with nondominant limb affected, average body mass index (BMI) and total node removed, and no compression garment usage.

Results

Of the 88 patients randomised, 82 completed the lymphoedema assessment at baseline (IL = 42, I-IL = 40), and 60 patients (IL = 29, I-IL = 31) completed it at 24 months (Fig. 1). Reasons for not completing the assessment include missing data and discontinuation of the study due to death and lost to follow-up. Hip flexor and calf length measures were not included in the results due to measurement error for some sites resulting in noninterpretable data. LYMQOL data were not analysed due to large amounts of missing data for the low number of patients with protocol-defined lymphoedema.

Fig. 1
figure 1

CONSORT diagram. *Lost to follow-up is defined as those who did not attend EAGLE FM study measurements from 24 to 36 months.

At baseline, there were no significant differences between IL and I-IL in terms of gender, age, BMI, dominant limb affected, geographic location, ILVD, or L-Dex® (Table 1). Race and ethnicity data were missing for most sites, because this information was not requested as a compulsory field in the 2015 study protocol. Mean (standard deviation [SD]) number of total lymph nodes was 10.9 (5.1) for IL and 21.5 (7.8) for I-IL patients. There was no significant difference found between the two surgical groups in the number of positive nodes, groin radiotherapy, micrometastases and macrometastases, or wound infection postoperatively. There were 26 serious adverse events (IL = 12, I-IL = 14). These included wound infections, excessive wound drainage, thromboembolism, and syncope. All but one event occurred within 120 days of surgery.

Table 1 Patient demographics at baseline

Baseline L-Dex® was captured for 37 participants (45%), but this reduced to 24 participants (40%) by 24 months. All available participants underwent circumferential measurement for limb volume throughout the study. Some patients with leg lymphoedema were detected by using the ILVD >10% definition but were not captured by L-Dex® > 10, whereas some were detected by L-Dex® and not by ILVD (Table 2).

Table 2 Lymphoedema detection methods

At baseline, the prevalence of leg lymphoedema was higher for IL (5/42) compared with I-IL (1/40). Peak lymphoedema prevalence occurred at 6 months for both lymphadenectomy groups, with approximately one in every two patients affected (45.9% IL [CI 29.9–62.0%], 54.1% I-IL [CI 38.0–70.1%]). Table 3 presents the summary of variables stratified by lymphoedema at 6 months. Lymphoedema prevalence decreased from 6 months, affecting 18.8% [CI 5.2–32.3%] of IL and 41.4% [CI 23.5–59.3%] of I-IL patients at 18 months. Between 18 and 24 months, the prevalence of lymphoedema rose for both surgical groups but the prevalence remained lower than at 6 months (IL 31.0% [CI 14.2–47.9%]; I-IL 45.2% [CI 27.6–62.7%]). Although there is no statistical difference between the surgical groups overall, I-IL patients had a higher lymphoedema prevalence than IL patients during all follow-up timepoints (Fig. 2).

Table 3 Summary of variables stratified by lymphoedema at 6 months
Fig. 2
figure 2

Lymphoedema prevalence over time

At 6 months, median ILVD was 6.5% (IQR 3.4–11.9%) and 10.5% (IQR 5.6–14.7%) and for those measured with bioimpedance, median L-Dex® was 5.1 (IQR 1.8–8.4) and 5.1 (IQR − 0.3 to 8.7) for IL and I-IL respectively. At 24 months, median ILVD remained low for both groups at 5.8% (IQR 3.4–9.6%) and 7.2% (IQR 3.6–14.0%) for IL and I-IL, but median L-Dex® was significantly higher for I-IL at 9.1 (IQR 0.9–13.3) compared with − 0.6 (IQR − 3.4 to 3.7) for IL (Fig. 3a, b).

Fig. 3
figure 3

a Interlimb volume difference over time. b L-Dex® over time

The severity of leg lymphoedema was mild in most cases detected at 6 months, with a median ILVD of 13.8% (IQR 11.2–17.3%) and L-Dex® of 9.4 (IQR 7.0–11.4). At 24 months, lymphoedema severity increased slightly for those affected, with median ILVD at 14.5% (IQR 10.6–18.7%) and median L-Dex® at 12.6 (IQR 9.0–17.2).

Recurrence data were available for 59% (n = 48) of our participants. Recurrence did not differ significantly between IL and I-IL groups, nor was it associated with the development of lymphoedema during follow-up. Local or in-transit recurrence occurred in 21.4% of IL and 25% of I-IL (p = 0.955) and regional recurrence occurred in 26.2% of IL and 15% of I-IL (p = 0.334).

Interestingly, 52.7% of all patients (n = 74) were using compression garments at 6 months, irrespective of whether they had lymphoedema. By 24 months, compression garment use had decreased to 41.7% (n = 60); only ten (43.5%) of the 23 patients with lymphoedema used compression. The pattern of compression garment use appeared to follow the lymphoedema prevalence of the lymphadenectomy group at that timepoint; higher usage corresponded to higher prevalence. For example, in the IL group, highest lymphoedema prevalence at 6 months of 45.9% corresponded with highest compression garment use in 43.2% of patients, whereas lowest lymphoedema prevalence at 18 months of 18.8% corresponded with lowest compression garment use in 28.1% of patients.

The final fitted multivariate model showed that the odds of developing lymphoedema between the IL and I-IL groups depends on time (Fig. 4). The odds ratio (OR) between the surgical groups substantially increased from the baseline, reaching around 10 (OR = 10.44; CI 1.32–82.40; p = 0.026), 16 (OR = 15.56; CI 1.77–137.18; p = 0.013), and 9 (OR = 9.10; CI 1.03–80.38; p = 0.047) times higher at 12, 18, and 24 months, respectively. Additionally, the likelihood of developing lymphoedema reached its highest point at 6 months (OR = 9.40; CI 3.53–25.04; p < 0.001). Linear hypothesis testing demonstrated a significant difference between I-IL and IL in their chance of developing lymphoedema at 18 months (OR = 3.49; CI = 1.15–10.56; p = 0.027). However, there was no statistical evidence to support a difference at any other timepoint or overall difference (baseline: OR = 0.22; CI 0.03–1.56; p = 0.131, 6 m: OR = 1.5; CI 0.58–3.85; p = 0.399, 12 m: OR = 2.34; CI 0.82–6.68; p = 0.113, 24 m: OR = 2.04; CI 0.69–5.99; p = 0.195, overall: p = 0.092).

Fig. 4
figure 4

Odds ratios for lymphoedema

Our multivariate model also indicated that patients older than 55 years were twice as likely to be affected by lymphoedema than those younger than 55 years (OR = 2.04; CI 1.04–4.00; p = 0.038). Male patients were associated with a significantly higher risk of lymphoedema development compared with female patients (OR = 2.26; CI 1.14–4.47; p = 0.019). Patients who used compression garments had a significantly reduced prevalence of lymphoedema by 39% on average (OR = 0.61; CI 0.37–1.00, p = 0.049).

Discussion

Despite slow recruitment and smaller than anticipated participant numbers, EAGLE FM remains the largest, prospective, randomised, controlled trial addressing the extent of surgery for inguinal nodal metastatic melanoma. Access to modern adjuvant therapies in the management of melanoma is unequal worldwide. For many patients, lymphadenectomy is still necessary for disease control. Additionally, there are patients whose comorbidities prevent them from receiving immunotherapy or targeted therapy, as well as nonresponders to these treatments.

Because of lack of statistical evidence, we cannot report whether there is any difference in lymphoedema prevalence between the surgical groups in general over the course of follow-up. However, I-IL patients did show a significantly higher chance of developing lymphoedema at 18 months compared with IL. This is consistent with the results of our previous study that indicate a higher overall prevalence for patients who have I-IL versus IL.5

Our multivariate model also confirms the odds ratio of lymphoedema between I-IL and IL increased substantially at 12, 18 and 24 months follow-up. It is possible that the effects of the more extensive I-IL are compounded over time so that factors, such as deposition of scar tissue blocking lymph pathways and lymphangiogenesis, and chronic tissue inflammation, do not become apparent until 12 months or later from surgery.19,20

The prevalence of lymphoedema was similarly high for both IL and I-IL patients at 6 months and declined for both groups from 6 to 18 months but increased again at 24 months. This would imply that for some patients, leg lymphoedema detected at 6 months, is reducible and may be reversible. It is unclear whether it is natural recovery over time or lymphoedema treatment that helped reduced the leg oedema in these patients. Regardless, this may help to reassure patients that not everyone who has leg oedema at 6 months develop chronic lymphoedema.

The severity of lymphoedema for IL and I-IL was mostly mild during follow-up. The fact that our participants were monitored regularly, similar to a prospective surveillance model, may have contributed to cases being detected early and referrals made early for lymphoedema treatment, thereby reducing the overall severity and prevalence. It is well-known that a prospective surveillance model of care for breast cancer patients reduces the prevalence and burden of chronic lymphoedema and the cost to the health system.21,22 This model of care can be generalised to benefit melanoma patients who have had lymphadenectomies and the health system that treats them.

Leg lymphoedema detected by ILVD>10% did not always correspond with an L-Dex® > 10. Since developing the EAGLE FM protocol in 2015, there has been a lowering of the lymphoedema threshold from 10 to 7, which is reflective of a change from 3 to 2SD away from the average estimate of the healthy population.23We reanalysed the data with the new threshold of 7, and it improved the joint detection rate of ILVD and L-Dex® by four patients at 6 months and one patient at 18 months, but these patients were already identified to have lymphoedema by the ILVD definition in our original analysis. Ultimately, lowering the threshold did not change the overall lymphoedema prevalence in this study. Objective lymphoedema definitions remain contentious; this extends to the ILVD threshold, with calls to both lower and raise the threshold in recent years.24,25 Until there is agreement on a universal definition, replication and comparison of studies remain problematic.

Our study demonstrated that older age was associated with an increased risk of lymphoedema. Older age is associated with enlarged lymph vessels and increased permeability in both animal and human models.26 Additionally, comorbidities, such as venous insufficiency, is more likely to affect the older age group.27 Venous insufficiency may be subclinical before surgery, but the increase in interstitial fluid after surgery can place more demand on fluid return, resulting in both venous and lymphatic failure, and leg lymphoedema.

Around half of all participants were using compression garments at 6 months, irrespective of lymphoedema status. It is unclear which patients were using their compression garments prophylactically and which patients were using their compression garment as treatment for their lymphoedema. This is indeed a limitation of our study. Compression garment use would need to be controlled and consistent during follow-up to evaluate its effectiveness. The pattern of compression garment use in our results for both groups appear to follow the prevalence of lymphoedema for that group at the timepoint of follow-up, which would suggest that they were being used for treatment rather than prophylaxis. Prophylactic use of compression garments has not been proven to reduce lymphoedema risk in melanoma patients after groin dissection, but it has been shown to reduce lymphoedema in high-risk breast cancer patients.28,29 Our multivariate model would appear to agree with the latter as our patients who wore compression garment were far less likely to have lymphoedema than those who did not wear compression.

Patients that were using compression garments as treatment for their lymphoedema would be expected to have minimal lymphoedema, potentially underestimating the overall extent of lymphoedema in the study population. On the other hand, it also would appear that many of our patients with lymphoedema did not wear, or adhere, to compression garments despite referral to a lymphoedema clinic. A previous study also found that females were more likely than males to wear compression stockings for the management of venous oedema, which would explain the higher odds of lymphoedema for males in our study.30 It is likely that patient perception of threat and their coping strategies, including their perception of lymphoedema being a problem, their ability to don garments, garment cost, comfort, and employment contributes to their adherence to compression.31,32,33

Conclusions

Despite the trend for higher lymphoedema prevalence in patients who have had I-IL over IL, our study’s small sample did not have the statistical evidence to support an overall difference in lymphoedema prevalence between the surgical groups during 24 months of follow-up. Lymphoedema severity was mostly mild in those affected.