Background

Breast cancer–related arm lymphoedema (BCRL) is a common sequelae of breast cancer treatment with a significant physical and psychosocial impact on women and their families [1, 2]. BCRL typically presents as arm swelling, whole or part, often extending into hand and fingers and sometimes the adjacent trunk quadrant, including breast lymphoedema. Symptoms may include heaviness, with reduced manual dexterity affecting activities of daily living sometime require work environment modifications or change of employment [3,4,5]. Women with BCRL typically experience greater psychological morbidity and psychosocial impact than those with breast cancer alone [2, 6]. For many women, swelling may initially subside with rest and elevation. For others, symptoms become more severe and increasingly difficult to treat, with firmer and fibrosed subcutaneous tissues and increased risk of infection due to reduced lymphatic drainage [7,8,9,10].

In recent years, prophylactic lympho-vascular anastomotic surgery and less aggressive axillary surgery (sentinel lymph node biopsy) and radiotherapy protocols have minimised axillary treatment and reduced the incidence and presenting severity of BCRL [11]. However, a pooled BCRL incidence of 14.29% (95%CI 13.79–14.79) suggests that of the 55,000 new cases of invasive cancer diagnosed each year in the UK at least 7000 will develop BCRL within 2 years following cancer treatment, with others developing BCRL many years later [12,13,14].

Lymphoedema remains a physical and psychosocial problem regardless of the amount of visible swelling or whether symptoms are mild, moderate or severe [15]. Although women are routinely advised to access early treatment should swelling occur, evidence for effectiveness of initial BCRL treatment is lacking. International consensus has for the past 30 years recommended two-phase decongestive lymphoedema treatment (DLT), a combination of conservative treatments also known as complex decongestive treatment (CDT), to reduce and control lymphoedema and avoid the complication of cellulitis [16,17,18]. While several studies have shown that women who receive initial intensive-DLT followed by maintenance self-treatment with hosiery will achieve better outcomes than those self-treating with hosiery alone, particularly for those with mild and recent BCRL onset [19,20,21,22], Dayes et al. [23] found no significant difference in outcomes (n = 105). Systematic reviews have concluded there is weak evidence for the effectiveness of DLT although insufficient to inform an optimal DLT package due to the wide range of treatment protocols, programme duration and variation in study population severity [24, 25]. Most reviews noted a predominance of underpowered studies, heterogeneity of treatment protocols and measured outcomes and lack of follow-up monitoring long-term effects [24,25,26].

International awareness of BCRL has greatly improved since an early systematic review concluded the major limiting factors for BCRL treatment were lack of awareness of the problem and insufficient resource to manage it [27]. BCRL is no longer seen as a neglected or rare condition, with women now routinely informed about self-management strategies to minimise the risk of BCRL and signposting to ensure they no longer wait months or years to access treatment [28]. However, despite the existence of many UK lymphoedema services, lymphoedema treatment capacity remains a problem, and women are routinely offered self-treatment with hosiery although the impact of this approach on treatment outcomes has not been addressed [16, 18, 29]; UK practitioners struggle to provide appropriate care to increasing caseloads of individuals with all types of lymphoedema and complex comorbidities [30, 31]. In contrast, other countries consider two-phase DLT with initial intensive-DLT to be standard care despite reporting limited service capacity for providing intensive-DLT [32] or MLD [33].

Research into BCRL management has for many years focused on surveillance and testing whether prophylactic interventions for subclinical oedema will reduce the incidence of BCRL [34]. A previous review found that evidence for the effectiveness of initial treatment for women with early established BCRL, that is, less than 12-month duration, hereafter referred to as ‘early BCRL’, is lacking [35] yet this is not considered a research priority.

The objective of this updated systematic review was to (1) identify the effect of DLT on excess arm volume (EAV) and patient-reported outcomes for women with early BCRL and (2) determine whether four years after the original review [35], there was still the need for a definitive trial; it is good practice to periodically update systematic reviews and report changes to findings [36]. The definition of early BCRL as < 12-month duration was chosen to exclude women with chronic changes to their affected arm, such as fibrosis and thickening, which would be more difficult to treat.

Method

Literature searches and inclusion criteria

Figure 1 shows the review process which was conducted according to the published protocol [37, 38], PROSPERO (CRD42015015843), as reported in the PRISMA checklist (Online Resource 1). The JBI review method was chosen because it allowed inclusion of uncontrolled quantitative studies and it was expected that relevant lymphoedema research studies could be of non-experimental design [3941].

Fig. 1
figure 1

Overview of systematic review method

Eligibility criteria and keywords (Table 1) were developed using the PICO framework and JBI terminology (type of Population, Intervention, Comparator, Outcome, Study) [40, 41]. The final search strategy combined variant terms for ‘lymphoedema’ and ‘breast cancer’ in title, abstract, subject heading and keyword (Online Resource 2). The strategy was deliberately broad as attempts to refine searches omitted potentially relevant papers. The list of included databases (see Online Resources 2 and 3) was also broad, including a wide range of medical and nursing databases, to identify relevant black and grey literature and non-English language publications with an English language abstract, with no date limits applied; early lymphoedema papers were published in journals which were not index linked.

Table 1 Eligibility criteria, structured using type of Population, Intervention, Comparison, Outcomes and Studies (PICOS)

The original searches in July 2016 [35] were updated on 6 July 2018 following the original protocol although without searching grey literature [37, 38]. The most recent search, on 8 September 2022, was limited to PubMed, CINAHL and AMED databases which was a pragmatic decision to manage the time-consuming process of updating the review by searching those databases considered most likely to capture all relevant literature from journals addressing medicine, nursing and professions allied to medicine; the review protocol was otherwise unchanged (Online Resource 3). Four lymphoedema journals (EJLRP, J.Phlebo.Lymphol, Lymphology, J Lymphoedema) were hand-searched on each occasion (Online Resource 4), and reference lists from contemporary systematic reviews of BCRL treatment were checked for additional references, including Lasinski et al. [42] in 2016 and Davies et al. [43] and Marchica et al. [44] in 2022. All search results were electronically imported into EndNote with duplicate references removed using the automated duplicates finder and by manual sorting. The remaining records were combined into a single EndNote reference library (version X9).

One reviewer (EJ), an expert lymphoedema practitioner and researcher, screened records against the review eligibility criteria (Fig. 1): firstly, to remove titles describing other populations and interventions not of interest; secondly, to remove papers without an abstract and abstracts which did not include the population, interventions and outcomes of interest; finally, to remove full papers not fulfilling the eligibility criteria (Table 1), and examining at this stage those papers with abstracts lacking sufficient detail for exclusion. Eligibility of non-English papers was confirmed using online translation tools and bilingual colleagues; no formal translation was required. Due to a lack of studies with the population of interest, studies were included which separately reported a subgroup of the whole population or one study group with < 12-month or mean < 9-month BCRL duration which would keep to a minimum the inclusion of women with duration ≥ 12-month (chronic BCRL). The number of references excluded at each stage was recorded with reasons documented only for those excluded at full paper screening. Corresponding authors were contacted where full papers contained insufficient detail to determine eligibility; papers were excluded if contact details were missing or corresponding authors did not respond.

Methodological assessment and data extraction

Two reviewers (EJ, DB) independently assessed included papers using standardised JBI critical appraisal tools to determine how each study addressed the possibility of bias in its design, conduct and analysis [35, 41]. A lymphoedema definitions list (Online Resource 5) was developed (EJ) to guide assessment by the secondary reviewer (DB). Differences of opinion were resolved through discussion, with no input required from a third independent reviewer (ER). Methodological limitations were reported rather than excluding poor quality papers.

Relevant outcomes were extracted (EJ) using the JBI standardised data extraction tool (Online Resource 6) adapted (EJ) to capture sufficient data to describe lymphoedema setting, population, interventions and outcomes of interest [41]. Corresponding authors were contacted where data were missing or unclear. Due to wide heterogeneity of interventions and outcomes of interest, findings could not be synthesised but are instead displayed in narrative and tabular format according to reported study intervention with summary statistics; relevant missing data are reported. Certainty of evidence was rated using the GRADE system with a strong recommendation (grade A) for health benefits indicating robust evidence of good quality and a weak recommendation (grade B) indicating some benefit with poor quality evidence [45]. There were insufficient data to perform two planned subgroup analyses: (1) previous lymphoedema treatment and (2) severity of swelling at baseline.

Results

Study selection

A total of 23,348 references were identified from three searches (Fig. 2), with most excluded as duplicates (n = 13,937, 60%) and by screening titles (n = 8829, 38%). Excluded abstracts (n = 329) included conference abstracts (n = 33) without a full publication. Excluded full papers (n = 243) are reported with reasons in Jeffs et al. [35] for the original search (n = 160) and in Online Resource 7 for updated searches (n = 83). Many studies did not report BCRL duration.

Fig. 2
figure 2

PRISMA flow diagram of search and study selection process

Only 10 studies fulfilled the eligibility criteria, six from the original search, one from 2018 and three from 2022. Another paper included in the original review [35] was excluded here because their findings were presented as figures and, due to lack of clear labelling, could not be compared with other studies [46]. The included studies were six RCT [21, 23, 47,48,49,50], one quasi-controlled study [51], one prospective uncontrolled study [52] and two retrospective case review studies [53, 54].

Methodological assessment

All studies had significant areas of bias, with controlled studies scoring 6–10 out of 13 items (Table 2) and uncontrolled studies 3–7 out of nine items (Table 3). Only two trials described both randomisation method and allocation concealment [21, 23]. Although five trials blinded their assessors [21, 47,48,49], participant and therapist were not blinded to allocation despite three trials testing interventions with potential for blinding participants: bandage method, exercise programme [49,50,51]; there is no sham version of manual lymph drainage (MLD), self-lymphatic drainage (SLD) or two-phase DLT [21, 23, 47, 48]. Sample sizes for this review were small (n = 24–57), affecting precision of estimates and increasing the possibility of a type II error. Six trials reported attrition (5–40%), for mostly study-related reasons [21, 23, 48, 50, 51], although only one trial conducted sensitivity analyses [23].

Table 2 Results of critical appraisal based on Joanna Briggs Institute (JBI) tool for appraising randomised controlled trials (RCT)
Table 3 Results of critical appraisal based on JBI tool for appraising quasi-experimental, non-randomised studies

All trials reported comparable study groups at enrolment and, except for their intervention of interest, treated women identically and according to their study allocation. Outcomes included in this review were measured using reliable methods, although findings were excluded for two studies which reported absolute volume [50] or only the affected arm [49]. Only one study [21] reported training assessors which would reduce measurement bias although two studies used experienced lymphoedema practitioners [48, 50]. However, heterogeneity of outcomes and measurement methods made comparisons difficult, and studies excluded from their analysis those women who did not complete allocated treatment or subsequent study follow-up.

The two retrospective case review studies had low risk of bias [53, 54]. The prospective uncontrolled study [52] lacked sufficient detail to ensure reproducibility of reported outcome measurements, used a non-validated tool to measure self-reported symptoms and did not follow study outcomes beyond the immediate post-intervention period.

Characteristics of included studies

Table 4 shows an overview of key characteristics for each study with findings reported for only the eligible group, excluding findings for intervention and comparator groups with > 12-month or mean > 9-month BCRL duration.

Table 4 Characteristics of included studies, reporting study design, participants, interventions, outcomes and follow-up

Participants

Results were available for 327 women with mean/median age from 47.5 to 61.6 years, although no study specifically addressed the population of interest. Four studies (n = 148) reported a subgroup with BCRL duration ≤ 12 months [21, 23, 52, 53], four studies (n = 138) had mean ≤ 9-month duration [47, 49, 50, 54] and two trials (n = 41) reported only one study group with mean ≤ 9-month BCRL duration [48, 50]. The number of women with prior treatment experience could not be determined as only four studies explicitly excluded women due to prior BCRL treatment [48, 51, 52]. Three studies did not address prior BCRL treatment and three excluded an unknown number of women who had received treatment within the previous 1–6 months [21, 23, 50].

The controlled studies defined BCRL by arm size difference using varying criteria, with four studies contributing a pooled mean baseline of 21%EAV [23, 48, 53, 54]. No descriptive study provided a BCRL definition. Three populations reported moderate BCRL, ≥ 20%EAV [23, 47, 53], and one reported mild BCRL, < 20%EAV [52]. Only two studies reported ISL lymphoedema severity stage [47, 53].

Study setting

Studies were conducted between 2000 and 2019 in countries with well-developed health care systems, mostly in Lymphoedema Units or Rehabilitation Departments associated with cancer services, with time periods varying from 8 months to 11 years. Studies were conducted in: Canada [21, 23], Iran [51, 52], South Korea [49, 50, 54], Turkey [47], Greece [53], Poland [48].

Interventions and comparators

The studies reported heterogeneous treatment protocols, intervention duration and components (Tables 4 and 5). Only two studies provided sufficient detail to replicate key components of both intervention and comparator groups [21, 48]. One trial [23] and three descriptive studies [52,53,54] provided intensive-DLT as the intervention, including compression bandaging (CB), MLD, exercise and skin care. A further four trials provided intensive-DLT to intervention and/or comparator group [21, 48, 49, 51]. There was heterogeneity of CB materials, application method, treatment duration (2–4 weeks) and post-intervention follow-up (0–104 weeks). Only two trials provided sufficient detail to replicate intervention and comparator [21, 48]. Detailed information regarding exercise and skin care was lacking from most studies.

Table 5 Summary of treatment components in each study

All 10 studies provided CB, for 2–5 weeks, although only one trial [50] specifically addressed the use of CB without reporting sufficient detail regarding materials and other treatment components to replicate the intervention. Only two trials described the process sufficiently to replicate CB [21, 48]. Otherwise, CB was typically described as short-stretch without indicating the number or type of bandage and sub-bandage layers.

Two trials tested the use of MLD to determine whether treatment outcomes improved with intensive-DLT compared with CB alone [21, 48]. Protocols differed in number of sessions (10 and 20) and duration (30 and 45 min). Another six studies provided MLD within their treatment protocol [23, 49, 51,52,53,54], reporting only the session duration and treatment method, whether Vodder or Foldi, which should be sufficient for trained MLD practitioners to replicate the intervention.

One study tested whether SLD enhanced CB outcomes, providing 10–15 min SLD prior to bandaging, with SLD discontinued during the median 6-week post-intervention self-treatment period [47]. Two other studies included SLD only during the post-intervention self-treatment phase so were excluded from this analysis [51, 54].

One trial tested the benefit of adding 15-min active resistance exercise (ARE) to intensive-DLT compared with intensive-DLT alone, with ARE continued throughout the 6-week self-treatment phase [49]. Six other studies reported exercise done by both study groups [23, 47, 48, 52,53,54] although only one provided sufficient detail to replicate the exercises [48]. Another trial reported relaxation exercise as the intervention with findings excluded as the intervention group was not the population of interest [51].

Outcomes

Outcomes of interest to this review included arm size, reported as relative change in EAV (Table 6) or pre- and post-intervention %EAV (Table 7); health-related quality of life (HRQOL) using the 36-item Short Form Health Survey (SF-36; RAND, USA) (Table 7); function using the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire and QuickDASH (Table 7); arm heaviness and patient-perceived benefit. Wide heterogeneity in study populations, interventions and measurement outcomes made meaningful comparison of data difficult and meta-analyses could not be performed.

Table 6 Results for studies reporting post-intervention relative (%) reduction in excess arm volume (EAV)1
Table 7 Results for studies reporting pre- and post-intervention relative excess arm volume (%EAV)

Length of follow-up

Duration of follow-up ranged from 0 to 24 months (Table 5). The longest follow-up for a prospective trial was one year [23] and two years for a retrospective study [54]. Two trials completed follow-up at 6 weeks [49, 51] which is insufficient to determine the long-term benefit or retention of treatment effect. Four studies did not follow participants beyond completion of the study intervention period [21, 50, 52, 53].

Findings

Tables 6 and 7 show findings, from the 10 studies according to intervention of interest and as published, including available p values, with additional findings for a subgroup of the study population provided by corresponding authors.

Intensive-DLT/CDT

Two trials reported eligible data for both intervention and comparator groups. Dayes [23] reported no significant difference in between-group outcomes although 25% of those receiving intensive-DLT achieved ≥ 50%EAV reduction compared with 15% of those self-treating with hosiery. McNeely [21] reported greater reduction for those receiving CB + MLD (intensive-DLT) compared with CB alone, although they did not report the statistical significance and the study sample was smaller.

Dayes [23] and Michopoulos [53] reported pre- and post-intervention %EAV, categorised as pre-intervention moderate swelling (≥ 20% < 40%EAV) respectively reducing to mild (≥ 10% < 20%EAV) and minimal (< 10%EAV) post-intervention swelling. The reason for substantially greater 81% improvement (Table 6) reported by Michopoulos [53] is unknown as their baseline 29%EAV is similar to other studies. Only Hwang [54] reported no post-intervention reduction, although they considered lack of progression over the 2-year study period a positive outcome.

There were varied outcomes for HRQOL and function (Table 7). Kim [49] reported significant post-intervention HRQOL improvement for both groups (p < 0.05) with, of note, substantially higher pre-intervention SF-36 scores for both groups than Dayes [23] who reported no significant change for both groups. Gradalski [48] reported high satisfaction with treatment outcomes, self-rating 9.4/10, although it is unclear whether women rated satisfaction with the treatment experience or outcomes.

These limited findings suggest intensive-DLT can improve BCRL symptoms although, due to conflicted findings, evidence is insufficient to report benefit.

Compression bandaging

Three studies reported demonstrated improvement with CB (Tables 6 and 7) although without shared or comparable outcomes or reporting methods [21, 47, 50]. These limited findings suggest CB can improve BCRL symptoms although, as the amount of improvement is unclear and no study compared CB with another intervention, evidence is insufficient to report benefit.

Manual lymph drainage

One trial [21] tested the effect of MLD within the intensive-DLT package although statistical difference was not reported for the eligible subgroup of the population, so it is unclear whether these findings favour inclusion of MLD (Table 7). Although Gradalski [48] conducted a similar trial, the findings cannot be reported as the intervention group was ineligible for this review. One study addressed SLD [47] with improvements to volume, HRQOL and arm function found in both groups, but they concluded SLD added no benefit to CB (Table 7).

Therefore, evidence is insufficient to report benefit of MLD or SLD within the intensive-DLT package.

Exercise

Kim [49] reported statistically significant improvement in post-intervention HRQOL for both groups receiving intensive-DLT (p < 0.05) although intergroup difference favoured the ARE intervention (p < 0.05) (Table 7). However, findings for reduction in arm size were excluded as reported using an ineligible method. There is insufficient evidence for the use of exercise to treat women with early BCRL.

Summary of findings

Few comparisons could be made between studies reporting the same intervention or comparator due to inconsistencies and imprecision of results, small samples and lack of directly comparing treatments. There were serious limitations affecting some studies more than others, including lack of allocation concealment and analysis of completed assessments rather than by intention to treat per-protocol.

Evidence was lacking for long-term benefit of intensive-DLT for this population although there is weak evidence (grade B) that decongestive treatments, whether provided in the form of CDT or CB, with or without MLD, effectively reduces lymphedema in women with early BCRL (Table 8). This review could not determine the most effective combination of treatment elements to reduce EAV or improve patient-centred outcomes, or the optimum treatment duration for women with early BCRL.

Table 8 Summary of individual study findings for women with breast cancer–related lymphoedema (BCRL) symptom duration of less than 12 months

Discussion

This review provides the only systematic examination of evidence for treatment of women with < 12-month BCRL duration, with no relevant studies published since review searches were updated in September 2022. Therefore, there is no evidence to support change in clinical practice by UK practitioners who routinely offer women self-treatment with hosiery or by practitioners in other countries offering initial intensive-DLT as standard care. The paucity of eligible papers and lack of trials recruiting only women with early BCRL highlight a big gap in research addressing management of early BCRL.

Lack of evidence for superiority of any treatment for women with BCRL has been previously reported [24,25,26, 44]. While Rangon et al. [25] suggested their review findings favoured intensive-DLT over other multimodal therapies, they included a smaller selection of trials which mostly compared variations of intensive-DLT, adding, removing or replacing DLT components. Dayes et al. [23] reported an unexpected finding of no significant difference in outcomes for women randomised to two-phase DLT or self-treatment with hosiery which was contrary to findings of an earlier trial by Badger et al. [19] which favoured CB over self-treatment using hosiery. The reason for these different findings is unknown although participants in the earlier trial [19] had no previous treatment experience, whereas an unknown number of women in Dayes trial [23] had previously received two-phase DLT or worn hosiery.

Although Hwang et al. [54] were the only study not to report improvement with intensive-DLT, they considered it meaningful that BCRL severity had not progressed over the 24-month follow-up period. Several studies highlight the importance of applying effective treatment at an early stage and monitoring symptom severity [56,57,58,59]. Two prospective surveillance studies reported progression of BCRL symptoms for 16% of women despite using hosiery for self-treatment [56, 57], although Blom et al. [56] reported worsened swelling for 57% of those not wearing compression garments. Bar Ad et al. [9] reported increased number of women progressing from mild to more severe BCRL over a 5-year review period. However, prospective surveillance studies do not address the treatment of established BCRL, including early BCRL, and no study addressed whether intensive-DLT can more effectively prevent or limit progression of BCRL symptoms. Further, the consequences of not appropriately treating established BCRL at an early stage are not considered.

This review also highlighted the paucity of research monitoring outcomes of importance to women. Few studies reported outcomes other than arm volume, although available evidence highlights the importance of monitoring symptom severity, such as arm heaviness, tightness and shoulder range of movement, and not just size [60, 61]. Women’s views of the value of treatment benefit and satisfaction with treatment were considered by only one study in this review [48], although the importance or meaningfulness for women and practitioners of the statistically significant findings reported by some studies in this review is unknown. Variation in measurement and reporting methods contributed to the paucity of outcome data, with minimal clinically important change for objective and self-report measures yet to be agreed, including arm volume, symptoms and HRQOL [10, 24, 26].

A strength of this review is the inclusion of literature from many databases with no date or language restriction. However, although recent expansion of bibliographic databases has index-linked many journals publishing lymphoedema papers, some relevant literature may have been missed by not searching all databases in 2022. In future updates, further consideration should be given to including other databases which could source additional eligible papers, for example, Embase may provide access to European studies that are not index-linked by Medline/PubMed. The impact of using single screening is unknown although we minimised the risk of missing studies at screening of title and abstract through careful use of EndNote, rechecking categories (EJ) and supervision by a senior experienced reviewer (DB). Heterogeneity of interventions and assessment methods, and variations in quality of study design and reporting, limited comparison of results and determination of the most effective treatment elements and optimal duration of treatment. Further, study samples were small, and few studies had sufficient post-intervention follow-up to address the chronicity of BCRL and potential for post-intervention treatment failure; these factors have been reported by multiple reviews of evidence for treatment of women with any BCRL duration [24, 62, 63]. Other limitations of this review highlight the paucity of research for this group, such as exclusion of findings reporting non-standard measurement methods and inclusion of one study group with mean ≤ 9-month BCRL duration while excluding the other with > 12-month duration. The lack of consensus regarding a standardised format for reporting and to inform comparisons between different protocols has been reported elsewhere [26, 64].

Conclusion

The updated searches added four studies but highlight there is still no evidence to justify change to current practice for treatment of early BCRL, whether self-treatment with hosiery in the UK or two-phase DLT typically provided by other countries. As with the findings of the original report published in 2018 [35], there was weak evidence (grade B) [65] for the impact of DLT for women with early BCRL regardless of the combination of treatment components, and the heterogeneous treatment protocols did not permit conclusions to be drawn about optimal treatment components for initial BCRL treatment or the required duration to reduce EAV and improve patient-centred outcomes.

The findings of this review confirm that, four years after the original review [35], there is still a need for a definitive trial of treatment effectiveness for women with early BCRL. Researchers with an interest in this area need to design adequately powered studies which recruit only women with < 12-month BCRL duration and monitor treatment outcomes for at least one year and preferably five years or more. Collaborations should be encouraged between groups of lymphoedema researchers to create comparable treatment protocols, outcome measures and reporting methods. Further research is also required to determine measurement methods which are meaningful to women, including minimal clinically important changes for outcomes; the inclusion of women with BCRL in designing studies could help to identify priority outcomes and select appropriate patient reported outcome measures. The impact of intensive practitioner-delivered intensive-DLT and self-treatment with hosiery on progression of BCRL severity should also be addressed as prevention of symptom progression may be as important as improving symptoms.