Background

Publicly funded welfare-type healthcare systems with a strong emphasis on equal access to healthcare are increasingly struggling with resource constraints. This requires a standardisation, with continuing re-evaluation, of what should be reimbursed and what should be rationed [1,2,3]. The nature of plastic surgery entails an element of subjectivity and studies have revealed that there is a variation in what is offered which could indicate a lack of evidence and unequal access [1, 4,5,6,7]. One procedure that has been debated, and where guidelines vary, is breast reduction due to breast hypertrophy [3, 5, 8,9,10].

Breast hypertrophy is a condition that may give rise to both physical and psychosocial symptoms, including muscle pain, such as back and shoulder pain, headache, postural changes, bra strap grooves, intertrigo, inability to participate in exercise and sports, sexual problems, bullying, body image problems, and problems with poorly fitting clothes [11]. Most of the symptoms described impair health-related quality of life (HRQoL). A breast reduction (reduction mammaplasty) is considered effective at reducing physical and psychosocial symptoms and improving HRQoL [12, 13]; however, similar effects are also achieved when a breast reduction is performed for cosmetic reasons and therefore it is difficult to distinguish which patients should be operated in the publicly funded healthcare system [14, 15]. Moreover, there is no standardisation regarding the assessment and prioritising of functional problems, such as back pain, compared to non-functional problems, for example suffering due to appearance [16].

There is no commonly accepted definition of breast hypertrophy and no consensus for when a breast reduction is indicated and should be reimbursed. Some guidelines define breast hypertrophy according to breast volume. For example, the current national Swedish guidelines [17] base the definition on anthropomorphic measurements of mean breast volume (405 ml, median 359 ml) in a population of randomly chosen women [18]. Hypertrophy is defined as at least twice the mean volume observed in the anthropomorphic measurement studies; that is, a volume of > 800 ml per breast. Previous Swedish studies, conducted before the guidelines were established, showed that many women who want a breast reduction have a volume of > 800 ml [19, 20]. Other definitions of breast hypertrophy include the Sacchini criteria [21] and bra size. The Sacchini criteria [21] are based on the mean measurement of the nipple to the inframammary fold distance and the nipple to the lateral border of the sternum distance. A mean distance of less than 9 cm is considered to indicate a small breast, 9–11 cm a normally sized breast and > 11 cm breast hypertrophy. When bra size is used, a cup D or larger is typically considered to indicate breast hypertrophy. In healthcare systems with third party payers, such as the United States system, insurance companies often assess the medical necessity for a breast reduction based on the amount of tissue that can be removed in a normal weight patient [9], usually according to the The Schnur Sliding Scale [10, 22].

The aim of this review was to examine the risks and benefits of breast reduction to treat breast hypertrophy. Specifically, the primary aims were to investigate whether breast reduction is better than no surgery, in women with symptomatic breast hypertrophy and a BMI of ≤ 35, regarding HRQoL, depressive symptoms, anxiety symptoms, sexuality-related outcomes, work ability, sick leave, physical function, pain, and patient experience, and whether it is safe. Secondary aims were to examine how the studies defined breast hypertrophy and which indications for a breast reduction were used.

Methods

Protocol

This is a systematic review and meta-analyses based on a Health Technology Assessment report [23]. The protocol was pre-registered on the webpage of The Regional Health Technology Assessment Centre of Region Västra Götaland, Sweden (HTA-centrum).

Eligibility criteria and study selection

Studies examining risks and benefits with breast reduction in breast hypertrophy were included. Included articles had to meet criteria defined in a PICO (population, intervention, comparison, and outcome) [24] (Table 1). A patient was included in the work group when the PICO was defined.

Table 1 PICO

Eligible study designs were randomised controlled trials (RCTs), non-randomised controlled studies comprising ≥ 100 patients, case series if ≥ 1000 patients were reported (only for complications), all case reports/series reporting deaths, and qualitative studies. All the authors independently assessed whether the articles met the inclusion criteria and disagreements were resolved by discussion.

Information sources and search

In June 2020 two medical librarians (authors AL, ME) together performed a search in MEDLINE All (Ovid), PubMed, Embase (Ovid), the Cochrane Library (Wiley) and APA PsycInfo (Ebsco), using controlled vocabulary (MeSH, Emtree) such as breast hyperplasia and breast reduction and relevant free-text terms. The searches for all databases were validated by discussion and are available in Additional file 1. Reference lists of relevant articles were scrutinised for additional references. The web sites of the Swedish Agency for Health Technology Assessment and Assessment of Social Services (SBU) and the Norwegian Institute of Public Health were visited. The search was limited to English, Swedish, Norwegian and Danish languages, to human studies and publications from January 1990 to June 2020. The search was limited to this time period, as a previous systematic review has demonstrated that there is no relevant literature from before this date [17]. All articles remaining after the initial selection were obtained in full text for assessment by the other authors (EWJ, SB, HH, CJ, LJ, MP, MS, FW, EH). All authors independently assessed all the full-text articles (EWJ, SB, ME, HH, CJ, LJ, AL, MP, MS, FW, EH). Final inclusion was determined by consensus.

Data collection process and data items

Data were extracted by one author and verified by another. Information collected included: first author, year of publication, study country, study design, study scope, number of patients and controls, dropouts, study groups, body mass index (BMI), age, tobacco use, definition of breast hypertrophy, resection weight and outcomes according to the PICO.

Statistical analysis

The results of each article were tabulated per outcome (Tables 2, 3, 4, 5, 6, 7). When possible, data were pooled and subjected to meta-analysis using Review Manager (RevMan) and the Metan-command in Stata version 16. Random effects model using the method of DerSimonian and Laird, with the estimate of heterogeneity being taken from the inverse-variance fixed-effect model, was used. When only median and range was reported in the original studies, median was used as a proxy for mean and range divided by 6 was used a proxy to SD since mean ± 3*SD covers about 99.7% of the population values. If SD was only reported for baseline this value was also used for follow-up. For most outcomes, meta-analysis was not possible due to heterogeneity in measures and follow-up time.

Table 2 Characteristics of included studies
Table 3 Health-related quality of life
Table 4 Depression and anxiety
Table 5 Sexually-related outcomes
Table 6 Physical function
Table 7 Pain

Risk of bias in individual studies and across studies

All included randomised and non-randomised controlled studies, as well as the qualitative study, were assessed regarding directness, risk of bias and precision, as described by the GRADE working group [25,26,27,28]. Checklists for assessing study quality, modified from the Swedish Agency for Health Technology Assessment and Assessment of Social Services (SBU) [29], were used. Certainty of evidence was assessed using the GRADE approach, as very low (GRADE ⊕), low (GRADE ⊕ ⊕), moderate (GRADE ⊕  ⊕ ⊕), and high (GRADE ⊕  ⊕  ⊕ ⊕) [30]. High quality is defined as ‘further research is very unlikely to change our confidence in the estimate of the effect’, moderate quality as ‘further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate’, low as ‘further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate’, and very low as ‘any estimate of effect is very uncertain’ [30].

Results

Study selection

The literature search identified 1355 articles after removal of duplicates. Of these, 1257 articles were excluded after screening of abstracts. Another 44 articles were excluded when they had been read in full text (Fig. 1). The 54 full-text articles left after this first selection were sent to all authors, and 15 articles were finally included in the review (Table 2). The excluded articles, with reasons for exclusion, are presented in Additional file 2.

Fig. 1
figure 1

PRISMA flow diagram

Study characteristics

Of the fifteen included articles, four were RCTs (reported in eight papers), three were non-randomised controlled studies, three were case series, and one was a qualitative study (Table 2). The majority of the included studies compared surgical intervention with no treatment (C1) and one study [31] with physiotherapy (C2, non-surgical treatment).

Risk of bias within and across studies

The RCTs had serious study limitations, indirectness, and/or imprecision. Methodological issues included unclear definition of breast hypertrophy, short follow-up, lack of blinding of patients or surgeons, control groups composed of patients waiting for a breast reduction, and a lack of inter-group comparisons. Effects were measured using validated patient-reported outcome measures. The non-randomised controlled studies had some study limitations in terms of poor evaluation of potential confounding, adherence, dropouts, and unclear definitions of breast hypertrophy. The qualitative study was assessed as being of moderate quality.

Mortality and complications

Mortality was reported in three case series (n = 104,565), all based on the same registry, NSQIP; thus, slightly overlapping. One death was reported in the population with BMI > 30, not included in this review [32]. Three RCTs, one non-randomised controlled study and three cases series, based on register data, reported surgical complications (Additional file 3). Reporting standards were heterogeneous, as complications were not predefined, and no information was given about when, how, or by whom they were diagnosed. The reported frequencies of major complications after breast reduction, such as venous thromboembolism [33, 34], varied from 2.4 to 14%, and frequencies of minor complications, such as surgical site infection and delayed wound healing, from 2.4 to 69%. Two of the included studies showed that increased BMI was a risk factor for complications [32, 33].

Health-related quality of life

Health-related quality of life was reported in three RCTs and two non-randomised controlled studies, using both generic (SF-6D, and SF-36) and disease-specific (BREAST-Q) questionnaires (Table 3). HRQoL was improved after breast reduction in all included studies, compared with no surgery. Meta-analyses (Figs. 2, 3, 4) performed for SF-6D and SF-36 scores, showed a weighted mean difference for SF-6D of 0.14 (95% CI 0.10–0.17) 6 months after surgery, implying a clinically relevant difference in HRQoL, compared with the previously suggested minimal important difference (MID) [35] (Table 3). In summary, breast reduction compared with no surgery may result in a clinically relevant improvement in HRQoL in women with breast hypertrophy (low certainty of evidence, GRADE ⊕ ⊕).

Fig. 2
figure 2

Meta-analysis of studies comparing reduction mammoplasty with no surgery using SF-6D (Health utility index score)

Fig. 3
figure 3

Meta-analysis of studies comparing reduction mammoplasty with no surgery, using SF-36 (Physical summary score)

Fig. 4
figure 4

Meta-analysis of studies comparing reduction mammoplasty with no surgery, using SF-36 (Mental summary score)

Depression and anxiety

Depressive symptoms were reported in three RCTs (n = 215) and symptoms of anxiety in two RCTs (n = 155), using different validated assessment tools and scores (Table 4). Postoperative (4–6 months) depressive symptom rates were consistently lower in women undergoing breast reduction compared with no treatment or physiotherapy. The postoperative anxiety symptoms were measured after four to six months and were significantly lower in women who had undergone breast reduction in both studies. In summary, breast reduction, compared with no surgery, may result in a clinically relevant reduction in depressive and anxiety symptoms, in women with breast hypertrophy (low certainty of evidence, GRADE ⊕ ⊕).

Sexuality-related outcomes

Sexuality-related outcomes were reported in one RCT and two non-randomised controlled studies (n = 262), using different instruments (Table 5). Sexual function, sexual well-being, and sexual quality of life were significantly improved after breast reduction compared with no surgery. In summary, sexuality-related outcomes may be significantly improved by breast reduction, compared with no surgical intervention (low certainty of evidence, GRADE ⊕ ⊕).

Work ability and sick leave

Work ability and sick leave were not reported in any of the included studies.

Physical function

Physical function after breast reduction compared with no surgery, was reported in two RCTs and two non-randomised controlled studies (n = 447) (Table 6). One RCT reported physical function in two papers [36, 37]. Statistically significant improvement in physical function after surgery was reported in the RCTs, with a follow-up time of 6 months. Significant intergroup improvement was reported in the non-randomised controlled studies regarding physical wellbeing, physical function and daily activities after surgery. In conclusion, it is uncertain whether breast reduction compared with no surgery affects physical function in women with breast hypertrophy (very low certainty of evidence, GRADE ⊕).

Pain

Three RCTs and one non-randomised controlled study (n = 420) reported pain (Table 7), measured with different instruments. Pain was significantly reduced in all studies. In summary, breast reduction compared with no surgery may result in a clinically relevant reduction of pain in women with breast hypertrophy (low certainty of evidence, GRADE ⊕ ⊕)..

Patient experiences of a breast reconstruction

One qualitative study including 50 patients was identified [38]. Most of the patients reported an increased physical activity after the operation and believed that the operation had changed their lives to the better. Nonetheless, a few patients reported a deterioration in self-image and quality of life and one patient expressed regret. Some patients were unsatisfied or distressed with the scarring (Additional file 4).

Definitions of breast hypertrophy and indications for a breast reduction

As regards definitions, three studies used the Sacchini criteria, two studies used bra cup size, and 10 studies did not report how they defined breast hypertrophy (Table 2). One study reported that a bra size of E or more in combination with ‘symptoms in the upper body associated with mammary hypertrophy’ constituted an indication for surgery [39]. None of the other studies specifically reported indications for a breast hypertrophy.

Discussion

The aim of this review was to examine the risks and benefits of breast reduction in women with breast hypertrophy, with an underlying focus on identifying specific indications for surgery in the public healthcare system.

Methodological limitations of the included studies

Several methodological limitations were identified in all included studies. Main issues included a lack of, or the use of non-validated, definitions of breast hypertrophy and of complications, a potentially biased control group, lack of blinding, a short follow-up, and insufficient reporting of inter-group results.

The main problem with the lack of definitions of breast hypertrophy and indications for breast reductions in the studies is that it is difficult to evaluate effects of treatment when the condition is not adequately defined. Moreover, not all of the studies reported the resected amount of breast tissue, further complicating the evaluation of the effects of the intervention in relation to the severity of breast hypertrophy. There are a number of limitations regarding the use of unvalidated breast measurements, such as bra size and the Sacchini criteria, that were used in the few studies [39,40,41,42,43] in this review that reported their definition. Firstly, there are no conclusive studies determining what volume/weight, in relation to body build, that gives rise to physical and/or psychosocial symptoms, and symptom relief does not seem to be correlated to the amount of tissue resected [22, 44]. Secondly, the relationship between breast volume and breast weight is not clear-cut as different breasts have different density. The ratio between adipose tissue and breast tissue varies according to genetics and hormonal status and breast tissue weighs more than adipose tissue. Thirdly, breast size measurements are uncertain [45, 46]. As regards, the use of cup sizes, they are not standardised; for example, one brand’s D cup might equal another brand’s C cup. The cup size is often based on the difference in breast circumference and rib cage circumference; that is, a difference of one inch (2.54 cm) constitutes an A cup, two inches a B cup, etc. and consequently the actual volume of the cup is substantially different depending on the circumference of the rib cage. Moreover, the model of the bra, for example if it covers the entire or only part of the breast, creates different ‘volumes’. Finally, there is a considerable difference in how women want their bra to fit; that is, women with identical breast volume might wear different bra sizes [47]. In brief, it is unclear which conditions have actually been treated in the included studies.

None of the included studies stated how complications were defined and whether they had been registered in a systematic and prospective fashion or not. Similar methodological problems have been seen previously in studies on breast reduction, where most studies only register surgical site complications in an undefined way leading to an underestimation of overall complication rates [48]. In one of few publications [48] on breast reduction where complications were classified according to a validated system, Clavien–Dindo, the complication frequency was 63%, albeit retrospectively registered. A prospective approach could give an even higher complication frequency. The most common type of complication (46%) was wound healing complications [48]. The study by Winter and associates [48] was not included in this review as the number of reported patients were 486, and the inclusion requirement of > 1000 patients for case series was therefore not met. In this review, the lower complication rates are from publications reporting figures from the NSQIP registry [32, 33, 49]. In the registry, wound complications are defined as ‘superficial infection, deep wound infection, deep or organ space infection, and wound dehiscence’ [49]. In Winter et al.’s study [48], the rate of such wound complications was 9%, and the rate of milder wound complications, not requiring an intervention, such as antibiotics or debridement, was 48%. Indeed, the studies included in this report with higher complication rates seem to have included all types of wound complications. Hence, complications are common but reported frequencies are dependent on how complications are defined and classified, explaining the wide range of frequencies observed in the present review.

In all included RCTs, patients who wanted a breast reduction were randomised either to breast reduction or to a waiting list for such surgery. Therefore, all the patients were likely biased towards a wish for a breast reduction and all the controls knew that they would receive a breast reduction eventually. It can be discussed whether such patients represent an adequate untreated control group. The practice also implies that neither the patients nor the surgeons were blinded.

Another issue limiting the directness of the results, is the short follow-up time in the included studies. According to basic plastic surgical principles, a final result can never be evaluated before at least a year has passed [50]. Most of the included studies had a follow-up time of less than one year, and therefore the measured effects might not represent the final outcome of surgery. Patients who are treated with surgery they have requested themselves, initially experience a positive effect of the surgery that might diminish over time [51]. Moreover, two of the four RCTs were conducted in the same country which might limit the generalisability of the results as cultural norms [52], and perceived need for breast reduction, might be different in other parts of the world.

Discussion of current evidence

Our review shows that complications are frequent after breast reduction. None of the included studies specifically reported the impact of different breast volumes on the effect and safety of breast reconstruction. However, the case series on complications clearly showed that a BMI equal to or higher than 30 increases the risk for complications by three-fold [32]. Moreover, the most serious complications, such as pulmonary embolism [34] and death [32], occurred in patients with a high BMI. Nonetheless, even though a high BMI clearly increases the risk for complications, there is no evidence to suggest where the exact BMI limit should be. None of the included studies specifically included an analysis of other risk factors for complications, such as smoking [53]. However, the high frequency of wound healing complications in all of the included studies could indicate that all risk factors for wound healing problems should be eliminated.

Regarding effects, breast reduction may improve HRQoL and may reduce depressive symptoms, anxiety symptoms, and pain, compared with no surgery. However, such effect can also be seen when a breast reduction is performed for aesthetic purposes [14, 15, 54]. In this context, it is unclear how such patient-reported outcomes improvement should be valued, and how patients suffering due to appearance related factors should be differentiated from patients with a mere preference for plastic surgery [16, 55]. Moreover, little is known about the long-term effect of plastic surgery on HRQoL, depression and anxiety [56]. In addition, some of the effects, such as the effect of breast reduction on depression, should be interpreted with caution, as the observed baseline values generally indicated no or mild depression [39, 41, 57]. A total of five health economic articles [40, 58,59,60,61] were identified in the literature search but only one [40] of them fulfilled the eligibility criteria for inclusion. The studies were all based on studies with small sample sizes, assessing QALY benefits by the intra-individual changes in HRQoL (i.e. lacking control group) and making the optimistic assumption that the HRQoL benefits would last the rest of the lifetime.

The effects seen in this review on HRQoL, depressive symptoms, and anxiety after breast reduction illustrate that breast hypertrophy gives rise to more symptoms than back pain and functional problems, which might indicate that a volume/size requirement is too crude a measure to decide which patients will benefit from a breast reduction and should be granted an operation in the public healthcare system. Moreover, there are no reports on which volumes/weights give rise to physical symptoms in relation to body build and other factors, further strengthening that a volume/size definition, on its own, seems inadequate to predict which patients benefit the most from a breast reduction. In brief, to create evidence-based guidelines for which patients should be granted a breast reduction in the public healthcare system, more studies are needed on the definition of breast hypertrophy and the health care need it gives rise to, as well as on the effect of treatment.

Conclusions

There are few studies and a lack of high-quality studies that evaluate the results of breast reduction and include a definition of breast hypertrophy. A breast reduction seems to have positive psychological and physical effects for women, but it is unclear which women benefit the most and which women should be offered a breast reduction in the public healthcare system. Currently, there is large variation in, and unequal access to publicly funded breast reduction. A number of priorities for further research have been identified:

  • A validated system for how breast hypertrophy should be defined, and which preoperative measurements, symptoms, and outcome variables should be reported in studies

  • Evaluation of which volumes/weights give rise to physical symptoms in relation to body build and other factors

  • Analysis of what healthcare needs breast hypertrophy gives rise to

  • A validated classification system for prospective registration of complications after breast reduction

  • Health economical evaluation of the cost-utility of breast reduction compared with no surgery

  • Further studies on women’s experiences of breast reduction.