Background

Androgenic alopecia (AGA) is characterized by nonscarring progressive reduction in the diameter, pigmentation, density, and length of hair from frontotemporal and/or vertex regions of the scalp in a distinctive pattern [1]. It is a type of androgen-dependent hair loss disorder that affects approximately 50% of men in their 50s, [2] and its prevalence and extent tends to increase with age [3]. A vital alteration to the hair cycle for AGA is that the duration of the anagen phase decreases stepwise while that of the telogen phase increases, [4] but the pathogenesis of this disorder, as yet, is not fully understood. However, it has been fairly well established that androgens, mainly dihydrotestosterone (DHT) as major regulators, have critical effects on human androgen-sensitive hair follicles and may inhibit follicles on certain areas of scalp in genetically susceptible individuals and therefore causing AGA [5]. Testosterone, the principal androgen circulating in males, is converted to DHT by 5-αreductase. It has been reported that 5α-reductase inhibitors, such as finasteride, have increased scalp hair [6]. In addition, it has also been determined that heritability plays a critical role in the miniaturization of hair follicles leading to a change in the hair cycle [7, 8].

A high prevalence of abdominal obesity, hypertension, and lower high-density lipoprotein was found in patients affected by AGA [9]. Moreover, numerous studies have shown that AGA is implicated in an increased risk of metabolic syndrome [10] and coronary heart disease [11]. AGA is not simply recognized as an abnormal benign symptom with psychological effects and cosmetic impacts, but it has been represented as a forbear for future potential chronic diseases and therefore deserves attention. In consideration of similar pathophysiological mechanisms in terms of age dependency, genetic predisposition, and hormone dependency [12, 13], researchers have hypothesized a link between AGA and cancer risk, especially hormone-related cancer. Since cancer remains a worldwide public health issue that threatens human life, early diagnosis such as tumor screening could be an effective way to reduce the incidence of cancer. Yet, there are still some obstacles that hinder the wide application of tumor screening, for example, the lack of pertinence and the substantial consumption of healthcare resources. In this sense, it is beneficial to specify the high-risk groups to be screened so that it can reduce the costs, and that is why AGA is now being considered as a vital sign.

For the past two decades, a considerable number of publications have studied the relationship between AGA and hormone-related cancer, but they haven’t yielded consistent results. Amoretti et al. [14] conducted a meta-analysis to reveal the relationship between AGA and prostate cancer in 2013, but there have been several published studies that involved more kinds of cancers. As a result, we conducted a systematic review and a comprehensive meta-analysis in order to further investigate the issue and identify potential sources of heterogeneity that might be confounders that have affected some existing conclusions.

Methods

Search strategy and selection criteria

We searched for all eligible publications that evaluated AGA and cancer risk in Embase and PubMed up to June 2016.The combined search strategy employed the terms androgenic alopecia, alopecia, baldness, bald or balding in combination with cancer, tumor, or neoplasm in combination with risk, incidence, or mortality. No language or country filters were imposed. The details of the searching terms were listed in (Additional file 1: Table S1). The selection criteria were as follows: (1) the studied participants were exposed to AGA, and there was no gender limitation; (2) the study evaluated the incidence or mortality of prostate cancer and testicular germ cell tumor; (3) the papers provided relative risks (RRs), including cancer-specific hazard ratios (HRs) or odds ratios (ORs) in combination with 95% confidence intervals (CIs), or provided related data could calculate RRs; and (4) the study design was unrestricted. If multiple publications reported overlapping data or the same data, the one with greater size or more information would be chosen. Reviews, comments, letters, notes, abstract and repeated literature case reports were all excluded.

Data extraction and quality assessment

Two reviewers (L.S and M.D) independently retrieved the information from all eligible records. The methodological quality was assessed by two authors (W.L and L.S) using the Newcastle-Ottawa Scale (NOS) with 0–3 scores defined as low quality, 4–6 scores as moderate quality, and 7–9 scores as high quality. For each study, the participant and study characteristics, number of subjects, type of controls, study design, follow-up time, type of cancer, means of AGA assessment, method of case confirmation, cohort/control selection, and AGA categories were extracted and transformed into the specially designed forms. Disagreements or uncertainties were resolved by the reviewers’ re-verification of the data. If an agreement was not still reached, an additional adjudicator (S.D) was invited into the discussion.

Data synthesis and statistical analysis

If available, multivariate-adjusted risk estimates were used for each study; otherwise, unadjusted RRs, which were calculated according to exposure distributions given in the papers, were utilized. When a study only provided risk estimates from the comparison of a subset, i.e. a specific category of hair pattern such as frontal/vertex baldness with the group of no baldness, the alternative estimates were synthesized as the summarized estimates for overall exposure. If risk estimates were presented as RRs or ORs, combined estimates were generated using the method proposed by Hamling et al. [15], and if they were presented as HRs, a fixed-effect model was conducted.

Considering potential interactions, studies were stratified by the different designs of the studies with the effects estimated by HRs for cohort studies or ORs for case-control studies. As the ORs for case-control studies approximate the RRs in cohort studies with low incidence and effects estimated generally approaching 1.0, the combination of ORs and RRs was permitted [16]. According to the results of inter-study heterogeneity appraisal using χ2-based Q statistics and I2 for statistical significance of heterogeneity, pooled ORs and HRs with 95% CI were calculated using a fixed-effect model (Mantel-Haenszel method) or random-effect model (DerSimonian-Laird method). A P value of Q statistics > 0.10 and I2 < 50% indicated little heterogeneity. Sensitivity analysis was performed to assess the influence of a single trial in the meta-analysis estimated by sequential omission of individual trials. Publication bias was assessed with a funnel plot and the Egger regression. An overall meta-analysis was carried out in all included studies, and then a specific type of cancer with over two articles was included as the subtype that was performed for further meta-analysis. Prespecified subgroup analyses were performed by, reference age (20, 30, 40, or 45) for baldness assessment, pattern of baldness (frontal, vertex, vertex with frontal, vertex with/without frontal, and frontal with vertex), amount of baldness [1st stage: I; 2nd stage: II, IIa; 3rd -4th stage: III, IIIa, III-vertex, IV, Iva; 5th–7th stage: V, Va, VI, VII which were measured by Hamilton-Norwood scale (see Additional file 2: Figure S1)], type of control, baldness assessment type (by self-reporting and by trained observers) and age of the case.

A two-sided P < 0.05 was considered statistically significant. The STATA version 12.0 (StataCorp LP, College Station, Texas, USA) was used for statistical analysis.

Results

Eligible studies

The literature retrieval identified 1562 records in Pubmed and 4476 in Embase. Ultimately, 20 publications [17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35] were finally selected for the study; the flow diagram was presented in Fig. 1. The results of methodological quality assessment indicated that all included records were of high quality (More details were shown in Additional file 3: Table S2 and Table S3). All data generated or analysed during this study are included in this published article [and its Additional file 4: Excel S1].

Fig. 1
figure 1

Flow diagram of included studies for this meta-analysis

Baseline characteristics

In total, six cohort studies [17, 18, 20, 23, 24, 27] and 14 case-control studies [19, 21, 22, 25, 26, 28,29,30,31,32,33,34,35,36] were included in the present systematic review and meta-analysis. Among these, two cohort studies [17, 18, 37] were conducted on PC-specific mortality. In the case-control studies, ten studies addressed PC [21, 22, 26, 28, 30,31,32, 34,35,36], and four addressed TGCT [19, 25, 29, 33]. PC and TGCT were performed as the subtype for further meta-analysis. The baseline characteristics were shown in Table 1.

Table 1 Characteristics of included studies: androgenic alopecia and cancer risk

Systematic review and meta-analysis

The pooled results indicated that any kind of baldness had no influence on the PC-specific mortality in a random-effect model, with an HR = 1.07 (95% CI 0.43–2.64) and moderated heterogeneity (I2 = 69.2; P = 0.071). Twenty observational studies were involved in assessing the relationship between AGA and cancer incidence, including 6 cohort studies [17, 18, 20, 23, 24, 27] and 14 case-control studies [19, 21, 22, 25, 26, 28,29,30,31,32,33,34,35,36]. The cohort study of Hawk et al. [27] with the effects estimated by RRs was combined with case-control studies in the pooled analyses. A negative association was shown when all studies were pooled, with an HR = 0.99 (95% CI 0.91–1.09). Depending on the particular method of baldness assessment, the studies were separated into two subsets. The first subset included 10 studies [20, 22, 26, 27, 29, 31,32,33,34,35] that assessed baldness by self-reporting, and the second subset included 3 [21, 25, 28] studies where it was assessed by trained observers. No association was consistent in the two subsets. A summary of the results was presented in (Additional file 5: Table S4).

Analysis of TGCT

Four case-control studies [19, 25, 29, 33] that assessed the influence of AGA on the incidence of TGCT showed a negative association (OR = 0.69; 95% CI 0.58–0.83; P < 0.001) with little evidence of heterogeneity (Fig. 2a). When the studies were stratified by histological subtypes, the results were consistent and more strongly evident in nonseminoma. When the studies were stratified by different degree of baldness on the basis of the Hamilton-Norwood scale, hair loss at 2nd stage was negatively correlated with TGCT risk compared to that at 1ST stage (OR = 0.46; 95% CI 0.30–0.72; P = 0.001). A summary of the results was presented in Table 2.

Fig. 2
figure 2

Forest plots of any AGA and the risk of PC and TGCT incidence for case-control studies: a for TGCT incidence; b for PC incidence; and c high grade PC incidence

Table 2 Meta-analysis results of association between AGA and incidence of testicular germ cell tumor

Case-control studies & analysis of PC

The pooled OR for the 11 studies [21, 22, 26,27,28, 30,31,32, 34,35,36] that assessed the association between any AGA and the risk of PC was 1.13 (0.96–1.32) (Fig. 2b). However, when studies on PC stratified by different grades/stages [22, 28, 34, 35] were combined, a significantly increased risk was observed in high grade PC (OR = 1.42; 95% CI 1.02–1.99; P = 0.038) (Fig. 2c). The effect of size demonstrated a moderate statistical heterogeneity among studies of PC. When studies of PC were stratified by grades, no evidence of heterogeneity was observed. All the meta-analysis results of association between AGA and incidence of PC are listed in Table 3.

Table 3 Meta-analysis results of association between AGA and incidence of prostate cancer

Different patterns of baldness

Based on the Hamilton-Norwood Scale, the 8 study subsets [21, 22, 26, 27, 30, 32, 33, 35] and the 4 study subsets [25, 28, 31, 34] were categorized for male pattern baldness, among which 3 variables (none or little baldness, frontal baldness and vertex with/without frontal baldness) were used for the first subset and 4 variables (none or little baldness, frontal, vertex without frontal baldness, and frontal with vertex) for the second subset. Vertex with/without frontal baldness was associated with PC (OR = 1.29; 95% Cl 1.03–1.61; P = 0.029).

Different reference age

A total of two studies evaluated AGA at reference age 20 [31, 34], five at age 30 [22, 32, 34,35,36], and four at age 40 [31, 34,35,36], respectively. No association was established for participants with AGA at ages 20, 30, and 40, and these analyses yielded moderate heterogeneity.

Different age of case

Three studies [22, 28, 32] calculated the age-stratified association. AGA was not associated with the risk of cancer incidence in either younger men (< 60 years of age) or older men (≥ 60 years of age).

Different types of controls

As controls, five studies [28, 31, 32, 34, 35] selected PC-free participants, two [21, 30] selected prostatic hyperplasia (BPH) participants, and the others selected cancer-free or benign disease participants. No significant relationship was found between AGA and PC risk when the study was stratified by the different types of controls.

Cohort studies & analysis of PC

When four cohort studies [17, 18, 23, 24] were combined, the pooled results revealed that AGA had no relationship with prostate cancer in a random-effect model, with an HR = 0.99(0.94–1.05) and moderated heterogeneity (I2 = < 0.1%). In other subgroup analysis, the results were consistent.

Sensitivity analysis and publication bias

Sensitivity analysis confirmed that no individual study influenced the overall results (data not shown). There was no evidence of publication bias in this meta-analysis indicated by the Begg’s funnel plot and Egger’s tests (Fig. 3).

Fig. 3
figure 3

Begg’s funnel plots for publication bias of the relative risk of cancer incidence: a for all included studies; b for case-control studies; and c for cohorts

Discussion

This systematic review and meta-analysis involved a total of 10,935 cases from 21 observational studies. Amoretti et al. [14] conducted a meta-analysis to examine the association between AGA and the incidence of PC, but more studies have been published to indicate its relation to other kinds of cancer. Besides, this meta-analysis also evaluated the relationship between AGA and the mortality of cancer. Moreover, several subgroup analyses were conducted to evaluate whether the association varied by subtype of PC or TGCT, reference age for baldness assessment, pattern of baldness, type of control, and age of the case, and to minimize potential confounding biases.

Cases of AGA demonstrated a 31% decrease in the incidence of TGCT compared to those with no baldness. The sample size was not sufficient as there were only four studies included in the subset but little heterogeneity was shown. AGA may be regarded as a surrogate of cumulative androgen status which was considered as the result of its components such as exogenous hormones, endogenous hormonal levels, hormonal metabolism, or individual sensitivity to hormones. TGCT is significantly different from PC at the onset age. High risk for TGCT often occurs at ages 20–45 while it occurs at later ages for PC. It has been reported that circulating testosterone, DHT, and E2 declined gradually during male aging [38]. Thus, age differences both in the incidence of different kinds of cancer and the androgen status account for a complex interplay of these four components. In addition, androgen receptor and androgen metabolic pathway genetic variation studies showed that Ser312Asn polymorphism of the luteinizing hormone receptor was linked to an increased relative risk of PC [39] and a decreased relative risk of TGCT [40]. This suggested that androgen expression and metabolism might have tissue-specific effects. Moreover, testosterone and DHT appeared to be tissue-specific as well [41]; DHT played a vital role primarily in prostate and scalp, while testosterone appeared to function in spermatogenesis, muscle, and bone. These findings suggested that a complex interaction may partly explain the reason why AGA, as a surrogate of high androgen, has been implicated in different cancer but has shown conflicting results. AGA might reflect androgen status more directly and be less affected by other factors, thus being inversely related to tumorigenesis in TGCT. In histologic specific analyses on TGCT, the pooled results indicated that AGA exposure was significantly inversely related to the risks of developing both seminoma and nonseminoma. This relationship was more predominant among nonseminoma with less heterogeneity. Given that nonseminoma is often diagnosed at ages 25–29 on average, which is 10 years earlier than seminoma [42], the risk of TGCT decreased due to hormonal related factors during this time, and they may be more relevant to nonseminoma. Also, it was possible that these differences were caused by an insufficient sample size and inadequate statistical power for stratified analysis.

For 11 of the included studies, the association of AGA with PC was not evident in the overall population, which was consistent with the previous study [42]. Some authors speculated that the earlier onset of AGA, compared with its later onset, is a risk factor for developing PC. However, all the pooled results at different AGA time points (20, 30, 40, and 45) were negative. Yet, the interference of another potential source of bias, such as inadequate sample size, recall and selection bias, and study design, could not be ruled out. But, the present results could not support this interference. Also, the pooled results for baldness assessment type, age of case, and control selection were thought to be major factors. Coincident with the overall results, no subset analyses could establish a link between AGA and the incidence of PC. Thus, it is plausible that the overall results are robust. In addition, moderate inter-study heterogeneity was found to be consistent in case-control studies which suggested that there might be some other confounders or bias that accounted for inter-study heterogeneity. Several studies [22, 28, 34, 35] categorized the cases of PC into two grades according to Gleason scores, but the cut-off scores were not congruent. A negative result was achieved when included studies only focused on high grade PC with Gleason scores of 7–10. It was interesting to note that the result turned out to be positive when a study on high grade PC with Gleason scores of 8–10 was included. However, the pooled results showed AGA was not linked to aggressive PC (defined as Gleason scores of 7–10, regional/distant metastases, or fatal prostate cancer) without a difference in any baldness pattern in the cohort studies. The Gleason score was regarded as a good indicator of PC aggressiveness. Gleason 7 was divided into Gleason 4 + 3 and 3 + 4. Pathologically advanced PC and poor prognosis were more common in the first of the two. In our present meta-analysis, however, an agreement on the exact grade of PC with Gleason score 7 was not reached. A new Gleason grading system was proposed wherein Gleason scores ≤6 were lumped into prognostic grade group I, the score of 3 + 4 = 7 into group II, the score of 4 + 3 = 7 into group III, the score of 4 + 4 = 8 into group IV, and scores of 9–10 into group V [43, 44]. Thus, whether the less aggressive Gleason 7 cases account for the negative results or not needed to be confirmed and a new Gleason grading system should be applied in further studies.

When it came to different categories of baldness, the results showed that the vertex pattern with/without frontal baldness was related to PC risk, but the frontal pattern was not. On the other side, baldness at 2nd stage which amounted to frontal baldness was related to TGCT risk, while the other categories were not. As a result, a dose-response relationship could not be obtained. If these relationships were real, they may be possibly explained by non-linear correlations between the degree of AGA and the circulating androgen status. It was difficult to explain how the results turned to be negative in the PC, when vertex without frontal baldness and vertex with frontal baldness were regarded as independent patterns in the pooled studies. Also, no specific pattern of AGA showed any link to PC risk in the cohort studies. The pathophysiological difference among the patterns of AGA was yet unknown, so a reasonable explanation could not be provided. In addition, the reference age of AGA was different in the included studies; however, Muller et al. [20] found that vertex AGA at age 40 was not associated with the risk of PC; at age 55, the vertex AGA group had a higher hazard of prostate cancer; at ages 60–70, the HR was not discernible from 1; and at age 75, the hazard of prostate cancer was lower. This indicated that the association between AGA and PC was by nature age-variant and could not be sufficiently described by a single, age-invariant estimate of relative risk. Thus, the association might be masked by inconsistent AGA assessment age. Moreover, finasteride is a type II 5-αreductase inhibitor and is commonly used to treat AGA, [37] which could theoretically decrease the incidence of PC [45]. Another important point was that numerous epidemiologic studies have shown that AGA was associated with cardiovascular disease [46]. However, most of these studies lacked information on the connection of comorbidity and the use of finasteride. Therefore, this connection was potentially a major source of bias that could influence the final results, and the positive results should be interpreted with caution.

In this meta-analysis, only two of the included studies examined the association of AGA and PC-specific mortality, which showed that inner-study heterogeneity was moderate. Hence, more research work was needed to confirm these findings.

Nevertheless, several limitations of this study must be acknowledged. First, despite several subgroup analyses to be performed, significant heterogeneity was generally observed. Given the differences of the studies in race, age, participants’ lifestyle, information collection method, sample size, duration of follow-up and so on, heterogeneity was not avoidable. Second, the number of stratified analyses was so limited that might result in invalid statistical analyses in those groups. Third, although most of the studies used multivariate statistical models to calculate the estimated RRs, the number and content of the adjusted confounders varied in each trial, which might lead to imprecision in the results. But, the most multivariable adjusted-effect estimates were chosen for analysis to minimize the confounding biases. Besides, several sources of bias, such as inherent limitations, unmeasured confounding, and the typical bias of observational studies, could have affected the observed results. Therefore, well-designed and more comprehensive studies are still needed to further evaluate the relationship between AGA and the risk of cancers.

Conclusions

The results support the hypothesis that AGA is associated with a reduction of TGCT incidence by altering testicular development and they also suggest that there is a common pathogenic pathway. For PC, positive results are only observed in vertex patterns of AGA and high grade PC, while the viewpoint that AGA can be used as a phenotypic marker for PC risk is poorly supported. In this sense, future studies should be conducted to confirm the conclusions, as well as to evaluate the potential value of this association, which may offer a reference for establishment of predictive models.