Background

Breast cancer is the most common malignancy that develops in women worldwide, its incidence continues to rise and it is responsible for the highest cancer-associated death rates [1]. It is an extremely heterogeneous disease, made up of a number of different subtypes. Classification of breast cancer into subtypes can be partly attributed to the presence or absence of receptors for the hormones estrogen (ER), progesterone (PR) and human epidermal growth factor 2 (HER2). Triple negative breast cancer (TNBC) represents a particularly important clinical subtype, characterised by an absence of ER, PR and HER2 and which therefore lacks common targets used for anti-hormone therapies [2, 3]. Although TNBCs comprise only a small percentage of all breast cancers diagnosed (10-24%), they have been recently the subject of intense investigation because of their aggressive clinical behaviour. Patients who are diagnosed with TNBC are of younger age, tend to develop tumours of larger size, and have an increased likelihood of distant metastasis and death within 5 years of diagnosis [2, 3]. Thus, TNBCs represent a major clinical problem for which targeted therapies are currently not available.

microRNAs (miRNAs) are a class of small (~22 nucleotides) non-coding RNAs that control gene expression by targeting mRNAs and triggering either translational repression or RNA degradation [4]. miRNAs have emerged as an attractive candidate for molecular biomarkers and novel therapeutic targets in cancer because of their stability, ease of detection and their ability to act as endogenous antisense regulators of entire gene sets that regulate cancer growth [5].

Several studies have identified critical roles for miRNAs in breast cancer. Iorio and colleagues were the first to report significant deregulation in miRNA expression profiles in breast cancer when compared to normal breast tissue, where they showed that the expression of several miRNAs was associated with breast cancer subtypes and clinico-pathological features including hormone receptor status, clinical stage and proliferation index [6], suggesting that miRNA expression in breast cancer may have diagnostic and prognostic value. Since then, several miRNAs have been identified that have prognostic significance, including miR-210, miR-126, miR-21 and miR-205 [711]. However, the clinical cohorts used to generate these miRNA profiles have been highly heterogeneous, in regards to clinicopathological variables (e.g. grade, tumour size, subtype). Moreover, because these studies have concentrated on using miRNA profiles to better classify breast cancer subtypes, there are few studies that have profiled all miRNAs within a particular breast cancer subtype and even fewer that have used matched normal tissue as controls in these analyses; and so it is plausible that miRNAs important in disease initiation and progression that are subtype specific have been missed. TNBCs are distinct from ER-, PR- positive tumours at the molecular level and this classification has clinical implications [12]. Therefore, the identification of miRNAs that control TNBC initiation and progression could identify individuals that have more aggressive disease and may also help to identify subgroups of patients that are more responsive to particular treatments within these subtypes.

The dissemination of primary cancer cells to the lymphatic system represents one of the first signs of metastatic spread. In breast cancer, the number of positive lymph nodes (LN) is known to have an inverse linear correlation with prognosis and survival [13]. This is not the case with TNBC, where it has been shown that any LN involvement is associated with worse disease-free and overall survival [14]. Therefore, identification of miRNAs that are differentially expressed within LN metastases in this highly aggressive breast cancer subtype may serve as a better indicator of prognosis than miRNA profiles derived from primary breast cancers.

In this study, miRNA expression profiles of 31 primary TNBCs were examined and compared to the profiles of 23 matched normal breast tissues to reveal miRNAs that were differentially expressed in TNBC and that are potentially associated with its initiation. In addition, by comparing the miRNA expression profiles of lymph node positive primary tumours and matched LN metastases, we have identified a panel of miRNAs that are associated with metastatic capability in the TNBC subtype.

Methods

Study cohort and tissue sampling

Thirty-five formalin-fixed paraffin-embedded invasive ductal carcinomas (IDCs) were obtained from the archives of the Hunter Area Pathology Service, John Hunter Hospital, Newcastle, Australia. All patients were diagnosed with grade 3 IDC between the years of 2004–2009, and were negative for ER, PR and HER2 as assessed through routine diagnostic pathology (Additional file 1: Supplementary Methods and Figure S1). Areas of tissue representing histologically normal adjacent breast tissue (NAT, where available), IDC and breast cancer metastases (in LN) were identified and confirmed by a pathologist. Micrometastases (<2 mm) were not used in the analysis. None of the areas of selected NAT contained tumour tissue and all were enriched for terminal duct lobular units (>2 in biopsied area). Patient information is described in Table 1. A 1.5 mm punch biopsy was used to punch tissue cores from the paraffin blocks using haemotoxylin and eosin stained sections of the same sample for guidance. Tumour volume in the core biopsy was >70% of the total. This study complies with the Helsinki Declaration with ethical approval from the Hunter New England Human Research Ethics Committee (Approval number: 09/05/20/5.02). In accordance with the National Statement on Ethical Conduct in Research Involving Humans, a waiver of consent was granted for this study.

Table 1 Demographic data of triple negative breast cancer cases

Extraction of RNA

Total RNA was extracted using the miRNeasy FFPE kit (Qiagen, Doncaster, VIC, Australia). RNA was quantified using the Quant-it RiboGreen RNA Assay kit (Invitrogen, Mulgrave, VIC, Australia) and purity assessed by A260/A280 and A260/230 ratios (>1.8) using the Nanodrop. The RNA integrity of selected samples was analysed using the 2100 Bioanalyser and the RNA 6000 Nano kit (Agilent Technologies, Mulgrave, VIC, Australia).

miRNA arrays

100 ng of total RNA was dephosphorylated and directly labelled with Cy3 using the miRNA Complete Labelling and Hyb Kit (Agilent Technologies). Labelled RNA was hybridised to Human miRNA microarrays (Sanger Release 14.0) according to the manufacturers’ instructions (Agilent Technologies) and scanned on an Agilent High-resolution C scanner.

miRNA array analysis

These miRNA array results have been deposited in Gene Expression Omnibus (GEO) with Accession No. GSE38167. Data from 15,000 probe features representing 904 unique miRNAs was extracted using Agilent Feature Extraction software (v10.7.3.1) and converted to background subtracted signal intensities. The extracted data was imported into Genespring GX (Agilent Technologies) where it was log2 transformed and median normalised. Seven samples did not meet quality control measurements recommended by the manufacturer and were removed from the microarray analysis (1 IDC from a LN negative patient, 1 NAT and 3 IDCs from LN positive patients; and 2 LN metastases). Of all the probes interrogated in this analysis, those corresponding to 570 miRNA transcripts were present at a signal intensity threshold above background in at least one of the tissue samples. Unpaired t-tests were used to identify miRNAs with significantly altered expression (>2-fold, p < 0.05). To correct for false positive results, a Benjamini and Hochberg False Discovery Rate (FDR) of 5.0% was used for multiple testing. Supervised hierarchical cluster analysis was performed on miRNAs that were found to be significantly different (>2 fold, p < 0.05, FDR < 0.05). Similarity in the expression patterns between miRNAs was measured by Pearson’s correlation coefficient.

Biological targets of differentially expressed miRNAs were identified by searching for the presence of conserved 8mer and 7mer sites within genes that match the seed region of each miRNA. Non-conserved sites were also included in this analysis. This analysis was performed using sRNA Target Base (starBase, http://starbase.sysu.edu.cn/index.php) [15] which integrates data from 21 Ago or TNRC6 CLIP-Seq sequence data sets with the target prediction programs Target Scan, Pictar and miRanda. The number of genes identified by each of these programs and those in common between each of the programs is shown in Additional file 1: Figure S2. Only those target genes that were predicted by all three target prediction programs (560 genes) were used for further pathway analysis. PANTHER [16] was used to annotate the biological pathways that predicted miRNA target genes were involved in as previously described [17]. Pathways with a p-value <0.05 were considered to be significantly regulated by the miRNAs.

To determine the significance of differentially expressed miRNA families and clustered miRNAs; or to annotate the function of differentially expressed miRNAs, a freely available web-based resource, TAM (tool for annotations of miRNAs), was used [18].

Semi-quantitative real-time PCR

Total RNA (5 ng) was reverse transcribed to generate cDNA using the Taqman MicroRNA Reverse Transcription Kit and Megaplex RT Human Primer Pools Set v3.0 (Applied Biosystems, Mulgrave, VIC, Australia) according to the manufacturers’ instructions. cDNA was amplified using the TaqMan PreAmp Master Mix with Megaplex Human PreAmp Primer Pools Set v3.0 (Applied Biosystems) according to the manufacturers’ instructions. Real-time PCR analysis was performed in triplicate on all samples (Table 1) using TaqMan Universal PCR mix No AmpErase UNG and TaqMan MicroRNA Assays (Applied Biosystems) according to the manufacturers’ instructions, with results quantified on a 7500 real-time PCR system (Applied Biosystems). The expression of the following miRNAs was analysed for array validation: hsa-let-7a (Assay ID: 377), hsa-let-7b (Assay ID: 2619), hsa-let-7c (Assay ID: 379), hsa-miR-100 (Assay ID: 437), hsa-miR-101 (Assay ID: 2253), hsa-miR-126* (Assay ID:451), hsa-miR-26a (Assay ID: 405), hsa-miR-26b (Assay ID: 407), hsa-miR-130a (Assay ID: 454), hsa-miR-29c (Assay ID: 587), hsa-miR-205 (Assay ID: 509), miR-210 (Assay ID: 512), RNU44 (Assay ID: 1094) and RNU49 (Assay ID: 1005). The relative miRNA expression was calculated by normalising the miRNA of interest to RNU44 (2-ΔCt). Relative expression of miRNAs was also calculated using a second normaliser, RNU49 (data not shown). The same relative expression patterns for the miRs analysed was observed when normalised with RNU49 (examples shown in Additional file 1: Figure S3). The relative expression of RNU44 and RNU49 was not significantly different between the subgroups analysed and therefore served as an appropriate normaliser for this analysis (Additional file 1: Figure S3). All pre-amplified multiplex miRNA assays were validated against uniplex miRNA assays to verify that the multiplex reaction did not affect miRNA quantitation (Additional file 1: Supplementary Methods, Table S1 and Figure S4). hsa-miR-126* and hsa-miR-205 were outside our range of acceptable PCR efficiencies and were not used for further validations.

Statistical analysis

The normality of the data distribution was tested using a D’Agostino and Pearson Omnibus test. The values were found not to have been sampled from a Gaussian distribution and thus, non-parametric statistical tests were used to compare the real-time PCR data. A two-tailed Mann–Whitney U test was used to determine if there was a statistically significant difference in the expression of miRNAs between any two subgroups. The Kruskal-Wallis rank test followed by a Dunn’s Multiple Correction test was used to determine the statistical significance of miRNA expression between multiple (>2) subgroups. Analysis of the correlation between miRNA expression and clinical parameters was performed using Spearman’s correlation test. All analysis was performed using GraphPad Prism (version 5.04) software (GraphPad software Inc., La Jolla, CA, USA).

Results

miRNAs differentially expressed in TNBC

To identify miRNAs associated with TNBC, expression profiles were analysed in 35 grade 3 invasive ductal carcinomas (IDCs) compared to 24 matched normal adjacent tissue (NAT) specimens (Table 1). Seven samples did not meet quality control measurements recommended by the manufacturer and were removed from the microarray analysis (refer to Methods). Seventy-one of the miRNA transcripts were identified as being differentially expressed between IDC and NAT (Figure 1A). Supervised hierarchical clustering of these miRNAs clearly separated NAT from IDC, suggesting that the expression of these miRNAs can discriminate these two groups (Figure 1A). Five of the differentially expressed miRNAs (miR-210, miR-100, miR-130a, let-7b, let-7c) were verified by real-time PCR and were shown to be significantly different in expression between IDC and NAT in all cases (Figure 1B). In addition, the fold change in expression (IDC versus NAT) of the five miRNAs was highly concordant (R2 = 0.9543) between microarray and real-time PCR analysis, further confirming the validity of this approach (Figure 1C).

Figure 1
figure 1

miRNAs significantly different in triple negative breast cancer. A) Supervised hierarchical cluster analysis was performed on 71 miRNAs significantly altered between tumour (n = 31, red branches, top) and normal adjacent tissue samples (n = 23, blue branches, top). Similarity in the expression patterns between miRNAs (branches shown on the left-hand panel) and between samples (branches shown on the top panel) was measured using Pearsons correlation. Distances between clustered branches represent the average distances between miRNAs and samples in the cluster. The height of each branch represents the degree of similarity within the cluster. miRNAs are coloured according to their expression level, where up-regulated expression is represented by red, down-regulated expression is represented by blue, and equal expression is represented by yellow. B) Relative quantification of miR-210, miR-100, miR-130a, let-7b and let-7c by real-time RT-PCR in normal (n = 24) and tumour samples (n = 35). Results are shown as a scatter plot of the relative normalised expression (target/RNU44) of the target miRNA (2-ΔCt). Values represent the median ± interquartile range. *p < 0.0001, **p = 0.0004. C) Correlation between real-time PCR and miRNA array results. Relative quantification of fold change (IDC v NAT) in miR-210, miR-100, miR-130a, let-7b and let-7c between normal and tumour samples by real-time RT-PCR (2-ΔΔCt, y-axis) or miRNA array (x-axis).

Of the 71 significantly altered miRNA transcripts, 39 showed increased expression and 32 showed decreased expression in IDC when compared to NAT (Additional file 1: Table S2). miRNAs are known to be located in genomic clusters [19]. Several of the miRNAs that were identified as being differentially expressed are clustered within the same genomic region, and four of these genomic clusters were significantly over-represented (p < 0.05): the miR-17-92 cluster (miR-17, miR-18a, miR19a, miR-19b, miR-20a), the miR-106b cluster (miR-25, miR-93, miR-106b), the miR-200a cluster (miR-200a, miR-200b, miR-429) and the miR-106a cluster (miR-18b, miR-19b, miR-20b, miR-363) (Additional file 1: Table S3). miRNAs are commonly grouped in families based on the similarity in their seed sequence. Three miRNA families were significantly (p < 0.05) over-represented: the miR-17 family (miR-17, miR-18a, miR-18b, miR-20a, miR-20b, miR-93, miR-106b), the miR-200 family (miR-200a, miR-200b, miR-200c, miR-141, miR-429) and the miR-130 family (miR-130a, miR-130b, miR-301a) (Additional file 1: Table S3). The majority of the differentially expressed miRNAs have previously been implicated in breast cancer including miR-200a, miR-200b, miR-200c, miR-21, miR-210, miR-205 and miR-10b [7, 8, 10, 11, 20, 21]; and their regulation was concordant with previous studies. This suggests that the majority of the miRNAs identified in this study are relevant to the initiation of all breast cancers. In addition, we identified 5 miRNAs: miR-130a, miR-1280, miR-590-5p, miR-1308, miR-17*, which to the best of our knowledge, have not previously been implicated in breast cancer (Table 2). Of note, miR-130a has been shown to be associated with chemotherapy response in ovarian cancer and lung cancer cell lines, while miR-1280 has recently been demonstrated to inhibit invasion and metastasis by targeting ROCK1 when over-expressed in bladder cancer [2224]. miR-590-5p has been reported to enhance (via the tumour suppressor PBRM1) or inhibit (via S100A10) cell growth and invasion depending on the cellular context, however, the function of miR-1308 and miR-17* has not been extensively studied [25, 26].

Table 2 Unique miRNAs identified as being differentially expressed in triple negative breast cancer

miRNAs differentially expressed in LN negative IDCs compared to LN positive IDCs

To determine miRNAs that were differentially expressed in patients with LN metastases, the primary breast cancers from 15 LN negative patients and 16 LN positive patients were compared. In this analysis, there were no miRNAs that were found to be significantly different. Additionally, the comparison of NAT from node negative and node positive women yielded no significant differences in miRNA profiles.

miRNAs associated with LN metastasis

In contrast to the report of Cascione et al. we undertook an additional investigation examining the relationship between lymph node positivity and miRNA expression [27]. We determined the miRNA profiles of LN positive and LN negative IDCs compared to their matched NAT. From these analyses, 37 miRNAs were found to be altered in LN negative patients, while 46 miRNAs were found to be significantly different in LN positive patients (Figure 2, Additional file 1: Tables S4 and S5). Supervised hierarchical clustering of these miRNAs completely separated NAT from IDC into two distinct groups in LN negative patients (Figure 2A) and was able to distinguish the majority of LN positive IDCs (two IDCs were misclassified) from matched NAT (Figure 2B). Only 10 of the miRNAs that were identified as being differentially expressed in IDC versus NAT overlapped in LN negative and LN positive patients and the direction of their regulation (up, down) when compared to matched NAT, was concordant (Figure 2C, Additional file 1: Tables S4 and S5, miRNAs in bold). These included up-regulation of miR-21 and down-regulation of miR-10b; which have well known roles in breast cancer (Figure 2C; Additional file 1: Tables S4 and S5). Interestingly, the majority of miRNAs identified as being differentially expressed in LN negative patients were up-regulated (34/37 miRNAs), while the majority of differentially expressed miRNAs were down-regulated (35/46 miRNAs) in LN positive patients (Figure 2, Additional file 1: Tables S4 and S5). These results suggest that the repertoire of miRNAs whose expression is altered in the primary breast tumour when compared to the normal breast tissue is distinct in LN positive and negative patients.

Figure 2
figure 2

miRNAs significantly different in triple negative breast cancer with lymph node metastases. Supervised hierarchical cluster analysis was performed on miRNAs significantly altered between tumour (red branches, top) and normal adjacent tissue samples (blue branches, top) in A) lymph node negative and B) lymph node positive patients. Similarity in the expression patterns between miRNAs (branches shown on the left-hand panel) and between samples (branches shown on the top panel) was measured using Pearsons correlation. Distances between clustered branches represent the average distances between miRNAs and samples in the cluster. The height of each branch represents the degree of similarity within the cluster. miRNAs are coloured according to their expression level, where up-regulated expression is represented by red, down-regulated expression is represented by blue, and equal expression is represented by yellow. C) Venn diagram representing the overlap between miRNAs regulated in lymph node negative patients (37 miRNAs, left-hand circle), lymph node positive patients (46 miRNAs, right-hand circle) and in the lymph node metastases of lymph node positive patients (63 miRNAs, bottom circle). D) Histogram depicting the fold regulation of the 27 miRNAs (highlighted in red in C) in tumour compared to NAT in lymph node negative tumours (white), lymph node positive tumours (black) and lymph node metastases (grey). Two-fold regulation is depicted by the red line.

Of note, the five unique miRNAs that were differentially regulated when all tumours were compared to all normal tissues (Table 2), were not commonly regulated when lymph node positive and negative cases were compared separately to matched normal tissue. Instead, the miRs that were positively regulated (miR-17*- 3.03, miR-590-5p- 2.15, miR-1280- 2.10) when all tumours were analysed became more highly up-regulated (miR-17*- 4.66, miR-590-5p- 3.13, miR-1280- 2.40) in the comparison of lymph node negative tumours versus matched normal tissue, but were not regulated in lymph node positive tumours. In contrast, while miR-130a was down-regulated in the comparison of all tumour tissues (−2.21), it became more strongly down-regulated in lymph node positive tumours (−3.32) when they were compared to matched normal tissues, and was not regulated in lymph node negative cases. This supports our data, that there are intrinsic differences between lymph node positive and negative TNBCs: miRs are mainly down-regulated in patients with lymph node metastases, and up-regulated in patients who do not have lymph node metastases and suggests that it is these patients (i.e. lymph node positive or negative tumours) that are driving the differential regulation (negative and positive respectively) when all patients are combined.

To determine miRNAs that were altered in LN metastases, the miRNA profile of LN metastases were compared to those of NAT (9 matched and 5 unmatched) from LN positive patients. From this analysis, 63 miRNAs were found to be differentially expressed (Additional file 1: Table S6). The majority of these miRNAs (58/63 miRNAs) were down-regulated and largely overlapped with the profile of their primary tumour (IDC, LN+; 32/63 miRNAs, Figure 2C). Many of the miRNAs overlap with those identified by Cascione et al., with 33% (21/63) of the differentially expressed miRNAs identified by our study also showing differential regulation in normal versus LN metastases comparisons in their study of TNBC [27]. No miRNAs were differentially expressed when LN positive primary IDCs and LN metastases were compared. This suggests that the profile of miRNAs differentially expressed in primary breast cancers are maintained in LN metastases from these tumours. This is also concordant with the results of Cascione et al. who found very few differentially expressed miRNAs in this comparison [27]. In contrast, only five of the 63 miRNAs that were differentially expressed in LN metastases were also differentially expressed in LN negative primary IDC’s (Figure 2C) when compared to NAT, suggesting that their miRNA profiles are largely distinct.

We reasoned that miRNAs that are differentially expressed in LN metastases and in the primary IDC of patients with LN metastases, that are not differentially expressed in patients who did not have LN metastases, may represent some of the earliest changes associated with metastatic progression. The overlap of the miRNA profiles is shown in Figure 2C, where 27 miRNAs were shown to meet these criteria. These concordantly regulated miRNAs include three members of the let-7 family (let-7a, -7b, -7c), two members of the miR-26 family (miR-26a, -26b) and three members of the miR-199 family (miR-199a-3p, -199a-5p, -199b-5p), suggesting that these miRNAs regulate similar target genes (Table 3). Furthermore, the list also contains miRNAs located within the same genomic cluster as one another, the let7a cluster (let-7a-3, -7b; and let-7a2, miR-100), the miR-195 cluster (miR-195, miR-497) and the miR-199a cluster (miR-199a, miR-214), suggesting they are co-ordinately regulated. With the exception of miR-210 and miR-135b, all miRNAs were significantly down regulated in the primary IDC and LN metastases when compared to matched NAT, but were not significantly different in primary IDCs from LN negative patients (Figure 2D and Table 3). These results were verified by real-time PCR for nine of the miRNAs (Figure 3), where the statistical significance in the difference in expression of these miRNAs was largely confirmed, with the exception of miR-101 and miR-29c, where the difference in expression of these miRNAs between normal and tumour samples in LN positive patients did not reach statistical significance.

Figure 3
figure 3

Validation of selected miRNAs significantly different in triple negative breast cancer with lymph node metastases. Relative quantification of let-7b, miR-130a, miR-101, miR-26a, miR-100, miR-210, miR-26b, miR-29c and let-7a by real-time RT-PCR in normal and tumour samples from lymph node negative and lymph node positive patients. Results are shown as a box plot of the relative normalised expression (target/RNU44) of the target miRNA (2-ΔCt). Boxes represent the median ± interquartile range. *p < 0.05, **p < 0.01, ***p < 0.0005 and +p < 0.0001.

Table 3 miRNAs differentially expressed in triple negative breast cancer patients with lymph node metastases

Biological functions and pathways regulated by the 27 miRNAs associated with LN metastasis

To annotate the function of the 27 miRNAs, we performed TAM analysis [18]. Through this analysis, the majority of miRNAs were found to function as tumour suppressor miRs (13/27 miRNAs) or were involved in human embryonic stem cell regulation (11/27 miRNAs) and these functions were significantly over-represented. A number of miRNAs in this list are also known to have functions in cell death and/or the cell cycle and these functions were also significantly over-represented (Additional file 1: Table S7).

We next used starBase and PANTHER to predict the genes and signalling pathways regulated by these miRNAs. Ten pathways were predicted to be significantly regulated by these miRNAs and many of these have already been reported to be dysregulated in TNBCs including the p53, EGFR1, Wnt and the TGFβ signalling pathways (Table 4).

Table 4 Predicted signalling pathways regulated by 27 differentially expressed miRNAs in TNBCs with LN metastases

Correlation between miRNA expression and clinical parameters

To determine if the expression of miRNAs verified in this study by real-time PCR were independently related to age at diagnosis, tumour size or the percentage of positive LNs, we tested their association using Spearman’s Rank correlation (Table 5). None of the miRNAs tested were independently associated with age at diagnosis or tumour size. However, the expression of miR-101 was negatively correlated with the percentage of LNs positive while the negative correlation of let-7b and miR-29c with the percentage of LNs positive approached significance (Table 5).

Table 5 Correlation between miRNA expression levels and clinical variables in triple negative breast cancers

Discussion

This study has investigated miRNA profiles in TNBC cases to determine miRNAs whose expression is associated with the initiation of this breast cancer subtype and those associated with its metastasis to the LN. We have identified a panel of 27 miRNAs that are associated with breast cancer progression in TNBC.

miRNAs differentially expressed in TNBC

By comparing the miRNA profiles of primary breast tumours to matched normal adjacent breast tissue, 71 miRNAs were identified as being differentially expressed. The majority of the miRNAs identified in this analysis have previously been implicated in breast cancer, suggesting that these miRNAs have a general function in the tumourigenic process in all breast cancers and that they are not specific to TNBC. This is perhaps not surprising, given the weak separation of the TNBC subtype by hierarchical clustering in miRNA expression profiles when compared to other breast tumour subtypes [28]. The results of this study are highly concordant with that of Cascione et al., with 70% (50/71) of the differentially expressed miRNAs identified by our study also showing differential regulation in normal versus tumour comparisons in their study of TNBC [27]. Differentially expressed miRNAs included members of the miR-17-92 oncogenic cluster, the let-7 family and the miR-200 family which are known to be dysregulated in several solid tumours, including breast cancer [29, 30]. Of particular note, the miR-17-92 cluster was recently shown to be up-regulated in TNBC and the basal subtype [10, 31] while miR-145 and miR-199b-5p were strongly repressed [10] consistent with our results. miR-210 and miR-21, well known oncomiRs [7, 11] were found to be over-expressed in our study. We also observed that miRNAs known to be bona fide regulators of ERα (miR-18a, miR-18b) were over-expressed [32]; while those that have been previously reported to be differentially expressed between ER-positive and -negative breast cancers (including let-7b, miR-200a, miR-21, miR-25, miR-106b) were significantly altered in this study [32].

The immunohistochemical and molecular profiles of TNBCs are similar to that of hereditary breast cancers that have mutations in the BRCA1 gene [2]. In this regard, several miRNAs identified in this study are known to participate in the BRCA1 signalling axis including miR-146a, miR-155 and miR-335 [3335]. Deregulation of these miRNAs in TNBC may contribute to altered BRCA1 signalling and could partly explain the similarities of these tumours with those in which BRCA1 function is lost.

miRNAs involved in LN positive TNBC

A significant and important finding from this study is that the miRNA profiles of LN positive primary breast cancers were strikingly distinct from that of LN negative patients. In particular, there was an overall up-regulation of miRNAs in LN negative patients and a dampening of miRNA expression in LN positive patients, suggesting that oncogenic miRNAs are associated with the development of LN negative TNBC and in contrast, that deregulated expression of tumour suppressor miRs is involved in LN positive disease. Two enzymes, Drosha and Dicer, are pivotal in the processing of pri-miRNA into mature double stranded miRNA fragments [4]. Interestingly, in breast cancer, reduced expression of Dicer has been associated with shorter metastasis-free survival and with the TNBC subtype, where it is observed in 60-78% of patients [3638]. The correlation of Dicer expression with LN metastases in TNBC patients was not examined in these studies [36, 38]. It could be hypothesised that the overall down-regulation of miRNAs observed in LN positive TNBCs in the current study is a result of reduced Dicer expression, but this remains to be determined.

miRNAs as markers for metastasis

In this study, we were not seeking to identify miRNAs that were differentially expressed in the transition of breast cancer progression i.e. primary breast cancer to metastasis or from ductal carcinoma in situ (DCIS) to IDC, as previous studies have done [10, 27, 28, 39]. In contrast to other studies, our analysis yielded no discriminatory miRNAs when matched primary breast cancer and LN metastases were compared suggesting that they are remarkably similar in their miRNA profiles and supporting the validity of our approach. The premise of the current study was to determine altered miRNA expression patterns in primary breast cancers that were also present in LN metastases. We reasoned that deregulated expression of key miRNAs that promote a highly invasive phenotype would be an early event in breast cancer progression and that these changes would be present in both the primary and metastatic lesion. Using this approach, we identified a panel of 27 miRNAs that are associated with metastasis to the LN. Given that the majority of these miRNAs are down-regulated, their value as prognostic markers remains to be determined and is the subject of ongoing investigations. However, 17/27 (63%) miRNAs are either known to be tumour suppressor miRs and/or have been shown to play a role in human embryonic stem cell regulation (Additional file 1: Table S7), supporting that deregulated expression of this panel of miRNAs is likely to result in altered differentiation states and increased invasiveness. In this regard, let-7 family members, miR-34a and miR-205 are markedly reduced in breast tumour initiating cells and are involved in epithelial to mesenchymal transition [21, 40, 41]. In addition, several of the miRNAs in this list have already been shown to have prognostic value in breast cancer including miR-210, miR-126, miR-26a, miR-125b, miR-205 and miR-214 [710, 4244]. Furthermore, our results are in agreement with Cascione et al., who found that both miR-125b and miR-497 were down-regulated and related to survival in TNBC [27].

Within our panel of metastasis-related miRNAs are a number of miRNAs whose functional role in breast cancer progression has not previously been described. These include miR-320c, miR-29c, miR-130a and miR-195 among others. There are no reports on miR-320c function to date and thus its possible role in promoting LN metastasis in TNBC remains unknown. miR-29c has previously been reported to be up-regulated in a small cohort of breast cancer cases [11], however, we found miR-29c to be down-regulated and associated with LN metastasis. In agreement with our study, miR-29c is down-regulated in chronic lymphocytic leukaemia and a range of solid tumours including colorectal cancer where it has been reported to decrease with cancer progression and is a predictor of survival and early recurrence [45, 46]. miR-130a is down-regulated in a panel of ovarian cancer and lung cancer cell lines that are resistant to chemotherapies [23, 24]. Moreover, it has recently been shown to be decreased in prostate cancer and its over-expression in prostate cancer cell lines causes the repression of key oncogenic pathways such as the MAPK pathway [47]. miR-195 was markedly down-regulated in our study. Its re-expression in breast cancer cell lines has recently been reported to reduce cellular proliferation and invasion, suggesting that it plays a key role in breast cancer progression [48].

A recent study by Buffa et al., described three miRNAs (miR-342, miR-27b, miR-150) that were prognostic for relapse-free survival in TNBC [49]. In our hands, these results were not replicated for LN metastasis and moreover, these miRNAs were not differentially expressed when breast tumour was compared to normal tissue. In addition, miR-10b has been shown to be strongly expressed in metastatic breast cancer cells, where it regulates invasion and migration [50]. However, we observed miR-10b to be strongly down-regulated in lymph-node positive and negative primary breast cancers and in LN metastases, suggesting it does not contribute to breast cancer invasiveness, and this is in strong agreement with several recent studies [6, 10, 20].

We used starBASE to predict signalling pathways regulated by the 27 miRNAs associated with LN metastasis. Interestingly, the pathways identified by this analysis have been widely reported to be deregulated in TNBC. In particular Wnt/β-Catenin, EGFR1 and the TGFβ receptor signalling axis are all known to be highly expressed in a large proportion of cases within the TNBC subtype and have been proposed as alternative treatment targets for TNBC [2, 3, 51]. A number of chemotherapeutic agents which target these pathways are currently under investigation in patients with TNBC, in both the neoadjuvant and metastatic setting [52]. The findings from our study- decreased expression of key miRNAs which target these pathways- provides a biological mechanism for up-regulation of these pathways in this breast cancer subtype. Additionally, TP53, PIK3CA and EGFR are among the four most commonly mutated genes associated with aberrant expression of interacting genes in TNBC [53], exemplifying their importance in this subtype. The predicted regulation of these pathways by miRNAs, as found in our study, reveals another mechanism for disruption of these pathways in TNBC.

Study design

A disadvantage of our study design is that samples were acquired by punch biopsy, rather than isolating individual normal breast and tumour epithelial cells by laser capture microdissection. We recognise that tissue heterogeneity and contamination by other non-neoplastic cell types including adipocytes, stroma and lymphocytes was unavoidable and that it may have contributed to the miRNA profiles generated. Our approach of selecting miRNAs that were only differentially expressed in primary tumours and matched LN metastases and that were not altered in LN negative primary tumours would have culled a large proportion of those miRNAs that are specifically altered in inflammatory responses. Moreover, those miRNAs that were differentially expressed simply because of differences in cell type composition between normal tissue (higher content of adipocytes and stroma) when compared to tumour tissue (higher content of epithelium) would have shown overlapping miRNA profiles when both LN negative and LN positive primary tumours were compared to matched normal tissues (10 miRNAs, Figure 2C) and were not included in our list of miRNAs related to metastasis (Table 3). Therefore, because of this careful process of elimination, we consider the miRNAs identified in this study (Table 3) are strongly representative of the invasive capacity of the primary tumour.

Given that the sample size used in this study is relatively small, conclusions cannot be drawn about the relationship of these miRNAs to prognosis or their specificity to the TNBC subtype. However, the tumours are homogenous with regards to size, hormone receptor status and histological grade. Furthermore, the size of this cohort (n = 35) is in line with several recent studies investigating the prognostic value of miRNAs in TNBC (Buffa et al., n = 37; Farazi et al., n = 48; Radojocic et al., n = 49) [20, 28, 49] and represents the only study to date that has analysed the full repertoire of miRNAs in lymph node positive and negative TNBC samples separately compared to matched normal adjacent tissue. The uniqueness of our study design was highly advantageous for delineating critical differences in miRNA profiles between LN positive and negative primary tumours, which would have been missed if: 1) matched normal adjacent tissue was not used and 2) only primary tumours from LN positive versus LN negative patients were compared to one another. In a recent study, Farazi and colleagues showed that there was no difference in miRNA profiles of primary breast tumours between TNBC patients who relapsed when compared to those who did not [28]. Indeed, we also found that no miRNAs were differentially expressed when LN negative versus positive primary tumours were directly compared using our statistical cut-offs. Further to this, in our study no miRNAs were found to be differentially expressed when normal tissue from LN positive patients was compared to that of LN negative patients. As such, the distinct miRNA profiles generated when LN negative and LN positive primary tumours were each separately compared to matched normal adjacent tissues are likely to reflect small, but critical changes in the regulation of miRNAs that are essential for the development of each of these respective phenotypes. Given that any LN involvement (regardless of number) is associated with worse disease-free and overall survival in TNBC [14], identification of molecular determinants such as miRNAs, that contribute to this highly invasive phenotype is urgently needed.

Conclusions

This study has provided novel insight into the repertoire of miRNAs that contribute to the initiation of and progression to LN metastasis in TNBC. These miRNAs may serve as markers for metastasis or treatment targets in the future.