Introduction

Breast conserving surgery (BCS) is a well-accepted standard of care for the treatment of both early stage breast cancer and preinvasive ductal carcinoma in situ (DCIS) [1]. Following initial randomized studies of BCS, efforts were made by pathologists studying mastectomy specimens to estimate the amount of occult breast cancer multifocality and thereby predict the likelihood of recurrence after BCS, as well as guide the extent of surgical resection margin needed to remove all invasive tumor cells. Despite initial reports of multifocality rates as high as 40–60 % [2, 3], definitions of multifocality restricted to residual tumor foci outside simulated quadrantectomies led to lower estimates of 28–38 % [4, 5]. Later mastectomy studies, eliminating more diffusely spreading invasive phenotypes from consideration (e.g., invasive lobular cancers) and further restricting primary tumors to those most eligible for BCS (stages Tis, T1–2), demonstrated that a 2-cm clear surgical margin would still leave behind occult tumor foci in up to 42 % of cases, a third of which being invasive tumor cell clusters and the remainder being occult noninvasive foci of DCIS [6].

Over the >20 years that have elapsed since the completion of six prospectively randomized trials showing that BCS followed by whole breast irradiation yielded overall survival rates comparable to mastectomy, studies of occult residual tumor burden following BCS have declined because of the impact of whole breast irradiation in significantly reducing but not eliminating potentially life-threatening loco-regional recurrences, the majority of which occur near the primary tumor site [7, 8]. However, technical, cosmetic, and economic issues still beg the question of how much normal breast tissue should be removed during BCS to achieve clean surgical margins and further minimize loco-regional recurrences [7, 8]. Moreover, with BCS, the current standard of care and an ever-growing need to extract normal host tissue DNA and RNA from the same small BCS specimen as control samples for the many high-resolution genetic and epigenetic assays now being performed on breast tumors [9]; there is renewed concern about the molecular normalcy of morphologically normal peritumor mammary tissue. Adding fuel to these clinical and technical questions are several newly recognized issues: (i) a negative margin of any width does not indicate the absence of residual occult breast tumor cells outside that margin [7]; (ii) among BCS eligible patients, the likelihood of multifocality and residual tumor volume elsewhere in the breast does not correlate with tumor size [6, 7], but is determined by tumor phenotype with triple-negative (ER-negative, PR-negative, HER2-negative) and HER2-positive breast cancers likeliest to recur [7, 10]; and (iii) “field cancerization,” characterized by genetic and epigenetic alterations in histologically normal-appearing tissue is common within 5 cm of a breast tumor margin and thought to provide a microenvironment conducive to new tumor formation [11, 12].

The term “field cancerization” was coined over 50 years ago [13], but more recently has been redefined to apply to a morphologically normal-appearing microenvironment surrounding various tumor types (e.g., breast, prostate, lung, head and neck, gastrointestinal, and skin cancers) that, when assayed by high-resolution molecular assays, exhibits a gradient of genetic and/or epigenetic aberrancies relative to more remote non-tumor-bearing areas in that organ [11, 12]. In 1996, Smith and coworkers [14] first demonstrated genetic loss of heterozygosity (LOH) in morphologically normal lobules adjacent to most breast cancers bearing that same LOH; and they postulated that lost tumor suppressor gene(s) at those loci characterized a breast tumor initiating genetic field defect. Later genetic studies subsequently showed that shortened telomeric DNA and four to five times more prevalent LOH loci characterized the field cancerization of normal-appearing tissues within 1 cm of microscopically defined tumor margins [12]. More recently, gene expression profiling of the peritumor breast microenvironment by different groups has revealed the presence of two distinct RNA expression subtypes [15, 16]: one “normal” microenvironment, independent of distance from tumor, and another “active” microenvironment, its presence dependent on distance from the tumor edge and independent of tumor phenotype yet associated with poorer patient prognosis, and its signature consistent with extracellular remodeling, wound healing, fibrosis, and epithelial-to-mesenchymal transition (EMT) [16]. Whether or not this epigenetic EMT-like phenotype within histologically normal cancer-adjacent mammary tissue also reflects any underlying genetic field cancerization effect remains unclear, as neither of those two gene expression studies evaluated genomic aberrations in the microdissected peritumor tissue, and the investigators concluded that this EMT-like phenotype was a host reaction unrelated to any specific breast cancer phenotype [15, 16].

In order to address the questions of residual breast tumor burden within the peritumor microenvironment and the genomic status of morphologically normal breast tissue surrounding aggressive breast cancers of the same phenotype, we performed a pilot study (Study 1, n = 12 cases) followed by a verification study (Study 2, n = 20 cases) on DNA samples isolated from HER2-positive breast tumors (T) and their normal-appearing mammary tissue (N), excised from 1 to 3 cm beyond the invasive tumor margin. High-resolution molecular inversion probe (MIP) arrays (20K array for Study 1, 300K array for Study 2) were used to examine differences in genome copy number gains and losses between the T/N pairs; such MIP arrays are capable of sensitive quantification of allele-specific copy number changes (ranging from 1 to 60 altered copies per locus) across the entire genome as well as accurate determination of allele-specific composition within the HER2 amplicon [1720]. We also reasoned that increased HER2 gene copies would be easier to detect than loss of normal gene copy numbers arising from occult tumor cells situated within a field of genomically normal tissue. For correlation with the 300K MIP array findings in Study 2, a more detailed microscopic re-evaluation of all N samples was performed by independent study pathologists blinded to the genomic data and employing histologic and immunohistochemical (IHC) assays to characterize the adjacent normal tissue composition bracketing the N sample cores extracted for DNA. While these retrospective studies produced microscopic evidence of extratumoral cancer cell foci in 20 % (4/20) of the N samples from the HER2-positive cases (Study 2), the 300K MIP arrays detected excess HER2 gene copy numbers along with other genomic aberrations consistent with those found in the adjacent HER2-positive tumors in 35 % (7/20) of the N samples, along with unrelated genomic aberrations in another 10 % (2/20) of samples.

Methods

Study samples

Under multi-institutional review board approval, all cryobanked (−80 °C without fixatives) tissue samples studied (32 sample pairs, average sample wet weight >100 mg) were received in two different dry ice batches (Study 1, archived before 2006; Study 2, archived before 2009) from the Stiftung Tumorbank Basel with each sample pair coded and delinked from all personal identifiers. All tumors were HER2-positive based on a well-validated quantitative immunoassay for HER2 protein expression performed in Basel [21], with only the Study 2 samples accompanied by limited annotation (age-at-diagnosis, ER and PR status, tumor size, nodal status), as summarized in Table 2, not including treatment or follow-up data. Swiss pathologists had microscopically confirmed that all the tumor samples contained >50 % invasive breast cancer cells, and that the normal-appearing adjacent mammary tissues had been excised minimally 1 cm away but no more than 3 cm away from the invasive tumor margin.

Sample processing for microscopy, DNA extraction, and MIP microarrays

Cryosectioning was performed on the central core of each sample (10 sections, 50 μm each) for DNA extraction, with flanking tissue used to generate at least five 10-μm frozen section slides for microscopic analysis and correlation with DNA studies. Frozen sections from the paired tumor and normal adjacent tissue samples were independently evaluated (blinded to DNA results) by pathologists (AL, FW) within the Clinical Pathology Laboratory of the University of California at San Francisco (UCSF) using hematoxylin and eosin staining (H&E, recording malignant and benign components including epithelial, fat, stromal, and muscle cell content) and IHC assay of epithelial HER2 expression by Hercep Test (DAKO, 0–3 scale), as previously described [22]. Core sections were DNA extracted as previously described for HER2 gene analysis by comparative genomic hybridization [23], using a standardized UCSF protocol involving proteinase K digestion, phenol/chloroform/isoamyl alcohol (PCI) extraction, and ammonium acetate/ethanol precipitation followed by TE resuspension and RNase A digestion (http://waldman.ucsf.edu/protocols/dna.frozen.htm), and then repeated extraction and precipitation. TE redissolved DNA frozen in 96-well plates (~200 ng DNA in 50 μl/well) was shipped to Affymetrix, Inc., Santa Clara, CA (Yuker Wang), where Study 1 sample pairs were analyzed on a 20K MIP array platform and Study 2 samples were later analyzed on a 300K MIP array platform, and all pre-processed data returned for further determination of allele-specific copy number aberrations (CNA) [1720].

Allelic copy number determination, genomic aberrations, and visualization tools

Pre-processed MIP array data for each sample were analyzed as previously described [19, 20] after first being segmented by a modified version of a circular binary segmentation (CBS) algorithm appropriate for single-nucleotide polymorphism (SNP) data [24, 25]. All programming scripts were performed using R (http://www.rproject.org/). Inputs into the modified algorithm were the A and B allele values on the copy number scale for every SNP (after removing those SNPs with call rate below 90 %). In the first round of segmentation, square root of the total copy number (sum of A and B alleles) was used to stabilize the variance; otherwise, the original copy number scale was used. In the second round, the absolute differences between the A and B alleles of all heterozygotes were segmented. The first segmentation was performed with a type I error of 0.009 and the second was performed with a type I error of 0.001, which maintained a typical CBS type I error of 0.01. Allele-specific means were estimated from the means of the total copy numbers and means of the differences between the A and B alleles of the heterozygotes, with the lower allele-specific mean referred to as the minor copy number and the upper allele-specific mean referred to as the major copy number. The minor copy numbers were combined across adjacent segments if t tests corresponding to the minimum SNP values were not significant by a cutoff p value of 0.05. The same was true of the major copy number and maximum SNP values. Heterozygotes were identified using an iterative binning method based on the minimum allele frequency across all SNPs. The bin between values 0 and 0.25 of size at least 0.05 that contained the lowest ratio of minimum allele values to bin size was first identified, then any SNP with a minimum allele value below the bin was estimated to be homozygous, while any above the bin was estimated to be heterozygous; SNPs in the bin were estimated to be homozygous or heterozygous in the same proportion as those outside the bin, and those with lower values estimated to be heterozygous. Allelic and total copy number differences and frequencies were analyzed using the Bioconductor R packages aCGH and DNAcopy [2628]. For some graphic purposes, segmentation files were normalized by log2 median centering each sample separately resulting in an estimated normal copy number at zero, with positive values indicating an increase in copy number and negative numbers indicating a decrease in copy number. Visualization was also performed with a custom drawing script in python using the matplotlib visualization library, with data represented in the box-plot summary view available on the UCSC Cancer Genomics Browser [29]. Visualization of the ERBB2/HER2 amplicon on chromosome 17 was performed using the Integrative Genomics Viewer (IGV) [30].

Results

In the pilot study (Study 1), genome-wide allele-specific copy number analysis was performed using 20K MIP arrays with 12 paired DNA samples extracted from cryobanked HER2-positive breast tumor samples and their adjacent normal tissue, as described in “Methods” section. The genome-wide total CNA determined from these lower resolution MIP arrays are summarized in Fig. 1, showing the CNA fraction gained or lost for each probe across all 12 tumors (a) and their adjacent normal tissue samples (b). This study was initially intended to document monoallelic amplification in breast tumors that had been scored as HER2-positive prior to archiving by quantitative immunoassay [21], and to look for evidence of genomically abnormal peritumor field defects based on earlier evidence of monoallelic HER2 transcription present in normal-appearing breast epithelium adjacent to HER2-positive tumors [23]. Individual genomic profiles confirmed the presence of a 17q12–21 HER2 amplicon in 10/12 (85 %) Study 1 tumors, only half of which appeared to show a monoallelic haplotype (i.e., single-parental origin). The minimal common region of this HER2 amplicon haplotype within these five tumors was ~400 kbp in size and consisted of the following known genes (centromere to telomere direction): CRKRS, NEURODL, PPP1R1B, STARD3, TCAP, PNMT, PERLD1, ERBB2/HER2, C17orf37, GRB7, and IKZF3. As well, all of these HER2-positive tumors appeared genomically consistent with those described in a larger high-resolution genomic study [31] of HER2-positive breast cancers by the following criteria: (i) near 40 % frequency of other chromosome 17q amplification peaks within 17q11–12, 17q21, and 17q23; (ii) >20 % frequency of other chromosome copy number gains/amplifications at 1q, 8q, 11q, and 20q loci; and (iii) near 20 % frequency of copy number losses in other chromosomes at 1p, 8p, 16q, 17p, and 18q loci. Most strikingly, however, was the detection of HER2 amplification in nearly 60 % of the matched adjacent normal samples in Study 1 (Fig. 1B), and the finding in 5/12 normals of the same monoallelic amplicon haplotype as found in their paired tumor samples. Uncertain whether these Study 1 observations signified the presence in these adjacent tissues of cancerization with normal-appearing epithelial cells displaying amplified HER2 but few other genomic aberrations, or the presence of a few contaminating malignant cells genomically similar to the nearby tumor, we launched Study 2 to verify these findings and better define the histologic nature of the normal-appearing tissue adjacent to HER2-positive breast cancers.

Fig. 1
figure 1

Frequency of significant allelic gain (green) or loss (red) based on 20K MIP array platform analysis of 12 paired Study 1 samples, plotted as a function of genomic location, with HER2-positive tumor frequencies shown above (A) and their adjacent normal breast tissue frequencies shown below (B). The 17q chromosome location of the HER2/ERBB2 amplicon is indicated above each panel. Vertical axis labels indicate fraction of samples with gained (0 to +1.0) or lost (0 to −1.0) allelic probes

Representative histologic (H&E) appearances of a HER2-positive tumor and a normal-appearing mammary tissue samples from Study 2 are shown in Fig. 2. Fully detailed characteristics for each of the 20 Study 2 tumors (age-at-diagnosis, tumor size and nodal status, ER/PR status, HER2 receptor expression, and HER2 gene copy number) are given in Table 1; as well, the corresponding adjacent normal tissue sample characteristics (histologic, IHC, MIP) are summarized in Table 2. Importantly, these HER2-positive tumors appear to be representative of those found clinically as 75 % were diagnosed at >50 years of age, half of them lacked hormone receptor expression (ER-neg, PR-neg), 60 % were node-negative, and 80 % presented as T1–2 lesions (Table 1). Also shown in Table 1, all Study 2 tumors contained HER2 amplicons (up to 918 kbp in size, from chromosome 17 positions 34179009–35097207) and MIP array estimated HER2 copy number gains averaging 13.9 allelic copies/tumor and ranging from as few as 3 to as many as 56.7 copies/tumor, well within the quantitative dynamic range of the 300K MIP array platform [1720]. Despite significant resolution (probe density) differences in the MIP arrays used in the two studies, a similar profile of genomic gains and losses across all chromosomes was apparent between the HER2-positive tumors in Studies 1 and 2. Likewise, when comparing the genome-wide log2 total copy number gains and losses between the 20 normal and tumor samples (summary box-plots in Fig. 3A), as seen with the Study 1 sample pairs, the Study 2 normals showed a striking genome-wide profile resemblance to the summary tumor profile despite the difference in their copy number scales (lower copy number values for normals than tumors), and consistent with the fact that 11 of the 20 normal samples (# 1, 3, 4, 5, 6, 7, 11, 12, 13, 14, 19) possessed perfectly normal allele-specific copy numbers across their entire genomes as would be expected of all genomically normal tissues (Supplementary Fig. 1). Thus, the summary profile of 20 normals showing allelic copy numbers that vary from 1 (log2 = 0, Fig. 3A) reflects the genomic gain and loss contributions from the aberrant genomes of only 9 of the 20 Study 2 normal samples (# 2, 8, 9, 10, 15, 16, 17, 18, 20), whose individual sample profiles are shown next to their tumor sample profiles in Fig. 3B. Of note, there was no statistical differences in age-at-diagnosis, tumor size, nodal involvement, hormone receptor (ER, PR) status, tumor HER2 expression level or estimated HER2 copy numbers between sample pairs with or without detectable genomic aberrations in their adjacent normal tissues.

Fig. 2
figure 2

Histologic (H&E) images from representative Study 2 cryosections of a HER2-positive cancer (A) and a normal-appearing cancer-adjacent sample (B). All cancer-adjacent samples were characterized for constituent histologic elements including fat, stroma, and epithelium as illustrated above and summarized in Table 2

Table 1 Summary of Study 2 tumor (T) characteristics including age-at-diagnosis, MIP array estimate of HER2 gene copy number, HER2 protein overexpression level by ELISA, estrogen and progesterone receptor (ER, PR) status, tumor size, stage (T1–4), and nodal (N0–1) status
Table 2 Summary of Study 2 adjacent normal (N) breast tissue constituents including fat, muscle, and either benign (normal or usual ductal hyperplasia, UDH) or malignant epithelium, as well as MIP array determined copy number normalcy or abnormality (CNA) and HER2 copy number gain and HER2 IHC assay (1–3+) of cancer (CA) cells identified in the N samples
Fig. 3
figure 3

Total and allele-specific copy number variations based on 300K MIP array platform analysis of paired Study 2 samples, plotted as a function of genomic location. A Summary box-plot of log2 copy number variations across all 20 normal samples and their matching 20 tumor samples. Red (increased copy number) and blue (decreased copy number) vertical lines indicate upper and lower quantile variations, colored by their median value; note different copy number scales for normal (−0.4 to +0.6) and tumor (−2 to +3) samples. Black dots indicate median values across all samples. B Genome-wide allele-specific copy number comparisons for the 9 HER2-positive tumor (T)–normal (N) sample pairs in which there was some genomic aberration detectable in the N sample, with 7 of these showing 17q12–21 HER2 copy number gains (# 2, 8, 9, 16, 17, 18, 20). Major (yellow) and minor (blue) allelic changes (scaled from 0 to 8 for all samples) are shown with estimated copy numbers >1.5 considered significant gains

Of the nine normal samples exhibiting some evidence of genomic aberration (Fig. 3B), only two (# 10, 15) showed minimal but detectable copy number changes elsewhere besides the HER2 locus and chromosome 17, suggesting the absence of contaminating tumor cells and the possible presence of a field defect or cancerization in these two samples. Of note, microscopic examination of these normal samples revealed the presence of only normal-appearing mammary epithelium in both, with one (#10) showing the unusual absence of other stromal, fat, or skeletal muscle components (Table 2) suggesting that its twofold monoallelic gain at the telomere of 11p was present in that morphologically normal-appearing mammary epithelium. The remaining seven normal samples (35 % of Study 2 normals) showed genomic changes at loci similarly involved by their matching tumor samples (Fig. 3B), including monoallelic HER2 copy number gains and a general pattern of 17q, but not 17p, aberrations similar to their adjacent tumors, as illustrated in Fig. 4 and consistent with the possibility of occult tumor cells somewhere within the adjacent normal-appearing tissue sample. While 60 % of all the normal samples contained some epithelial components, three of the seven genomically aberrant normal samples contained normal stroma and fat but no microscopically detectable benign or malignant epithelial components (Table 2), suggesting that the cryosections flanking the larger core sample extracted for DNA were not adequate to capture contaminating tumor cells likely present in the core sample. In contrast, contaminating tumor cells were confirmed in four of the seven genomically aberrant normal samples (# 2, 8, 18, 20) which contained occasional 25–200 μm clusters of malignant, HER2-positive (IHC +3) cells, as illustrated in Fig. 5, with or without additional normal-appearing, HER2-negative (IHC +1 or +2) epithelial cells elsewhere in the same normal sample (Table 2).

Fig. 4
figure 4

IGV display comparing Study 2 chromosome 17 gains/losses and HER2/ERBB2 amplicon gains in the 7 T/N sample pairs for which increased 17q and HER2 copy numbers were seen the N samples (top heatmap panel), and the 2 other sample pairs in which the N samples showed normal chromosome 17 copy numbers but aberrant genomes elsewhere (bottom heatmap panel). Heatmaps show segmented overall copy number variations with red intensities indicating regions of copy number gain, blue intensities indicating regions of copy number loss, and white indicating no copy number change. The HER2/ERBB2 locus is indicated by the red rectangle on the ideogram (~35 mb)

Fig. 5
figure 5

Histologic (H&E) and IHC (+3 HER2 overexpression) images of scattered, occult tumor cell clusters (25–200 μ) identified in four HER2-positive cancer-adjacent normal samples (# 2N, 8N, 18N, 20N)

Discussion

Long-term (13–20 years) clinical follow-up of randomized trials of BCS have revealed in breast recurrence rates ranging from 6 to 20 %, even when negative surgical margins were as wide as 1–2 cm; these and other studies have suggested that up to half of such local recurrences might actually represent new primary cancers arising in mammary tissue at risk for developing cancer [8]. However, with most surgeons currently accepting a 2 mm or less free margin width as sufficient for complete breast cancer excision [8], such recurrence rates are also consistent with earlier mastectomy studies showing occult tumor cells commonly residing more than 2 cm beyond the microscopic tumor edge, potentially capable of invasive growth and regional spread [6]. We chose to study morphologically normal-appearing mammary tissue adjacent to HER2-positive breast cancers in part because this aggressive tumor phenotype is molecularly less heterogeneous than other breast cancer phenotypes yet likeliest to recur [7, 10], and because an earlier study had suggested that its adjacent mammary epithelium might be predisposed to monoallelic HER2 amplification [23]. Therefore, given the high resolution and sensitivity of MIP arrays, we expected to find genetic evidence of either field cancerization or contaminating tumor cells in up to 20 % of normal-appearing mammary tissue samples excised from 1 to 3 cm outside the tumor margins of HER2-positive breast cancers. Surprisingly, 16/32 (50 %) Study 1 and Study 2 sample pairs produced some evidence of genomic aberration within the adjacent normal tissue samples with 14/32 (44 %) of these showing significant copy number gains at the HER2 locus.

The observed single-parental origin of monoallelic amplicon haplotypes shared by informative tumor–normal sample pairs, as well as the commonly gained loci elsewhere on 17q in adjacent normal samples concordant with characteristic 17q gains previously described in HER2-positive breast tumors [31], provided compelling support for a preliminary conclusion that contaminating tumor cells probably explained the observed genomic aberrations in these HER2-positive cancer-adjacent normal samples. On the other hand, two sample pairs in Study 2 contained aberrations in their normal tissues that did not mirror their adjacent tumor genomes, suggesting possible field cancerization effects [11, 12], although such genetic field effects have not been described previously in association with HER2-positive breast cancers. Therefore, additional microscopic study of the multiple frozen sections flanking the DNA-extracted core normal samples from Study 2, blinded to the genomic results, was considered essential for histologic insight into the source of the observed genomic aberrations, understanding that absent microscopic evidence of contaminating tumor cells in these sections would not constitute proof of tumor cell absence in the larger DNA-extracted core samples. Indeed, upon histologic and genomic correlation, 4/7 normal samples with MIP array evidence of amplified HER2 gene copies also demonstrated isolated microscopic clusters of HER2-positive cancer cells easily missed depending on the number of frozen sections cut and evaluated. Sections flanking the other 3/7 normal samples showing obvious genomic aberrations and increased HER2 gene copies revealed normal mammary fat and stromal elements but no evidence of benign or malignant epithelium, leading to the presumption that occult tumor cell foci must have been confined to the larger DNA-extracted core sample. Importantly, of the 11 adjacent samples with entirely normal genomic profiles, 7 showed normal epithelial elements and 3 of these also contained some usual ductal hyperplasia (UDH), indicating that their normal genomic profiles were not likely false-negative findings due to the absence of adjacent breast epithelium. Notably, those three normals with UDH did not show any evidence of either increased HER2 gene copy number or HER2 overexpression by IHC (Table 2), suggesting that the UDH did not represent precursor lesions related to the adjacent HER2-positive invasive tumors. Finally, sections from the two normal samples with aberrant genomes but lacking increased HER2 gene copy numbers and appearing genomically dissimilar to their matching tumors, contained histologically normal epithelial elements, consistent with prior reports of field cancerization occurring in microscopically normal-appearing breast epithelium [11, 12, 1416].

In sum, these findings suggest that using high-resolution genomic profiling, genetic aberrations commonly exist within morphologically normal breast tissue located well beyond what is commonly considered a clear surgical margin for BCS even for aggressive HER2-positive breast cancers, although the majority (55 %) of our Study 2 sample pairs revealed histologically and genetically normal tissue adjacent to the HER2-positive tumors. The genomic profiles and histologic study of our aberrant normal samples support the conclusion that tumor cell contamination rather than genetic field cancerization represents the likeliest explanation for clinically observed local recurrence rates up to 20 % following BCS despite post-operative breast radiotherapy. The observed frequency of genomically aberrant adjacent normal tissue in our two studies was consistent with earlier mastectomy studies showing frequent presence of isolated malignant cells well beyond tumor margins; and, as with many of these clinical studies, we found no correlation between our aberrant adjacent tissue genomic findings with other clinical characteristics of these HER2-positive tumors including age-at-diagnosis, size, nodal involvement, or hormone receptor expression. Importantly, this study did not address possibly common epigenetic field cancerization effects conducive to de novo tumorigenesis that would require DNA methylation, microRNA, and mRNA profiling for full characterization. As well, despite many recent molecular genetic approaches, there remain significant statistical challenges in trying to demonstrate the clonal relatedness of tumors arising in the same patient at distant sites or at different points in time [3234]; therefore, even ignoring the technical limitation of characterizing minimal tumor DNA within a pool of normal germline DNA, we cannot absolutely conclude that the genomic aberrancies in our normal tissue samples equate to contaminating tumor cells derived from the adjacent primary tumors. In the future, it is possible that deep exome or whole genome sequencing of adjacent normal tissue samples will be able to better detect and differentiate genomic aberrations such as cancer-driving versus precursor mutations; and such studies are now underway within The Cancer Genome Analysis (TCGA) network [9]. The TCGA program, in particular, recognizes the growing need to employ surgically accessible normal organ tissues for in depth and accurate genomic profiling of breast and other cancers; and the present comparison of HER2-positive tumor and normal sample pairs indicates that great caution is needed in assuming the genomic normalcy of histologically benign appearing tissue within 1–3 cm of the invasive tumor margin.