1 Introduction

Accumulation of genetic/genomic aberrations are thought to lead to a multi-step process of cancer genesis-progression, in which the functions of genes controlling the cell cycle, cell death, chromosomal stability, angiogenesis, invasion and metastasis become aberrant [13]. Epidemiologic and experimental evidences indicate that most oral cancers originate and progress with the contribution of carcinogen exposure, mainly from tobacco smoking [4, 5]. Recently, using parallel sequencing technology and a lung cell line, a total of 22.910 somatic substitutions were identified, including 134 in coding exons [6]. More than 60 chemicals derived from tobacco smoking that bind and mutate DNA within the oral mucosa cells may contribute, in particular, to create an extensive “field cancerization” [79]. Specific gene mutations are also thought to contribute to inhibit DNA repair and to increase DNA damage within the entire mucosa and, in particular, to generate chromosomal instability and aneuploidy resulting in an increased risk of developing oral cancer. The theory of “field cancerization” [7] and its more recent genetic explanation [8, 9] have generated a very attractive genetic progression model of oral cancer [10]. A critical step in this model, however, is the conversion of a patch, in which stem cells share genetic/genomic aberrations, into an expanding field with numerous aberrations, which sometimes becomes visible as leukoplakias [1115]. Along the concepts of this model, many studies have compared several types of DNA aberrations within samples from oral cancer (mainly, oral squamous cell carcinomas, OSCCs) and of oral tissue in the vicinity of OSCCs. These studies include investigations on loss of heterozygosity [9, 16, 17] and detection of molecular and cellular aberrations by in situ hybridization [18, 19], immunohistochemistry [18, 19], multiplex ligation-dependent probe amplification [17], DNA image [17] and flow cytometry [20].

Oral cancer was also characterized by different chromosomal aberrations located in specific chromosomal loci including 1p36.3, 3q26, 7p, 8q, 9p, 11q13, 16p, 16q, 17q, 20q [2127]. Recently, we have investigated the presence of DNA aneuploid sublines by hr DNA-FCM within ODFs, OPMLs and OSCCs, suggesting a model of DNA aneuploidization [20, 28]. In the present study, we aimed to investigate visually normal appearing and histologically non-dysplatic mucosa fields (ODFs) distant with respect to non-dysplastic and dysplastic OPMLs (nd- and d-OPMLs).

Our experimental approach was based on hr DNA-FCM to detect DNA aneuploid sublines. DNA near-diploid aneuploid sublines with a DI difference between 2 and 4% with respect to DNA diploid control cells could be detected [20]. A DNA-FCM-Sorter was then used to physically separate the epithelial DNA aneuploid sublines in order to extract specific DNA to submit to a genome-wide a-CGH analysis providing DNA copy number aberrations (CNAs). These combined techniques may potentially provide new evidence for the conversion of a patch (represented by the ODFs) into an expanding field, which became visible as leukoplakias (nd-OPMLs and d-OPMLs). The first aim was to address whether the ODFs and the OPMLs shared specific chromosomal changes. The second aim was to compare chromosomal aberrations with DNA aneuploidy to eventually identify genomic regions underlying the development of DNA aneuploid sublines.

2 Materials and methods

2.1 Patients and histological assessment

Patients with OPMLs were recruited in two different medical centers: the Oral Medicine and Oral Oncology Section of the University of Turin at the A.O.U. S. Luigi Gonzaga (Orbassano-Turin) and at the Department of Otolaryngology, “S. Martino Hospital” in Genoa. Patient written consent was obtained in every case according to the Institutional Ethic Committees (A.S.O. S. Luigi Gonzaga Prot. N. 11780 and S. Martino Hospital Prot. N. 1084) during an interview in which their tobacco and alcohol habits were recorded.

Nineteen patients (9 males and 10 females, with a mean age of 57.9 years, range 32–76 years) with nd- and d-OPMLs, gave their informed consent and were included in this study. The patients were also interviewed as to their tobacco habit and the results were recorded: 8 patients were non-smokers (NS) and 11 were smokers (S). Tissue samples were obtained from both the OPMLs and from distal corresponding ODFs in 18 out of 19 patients. The only exception was the ODF of the patient 16, which allowed the sampling of healthy tissue only from the buccal mucosa and not from the tongue, which was the subsite affected by the OPML. Sampling was performed by means of incisional biopsy and/or micro-biopsy as previously detailed [29], taking care to cause slight bleeding to ensure that the basal layers of the epithelium had been collected. Part of the sample was formalin-fixed to assess the presence of dysplasia by means of routine haematoxylin and eosin stained slides. Additional multiple samples were immediately frozen and stored at −20°C for later measurements by hr DNA-FCM and a-CGH. The diagnosis of OPMLs (all leukoplakias) was based on internationally accepted criteria with levels of diagnostic certainty C3-C4 [30], while the histological diagnosis for the assessment of the presence of dysplasia was carried out by a specially trained pathologist, according to the WHO guidelines [31]. The anatomical subsites, which were in common in 18 out of 19 cases for both ODFs/OPMLs, were for the vast majority of the cases buccal mucosa. ODFs were not always equally distant from the corresponding OPML lesions. In some cases, as for example, when the OPMLs were on a cheek, the ODF samples were obtained from the opposite cheek. This was an optimal situation, which identifies the ODF position as “controlateral” or “mirror” with respect to the OPML position. On the contrary, when the lesion was on the anterior floor of the mouth, ODFs could be too close to the lesion. Therefore, OPMLs close to the midline were not included in the present study. The anatomical subsites of the ODFs/OPMLs are reported in Table 1.

Table 1 Patient’s and sample characteristics with DI values by hr DNA-FCM and CNAs by a-CGH for 19 ODFs and 19 OPMLs

2.2 Flow cytometric analysis and sorting

Multiple tissue samples were processed to obtain DAPI stained nuclei suspensions following the method of Otto et al. [32] with modifications. Hr DNA-FCM of these samples was done as previously described in details using a CyflowML cytometer (Partec GmbH, Muenster, Germany) to obtain DNA content histograms and to evaluate the DNA Index (DI) [33]. DNA diploid controls were represented by gender-specific human lymphocytes. The mean CV of the corresponding DNA diploid G0-G1 peaks was 1.2 ± 0.2%. DNA near-diploid aneuploid sublines with a DI difference between 2 and 4% with respect to DNA diploid control cells could be detected [20].The DNA aneuploid sublines (DI ≠ 1) were sorted using a Cyflow Space FCM equipped with a PPCS unit (Partec GmbH) at the purity of about 99%.

2.3 DNA extraction

DNA extraction was performed using ArchivePure DNA kit (5-Prime, Hamburg, Germany) with some modifications: after proteins precipitation the DNA was purified by phenol-chloroform extraction and collected by ethanol precipitation using 20 μg of glycogen (20 mg/ml) as carrier. DNA quality was assessed by agarose gel and ND-1000 Spectrophotometer (NanoDrop Technologies, Wilmington, DE).

2.4 Analysis by a-CGH

The a-CGH analysis was carried out using oligonucleotide Human Genome CGH 105 K array platform (Agilent Technologies, Santa Clara, CA). A pool of normal male or female DNA (Promega, Madison, WI) was used as reference DNA. Briefly, 50 ng of genomic DNA were amplified by GenomePlex Whole Genome Amplification Kit WGA2 (Sigma-Aldrich, St. Louis, MO). When the total amount of DNA was lower than 50 ng the amplification was performed by Ovation WGA System (NuGEN, San Carlos, CA, USA). For each array, amplified test and reference DNAs (2 μg each) were labeled by Bioprime Labelling Kit (Invitrogen, Paisley, UK). Slides were hybridized and washed according to manufacture’s recommendations. Spot fluorescence was measured by the Agilent G2565BA Scanner and images were processed by Feature Extraction software (version 9.5.3.1) (Agilent Technologies). Data files were analyzed by Agilent CGH Analytics Software (version 3.5.14) according to ADM-2 algorithm. The quality of each experiment was assessed by QC metric tool. Oligonucleotides were mapped according to the human genome build NCBI35/Hg17 (May 2004). Raw data were uploaded in Gene Expression Omnibus: (http://www.ncbi.nlm.nih.gov/projects/geo/index.cgi, accession number: GSE28906). We have used commercially available pooled normal DNA from healthy donors. We excluded those copy number variations (CNVs) that were listed as large segment copy number polymorphisms by Genomic Variants Database (http://projects.tcag.ca/variation).

Several tests were also performed to exclude that a-CGH results could be affected by WGA, DAPI staining and the DNA-FCM-Sorting (data not shown).

2.5 Statistical analysis

Data collection, management and analyses were done using Microsoft Office Excel and the SPSS 16.0 software package (Apache Software Foundation, Chicago, IL). The association among variables in contingency tables was evaluated with the Fisher exact test (two-sided) or the chi-squared test. A p-value ≤ 0.05 was taken as statistically significant.

3 Results

Table 1 shows the main characteristics of the patients (gender, age and tobacco smoking habit), the subsites, the histology of the ODFs and OPMLs and the corresponding DI values by DNA-FCM and CNAs by a-CGH. On the basis of the histological diagnosis, the group of OPMLs in the present series included 11 leukoplakias without dysplasia (nd-OPMLs) and 8 dysplastic leukoplakias (d-OPMLs). Dysplasia was never detected in the corresponding 19 ODFs. The frequencies of DNA aneuploidy in ODFs and OPMLs were 5.3% and 32% respectively. The DI aneuploid values among the ODFs and the nd-OPMLs were in all cases near-diploid (DI ≠ 1 and DI ≤ 1.4), while in the d-OPMLs were high-aneuploid (DI > 1.4) in 40% of the cases (Table 1).

Copy number aberrations (CNAs) were observed in 11/19 OPMLs (58%) and in 7/19 ODFs (37%; Table 2). In 12 out of 19 ODFs (63%) and in 8/19 OPMLs (42%) no aberrations were found. In particular, 6 ODFs (31.6%) and the corresponding lesions showed a flat profile. ODFs and the corresponding OPMLs had specific CNAs in common in 5 out of 19 cases (29%). Two ODFs out of 19 (11%) had, instead, unique CNAs that were absent in the corresponding nd-OPMLs (Table1). The frequency of DNA aneuploidy, which increased from 5.3% for the ODFs to 32% for the OPMLs, was paralleled by an increase in the average number of CNAs from 1.9 to 6.47 (Table 2). Moreover, the presence of dysplasia among the OPMLs was associated with an increased average number of CNAs (data not shown).

Table 2 DNA ploidy and CNAs among ODFs and OPMLs

It is of note that the CNA-gains were more frequent than the CNA-losses among both the ODFs and the OPMLs. In particular, the frequency of CNA-losses among the OPMLs was 32% versus 0% among the ODFs (p = 0.02, Fisher exact test; Table 3). It is also of note that the presence of DNA aneuploidy among all samples was concomitant with an higher number of CNAs, both gains and losses. We also observed that, while segmental chromosomal aberrations (defined as CNAs within a chromosomal segment) were present in both the DNA diploid and aneuploid sublines, numerical chromosomal aberrations (gain/loss of entire chromosomes) were seen only in presence of DNA aneuploidy (Table 1).

Table 3 Relative number of ODFs and OPMLs with CNA-gains and CNA-losses

Table 4 reports the CNA-gains of 6 minimal common regions with the corresponding segment sizes and number of genes. The most frequent aberrant common region was the gain of 20q13.33-qter, observed in 7/19 cases (37%) for both the ODFs and the OPMLs. This aberrant region was more commonly detected in the d-OPMLs (62.5%) than in the nd-OPMLs (18%) (Table1). It is also of note that the frequency of 7p22.2-pter gain was slightly lower in the ODFs (21%) that in the OPMLs (32%). Similarly, the gain of 8q24.3-qter was 16% for the ODFs and 37% for the OPMLs. A cluster of 3 aberrations, the CNA-gains of the 7p22.2-pter, 8q24.2-qter and 20q13.33-qter regions, was noted simultaneously in 16% of the ODFs, in 18% of the nd-OPMLs and in 50% of the d-OPMLs.

Table 4 Detailed overview of minimal common region CNA-gains in ODFs and OPMLs

In the present series of 19 OPMLs, 8 were characterized by the presence of dysplasia (d-OPMLs) and of unique types of aberrations, which were absent or at very low frequency in all the other samples. Table 5 reports these aberrations, their start-end positions, segment sizes and the corresponding number of genes: CNA-gains in 5p13.33-pter, 16q24.2-qter and 20q13.31–q13.33 and a CNA-loss in 9p21.3. The corresponding frequencies among d-OPMLs were respectively 50%, 37.5%, 62.5% and 50%.

Table 5 Unique types of CNAs present in the d-OPMLs that were absent or at very low frequency in all the other samples

In the present series of 19 ODFs and 19 OPMLs, 10 different aberrant chromosomal regions have been detected. Table 6 presents the 2 × 2 contingency table tests of association between these aberrant chromosomal regions (CNAs with 9 gains and 1 loss) and the DNA ploidy status (31 DNA diploid and 7 aneuploid samples). The association was statistically significant for the chromosomal regions 5p13.33-pter (p = 0.0005), 20q13.31–q13.33 (p = 0.01) and 9p21.3 (p = 0.00004). It is of note that the 20q gain, which extends beyond 20q13.33 up to 20qter and was commonly present among the ODFs/OPMLs (14/38 = 37%) was only slightly associated with DNA aneuploidy (p = 0.077).

Table 6 Loss at 9p21.3, gain at 5p13.33-pter and gain of 20q13.31–q13.33 were associated with DNA aneuploidy

4 Discussion

The concept of field cancerization in oral cancer has been widely described and discussed [710] suggesting that premalignant changes may occur in any region of the oral mucosa exposed to carcinogens. According to Dakubo et al., the field cancerization represents a “process whereby cells in a particular tissue or organ are transformed, such that genetically altered but histologically normal appearing cells predate the development of neoplasia or coexist with malignant cells, irrespective of clonality” [34].

More than 60 chemicals derived from tobacco smoking that may bind and mutate a large number of genes [6] may contribute to create such an extensive oral field of cancerization. Along these concepts, a very attractive genetic progression model of oral cancer was proposed [10]. A critical step in this model, however, appears the transition from an early precursor to an expanding field with numerous aberrations, which sometimes becomes visible as leukoplakias or oral potentially malignant lesions (OPMLs) [1115].

In recent years, the powerful technique of Comparative Genomic Hybridization (metaphase-CGH and, more lately, array-CGH) has provided a genome-wide analysis of DNA copy number aberrations (CNAs) suggesting that CNA gains and losses may result respectively in hyper- and hypo-expression of specific genes [35]. These studies, which mainly analyzed oral squamous cell carcinomas (OSCCs) and more rarely OPMLs, have identified several aberrant chromosomal loci including 1p36.3, 3q26, 5p, 7p, 8q, 9p, 11q13, 16p, 16q, 17q, 20q and have contributed to a better understanding of the development of oral cancer and of its clinical implications [1115, 2127].

In the present study, we demonstrated that the vast majority of these aberrant chromosomal regions were already present in our series of OPMLs and corresponding non-dysplastic visually normal mucosa oral fields (ODFs). In particular, some specific CNAs that were present at high frequencies (between 40 and 60%) in the d-OPMLs (for example, loss at 9p21.3 and gain at 20q13.31–q13.33) were absent or at very low frequency in all the other samples.

The additional use of high resolution DNA flow cytometry (hr DNA-FCM) carries two relevant advantages. First, it allows (with the use of the technical capability of DNA-FCM-Sorting) to identify the presence of DNA aneuploid sublines and to physically separate them from the DNA diploid components and to submit therefore a relatively pure population of epithelial aneuploid nuclei to a-CGH. Second, this combined approach made it possible to compare the DNA aberrations obtained by hr DNA-FCM and a-CGH and to eventually indicate genomic regions underlying the development of DNA aneuploid sublines.

Among the 10 different aberrant chromosomal regions that we have detected for both ODFs and OPMLs, we found that loss at 9p21.3 and gains at 5p13.33-pter and 20q13.31–q13.33 were associated with DNA aneuploidy (p = 0.00004; p = 0.0005; p = 0.01). Though based on a relatively small number of ODF/OPML samples (n = 38), these results represent a new finding, which suggests that candidate genes within these chromosomal regions may be linked with mechanisms of chromosomal instability and aneuploidy. In particular, the chromosomal loss in the region 9p21.3 appears to be an interesting one, because it is characterized by a small size of only 2.16 Mb and only 9 genes, because 50% of the d-OPMLs lost this region and also because it is strongly correlated with DNA aneuploidy. Clearly, only specific types of studies may identify which specific gene/s may be responsible for such an important change as DNA aneuploidization. As far as known function of genes located in this region, we recall the gene CDKN2A (Cyclin-dependent kinase inhibitor 2A or TP16). This tumor suppressor gene, which is frequently inactivated by methylation, gene mutation and allelic deletions, was reported to play an important role in the carcinogenesis of OSCCs [36]. Also the chromosomal region 5p15.33-pter (segment size = 1.9), which was similarly associated with DNA aneuploidy, is an interesting one. Within this region, we recall the hTERT gene (Telomerase reverse transcriptase), which was implicated in telomerase-mediated chromosomal instability along with cell growth and apoptosis imbalances [37, 38].

Finally, we like to cite the RTEL1 gene (Regulator of telomere elongation helicase 1) located in the 20q13.33-qter region, which was commonly present among the ODFs/OPMLs. This gene was reported to play a crucial role in telomere maintenance, chromosomal instability, DNA repair, negative regulation of apoptosis and was found to be amplified in several human cancers [3942].

Clearly, the identification of genes responsible for chromosomal instability and aneuploidization in the early phases of oral carcinogenesis represented by ODFs and OPMLs would be a very important task. With respect to our a-CGH data, we have also posed this question: have the ODFs and the OPMLs some chromosomal aberrations in common? The observation that some CNAs at specific chromosomal regions for the ODFs (1p36.32-pter, 7p22.2-pter, 8q24.2-qter, 11p15.5-pter, 16p13.3-pter, 20q13.33-qter) were also present (in almost all cases at increasing frequencies) for the OPMLs might suggest that these aberrations could have played a role in the ODF-OPML transition and that they may be clonally related. In other words, the ODFs might represent in some cases an early precursor of the OPMLs of the same patient. It is to be noticed, however, that two ODFs with CGH aberrations had 2 corresponding nd-OPMLs without aberrations. This might also imply that the origin of either ODFs or nd-OPMLs may be polyclonal. Clearly, we are well aware that these chromosomal aberrant regions are quite large and might contain a large number of genes beside those eventually involved in the ODF-OPML transition.

With respect to the hr DNA-FCM data, we found that among the 19 ODFs of the present series, only one was characterized by a DNA Index value different from 1 (DI ≠ 1), i.e., by a DNA aneuploid subpopulation (1/19 = 5.26%). On the contrary, DNA aneuploid subpopulations were detected in 6 out of 19 OPMLs (32%). These results suggest that DNA aneuploidization is a relatively frequent early event in OPMLs and that sporadically such early aberration may already occur in ODFs within the same anatomical subsite. In no cases, however, we detected a common DNA aneuploid subline (with the same DI value) in both ODFs and corresponding OPMLs. Clearly, a much greater number of cases is needed to address this question. These data are in agreement with an independent study performed only by DNA-FCM in which we found that the percentage of DNA aneuploid ODFs and OPMLs were respectively 8.5% and 28% [20].

The absence of a DNA aneuploid component in the pure DNA diploid samples did not allow an enrichment of the epithelial component and therefore diploid stromal cells were also present with the epithelial cells. It is also to be noticed that the amount of material was not sufficient to obtain DNA diploid suspensions of cells to be marked and sorted according to epithelial specific antibodies and then to be reduced to nuclei suspensions and extracted DNA. The presence of epithelial cells (at least 50%) in the DNA diploid samples was controlled by morphological inspection from tissue samples obtained in parallel with the ODF/OPML samples.

In conclusion, specific aberrant chromosomal regions detected by high resolution a-CGH in oral visually and histologically normal fields and in potentially malignant lesions appear to be early events of the natural history of oral cancer associated with an early onset of the field cancerization and with DNA aneuploidy. Work remains to be done to identify possible specific genes within the CNA common regions, which might potentially contribute to the ODF-OPML transition and to DNA aneuploidization. The detection of these genomic aberrations, including DNA aneuploidy, which is prone to routine analysis, may help to identify individual patients with OPMLs at high risk to develop OSCCs and indicate alternative criteria for the management of these patients [34, 4345]. Clearly, the validation of this hypothesis requires long term prospective follow-up studies with an appropriate number of cases.