Introduction

Precancerous lesions of the upper aerodigestive tract keratinizing epithelium include leukoplakia, erythroplakia and mixed leukoerythroplakia (hyperplastic epithelial lesions). These clinically defined lesions have been stated to harbor an increased risk compared to normal mucosa for transformation into squamous cell carcinoma. When histopathologic evidence of cytologic and architectural atypism is present in the absence of invasion, the lesions are referred to as dysplastic and the presence of dysplasia has been considered to represent an increased risk identifier for malignant transformation over that of squamous lesions that fail to show dysplastic features [1, 2]. This review will define the microscopic features encountered in various grading systems of upper aerodigestive tract squamous dysplasia, discuss diagnostic reliability among pathologists, tabulate data on prognostication relative to carcinomatous transformation and review potential molecular and biomarker predictors.

Grading Systems

Dysplasias have been subdivided according to the degree of architectural and cytologic atypia. These grading systems have been evaluated for their prognostic utility with the assumption that mild atypical changes will have a lower risk for transformation to invasive carcinoma than those with advanced or severe changes. In the larynx, the most frequently employed systems in use among pathologists include the World Health Organization (WHO) and the Ljubljana Grading systems for vocal cord lesions; oral precancerous lesions are typically classified according to WHO [36]. The WHO grading system for both vocal cord and oral mucosa divides dysplasia into three categories, with the realization that dysplastic change is in reality a dynamic process. These three stages include mild, moderate and severe dysplasia flanked by hyperplasias or benign keratoses on the benign end of the spectrum and carcinoma in situ on the malignant extreme.

Low grade dysplasia is defined as cytologic and architectural atypia confined to the basal/parabasal layer; moderate dysplasia is characterized by atypical changes progressing into the mid spinous layer and severe dysplasia progresses into the upper spinous layer (Fig. 1). Carcinoma in situ is characterized by atypical changes from top to bottom. Many pathologists collapse severe dysplasia and carcinoma-in situ into a single category. Some pathologists have applied the cervical dysplasia paradigm to upper airway lesions: oral intraepithelial neoplasia (OIN 1, 2 and 3) or squamous intraepithelial neoplasia (SIN 1, 2 and 3) [7].

Fig. 1
figure 1

The WHO grading system for oral and laryngeal precancerous lesions. a. Benign hyperplasia (benign keratosis), b. mild dysplasia, c. moderate dysplasia, d. severe dysplasia, e. carcinoma in situ

The Ljubljana grading system devised in Slovenia by Kambic and Gale [3, 5] for laryngeal precancerous lesions divides stages of progressing dysplastic changes into three categories: “simple” hyperplasia characterized by epithelial thickening without atypical cytologic changes, “abnormal” hyperplasia represented by bulbous rete pegs and hyperplasia of the basal/parabasilar compartment with nuclear crowding and enlargement, and “atypical” hyperplasia represented by cytologic atypia extending into the upper strata (Fig. 2). Carcinoma-in situ is reserved for those lesions with pronounced top to bottom atypia and a loss of normal stratification. Table 1 compares the Ljubljana and WHO models.

Fig. 2
figure 2

The Ljubljana grading system for vocal cord precancerous lesions. a. Simple hyperplasia, b abnormal hyperplasia, c. atypical (risky) hyperplasia, d carcinoma in situ

Table 1 Vocal cord dysplasia

Grading Reliability

If any classification scheme for upper airway precancerous lesions is to be employed or universally adopted, it should be utilitarian. This utility includes reproducibility by examiner pathologists, both between (interexaminer) and within (intraexaminer) and lesional grades should have prognostic value with regard to potential for malignant transformation to squamous cancer [7]. Investigations that assess examiner reliability are listed in Table 2. Reliability data are presented as either % agreement or are expressed as Kappa statistics. In these studies, a group of examiners is asked to utilize a specified grading system and render a histopathologic diagnosis on a selection of lesions. Reponses are then compared to assess for agreement or disagreement. In those studies that address intra-rater correlations, pathologists are assigned a set of glass slides and render an initial diagnosis and subsequently, at a given future date, reevaluate the same group of slides and render a diagnosis while remaining blinded from their initial diagnoses.

Table 2 Inter- and intra-rater reliability for dysplasia grades in oral and laryngeal precancerous lesions

In general, regardless of the grading system chosen, agreement among pathologists is only fair, kappa values falling under 0.5. Vocal cord lesional assessment employing the WHO criteria showed a poor level of agreement; however, when a binary grading system that included only two diagnostic categories (mild dysplasia versus severe dysplasia) were applied, it revealed an improvement in interrater agreement with a kappa of 0.7 [8]. Among oral precancerous lesions, using the WHO criteria, kappa values hover between 0.5 and 0.6 [713]. In the investigation by Kujan et al. [9] a binary grading system did not produce a higher level of agreement over the WHO system; however, in the study by Abbey et al. [11] an 80% agreement level was realized when pathologists were asked to evaluate cases that were dysplastic (any grade) compared to cases without dysplasia. A similar level of agreement was observed within pathologists (intra-rater).

Table 3 Progression to carcinoma from dysplastic precancerous lesions of the laryngeal and oral mucosa (percentage)

These data are somewhat disappointing, being nonreproducible in many instances. If pathologists cannot agree on diagnoses for which clinicians plan treatment strategies, then the system somehow fails unless all lesions regardless of histology are treated in the same fashion. It is noteworthy that certain variables factor into disagreement among pathologists, namely presence or absence of inflammation, lesion site and biopsy method [12].When pathologists undergo a group training session on what histopathologic, cytologic and architectural features are used as criteria for each grade of dysplasia/hyperplasia, subsequent agreement reliability significantly improves [14].

Prediction of Malignant Transformation

Prognostication should be the outcome goal of histopathologic grading of precancerous lesions of the upper air passages. Many studies have been undertaken to determine the prevalence of carcinomatous transformation in clinically defined leukoplakia, erythroplakia, acute laryngitis (vocal cord erythroplakia) and chronic laryngitis (vocal cord leukoplakia) [1423] (Table 3). Five recent follow-up studies on vocal cord lesions report a range of 3.8–11.2% malignant transformation rate for all lesions regardless of histopathology (i.e. both dysplastic and nondysplastic hyperplasias); a 10.5–32% rate for lesions with histologic evidence of dysplasia has been reported [1418]. When grade of dysplasia using either the WHO or Ljubljana systems are assessed, there is a progressive increase in carcinomatous transformation rate with increasing severity of dysplastic change. Data on oral mucosal dysplasias, based on five follow-up studies discloses a malignant transformation rate range for combined dysplastic and nondysplastic lesions of 3.9–17% [1, 1922]. In the study by Silverman et al. [1], the risk for cancer progression in leukoplakias jumped from 17% for all lesions to 36% for dysplastic lesions whereas the remaining studies failed to find any statistically significant increased risk for dysplastic lesions [1922]. Holmstrup et al. [20] demonstrated a progressive increase in risk correlated with increasing degrees of dysplasia; however, their population sample was too low to achieve statistical significance. Lastly, it is noteworthy that the vast majority of precancerous lesions arising on vocal cord and oral mucosae, regardless of whether dysplastic changes are extant, do not progress to carcinoma, at least within the follow-up period specified in various reports.

Molecular Biomarkers and Prediction

A variety of proliferation markers, cell cycle cyclin proteins, cyclin kinases, oncoproteins, tumor suppressor mutations, microsatellite loss of heterozygosity (LOH), nuclear image parameters and DNA ploidy have been investigated in both oral and laryngeal carcinomas as well as dysplasias [17, 23, 24]. These investigations have provided insight into the molecular mechanisms of carcinogenesis. It is beyond the scope of this communication to review this literature, and therefore only those with promise as potential biomarker predictors will be discussed. It is noteworthy that immunohistochemical markers that mirror the changes observed microscopically are of no prognostic value. Only a marker that can be identified in premalignant lesions whose presence predicts progression offers any meaningful clinical application. Three approaches have shown utility for upper airway prognostication including the presence of aneuploidy, microsatellite instability with LOH at 3p and 9p and computer generated nuclear image analysis [2527].

Crissman and Zarbo [25] demonstrated a direct correlation between the presence of aneuploidy and degree of dysplasia in vocal cord lesions, with all severely dysplastic lesions being aneuploid. Unfortunately, this study was not progressive and did not address predictive follow-up information. No reliable ploidy data have been forthcoming in any large follow-up studies of oral mucosal dysplastic lesions. Rosin et al. [26] investigated hyperplastic and mild dysplastic oral lesions, assessing lesions that progressed to carcinoma and those that failed to progress using microsatellite LOH as a biomarker. They reported a 33 fold elevated cancer risk for lesions that progressed to cancer when LOH at 3p and/or 9p coupled with additional losses at 4q, 8p, 11q, or 17p were identified. Guillaud et al. demonstrated that this molecular predictive feature could be further augmented when combining LOH data with nuclear image analysis (nuclear phenotype score).

Discussion

Based upon current knowledge, the majority of dysplastic oral and vocal cord lesions do not progress to squamous cancer, at least in a mean follow-up period of 7 years and therefore the terms “intraepithelial neoplasia”, either oral or squamous are to be discouraged for head and neck lesions. Even the issue of an increase in carcinomatous transformation prevalence among dysplastic as compared to benign hyperplastic lesions may be questionable for oral precancerous lesions. Alternatively, an increased cancer risk in vocal cord dysplasias is extant over nondysplastic laryngeal epithelial hyperplasias, although not many long term follow-up studies have been published.

Some studies do report an increased risk for cancer when dysplasia is encountered in oral mucosal leukoplakias and erythroplakias, although only one study compared risk according to grade of dysplasia, failing to find a statistically significant correlation. Prognosis and prediction for cancer progression from precancerous lesions will continue to be based on features of dysplasia until more definitive markers can be discovered. Carcinoma-in situ should be considered a true “intraepithelial” neoplastic process and further investigations are needed to assess this severest variant of dysplastic change as a probable predictor.

Biomarkers with potential predictive value have been published and may prove to be clinically relevant. These include DNA ploidy, computerized nuclear image analysis and micro satellite instability at 3p and 9p with addition LOH at other loci.