Introduction

The histopathological diagnosis of lymphoid malignancies can be challenging, particularly when morphological, clinical and immunophenotypic findings are not conclusive. Early stages of malignant lymphoproliferations, clinical and morphological overlap with reactive conditions, absence of defining immunophenotypic abnormalities, and increasingly small diagnostic biopsy samples are common problems. Especially in peripheral T cell lymphomas, morphological heterogeneity, lack of easily identifiable immunophenotypical markers of clonality and high numbers of accompanying reactive cells increase the diagnostic challenge. Therefore, molecular analysis of T cell receptor gene rearrangements is a crucial part in the diagnostic process of suspected T cell lymphoproliferations.

Routine clonality assessment should be simple and robust, with good sensitivity and specificity rates. PCR-based techniques provide high sensitivity, speed and cost effectiveness, and can be applied to small amounts of DNA derived from formalin-fixed paraffin-embedded (FFPE) samples. For these reasons, PCR-based clonality analysis has largely replaced the previous gold standard Southern blot in daily practice [3]. However, the question which of the many published protocols for T cell clonality determination is best suited for routine diagnostics of FFPE tissues is far from settled.

To assess T cell clonality, the T cell receptor γ gene (TCRγ) locus on 7p14 has been the preferential target for clonality analysis due to its restricted germline repertoire, containing nine functional variable (V) region genes and five joining genes, which limits the number of required PCR primers, as compared to the complex TCRα and TCRβ gene loci. Since the 1990s, multiple protocols have been developed for the detection of TCRγ rearrangements, ranging from relatively simple protocols with consensus primers amplifying the different Vγ families in two PCRs with one product size distribution of TCRγ rearrangements, to multiple tubes with one or more fluorochromes for primer labelling and several size ranges for amplified polyclonal products [1, 2, 48]. Although the use of multiple primer sets covering most or all Vγ genes increases sensitivity, multiple ranges of product sizes may lead to small pseudoclonal peaks in a reduced polyclonal background if uncommonly used variable or joining gene segments are rearranged [9]. Furthermore, the number of primers in a single tube reaction may also affect results and can lead to false negativity as a result of competitive inhibition of amplification in the multiplex setting [10].

Although the high complexity of the TCRβ gene proved to be challenging for primer design, several groups have undertaken to develop protocols to detect TCRβ rearrangements with a variety of approaches, mostly using highly degenerate primers [1015]. This was shown to increase the sensitivity and reliability of T cell clonality detection when compared to TCRγ-based protocols [11, 16].

A major effort to improve and standardize protocols for clonality determination was undertaken by the BIOMED-2 concerted action BMH4-CT98-3936 which included more than 30 European laboratories and large numbers of samples. In addition to PCR assays for the TCRγ locus, primer sets for the TCRβ and TCRδ loci have been included. Since publication of the initial results [10] and commercialization of the primer sets, a multitude of further studies regarding the utility of the BIOMED-2 assays for T cell clonality analysis with freshly frozen and FFPE samples have been published. Although excellent sensitivity was demonstrated in most studies, the aspect of false positive results, which may have far-reaching clinical implications, has received considerably less attention [9, 17]. Several aspects of the BIOMED-2 assays may affect performance in daily practice. BIOMED-2 assays were developed on freshly frozen tissue [10, 18]. Although they have by now been tested extensively in FFPE tissues [17, 19], their complex design with several products distributed over a relatively large size range increases the risk for both overcalling clonality based on preferential amplification due to poor template quality, as well as false negative results because of dropout of larger product sizes. For PCRs targeting loci with limited germ line variety, separation techniques based on both product length and sequence, such as, heteroduplex analysis, are superior to methods purely separated by product size, such as, the widely used capillary electrophoresis with fluorescence-labelled primers (Genescan) [10, 20, 21]. In addition, interpretation of the complex results requires considerable knowledge of potential biological and technical pitfalls as well as significant experience of test performance under local laboratory conditions, increasing the risk for misinterpretation in less experienced laboratories [20].

The aim of this study was to compare sensitivity and specificity of two previously published, relatively simple, but broadly used TCRγ clonality protocols of the first generation with the BIOMED-2 TCRγ and TCRβ assays in a panel of well-characterized peripheral T cell lymphomas and control cases [1, 2, 10]. Using capillary electrophoresis with fluorochrome-labelled primers as detection method, we specifically wanted to test in a routine setting, whether the practical benefit of the increased sensitivity of the BIOMED-2 protocols might be affected by a higher rate of false positive or equivocal results, as has been suggested by others [9]. We therefore enriched our samples for problematic, but nevertheless, commonly investigated tissue types, such as, skin and bone marrow biopsies, and also included a panel of B cell lymphomas.

Materials and methods

Samples

A total of 52 cases of morphologically and phenotypically well-characterized, paraffin-embedded peripheral T cell lymphomas (PTCL) and 55 control cases, including, 25 B cell non-Hodgkin lymphomas (B-NHL) and 30 reactive lesions were retrieved from the Institute of Pathology and the Department of Dermatology, University Hospital Tübingen, Germany. Specific diagnoses and tissue sources are listed in Table 1. All lymphoma cases were classified according to the guidelines of the World Health Organization Classification of Tumors of Hematopoietic and Lymphoid Tissues [22] and reviewed by three of the authors (P.A., L.Q.-M., F.F.). Special emphasis was given to smaller samples with limited numbers of lymphocytes, such as, bone marrow (n = 23) and skin (n = 21) biopsies. Bone marrow biopsies were fixed in formalin, decalcified with EDTA for 7 h and embedded in paraffin, which results in good DNA quality comparable to other FFPE tissues [23, 24]. Ethics approval for the study was obtained from the local ethics committee, University Hospital Tübingen.

Table 1 Specific diagnoses and tissue sources

DNA isolation

DNA was extracted from 10-μm paraffin-embedded tissue sections. After dewaxing, standard proteinase K digestion was performed. DNA quality was assessed in all samples with the BIOMED-2 control gene PCR [10] and subsequent agarose gel electrophoresis. For both Trainor and McCarthy protocols, crude DNA extracts, as well as DNA extracts purified by phenol/chloroform/isoamyl alcohol were compared in their performance in a major subset of the samples, whereas BIOMED-2 assays were only performed with purified samples.

Clonality analysis

The PCR of TCR rearrangements of all 107 tissue samples was performed using TCRγ primer sets previously described by McCarthy et al. and Trainor et al. [1, 2]. In modification of the original reports, however, Jγ primers were modified with D4 fluorescent dyes (Sigma-Aldrich, St. Louis, MO, USA). The first approach by McCarthy et al. consists of two V-consensus primers targeting eight genes and two consensus primers for the five genes within two tubes, but does not amplify Vγ9 rearrangements. The PCR products range from 75 to 95 and 80 to 110 base pairs for the Jγ and the Jp primer sets, respectively, thus making the assay well suited for FFPE-derived DNA. The protocol by McCarthy et al. was adapted with an initial denaturation step of 98°C (30 s), 38 cycles (98°C 10 s, 57°C 60 s, 72°C 30 s) and a final step of 7 min. The second TCRγ method by Trainor et al. involves a multiplex assay with nine Vγ primers and three Jγ primers (D4 flourescently-labeled) which are divided into two tubes [2]. Amplification according to Trainor et al. was modified with an initial denaturation step of 98°C (30 s), 42 cycles (98°C 10 s, 55°C 60 s, 72°C 60 s) and a final step of 10 min. Phusion Hot Start DNA Polymerase (Finnzymes Oy, Espoo, Finland) was used for amplification in conjunction with Phusion GC Buffer containing MgCl2.

Each sample was analysed at least in duplicate by using two different DNA dilutions in case of unpurified DNA, or 30 and 100 ng of purified DNA, respectively. DNA of the cell lines Jurkat and SUP-M2 were used as positive controls.

A subset of 50 samples, including 25 T-NHL and 25 control cases was analysed in duplicate for TCRγ and TCRβ rearrangements using the commercially available BIOMED-2 assays [10] (InVivoScribe Technologies Inc., San Diego, CA) according to the manufacturer’s instruction manual with the modification of using 25-μl reactions with 250 ng of purified DNA. In these 50 case subsets, we included all samples which had rendered false negative, false positive or equivocal results with the Trainor and McCarthy TCRγ protocols (see below). Since γδ-T cell neoplasms or T-lymphoblastic lymphomas were not included in the study, the BIOMED-2 TCRδ assay was not performed.

GeneScan analysis

For GeneScan analysis, 1 μl of the PCR products was mixed with sample loading solution containing 0.45 μl of DNA Size Standard 400 (Beckman Coulter, Brea, CA, USA). The products were separated by capillary electrophoresis on the GenomeLab GeXP Genetic Analysis System and analysed by the GenomeLab GeXP software 10.2 (Beckman Coulter, Brea, CA, USA).

Evaluation criteria and statistical analysis

Criteria of interpretation

All cases were evaluated by two independent observers, and discrepancies were resolved by discussing the GeneScan images to reach consensus. The following definitions were used throughout the study:

  • Clonal: one/two reproducible peaks greater than threefold the height of the neighboring peaks within the expected size range for a given primer set

  • Equivocal: one or two clearly discernible and reproducible peaks, but less than threefold the height of the neighboring peaks for a given primer set within the appropriate size range

  • Polyclonal: amplicons of different length and sufficient quantity covering the whole expected size range in a given primer set, with a near-Gaussian curve matching or similar to the polyclonal control

  • Oligoclonal: three or more, usually not reproducible peaks within the appropriate size range and regarded as polyclonal for the purpose of the study

  • Technically not evaluable: no amplification or weak amplification product with a mean fluorescence signal intensity significantly below average with few minor peaks not reproducible in repeat PCRs

Results

Samples

A total of 52 T-NHL specimens were analysed, including 19 PTCL, not otherwise specified, 13 angioimmunoblastic T cell lymphomas (AITL) and seven cases of mycosis fungoides, among others. Diagnoses and tissue sources are listed in Table 1. The control set included 25 B-NHL and 30 specimens of reactive lymphoid tissues, reactive bone marrow (BM) biopsies and skin biopsies with a variety of well-characterized benign inflammatory dermatoses. Of the samples analysed, 95/107 (89%) showed amplification of the 300-bp band in the BIOMED-2 control PCR including all skin and BM biopsies.

Detection of clonality in primary samples with McCarthy and Trainor primer sets

All 107 cases were analysed using the protocols of McCarthy et al. and Trainor et al. Clonal TCRγ rearrangements were detected in 42 out of 52 (81%) and 43 out of 52 (83%) T-NHL samples using the McCarthy and Trainor protocols, respectively.

In total, 45 (86%) samples were monoclonal with at least one technique, and concordant monoclonal results with both methods were detected in 40 out of 45 of the cases. In contrast, six (12%) cases (four peripheral T cell lymphoma, not otherwise specified (PTCL-NOS), one AITL and one cytotoxic mucocutaneous T cell lymphoma) remained poly-/oligoclonal with both techniques, and one case was poly-/oligoclonal with one and equivocal with the second primer set. Of the six discordant cases, three were monoclonal with the Trainor primer set and poly/-oligoclonal or equivocal with the McCarthy set. Two of these cases showed a clear-cut monoclonal result with Trainor tube 1 and a polyclonal pattern with McCarthy tube 1, indicating the presence of a Vγ9 rearrangement in these cases. This was confirmed with the BIOMED-2 TCRγ primer set (Fig. 1a). The other three discordant cases included two cases monoclonal with the McCarthy assay and polyclonal or equivocal with the Trainor primer set, and finally a case with equivocal results in McCarthy, but clearly polyclonal in the Trainor assay.

Fig. 1
figure 1

GeneScan analysis of representative samples. a AITL (lymph node) with a clonal TCRγV9 rearrangement. There is a clear-cut clonal pattern in the Trainor primer set 1, but a polyclonal pattern in McCarthy primer set 1. Monoclonality is confirmed with BIOMED-2 TCRγ Tube B. b DLBCL, bone marrow biopsy. Both McCarthy and Trainor primer sets 1 show a clonal TCRγ product with prominent polyclonal background confirmed in repeats. c Incomplete DJβ rearrangements. Reactive lymph node, TCRβ Tube C. Although both products show a polyclonal pattern, the larger-sized product shows a significantly diminished intensity, indicating differences in amplification efficiency (upper lane). B-NHL with a monoclonal peak and polyclonal background in both product size ranges, indicative of an incomplete DJβ rearrangement. TCRβ Tube C (middle lane). Two PTCL with monoclonal (left) and polyclonal (right) PCR products (lower lane). d Skin biopsy with benign inflammatory dermatosis demonstrating polyclonal results with McCarthy and Trainor primer sets 1, as well as the BIOMED-2 TCRγ (Tube B) and TCRβ assays (Tube B). Note the irregular, not perfectly Gaussian distribution of products in all PCRs due to a limited number of T-cells and possibly restricted repertoire. The occasional peaks of increased intensity were not reproducible in repeats

Clonal or equivocal results were identified in six out of 55 (11%) control cases with the McCarthy protocol and six out of 53 (11%) amplifiable controls with the Trainor assay. A single reactive BM (3%) rendered a monoclonal result with both primer sets, four (13%) reactive cases showed a monoclonal or equivocal result with one of the two protocols, whereas the remaining benign controls remained polyclonal in all PCRs. Three (12%) of 25 B-NHL cases showed a reproducible monoclonal TCR rearrangement with one (one case) or both (two cases, Fig. 1b) primer sets.

Comparison with BIOMED-2 TCRγ and TCRβ protocols

From the total study set of 107 cases, a subset of 50 cases, which exhibited the required DNA quality (at least 300-bp products in the BIOMED-2 control PCR) were additionally analysed by the TCRγ and TCRβ BIOMED-2 assays (Table 1). This subset contained 25 peripheral T cell lymphomas, including all six cases, which had shown a polyclonal pattern using the Trainor and McCarthy protocols, and the two T-NHLs with suspected Vγ9 rearrangements. Additionally, 25 controls including most samples giving monoclonal or equivocal results were analysed. In this subset of 50 “difficult” cases specifically selected to test the performance of the BIOMED-2 protocols, the McCarthy primer set achieved a sensitivity of 64% (16 out of 25), whereas a sensitivity of 72% (18/25) was reached with the Trainor protocol. Detailed results are shown in Fig. 2a, b.

Fig. 2
figure 2figure 2

Results of TCRγ PCR for the 50-case subset, using the modified protocols according to a McCarthy and b Trainor, as well as c BIOMED-2 TCRγ and d BIOMED-2 TCRβ and e the combined BIOMED-2 assays. Percentages of clonal, polyclonal, equivocal and technically not evaluable results are given for T-NHL, B-NHL and reactive samples. Only incomplete DJ TCRβ rearrangements (Tube C) were identified in B-NHL samples (asterisk)

Of the 25 T-NHL cases, 23 showed a clonal result with one or both BIOMED-2 sets (TCRγ or TCRβ), including all cases found to be monoclonal with one or both previous techniques. Four of the six polyclonal cases with both Trainor and McCarthy primer sets revealed a monoclonal product with the TCRβ set, including one with incomplete DJβ rearrangement only (Tube C), increasing the sensitivity for the combined BIOMED-2 assays to 92%. One of these cases additionally revealed a TCRγ rearrangement.

Of the 15 reactive control cases analysed, four (27%) cases showed either a monoclonal (one sample), equivocal (one sample) or technically not evaluable result (two samples) with the combined BIOMED-2 TCRγ and TCRβ assays. In comparison, three out of 15 (20%) cases had shown a monoclonal or equivocal result with the Trainor or McCarthy protocols, of which one proved to be polyclonal with both BIOMED-2 sets. Of the ten B-NHL samples analysed with BIOMED-2 primers, one showed an equivocal result with the TCRγ set, and four monoclonal results with the TCRβ set. Of interest, all four samples exhibited only an incomplete DJβ rearrangement (Tube C) (Fig. 1c). In contrast, three out of ten B-NHL samples showed monoclonal or equivocal results with both McCarthy and Trainor protocols. The results of the 50-case subset with BIOMED-2 primers are depicted in Fig. 2c–e. Specificities in the B-NHL cases were 80% (McCarthy), 70% (Trainor) and 60% (BIOMED-2 TCRγ and TCRβ) versus 93% (all protocols) for reactive cases, if equivocal cases were considered together with the poly-/oligoclonal cases. For all non-T-NHL cases combined, specificity was 88% (McCarthy) versus 84% (Trainor) and 80% (BIOMED-2 TCRγ and TCRβ).

In summary, the combined BIOMED-2 TCRγ and TCRβ assays revealed a superior sensitivity of 92% and a comparable specificity of 80% as compared to the McCarthy and Trainor protocols.

Influence of tissue type, DNA quality and DNA purification

The specimen type was correlated with the PCR results to identify a potential influence of the tissue source. In the 50-case subset, false negative results were overrepresented among lymph node biopsies with McCarthy, Trainor and BIOMED-2 TCRγ protocols.

The false positive results among control cases in general were overrepresented among bone marrow biopsies. Equivocal results were observed with all protocols among reactive BM biopsies, whereas no equivocal results were found among the lymph node samples, indicating an influence of DNA quality and number of lymphocytes in the sample.

Suboptimal DNA quality, defined as showing only the 200-bp band in the control PCR, was present in 12 out of 107 samples, including six T-NHL and six control samples. Both McCarthy and Trainor protocols detected a monoclonal rearrangement in five out of six T-NHLs. Furthermore, a polyclonal result was obtained in all six control samples with the McCarthy protocol and in five out of six with the Trainor primer sets. These results are similar to the whole collective, indicating relatively robust assay performance. These samples, however, were considered inadequate according to guidelines for the BIOMED-2 primer sets and therefore not analysed further.

Purified and unpurified DNA extracts were compared with Trainor and McCarthy protocols. An improvement in assay performance was noted in a third of the investigated samples after purification.

Discussion

Detection of T cell clonality in clinical samples is an important tool for the diagnosis of T cell lymphoproliferative disorders. Consequently, a large number of PCR tests for this purpose have been developed, culminating in the multinational and multi-institutional effort, which resulted in the development of the BIOMED-2 protocols. These assays for the detection of T cell clonality targeting both TCRγ and TCRβ loci show superior sensitivity compared with most previously published protocols [10]. Clonality detection rates in T-NHL of 86–89%, 87–94% and 94–100% were reported for the TCRγ, TCRβ and the combined assays, respectively, in freshly frozen tissues. With FFPE material, analysis of either TCRγ or TCRβ renders monoclonal results in 69–81% of cases, reaching 80–100%, when both assays are evaluated jointly [3, 10, 17, 25, 26]. Our comparative analysis confirms that the addition of BIOMED-2 primers increases the sensitivity of clonality analysis in FFPE tissues of T-NHL over simpler protocols. The sensitivity of 92% for the combined BIOMED-2 assays in our study is remarkable because the 50-case subset was enriched for cases negative with both Trainor and McCarthy TCRγ protocols. BIOMED-2 assays identified clonality in four of the initially false negative cases, mostly due to positive results with the TCRβ assay. Of note, the two cases which remained negative with BIOMED-2 primers corresponded to lymph node biopsies of PTCL-NOS, which showed some, but not all, diagnostic features of AITL. In contrast, all ten extranodal biopsies with T-NHL infiltrates, including skin, bone marrow and gastrointestinal tract samples rendered positive results. This indicates that the false negativity was not due to poor DNA quality or biopsy size but rather to the inherent properties of the malignancy, such as, oligoclonality, which can be observed in a proportion of AITL, or presence of incomplete or deleterious rearrangements of the TCR or trans-rearrangements [27].

Another aim of our study was to investigate the issue of specificity, which has received considerably less attention in previous reports. A large study of reactive lymphoproliferations performed by the BIOMED-2 consortium analysed exclusively freshly frozen tissues and excluded FFPE tissues, stating that “PCR results of such samples are largely dependent on variation in formalin fixation and paraffin embedding” [18]. In daily practice, however, many laboratories commonly have to deal with FFPE samples. We therefore focussed on FFPE tissues and included a number of “difficult” samples, such as, EDTA-decalcified BM, endoscopic and skin biopsies, which are often overrepresented among samples submitted to clonality analysis. If only clearly reproducible false positive results are taken into account, BIOMED-2, McCarthy and Trainor protocols show roughly equal specificities ranging from 80% to 88%. False positive results can be due to underlying biological as well as technical factors. On one hand, minor T cell clones are detectable in reactive conditions such as autoimmune diseases or viral infections [18, 28, 29]. In addition, the presence of a monoclonal T cell population could be due to a clinically and morphologically undetectable (pre-) neoplastic disorder, such as, an early stage of T cell large granular lymphocytic leukemia in bone marrow biopsies. Of interest, most false positive results obtained with BIOMED-2 assays were due to the high incidence (40%) of reproducible, isolated incomplete DJβ rearrangements (Tube C) in our mature B-NHL panel. In B-NHL, the occasional presence of monoclonal T cell rearrangements has been described by several groups. This can either be due to cross-lineage rearrangements, which are usually found in precursor B cell neoplasms but can occasionally occur in mature B-NHL [30, 31] or may represent expansion of benign reactive T cell clones in response to the B-NHL. Of interest, a relatively high rate of incomplete TCRβ DJ rearrangements in B-NHL has been observed in another study [13]. Since isolated incomplete TCRβ DJ rearrangements without clonal products with other primer sets seem to be rare in T-NHL, we recommend to interpret these isolated findings with great caution.

Although our results show a favourable rate of false positive or equivocal results in FFPE under carefully controlled conditions, if the incomplete TCRβ DJ rearrangements are excluded, in our experience, BIOMED-2 assays are nevertheless more prone to false positive results and overinterpretation for several reasons. The PCR products generated with the BIOMED-2 primers range from 240 to 325 bp for the TCRβ set and from 80 to 255 bp for the TCRγ set, which is larger than most products generated with the McCarthy (75–110 bp) and Trainor (170–230 bp) protocols. In addition, BIOMED-2 assays generate different normal product size distributions within one PCR [8]. The separation of TCRγ rearrangement amplicons into multiple size distributions with several fluorochromes and sometimes less-than-ideal Gaussian curves makes the interpretation sometimes difficult and increases the likelihood of pseudoclonal peaks and consequently false positive results [9]. This is especially true for amplification of uncommonly used joining or variable gene segments, and if the sample contains limited numbers of T cells or poor DNA quality. That the amount of polyclonal target cells in the specimen is a critical factor for the generation of pseudoclonal peaks is demonstrated by the higher incidence of false positive and equivocal results in bone marrow biopsies. Although identification of pseudoclonal in contrast to true monoclonal products is facilitated by the performance of repeat PCRs, some of these minor non-malignant clones can be detected reproducibly. Whether the use of alternative, sequence-sensitive techniques for product separation, such as, heteroduplex analysis, allows to reduce the rate of false positive results was not examined in this study. For skin biopsies, analysis of several specimens from different sites has been recommended and can strengthen a diagnosis of cutaneous T-NHL if the same clone is detected in more than one sample [32]. However, oligoclonality or occurrence of heterogeneously distributed, different T cell clones has been repeatedly described in mycosis fungoides [33, 34].

In conclusion, our study confirms the superior sensitivity of the BIOMED-2 TCRγ and TCRβ assays in the detection of T cell clonality, even in “difficult” FFPE samples. However, the uncritical use of these protocols on FFPE samples may result in an increase of false positive results. This is especially true for extranodal samples with limited amount of lymphoid cells, poor template DNA quality and lack of experience with the interpretation of the complex product patterns. Assessment of DNA quality and routine performance of duplicate reactions are mandatory to obtain reliable results. In addition, reliance on numerical thresholds such as >3× or >5× peak height or similar mathematical and statistical algorithms without critical observance of overall amplification efficiency on one hand, and clinicopathological features on the other hand can result in incorrect interpretations. Amount and lymphocyte content of examined tissue, DNA quantity and quality, availability of previous samples for comparative analysis, and outcome of control and duplicate reactions all need to be taken into consideration before signing out molecular results. In the case of discrepancies between molecular and histological findings, these need to be addressed in the final report [20].

In addition, our study demonstrates that fairly simple, first-generation TCRγ assays such as the two protocols tested above show robust performance despite their lower sensitivity and still provide an acceptable alternative, e.g. for samples with poor DNA quality or as first test in a stepwise approach to T cell clonality analysis. Irrespective of the protocols used, determination of T cell clonality in FFPE samples requires stringent quality controls and careful interpretation of results with full knowledge of potential technical and biological pitfalls.