Introduction

The increased risk of tuberculosis (TB) in healthcare workers is well known (Seidler et al. 2005). Therefore, screening HCWs for latent TB infection (LTBI) and preventive chemotherapy is a cornerstone of TB prevention programs (CDC 2005). However, the conventional tuberculin skin test (TST) has known limitations in accuracy and reliability. Furthermore, interpretation of serial TST results is complicated by non-specific variation and because of its intradermal application, by potential boosting from precedent tests (Pai et al. 2007). The development of the interferon-γ (INF-γ) release assays (IGRA) is welcomed as a means of overcoming this problem. The IGRAs allow ex-vivo testing and therefore are not prone to boosting. In addition, the IGRAs are highly specific, giving them valuable advantages over the TST especially in Bacillus Calmette-Guérin (BCG)-vaccinated populations (Diel et al. 2006; Nienhaus et al. 2008).

As with the TST, IGRA results are determined by several factors: precision of measurement technique, intrapersonal biological variation, new infection (conversion), transient infection (Ewer et al. 2006) or transition of Mycobacterium tuberculosis (MTB) from replication to a dormant state no longer stimulating cell-mediated immune response (reversion). MTB cannot be directly observed in the body. Therefore, its presence and replication activity can only be measured indirectly by antigen-specific response in TST or IGRA. For the TST, it is common sense that test interpretation in serial testing should be based on a comparison between actual and previous TST results. Depending on circumstances, an increase in TST ≥6 mm (Menzies 1999) or ≥10 mm (ATS 2000) is believed to indicate MTB presence and activity. Preliminary data showed that, similar to TST, an easy positive/negative interpretation of serial IGRA is not warranted (Pai et al. 2006) and a more sophisticated approach to IGRA interpretation in serial testing is needed. However, data on IGRA interpretation in serial testing is sparse. The few published studies available are rather small, allowing limited conclusions only (Hill et al. 2007; Franken et al. 2007; Cummings et al. 2009). So far, different ‘uncertainty zones’ for QuantiFERON-TB® Gold In-Tube (QFT), one of the two commercially available IGRAs, have been proposed. Based on the Indian data, a person whose IFN-γ result increased from <0.20 and exceeded 0.50 IU/mL on the repeat test was considered to have a ‘true conversion’. Likewise, a person whose IFN-γ result decreased from >0.50 and fell to <0.20 IU/mL was considered to have a ‘true reversion’ (Pai et al. 2009). Based on South African data, it was suggested that an increase in IFN-γ response from below 0.35 IU/mL to above 0.70 IU/mL for the QFT assay could be used to define conversions (van Zyl-Smit et al. 2009). Because high spontaneous reversion rates were reported, when the first QFT showed INF-γ between 0.35 and 0.7 IU/mL (Yoshiyama et al. 2009), it is unknown to what extent people falling into this category benefit from chemotherapy. In our follow-up study, we analyzed conversion and reversion rates in serial testing of HCWs with QFT, depending on baseline concentration of INF-γ and TST variation as well as for different definitions of conversions and reversions. Assuming that a small variation in baseline INF-γ concentration should not result in high changes to the conversion and reversion rates, we tried to derive an uncertainty zone around the cutoff for the QFT to be used in serial testing.

Materials and methods

Study setting and study subjects

The population of this follow-up study comprises all workers of the Hospital S. João who participated in TB screening from February 2007 through September 2009. The hospital is located in the northern part of Portugal and serves as a referral center for TB. On average, 250 TB patients are treated per year and a total of 32,000 patients are admitted for all diagnosis. In addition, there are about 500,000 outpatient contacts per year. As reported from a previous study of the same hospital (Torres Costa et al. 2009), the annual incidence rate of active TB in Portuguese HCWs (192 per 100,000) was about six times higher than the one in the general population in Portugal (32/100,000) in 2006.

In accordance with CDC guidelines, HCWs in infection and TB wards are considered to be at high risk, workers with regular patient contacts in the other wards are considered to be at medium risk and workers with no regular patient contacts or no contacts to biological material are considered to be at low risk (CDC 2005). Upon commencement of employment, all workers are examined. HCWs considered to be at high risk are evaluated annually. All others are evaluated every second year or after known exposure to patients with active TB.

TST and IGRA have been performed simultaneously. TST was performed when the diameter of a previous TST was below 15 mm or when no previous TST result was known. A chest X-ray was performed when TST was ≥10 mm or IGRA was positive or in HCWs with TB symptoms.

BCG vaccination was assessed through the individual vaccination register or by scars. Following the national vaccination plan, BCG vaccination for newborns is mandatory in Portugal and until January 2000 was repeated if TST was <5 mm (National Vaccination Plan 2009). Therefore, every HCW has been vaccinated at least once.

TST was performed by trained personnel following standard procedures. In brief, 0.1 mL (2 TU) of purified protein derivate (RT23; Statens Serum Institute, Copenhagen, Denmark) was injected intradermally at the volar side of the forearm, and the transverse diameter of the induration was read 72–96 h later. A diameter ≥10 mm was considered positive. A conversion in TST was defined as a TST ≥10 mm and an increase of ≥10 mm or less stringent ≥6 mm compared to a previous TST <10 mm (Menzies 1999, ATS 2000).

Blood for the IGRA was drawn during the same appointment during which the TST and an interview were conducted. As IGRA, the QuantiFERON-TB® Gold In-Tube (QFT) assay (Cellestis Limited, Carnegie, Australia) was administrated following the manufacturer’s protocol. Concentrations above 10 IU/mL were set to 10 IU/mL because of imprecision of measurement at these high concentrations (Pai et al. 2009). According to the manufacturers, an INF-γ concentration ≥0.35 IU/mL after subtracting the NIL control is defined as a positive test result. Four different definitions for conversion and reversion were applied: (1) transgression or regression over cutoff, (2) increase from <0.2 to >0.7 IU/mL or decrease from >0.7 to <0.2 IU/mL, (3) transgression or regression over cutoff plus change ≥0.35 IU/mL, and (4) transgression or regression over cutoff plus change ≥0.50 IU/mL. Observers were blinded to the results of the TST and vice versa.

Statistical analysis

For metric variables, box plots were drawn giving the median as black line in the box and the 25 and 75 percentiles as the boundaries of the box. Chi-square tests were used for categorical data. Baseline INF-γ concentration was categorized in small increments in order to observe at which increment the highest change in conversion and reversion rates occurs. The 95% confidence intervals (CI) for proportions were calculated. If the 95% CI did not overlap, differences between proportions were considered as statistically significant.

The participants gave informed consent to the participation in the study. No agreement of an ethics commission was requested because data collection was performed during routine screening procedure following normal work protection standards (CDC 2005). For study purposes, no additional tests were performed and it is not standard practice to seek ethical approval for the anonymous analysis of routine data in Portugal.

Results

The study population comprises 679 persons working in healthcare with two consecutive QFTs. The study period covers February 2007 until September 2009. Indeterminate results were observed in nine (1.3%) HCWs. One of these nine HCWs had indeterminate results on both occasions. The characteristics of the remaining 670 HCWs as well as of a subgroup of 252 HCWs who had three consecutive QFTs are given in Table 1. The subgroup is comparable to the whole group with respect to the distribution of age, gender, BCG history, profession, risk assessment, and number of TSTs during the study period.

Table 1 Description of the study population with two consecutive QFTs and subpopulation with three consecutive QFTs

The first and second QFTs were positive in 30.0% of the HCWs (Table 2). A conversion occurred in 11.0% of those negative in the first QFT and a reversion in 22.1% of those positive in the first QFT, if a simple dichotomous approach (negative-positive and vice versa) was chosen. Reversion and conversion rates depended on the INF-γ concentration of the first QFTs. Conversion occurred in 4.8% of the 376 HCWs with an INF-γ concentration at baseline below 0.1 IU/mL but in 48.9% of the 41 HCWs with an INF-γ concentration of 0.2 to <0.35 IU/mL. The same pattern is observed for reversions. In the 49 HCWs with a baseline INF-γ concentration >7.0 IU/mL, two reversions (4.1%) occurred while in those 55 (34 + 21) HCWs with a baseline INF-γ concentration ≥0.35 to <0.7 IU/mL, about every second HCW showed a reversion. For reversion and conversion rates, a neat dose-frequency relation to the concentration at the first QFT was observed.

Table 2 Results of second QFT depending on INF-γ concentration in first QFT

The diameter of the TST was positively associated with the probability of two consecutive positive QFTs. This probability increased from 10% in those with a TST <10 mm to 31.7% for those with a TST ≥15 mm (Table 3). An increase in the second TST by at least 10 mm was seen in 61 (30.7%) of those who had a first TST <10 mm. Of these 61 HCWs 78.7% were negative in the two consecutive QFTs and 6.6% showed a conversion in QFT (Definition 1). In those HCWs with a TST of 10 ≤ 15 mm who were retested during the study period, four (2.1%) showed decreases in their TST results of ≥10 mm and seven (4.5%) of ≥6 mm.

Table 3 Results of first and second QFT in relation to TST and to change in TST

Conversion and reversion rates showed statistically significant differences, depending on the definition used (Table 4). The conversion rates were highest following TST (17.9%) and second highest when crossing the cutoff for QFT was used as a definition. The 95% CI of these rates does not overlap, indicating a statistically significant difference. Using a gray zone from 0.2 to 0.7 IU/mL and excluding all those who have at least one QFT within this gray zone from calculation resulted in low conversion (3.6%) and reversion rates (5.2%). Somewhat higher rates were obtained when, in addition to a positive–negative approach, a minimal chance of 0.35 or 0.5 IU/mL was requested for conversion and reversion. For all definitions, conversion rates were lower than reversion rates but the difference was statistically significant for the least stringent definition only (1a + b, Table 4). For the TST, reversion rates (2.1 or 3.8%) were considerably lower than conversion rates (17.9 or 22.8%).

Table 4 Number and rate of conversions and reversions depending on different definitions for ‘true’ conversions and ‘true’ reversions

In the subgroup with three consecutive QFTs, the same trend was observed (Table 5) as in the whole study group. However, the proportion of those who were positive throughout all three QFTs dropped to 14.3% (36/252) from 24.2% (162/670) with two positive consecutive QFTs in the whole group. Two (10%) reversions (one transient reversion—second QFT negative and third QFT positive) were observed in those with a baseline INF-γ concentration ≥3.0 IU/mL, while 73.3% showed a reversion at the second (n = 10) or the third (n = 1) QFT when the baseline INF-γ concentration was ≥0.35 to <0.7 IU/mL. In addition, one (6.7%) transient reversion occurred. In Figs. 1 and 2, the association between baseline INF-γ concentration and reversion or conversion can also be seen. The median of the baseline INF-γ concentration was ≥0.1 IU/mL in those with conversion in the second or in the third QFT (Fig. 1). Only one reversion occurred when the baseline INF-γ concentration was above 3.0 IU/mL (see asterisk in Fig. 2).

Table 5 Results of second and third QFT depending on INF-γ concentration in first QFT
Fig. 1
figure 1

Box plot for INF-γ concentration of the first QFT depending on whether all three consecutive QFTs were negative (− − −), the third QFT became positive (− − +), only the second QFT was positive (− + −) or the two following QFT were positive (− + +) for the subgroup with a negative first QFT (n = 197)

Fig. 2
figure 2

Box plot for INF-γ concentration of the first QFT depending on whether all three consecutive QFTs were positive (+ + +), the second and third QFT became negative (+ − −), only the second QFT was negative (+ − +) or the third QFT became negative (+ + −) for the subgroup with a positive first QFT (n = 55)

The time between the first and second QFT was, on average, 11.5 months (standard deviation 4.0 months). The time between the first and third QFT was, on average, 19.8 months (standard deviation 5.5 months). No association was observed between the time span of the QFTs and the probability of conversion or reversion in the QFT (data not shown).

Nine HCWs were diagnosed with active TB, all but two were acid-fast bacillus (AFB)-positive, culturally confirmed cases. In one person, diagnosis was based solely on PCR (Table 6). All persons with active TB were positive in the first QFT. The TST was ≥15 mm in seven and 10–14 mm in two of them. Seven HCWs had pulmonary TB, one pleural TB, and one skin TB. Six active TB cases were diagnosed within 2 months of the first QFT. The other three cases were diagnosed three, seven, and 19 months after the first positive QFT. In one case, a second QFT was performed at the time of diagnosis 3 months after the first QFT and an increase from 0.51 to 1.96 IU/mL was observed. The median of the INF-γ concentration in those with actual pulmonary TB was 2.26 IU/mL, the minimum was 0.51 IU/mL, and the maximum 6.32 UI/mL. For the HCW with pleural TB the INF-γ in the first QFT was 0.42 IU/mL and in the skin TB case it was >10 IU/mL. After diagnosis and treatment, a reversion occurred in the patient with pleural TB and a sharp decline occurred in the HCW with cutaneous TB (>10–1.04 IU/mL). For the other six cases, increases and decreases of INF-γ concentration were observed three times, respectively. A positive QFT led to diagnosis in four HCWs with no symptoms. In the other 5 HCWs with pulmonary, active TB, typical symptoms such as cough, fever, weakness, or weight loss were observed along with a positive QFT.

Table 6 Characteristics of the 9 HCWs diagnosed with active TB

Discussion

Our study is the largest follow-up study for serial testing to date. Furthermore, it is also the only study on serial testing that actually observed active TB cases, thus allowing conclusions about test interpretation in serial testing to be based on these findings. As suggested in smaller studies before (Pai et al. 2006; Pai et al. 2009; Hill et al. 2007; Franken et al. 2007; Yoshiyama et al. 2009; van Zyl-Smit et al. 2009), our data shows that a simple positive/negative approach in the interpretation of the IGRA might be misleading because of a high number of spontaneous conversions and reversions originating from INF-γ concentrations close to the cutoff for the QFT. Using an uncertainty zone around the cutoff would help to distinguish between clinically unimportant variation and true conversion and reversion. If one of the consecutive QFTs falls into this uncertainty zone, conversion or reversion is doubtful. On the basis of our data, the lower limit of the uncertainty zone could be 0.2 IU/mL and the upper limit 0.7 IU/mL because this provides the sharpest decrease in conversion and reversion rates. Even though a reversion rate with initial INF-γ concentration between 0.7 and ≤1.0 was high (17.4%), the uncertainty zone should not be extended to this range because we observed an active pulmonary TB with an INF-γ concentration of 0.92 IU/mL.

The conversion rate (11%) we observed was similar to those reported for Indian HCWs (11.6%) (Pai et al. 2006). In the Japanese HCW study, the conversion rate was lower (1.7%) (Yoshiyama et al. 2009). In a recent German HCW study, the conversion rate was 1.9% (Ringshausen et al. 2010). When applying the gray zone and defining a conversion as a transgression from <0.2 to >0.7 IU/mL, the conversion rate decreased from 11 to 3.6%. We believe the lower conversion rate to be more realistic because Portugal is a country with medium TB incidence comparable to Japan, while India is a high-incidence country. Therefore, most conversions we observed are unlikely to be explained by an increased replication rate of MTB (reactivation) or new infection with MTB.

A conversion in TST (increase ≥10 mm) occurred about three times as often as a conversion in QFT (30.7% versus 11%). Therefore, TST most likely overestimated the conversion rate. Independent of the criteria, conversion of TST was not predictive of a positive QFT. Three out of four HCWs who fulfilled the criteria for TST conversion were negative in the QFT. This casts some doubt on the validity of the change criteria in serial testing with TST.

The reversion rate (22.1%) we observed was similar to those reported for Indian HCWs (24%) (Pai et al. 2006). In the Japanese HCW study, the reversion rate was higher (52.6%) (Yoshiyama et al. 2009). In this study, 80% (eight out of ten) of the reversions had at least one INF-γ concentration falling into the above-defined uncertainty zone. In our data, spontaneous reversions were rare (4.1%) when baseline INF-γ concentration was >7.0 IU/mL. The reversion rate for a baseline INF-γ concentration between 1.0 and 3.0 IU/mL observed by us was about the same as that observed in the Indian household contact study (18.9 versus 17%) (Pai et al. 2009). Even though this reversion rate is high and is therefore unlikely to be explained by true reversion (transition from T-cell stimulating replication to dormant state of MTB), a QFT ≥ 1.0 should be considered as indicative of LTBI because four out of our eight active TB cases had baseline INF-γ concentrations between 1.0 and 3.0 IU/mL. One reason for the reversion rate being higher than the conversion rate (22.1% versus 11%) might be the ‘tendency toward the mean’. In repeated testing, random measurement errors tend to draw the results toward the mean, which in our study was below 0.35I U/mL. Our data suggest that this random measurement error is particularly import around the cutoff.

A reversion in TST occurred five to ten times less often than a reversion in QFT. Reversion in TST was not associated with reversion in QFT. Most often, those with a reversion in TST were negative in both consecutive QFTs. However, our data on reversion in TST is too sparse to draw any meaningful conclusions.

Variability of INF-γ concentration is influenced by several factors: intra-individual short-term variations of test results, variation in precision of measurement techniques, insufficient standardization of the handling of the probes. Variations caused by these factors can hardly be distinguished from variations caused by immunologic responses to increasing or decreasing replication of MTB in the persons infected. So far, only one study has measured short-term variations in IGRA results. However, this study was performed in a high-incidence country (van Zyl-Smit et al. 2009). Therefore, it was not possible to distinguish between random variation of INF-γ and immunologic response to MTB exposure. As a limitation of our data, we did not collect data on variables that might influence the QFT results. In future, studies are needed which analyze intra-individual variation of IGRA results and risk factors for this variation (e.g. alcohol consumption, time of test). Furthermore, variation of test results due to handling variations, different test kits or other technical aspects of the laboratory procedures should be analyzed.

Studies on TB prediction by QFTs show promising results in low to intermediate TB incidence countries (Diel et al. 2008; Aichelburg et al. 2009; Yoshiyama et al. 2010). In the Japanese prediction study, risk of progression to active TB increased with the concentration of QFT (Yoshiyama et al. 2010). So far, there is no study available that describes the association between the changes in IGRA and disease prediction. Only one of our nine TB cases was diagnosed after a second QFT was performed. In this patient, both QFTs were positive, with an increase from 0.51 to 1.96 UI/mL. This gives limited support to the hypothesis that QFT variations might be predictive of TB progression risk.

When using an uncertainty zone of 0.2–0.7 IU/mL, it should be kept in mind that a QFT result around the cutoff (0.35 IU/mL) does not exclude active TB. We observed a pleural TB patient with an INF-γ concentration of 0.42 IU/mL. It is known that INF-γ response in peripheral blood might be low in pleural TB patients (Hooper et al. 2009). Therefore, this observation was not surprising. Furthermore, we observed active pulmonary TB in an HCW with a positive first QFT falling into the uncertainty zone and a second QFT clearly above the uncertainty zone. It tells us that when using an uncertainty zone in serial testing, this should be done with the greatest of care. Uncertainty zone means that spontaneous, clinically irrelevant transgressions over the cutoff are probably predominant, but it does not exclude LTBI or even active TB. Sensitivity studies for QFT using active TB as a surrogate for LTBI suggest that disease activity might be inversely related to INF-γ concentration (Menzies et al. 2007; Diel et al. 2010). Therefore, as with the TST, recent exposure and clinical symptoms must be taken into account when interpreting QFT results.

Progression rate for active TB is highest in the first 2 years after exposure. Therefore, preventive treatment is most effective if recent infection is likely. In those with a positive QFT result falling into the gray zone and recent accidental and unprotected contact with infectious patients or materials, QFT should be repeated within 4 weeks after the first test. If QFT remains positive, preventive treatment should be initiated. If screening was performed because of regular contact with TB patients or infectious materials and therefore a distinction between old or recent LTBI is not possible, the next routine screening and therefore the next IGRA should be performed in 1 year. This recommendation is based on weak evidence. It should be borne in mind that low concentrations in QFT do not exclude progression to active TB, as was observed in the above-mentioned Japanese prediction study (Yoshiyama et al. 2010). However, disease progression was more likely with higher INF-γ concentrations in QFT. In future, studies are needed that analyze progression risk depending on IGRA results, changes in IGRA results, and exposure history.

Despite our increasing knowledge, several key questions about latent infection and reactivation of M. tuberculosis remain unanswered. Particularly, it should be noted that both the TST and the IGRA are designed to identify an adaptive immune response against M. tuberculosis, but not necessarily a latent infection. A positive result of currently available diagnostic tests is primarily a measure of an immunologic response to stimulation by mycobacterial antigens that should not, therefore, be equated with the presence of live M. tuberculosis in the human host. The proportion of individuals who truly remain infected with M. tuberculosis after TST or IGRA conversion is unknown. It is also uncertain how long adaptive immune responses toward mycobacterial antigens persist in the absence of live mycobacteria. For these reasons, according to the recently published TBNET consensus statement regarding latent TB and based on the informative value currently derived by IGRA and TST, the term “latent infection” would at best implicate “lasting tuberculosis immune responses” and would not necessarily identify a true latent infection with viable microorganisms and potential risk of developing active disease (Mack et al. 2009).

In conclusion, our data suggests the use of an uncertainty zone between 0.2 and 0.7 IU/mL in serial testing with QFT. As long as our knowledge regarding disease progression in QFT-positive persons is limited, in countries with limited experience in chemoprevention, persons pertaining to the uncertainty zone should be retested before being offered preventive chemotherapy.