FormalPara Key Summary Points

Why carry out this study?

There are limited data on CMV genotypes among non-hematopoietic stem cell transplant (HSCT) patients with hematological malignancies (HM) undergoing chemotherapy.

This study aimed to determine the distribution of CMV gB, N, and O genotypes and to explore their potential effect on its viral load and on clinical outcomes.

What was learned from the study?

Our study indicated that mixed CMV gB, gN, and gO genotypes were common.

Mixed glycoprotein genotypes were more frequently associated with clinical manifestation and higher viral load.

Digital Features

This article is published with digital features, including a summary slide, to facilitate understanding of the article. To view digital features for this article go to https://doi.org/10.6084/m9.figshare.14560542.

Introduction

Patients at risk of severe cytomegalovirus (CMV) infection include preterm neonates, patients with congenital or acquired immunodeficiency such as HIV, those undergoing chemotherapy or radiotherapy, and solid organ or hematopoietic stem cell transplant (HSCT) patients. CMV infections and subsequent diseases are increasingly recognized in non-transplant hematologic malignancies settings, especially in leukemia and lymphoma receiving chemotherapy. In these patients, CMV may cause either mild or even asymptomatic clinical manifestations as well as life-threatening disease [1,2,3]. However, little data are available about the impact of CMV virulence factors on infection outcome [4]. The virulence among different CMV strains may be an important factor in the occurrence and severity of CMV-associated diseases [5, 6], because of the genetic variation in functionally important genes such as glycoproteins gB, gN, and gO that are involved in host cell penetration, tissue tropism, or replication [6,7,8,9,10,11,12,13]. Five gB genotype (gB1, gB2, gB3, gB4, and gB5) have been identified on the basis of sequence variation in the UL55 gene encoding gpUL55 (gB) [14, 15]. The gpUL73 (gN) encoded by UL73 gene has four main genomic variants gN1, gN2, gN3 and gN4, with gN3 subdivided into gN3a and gN3b while the gN4 genotype has three subgroups gN4a, gN4b, and gN4c [16, 17]. Another CMV surface glycoprotein, gpUL74 (gO), encoded by UL74 gene has at least eight genetic variants, including five major genotypes (gO1, gO2, gO3, gO4, and gO5) with minor subtypes (gO1a, gO1b, gO1c, gO2a, gO2b) [9]. In immunocompromised patients, mixed CMV infection has been shown to be associated with elevated viral load [5]. However, there is no full understanding of the pathogenic mechanisms underlying this association. The relationship between CMV genetic variants and serious complications of CMV infection can depend on the additive or synergistic activity of at least two CMV glycoproteins. No evidence of genetic linkage between gN/gB genotypes has been observed previously [17]. On the other hand, there were eight significant linkages between the gN/gO genotypes as follows: gO1a/gN1, gO1b/gN3a, gO1c/gN4c, gO2a/gN3b, gO2b/gN2, gO3/gN4a, gO4/gN4b, and gO5/gN4c from different clinical settings [18, 19]. The impact of these linkages on the severity of CMV infection is not well defined. The genetic linkages between gO and gN could have a potentially functional interaction, which may lead to virus attachment, replication, and spread.

Thus, the current study investigated the distribution of CMV gB, N, and O genotypes and explored the possible relationship between different genotypes and clinical characteristics in Tunisian non-HSCT patients with HM.

Methods

Study Population

To screen patients for our study, 95 patients with newly diagnosed acute leukemia (52 acute lymphoblastic leukemia (ALL) (39 B cell ALL and 13 T cell ALL) and 43 acute myeloid leukemia (AML)) and 50 Hodgkin’s and non-Hodgkin’s lymphoma (10 HL and 40 NHL) were prospectively enrolled in the follow-up study during the period from January 2016 to December 2018. All 145 consecutive patients with leukemia/lymphoma were included, regardless of CMV serostatus. Initial treatment with conventional chemotherapy induces initial remission. Consolidation and maintenance therapy have been used as post-remission strategies. Salvage regimens were selected in case of primary refractory or at relapse. In the present study, patients who received allogeneic or autologous HSCT were excluded. During the study period, patients included did not receive CMV antiviral therapy. Most viral infections were self-limited (clear the virus without therapy).

Study Design

The study design involved prospective sampling at the time of diagnosis (before chemotherapy) and in the different phases of chemotherapy (post-induction, post-remission, and post-salvage). Plasma samples were collected from patients at diagnosis, and once every 7–14 days in the same period, but the interval was greater because sampling was not performed during periods in which patients were not hospitalized. For some patients, the number of follow-ups was limited because of early death or loss of follow-up samples. The median follow-up duration was 7.4 months range (1–36) from the start of chemotherapy for patients with acute leukemia and 6.8 months range (2–15) for those with lymphoma. Only those patients with positive CMV infection were included in this study and were evaluated with CMV load testing and genotyping of CMV glycoproteins B, N, and O. These patients included a cohort of 60 non-HSCT patients with active CMV infection.

The study design was approved by the Ethics Committee and Medical Research of the Farhat Hached University Hospital, Sousse, Tunisia (Reference number IRB00008931) and informed consent was obtained from all participants enrolled in the study before starting the research work.

Definitions and Data Collected

Active CMV infection was defined as the detection of CMV DNA (any level of plasma viral load) in plasma specimens. For each patient, CMV recurrence was defined as detectable CMV DNAemia after initially achieving clearance following the initial episode of CMV infection. A viral co-infection was assigned as the detection of DNA of more than one type of herpesvirus (HHV) in the same sample and mixed genotype was defined as the presence of more than one CMV genotype in the same sample.

The hospital records were evaluated and the clinical characteristics and microbiological data were registered for each patient. Detailed information regarding the data collected in the present study is provided in the supplementary material.

CMV Serology

Serum samples from all patients before starting chemotherapy were assessed for anti-CMV IgG antibodies with chemiluminescence assays by the Abbott Architect i2000SR.

Multiplex PCR Detection for Herpesviruses (HHVs)

HHVs DNA were extracted from plasma samples using a DNA extraction kit (QIAamp DNA Mini Kit, QIAGEN, Germany). The detection of HHVs (HSV-1/2, VZV, EBV, CMV, HHV-6A/B, and HHV-7) was performed on plasma samples according to the multiplex PCR method described by Tanaka et al. [20].

Quantitative PCR Assay for CMV DNA

Only those patients with HHVs multiplex results positive for CMV were included in the study and evaluated with CMV viral load in the first positive plasma samples using the artus CMV RG PCR kit (Qiagen, Germany) and the Rotor-Gene Q (RGQ) instrument according to the manufacturer’s recommendations. The lower limit of detection for this assay is 57.1 copies/mL. On the basis of the assay results, patients were categorized as patients with high-level CMV DNAemia (at least 1000 copies/mL) and low/moderate-level DNAemia (less than 1000 copies/mL).

Characterization of CMV gB, gN, and gO Genotypes

The first samples determined to be CMV DNA positive were further examined by genotyping envelope glycoproteins, UL55 (gB), UL73 (gN), and UL74 (gO) by the multiplex nested PCR assays as previously described [11, 21, 22].

The primers are listed in Table S1 in the supplementary material. The primers for the first round of PCR were designed on the conserved regions of UL55 (gB), UL73 (gN), and UL74 (gO) gene. After the first round of PCR, the amplicons were subjected to the next multiplex PCR round resulting in genotype determination using a set of genotype-specific primers. Positive and negative controls were included with each PCR run. Then the different genotypes were easily differentiated by agarose gel electrophoresis.

DNA Sequencing and Sequences Analysis

Randomly selected amplicons for the different gB/gO genotyping results were then confirmed by sequencing. Both forward and reverse DNA strands were sequenced with the same specific primers of the first PCR round using an ABI PRISM Big Dye Terminator Cycle Sequencing Reaction kit (version 3.1) on an ABI PRISM 3100 DNA sequencer (Applied Biosystems, Germany). Part of the gN genotyping results were further confirmed by PCR and sequencing using the gN sequencing primers. Sequencing alignment analysis was conducted using the NCBI BLAST to confirm their viral genotype.

Statistical Analysis

All statistical analyses were performed using SPSS software (version 22.0). The results are expressed as the mean ± standard deviation (SD), and percentage. Categorical variables were compared using the chi-square test or the Fisher’s exact test. All p values were based on a two-tailed test of significance. Statistical significance was defined as a p value less than 0.05.

Results

Patients’ Characteristics

Flowchart of patient enrollment is presented in Fig. 1. The overall CMV IgG seroprevalence was 96.5% (140/145). None of the patients who were negative for CMV IgG developed CMV DNAemia during the follow-up study. The prevalence of CMV DNAemia was 60 out of 145 (41.4%). According to each period, CMV DNAemia was detected in 5.5% (8/145), 11.1% (16/145), 16.4% (18/110), and 33.33% (18/54) before chemotherapy, at post-induction, post-remission, and post-salvage, respectively. Table 1 summarizes the demographic characteristics and clinical data of the enrolled patients. Sixty patients with active CMV infection (46 acute leukemia and 14 lymphomas) were included in the study between 2016 and 2018. The male/female sex ratio was 2.5. The patients were aged 1–69 years (mean 24).

Fig. 1
figure 1

Flowchart of the study population depicting the process of enrollment for analysis. Patients with CMV infection were enrolled in the study at different phases of chemotherapy: before chemotherapy, post-induction, post-remission, and post-salvage

Table 1 Demographic and clinical characteristics of the enrolled patients with active CMV infection

Detection and Quantitative PCR Assay for CMV DNA

Of the 60 patients infected with CMV, 39 (65%) patients were single infected and 21 (35%) were co-infected with other HHVs (EBV, HHV-6, HHV-7); 19 of these with two viruses and 2 with three viruses. None of those patients were positive for HSV-1, HSV-2, and VZV. All patients with active CMV infection were CMV IgG positive before starting chemotherapy.

High level of CMV DNAemia (at least 1000 copies/mL) was developed in half of the patients (30/60). All high-level cases of CMV DNAemia were at post-remission and at post-salvage phase of chemotherapy and not before chemotherapy.

Distribution of gB, gN, and gO Genotypes in CMV-Infected Patients

The distributions of glycoprotein gB, N, and O genotype are presented in Fig. 2. As observed, all the expected genotypes are present for gB, gN, and gO except for the gB5, gN2, and gN4b genotypes, none of which were present in the studied population. Sixty percent of CMV-infected patients (36/60) were found to carry mixed infections with multiple genotypes in at least one genetic locus of gB, gN, and gO. The gB, gN, and gO genotype was successfully determined in 96.6% (58/60), 93.3% (56/60), and 93.3% (56/60) of CMV-infected patients, respectively. Mixed CMV gB, gN, and gO genotypes were detected in 31% (18/58), 41% (23/56), and 46.4% (26/56) of CMV-infected patients, respectively. Among mixed gB genotypes, 72% (13/18) were dual, 22.2% (4/18) were triple, and only one was a quadruple genotype, i.e., 5.5% (1/18). Among the gN mixed genotypes, 87% (20/23) were dual gN genotype and 13% (3/23) triple genotype. Among the mixed gO genotypes, 76.9% (20/26) were dual gO genotypes and 23.1% (6/26) were triple genotypes.

Fig. 2
figure 2

Glycoprotein B, N, and O genotype distributions in non-transplant patients with HM undergoing chemotherapy. The overall distributions of gB, gN, and gO genotypes are shown in the pie charts of ac, respectively. The specific genotypes and the related percentages are indicated

Twelve patients in our cohort developed a second recurrent episode of CMV infection. Between the first and the follow-up samples investigated in the present study, all patients showed periods of undetectable CMV DNAemia. The median time between the initial and the relapse episode was 9.5 months range (2–16). Variations in glycoproteins were observed between initial and recurrent infection. An increase in the complexity of these genetic populations was seen in half of the cases, and new genotypes emerged compared to those found in the initial episode, while the others maintained the same genotypes, found in the initial episode, throughout the follow-up.

To evaluate further the accuracy of the multiplex PCR based, 24 single genotype strains were sequenced. There was no discrepancy between subtyping based on multiplex PCR assays and UL55 (gB), UL73 (gN), and UL74 (gO) sequences (data not shown).

Analysis of Glycoprotein Genotypes and Clinical Characteristics in CMV-Infected Patients

The relationship between CMV (gB) single genotypes and mixed genotypes and clinical characteristics of patients are presented in Table 2. When comparing infections involving mixed gB genotype infections and single genotype infections no statistically significant differences were observed with respect to gender status, age groups (≤ 22 years, > 22 years), underlying disease, status of the disease, recurrence of infection, clinical findings (severe neutropenia, febrile neutropenia, severe lymphopenia, anemia, thrombocytopenia, pneumonia, gastroenteritis, hepatitis, CNS disease, and skin rash), and opportunistic and infectious complications (bacteremia/fungal infections). Table 2 demonstrates that patients infected with mixed gB genotypes developed higher initial levels of CMV loads (p = 0.001), had an increased rate of fever (p = 0.025) ,and were significantly more often infected with other herpes viruses (p = 0.024) than patients with single gB genotype.

Table 2 Demographic and clinical characteristics of plasma samples from non-transplant patients with HM infected with CMV, according to gB viral genotypes

In Table 3, when comparing infections involving only single gN genotypes, single genotypes had common characteristics including gender status, age groups (≤ 22 years, > 22 years), underlying disease, status of the disease, recurrence of infection, intensity of initial viral load, clinical findings, and opportunistic and infectious complications.

Table 3 Demographic and clinical characteristics of plasma samples from non-transplant patients with HM infected with CMV, according to gN viral genotype

The presence of mixed gN genotypes was more frequently associated with profound lymphopenia (ALC < 500/µL) (p = 0.01) and increased risk of death (p = 0.042) than with single gN genotype, but this finding was not time-related (Table 3).

Because the number of infected patients with gO1a, gO1b, gO1c, gO2a, gO3, gO4, and gO5 genotypes was small, we did not perform a statistical analysis of correlations between clinical features and these gO genotypes. When comparing infections involving only single genotypes, gO2b genotype had a more unfavorable outcome (p = 0.09) than the other single genotypes, but this finding was not time-related. The mixed gO genotype group compared with only single genotype group was more associated with female gender (p = 0.015), acute leukemia disease (p = 0.036), initial high level of CMV viral load (p = 0.029), and skin rash (p = 0.01) (Table 4).

Table 4 Demographic and clinical characteristics of plasma samples from non-transplant patients with HM infected with CMV, according to gO viral genotypes

Analysis of Genetic Linkage of gO/gN Variants in Non-HSCT Patients with Mixed CMV Genotype Infection

Both gO/gN genotypes were identified from 55 CMV-infected patients. Twenty-five patients with single genotypes in both gO/gN infection were found. Thirty patients were co-infected with mixed genotypes in at least one genetic locus of gO and gN. The consistent predefined linkages (described previously) of gO/gN were found as follows: gO1a/gN1 in 16.4% (9/55), gO1b/gN3a in 1.8% (1/55), gO1c/gN4c in 20% (11/55), gO2a/gN3b in 14.5% (8/55), gO3/gN4a in 20% (11/55), and gO5/gN4c in 3.6% (2/55).The linkages of gO2b/gN2 and gO4/gN4b were not found. However, novel linkages between gO/gN genotypes were found as gO3a/gN4a in 20% (11/55), gO2b/gN3b in 16.4%, gO1a/gN3b, and gO1c/gN3b in each with 12.7% (7/55), gO1a/gN4a,gO2a/gN4c, gO2b/gN1 gO2b/gN4a, andgO3a/gN4c in 10.9% (6/55), respectively. The percentage of all associations of gO/gN genotypes in CMV-infected patients is shown in Table 5.

Table 5 Association between different gO/gN linkage groups in mixed CMV infection (N = 55)

The effects of gO/gN genotype linkages on CMV viral load are shown in Table 6. The difference between the viral load within the different linkages groups (presence and absence of linkages) was not statistically significant. However, only the presence of gO1a/gN3b linkage group showed a significantly higher viral load compared with the absence of the gO1a/gN3b linkage (p = 0.012) (Table 6).

Table 6 Analysis of gO/gN genotype and viral load

Discussion

To our knowledge there have been limited studies on the distribution of CMV genotypes and their possible role among non-transplant patients with HM undergoing chemotherapy. And, only some studies have involved analysis of gB genotypes in this setting. Moreover, no study has addressed the distribution and the role of gN and gO genotypes in this group of patients [5].

In the present study mixed CMV infection with more than one glycoprotein genotype (gB, gN, and gO) was found in 60% of infected patients with acute leukemia and lymphoma undergoing chemotherapy. The mixed CMV gB, gN, and gO genotypes were detected in 31%, 41%, and 46.4%, respectively, of the CMV-infected patient. These results are consistent with previous studies in different population groups which underlined that co-infection with different strains is common. In fact, a high proportion of mixed CMV infections with multiple glycoprotein genotypes was noted in a variety of population groups including immunocompetent [23], patients with AIDS [22, 24], allo-HSCT recipients [25, 26], solid organ transplantation (SOT) [6, 27, 28], and infants with congenital CMV infection [29]. In contrast, a lower prevalence of mixed infections was noted in other groups of infants with congenital infection [11], AIDS [30], HSCT [12, 31], and SOT recipients [6]. In Tunisia, the distribution of gB genotypes was described in one study and it was found that the gB1and gB2 genotypes were dominant with no mixed genotypes in immunocompromised patients including transplant recipients and patients with AIDS [32]. Most of these previous studies have focused on a singular viral envelope glycoprotein gene characterized by polymerase chain reaction (PCR)-based genotyping which is relatively insensitive to multiple-strain infections [18, 33, 34]. The detection of mixed infection was also hampered by tissue-cultured viruses because of the selection of a single strain [35, 36]. However, in the current study, clinical specimens were tested directly to prevent underestimation of virus diversity and we used highly sensitive multiplex nested PCR and subsequent Sanger sequencing technology at multiple loci.

A different finding was described in previous studies with almost no mixed gN genotypes, such as in newborns with congenital infection [11] and in immunocompromised patients including SOT population [37], or patients with AIDS [30]. In striking contrast, other studies showed a higher prevalence of gN mixed infection in African children [33] and 40.2% in Chinese HSCT recipients [38]. While plenty of data are available on the gB and gN genotype, little is known about the gO genotype distribution in immunosuppressed patients.

Consistent with a previous study, mixed gO genotype was the most common genotype in allo-HSCT recipients [25]. But, this finding differs from a study by Roubalova et al. who found that the gO1 genotype was the most frequently genotype identified in allo-HSCT recipients with CMV infections [39]. The differences among studies may, in part, be due to the wide spectrum of technique sensitivity, variation in geographical and/or demographic allocation of genotypes, variability in transmission, and differences in mechanisms of pathogenesis and cause of immunosuppression [15, 27, 40].

In this study most of the patients were anti-CMV IgG seropositive before chemotherapy, and most probably viral DNAemia during chemotherapy was due to reactivation or reinfection under immunosuppressive conditions rather than primo-infection. Interestingly, analysis of different samples from the same patients at a later CMV infection episode revealed the emergence of new genotypes over time in half of patients with recurrent infections. This result may have multiple explanations. CMV reactivation or reinfection seems to occur stochastically, as has previously been shown using the murine model [41, 42]. It is possible that these patients acquired new strains during the time between the first screening and follow-up. If not, both strains may have been present at both time points but not detected in the first screening samples because of a low DNAemia of minor virus populations. Moreover, quantitative differences among the different CMV genotypes could be associated with different replication efficiencies of specific genotypes. In this context, the dominating genotype can be expected to have the most pronounced impact on the course of CMV infection. Furthermore, the relative frequencies of the individual genotypes within a patient did not remain constant over time, a finding which has also been described for Epstein–Barr virus strains in mixed infections [43]. It is also of interest to consider that genotype-specific immune responses might have an influence on the rate of replication of a particular genotype [41].

There is growing evidence that the occurrence of mixed genotype infections serves as a marker of severe or prolonged complications from CMV infections in SOT and HSCT recipients [5, 6, 25, 28, 44,45,46].Whether these observations can be extrapolated to the non-HSCT setting is yet to be elucidated. In agreement with a previous report in immunocompromised patients [5], our data showed that patients who were infected with mixed gB genotypes developed higher initial levels of CMV loads and had significantly more concomitant infection with other HHVs than patients infected with single gB genotypes. It is also interesting to note that patients infected with mixed gN genotypes were more associated with profound lymphopenia (ALC < 500/µL) and increased risk of death than patients infected with single gN genotype. In a previous study single gN1 and gN4b showed a significantly different virulence and could serve as early predictors for the progression of CMV infection in transplant patients [47].

In the present study, patients infected with gO2b genotype had more unfavorable outcome than patients infected with single gO genotypes. In comparison to single genotypes, mixed genotypes were more frequently associated with female gender, acute leukemia disease, skin rash, and the initial high level of CMV viral load. In contrast, no clear correlation existed between the gO genotypes and CMV disease in clinical isolates from different geographic regions and different disease settings [9, 16, 48, 49].

On the basis of the distinct distributions of CMV gB, gO, and gN genotypes in different clinical settings and the association between mixed genotypes and serious complications, we assumed that a single polymorphic gene as a virulence marker was not sufficient to investigate the molecular epidemiology of CMV. The virulence of CMV may depend on the synergistic action of at least two glycoproteins. Different genotypes in mixed infection can interact with each other and the interaction between co-infecting genotypes may depend on the specific genotype combination. The exact mechanism of this strain interaction for CMV has not been identified, but it has been shown in the mouse model that different murine CMV variants can interact with each other through functional trans-complementation and that this can lead to impaired virus fitness [41, 42]. The linkage between gN and gO genotypes suggests possible functional interactions between these glycoproteins which may lead to virus replication, entry, and cell tropism [11, 16, 18, 19, 47]. Interestingly, a novel linkage between gO1a and gN3b was recognized in the studied patients and was significantly associated with higher viral load unlike the other defined gO/gN linkages [18, 19]. In contrast to the previous studies [16, 22, 30], no association was observed between gO1a/gN4a, gO1a/gN1, gO3/gN4a, and viral load, which may be due to limited sample numbers in this study. Therefore, further studies are needed to ascertain the strength of this new genetic linkage and to highlight the relevance of glycoprotein genotypes as virulence factors in the progression and the severity of CMV infection in immunocompromised hosts.

This study had some limitations such as the small number of patients enrolled, the monocentric type of study, and the limited geographical area surveyed. Larger epidemiological studies and improved genotype-specific real-time PCR-based assays and high-throughput sequencing technology may reveal a high level of intrahost variability and offer strong evidence that CMV exists as a complex mixture of variants in immunocompromised patients and may also clarify more the epidemiological and the clinical impact of these variants. The genetic profile of infecting viruses and relation to clinical outcome should be investigated, taking into account the host’s innate and adaptive immune responses [4, 8, 45, 50, 51].

Conclusions

Our results provide evidence that, in non-HSCT patients with HM undergoing chemotherapy, severe clinical manifestations occur mostly in patients infected with multiple CMV genotypes. The interaction of multiple genotypes during CMV infection, more specifically by gO1a and gN3b, may play an important role in the virulence of CMV in infected non-HSCT patients.