Background

Methylated 5'-terminal cap structures have been described in most eukaryotic and many viral mRNAs. In all cap structures, including the 'minimal' cap 0 (m7G(5')ppp(5')N), an N7-methylguanosine (m7G) is attached through a 5'-5' triphosphate bridge to the penultimate nucleoside. Cap 0 is usually synthesized in the nucleus by the sequential action of three enzymatic activities: mRNA triphosphatase, guanylyltransferase (GTase), and m7G-methyltransferase (MTase); this pathway was defined originally during studies of vaccinia and reovirus mRNA synthesis [1,2]. In some molecules, additional 2'-O-ribose methylations are found at the penultimate and the antepenultimate nucleosides, forming the cap 1 (m7G(5')ppp(5')Nm2'O) and cap 2 (m7G(5')ppp(5')Nm2'OpNm2'O) structures, respectively (reviewed in [3,4,5]). The capping apparatus differs significantly in fungi, metazoans, protozoa and viruses in respect of the evolutionary origin and structure of individual domains, and the arrangement of domains within subunits [5]. Hence, the capping enzymes encoded by viral, fungal and protozoal pathogens are attractive targets for specific inhibitors that would exert a limited effect on the host enzyme.

Reoviruses are non-enveloped icosahedrally symmetric viruses with two concentric protein capsids surrounding and protecting the multisegmented, double-stranded (ds) RNA genome; they replicate in the cytoplasm of the eukaryotic host cell (reviewed in [6]) The reovirus core particle can produce m7GpppGm2'OpC(pN)n-OH plus-strand RNA from each genomic double-stranded RNA segment in vitro [2], indicating that it contains all of the enzymes necessary for de novo synthesis of capped mRNA. On the basis of sequence analysis and the results of UV-crosslinking and site-directed mutagenesis it has been suggested that the 144-kDa λ2 protein possesses an S-adenosyl-L-methionine (AdoMet)-dependent methyltransferase (MTase) domain (or domains), which mediates both the cap 0 and cap 1 methylation activities in forming a 5' cap structure on reovirus mRNA [7,8,9]. Nevertheless, the failure of isolated λ2 to exhibit methylation activity limited studies to dissect its putative methyltransferase activities and domain structure at a molecular level.

The structure of the reovirus core has been recently solved at 3.6 Å resolution by Reinisch et al. (entry 1ej6 in Protein Data Bank) [10], revealing a series of seven domains in the λ2 monomer, of which one domain (amino acids 1–385) was identified as a guanylyltransferase and two domains (434–691 and 804–1,022) were found to share the three-dimensional fold with typical AdoMet-dependent MTases (reviewed in [11,12]). In the turret formed by λ2 pentamers, the GTase site from one monomer is closer to the MTase 1 in the clockwise neighbor than to other MTase domains. Reinisch et al. hypothesized that a correlation may exist between the spatial arrangement of domains in λ2, and the order of reactions in a pathway (guanylyl transfer → N7 methylation → 2'O methylation). Thus, it has been proposed that MTase 1 is the cap 0 MTase and MTase 2 is the cap 1 MTase [10]. However, no groove to guide the mRNA from one active site to another could be identified and the problem of specificities of reactions carried out by the individual MTase domains was left open.

Results and discussion

In the course of a large-scale analysis of MTase sequences and structures we observed low similarity between various ribose 2'-O-MTases from eubacteria and negative-strand RNA viruses and a fragment of the grass carp reovirus protein [13] corresponding to the carboxy-terminal part of MTase 1 of human reovirus (J.M.B. and L.R., unpublished data). This prompted us to reanalyze the sequences and structures of the reovirus MTase domains. Disappointingly, sequence database searches carried out using PSI-BLAST [14] initiated with the λ2 sequence and its fragments revealed significant similarities only between the human reovirus and grass carp reovirus proteins. Nonetheless, VAST [15] searches of the Protein Data Bank [16] revealed high similarity of both putative MTase domains to numerous known MTase structures. Detailed inspection of the coordinates superimposed using Swiss-Pdb Viewer [17] allowed us to predict which amino-acid residues might be involved in cap 0 and cap 1 methylation of reovirus mRNA.

According to VAST, MTase 1 showed highest similarity to the RrmJ (1ejo; P-value = 10-10.4) and VP39 (1av6; P-value = 10-8.1) proteins. The mutual similarity of MTase 1 and MTase 2, which can be regarded as a reference, was evaluated as high, albeit significantly lower (P-value =10-4). RrmJ and VP39 are both 2'-O-ribose MTases; RrmJ targets U2552 in eubacterial rRNA [18,19], whereas VP39 is a bonafide cap 1 MTase of vaccinia virus mRNA [20]. Remarkably, simultaneous superposition of the three structures revealed perfect conservation of the K-D-K-E tetrad of putative catalytic residues (Figures 1,2,3), which has been observed by us in other families of genuine and putative 2'-O-MTases ([21] and J.M.B. and L.R., unpublished data). These results strongly argue that MTase 1 is more likely to be a cap 1 MTase than a cap 0 MTase.

Figure 1
figure 1

Stereoview ribbon diagrams of superimposed structures of the MTase 1 domain (green), VP39 (blue) and RrmJ (magenta). MTase 1 shows elaborations of the common fold similar to those of VP39, which are altogether absent from the RrmJ structure and which are predicted to take part in cap recognition and binding. AdoMet (copied from the RrmJ structure) and the K-D-K-E tetrad of putative catalytic residues are shown in wireframe representation.

Figure 2
figure 2

Sequence alignment of the predicted human reovirus cap 1 MTase with its counterpart from grass carp reovirus (GCRV) and vaccinia virus (VP39). Residues that are invariant between the reoviruses and those predicted to participate in binding of the AdoMet, the cap moiety, and in catalysis in all cap 1 MTases, are highlighted in black; other conserved residues are highlighted in gray. Motif nomenclature follows [12].

Figure 3
figure 3

Stereoview of superimposed structures of the MTase 1 domain (ribbon in green, side chains and labels in red) and VP39 (ribbon and labels in blue, side chains in cyan), delineating the proposed binding sites for the guanine moiety and the target ribose 2'-OH. Capped mRNA is shown in orange and the methylated guanine (black label) is sandwiched between Y22 and F180 of VP39. A conformational change of both the protein and the RNA is required for the cluster of conserved aromatic residues of the reovirus MTase to bind the cap structure in a similar manner.

In the MTase 1 structure we could identify no aromatic residues that would superimpose well with Y22 and F180 from VP39, which form enhanced stacking interactions with the N7-methylated cap [20]. Moreover, a region corresponding to the guanine-binding pocket in VP39 is blocked off by a differently positioned loop in the reovirus λ2 protein structure. Nevertheless, we identified a cluster of aromatic side chains (Y460/464, F461/465 and F618/F622) that are invariant between the human and grass carp reovirus sequences and map in the vicinity of the methylated cap, if the mRNA coordinates are copied from the superimposed VP39 structure (Figure 2). It is tempting to speculate that a conformational change in this region occurring on mRNA binding would, for instance, reallocate Y460 and F618 residues of human reovirus to positions equivalent to Y22 and F180 from VP39. The sequence alignment (Figure 3) has been manually adjusted to illustrate this hypothesis.

The MTase 2 structure showed highest similarity to the glycine N-MTase (GNMT; 1xva; P-value = 10-10.1). It is noteworthy that the MTase 2 domain showed lower similarity (P-value = 10-4.3) to VP39 than to GNMT and the MTase 1 domain showed lower similarity (P-value = 10-7.9) to GNMT than to the 2'-O-ribose MTases. Figure 4 shows that in addition to the common catalytic domain, GNMT and MTase 2 share a topologically equivalent 'lid' domain made of three antiparallel β strands. In GNMT, the β-lid domain forms a wall of a large 'molecular basket' structure, which may accommodate a variety of small molecules, including AdoMet, tetrahydrofolate and polycyclic aromatic hydrocarbon molecules such as benzopyrene (reviewed in [22]). In MTase 2, the strands forming the β-lid domain are much shorter and the pocket is also smaller. However, a guanine moiety can be docked into that pocket in a way that the N7 nitrogen is presented for the methyl group donor (data not shown). Most of the amino acids that line up the putative guanine-binding pocket of MTase 2 (for instance Q925, N927, F951, R956, and E958) are conserved between the human and grass carp reovirus sequences (Figure 5).

Figure 4
figure 4

Stereoview of superimposed structures of the MTase 2 domain (green) and GNMT (blue) showing extensive similarities of both domains.

Figure 5
figure 5

Sequence alignment of the MTase 2 domain with its counterpart from grass carp reovirus. Invariant residues are highlighted in black, conservatively substituted residues are highlighted in gray. Motif nomenclature follows [12].

To our knowledge, all structurally characterized MTases that modify bases in nucleic acids and do not employ covalent bond formation with the target use aromatic or aliphatic side chains to bind the base to be methylated and stabilize it in the active site. Examples of such MTases, in which the structure of the active site was determined experimentally or predicted from sequence analysis, include enzymes generating N6-methyladenine in DNA and RNA (reviewed in [23]), N4-methylcytosine in DNA [24], and N2-guanine in RNA [25]. We hypothesize that the invariant F951 and L890 (substituted by F897 in grass carp reovirus) may be involved in van der Waals interactions with an aromatic ring of guanine. But because of the low resolution of the original structure and lack of a precise docking model, the detailed contacts between the target and the enzyme could not be predicted. Nevertheless, we believe that our model will be a useful guide for site-directed mutagenesis experiments that would allow to elucidate the role of individual side chains of the putative cap 0 MTase.

Conclusions

The data presented suggest that MTase domains 1 and 2 of reovirus λ2 protein function as cap 0 and cap 1 MTases, respectively; that is, that the previous assignments should be swapped. On the basis of detailed comparison of the available MTase structures, we predicted the substrate-binding mode for both MTase domains and proposed which residues are likely to participate in catalysis. This hypothesis may be testable using mutagenesis and/or soaking experiments with substrate analogs. We suggest that substitution of the predicted cap-binding or catalytic residues in the MTase 1 domain should result in accumulation of cap 0-methylated viral mRNAs that nevertheless lack cap 1 modification.

We also report for the first time a remarkable structural similarity between the predicted m7G MTase domain and GNMT, suggesting a common evolutionary origin for the two N-MTase families. It is noteworthy that threading analysis and subsequent homology modeling suggest a similar structure for a family of cellular and poxviral cap 0 MTases [26] that show no significant sequence similarity to predicted cap 0 MTases from the Sindbis-like supergroup of positive-strand RNA viruses [27] nor to the predicted cap 0 MTase from reoviruses analyzed herein (J.M.B. and L.R., unpublished data). On the other hand, structure prediction for the family of aminoglycoside-resistance 16S rRNA:m7G MTases suggests that it shares only a common catalytic domain, but not the β-lid domain or predicted target-binding residues, with other m7G MTases [28]. It will be interesting to determine how many times the mechanism of guanine-N7 methy-lation appeared in the evolution of AdoMet-dependent MTases and what is the phylogenetic origin of the individual viral cap 0 MTase families.