1H, 13C, 15N backbone resonance assignment of apo and ADP-ribose bound forms of the macro domain of Hepatitis E virus through solution NMR spectroscopy

The genome of Hepatitis E virus (HEV) is 7.2 kilobases long and has three open reading frames. The largest one is ORF1, encoding a non-structural protein involved in the replication process, and whose processing is ill-defined. The ORF1 protein is a multi-modular protein which includes a macro domain (MD). MDs are evolutionarily conserved structures throughout all kingdoms of life. MDs participate in the recognition and removal of ADP-ribosylation, and specifically viral MDs have been identified as erasers of ADP-ribose moieties interpreting them as important players at escaping the early stages of host-immune response. A detailed structural analysis of the apo and bound to ADP-ribose state of the native HEV MD would provide the structural information to understand how HEV MD is implicated in virus-host interplay and how it interacts with its intracellular partner during viral replication. In the present study we present the high yield expression of the native macro domain of HEV and its analysis by solution NMR spectroscopy. The HEV MD is folded in solution and we present a nearly complete backbone and sidechains assignment for apo and bound states. In addition, a secondary structure prediction by TALOS + analysis was performed. The results indicated that HEV MD has a α/β/α topology very similar to that of most viral macro domains.


Biological context
Hepatitis E virus (HEV) is the most common cause of acute viral hepatitis worldwide (Chandra et al. 2010). HEV is quasi-enveloped virus with a positive single-stranded RNA genome. It is the only member of the genus Orthohepevirus of the family Hepeviridae (LeDesma et al. 2019). According to World Health Organization (WHO), every year there are 20 million estimated cases of HEV infection, with 3.3 million symptomatic cases. The virus is transmitted via fecal-oral or zoonotic route. The latest is caused by close contact with infected animals or consumption of contaminated undercooked animal products (Doceul et al. 2016;Izopet et al. 2012;Yan et al. 2016). In general, HEV is selflimiting illness which lasts a few weeks. The incubation period is 2 to 6 weeks and the symptoms of hepatitis develop, with fever and nausea followed by abdominal pain, vomiting, anorexia, malaise, and hepatomegaly. About 40% of patients develop jaundice (Aslan and Balaban 2020). It is worth mentioning that there is a mortality excess in pregnant females and patients with chronic diseases (Chaudhry et al. 2015). In addition to the classical hepatic manifestations, HEV is responsible for extrahepatic disorders such as neurological disorders associated with Guillain-Barré syndrome and neuralgic amyotrophy (Narayanan et al. 2019;Sooryanarain and Meng 2019). No specific antiviral drug or vaccine is licensed globally for chronic hepatitis, underlining the necessity in the development of potent viral inhibitors.
The HEV genome is 7.2 kb long with a 7-methylguanosine cap at the 5′ end and is polyadenylated at the 3′ end. HEV consists of four open reading frames: ORF1, ORF2, ORF3 and ORF4. ORF4 is overlapped with ORF1 and its transcription is controlled by an IRES-like RNA structure with an essential role in HEV RNA polymerase proper function (Kenney and Meng 2019). ORF3 codes a 13 kDa small phosphoprotein, which enhances RIG-I signaling (VP13) (Nan et al. 2014a). ORF2 encodes a N-glycosylated 72 kDa protein important for the capsid formation, a protein that is an attractive target for HEV infection diagnostics and vaccine development (Nan and Zhang 2016). The larger ORF is the ORF1 that occupies about the 2/3 of the genome, encoding the non-structural protein crucial for viral replication, and composed of several functional domains. A methyltransferase (MeT/MTase), a Y undefined domain, a papain-like cysteine protease (PCP), a proline-rich hinge/hypervariable region (PPR/HVR), a macro domain, a helicase (Hel/NTPase) and an RNA-dependent RNA polymerase (Ojha and Lole 2016b;Wang and Meng 2021).
The HEV macro domain was identified as a putative interferon (INF) antagonist (Nan et al. 2014b). In addition, its C-terminal region displays direct interaction with both MTase and ORF3 proteins (Anang et al. 2016). HEV MD specifically interacts with the light chain subunit of human ferritin, and suppress its secretion in Table 1 List of NMR experiments acquired at 700 MHz Bruker Magnet, including the main parameters used, to perform the sequence specific assignment of the backbone HEV MD in the free and ADPR bound forms cultured cells (Ojha and Lole 2016a). HEV MD belongs to the ADP-ribose-1''-monophosphatase (Appr-1''-pase family) that catalyses conversion of ADP-ribose-1′′monophosphate (Appr-1′′-p) to ADP-ribose (Allen et al. 2003). Recent studies on protein ADP-ribosylation suggested that viral macro domains are able to de-ADPribosylate Asp or Glu side chain of host proteins, which brought them into focus as promising therapeutic targets (Fehr et al. 2018;Li et al. 2016).
In the last decade, the progress in the understanding of the crucial functions carried out by viral MDs, suggests that the MD could be a relevant antiviral target and stimulate the development of drug design efforts (Brosey et al. 2021;Dasovich et al. 2022;Fu et al. 2021;Ni et al. 2021;Rack et al. 2020).
Here, we present for the first time a 1 H, 13 C and 15 N almost complete resonance assignment of the apo and ADP-ribose bound forms of HEV MD. These assignments should contribute to the understanding of the molecular mechanisms of de-ribosylation and provide starting points for inhibition or protein-protein interaction studies by NMR.
Cell suspension was supplemented with 5% glycerol, 1 mM Tris (2-carboxyethyl) phosphine (TCEP) and EDTA-free protease cocktail (Sigma-Aldrich). Three freeze-thaw cycles (liquid N 2 -42 °C) were performed before the sonication step. Cells were then lysed by sonication and the cell debris was cleared by centrifugation (21.000 × g, 45 min, 4 °C). Supernatant was filtered through a 0.25 μm filter and loaded on a 5 mL His-Trap HF column (GE Healthcare) charged with Ni 2+ . The HEV MD was purified by immobilized metal affinity chromatography (IMAC) and eluted with 200 mM imidazole, 20 mM Na 2 PO 4 , pH 8.0, 500 mM NaCl, 1 mM TCEP, 1 mM phenylmethylsulfonyl fluoride (PMSF). The eluted HEV MD was gradually introduced to the NMR buffer (10 mM Sodium Acetate, 5 mM EDTA pH 5.4), using an Amicon Ultra 15 mL Centrifugal Filter membrane (Merck Millipore) and concentrated to a final volume of 1 mL. The protein was further purified by size exclusion chromatography using FPLC ÄKTA Purifier System (GE Healthcare) with Superdex® Increase 75 10/300 GL (GE Healthcare) pre-equilibrated with buffer 10 mM Sodium Acetate, 5 mM EDTA at pH 5.4. The protein was eluted according to its molecular weight, indicating a monomer. The fractions containing the HEV MD were collected and concentrated to a final volume of 500 μL and stored at − 80 °C. For the ADP-ribose bound state, a 100 mM stock solution of ADP-ribose sodium salt (Sigma A0752) was prepared in water. This stock solution was used to prepare the HEV MD-ADP-ribose complex by adding a tenfold molar excess to the protein.

Data acquisition, processing and assignment
For the NMR experiments 15 N and 13 C/ 15 N labelled samples prepared with a concentration of 0.4 mM for HEV MD in the apo form and 0.5 mM in the ADP-ribose bound form with protein to ADP-ribose ratio 1:10. All samples were in a mixed solvent of 90% H 2 O and 10% D 2 O (10 mM Sodium Acetate, 5 mM EDTA at pH 5.4). 1 H chemical shifts were referenced on DSS methyl signal at 0.0 ppm. 0.25 mM 4,4-dimethyl-4-silapentane-1-sulfonic acid (DSS) were used as internal standard. 13 C and 15 N chemical shifts were referenced indirectly to the 1 H standard using a conversion factor derived from the ratio of NMR frequencies (Wishart et al. 1995). All NMR experiment were recorded on a Bruker Avance III HD 700 MHz NMR spectrometer equipped with a fourchannel 5 mm cryogenically cooled TCI gradient probe at 298 Κ. All NMR data were processed with TOPSPIN 4.1.1 software and analysed with CARA 1.9.2a4 (Keller 2004). The acquired NMR experiments used for sequence specific assignment are summarized in Table 1 (Table 1).

Extent of assignments and data deposition
The HEV macro domain shares a low sequence homology with other MDs (i.e., AF1521, VEEV, CHIKV, SARS-COV1, SARS-COV2) as shown in Fig. 1. Indeed, the percentage of identity between HEV MD and other viral MD is surprisingly low and found around 20% (23.44% with VEEV MD).
The NMR 1 H-15 N HSQC spectrum showed well-dispersed amide signals and narrows line widths, indicative of a well-folded monomeric polypeptide as shown in Fig. 2a for apo and in Fig. 2b for ADP-ribose bound form of HEV MD, respectively. In addition, the superposition of 1 H-15 N HSQC spectra of HEV MD in apo and bound state indicated significant chemical shift changes of the 1 H-15 N HSQC crosspeaks upon binding with ADPR, as shown in Fig. 3.
For the apo form of HEV MD, the analysis of the heteronuclear NMR experiments of the double isotopically labelled sample with the conventional backbone and sidechains methodology, results in the sequence specific assignment of 93.93% the resonances of the backbone atoms (HN, N, CO, Cα and Cβ) and 58.41% the resonances of the sidechains atoms. For the ADP-ribose bound form of HEV MD, we were able to assign 95.22% and 61.63% of the resonances of the backbone and sidechains atoms respectively. 1 H, 13 C, 15 N backbone resonance assignment of apo and ADP-ribose bound forms of the macro… 1 3 The unassigned HN and N resonances of free HEV MD belong to D810, R812, L817, C818, H819, F821, T846. All the missing residues belong to loop regions or to unstructured regions or part of loops indicating some differences in their conformational dynamics features that hampers their detection. By contrary, the signals missing in the assignment of the ADP-ribose bound form of HEV MD belong to regions spanning only the residues S807, L817, C818, H819, F821. The disappearance of the above-mentioned set of resonances in the two forms might suggest conformational variability and flexibility upon binding.
In order to identify the secondary structure elements of the HEV MD apo and ADP-ribose forms, chemical shift assignments of backbone atoms (HN, Hα, Cα, Cβ, CO, N) for each residue in the sequence were analysed by TALOS + software (Shen et al. 2009). The secondary structure elements for free HEV MD protein are organized in an α/β/α sandwich-like fold with β/β/α/β/α/β/β/β/α/β/α/β/α topology from N-to C-terminal residues of the native sequence, graphically presented in Fig. 4. The order of the secondary structure segments are pretty similar to that of the other viral and human MDs ( (Melekis et al. 2015), (Makrynitsa et al. 2015), (Lykouras et al. 2018), (Tsika et al. 2022)). We also report that upon interaction with ADPR no significant change in secondary structure elements has been identified (Fig. 3b). TALOS + analysis indicates also that HEV MD adopts a similar folding to that of many viral macro domains despite its low sequence similarity (Fig. 1), (Makrynitsa et al. 2019;Tsika et al. 2022).
Chemical shift values for the 1 H, 13 C and 15 N resonances of HEV macro domain in the free state and in the ADPR bound state have been deposited at the BioMagResBank (https:// www. bmrb. wisc. edu) under accession numbers 51470, and 51471, respectively.
To summarize, we present in this work a biological method to produce and purify in high yield the native form of recombinant HEV MD. NMR analysis indicated that the polypeptide is well folded and in monomeric state. These results will contribute to its 3D structure determination and open opportunities for the development of inhibitors with potential antiviral properties.

Conflict of interest
The authors declare no competing financial interest.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.