Resonance assignment of human LARP4A La module

Human LARP4A belongs to a superfamily of RNA binding proteins called La-related proteins (LARPs). Whilst being a positive regulator of protein synthesis and a promoter of mRNA stability, LARP4A also controls cell morphology and motility in human breast and prostate cancer cells. All LARPs share a characteristic RNA binding unit named the La–module, which despite a high level of primary structure conservation exhibits a great versatility in RNA target selection. Human LARP4A La–module is the most divergent compared with other LARPs and its RNA recognition properties have only recently started to be revealed. Given the key role of LARP4A protein in cancer cell biology, we have initiated a complete NMR characterisation of its La-module and here we report the assignment of 1H, 15N and 13C resonances resulting from our studies.


Biological context
Human LARP4A is an RNA binding protein (RBP) involved in mRNA stabilisation and translation enhancement, 3′UTR polyA lengthening and miRNA processing Mattijssen et al. 2017;Nussbacher and Yeo 2018;Yang et al. 2011). As it localises to stress granules, membraneless structures associated with mRNA turnover and protection of mRNA during stress conditions, LARP4A has been suggested to play a role in the stress response (Gilbertson et al. 2018;Yang et al. 2011). LARP4A also regulates cancer cell morphology and motility: its siRNA-mediated depletion has been shown to increase cell migration and invasion, whereas its overexpression promotes cell circularity in breast and prostate cancers (Seetharaman et al. 2016).
LARP4A binds to the 3′polyA tail of mRNAs, and associates to translating ribosomes and protein partners including RACK1 (Receptor for Activated C Kinase) and PolyA binding protein (PABP) Yang et al. 2011). How the cellular functions of LARP4A in RNA and tumour biology are mediated by its molecular associations to RNA targets and/or other proteins remains unclear.
LARP4A possesses a La-module, a unique RNA binding unit conserved across all the members of the La-related proteins (LARPs) superfamily and consisting of two domains, the La motif (LaM) and an RNA recognition motif (RRM1) (Bousquet-Antonelli and Deragon 2009; Maraia et al. 2017). Despite sequence conservation, the RNA targets and functions of the La-modules in different LARPs are quite diverse, but the molecular bases of this versatility remain poorly understood . The La-module of the human La protein has been extensively studied at the molecular level and its interactions with the 3′UUU OH tail of the nascent RNA polymerase III transcripts well characterised: the LaM and RRM1 act in synergy to accommodate the 3′UUU OH target, with the LaM establishing the majority of the intermolecular contacts with the RNA (Alfano et al. 2004;Kotik-Kogan et al. 2008;Teplova et al. 2006). An analogous mechanism has been reported for human LARP7 and LARP6 Martino et al. 2015;Uchikawa et al. 2015).
The La-motifs (LaM) of LARPs exhibit a high degree of sequence conservation across the superfamily, particularly in six key residues identified in human La as prime mediators of RNA recognition, namely Q20, Y23, Y24, D33, F35, F55 (human La protein numbering). Intriguingly, in human LARP4A Y24 and F55 are replaced by Cys and Met respectively (Merret et al. 2013). Moreover, primary sequence analysis suggests that LARP4A lacks the otherwise conserved wing 2 loop at the C-terminus of the LaM and contains a short inter domain linker between the LaM and the RRM Martino et al. 2015;unpublished). These distinctive characteristics, divergent from other LARPs, may impact on the RNA binding properties of LARP4A. We therefore set out to unveil the structure and the determinants of RNA recognition of LARP4A, to understand its cellular functions and roles in cancer biology. Here, we report the chemical shift assignments of the backbone and side-chain resonances of LARP4A La-module.

Protein expression and purification
LARP4A La-module, spanning residues 111-287, was cloned in a pET-Duet1 vector (Novagen) with a hexa-Histidine tag at the N-terminus, followed by a TEV protease cleavage site. The recombinant protein was expressed in Escherichia coli Rosetta II cells (Novagen) and uniformly labelled with 15 N or 15 N/ 13 C in minimal media containing 15 NH 4 Cl (1 g/L) and 13 C glucose (2 g/L). The cells were grown to an OD value of 0.6 and induced at 18 °C with 1 mM IPTG (Isopropyl β-d-1-thiogalactopyranoside) overnight. The harvested cells were resuspended in a buffer containing 50 mM Tris pH 8, 300 mM NaCl, 10 mM imidazole, 5% glycerol, one Complete protease inhibitor cocktail tablet (Roche), 2 mM phenylmethylsulfonyl fluoride and lysozyme. After sonication and clarification, the lysate was loaded on a 5 mL His-Trap (GE Healthcare) affinity column and the Histagged protein was eluted with a gradient from 0 to 300 mM of imidazole. The protein was dialyzed into a buffer comprising 50 mM Tris pH 7.25, 100 mM KCl, 0.2 mM EDTA, 1 mM DTT and digested with TEV protease at 4 °C overnight. To isolate the un-tagged protein from the protease, tags and non-digested protein, the mixture was applied onto a Nickel affinity column (Generon). To eliminate any nucleic acid from the sample, the protein was further purified using a 5 mL Hi-Trap heparin column (GE Healthcare) and eluted with a KCl gradient from 0 to 1 M. The pure protein was dialysed into a buffer containing 20 mM Tris pH 7.25, 100 mM KCl, 0.2 mM EDTA and 1 mM DTT.

Extent of assignment and data deposition
The chemical shift assignment for LARP4A La-module has been deposited in the Biological Magnetic Resonance Bank (http://www.bmrb.wisc.edu/), accession number 27666.
An analysis of the backbone chemical shifts performed with TALOS+ (Shen et al. 2009) revealed that LARP4A La-module contains eight α-helices and seven β-strands distributed between the LaM and RRM1 (Fig. 1b). LARP4A LaM displays the same secondary structure topology found in other LaMs previously described (Alfano et al. 2004;Martino et al. 2015 The backbone and side-chains chemical shifts of the isolated LaM  and  were also analysed, revealing that they remained largely unchanged in the context of the La-module, and suggesting that the two domains do not adopt a rigid orientation relative to one other in solution. Fig. 1 LARP4A La-module NH amide assignment and secondary structure. a 1 H-15 N HSQC spectrum of human LARP4A recorded at 800 MHz and 25 °C. Amide group peaks are labelled with the residue type and numbered corresponding to the protein sequence. A closer view of the central part of the spectrum is shown for clarity. b TALOS+ prediction of secondary structure elements for LARP4A La-module. The secondary structure probabilities (red, α-helices; blue, β-strands), plotted against the residue number, are based on backbone HN, N, C′, Cα, and Cβ chemical shifts. Residues for which backbone amide resonance assignments are missing are indicated by asterisks