Active human full-length CDKL5 produced in the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125

Colarusso, Andrea; Lauro, Concetta; Calvanese, Marzia; Parrilli, Ermenegilda; Tutino, Maria Luisa

doi:10.1186/s12934-022-01939-6

Active human full-length CDKL5 produced in the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125

Research
Open access
Published: 14 October 2022

Volume 21, article number 211, (2022)
Cite this article

Download PDF

You have full access to this open access article

Microbial Cell Factories Aims and scope Submit manuscript

Active human full-length CDKL5 produced in the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125

Download PDF

Andrea Colarusso^1,2,
Concetta Lauro^1,2,
Marzia Calvanese¹,
Ermenegilda Parrilli¹ &
…
Maria Luisa Tutino¹

2092 Accesses
2 Citations
Explore all metrics

Abstract

Background

A significant fraction of the human proteome is still inaccessible to in vitro studies since the recombinant production of several proteins failed in conventional cell factories. Eukaryotic protein kinases are difficult-to-express in heterologous hosts due to folding issues both related to their catalytic and regulatory domains. Human CDKL5 belongs to this category. It is a serine/threonine protein kinase whose mutations are involved in CDKL5 Deficiency Disorder (CDD), a severe neurodevelopmental pathology still lacking a therapeutic intervention. The lack of successful CDKL5 manufacture hampered the exploitation of the otherwise highly promising enzyme replacement therapy. As almost two-thirds of the enzyme sequence is predicted to be intrinsically disordered, the recombinant product is either subjected to a massive proteolytic attack by host-encoded proteases or tends to form aggregates. Therefore, the use of an unconventional expression system can constitute a valid alternative to solve these issues.

Results

Using a multiparametric approach we managed to optimize the transcription of the CDKL5 gene and the synthesis of the recombinant protein in the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125 applying a bicistronic expression strategy, whose generalization for recombinant expression in the cold has been here confirmed with the use of a fluorescent reporter. The recombinant protein largely accumulated as a full-length product in the soluble cell lysate. We also demonstrated for the first time that full-length CDKL5 produced in Antarctic bacteria is catalytically active by using two independent assays, making feasible its recovery in native conditions from bacterial lysates as an active product, a result unmet in other bacteria so far. Finally, the setup of an in cellulo kinase assay allowed us to measure the impact of several CDD missense mutations on the kinase activity, providing new information towards a better understanding of CDD pathophysiology.

Conclusions

Collectively, our data indicate that P. haloplanktis TAC125 can be a valuable platform for both the preparation of soluble active human CDKL5 and the study of structural–functional relationships in wild type and mutant CDKL5 forms. Furthermore, this paper further confirms the more general potentialities of exploitation of Antarctic bacteria to produce “intractable” proteins, especially those containing large intrinsically disordered regions.

Soluble Recombinant Protein Production in Pseudoalteromonas haloplanktis TAC125: The Case Study of the Full-Length Human CDKL5 Protein

Combined effects of double mutations on catalytic activity and structural stability contribute to clinical manifestations of glucose-6-phosphate dehydrogenase deficiency

Article Open access 21 December 2021

Folding and assembly defects of pyruvate dehydrogenase deficiency-related variants in the E1α subunit of the pyruvate dehydrogenase complex

Article 14 February 2018

Background

Cyclin-dependent kinase-like 5 (CDKL5) is a serine/threonine protein kinase involved in the development of the human brain. Dozens of mutations of the CDKL5 gene are causative of CDKL5 Deficiency Disorder (CDD; OMIM 300203; 300672) [1, 2], a severe condition that is manifested with intellectual disability, autistic behavior, motor and visual impairments, infantile-onset refractory epilepsy, and many other symptoms [3]. Although no cure for CDD exists today, some studies proved that the restoration of CDKL5 activity through either protein or genetic intervention can revert CDD symptoms in mice and human models [4,5,6]. To clearly understand the applicability of these measures, however, a deeper knowledge of CDKL5 biology, its regulation, and the genotype-phenotype relationship of each CDD mutation should be acquired. For instance, we do not know whether some CDD mutations are dominant negative. If so, either enzyme replacement therapy (ERT) or gene addition may not be equally beneficial to all CDD patients. Another important question that should be addressed involves the levels of kinase activity that should be reached not to have a detrimental effect on CDD patients. Recently, the hyperphosphorylation of CDKL5 T169 has been demonstrated to induce the kinase hyperactivation which is a stress signal that triggers cell death during acute kidney injury [7]. This outcome would suggest that the presence and the abundance of post-translational modifications (PTMs) in CDKL5 preparations could significantly alter the therapeutic potential of a CDKL5-based ERT.

These observations indicate that multiple complementary tools to purify and study CDKL5 should be developed. Bacterial expression systems are desirable both for preparative and basic research purposes. First, the PTMs of the recombinant protein expressed in a bacterium would be limited to the phosphorylations dependent on the kinase autocatalytic activity, and a comparison with CDKL5 isolated from eukaryotic sources would make it possible to understand the role of other PTMs and their impact in an ERT. For similar reasons, a prokaryotic platform can be a valid tool to support the research in finding CDKL5 interactors, substrates, and upstream regulators. Different groups have already identified some CDKL5 substrates both in the cytoplasm [8, 9] and in the nucleus [10] using human cell cultures. Furthermore, a yeast two-hybrid screening has provided a list of proteins potentially interacting with the C-terminal extremity of CDKL5, although just two have been disclosed [11, 12]. We think that a reliable bacterial system can be useful to speed up the validation process of potential interactors and/or other putative substrates present in such lists. This aspect is crucial in the case of protein kinases since they generally can activate phosphorylation cascades and distinguishing each phosphorylation step from another can be troublesome in a eukaryotic context. In this regard, bacteria have often been useful in the study of eukaryotic kinases, from Cobb’s pioneering work about the reconstruction of a MAP kinase cascade in Escherichia coli [13] to more recent examples involving the characterization of the activation process of ciliary kinases through the comparison of protein preparations from different recombinant sources [14, 15]. The recombinant production of full-length CDKL5 (flCDKL5) is a challenging task, though. Its gene encodes a transcript that is subjected to alternative splicing leading to the production of five isoforms whose functional differences, if present, have not been explored [16]. However, the most abundant CDKL5 variant in the brain is isoform 1, followed by isoforms 2 and 3, and to a very little extent by isoform 4. Whereas isoform 5—which was the most studied one in early literature—is the bigger variant, but is exclusively expressed in the testis [16]. From a compositional point of view, all five isoforms are similar to each other since their cognate transcripts just differ for a few exons. For this reason, from now on we will generally refer to isoform 1 with the name flCDKL5. As recently highlighted by an in silico analysis and schematized in Fig. 1, flCDKL5 is predicted to be mainly a disordered protein, except for its N-terminal catalytic domain [17]. In support of this hypothesis, flCDKL5 has been recently demonstrated to interact with poly(ADP-ribose) with its C-terminal extremity [10], a typical feature of proteins possessing intrinsically disordered regions (IDRs) [18]. Proteins harboring extended IDRs are often problematic to be recombinantly expressed in and purified from conventional biotechnological platforms due to their propensity to either be degraded or to condensate [19]. flCDKL5 seems to fall in this category of difficult-to-express proteins. Even if its kinase catalytic domain has been successfully expressed in insect cells, yeast [20] and even E. coli [21], reports of recombinant production of flCDKL5 are quite scarce. Worth mentioning is the co-expression assay of flCDKL5 with two different substrates that Muñoz and co-workers established in HEK293 cells to evaluate the impact of CDD mutations [8]. Nevertheless, no examples of flCDKL5 satisfying expression in bacterial cells are available. The protein is insoluble in E. coli and denaturation and refolding procedures failed to achieve the active protein [22]. Regardless of this significant drawback, Katayama and Inazu managed to develop an ingenious assay to observe the flCDKL5 autophosphorylation activity on E. coli insoluble fractions, although such an approach would make it difficult to study the interaction of the kinase with other proteins [23]. As a result, a bacterial system that allows the soluble accumulation of flCDKL5 is desirable to allow both the validation of putative interactors with a co-expression assay and to obtain the active protein in native conditions after the cellular lysis.

To pursue this objective, we here present the genetic tools we have developed to produce flCDKL5 in the model Antarctic bacterium Pseudoalteromonas haloplanktis TAC125 (also named P. translucida TAC125). This prokaryote has already proven to be a useful alternative to conventional bacteria for the synthesis of difficult-to-express eukaryotic proteins [24,25,26]. The production of flCDKL5 isoform 5 has been tested in this bacterium but the overall yield was extremely low and the cultivation temperature of 0 °C was needed to preserve the protein during its synthesis [27]. However, very recently a model developed to study the P. haloplanktis TAC125 metabolism [28] has been specifically applied to optimize the production of flCDKL5 isoform 1, which is the most abundant in the human brain [29]. Here, we report the design of two sets of plasmids: the first set has been iteratively optimized to produce flCDKL5 isoform 1 fused with multiple tags for its detection and isolation; the second set has been designed to allow the co-expression of flCDKL5 with any one of its either putative or confirmed protein substrates. The former plasmid will be used to produce flCDKL5 for future applicative purposes, while the second will be used for basic research, e.g. either to study the effect of CDD mutations on flCDKL5 activity on known substrates or validate flCDKL5 putative interactors discovered with independent assays. As proof of concept, we here demonstrate that, by using the first set of plasmids, flCDKL5 can be produced mainly in intact and soluble form in P. haloplanktis TAC125 and that, after enrichment in native conditions, the kinase is catalytically active. Furthermore, by using the second set of plasmids, we co-expressed the 10 flCDKL5 CDD mutants shown in Fig. 1 with the CDKL5 bonafide protein substrate EB2 [8, 9] and demonstrated that this bacterial tool can effectively measure the impact of each mutation on the enzymatic activity. Our future goal is to use these two platforms to extend the knowledge of CDKL5 biology and to achieve protein purification for ERT. For this reason, all the constructs presented in this work possess an N-terminal TATκ peptide that has already been used to vehiculate flCDKL5 across the blood-brain barrier [4].

Results and discussion

A multiparametric strategy to optimize full-length CDKL5 expression in Pseudoalteromonas haloplanktis TAC125

To test if P. haloplanktis TAC125 can be used to both express full-length CDKL5 (flCDKL5) for preparative purposes and as an orthogonal system for studying flCDKL5 functions and disease-related alterations, the protein kinase was produced both alone and together with one of its bonafide substrates, EB2 [8, 9] so as to carry out an in cellulo kinase assay. To do so, we first optimized the production of flCDKL5 alone, and then we applied the acquired knowledge to set up the co-expression assay. As schematized in Fig. 2a, six major steps were faced to progressively improve the production of intact flCDKL5 alone.

First, to express flCDKL5 we tested P. haloplanktis TAC125 monocistronic low copy number plasmids that were already available [30, 31]. In this phase, we focused on the selection of the best inducible promoter (Step 1 and Fig. 2b, panel (I)) and the resolution of an unforeseen abortion of mRNA translation (Step 2).

In the second part of this work, flCDKL5 synthesis was tested in bicistronic plasmids (BCDs, BiCistronic Designs), exploiting the translational coupling of the gene of interest with an optimized upstream ORF to increase the rate of translation initiation (Fig. 2b, panel (II)) [32]. This approach was pursued to increase the mRNA translation efficiency and stability. In this phase, we screened different combinations of upstream ORFs (Leader in Fig. 2b) and Shine Dalgarno sequences upstream of the Leader sequence (SD1) and the flCDKL5 encoding sequence (SD2, Step 3). We also overcame an internal translation start (Step 4) and optimized the disposition of N- and C-terminal tags to preserve flCDKL5 integrity (Step 5). Finally, the overall yield of flCDKL5 was significantly increased by the replacement of the replication origin of the plasmid with a new one that guaranteed an increased copy number (Step 6).

flCDKL5 expression with monocistronic plasmids

To define whether flCDKL5 can be produced and is detectable in recombinant Antarctic bacteria, we started this work with a conventional monocistronic asset. The flCDKL5 gene was designed to encode the human CDKL5 isoform 1—also called 107 variant [16]—with multiple tags. It possessed an N-terminal TATκ peptide that can be exploited for intracellular delivery [4], and tandem 3xFLAG and 6xHis C-terminal tags. Codon composition was automatically optimized for P. haloplanktis TAC125 by using the OPTIMIZER tool with the guided random method [34]. This CDKL5 variant was named 107 (B) and its expression was attempted in P. haloplanktis TAC125 cells using either a D-galactose inducible (pMAV) [30] or an IPTG inducible plasmid (p79C) [31], harboring a weak and strong regulatable promoters, respectively [31]. Regardless of their different expression mechanisms, the two plasmids share a common backbone and, most notably, harbor the same replication origin (OriR) derived from the P. haloplanktis TAC125 endogenous plasmid pMtBL [33]. Since pMtBL was demonstrated to be stably inherited as a single copy plasmid in Antarctic cells thanks to its partition sequences [35], we checked for the plasmid copy number (PCN) and stability of the shuttle recombinant vector p79C which is devoid of such regions. As shown in Additional file 1: Fig. S1a, the recombinant plasmid was stably kept with an average of 2–3 PCN during a 40-hour culture at 15 °C when the antibiotic selection was applied. Furthermore, IPTG induction did not seem to cause any instability (Additional file 1: Fig. S1b). Once the plasmid stability had been ascertained, 107 (B) production in the Antarctic bacterium was analyzed. After recombinant induction, 107 (B) production was not noticeable in P. haloplanktis TAC125 using SDS-PAGE on total lysates derived from either pMAV- or p79C-mediated expression (3a, left panel).

However, positive signals could be detected by anti-CDKL5 Western blot in both strains, proving that the target protein was synthesized, although at low levels (Fig. 3a, right panel). Furthermore, even if a signal compatible with flCDKL5 was visible in the Western blot analysis, a similarly intense band with a lower molecular weight was produced both by pMAV and p79C plasmids. Given that the used antibody targets the first half of the protein, this truncated product probably lacks the C-terminal extremity. To identify the molecular basis of this truncation, we analyzed the nucleotide sequence of the automatically codon-optimized 107 (B) gene and we noticed the occurrence of an A-rich region at about three-quarters (Fig. 2b, panel (I)). This polyA stretch corresponds to codons for six consecutive lysines, given that AAA is the most common triplet for this amino acid in P. haloplanktis TAC125. Since multiple authors have reported that A repetitions in mRNAs induce ribosome stalling, sliding, and accidental frameshift during protein synthesis [36,37,38], we modified this sequence by replacing five AAA codons with synonymous AAG. This new flCDKL5 variant was named 107 (G) and its expression profile was compared with the one of 107 (B). As shown in Fig. 3b, the main truncation product visible in the pMAV-107 (B) bearing strain was not produced by cells hosting either pMAV-107 (G) or p79C-107 (G). Furthermore, the IPTG inducible plasmid guaranteed a higher accumulation of the target protein over time than the D-galactose-dependent vector, a result that is in agreement with the different strengths of the two promoters [31]. For these reasons, we focused on the improvement of the p79C plasmid and the 107 (G) construct for further studies.

Bicistronic cassettes and site-specific mutagenesis to overcome an internal translational start

In the second phase of this study, we wanted to address two questions: can flCDKL5 production be increased by translational regulation? Is there an ideal disposition and combination of tags to preserve flCDKL5 extremities? As flCDKL5 is a high molecular weight protein (107 kDa) with extensive disordered regions [17], we cannot exclude possible either N- or C-terminal truncations just by electrophoretic migration. This is a non-trivial issue in the case of the here presented flCDKL5 variants since they are characterized by two flexible ends, the N-terminal TATκ and the C-terminal intrinsically disordered region (IDRs). IDRs can be a target of proteolysis and impact the half-life of proteins, but their fusion with tags can change their overall stability [39].

Hence, the second phase of our study aimed to examine and possibly improve these aspects contextually by using a technique that could both modulate translational efficiency and test different combinations of tags.

The tool we chose for this purpose is the bicistronic cassette (BCD) that introduces short optimized coding sequences upstream of the heterologous gene of interest to optimize the start of translation [32]. This approach has been used several times in different recombinant bacteria in the past and has demonstrated that the translation of a heterologous ORF can be modulated by the translational coupling with an upstream optimized short ORF. The application of this technology to finetune a psychrophilic translation system to produce a eukaryotic protein normally synthesized at 37 °C is reasonable given that the mRNA translation is differently regulated in bacteria and mammals [40] and that the temperature is likely to play a pivotal role in the kinetics of protein synthesis and folding. Given that protein synthesis is mainly limited by translation initiation and early elongation [41], we reasoned that the BCD strategy could serve as an insulator to optimize flCDKL5 synthesis so as to avoid too many interventions on the gene composition. Consequently, we implemented this technology in P. haloplanktis TAC125 using the work by Mutalik and coworkers as a model both for the design of the plasmids—as indicated in the Methods section—and for the nomenclature of the bicistronic designs [32]. Furthermore, the adopted cloning strategy allowed for the easy permutation of tags at the two extremities of flCDKL5 (see the Methods section). For these reasons BCD vectors were the most suitable tool to simultaneously analyze the effect of tags and translational control on flCDKL5 production and preservation.

The main components of a BCD cassette are: a 5’ untranslated region (5’ UTR) harboring a first SD sequence (SD1); a short open reading frame (ORF) that encodes a peptide optimized for initial translation (Leader sequence) and embeds a second SD sequence (SD2); the gene of interest whose translation driven by SD2 is coupled with the translation of the Leader sequence thanks to the overlap of a Stop and Start codons (Fig. 2b, panel (II)). The impact of variable BCD configurations on protein expression was assessed both using plasmids for the syntheses of a fluorescent reporter, pGFP [31], and a parallel set for the production of flCDKL5 (Additional file 1: Table S1 and Table 1, respectively).

Table 1 Characteristics of BCD constructs for flCDKL5 expression

Full size table

In the case of pGFP, we initially tested the first 19 residues of P. haloplanktis TAE79 LacZ as a Leader peptide, given the high expression levels of this psychrophilic β-galactosidase in P. haloplanktis TAC125 [31]. The two BCD vectors that were generated differed only for the SD2 sequence. In pBCD1, SD2 was the same as SD1, that is the natural SD sequence upstream of the lacZ ORF. In pBCD2, SD2 was optimized to embed a region that is perfectly complementary to the 3’-OH end of P. haloplanktis TAC125 16S rRNA (Additional file 1: Table S1). When the expression levels driven by these two bicistronic vectors were compared with a monocistronic cassette (p79C-pGFP), we could detect pGFP accumulation in all the cases, but pBCD2-pGFP guaranteed a significantly higher fluorescence suggesting that the optimized SD2 sequence was pivotal to increase the pGFP translation rate (Additional file 1: Fig. S2a). We also tested a different SD1—Leader combination to produce pGFP. We aimed to replace the theoretically weak SD of lacZ in position SD1 with an SD that is perfectly complementary to 16S rRNA by using an ORF that naturally possesses an optimized SD in P. haloplanktis TAC125 genome. The ORF we selected is in the trpA gene (PSHA_RS06350) of the tryptophan operon. The new plasmid was named pBCD3-H6-pGFP (Additional file 1: Table S1) and the levels of pGFP that could be accumulated were compared to pBCD1 and pBCD2 configurations. As visible in Additional file 1: Fig. S2a, although pBCD3 performed better than pBCD1, it was worse than pBCD2. This indicates that both SD1, SD2, and the Leader sequence play a role in the accumulation of the protein encoded by the second cistron. However, while the effect of SD2 seems predictable, more subtle variables may influence the roles of SD1 and the Leader cistron, as expected. The role of BCD constructs is to optimize the expression of the second cistron independently of its codon composition thanks to the insulating effect of the first cistron. On the other hand, the translation starting from the first cistron must be optimized with thorough screening, due to the lack of any insulator [32]. The addition of N-terminal 6xHis-TATκ tags did not significantly affect pGFP expression in the BCD2 configuration, indicating that multiple N-terminal tags can be added with no major detrimental effects on protein accumulation (Additional file 1: Fig. S2b). Overall, the data acquired with the fluorescent reporter indicate that BCD2 is the best bicistronic configuration we developed for recombinant expression in P. haloplanktis TAC125 and that the addition of N-terminal tags does not seem to perturb the pGFP production levels when this asset is used.

In the case of flCDKL5, the achievement of protein accumulation was less straightforward than pGFP. We tested the three bicistronic configurations (BCD1, BCD2, BCD3) that were explored for pGFP expression (Table 1). Furthermore, flCDKL5 was originally produced with two different dispositions of tags. The most relevant traits are that variant 107 (G) has C-terminal 3xFLAG and 6xHis, while 107 (K) harbors an N-terminal 6xHis and a C-terminal 3xFLAG tags (Table 1 and Fig. 4a). Anti-CDKL5 and anti-FLAG Western blots showed that pBCD2 and pBCD3 configurations gave rise to the production of a high molecular weight band that is not generated by either the monocistronic vector or pBCD1 (Fig. 4b, left and middle panels).

Apparently, p79C and pBCD1 plasmids only generated a truncated version of flCDKL5, while pBCD2 and pBCD3 allowed the synthesis of both the truncated and the putatively intact forms of flCDKL5 both as 107 (G) and 107 (K) variants. Given that such two bands are detected by the anti-FLAG antibody and that both 107 (G) and 107 (K) have a C-terminal 3xFLAG tag (Fig. 4a), the lower molecular weight band is probably N-terminally truncated. This hypothesis is also suggested by the anti-His Western blot that detected positive signals for all the strains but pBCD1-107 (K, Fig. 4b, right panel). 107 (G) was always reactive because the 6xHis tag is C-terminal in this flCDKL5 variant. On the other hand, the 107 (K) variant has an N-terminal 6xHis tag and the lack of any anti-His detection in the pBCD1-107 (K) bearing strain is indicative of the fact that in this condition flCDKL5 is exclusively synthesized in an N-terminally truncated form (detectable by anti-FLAG and anti-CDKL5 antibodies). Since the three BCD configurations just differ for elements regulating the start of flCDKL5 translation, we reasoned that such variable fragmentation patterns might be due to a translational bias. We formulated two hypotheses: (1) flCDKL5 was always N-terminally proteolyzed in P. haloplanktis TAC125, but BCD2 and BCD3 allowed for the accumulation of the intact form of the protein thanks to higher production levels (see Additional file 1: Fig. S2 for a comparison with pGFP accumulation); (2) the flCDKL5 encoding gene harbored an internal translation start that overrode the first translation start unless the latter is optimized like in the BCD2 and BCD3 configurations. A new analysis of the flCDKL5 encoding gene pointed toward the second hypothesis since it highlighted the possible existence of a cryptic internal translation initiation at the level of M10 according to the coordinates of the human CDKL5 isoform 1 (O76039, Uniprot). Although a strong Shine Dalgarno sequence could not be found upstream of the ATG encoding M10, a truncation at this level is compatible with the small electrophoretic difference between intact flCDKL5 and truncated flCDKL5 forms in our Western blot analysis (Fig. 4b).

To test this hypothesis of internal translation initiation, we planned multiple experiments. First, we designed a new flCDKL5 variant possessing a bulky tag between the N-terminal 6xHis and the CDKL5 wt residues to make the truncated product more easily distinguishable from intact CDKL5. This construct was named 107 (L) and harbors N-terminal 6xHis-Sumo tags and a C-terminal 3xFLAG (Fig. 4a). This modification served also to test our first hypothesis (i.e. whether flCDKL5 was truncated due to degradation) since the fusion of flexible extremities with globular domains can change their propensity to degradation, as already mentioned [39]. Then, we introduced site-directed mutations in 107 (G), 107 (K), and 107 (L) flCDKL5 variants to reveal the putative role of M10 as an internal translation start. In particular, we abolished the translation initiation from the first residue of 107 (G) by eliminating the first ATG and introducing a STOP codon in the second position (pBCD2-107 (G) STOP in Table 1). As revealed by anti-CDKL5 and anti-FLAG Western blots (Fig. 4c, left and middle panels, respectively), the truncated form of flCDKL5 was still produced by pBCD2-107 (G) STOP bearing cells, while the upper band was absent, as expected. This outcome suggests that an internal translation start occurred. Then, we replaced M10 of human CDKL5 with a valine to have an insight into its role in flCDKL5 fragmentation. Although such residue has different coordinates depending on the protein variant (43, 61, and 159 for 107 (G), 107 (K) and 107 (L), respectively), we will always name such mutation as M10V in reference to human CDKL5 wild type coordinates. As visible in Fig. 4c, such mutations abolished the synthesis of the truncated fragment in 107 (G) M10V and 107 (K) M10V producing strains. In the case of 107 (L) M10V, a lower molecular weight fragment at the level of the untagged protein was visible in anti-CDKL5 and anti-FLAG Western blots, but the intensity was low and comparable with all the other truncated fragments in the same lane. This suggests that such a product is unlikely to be caused by the internal translation initiation. An anti-FLAG Western blot specifically comparing the production profiles of pBCD2-107 (L) and pBCD2-107 (L) M10V strains proved this point (Additional file 1: Fig. S3). This result demonstrates that the addition of the globular Sumo domain at the N-terminal extremity of flCDKL5 did not resolve its fragmentation and that such fragmentation is unlikely to be due to proteolysis. Nevertheless, the 107 (L) M10V variant guaranteed better preservation of the N-terminal 6xHis tag than the 107 (G) and 107 (K) proteins (Fig. 4c, right panel). This outcome may be since in the latter variants the 6xHis tag is directly fused to extremely flexible regions (i.e. the C-terminal intrinsically disordered region in 107 (G) and the flexible N-terminal TATκ in 107(K), Fig. 4a).

As the last proof of an internal start occurring in flCDKL5 mRNA, we wanted to obtain the N-terminal amino acid sequence of the fragmented protein by using Edman sequencing. Since flCDKL5 is a difficult-to-purify protein, we tried to isolate such a protein fragment from a preparation of the CDKL5 catalytic domain only. We decided to produce this shortened version of CDKL5 in Escherichia coli rather than P. haloplanktis TAC125 given that previous reports demonstrate that this cell factory can be effectively used to isolate the CDKL5 catalytic domain [21]. To do so, we generated a short ORF encoding CDKL5(1-352) with a C-terminal His tag, performing a PCR on the 107 (L) encoding gene. Such construct was then expressed in E. coli BL21(DE3) cells using a pET system and the recombinant protein was isolated with an IMAC. As expected, the truncated fragment was co-purified and its N-terminal sequence was MNXF according to Edman degradation, where X is an undetermined amino acid. This indetermination was due to the little quantity of the protein fragment that could be recovered, but such a sequence is only compatible with a start in the M10 position in the CDKL5 protein, as expected. Hence, our results indicate that an internal translation start occurs when our engineered CDKL5 construct is expressed in bacteria which is typical of some eukaryotic cDNAs expressed in prokaryotes [42].

Based on this body of work, we selected pBCD2-107 (L) M10V as the best construct we tested for flCDKL5 expression. pBCD2 was preferred over pBCD1 because the latter only produced a truncated form of flCDKL5 (Fig. 4b). Moreover, pBCD2 was better than pBCD3 because the former guaranteed a higher accumulation of the full-length protein, as visible by anti-His Western blot (Fig. 4a, right panel) and confirmed by the pGFP results (Additional file 1: Fig. S2a). Furthermore, the pBCD2 configuration guaranteed increased stability of the flCDKL5 mRNA in comparison with the monocistronic p79C plasmid (2.73 ± 0.1 fold change), as demonstrated by qRT-PCR measurements that are in agreement with previous studies from other authors [43]. The 107 (L) variant was chosen because of the Sumo protective role on the N-terminal 6xHis tag (Fig. 4c, right panel). Finally, the M10V mutation was kept because it provided a solution to avoid the internal translational start. The presence of two different affinity tags at the two extremities of the protein constitutes a further advantage in sight of the future purification of flCDKL5. Double tagging has indeed been successfully used for the purification of both multidomain and intrinsically disordered proteins that are prone to fragmentation [19].

Overall, the multiparametric approach used to optimize flCDKL5 in P. haloplanktis TAC125 proved that the bicistronic strategy was pivotal to increase the yield of intact flCDKL5 from virtually absent to an amount detectable by Western blot. Nevertheless, BCDs were not sufficient to overcome an internal translational start, suggesting that the translation from the first and the internal ATGs are not in competition. This datum is corroborated by the fact that the abolition of translation from either the first ATG (pBCD2-107 (G) STOP in Fig. 4b) or the internal ATG (M10V mutants in Fig 4b) did not increase the production levels of the truncated and intact proteins, respectively, which is in agreement with a previous study [44]. Our hypothesis of testing multiple N- and C-terminal fusions with affinity tags proved to be valid because it allowed for better discrimination of flCDKL5 fragments during protein synthesis and proved that a bulky domain like Sumo can increase the preservation of the N-terminal 6xHis tag. This approach should be extended to the flCDKL5 C-terminal IDR in the pursuit of increased stabilization. Even if the main truncated fragment of flCDKL5 during recombinant expression in P. haloplanktis TAC125 was due to a translational issue, our Western blots highlight the existence of other numerous low molecular weight fragments though considerably less abundant. This pattern may be ascribable to the fragility of the flCDKL5 C-terminal tail and the fusion with a terminal globular domain could lessen its impact.

Application of high-copy number plasmids for the expression of flCDKL5 and evaluation of the kinase activity

Very recently a library of replication origins with variable PCNs has been established for plasmid maintenance in P. haloplanktis TAC125 (Calvanese M. et al., manuscript in preparation). After we had optimized the flCDKL5 expression cassette, we moved it into a high copy number backbone named pB40 with an average PCN of ~ 100. The comparison of the expression profiles of the Antarctic cells bearing the original low copy number monocistronic plasmid p79C-107 (G) with the ones hosting the high copy number bicistronic plasmid pB40-BCD2-107 (L) M10V revealed a strong difference.

While the protein achieved from the original construct was barely detectable by anti-CDKL5 Western blot, the optimized plasmid guaranteed a considerable higher accumulation of the target protein that was visible both by Coomassie staining after SDS-PAGE and by Western blot (Fig. 5a, left and right panels, respectively). Furthermore, when chemical-enzymatic lysis and fractionation of crude extracts were carried out in native conditions (see Materials and Methods for the details), most flCDKL5 appeared to be soluble (Fig. 5b). After a chromatographic enrichment with an anti-Flag resin, the recombinant kinase was tested for its catalytic activity by using an in vitro kinase assay with EB2 as a protein substrate. Briefly, 100 nM enzyme was incubated with 200 nM substrate in the presence of MgATP at 30 °C for 30 min. After protein inactivation with Laemmli buffer at 70 °C, the CDKL5-mediated phosphorylation of EB2 S222 was detected via Western blot with a specific antibody [9]. Remarkably, recombinant flCDKL5 enriched from P. haloplanktis TAC125 showed enzymatic activity on EB2, though it was less active than a commercial preparation of the CDKL5 catalytic domain from insect cells (CDKL5 ΔC, Fig. 5c). On the other hand, a catalytically inactive variant, flCDKL5 KD, did not phosphorylate EB2, as expected (Fig. 5c). The discrepancy between flCDKL5 and CDKL5 ΔC activities may be ascribable to more factors. First, the flCDKL5 C-terminal region is known to play an inhibitory effect on the enzymatic activity [7, 45, 46]. Second, the protein produced in the eukaryotic system may harbor higher phosphorylation levels that cannot be reached in a bacterial system. These two phenomena need to be dissected in the future by using CDKL5 ΔC purified from bacteria and flCDKL5 obtained from a eukaryotic system as controls, which is currently unavailable from commercial sources.

Besides these limitations, this work demonstrated that an increased gene dosage was needed to have satisfying flCDKL5 accumulation in P. haloplanktis TAC125 at 15 °C, a prerequisite that allowed the recovery of enough protein to assess its activity in vitro. To the best of our knowledge, no other bacterial platform has guaranteed to obtain flCDKL5 in native conditions as an active enzyme so far.

Use of TCD plasmids to measure the impact of CDKL5 pathogenic mutations

As the last proof of concept study, we planned to co-express flCDKL5 with one of its substrates to demonstrate that this prokaryotic platform can be used to validate CDKL5 substrates and model CDD mutations. The plasmid configuration optimized with the BCD2 design was converted into a tricistronic plasmid for the co-expression of EB2 and flCDKL5. As schematized in Fig. 6a, the TCD plasmid triggered the production of a polycistronic mRNA encoding the LacZ leader peptide, EB2 and flCDKL5.

To test whether this system can be used to reliably measure flCDKL5 catalytic activity through an in cellulo kinase assay, EB2 was co-expressed either with flCDKL5 or flCDKL5 KD, a catalytically inactive variant. As visible in Fig. 6b (left panel), the synthesis of EB2 and flCDKL5 could be contextually detected with an anti-His antibody and was consistent through biological triplicates. Furthermore, specific CDKL5-mediated phosphorylation of EB2 could be detected only when active flCDKL5 was produced. This result confirms that this assay can be used to undoubtedly measure CDKL5 specific activity.

Most of the patients affected by CDKL5 Deficiency Disorder (CDD) suffer from refractory epilepsy, hypotonia, intellectual and motor disabilities, and visual impairments [3]. However, the severity of these symptoms is variable, and clear genotype-phonotype correlations are scarce. For this reason, we wanted to test if our co-expression assay can be a valid tool to model CDD CDKL5 variants at the molecular level. To do so, 10 pathogenic flCDKL5 missense variants were co-expressed with EB2 using TCD plasmids, and EB2 phosphorylation levels were quantified. All flCDKL5 variants were hypoactive and the levels of EB2 cross-phosphorylation were variable, indicating a different impact of each mutation on flCDKL5 functionality (Fig. 7). This kind of study can be extended in the future to other known CDKL5 substrates to have a deeper understanding of CDD pathogenesis.

Conclusions

Our multivariate approach to optimize the production of flCDKL5 in P. haloplanktis TAC125 at 15 °C led to the accumulation of the target protein mainly as an intact, soluble, and active form, making feasible further developments for its purification and formulation. Furthermore, the co-expression assay with EB2 demonstrated two main points: (1) flCDKL5 is active when properly produced in bacteria; and (2) CDD missense variants can be modeled and discretized based on their hypoactivity. Such an approach will be useful in the future to support the reconstruction of CDKL5-mediated networks and to characterize the molecular effect of CDD mutations.

In addition, our results further support the more general potentialities of exploitation of the Antarctic bacterium to produce “intractable” proteins such as those endowed with a folded N-terminal domain and a long disordered tail, a common trait in the human kinome. Notably, all RCK kinases that are involved in cilia biology share such property, and our work demonstrate that P. haloplanktis TAC125 could be a suitable host to test their expression and purification for further characterization.

Methods

Bacterial strains and culture conditions

E. coli TOP10 was used for cloning purposes, while E. coli S17-1(λpir) was employed in intergeneric conjugations as a donor strain for P. haloplanktis TAC125 transformations. E. coli BL21(DE3) was used for the recombinant production of the catalytic domain of human CDKL5 and for the synthesis of human EB2.

The P. haloplanktis TAC125 strain that was used for the expression of flCDKL5 alone is KrPl LacY⁺ which lacks its endogenous plasmid pMtBL and the lon gene, and constitutively expresses the E. coli lactose permease [31]. For the double production of EB2 and flCDKL5, the KrPl strain was used, instead [31]. The induction of recombinant expression was performed at 1 OD with either 10 mM D-galactose or 5 mM IPTG for pMAV and p79C-derived plasmids, respectively. If not differently specified, the recombinant expression was protracted for 4 h.

The E. coli strains were grown in Lysogen broth (LB, 10 g/L bacto-tryptone, 5 g/L yeast extract, 10 g/L NaCl) at 37 °C with 200 rpm agitation. The BL21(DE3) recombinant strains were induced in mid-exponential phase with 0.1 mM IPTG at 15 °C for 16 h. P. haloplanktis TAC125 was grown in a rich medium (16 g/L bacto-tryptone, 16 g/L yeast extract, 10 g/L NaCl) during the preinocula development, and in a synthetic medium named GG for recombinant expression [30]. In both cases, the bacterial cultures were carried out at 15 °C with 200 rpm agitation.

For the selection of E. coli recombinant strains, either 100 µg/mL ampicillin (for pMAV derived plasmids) or 34 µg/mL chloramphenicol (for p79C derived plasmids) or 50 µg/mL kanamycin (for pET40-b derivatives) was used, depending on the selection marker. In the case of recombinant P. haloplanktis TAC125, chloramphenicol was used at 12.5 µg/mL in solid media and 25 µg/mL in liquid media, while ampicillin was always used at 100 µg/mL concentration.

Plasmids construction

Three genes encoding different flCDKL5 variants were first cloned into two monocistronic plasmids. Then, to test flCDKL5 expression in bicistronic configurations with different N- and C-terminal tags, BCDs were developed. To do so, two flCDKL5 genes were cloned into Bicositronic Entry Clone plasmids (BECs) and then short DNA fragments harboring different Leader ORFs, Shine Dalgarno sequences and N-terminal tags were inserted between the pLacZ promoter and the gene of interest (GOI). The same procedure was followed for the preparation of the pGFP-based vectors. The list of the primers used in this work is reported in Additional file 1: Table S2.

Monocistronic designs

For the recombinant expression of flCDKL5 with monocistronic plasmids, pMAV [30], and p79C [31] vectors were used. The genes encoding flCDKL5 variants named 107 (B), 107 (G) and 107 (H) were synthesized by an external company and cloned into pMAV by using NdeI/EcoRI double digestion. The transfer of the genes of interest from pMAV into p79C was achieved with the use of NdeI and SacI allowing the isolation of the recombinant genes together with the transcriptional terminator [31]. The sequences of the three genes are reported in the Additional file 1. 107 (B) and 107 (G) have a different codon composition, but they encode the same protein with an N-terminal TATκ peptide and C-terminal tandem 6xHis and 3xFLAG tags. 107 (H) differs from the other two variants for the lack of a His tag.

Entry clone designs

For the design of bicistronic plasmids (BCDs), Entry Clone plasmids (BECs) were prepared first. To do so a parental plasmid, p79BsC, was generated by cloning a synthetic fragment encompassing the LacR and pLacZ sequences of p79C into the pUCC backbone [47] using SphI/PstI double digestion (Additional file 1: Fig. 4, left plasmid). The aim of this step was the introduction of a BsaI site downstream of the pLacZ promoter.

The three BEC plasmids that were then generated shared the common feature of harboring two divergent BsaI sites upstream of the GOI, so as to allow the scarless cloning of DNA fragments between the pLacZ promoter and the GOI with the Golden Gate technology [48]. pBEC-pGFP was generated by isolating the pGFP gene from p79C-pGFP [31] via PCR using the pGFP_PstIBsaI_fw and pGFP_KpnI_rv primers, and cloning it into p79BsC with PstI/KpnI digestion (Additional file 1: Fig. S4, upper branch).

pBEC-107 (G) was developed by cloning the 107 (G) gene as two fragments, 5′ 107 and 3′ 107 (G). The former is a synthetic DNA harboring the first half of the flCDKL5 gene with left PstI and BsaI sites and a right HindIII site. 3′ 107 (G)—the second half of the gene—was extracted from p79C-107 (G) using HindIII/SacI double digestion. The two fragments were cloned between PstI and SacI into p79BsC (Additional file 1: Fig. S4, middle branch).

pBEC-107 (H) was generated in the same way as pBEC-107 (G) with the only difference that 3′ 107 (H) was used in place of 3′ 107 (G). Such fragment was obtained from p79C-107 (H) using HindIII/SacI digestion and was cloned together with 5′ 107 into p79BsC (Additional file 1: Fig. S4, bottom branch).

Bicistronic designs

To generate the various pBCD plasmids described in this work, different DNA fragments, named Strings A-F were cloned into the BEC plasmids. As depicted in Additional file 1: Fig. S5a, each String has two divergent BsaI sites at the extremities to allow Golden Gate cloning. Furthermore, every String harbors the 5’UTR of p79C (+ 1 in Additional file 1: Fig. S5a indicates the transcription start) and different combinations of SD1, SD2, Leader ORF and possibly N-terminal tags to be fused to the GOI. The various combinations of BEC plasmids and Strings generated the pBCDs described in this work (Additional file 1: Fig. S5b). Finally, to generate pBCD2-H6-TATκ-pGFP, a PCR on pBCD2-107 (K) was performed with pBCD_SphI_fw and H6-TATk_BsaI_rv primers to isolate a DNA fragment harboring the H6-Tatκ encoding sequence to be cloned into pBEC-pGFP upstream of the pGFP gene.

Mutagenesis reactions into flCDKL5 encoding genes were carried out with the QuikChange II XL Site-Directed Mutagenesis kit (Agilent). In particular, the elimination of the first ATG was achieved using M1I_G2stop_fw and M1I_G2stop_rv primers; M10V mutations were performed with M10V_fw and M10V_rv primers; Kinase Dead mutants were obtained using Mut_KK_fw and Mut_KK_rv.

High copy number bicistronic plasmids

The low copy number OriR of the recombinant plasmids was replaced by the high copy number B40 replication origin using AscI/NotI double digestion.

Tricistronic designs

To co-express the LacZ Leader peptide, EB2 and flCDKL5, a similar strategy as for the BCD plasmids was applied. First, pB40-BCD2-107 (L) M10V was converted into an Entry Clone plasmid (pB40-BEC-107 (L) M10V). To do so, a PCR fragment encompassing the first half of the 107 (L) M10V gene was amplified with CDKL5_L_PstIBsaI_fw and CDKL5_NheI_rv primers to introduce PstI and BsaI sites upstream of the 6xHis-Sumo tag encoding sequence. Then, such DNA fragment was cloned into pB40-BEC-107 (H) using PstI/NheI double digestion so as to replace the 5’ half of the flCDKL5 gene and generate pB40-BEC-107 (L) M10V (Additional file 1: Fig. S6a). Finally, the BEC plasmid was converted into the tricistronic plasmid (TCD) by cloning a DNA fragment harboring the 5’UTR, the LacZ (aa 1–19) encoding sequence, and the cmyc-EB2-6xHis gene between the two BsaI sites (Additional file 1: Fig. S6b).

For the development of TCD plasmids expressing CDD mutants of flCDKL5, TCD was subjected to mutagenesis with the QuikChange II XL Site-Directed Mutagenesis kit (Agilent) with the primers listed in Additional file 1: Table S2.

Measurement of average plasmid copy number (PCN)

The quantification of p79C-107(B) in P. haloplanktis TAC125 was carried out as previously reported [35]. Briefly, total DNA was extracted from 1 OD of bacterial cells using the E.Z.N.A. Bacterial DNA kit (Omega Bio-Tek Inc). Then, five serial dilutions of total DNA were employed as substrates in qPCR reactions with either Prom7_fw and Prom7_rv primers to detect the genomic DNA, or CDKL5_fw and CDKL5_rv to quantify the plasmid. Hence, the PCN values were calculated as follows: PCN = E_c^Ctc / E_p^Ctp, where Ec and Ep are the efficiencies obtained from the standard curves of the amplification of the chromosomal and plasmid genes, respectively, and Ctc and Ctp are the threshold cycles for the two amplicons (chromosomal and plasmid genes) in each sample.

Quantification of flCDKL5 mRNA

Total RNA was isolated using the Direct-zol RNA Kit (Zymo Research, Irvine, CA, USA) adopting the manufacturer’s instructions, followed by treatment with RNAse-free DNase I (Roche, Mannheim, Germany) to avoid genomic DNA contamination. Total RNA was reverse transcribed using SuperScript IV (Invitrogen, Carlsbad, CA, USA) according to the recommended protocol using PSHA_RS01090_rv and CDKL5_rv primers. 1 μL cDNA from each sample was used as the template for quantitative real-time PCR by using 1X PowerUp SYBR Green Master Mix (Applied Biosystems, Foster City, CA, USA) in the presence of 400 nM of specific primers (PSHA_RS01090 _fw, PSHA_RS01090 _rv; CDKL5_fw, CDKL5_rv). The reactions were run by a StepOne Real-time PCR System (Applied Biosystems, Foster City, CA, USA) and three independent sets of experiments were performed.

The thermal cycling protocol was set up as follows: UDG activation for 2 min at 50 °C; initial denaturation for 10 min at 95 °C; 40 cycles of denaturation for 15 s at 95 °C alternated with annealing/extension steps for 1 min at 60 °C. At the end of each reaction, melting curves were performed to verify the presence of a specific and unique amplification product. The housekeeping gene PSHA_RS01090 was chosen as the normalizer for variations of mRNA amounts, cDNA synthesis efficiency and plasmid DNA contamination. The expression level of the CDKL5 gene was assayed for up-regulation in the experimental samples (induction of expression) in comparison to the calibrator sample (noninduced cells, NI). The relative quantification of mRNA was expressed as fold-change and was calculated through the standard curve method (Pfaffl method):

Gene expression ratio = (E_target) ^ΔCt _target ^{(control−sample)} /(E_housekeeping) ^ΔCt _housekeeping.^{(control−sample)} [49]

Measurement of pGFP fluorescence

After recombinant production, the equivalent of 1 OD cells was harvested and washed with PBS. Then, each sample was diluted with the same buffer to achieve the best signal-to-noise ratio in the fluorescence measurements that were carried out with a JASCO FP-750 spectrofluorometer at 25 °C (excitation at 488 nm and emission at 509 nm).

Analysis of the production of flCDKL5 and EB2 proteins

To analyze total protein productions, 1 OD pellets were solubilized in Laemmli Sample buffer and heated at 90 °C for 20 min. Then, total cellular extracts were resolved by SDS-PAGE and analyzed either by Coomassie staining or Western blot. For the solubility analysis, cells were lysed in 20 mM sodium phosphate buffer pH 7.0, supplemented with 0.5 M NaCl, 10% Glycerol, 0.1% Triton X-100, 20 U/mL DNAse I, 0.1 mg/mL lysozyme, 1 mM DTT and a protease inhibitor cocktail. Then, the soluble and insoluble fractions were segregated by centrifugation. Finally, the insoluble fraction was resuspended with lysis buffer in the same volume as the soluble fraction. 10 µg of soluble extract were analyzed by SDS-PAGE and the same volume of insoluble fraction was used as a control of solubility.

After SDS-PAGE runs, proteins were electroblotted to PVDF membranes using a semidry system. After the incubation with specific antibodies, the chemiluminescent signals were developed with the ECL method.

To detect flCDKL5, the membrane was blocked with PBS, 0.05% Triton X-100, 5% w/v milk for one hour. Then, CDKL5 (D-12): sc-376314 antibody (Santa Cruz Biotechnology) was diluted 1:1000 in the same buffer. After one hour of incubation at room temperature with the primary antibody, the membrane was washed with PBS, 0.05% v/v Triton X-100 three times (5 min each) and incubated with an anti-mouse antibody diluted 1:10,000 in PBS, 0.05% v/v Triton X-100, 5% w/v milk for one hour. Then, the membrane was washed again with PBS, 0.05% v/v Triton X-100 three times and the secondary antibody was detected using the ECL method.

For anti-FLAG Western blots, the membrane was blocked with PBS, 0.2% Tween 20, 5% w/v milk, for one hour. Then, Monoclonal ANTI-FLAG M2, Clone M2 (F1804, Sigma) was diluted 1:1000 in the same buffer. After overnight incubation at 4 °C with the primary antibody, the membrane was washed with PBS, 0.2% Tween 20 three times and incubated with an anti-mouse antibody diluted 1:5000 in PBS, 0.2% Tween 20, 5% w/v milk for one hour at room temperature. Then, the membrane was washed again with PBS, 0.2% Tween 20 three times and the secondary antibody was detected using the ECL method.

In the case of anti-His Western blots, the membrane was blocked with PBS, 5% w/v milk for one hour. Then, Monoclonal Anti-polyHistidine-Peroxidase clone HIS-1 antibody (A7058, Merck) was diluted 1:2000 in PBS, 0.05% Tween 20, 5% w/v milk. After one hour of incubation at room temperature with the antibody, the membrane was washed with PBS, 0.05% Tween 20 three times and it was developed.

To measure EB2 phosphorylation of Ser222, the membrane was blocked with TBST, 5% w/v milk for one hour. Then, anti-EB2 pS222 antibody (00117739, Covalab) was diluted 1:4000 in the same buffer. After overnight incubation at 4 °C with the primary antibody, the membrane was washed with TBST three times and incubated with an anti-rabbit antibody diluted 1:2000 in TBST and 5% w/v milk for one hour at room temperature. Then, the membrane was washed again with TBST three times and was developed.

Parallel Coomassie stained polyacrylamide gels were always used to ascertain that complex samples (i.e. total and soluble lysates) were correctly balanced in Western blot analyses.

Preparation of samples for N-terminal sequencing

To produce the catalytic domain of CDKL5 in E. coli, the primers named PhSumoCDKL5_NdeI_fw and PhCDKL5dC_XhoI_rv were used in a PCR on pB40-BCD2-107 (L). This gene encoding a Sumo-tagged version of CDKL5(1–352) was cloned into the pET40-b vector in frame with a C-terminal 8xHis tag using NdeI/XhoI double digestion. After recombinant expression in E. coli BL21(DE3), the recombinant cells were resuspended in 50 mM TrisHCl pH 8.0, 0.5 M NaCl, 20 mM imidazole and lysed by sonication in the presence of a protease inhibitor cocktail. After centrifugation (14,000 g, 4 °C, 60 min), the soluble fraction was loaded onto a HisTrap of 1 mL (Cytiva) and both the full-length protein, and its N-terminally truncated fragment were collected with a linear gradient of imidazole. A sample containing approximately 30 µg of the intact catalytic domain and 6 µg of the N-terminally truncated fragment were loaded onto SDS-PAGE and then electroblotted onto a PVDF membrane using 10 mM CAPS, 10% methanol pH 11.0 as the transfer buffer. The protein bands were made visible by Ponceau S staining and submitted to Edman sequencing at the Institute of Biosciences and Bioresources (CNR, Naples).

Enrichment of 107 (L) M10V from P. haloplanktis TAC125 lysate

Recombinant P. haloplanktis TAC125 was lysed with a chemical-enzymatic method. Briefly, the cell paste was resuspended in 20 mM sodium phosphate buffer pH 7.0, supplemented with 0.5 M NaCl, 10% glycerol, 0.1% Triton X-100, 20 U/mL DNAse I, 0.1 mg/mL lysozyme, 1 mM DTT, and a protease inhibitor cocktail to reach a final concentration of 14 OD/mL. After incubation at 4 °C for 20 min, the suspension was centrifugated (14,000 g for 45 min at 4 °C) to separate the soluble fraction from the cellular debris. Then, the soluble lysate was incubated with 0.15 mL of ANTI-FLAG M2 Affinity gel (Millipore) at 4 °C for 4 h and a gravity flow column chromatography was performed. The resin was washed with lysis buffer containing 1% Triton X-100, while the elution was performed using in 0.5 mL of the same buffer containing 175 µM 3xFLAG peptide.

EB2 purification

A codon optimized EB2 gene was synthesized by an external company and cloned into the pET40-b with NdeI/BamHI double digestion. The resulting gene encodes human EB2 with an N-terminal 6xHis tag (see the Additional file 1: Information for the nucleotide sequence). After recombinant expression, E. coli BL21(DE3) recombinant cells were lysed by sonication in 50 mM TrisHCl pH 8.0, 0.5 M NaCl, 5% glycerol, 20 mM imidazole supplemented with a protease inhibitor cocktail. The soluble fraction was recovered after a centrifugation (14,000 g for 45 min at 4 °C) and loaded onto a HisTrap of 1 mL (Cytiva). The target protein was eluted with 250 mM imidazole and loaded onto a a Hiload 16/600 Superdex 200 pg using 50 mM TrisHCl pH 8.0, 0.18 M NaCl as a running buffer for a final polishing step. The final preparation was stored at – 80 °C in 40 mM TrisHCl pH 8.0, 0.15 M NaCl, 1 mM DTT, 15% glycerol at 1.8 mg/mL protein concentration.

In vitro kinase assay

EB2 phosphorylation assays with enriched CDKL5 proteins were carried out using 200 nM EB2 and 100 nM of enzyme in 30 μL of 20 mM TrisHCl pH 7.7, 0.5 M NaCl, 10 mM MgCl₂, 1 mM DTT, 0.7 mM ATP, complete protease inhibitor cocktail (Roche) and Halt phosphatase inhibitor cocktail (Thermo Fisher Scientific). The reactions were stopped after 30 min with 10 μL Laemmli Sample buffer 4 × and denatured at 70 °C for 20 min. 10 μL of each reaction were analyzed via either SDS-PAGE (for total CDKL5 and EB2 detection) or anti-EB2 pSer222 Western blot (for phosphorylated EB2 detection). As a negative control, a reaction was set up with flCDKL5 KD, a catalytically inactive CDKL5 variant. As a positive control, a reaction with commercial GST-CDKL5(1–498) (ab131695, abcam) was performed.

Statistics and reproducibility of results

The Data from the in cellulo kinase assays were statistically validated using the t-Student test comparing the mean measurements of experimental and control samples, both carried out as technical triplicates. The significance of differences between mean values was calculated using a two-tailed Student’s t-test. A p value of < 0.05 was considered significant.

Availability of data and materials

The sequences of the original flCDKL5 encoding genes (107 (B), 107 (G), 107 (H)) cloned into pUC18 have been deposited in GenBank with the accession codes ON605205, ON605206 and ON605207, respectively. The synthetic plasmid pMK-T-H6EB2 from which the EB2 encoding gene was taken for expression in E. coli is available in GenBank with the accession code ON605208, while the sequence for EB2 co-expression with flCDKL5 was obtained from pMK-RQ-BCD_LacZ-cmyc-EB2-H6, whose accession code in GenBank is ON605209. All the other constructs described in this work were derived from such sequences and the ones described in refs. [30, 31]. Correspondence and material requests should be addressed to MLT (tutino@unina.it) and AC (andrea.colarusso@unina.it).

References

Hector RD, Kalscheuer VM, Hennig F, Leonard H, Downs J, Clarke A, et al. CDKL5 variants. Neurol Genet. 2017;3(6): e200.
Article CAS PubMed PubMed Central Google Scholar
Krishnaraj R, Ho G, Christodoulou J. RettBASE: Rett syndrome database update. Hum Mutat. 2017;38(8):922–31.
Article PubMed Google Scholar
Olson HE, Demarest ST, Pestana-Knight EM, Swanson LC, Iqbal S, Lal D, et al. Cyclin-dependent kinase-like 5 deficiency disorder: clinical review. Pediatr Neurol. 2019;97:18–25.
Article PubMed PubMed Central Google Scholar
Trazzi S, De Franceschi M, Fuchs C, Bastianini S, Viggiano R, Lupori L, et al. CDKL5 protein substitution therapy rescues neurological phenotypes of a mouse model of CDKL5 disorder. Hum Mol Genet. 2018;27(9):1572–92.
Article CAS PubMed Google Scholar
Terzic B, Felicia Davatolhagh M, Ho Y, Tang S, Liu YT, Xia Z, et al. Temporal manipulation of Cdkl5 reveals essential postdevelopmental functions and reversible CDKL5 deficiency disorder-related deficits. J Clin Invest. 2021. https://doi.org/10.1172/JCI143655.
Article PubMed PubMed Central Google Scholar
Gao Y, Irvine EE, Eleftheriadou I, Naranjo CJ, Hearn-Yeates F, Bosch L, et al. Gene replacement ameliorates deficits in mouse and human models of cyclin-dependent kinase-like 5 disorder. Brain. 2020;143(3):811–32.
Article PubMed Google Scholar
Kim JY, Bai Y, Jayne LA, Hector RD, Persaud AK, Ong SS, et al. A kinome-wide screen identifies a CDKL5-SOX9 regulatory axis in epithelial cell death and kidney injury. Nat Commun. 2020. https://doi.org/10.1038/s41467-020-15638-6.
Article PubMed PubMed Central Google Scholar
Muñoz IM, Morgan ME, Peltier J, Weiland F, Gregorczyk M, Brown FC, et al. Phosphoproteomic screening identifies physiological substrates of the CDKL 5 kinase. EMBO J. 2018;37(24):1–19.
Article Google Scholar
Baltussen LL, Negraes PD, Silvestre M, Claxton S, Moeskops M, Christodoulou E, et al. Chemical genetic identification of CDKL 5 substrates reveals its role in neuronal microtubule dynamics. EMBO J. 2018;37(24):1–18.
Article Google Scholar
Khanam T, Muñoz I, Weiland F, Carroll T, Morgan M, Borsos BN, et al. CDKL5 kinase controls transcription-coupled responses to DNA damage. EMBO J. 2021;40(23): e108271.
Article CAS PubMed PubMed Central Google Scholar
Nawaz MS, Giarda E, Bedogni F, La MP, Ricciardi S, Ciceri D, et al. CDKL5 and shootin1 interact and concur in regulating neuronal polarization. PLoS ONE. 2016. https://doi.org/10.1371/journal.pone.0148634.
Article PubMed PubMed Central Google Scholar
Barbiero I, Peroni D, Tramarin M, Chandola C, Rusconi L, Landsberger N, et al. The neurosteroid pregnenolone reverts microtubule derangement induced by the loss of a functional CDKL5-IQGAP1 complex. Hum Mol Genet. 2017;26(18):3520–30.
Article CAS PubMed Google Scholar
Khokhlatchev A, Xu S, English J, Wu P, Schaefer E, Cobb MH. Reconstitution of mitogen-activated protein kinase phosphorylation cascades in bacteria. Efficient synthesis of active protein kinases. J Biol Chem. 1997;272(17):11057–62.
Article CAS PubMed Google Scholar
Fu Z, Schroeder MJ, Shabanowitz J, Kaldis P, Togawa K, Rustgi AK, et al. Activation of a nuclear Cdc2-related kinase within a mitogen-activated protein kinase-like TDY motif by autophosphorylation and cyclin-dependent protein kinase-activating kinase. Mol Cell Biol. 2005;25(14):6047–64.
Article CAS PubMed PubMed Central Google Scholar
Fu Z, Larson KA, Chitta RK, Parker SA, Turk BE, Lawrence MW, et al. Identification of Yin-Yang regulators and a phosphorylation consensus for male germ cell-associated kinase (MAK)-related kinase. Mol Cell Biol. 2006;26(22):8639–54.
Article CAS PubMed PubMed Central Google Scholar
Hector RD, Dando O, Landsberger N, Kilstrup-Nielsen C, Kind PC, Bailey MES, et al. Characterisation of CDKL5 transcript isoforms in human and mouse. PLoS ONE. 2016;11(6):1–22.
Article Google Scholar
Fahmi M, Yasui G, Seki K, Katayama S, Kaneko-Kawano T, Inazu T, et al. In silico study of Rett Syndrome treatment-related genes, MECP2, CDKL5, and FOXG1, by evolutionary classification and disordered region assessment. Int J Mol Sci. 2019;20(22):1–19.
Article Google Scholar
Altmeyer M, Neelsen KJ, Teloni F, Pozdnyakova I, Pellegrino S, Grøfte M, et al. Liquid demixing of intrinsically disordered proteins is seeded by poly(ADP-ribose). Nat Commun. 2015;6.
Paz A, Zeev-Ben-Mordehai T, Sussman JL, Silman I. Purification of intrinsically disordered proteins [Internet]. Instrumental Analysis of Intrinsically Disordered Proteins. 2010. p. 695–704. (Wiley Online Books).
Canning P, Park K, Gonçalves J, Li C, Howard CJ, Sharpe TD, et al. CDKL family kinases have evolved distinct structural features and ciliary function. Cell Rep. 2018;22(4):885–94.
Article CAS PubMed PubMed Central Google Scholar
Katayama S, Sueyoshi N, Kameshita I. Critical determinants of substrate recognition by cyclin-dependent kinase-like 5 (CDKL5). Biochemistry. 2015;54(19):2975–87.
Article CAS PubMed Google Scholar
Kameshita I, Sekiguchi M, Hamasaki D, Sugiyama Y, Hatano N, Suetake I, et al. Cyclin-dependent kinase-like 5 binds and phosphorylates DNA methyltransferase 1. Biochem Biophys Res Commun. 2008;377(4):1162–7.
Article CAS PubMed Google Scholar
Katayama S, Inazu T. Straightforward and rapid method for detection of cyclin-dependent kinase-like 5 activity. Anal Biochem. 2019;566(November 2018):58–61.
Article CAS PubMed Google Scholar
Vigentini I, Merico A, Tutino ML, Compagno C, Marino G. Optimization of recombinant human nerve growth factor production in the psychrophilic Pseudoalteromonas haloplanktis. J Biotechnol. 2006;127(1):141–50.
Article CAS PubMed Google Scholar
Papa R, Rippa V, Sannia G, Marino G, Duilio A. An effective cold inducible expression system developed in Pseudoalteromonas haloplanktis TAC125. J Biotechnol. 2007;127(2):199–210.
Article CAS PubMed Google Scholar
Unzueta U, Vázquez F, Accardi G, Mendoza R, Toledo-Rubio V, Giuliani M, et al. Strategies for the production of difficult-to-express full-length eukaryotic proteins using microbial cell factories: production of human alpha-galactosidase A. Appl Microbiol Biotechnol. 2015;99(14):5863–74.
Article CAS PubMed Google Scholar
Calvanese M, Colarusso A, Lauro C, Parrilli E, Tutino ML. Soluble recombinant protein production in Pseudoalteromonas haloplanktis TAC125: the case study of the full-length human CDKL5 protein. Methods Mol Biol. 2022;2406:219–32.
Article PubMed Google Scholar
Fondi M, Maida I, Perrin E, Mellera A, Mocali S, Parrilli E, et al. Genome-scale metabolic reconstruction and constraint-based modelling of the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125. Environ Microbiol. 2015;17(3):751–66.
Article CAS PubMed Google Scholar
Fondi M, Gonzi S, Dziurzynski M, Turano P, Ghini V, Calvanese M, et al. Modelling hCDKL5 heterologous expression in bacteria. Metabolites. 2021;11(8):1–16.
Article Google Scholar
Sannino F, Giuliani M, Salvatore U, Apuzzo GA, de Pascale D, Fani R, et al. A novel synthetic medium and expression system for subzero growth and recombinant protein production in Pseudoalteromonas haloplanktis TAC125. Appl Microbiol Biotechnol. 2017;101(2):725–34.
Article CAS PubMed Google Scholar
Colarusso A, Lauro C, Calvanese M, Parrilli E, Tutino ML. Improvement of Pseudoalteromonas haloplanktis TAC125 as a cell factory: IPTG-Inducible plasmid construction and strain engineering. Microorganisms. 2020;8(10):1–24.
Article Google Scholar
Mutalik VK, Guimaraes JC, Cambray G, Lam C, Christoffersen MJ, Mai QA, et al. Precise and reliable gene expression via standard transcription and translation initiation elements. Nat Methods. 2013;10(4):354–60.
Article CAS PubMed Google Scholar
Tutino ML, Duilio A, Parrilli E, Remaut E, Sannia G, Marino G. A novel replication element from an Antarctic plasmid as a tool for the expression of proteins at low temperature. Extremophiles. 2001;5(4):257–64.
Article CAS PubMed Google Scholar
Puigbò P, Guzmán E, Romeu A, Garcia-Vallvé S. OPTIMIZER: a web server for optimizing the codon usage of DNA sequences. Nucleic Acids Res. 2007;35(SUPPL2):126–31.
Article Google Scholar
Qi W, Colarusso A, Olombrada M, Parrilli E, Patrignani A, Tutino ML, et al. New insights on Pseudoalteromonas haloplanktis TAC125 genome organization and benchmarks of genome assembly applications using next and third generation sequencing technologies. Sci Rep. 2019;9(1):16444.
Article PubMed PubMed Central Google Scholar
Koutmou KS, Schuller AP, Brunelle JL, Radhakrishnan A, Djuranovic S, Green R. Ribosomes slide on lysine-encoding homopolymeric A stretches. Elife. 2015;2015(4):1–18.
Google Scholar
Arthur LL, Pavlovic-djuranovic S, Koutmou KS, Green R. Translational control by lysine-encoding A-rich sequences. Sci Adv. 2015. https://doi.org/10.1126/sciadv.1500154.
Article PubMed PubMed Central Google Scholar
Buskirk AR, Green R. Ribosome pausing, arrest and rescue in bacteria and eukaryotes. Philos Trans R Soc B Biol Sci. 2017. https://doi.org/10.1098/rstb.2016.0183.
Article Google Scholar
Suskiewicz MJ, Sussman JL, Silman I, Shaul Y. Context-dependent resistance to proteolysis of intrinsically disordered proteins. Protein Sci. 2011;20(8):1285–97.
Article CAS PubMed PubMed Central Google Scholar
Zhang G, Ignatova Z. Folding at the birth of the nascent chain: coordinating translation with co-translational folding. Curr Opin Struct Biol. 2011;21(1):25–31.
Article PubMed Google Scholar
Verma M, Choi J, Cottrell KA, Lavagnino Z, Thomas EN, Pavlovic-Djuranovic S, et al. A short translational ramp determines the efficiency of protein synthesis. Nat Commun. 2019;10(1):1–15.
Article CAS Google Scholar
Leith EM, O’Dell WB, Ke N, McClung C, Berkmen M, Bergonzo C, et al. Characterization of the internal translation initiation region in monoclonal antibodies expressed in Escherichia coli. J Biol Chem. 2019;294(48):18046–56.
Article CAS PubMed PubMed Central Google Scholar
Zhao Z, Liu X, Zhang W, Yang Y, Dai X, Bai Z. Construction of genetic parts from the Corynebacterium glutamicum genome with high expression activities. Biotechnol Lett. 2016;38(12):2119–26.
Article CAS PubMed Google Scholar
Whitaker WR, Lee H, Arkin AP, Dueber JE. Avoidance of truncated proteins from unintended ribosome binding sites within heterologous protein coding sequences. ACS Synth Biol. 2015;4(3):249–57.
Article CAS PubMed Google Scholar
Bertani I, Rusconi L, Bolognese F, Forlani G, Conca B, De Monte L, et al. Functional consequences of mutations in CDKL5, an X-linked gene involved in infantile spasms and mental retardation. J Biol Chem. 2006;281(42):32048–56.
CAS PubMed Google Scholar
Lin C, Franco B, Rosner MR. CDKL5/Stk9 kinase inactivation is associated with neuronal developmental disorders. Hum Mol Genet. 2005;14(24):3775–86.
Article CAS PubMed Google Scholar
Parrilli E, Giuliani M, Tutino ML. General secretory pathway from marine antarctic Pseudoalteromonas haloplanktis TAC125. Mar Genomics. 2008;1(3–4):123–8.
Article PubMed Google Scholar
Engler C, Kandzia R, Marillonnet S. A one pot, one step, precision cloning method with high throughput capability. PLoS ONE. 2008;3(11): e3647.
Article PubMed PubMed Central Google Scholar
Pfaffl MW. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001;29(9):45.
Article Google Scholar

Download references

Acknowledgements

This work is dedicated to Elettra Yvonne, Sofia, Elena, Arianna, Filippo, Raina, Gianluigi and all the beloved CDD patients all over the world, to their extraordinary parents and relatives. We feel responsible for your hope. Open access publication fees was kindly financed by the Department of Chemical Sciences, “Federico II” University of Naples.

Funding

This research was funded by a research grant from the University of Pennsylvania Orphan Disease Center on behalf of the Loulou Foundation to MLT (pilot award no. CDKL5-20-101-08), by the Italian Parents’ Association “La fabbrica dei sogni 2—New developments for Rett syndrome” (to A.C., M.C., C.L., E.P. and M.L.T.), and by the Italian Parents’ Association “CONRETT ONLUS” (to A.C. and C.L.).

Author information

Authors and Affiliations

Department of Chemical Sciences, “Federico II” University of Naples, Complesso Universitario Monte S. Angelo-Via Cintia, 80126, Naples, Italy
Andrea Colarusso, Concetta Lauro, Marzia Calvanese, Ermenegilda Parrilli & Maria Luisa Tutino
Istituto Nazionale Biostrutture e Biosistemi-I.N.B.B., Viale Medaglie d’Oro, 305-00136, Rome, Italy
Andrea Colarusso & Concetta Lauro

Authors

Andrea Colarusso
View author publications
You can also search for this author in PubMed Google Scholar
Concetta Lauro
View author publications
You can also search for this author in PubMed Google Scholar
Marzia Calvanese
View author publications
You can also search for this author in PubMed Google Scholar
Ermenegilda Parrilli
View author publications
You can also search for this author in PubMed Google Scholar
Maria Luisa Tutino
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.C., E.P. and M.L.T. designed the project, A.C. was responsible of its experimental execution, of data collection and analysis, and wrote the manuscript. C.L. developed some bicistronic constructs for the expression of CDKL5 and pGFP, set up the in vitro activity assay and purified flCDKL5. M.C. handled the isolation of B40 origin of replication, the production of some flCDKL5 missense mutated genes, and the setup of the first in cellulo co-expression assay. C.L. and M.C. established the EB2 production and purification protocol. All the authors revised the manuscript and approved the content.

Corresponding author

Correspondence to Maria Luisa Tutino.

Ethics declarations

Consent for publication

not applicable.

Competing interests

Authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Characteristics of BCD constructs for pGFP expression. Table S2. List of primers used in this work. Fig. S1. Average plasmid copy number (PCN) of pP79-107 (B). Fig. S2. Ranking of bicistronic designs (BCDs) with a fluorescent reporter. Fig. S3. flCDKL5 production profiles with pBCD-107 (L) plasmids. Fig. S4. Development of Bicistronic Entry Clones. Fig. S5. Development of Bicistronic Designs. Fig. S6. Development of Tricistronic Designs.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Colarusso, A., Lauro, C., Calvanese, M. et al. Active human full-length CDKL5 produced in the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125. Microb Cell Fact 21, 211 (2022). https://doi.org/10.1186/s12934-022-01939-6

Download citation

Received: 04 July 2022
Accepted: 24 September 2022
Published: 14 October 2022
DOI: https://doi.org/10.1186/s12934-022-01939-6

Active human full-length CDKL5 produced in the Antarctic bacterium Pseudoalteromonas haloplanktis TAC125

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Soluble Recombinant Protein Production in Pseudoalteromonas haloplanktis TAC125: The Case Study of the Full-Length Human CDKL5 Protein

Combined effects of double mutations on catalytic activity and structural stability contribute to clinical manifestations of glucose-6-phosphate dehydrogenase deficiency

Folding and assembly defects of pyruvate dehydrogenase deficiency-related variants in the E1α subunit of the pyruvate dehydrogenase complex

Background

Results and discussion

A multiparametric strategy to optimize full-length CDKL5 expression in Pseudoalteromonas haloplanktis TAC125

flCDKL5 expression with monocistronic plasmids

Bicistronic cassettes and site-specific mutagenesis to overcome an internal translational start

Application of high-copy number plasmids for the expression of flCDKL5 and evaluation of the kinase activity

Use of TCD plasmids to measure the impact of CDKL5 pathogenic mutations

Conclusions

Methods

Bacterial strains and culture conditions

Plasmids construction

Monocistronic designs

Entry clone designs

Bicistronic designs

High copy number bicistronic plasmids

Tricistronic designs

Measurement of average plasmid copy number (PCN)

Quantification of flCDKL5 mRNA

Measurement of pGFP fluorescence

Analysis of the production of flCDKL5 and EB2 proteins

Preparation of samples for N-terminal sequencing

Enrichment of 107 (L) M10V from P. haloplanktis TAC125 lysate

EB2 purification

In vitro kinase assay

Statistics and reproducibility of results

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Table S1.

Rights and permissions

About this article

Cite this article

Share this article

Keyword

Search

Navigation