Introduction

DNA vaccine technology has been successfully applied to the treatment of cancer [1], autoimmune diseases [2], and allergies [3], as well as a prophylactic approach against several infectious diseases from bacterial and viral origin [4]. DNA vaccines are effective at eliciting a T-cell response as well as the production of antibodies against the protein encoded by the plasmid [5]. Protective immunity has also been observed following DNA immunization [6]. Additionally, DNA vaccination presents commercial advantages compared to conventional approaches since plasmids can be manufactured by generic processes, including standardized quality control and storage conditions that are independent of the encoded sequence [7].

Regardless of their potential, an effort has been made to improve their performance through the use of several optimization strategies such as the (1) improvement of the immunological properties of the vector itself through the addition of CpG sequences, capable of acting as an adjuvant [8]; (2) maximization of antigen expression through the use of improved vectors using strong constitutive promoters, addition of regulatory elements that work as transcriptional enhancers, or through codon optimization of the antigen gene sequence [911]; (3) co-administration of DNA with cytokines and chemokines to amplify the immune response and modulate the Th1 and Th2 profile [1214]; (4) combined administration of DNA with lipid complexes, microparticles or other delivery formulations to increase transfection and expression efficiencies [1517].

Furthermore, improvement of antigen delivery to the immune system is also considered to be a fundamental aspect for DNA vaccine development, since several studies report that the localization of the expressed protein can significantly influence the efficacy of DNA vaccines [18]. Therefore, several molecular approaches have been attempted including the use of signal sequences which target the antigen to intracellular compartments involved in the processing and presentation pathways MHC I [19] and MHC II [20, 21].

The invariant surface glycoprotein (ISG) [22] and the N-terminal trans-sialidase (nTSA) [23] of Trypanosoma brucei were used as model proteins in this study. These antigens have been selected for the construction of DNA vaccine prototypes against African Trypanosomiasis [23]. The two proteins were fused to GFP and to three different targeting sequences: (i) a secretion signal (sc) composed of the first 21 amino acids of the human tissue plasminogen activator signal sequence [24] that promotes protein secretion to the extracellular space, (ii) the lysosomal-associated membrane protein-1 (LAMP) sequence [25] that directs protein to the lysosomes, and (iii) the adenovirus e1a sequence [26] that allows protein transport to the endoplasmic reticulum (ER).

It has been reported that introduction of DNA sequences into a vector backbone can significantly affect plasmid stability and resistance to nuclease attack, reducing the number of active plasmid copies that are able to reach the nucleus [27, 28]. The addition of DNA sequences can also influence mRNA stability and compromise transgene expression. Therefore, plasmid copy number and mRNA levels were determined in order to assess the effect of addition of targeting sequences on plasmid stability, transfection, and transcription efficiencies. Protein expression was analyzed by microscopy and quantified by flow cytometry. Here we present a systematic analysis of the performance of vectors encoding three targeting sequences used in vaccine development. This study is an important contribution for a rational design of DNA vaccines.

Materials and Methods

DNA Vaccine Prototype Design and Purification

A detailed description of the plasmids used in this study is outlined in Table 1. The plasmids were initially constructed using the pVAX1/lacZ (Invitrogen) as the backbone vector, to be used as candidate vaccines against African Trypanosomiasis. For the purpose of this study, the lacZ reporter gene was replaced by the Green Fluorescence Protein (GFP), generating pVAX1GFP [27] (3697 bp). Firstly, the ISG [22] gene from T. brucei was amplified by PCR (1571 nucleotides) and inserted into pVAX1/lacZ. This construct (pISGlacZ) produces a cytoplasmic protein that was used as a control of the experiments. The second construction (pscISGlacZ) was generated through the addition of the coding sequence for the first 21 amino acids of the human tissue plasminogen activator signal sequence [24] upstream of the ISG gene. To generate the third construct (pISGlacZlamp) the LAMP [25] (Lysosomal-associated membrane protein-1) nucleotide sequence was inserted downstream into the pISGlacZ plasmid. The fourth plasmid (pe1aISGlacZ) was constructed to include the adenovirus e1a ER signal sequence [26] upstream and in frame with ISG. Since there are reports in the literature that suggest that an efficient transport to the lysosomes is dependent on protein loading to the secretory pathway by an ER targeting signal [29], the e1a sequence was also added to the pISGlamplacZ plasmid to obtain a fifth construct (pe1aISGlacZlamp). Finally, in all plasmids the reporter gene lacZ was replaced by the GFP gene (pISG, pISGlamp, pscISG, pe1aISG, and pe1aISGlamp).

Table 1 Features of ten plasmids encoding either ISG or nTSA genes located upstream of the reporter gene GFP

The above plasmids were used to construct a second set of vaccines prototypes (pTSA, pTSAlamp, pscTSA, pe1aTSA, pe1aTSAlamp) where ISG was replaced with the N-terminal sequence of the TSA gene (1416 nucleotides) [23], also amplified from T. brucei genomic DNA by PCR.

Automated DNA sequencing was performed (Stab Vida Inc., Portugal) on these constructs to check correct sequence and alignment between GFP, antigen genes, and targeting sequences.

Plasmids were amplified in Escherichia coli DH5alpha and purified using the High Pure Isolation Kit (Roche). One microgram of pDNA was added to the lipofectamine solution. Each plasmid was diluted to the same concentration in order to use the same volume of purified DNA in each transfection assay.

Cell Culture and Transfection Assays

Transfection assays were performed using Chinese Hamster Ovary (CHO) cells. This cell line was selected as a model to analyze if differences in transfection were introduced by the addition of targeting sequences in the parameters studied in this work. CHO cells are a standard mammalian cell line used for transfection and expression of a broad range of recombinant proteins [30]. Moreover, they are considered to be an appropriate model for studies designed to assess plasmid content in eukaryotic cells [27, 31]. However, results present here may not necessarily translate to other cell lines since protein expression, plasmid half life, and transfection levels vary using different models [27].

The starting culture was seeded in 75 cm2 flasks using F-12 (Ham) nutrient mixture (Gibco, UK), containing 10% (v/v) fetal bovine serum (FBS) (Gibco, UK) and supplemented with 1× non-essential amino acids (Gibco, UK), 1× gentamicin (Gibco, UK), 1× sodium pyruvate (Gibco, UK), and 1× antibiotic–antimycotic (Gibco, UK). Cultures were incubated in 5% CO2 humidified environment at 37ºC. Cells were trypsinized after reaching confluence and then seeded into 24-well culture plates (4 × 104 cells per well). The cells were then incubated for 48 h and transfected using the combination of 1.5 μl of lipofectamine (Lipofectamine 2000TM, Invitrogen, UK) with 1 μg of each plasmid per well. Lipofectamine is a cationic lipid that forms a complex with plasmid DNA. The negative control of this experiment was set using cells transfected with plasmid DNA to which no lipofectamine was added. Finally, the medium was replaced 12 h after the transfection to remove plasmids not internalized by cells.

Protein Expression and Intracellular Localization

Protein expression was first assessed by fluorescence microscopy (Olympus 6X40 with an Olympus U-MWB mirror cube unit; excitation filter BP 450–480, barrier filter BA 515). Cells were transfected as described above and observed at 24, 30, and 48 h post-transfection. It was concluded that the best time point to visualize protein expression was at around 30 h. Therefore, all other parameters investigated in this study were assessed at this time point.

Protein expression was quantified by flow cytometry, a more precise method when compared to fluorescence microscopy. At 30 h post-transfection, cells were washed three times with phosphate buffered saline (PBS), trypsinized, and resuspended in 2 ml of PBS supplemented with 10% of FBS. The cells were analyzed using the cytometer FACscan Scalibur (Becton–Dickinson, USA). In each run the forward scatter (FSC), side scatter (SSC), and green fluorescence (FL1) parameters were registered. When FSC is plotted against SSC, it is possible to isolate the debris and dead cells from the population of cells to be analyzed. Subsequently, using the FL1 parameter, the selected population of cells was divided between the non-transfected cells, displaying background FL1 (determined using the auto-fluorescence of the negative cells), and the transfected cells, emitting green fluorescence. A direct correlation between the transfection level (defined as the percentage of cells expressing green fluorescence above the threshold level defined by the natural autofluorescence of the control non-transfected cells) and the mean fluorescence was found. As the fluorescence levels are more relevant to the analysis performed here, only the mean fluorescence data are presented in the “Results” section. The CellQuest Pro software (Becton–Dickinson, USA) was used for the analysis of these parameters.

Plasmid Copy Number

The plasmid copy number in cells transfected with each one of the constructs was analyzed by quantitative real time PCR [31]. Thirty hours post-transfection, the cells were washed three times with 1 ml of PBS in order to remove non-specifically bound pDNA. Cells were then trypsinized and centrifuged for 8 min at 160g. Subsequently, the cell pellet was washed and resuspended in 2 ml of the PBS. The number of cells was determined by the average of two performed microscope counts using a Neubauer chamber. Finally, cells were centrifuged to remove the supernatant and stored at −80ºC until use.

Immediately upon thawing, cell pellets were resuspended in PCR-grade water to obtain a final concentration of 6,250 cells/μl [27]. The set temperature of the first cycle of the amplification PCR programme is enough to induce cell lysis. Two microliters of this solution was added to the PCR reaction mixture (1× SYBR Green mixture, 0.4 μM of each primer, and 3 mM of MgCl2 in a total volume of 20 μl). A reaction containing 2 μl of control cells (cells transfected with pVAX1GFP plasmid to which no lipofectamine was added) was included in each PCR run to be used as a negative control of the assay. A second negative control of the PCR reaction was set up using PCR-grade water instead of the control cells.

Plasmid quantification was performed by promoting the amplification of a 108 bp sequence from the eGFP gene using the 5′-TCGAGCTGGACGGCGACGTAA A-3′ forward primer, and the 5′-TGCCGGTGGTGCAGATGAAC-3′ reverse primer. PCRs were carried out in a Roche LightCycler® detection system using the FastStart DNA Master SYBR Green I kit (Roche, Germany). Reactions were incubated at 95ºC for 10 min to activate FastStart DNA polymerase, followed by 30 cycles of 10 s at 95ºC, 5 s at 55ºC, and 7 s at 72ºC. After the final extension cycle, reaction mixtures were kept at 70ºC for 30 s, and heat-denatured over a temperature gradient at 0.1ºC/s from 70 to 95ºC.

The plasmid copy number was determined based on a calibration curve, constructed using a range of concentrations from 2 pg up to 2,000 pg of pVAX1GFP. Each plasmid dilution was prepared using PCR-grade water that was then added to 2 μl of control cells.

GFP mRNA Content

The GFP mRNA levels in cells transfected with each one of the plasmids were quantified using a quantitative reverse transcription RT-PCR method [27]. Cells were harvested 30 h after transfection and stored at −80ºC until use. Total RNA was purified using the Pure RNA Isolation Kit (Roche, Germany) and quantified by absorption at 260 nm. Subsequently, 300 ng of total mRNA purified was used as template for the reverse transcription reaction performed using the First Strand cDNA Synthesis Kit for RT-PCR (AMV) (Roche, Germany). Samples were treated with DNase to ensure removal of any DNA contamination. Two microliters of the cDNA reaction was then used in the quantitative real time PCR reaction, performed as described above for the quantification of the plasmid copy number per cell.

Statistics

Differences in the plasmid copy number, mRNA and mean fluorescence levels between plasmids were analyzed performing an ANOVA (General Linear Model, Tukey’s comparison test) with the level of confidence being P < 0.05. Analyses were performed with the Minitab statistical package, version 15 (Minitab Inc.).

Results

In this work, ISG or nTSA genes from T. brucei were fused with GFP reporter and with sequences targeting proteins to extracellular space (sc), to lysosome (LAMP or the combination of LAMP and e1a) or to the ER (e1a). Even though these sequences have been included in several vectors used as DNA vaccines, the effect of the selected targeting sequences in cell plasmid uptake and stability, mRNA transcription and protein expression are examined and discussed on a comparative basis.

Effect of the Signal Sequences on Plasmid Uptake/Stability and GFP mRNA Transcription

No significant differences were found between the number of plasmid copies in cells transfected with each of the constructs (Fig. 1). Therefore, plasmid uptake was not affected by the differences in the size of the different plasmids analyzed. Furthermore, the addition of ISG gene and targeting sequences appears not to have introduced sensitive regions to nuclease attack into the vector backbone.

Fig. 1
figure 1

Plasmid copy number CHO cells transfected with pVAX1GFP- and ISG-encoding plasmids at 30 h post-transfection. Data shown resulted from the analysis of three independent transfection assays, each one performed in duplicate. One way ANOVA (Tukey’s comparison test) was performed in order to compare pDNA copies in cells transfected with each construct. No significant differences were observed (P > 0.05)

mRNA determination showed that cells transfected with the plasmid pVAX1GFP, which codes for the smaller protein (GFP), had significantly more transcripts than cells expressing the ISG-GFP fusion proteins (Fig. 2). No significant differences in the levels of GFP mRNA were detected in cells transfected with pISG and with each one of the plasmids encoding a targeted protein (Fig. 2).

Fig. 2
figure 2

mRNA quantification of CHO cells transfected with pVAX1GFP- and ISG-encoding plasmids at 30 h post-transfection. Data shown resulted from the analysis of three independent transfection assays, each one performed in duplicate. One way ANOVA (Tukey’s comparison test) was performed in order to compare the number of copies of GFP mRNA in cells transfected with each construct. pVAX1GFP has a significant higher number of mRNA GFP copies compared with the rest of the groups (P < 0.05). However, no significant differences between ISG constructs were observed (P > 0.05)

Effect of the Signal Sequences on Protein Expression

The expression of ISG fused to GFP was analyzed by fluorescence microscopy (Fig. 3). Fluorescence of cells transfected with pVAX1GFP is significantly higher than cells transfected with any of the other constructs. Moreover, in this case, GFP is found throughout the cytoplasm and nucleus as described previously elsewhere [32]. Nuclear fluorescence was not observed in cells that received the ISG constructs.

Fig. 3
figure 3

Microscopy pictures of cells transfected with each of the constructed plasmids. Cells transfected with pVAX1GFP were set as the control of the transfection experiments (amplification ×100)

Differences in fluorescence intensities were also observed between the ISG encoding plasmids, especially between pe1aISGlamp and pISGlamp and cells transfected with pISG plasmid. The later cells were brighter and presented an even distribution of GFP in the cytoplasm. The low fluorescence levels of cells expressing the lamp sequence made it difficult to point out the exact localization of e1aISGlamp and ISGlamp proteins. However, in some of these cells it was possible to observe fluorescence not only in the cytoplasm, but also localized in vesicles. In the case of cells transfected with the secretory sequence, the fluorescence is more intense close to the cellular membrane. Lastly, no obvious differences were found between fluorescence of e1aISG and ISG proteins.

Quantification of protein expression was performed using flow cytometry. This method is presented here as an alternative method to immunofluorescence assays to assess protein expression and targeting. CHO cells transfected with both ISG and nTSA coding plasmids were analyzed. Fluorescent green cells were observed in cells transfected with each new construct, confirming that protein expression was successful. In addition, cells expressing the different fusion proteins were markedly lower (up to 100 times) when compared with the fluorescence of cells expressing the reporter gene GFP. Differences between the fluorescence levels of the different ISG proteins were also observed. The mean fluorescence of cells transfected with pISG was significantly higher (P < 0.05) when compared with cells transfected with pISGlamp and pe1aISGlamp (Fig. 4a). The fluorescence pattern of cells transfected with pTSAlamp and pe1aTSAlamp is also reduced when compared to the cytoplasmic protein encoded by pTSA (Fig. 4b). No statistically significant differences (P > 0.05) were observed in the mean fluorescence obtained when transfecting cells with plasmids pe1aISG (Fig. 4a) or pe1aTSA (Fig. 4b). Finally, a decrease in the mean fluorescence of cells transfected with plasmids that encode a secreted protein was only observed in cells transfected with pscTSA (Fig. 4b) and not with pscISG (Fig. 4a).

Fig. 4
figure 4

Mean fluorescence of cells transfected with a plasmid ISG and b nTSA constructs. Cells were collected at 30 h post-transfection. The averages and standard error of the mean shown are from three (ISG) and two (nTSA) independent assays, each one performed in duplicate. ANOVA (General Linear Model, Tukey’s comparison test) was performed in order to compare differences on the fluorescence levels between transfected plasmids. Fluorescence levels that are significantly different (P < 0.05) from the ones detected in cells transfected with pISG or pTSA plasmids are signed (*). The mean fluorescence values in cells transfected with pVAX1GFP were 1336 ± 238 and 1573 ± 30, respectively, used as control experiments to assess the fluorescence of cells transfected with ISG and TSA constructs. AU arbitrary units

Discussion

When combined with plasmid DNA, cationic lipids (lipoplexes), as the one used in this study, are internalized by cells using an endocytic pathway including a clathrin-dependent and -independent pathway and a caveolae-mediated uptake. Upon internalization, lipoplexes are trapped in endocytic vesicles that converge to late endosomes and finally to lysosomes where degradation occurs [33]. This topic was extensively reviewed elsewhere [34, 35]. The timing and the exact mechanisms responsible for plasmid DNA release from endo-lysosomal compartments into the cytoplasm are not clear, but this is considered to be one of the major limitation steps to transgene expression. Once in the cytoplasm, plasmid DNA is degraded by Ca2+ sensitive cytosolic nucleases that reduce even more the number of plasmid copies capable of reaching the nucleus [36]. In fact, 106 naked plasmid molecules are needed to transform a single cell, from which 102 to 104 are detected in the nucleus following plasmid delivery using cationic lipids [37]. The fact that only a minority of the total number of plasmid copies internalized by cells is detected in the nucleus is also a result of the processes that regulate plasmid uptake to this compartment. Plasmid DNA can be trapped inside the nucleus following the reorganization of the nuclear envelope in active dividing cells. However, since plasmid DNA is also found in non-mitotic cells other processes have to regulate nuclear import. In fact, nuclear entry can be mediated by DNA sequences containing binding sites for transcription factors [38, 39]. In conclusion, a number of physical and biochemical barriers determine the efficiency of gene expression. As demonstrative of the complexity of these processes, a recent study has shown that intracellular elimination and DNA release from the different non-viral vectors correlate poorly with transgene expression [40].

One of the objectives of this study was to assess if the addition of targeting sequences had introduced sensitive spots to nuclease attack to the vector backbone. The time point selected for the performed analysis is within the period when plasmid DNA is more vulnerable. Our group and others have demonstrated that the half-life of pDNA in cells following transfection using cationic lipids is approximately 20 h [31, 35, 41]. However, this time-frame can change depending on the carrier system used [40].

The addition of extra sequences to plasmids can introduce secondary structures more nuclease labile that may lead to a decrease in intracellular plasmid stability. The decreased resistance to nucleases action ultimately results in a lower transgene copy number, and therefore compromises gene expression [28]. Plasmid susceptibility to nuclease attack due to the addition of the selected targeting sequences, as well as the ISG gene, was unknown. Previous work has demonstrated that the GFP gene did not introduce sensitive regions, such as single-stranded secondary structures, into the pVAX1 vector (Invitrogen) [27, 28]. Since no differences in plasmid copy number were observed between cells transfected with each construct is reasonable to conclude that the addition of ISG gene sequence and the targeting sequences did not affect plasmid susceptibility to nuclease degradation. On the other hand, the increase in plasmid size up to 1.6 kb did not compromise plasmid DNA uptake.

As transfected cells presented similar plasmid content an approximate number of GFP transcripts were expected to be detected. However, a higher number of GFP transcripts were determined in cells transfected with pVAX1GFP, possibly due to the fact that the transcript is much smaller, ultimately contributing to a higher transcription rate and stability. Importantly, no significant differences were found between the ISG constructs. This means that these plasmids have similar transcription efficiencies and comparable mRNA stability, even though they contain different signal sequences.

The amount of expressed protein was determined indirectly by the amount of molecules with biological activity as measured by the fluorescence of GFP. Since LAMP sequence or its combination with e1a signal promotes protein transport to the lysosome where protein degradation occurs, the decrease in the fluorescence levels was expected. The data obtained suggest that protein sorting to the lysosomes was achieved. Microscopy analysis also indicates that this transport was effective, since fluorescence appeared to be localized inside vesicles, possibly part of the lysosomal pathway. On the other hand, transport to the endoplasmic reticulum does not result in protein degradation. Therefore, no significant decrease of fluorescence was likely to be observed. Again, this is supported by flow cytometry and microscopy data.

At the time of the analysis, only cells expressing the plasmid encoding TSA in conjunction with the secretory signal showed a decrease in the fluorescent levels. The intrinsic characteristics of each protein might help to explain these differences. The ISG protein has a predicted transmembrane domain located between the amino acids 469 and 491, as predicted by the TMHMM tool (available on http://www.cbs.dtu.dk/services/TMHMM/) [42]. The presence of this domain might retain the protein in the cell membrane, hampering or delaying its secretion to the extracellular space. In fact, a more intense fluorescence was observed near the cell membrane, which corroborates this hypothesis. On the other hand, no such type of domains is found in the TSA protein.

Since cells transfected with the different plasmids show similar levels of plasmid content and GFP transcripts, it is reasonable to assume that the differences observed in the fluorescence levels relatively to the control is the result of an effective transport of the model proteins to the expected compartments (extracellular space, lysosomes, and endoplasmic reticulum).

It has been demonstrated that an efficient antigen sorting to the MHC I and MHC II pathways improves DNA vaccines immunogenicity. The endogenous protein encoded by the DNA vector should not have access to the MHC II pathway that primarily processes exogenous antigens [43]. This limitation can be overcome if a targeting signal is used that promotes protein sorting to the lysosomes [44], compartments involved in the generation of epitopes to be presented to the immune system when complexed with MHC II molecules. On the other hand, when the antigen encoded by a DNA vaccine is expressed in the cytoplasm it can be processed in the context of MHC I pathway. However, protein sorting to the proteasomes [45] and to the endoplasmic reticulum has also been engineered to increase the efficiency of DNA vaccines [19] by increasing antigen loading to compartments involved in the MHC I pathway.

The sequences selected for this study have been used in conjunction with various antigens of both bacterial [46] and viral [21, 47, 48] origins, as well as part of therapeutic DNA vaccines against cancer [49]. This is illustrative of their general applicability in DNA vaccine design. However, it should be noted that differences might be observed in protein expression levels and transfection efficiencies if different cell lines and animal models had been used. Additionally, mRNA expression levels have been considered to be construct specific. Nevertheless, this work presents a systematic in vitro study on the influence of signal sequences with potential interest for DNA vaccines development at the level of plasmid uptake and susceptibility to degradation, mRNA transcription and protein expression.