Affinity protein purification

Protein engineering and production, as a widely spread methodological platform, is providing reagents for catalysis, drugs for pharmaceutical industries and in general, instruments for analytical research in structural biology, proteomics, interactomics and drug design, among others. If secreted, recombinant proteins produced in prokaryotic or eukaryotic cell factories are found diluted in the medium, or alternatively, they occur as minor components of extremely complex macromolecular mixtures if retained inside the cell. Therefore, they must be purified so as achieve conveniently high concentration and purity levels required for stable storage and characterization or for efficient use in biological systems or in catalysis. The identification of peptide tags for single-step affinity-based purification, upon end terminal fusion, has provided an advantageous tool that represents a faster, easy and cost-effective alternative to multiple-step chromatographic separations [1, 2]. Protein tagging allows the purification of virtually any protein without previous knowledge of its properties or those of its expected contaminants, since the capturing event relies on tag's rather than on protein's features.

Being absent in the natural target protein, affinity tags are expected to be removed upon purification, to keep the target protein sequence as natural as possible [3, 4]. During the upstream cloning strategies, recombinant proteins can be designed to contain target sites for trans-acting proteases [4] or alternatively, functional endoproteolytic peptides [5] to remove the additional peptides. The Tobacco Etch Virus TEV protease (TEVp) is probably the most used agent for trans tag removal, because of its high specificity for the TEVp target site (ENLYFQ↓G) compared to the most promiscuous behaviour of alternative enzymes such as factor Xa, enterokinase and some picornaviridae 3C proteases [4]. On the other hand, inteins are among the most used cis-acting tag-removal agents [68].

However, for technical convenience, the foreign peptide is often not removed after purification, as for instance, the C-position of the cleavage site in the recognition sequence of most endoproteases makes specially difficult the complete removal of C-terminal added stretches [1]. Obviously, the modification of the primary amino acid sequence is expected to have effects on physicochemical parameters of the protein such as the distribution of charged amino acids and the final conformation, and also on biological properties such as in intra- or intermolecular interactions, solubility and enzymatic activity. However, some observations suggest that such effects might be extremely soft or even convenient. In this context, a few additional end terminal amino acids seem not affecting the packing of the crystals or atomic interactions, leading to the correct 3D structure determination [9]. Furthermore, the presence of purification tags might positively affect the solubility, productivity and proteolytic stability of recombinant proteins [1014], which would provide an added value to these protein segments. These observations have prompted researchers to evaluate affinity tags not only for their usefulness as tools for downstream but also as midstream folding-assistant agents [1]. Finally, some analytical procedures to determine in vivo protein-protein interactions (such as the tandem affinity purification) are based on the presence in the model protein of affinity purification tags [15, 16], relying on the assumption that these additional segments do not dramatically influence the protein's natural capability to perform cross-molecular interactions.

Histidine-rich peptides in Biotechnology

Histidine-rich peptides (often H6 but also other peptide versions, Table 1) are probably the most used protein purification tags [1]. Being short sequences, His-tags do not add significant metabolic load to the protein production process and they can be easily incorporated to the protein by simple genetic engineering at the upstream level. Since the interaction between a His-tag and its ligand does not require any specific conformation, binding is feasible under either native or denaturing conditions (e.g. urea 8 M or guanidinium hydrochloride 6 M) [17] as well as under oxidizing or reducing conditions, resulting in high technological flexibility and easy scalability [18].

Table 1 Representative examples of His-rich peptides used in biotechnology as affinity tags for protein purification.

Immobilized metal affinity chromatography (IMAC) [19] is a separation principle based on the differential and reversible affinity of transition metal ions such as Zn2+, Cu2+, Ni2+, and Co2+ for histidines [20], due to the coordination bonds formed between metal ions and amino acid side chains exposed on the protein surface. Electron-donor atoms (N, S, O) present in the chelating compounds of the chromatographic support are capable of coordinating metal ions and forming metal chelates, which can be bidentate, tridentate, etc., depending on the number of occupied coordination bonds [21]. Four of the six coordination sites on the octahedral Ni2+ center are occupied by the nitrilotriacetic acid (NTA) ligand, and the remaining coordination sites are occupied by two of the six imidazole moieties in the H6-tag. The model in which the interaction of the metal ion with His-tag locates in residues n and n+2 of the tag is confirmed by the fact that IMAC ligands can bind to His-tags consisting of consecutive as well as to alternating histidine residues.

As the chelating ligand is used to fix the metal to agarose, iminodiacetic acid (IDA) was initially employed and it is still in use today in many commercial IMAC resins. This technology was improved by the invention of the chelating ligand NTA in the 1980s [22].

The amino acid histidine shows the strongest interaction with immobilized metal ion matrices, as electron donor groups on histidine imidazole ring readily form coordination bonds with the immobilized metal. In Biacore experiments, the dissociation rate of a H6-tagged protein to Ni-NTA was estimated to be 1 × 10-6 to 1.4 × 10-8 M at pH 7.0-7.4. The same study also showed that two histidines separated by either one or four other residues, are the preferred binding motifs [23]. Interestingly, selection of an optimum tag by a phage-displayed library showed that tags with only two histidine residues possessed chromatographic characteristics superior to those of the most commonly used H6-tag [24]. Adsorption of a protein to the IMAC support is usually performed in neutral or slightly basic media in which imidazole nitrogens in histidine residues are non-protonated. Elution of the target protein is achieved by protonation, ligand exchange or extraction of the metal ion by a stronger chelator, like EDTA.

Histidine-rich peptides in Nanomedicine

In a very different context, histidine-rich peptides are largely exploited in drug delivery as powerful endosomal escape agents, by their incorporation into drug-loaded nanoparticles or polyplexes to be internalized by target cells through endosome formation. During centripetal intracellular trafficking, most of the uptake routes converge into endocytic vesicles, where both vehicle and cargo may become enzymatically degraded when late endosomes are transformed into lysosomes [25]. Within endosomes, the imidazole group of histidine, with a pK around 6.0, gets protonated under the increasingly acid conditions. This step recruits Cl- ions with a consequent osmotic swelling and endosomal cracking (the 'proton sponge' model [26]), which results in the cytoplasmic release of the nanoparticle and escape from lysosomal enzymes. Histidine-rich peptides have been incorporated into polymers, liposomes and proteins (including virus-like particles, VLPs) (Table 2). In non-viral gene therapy, such histidine-empowered vehicles show up to almost 104-fold transgene expression improvement when compared with their corresponding histidine-lacking, parental versions [27]. Despite other membrane-active peptides, poly-histidines remain neutrally charged at neutral pH, avoiding non specific binding to serum proteins and consequent inactivation of the particle [28].

Table 2 Representative examples of His-rich peptides used in nanomedicine as endosomolytic agents.

The number of histidines and their distribution in the amino acid sequence of the tag determine endosomal escape efficiency of His-tagged vehicles and nanoparticles. Lo and Wang [27] showed that the endosomolytic activity of Tat-H10 fusion peptides could be 10-fold higher than Tat-H5 and Tat-H20 versions. Also, 10 His residues distributed at both sides of Tat peptide (C-5H-Tat-5H-C), are 102-fold more efficient than its equivalent C-Tat-10H-C. Although the optimal number of His residues is believed to be ranging between 5 and 10 (similar to the figures seen in His-based affinity tags, Table 1), the incorporation of only one His residue is sufficient to grant membrane-active properties to a drug delivery system [29]. Interestingly, although most of His-tags seem to favour intracellular targeting exclusively through proton sponge activities, LAH4 (KKALLALALHHLAHLALHLALALKKA) is believed to interact with the endosomal membrane at pH 7, due to its amphipathic nature, and disrupt its physical integrity upon subsequent protonation [30].

On the other hand, H6 has been recently found involved in intermolecular protein-protein interactions, leading to the self-assembling of C-terminal His-tagged building blocks into nanoparticles, useful as vehicles in non viral gene therapy [31, 32]. The nature of such organizing interactions is yet to be completely understood, and might involve other specific amino acids such arginines. Since the resulting nanoparticles seem to be more efficient in transgene expression when formed at slightly acidic pH values [32], its seems reasonable to speculate that protonated, overhanging H6 tails interact with anionic protein areas of siding building bocks and also with DNA. A deeper understanding of the interactions promoted by His-rich peptides could offer tools for the rational engineering of nanoparticle formation and DNA encapsulation, both issues of critical relevance in bionanomedicine [33, 34].

Conclusions

Protein production scientists are usually unaware of the potent biological properties of histidine-rich peptides for which they are used in nanomedicine (essentially endosomal disruption but also cross-molecular electrostatic interactions), which are activated at acidic pH. The same (or very similar) His-rich peptides used in affinity chromatography, incorporated into different type of nanoparticles (most ranging from H2 to H10, Table 2) to favour intracellular trafficking in drug delivery [25], are equally employed as affinity tags for separation and often not removed after protein purification (Table 1). These remaining agents might render unexpected protein behaviour, when exposed to cells or nucleic acids, and specific measures for control should be implemented in biological assays when technical obstacles make His-tag removal inconvenient upon protein purification.