Aggregation and associated conformational stress of cell factories (both prokaryotic and eukaryotic) are major concerns in recombinant protein production, resulting in low yields, unstable production and limited solubility and biological activity of the products [19]. Basic research on protein folding and the routine implementation of several analytical procedures such as circular dichroism, mass spectrometry and infrared spectroscopy (mostly incorporated from amyloid research) [4, 1016] have expanded our understanding of how polypeptide chains cross-interact and aggregate in vivo. In bacteria, probably the most studied cell factories, aggregation as inclusion bodies, a quite common event during production of heterologous polypeptides [17, 18], is now observed as a complex physiological event in which cellular agents, including chaperones [6, 19, 20], proteases [2123] and actin-like proteins [24] are coordinately acting [24, 25] in the frame of the cell's protein quality control machinery [2628]. Despite aggregation as inclusion bodies might represent a source of relatively pure proteins for further refolding or extraction [2933], or unexpectedly, a new type of nano-microparticulate biomaterials for biotechnological and biomedical applications [3439], the use of recombinant proteins for most of biotechnological and biomedical applications requires fully soluble protein versions. A particular issue in recombinant protein aggregation is the occurrence of soluble aggregates (less apparent that large aggregates), that are being progressively recognized in production processes. These soluble clusters adopt a spectrum of forms (mainly fibrilar, spherical or amorphous) [40, 41] and might be the in vivo physiological precursors and structural components of bacterial inclusion bodies [24, 42]. Very different approaches have been explored at upstream, midstream and downstream levels to minimize aggregation during recombinant protein production (Figure 1). Such strategies, eventhough being mostly a trial-and-error process, often result in significant improvements of protein solubility [4346].

Figure 1
figure 1

Tool boxes through which protein solubility can be enhanced at different stages of protein production and postproduction pipelines. Targets for improvement during in vivo administration are summarized in the red framed box, some of them being modulated by protein aggregation. Appropriate codon selection [82, 83], using weak promoters or low copy number plasmids and protein engineering [84] are the most common upstream strategies (yellow box). Growth at sub-optimal temperatures [85, 86], mild induction of gene expression, co-production of chaperones [8789] or protein production in protease-deficient strains [90, 91] or in mutants with altered redox properties [92] favor correct protein folding (orange box). Buffers and purification conditions should be optimized as per protein basis to prevent aggregation [9397]. Alternatively, soluble protein species can be obtained by refolding inclusion body proteins [3032] or by extracting functional proteins from inclusion bodies by mild procedures [29] (green box). Once purified, aggregation during storage or administration of protein drugs can be inhibited by appropriate excipient formulations or by chemical modification [50, 60, 62, 98] (blue box). Also, the use of delivery systems, either through protein adsorption onto nanoparticles, nano and micro encapsulation or embedding in biocompatible materials stabilize proteins, expand their half-life in the body and permit a sustained release, resulting in enhanced bioavailability and reduced toxicity [63, 64, 99]. Upstream protein engineering strategies that enhance solubility during production can also affect aggregation and performance of protein drugs upon administration. Also, by this approach, novel functions that improve pharmacological performance of proteins can be gained without necessarily enhancing solubility (bottom, green framed box).

Desirably, soluble versions of recombinant proteins should keep such soluble status in post-production stages, that is, during storage and use. This need is specially acute in the case of proteins intended for therapeutic uses [47, 48]. Protein drugs are commonly administered parenterally [49], what makes protein aggregation in stocks or upon administration a main concern in the Pharma industry (see for instance, http://www.eahp.eu/content/download/25193/164355/.../CoverStory20-21.pdf.pd). The high concentration at which proteins drugs are stored and administered [50] specifically favors aggregation [51]. In this context, diverse analytical procedures have been developed and specifically adapted to the detection of therapeutic protein aggregation [5254]. Importantly, aggregation does not only render drug inactivation during storage, and fast clearance, reduction of activity, limited bioavailability and proteolytic digestion upon administration, but it also stimulates undesired immunogenicity [55]. This is a critical issue in clinics as severe side effects observed upon prolonged protein administration (as in the case of insulins, interferons, erythropoietin and growth hormone) are antibody-dependent [5659].

Chemical modification of proteins and the use of appropriate excipients (Figure 1) are the most taken approaches for protein drug stabilization [6062]. Obviously, emerging concepts in Nanotechnology, Nanomedicine and in Material Sciences offer new biocompatible vehicles for protein encapsulation or embedding, mainly at the nanoscale, through which the stability, tissue targeting and bioavailability during drug delivery are dramatically enhanced [63, 64]. Many among those such nanostructured materials are from bacterial origin [65].

Being proteins flexible molecules suitable to be re-designed by genetic methods, upstream protein engineering, one of the main approaches to prevent aggregation during production (Table 1, up), is also useful to stabilize protein drugs during use (Table 1, bottom). In addition, modification of the protein primary sequence permits a fine tuning of protein features such as oligomerization, activity, cell targeting and cell penetration, that represent additional values in the performance of a protein drug (Table 1; Figure 1, bottom). In this regard, protein engineering is revealed as an extremely flexible approach to enhance the stability of proteins during production, storage and use, but also to improve their performance in in vivo uses. Reduction of aggregation is expected to minimize immunogenicity, increase proteolytic stability, improve bioavailability and limit side-effects, as aggregation has a pivotal role in all these issues [55, 59, 6669]. In addition, protein modification can offer added values to protein drugs, by conferring novel functions that improve pharmacological performance without necessarily enhancing solubility (Figure 1, green framed box). These include cell or tissue targeting or enhanced cell penetration by the fusion to a cell receptor ligand or an antibody [7074], enhancing half-life and bioavailability by fusion to transferrin [75], albumin [76], or albumin-binding peptides [77] and crossing the brain-blood barrier (BBB) by the incorporation of cationic peptides [78]. Creating multifunctional proteins by the appropriate combination of protein domains in a single polypeptide chain is being especially explored for the construction of protein-only artificial viruses, in which the therapeutic nucleic acids are encapsulated by chimerical protein building blocks [72, 7981]. Further exploration of protein engineering focused on post-production issues is strongly required and it should allow the emergence of optimized drugs to fulfill their increasing demand.

Table 1 Protein engineering strategies to reduce aggregation or derived effects during either production or administration, illustrated by representative examples.

Conclusions

Stability and solubility of recombinant proteins is a critical issue at both production and post-production stages. For a biomedical use of proteins as pharmaceuticals, high solubility not only supports stability but it also enhances bioavailability and reduces immunogenicity and undesired toxic effects. Among other approaches to stabilize protein drugs, such as chemical modification, proper formulation and encapsulation, protein engineering is a very flexible route to improve protein folding during production and reduce aggregation during storage and in vivo. Furthermore, the modification of protein primary sequence permits to confer additional functional values, such as binding to serum albumin, binding to cell surface receptors and cell membrane (or BBB) crossing, thus improving biodistribution, expanding the half-life and enhancing the biological potential of the drug. The wide spectrum of possibilities of post-production-addressed protein engineering is probably to be yet realized.