A novel toolbox for E. coli lysis monitoring
- 2.6k Downloads
The bacterium Escherichia coli is a well-studied recombinant host organism with a plethora of applications in biotechnology. Highly valuable biopharmaceuticals, such as antibody fragments and growth factors, are currently being produced in E. coli. However, the high metabolic burden during recombinant protein production can lead to cell death, consequent lysis, and undesired product loss. Thus, fast and precise analyzers to monitor E. coli bioprocesses and to retrieve key process information, such as the optimal time point of harvest, are needed. However, such reliable monitoring tools are still scarce to date. In this study, we cultivated an E. coli strain producing a recombinant single-chain antibody fragment in the cytoplasm. In bioreactor cultivations, we purposely triggered cell lysis by pH ramps. We developed a novel toolbox using UV chromatograms as fingerprints and chemometric techniques to monitor these lysis events and used flow cytometry (FCM) as reference method to quantify viability offline. Summarizing, we were able to show that a novel toolbox comprising HPLC chromatogram fingerprinting and data science tools allowed the identification of E. coli lysis in a fast and reliable manner. We are convinced that this toolbox will not only facilitate E. coli bioprocess monitoring but will also allow enhanced process control in the future.
KeywordsBioprocess monitoring Data science tools Lysis Escherichia coli Chromatogram fingerprinting HPLC
Escherichia coli is one of the most popular host organisms for recombinant protein production (e.g., [1, 2]). However, strong induction of recombinant protein production results in great cell stress and high metabolic burden, potentially leading to cell death and lysis . Therefore, it is of utmost importance to monitor the physiological state of the cells to minimize product loss. Flow cytometry (FCM) is the predominant method to monitor and quantify E. coli cell death. However, FCM devices are expensive and therefore often not available. Furthermore, FCM measurements need manual intervention and often require time-consuming, offline sample preparation. In contrast, spectroscopic methods, such as RAMAN and near infrared spectroscopy (NIR), can be used for online monitoring [4, 5]. Owing to the high magnitude of multi-dimensional data derived from these methods, multivariate data analysis (MVDA) is used for data interpretation [6, 7]. However, the continuously changing media background, changing morphologies, as well as changing process parameters (e.g., aeration) cause inaccuracy in measurements and thus limit the applications of these methods. Thus, alternative strategies for bioprocess monitoring are needed.
In this study, we developed a novel toolbox based on UV chromatograms as fingerprints to identify E. coli cell lysis. To date, UV spectroscopy coupled to high pressure liquid chromatography (HPLC) is implemented for real-time monitoring in downstream processes . However, we hypothesized that UV chromatographic data of E. coli bioprocess samples contain information about impurity release and lysis events and thus can also be used in upstream processing. We followed the impurity pattern of nucleic acids at 260 nm as marker for cell lysis along different E. coli bioprocesses. We combined UV chromatographic data with chemometric methods to identify lysis which may be used to define the optimal time point of harvest.
Materials and methods
E. coli BL21 (DE3) (Life technologies, CA, USA) and the pET28a(+) expression vector were used for the production of the cytoplasmic recombinant single-chain antibody fragment (scFv).
In all cultivations, a minimal medium according to DeLisa  supplemented with 0.02 g/L Kanamycin was used. Three cultivations were carried out in a DASGIP multi bioreactor system with four glass bioreactors and a working volume of 2 L each (Eppendorf, Germany). Detailed information about this fermenter setup can be found elsewhere .
An overnight preculture was used for initiating the batch phase, followed by a fed-batch phase and a subsequent induction phase (addition of 0.1 mM IPTG). pO2 and temperature were controlled throughout cultivation at 30 % and 35 °C, respectively. The pH during batch and non-induced fed-batch was kept constant at 7.2. During the induced fed-batch, the pH was either kept constant at 7.2 (Run1), or ramped from 7.2 to 5.7 (Run2) or from 7.2 to 8.7 (Run3) as shown in Electronic Supplementary Material (ESM) Table S1. Samples were taken every hour throughout the induction phase for offline determination of cell death by FCM and for chromatogram fingerprinting.
FCM was carried out according to Langemann et al. . In short, cultivation broth was diluted to stay within the linear range of the detector of the FCM device (CyFlow® Cube 8 flow cytometer, Partec, Münster, Germany). After addition of the fluorescent dyes RH414 (abs./em. 532/760 nm, staining of all plasma membranes) and DiBAC4(3) (abs./em. 493/516 nm, membrane potential-sensitive dye for assessment of viability), data were collected using the software CyView Cube 15 and analyzed with the software FCS Express V4 (DeNovo Software, Los Angeles, CA, USA). The error in FCM measurements was always below 5 %.
Multivariate data analysis
A modular HPLC setup (PATfinder™) with an auto-sampler (Optimas), pump module (Azura P 6.1 L), a multi-wavelength UV detector (Azura MWD 2.1 L) and a monolithic CIMac QA column (0.1 mL) was purchased from BIA separations (Ljubljana, Slovenia). Cell-free culture supernatants were diluted 1:5 with loading buffer (50 mM Tris-HCl, pH 8; AEX-A) to avoid deviations in the background matrix. Then, 50 μL of the prepared samples were loaded onto the column and bound proteins and nucleic acids were eluted using a linear gradient with 50 mM Tris-HCl + 1 M NaCl, pH 8 (AEX-B). Summarizing, column equilibration was done for 20 column volumes (CVs) with AEX-A, followed by sample injection, 10 CVs post-injection wash with AEX-A and elution with a linear gradient of AEX-B for 20 CVs. The time required for acquiring chromatographic data of one sample was shorter than 5 min. The column was cleaned with 1 M NaOH + 2 M NaCl for 10 CV after each sample to avoid carry-over. The flow velocity was kept constant at 283 cm/h. UV chromatographic data at 260 nm were recorded to follow release of nucleic acids. The chromatographic data were logged at a frequency of 5 Hz.
UV chromatographic raw data are usually attributed with shifts along the retention time and the baseline, which both strongly influence further data analysis. In order to overcome these shortcomings, peak alignment and baseline correction were done using icoshift  and first-order derivative, respectively. The preprocessed chromatographic UV data were then arranged as chromatogram fingerprints for further data analysis. Chromatogram fingerprints can be defined as a set of preprocessed overlaid chromatographic data which can be compared to identify and explain phenomena in a process. In our study, mean-centering and scaling of the UV chromatograms at 260 nm as fingerprints were done prior to performing PCA using SIMCA (Umetrics, Umea, Sweden).
Pattern recognition using PCA
Principal component analysis is a widely used exploratory technique which helps in decomposing huge datasets such as the matrix X of the chromatogram fingerprints. The matrix X is represented after PCA by few latent variables, called principal components (PCs). The transformation of X to PCs results in different attributes that are associated with X, called scores and loadings. The loadings of the PCs provide an overview of the variability in the X matrix. In general, the first PCs explain most of the variance in X. The loadings explain at which retention time the variance in the chromatographic data was significant. For example, the first loading would show at which retention time the variance in the chromatographic data was high. An overview of scores plotted along different PCs reveals groupings/clusters explaining similar trends and/or deviations between different samples in X.
Although the PCA score plots can be interpreted to monitor bioprocesses with respect to various PCs , multi-dimensional analysis of scores and loadings is cumbersome. Therefore, we used a univariate statistic (Hotelling’s T2) from the PCA model to follow deviations from pre-defined operating conditions in the E. coli bioprocesses .
Preprocessing of chromatographic data was done in MATLAB R2015a v8.5 (Mathworks, MA, USA). Pattern recognition using PCA was done in SIMCA v13.0 (Umetrics, Umea, Sweden).
Data acquisition and preprocessing
UV chromatographic data were acquired using a UV-VIS detector at 260 nm. After preprocessing, UV chromatographic data at 260 nm were arranged as chromatogram fingerprints as shown in ESM Fig. S1.
Pattern recognition using PCA
The FCM offline data and the Hotelling’s T2 statistics, calculated from the PCA model, are shown in Fig. 2. The Hotelling’s T2 statistics showed clear deviations from the control limit in each bioprocess. In fact, these deviations happened at the same time when cell death increased (indicated by thin dotted lines in Fig. 2). Apparently, cells started to die at different time points due to the pH ramps. Cell death resulted in lysis and thus in the release of impurities (nucleic acids), which we were able to reliably detect by UV chromatograms as fingerprints and combined data analysis. Based thereon, the time point at which the bioprocess started to deviate from normal operating condition was defined as the optimal time point of harvest. With the implementation of this novel monitoring toolbox, online detection of physiological events in the bioreactor is possible, and cumbersome offline analytics along bioprocesses is minimized.
We implemented a novel toolbox comprising UV chromatogram as fingerprints and chemometric techniques to monitor cell death in E. coli bioprocesses and to define the optimal time point of harvest.
The novelty of this approach is the use of whole UV chromatogram as fingerprints, rather than single chromatogram peaks, in combination with multivariate data analysis (MVDA) tools for monitoring of bioprocesses. Chromatogram fingerprinting approaches have been only used in chemical formulations and in some downstream bioprocesses so far, but for the first time we showed the applicability of this technique in upstream process monitoring. We envision the implementation of this toolbox for monitoring different unit operations in a bioprocess, such as bioreactor cultivations, harvesting, and product purification, which will facilitate continuous bioprocessing and process development.
Open access funding provided by TU Wien (TUW). The authors would like to thank BIA separations (Slovenia) for providing the columns and technical support.
David Wurm and Vignesh Rajamanickam designed the study. David Wurm, Christoph Slouka, and Vignesh Rajamanickam conducted the experiments and analyzed the data. Vignesh Rajamanickam developed the model. David Wurm, Oliver Spadiut, and Vignesh Rajamanickam wrote the paper. Christoph Herwig and Oliver Spadiut supervised the work.
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
- 7.Rathore AS, Parr L, Dermawan S, et al. Large scale demonstration of a process analytical technology application in bioprocessing: use of on-line high performance liquid chromatography for making real time pooling decisions for process chromatography. Biotechnol Prog. 2010;26:448–57. doi: 10.1002/btpr.320.Google Scholar
- 8.Rathore AS, Yu M, Yeboah S, Sharma A. Case study and application of process analytical technology (PAT) towards bioprocessing: use of on-line high-performance liquid chromatography (HPLC) for making real-time pooling decisions for process chromatography. Biotechnol Bioeng. 2008;100:306–16. doi: 10.1002/bit.21759.CrossRefGoogle Scholar
- 14.Eriksson L, Kettaneh-Wold N, Johansson E, et al. Multi- and megavariate data analysis. Umea: MKS Umetrics AB; 2006.Google Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.