Real-time measurements and adjustments of critical process parameters are essential for the precise control of fermentation processes and thus for increasing both quality and yield of the desired product. However, the measurement of some crucial process parameters such as biomass, product, and product precursor concentrations usually requires time-consuming offline laboratory analysis. In this work, we demonstrate the in-line monitoring of biomass, penicillin (PEN), and phenoxyacetic acid (POX) in a Penicilliumchrysogenum fed-batch fermentation process using low-cost microspectrometer technology operating in the near-infrared (NIR). In particular, NIR reflection spectra were taken directly through the glass wall of the bioreactor, which eliminates the need for an expensive NIR immersion probe. Furthermore, the risk of contaminations in the reactor is significantly reduced, as no direct contact with the investigated medium is required. NIR spectra were acquired using two sensor modules covering the spectral ranges 1350–1650 nm and 1550–1950 nm. Based on offline reference analytics, partial least squares (PLS) regression models were established for biomass, PEN, and POX either using data from both sensors separately or jointly. The established PLS models were tested on an independent validation fed-batch experiment. Root mean squared errors of prediction (RMSEP) were 1.61 g/L, 1.66 g/L, and 0.67 g/L for biomass, PEN, and POX, respectively, which can be considered an acceptable accuracy comparable with previously published results using standard process spectrometers with immersion probes. Altogether, the presented results underpin the potential of low-cost microspectrometer technology in real-time bioprocess monitoring applications.
The precise control of bioprocesses is essential to achieve an optimum quality and yield of the desired product. A key to achieve this goal are analytical methods that provide real-time information of the current process state and critical process parameters. For some of these parameters such as pH, temperature, and dissolved oxygen concentration, real-time measurement techniques are readily available, while other parameters, such as substrate and product concentrations, usually require time-consuming offline measurements [1,2,3], which furthermore results in additional sources of analytical error due to sampling and sample preparation.
Many efforts have been made in the past to enable real-time monitoring of these parameters using different types of sensors including optical , capacitance , and ultrasound-based sensors  as well as nuclear magnetic resonance measurements . Also, spectroscopic measurement techniques such as UV-Vis spectroscopy , fluorescence spectroscopy , and Raman spectroscopy  have already been utilized for bioprocess monitoring in the past.
One of the most promising sensing approaches for in-line application is near-infrared (NIR) and mid-infrared (MIR) spectroscopy, which, combined with multivariate data analysis, have been previously used for the real-time measurement of various process parameters [11, 12]. Spectral information is usually acquired using Fourier transform infrared (FTIR) spectrometers combined with a probe immersed into the bioreactor [13, 14]. The advantages of spectroscopic methods for bioprocess monitoring are manifold and include real-time capability, its non-destructive nature, easy maintenance, and the possibility for simultaneous determination of multiple target analytes in the complex fermentation broth. However, the commonly used FTIR spectrometers and measurement probes are costly and the probes usually have to be immersed into the fermentation broth, which makes sterilization of the probes a necessity.
With the recent advent of novel miniature spectrometer technology (“microspectrometer”) based on micro-electromechanical systems (MEMS), compact, robust, and cost-effective NIR spectrometers became available and significantly lowered the hardware costs for multiple NIR sensing applications . This makes NIR spectroscopy much more attractive and creates a potential to enable completely new applications for the measurement technique. Different instrument technologies are available, with the most widespread being Fourier transform, Fabry-Pérot (FP), and dispersive spectrometers . In recent years, the potential of microspectrometer technology has already been demonstrated in spectroscopy [17, 18], hyperspectral imaging , and compressive sensing  applications.
In this work, it will be demonstrated that FP-based microspectrometers can be utilized for real-time bioprocess monitoring, even without the need for an immersion probe via measuring through the glass wall of a bioreactor in reflection geometry. This enables a completely non-invasive measurement and therefore significantly increases the practicability and convenience of NIR process monitoring.
Materials and methods
P. chrysogenum fermentation and reference analytics
Fed-batch experiments using a spore suspension of an industrial Penicillium chrysogenum strain for penicillin production were performed in a 2.7 L parallel bioreactor system (Eppendorf AG, Germany). After increase in the pH level, which indicates the end of the batch process, 300 mL of cell broth was transferred to bioreactors filled with 1700 mL defined fed-batch media (for the detailed composition of batch and fed-batch media, the reader is referred to ).
Stirrer speed (350–850 rpm) and oxygen addition to pressurized air were used to keep the dissolved oxygen above 40%, while the aeration rate was kept constant at 1 vvm. During fermentation, the pH was sustained at 6.5 by addition of KOH and H2SO4 while the temperature was controlled at 25 °C. The supplied feeds were glucose (500 g/L), penicillin V precursor (80 g/L phenoxyacetate), and the nitrogen source (100 g/L (NH4)2SO4). The process was strictly glucose limited, whereas phenoxyacetate and nitrogen were kept at non-limiting concentrations by adjusting their feed rates.
Samples for offline reference measurements were taken every 8–10 h during the fermentation process. Determination of the penicillin V (PEN) and phenoxyacetate (POX) concentrations in the filtered broth was performed by high-performance liquid chromatography (HPLC) using a ZORBAX C-18 Agilent column and 28% acetonitrile, 6 mM H3PO4, and 5 mM KH2PO4 as an elution buffer. POX eluted after 2.75 min and was quantified between 0.00275 and 0.275 g/L. PEN eluted after 10.00 min and the calibration range was between 0.00468 and 0.468 g/L. For analysis, a 1:40 dilution of media samples with 1% citric acid was injected. For biomass determination, cells were separated from a 5 mL culture broth using centrifugation at 4800 rpm for 10 min at 4 °C, washed with 5 mL deionized water and dried at 105 °C. The remaining substance was then measured gravimetrically to get the amount of dried biomass.
Robust, compact, and low-cost NIR Fabry-Pérot (FP) microspectrometers (NIRONE Sensors, Spectral Engines, Finland) were attached to the glass wall of the bioreactors before the fermentation process was started. Various types of FP microspectrometers covering different wavelength regions are available and the selection of the right wavelength range is crucial for extracting relevant information. NIR monitoring of biomass, PEN, and POX concentrations in the same fermentation process using a broadband Fourier transform NIR spectrometer coupled to an immersion probe was already conducted in the past . Therefore, the published results were used to identify the most relevant wavelength regions for the analytes of interest. It was concluded that the most relevant available wavelength regions were 1350–1650 nm (“sensor 1”) and 1550–1950 nm (“sensor 2”). A photograph and a schematic drawing of the experimental setup are shown in Fig. 1. This setup enables a probeless and completely non-invasive acquisition of NIR spectra in reflection geometry. Spectra were taken every second and then averaged over approximately 60 s to lower the influence of short-lived disturbances such as air bubbles in the broth while still allowing for real-time monitoring of the fermentation process. As a light source, the built-in halogen lamps of the microspectrometers were used, which means no external light source was necessary to conduct the measurements. In between the two monitored fermentation processes, the spectral sensors were completely removed and reattached to the bioreactor, possibly resulting in slightly different sensor positions and/or orientations. The resulting differences in the recorded spectral signals are eliminated by the calculation of absorbance spectra using the spectrum before inoculation of the reactor with the P. chrysogenum batch culture for each fermentation process as reference as well as the applied spectral pre-processing.
Multivariate data analysis and partial least squares regression
All spectra were pre-processed by calculating the 1st derivative and employing a 2nd-order polynomial fit on a window size of five using a Savitzky-Golay (SavGol) filter. For the monitoring of biomass concentration, additionally, a standard normal variate (SNV) normalization was applied. For each sensor, the entire spectral range was used for all calculations and modelling procedures. Principal component analysis (PCA) and partial least squares (PLS) regression were carried out in Unscrambler X 10.5.1 (Camo Analytics, Norway). All PLS models were fitted using data from one batch and validated using data from another, independent batch produced two months later. A total of 18 and 15 reference values for each analyte were available for the calibration and validation batch, respectively. The number of latent variables used in the PLS models was optimized using leave-one-out cross-validation.
Results and discussion
Two full fermentation runs were pursued in order to test the suitability of MEMS-based microspectrometer technology for probeless monitoring of biomass, PEN, and POX in a P. chrysogenum batch fermentation. Figure 2a shows raw and smoothed (SavGol) 1st derivative spectra from sensor 1 and sensor 2 recorded over the course of the calibration batch. The most prominent changes in the pre-processed spectra were found around 1400 nm (sensor 1) and 1875 nm (sensor 2), which can be mainly attributed to CHx and ROH oscillations in the second and first overtone regions, respectively . This is in line with previous measurements done in a similar process environment . On the other hand, the changes visible in the raw spectra are rather unspecific, and can be mainly attributed to scattering effects. This nicely underpins the importance of proper pre-processing in order to uncover the relevant information in the spectra. A principal component analysis (PCA) of the fused spectra of both sensors was undertaken in order to investigate if the data from the two batches are consistent (Fig. 2b). As expected, the time evolution of the batches is captured by the first principal component (PC1), which accounts for most of the observed variation (93%) in the data. Notably, the second batch trajectory starts on a slightly lower and ends on a higher PC1 score. This indicates a higher inoculum size and lower biomass in the beginning and the end of the batch, respectively, which is confirmed by the corresponding dry weight analysis (results not shown). On the other hand, the first batch displays high variability along PC3 towards the end of the batch, which can be attributed to elevated scattering effects towards higher optical densities (occurring at higher biomass concentrations).
The pre-processed spectra from the first batch process, combined with the corresponding reference measurements, were used to establish PLS models for the prediction of biomass, PEN, and POX concentrations. Reference measurements were done in triplicates and three subsequently recorded spectra were used for each averaged reference value in order to facilitate the identification of potential outliers. However, in the data presented herein, no clear outliers were identified and all acquired data points were used for both calibration and validation. In order to compare the two sensors and identify the most suitable one, separate models were calculated for each sensor individually. In addition, models based on the fused spectra from both sensors were established. The regression coefficient vectors for biomass, PEN, and POX that yield the best predictive performance are shown in Fig. 3. The regression vectors for biomass and POX were calculated using spectral data from both sensors, utilizing three and four latent variables (LVs), respectively. For the calculation of PEN, the data from sensor 1 (i.e., the first 31 data points) was sufficient and the best results were achieved using four LVs.
The regression coefficient vector for biomass determination has clear extrema at the positions of strongest changes in the pre-processed spectra at around 1400 nm and 1875 nm (Fig. 2), which are indicated with a gray background in Fig. 3. The absorption at around 1400 nm has a negative and the absorption around 1875 nm has a strong positive contribution. Additionally, the extremum at 1640 nm is highlighted to emphasize the different spectral dependencies of the calculated regression vectors for PEN, POX, and biomass. This indicates that the models are responding to different chemical signatures in the spectral data, despite them showing similar concentration trends as shown below in Fig. 5. The differences in the spectral responses in the regression vectors fit nicely to the observed differences in the absorption spectra of PEN and POX in water published elsewhere . The predictive performance for each of the established models as well as the number of latent variables (# LVs) is summarized in Table 1.
Except for PEN, the best correspondence between measured and predicted values in terms of coefficient of determination (R2) and cross-validation error (RMSECV) was obtained when fusing the spectra from both sensors achieving 0.98 and 0.93 (R2) and 1.64 g/L and 0.4 g/L (RMSECV) for biomass and POX, respectively. When tested on data from the second batch, the corresponding models achieved high accuracies for prediction of biomass and POX, clearly outperforming the models established using only the spectra from either one of the single sensors. In contrast, prediction of PEN was most accurate when using only the sensor operating in the 1350–1650 nm regime, whereas data fusion with the spectra from the second sensor yielded poor predictive performance irrespective of the number of LVs included in the model. This can also be seen in the graphs in Fig. 4 where the values given by the regression models are plotted against the corresponding reference measurements for both cross-validation (blue) of the calibration data and predictions for the validation batch (red).
A reasonably good RMSECV value of 2.19 g/L was also achieved for the PLS model with only spectral data from sensor 1 for the biomass content (upper left in Fig. 4), but when the model was applied to data from the validation batch, poor agreement between model and reference measurement was observed (RMSEP = 10.51 g/L). Upon a closer look on the data, it becomes clear that the main error in the model values stems from an offset and a slightly wrong slope. As has been shown elsewhere , additive PLS (aPLS) modelling can be used in a scenario like this to significantly reduce the errors of the model values by applying an additional PLS regression to the residuals of the initial regression model. Indeed, if aPLS is used to correct for the slight changes in measurement conditions between the first and second batch, greatly improved RMSEP values for the biomass content of about 1.56 g/L can be achieved (regression curves not shown). This value is on par with the RMSEP achieved with the PLS model using fused spectral data from both spectral sensors, but requires an additional modelling step.
Figure 5 shows the time-resolved predictions of the best performing models (highlighted in gray in Fig. 4) along with the offline reference values over the course of batch 2 (validation batch). It can be seen that biomass and PEN are overestimated especially in the first 20 h of the fermentation, while the penicillin V sidechain, POX, seems to be estimated correctly in this timeframe. The subsequent sudden increase in POX concentration, indicated by three of the measurements between 20 and 60 h, points towards possible outliers since they have quite a large estimated error and no significant POX addition occurred in this timeframe.
The normalized root mean squared error (NRMSE) of the models for the three analytes, biomass, PEN, and POX, was calculated by normalizing the RMSEP with the total range of the reference data. This calculation yielded values of 9.8%, 18.0%, and 15.9%, respectively.
In order to judge the performance of the established PLS models, the NRMSE of the validation batch was compared with the estimated error of the reference analytics. This error was assessed by analyzing the reference measurements from several similar batch processes and calculating the mean relative standard deviation for each value. This leads to relative errors of 5.4%, 7.3%, and 5.7% for biomass, PEN, and POX, respectively. The relative errors of the reference measurement follow a similar trend as the NRMSE of the established models. In both cases, PEN shows the largest while biomass shows the smallest relative error. By multiplying these values with the mean reference value for the validation batch, the absolute errors for this batch can be estimated to be around 0.86 g/L, 0.39 g/L, and 0.15 g/L for biomass, PEN, and POX, respectively.
It should be noted here, that since only one sample was taken from the bioreactor for each measurement, the sampling error, which is one of the main sources of error in analytical chemistry, is not considered in this estimation. Here, only the measurement error due to sample preparation and reference instrumentation is covered by the stated error bars. Therefore, this can only be seen as a lower boundary of the deviation from the offline measurement to the actual value of the analyte in the fermentation broth. The actual absolute errors of the presented method can thus be expected to be even lower.
When comparing the achieved RMSEP values to previously published prediction errors of models that were calculated using NIR spectra acquired with an invasive in-line measurement probe, the achieved values stack up even better. For example, an RMSEP value of 1.39 g/L for biomass in a fed-batch Escherichia coli process  and 2.62 g/L, 0.34 g/L, and 0.51 g/L for biomass, PEN, and POX, respectively, for a P. chrysogenum fermentation  were reported. Except for the prediction of the PEN concentration, the values achieved in this work are on par with the previously reported results from invasive NIR measurements.
Conclusions and outlook
The potential of non-invasive NIR spectroscopic measurements in reflection geometry through the glass wall of a bioreactor for real-time bioprocess monitoring was successfully demonstrated. This was achieved by acquiring spectral data using novel NIR microspectrometer technology that is both low-cost and robust. Spectral data from two microspectrometer modules covering different wavelength ranges as well as offline reference data were used for calibrating PLS regression models for three different analytes (biomass, PEN, and POX) in a P. chrysogenum fed batch process. Validation of the established models was carried out using data from an independent batch process. Especially with regard to cost, size, and contamination risk, this approach is highly preferable over conventional NIR spectrometers connected to a measurement probe submerged into the fermentation broth while achieving similar performance. The reported approach is widely applicable and could give new insights into various different bioprocesses used in different industrial as well as scientific applications and allow for a cost-effective online monitoring and process control.
Comparison of different PLS models calibrated with single sensors and the fused spectral data of both NIR sensor modules showed that for two of the three analytes (biomass and POX), the model calculated with the fused spectral data showed the best performance. This hints towards the possibility to significantly improve the models when a third or fourth spectral sensor is used to widen the observed spectral range. This could also lead to better model performance for the determination of PEN concentration which was slightly worse than the one achieved previously via invasive measurements.
Another possibility would be to improve the performance of the multivariate analysis by using more elaborate regression methods than classical PLS. For example, domain invariant PLS (di-PLS) which can be useful to decrease influence on the model prediction quality stemming from changes in environmental conditions, instrumental response, or sample matrix , could be applied to the data. This however is subject of future research.
All data generated and analyzed during the current study are available from the corresponding author on reasonable request.
Vaidyanathan S, Macaloney G, Vaughan J, et al. Monitoring of submerged bioprocesses. Crit Rev Biotechnol. 1999;19:277–316. https://doi.org/10.1080/0738-859991229161.
Ulber R, Frerichs J-G, Beutel S. Optical sensor systems for bioprocess monitoring. Anal Bioanal Chem. 2003;376:342–8. https://doi.org/10.1007/s00216-003-1930-1.
Biechele P, Busse C, Solle D, et al. Sensor systems for bioprocess monitoring. Eng Life Sci. 2015;15:469–88.
Kiviharju K, Salonen K, Moilanen U, et al. On-line biomass measurements in bioreactor cultivations: comparison study of two on-line probes. J Ind Microbiol Biotechnol. 2007;34:561–6. https://doi.org/10.1007/s10295-007-0233-5.
November EJ, Van Impe JF. Evaluation of on-line viable biomass measurements during fermentations of Candida utilis. Bioprocess Eng. 2000;23:473–7. https://doi.org/10.1007/s004499900179.
Resa P, Elvira L, De Espinosa FM. Concentration control in alcoholic fermentation processes from ultrasonic velocity measurements. Food Res Int. 2004;37:587–94. https://doi.org/10.1016/j.foodres.2003.12.012.
Brecker L, Weber H, Griengl H, Ribbons DW. In situ proton-NMR analyses of Escherichia coli HB101 fermentations in 1H2O and in D2O. Microbiology. 1999;145:3389–97. https://doi.org/10.1099/00221287-145-12-3389.
Noui L, Hill J, Keay PJ, et al. Development of a high resolution UV spectrophotometer for at-line monitoring of bioprocesses. Chem Eng Process. 2002;41:107–14. https://doi.org/10.1016/S0255-2701(01)00122-2.
Faassen SM, Hitzmann B. Fluorescence spectroscopy and chemometric modeling for bioprocess monitoring. Sensors (Switzerland). 2015;15:10271–91.
Ávila TC, Poppi RJ, Lunardi I, et al. Raman spectroscopy and chemometrics for on-line control of glucose fermentation by Saccharomyces cerevisiae. Biotechnol Prog. 2012;28:1598–604. https://doi.org/10.1002/btpr.1615.
Lourenço ND, Lopes JA, Almeida CF, et al. Bioreactor monitoring with spectroscopy and chemometrics: a review. Anal Bioanal Chem. 2012;404:1211–37. https://doi.org/10.1007/s00216-012-6073-9.
Claßen J, Aupert F, Reardon KF, et al. Spectroscopic sensors for in-line bioprocess monitoring in research and pharmaceutical industrial application. Anal Bioanal Chem. 2017;409:651–66. https://doi.org/10.1007/s00216-016-0068-x.
Koch C, Posch AE, Goicoechea HC, et al. Multi-analyte quantification in bioprocesses by Fourier-transform-infrared spectroscopy by partial least squares regression and multivariate curve resolution. Anal Chim Acta. 2014;807:103–10. https://doi.org/10.1016/J.ACA.2013.10.042.
Arnold SA, Gaensakoo R, Harvey LM, McNeil B. Use of at-line and in-situ near-infrared spectroscopy to monitor biomass in an industrial fed-batch Escherichia coli process. Biotechnol Bioeng. 2002;80:405–13. https://doi.org/10.1002/bit.10383.
Ebermann M, Neumann N, Hiller K, et al (2016) Tunable MEMS Fabry-Pérot filters for infrared microspectrometers: a review. In: Piyawattanametha W, Park Y-H (eds). International Society for Optics and Photonics, p 97600H.
Antila J, Tuohiniemi M, Rissanen A, et al. MEMS- and MOEMS-based near-infrared spectrometers. In: Encyclopedia of analytical chemistry. Chichester: John Wiley & Sons, Ltd; 2014. p. 1–36.
Erfan M, Sabry YM, Sakr M, et al. On-chip micro–electro–mechanical system Fourier transform infrared (MEMS FT-IR) spectrometer-based gas sensing. Appl Spectrosc. 2016;70:897–904. https://doi.org/10.1177/0003702816638295.
Wiedemair V, Mair D, Held C, Huck CW. Investigations into the use of handheld near-infrared spectrometer and novel semi-automated data analysis for the determination of protein content in different cultivars of Panicum miliaceum L. Talanta. 2019;205:546–53.
Kilgus J, Langer G, Duswald K, et al. Diffraction limited mid-infrared reflectance microspectroscopy with a supercontinuum laser. Opt Express. 2018;26:30644. https://doi.org/10.1364/OE.26.030644.
Gattinger P, Kilgus J, Zorin I, et al. Broadband near-infrared hyperspectral single pixel imaging for chemical characterization. Opt Express. 2019;27:12666. https://doi.org/10.1364/OE.27.012666.
Posch AE, Herwig C. Physiological description of multivariate interdependencies between process parameters, morphology and physiology during fed-batch penicillin production. Biotechnol Prog. 2014;30:689–99. https://doi.org/10.1002/btpr.1901.
Luoma P, Golabgir A, Brandstetter M, et al. Workflow for multi-analyte bioprocess monitoring demonstrated on inline NIR spectroscopy of P. chrysogenum fermentation. Anal Bioanal Chem. 2017;409:797–805. https://doi.org/10.1007/s00216-016-9918-9.
Burns DA, Ciurczak EW. Handbook of near-infrared analysis: CRC Press; 2008.
Luoma P, Natschläger T, Malli B, et al. Additive partial least squares for efficient modelling of independent variance sources demonstrated on practical case studies. Anal Chim Acta. 2018;1007:10–5.
Nikzad-Langerodi R, Zellinger W, Lughofer E, Saminger-Platz S. Domain-invariant partial least squares regression. Anal Chem. 2018;90:6693–701. https://doi.org/10.1021/acs.analchem.8b00498.
Financial support was provided by the Austrian Research Promotion Agency (FFG) under the scope of the COMET programme within the research project Photonic Sensing for Smarter Processes (PSSP) (contract no. 871974) and the Austrian Competence Centre for Feed and Food Quality, Safety and Innovation (FFoQSI), funded by the Austrian ministries BMVIT, BMDW and the Austrian provinces Niederoesterreich, Styria, Upper Austria, and Vienna.
This article does not contain any studies with human or animal subjects performed by any of the contributing authors.
Conflict of interest
The authors declare that they have no conflict of interest.
Published in the topical collection Advances in Process Analytics and Control Technology with guest editor Christoph Herwig.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Zimmerleiter, R., Kager, J., Nikzad-Langerodi, R. et al. Probeless non-invasive near-infrared spectroscopic bioprocess monitoring using microspectrometer technology. Anal Bioanal Chem 412, 2103–2109 (2020). https://doi.org/10.1007/s00216-019-02227-w