Design of experiment (DOE) applied to artificial neural network architecture enables rapid bioprocess improvement

Rodriguez-Granrose, Daniel; Jones, Amanda; Loftus, Hannah; Tandeski, Terry; Heaton, Will; Foley, Kevin T.; Silverman, Lara

doi:10.1007/s00449-021-02529-3

Design of experiment (DOE) applied to artificial neural network architecture enables rapid bioprocess improvement

Research Paper
Open access
Published: 27 February 2021

Volume 44, pages 1301–1308, (2021)
Cite this article

Download PDF

You have full access to this open access article

Bioprocess and Biosystems Engineering Aims and scope Submit manuscript

Design of experiment (DOE) applied to artificial neural network architecture enables rapid bioprocess improvement

Download PDF

Daniel Rodriguez-Granrose^1,2,
Amanda Jones¹,
Hannah Loftus¹,
Terry Tandeski¹,
Will Heaton¹,
Kevin T. Foley^1,3,4 &
…
Lara Silverman^1,3

8229 Accesses
25 Citations
2 Altmetric
Explore all metrics

Abstract

Modern bioprocess development employs statistically optimized design of experiments (DOE) and regression modeling to find optimal bioprocess set points. Using modeling software, such as JMP Pro, it is possible to leverage artificial neural networks (ANNs) to improve model accuracy beyond the capabilities of regression models. Herein, we bridge the gap between a DOE skill set and a machine learning skill set by demonstrating a novel use of DOE to systematically create and evaluate ANN architecture using JMP Pro software. Additionally, we run a mammalian cell culture process at historical, one factor at a time, standard least squares regression, and ANN-derived set points. This case study demonstrates the significant differences between one factor at a time bioprocess development, DOE bioprocess development and the relative power of linear regression versus an ANN-DOE hybrid modeling approach.

Hybrid modeling as a QbD/PAT tool in process development: an industrial E. coli case study

Article Open access 15 February 2016

Designing a Model-Driven Approach Towards Rational Experimental Design in Bioprocess Optimization

Machine learning: an advancement in biochemical engineering

Article 21 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Due to multivariate datasets, sophisticated data interpretation is increasingly attractive to bioprocess professionals [1,2,3]. Traditional regression techniques have limitations, including the shapes that they can use to model data, poor extrapolation properties, and sensitivity to outliers [4]. Modeling with artificial neural networks (ANN) overcomes many of these shortcomings by implementing many smaller models to interpret sections of data [5].

An ANN is an algorithm which finds relationships by performing discrete computations in artificial “neurons” and fitting these computations into a larger model [6]. Each neuron in an ANN models data using a distinct activation function [6,7,8]. ANN can be optimized by changing the number of neurons and type of activation functions [5, 7, 9]. Although previous bioprocess ANN approaches have shown success, they primarily rely on complex machine learning techniques and frequently require coding experience to implement [2, 8, 10,11,12].

Design of experiments (DOE) is a method of employing mathematics to generate optimal experimental conditions [13,14,15]. DOE serves two distinct roles in our project. The first is to establish experimental conditions which test four bioprocess inputs for their ability to maximize cell proliferation. We then model this bioprocess dataset using standard least squares (SLS) regression and ANN [16]. DOE’s second use in our project is to evaluate neuron activation functions and quantities in each layer of the ANN. By creating a DOE which optimizes an ANN using neuron numbers and activation functions as inputs and model outputs as desirability functions, we can systematically explore relationships to find the optimal ANN architecture [17, 18]. DOE in this context replaces traditional machine training activities with directed, mathematically optimized, exploration of inputs and outputs. Therefore, the ANN-DOE approach ensures appropriate and efficient leverage of ANN to maximize cell proliferation. Further background on the terms and techniques used in this manuscript can be found in Online Resource 1.

A well modeled bioprocess shows us which combination of inputs results in optimal outputs. To test the various modeling approaches, we utilize a bioprocess unit operation related to the manufacture of an allogeneic cell therapy for low back pain, which is currently in clinical evaluation (Clinicaltrials.gov NCT03347708 and NCT03955315) [19]. In this case study, we test how the cell seeding density, media supplement percentage, media exchange volume during routine feeding, and cell line maximize cell doublings.

In addition to modeling our bioprocess dataset with SLS regression and ANN, we test a historical bioprocess set point optimized with one factor at a time (OFAT) experimentation. Growing cells at each set point allows us to directly compare the relative merits of OFAT and DOE process optimization, as well as the value of modeling a bioprocess dataset with SLS regression or ANN-DOE hybrid modeling. We hypothesize that applying DOE to ANN architecture will result in an optimized network which we can use to model our bioprocess and improve cell doublings compared to a linear regression, OFAT-derived set point, or our historical process set point. If successful, this approach will improve the process with optimized set points and provide a valuable tool with which to assess future bioprocess operations.

Materials and methods

All analyses and modeling were conducted using JMP Pro v. 14.0 on a 2019 MacBook Pro running Mojave 10.14.6.

Establishing a DOE bioprocess dataset

Intervertebral disc material was obtained from two recently deceased donors under IRB approval and transported to the lab in Hypothermasol (Biolife Solutions) containing gentamycin (Mediatech) and amphotericin B (Mediatech). Each donor was processed independently into a distinct cell line. Nucleus pulposus tissue was dissected from the intervertebral discs using scalpels and tweezers. Cells were isolated from nucleus pulposus tissue using NB5 collagenase (Nordmark). Isolated cells were expanded in vented cap T150 attachment culture flasks (Corning) in the presence of DMEM/F12 (Corning), amphotericin B (Mediatech), gentamycin (Mediatech), and a cocktail of other proprietary media supplements. Temperature and pH were passively controlled by growing cells in a 37 °C, 5% CO₂, incubator. At confluency, cells were dissociated from the flask using TrypLE (Thermo Fisher Scientific), formulated in 90% Characterized Fetal Bovine Serum (Hyclone), 10% DMSO (Protide Pharmaceuticals), and cryopreserved at − 196 °C for subsequent use.

At the time of testing, cells were thawed at 37 °C, washed with phosphate-buffed saline (pH 7.2; Thermo Fisher Scientific), and counted using a K2 automated cell counter (Nexcellom Bioscience). Cells were then passaged onto T25 attachment culture flasks (Corning) and grown under the same conditions as above except in the case of DOE parameters. Following 7 days of growth, cells were dissociated from the flask and counted using the K2 automated cell counter. Doublings were calculated using the formula: doublings = 3.32[log (total viable cells at harvest/total viable cells at seed)]. Microscopic images of the attached cells were obtained immediately prior to harvest.

Twenty-four unique conditions were derived from a D-optimal DOE. DOE input parameters were cell line, seeding density, media supplement percentage, and media exchange percentage. Each input parameter was investigated from the lowest (− 1) to highest (+ 1) points of our historically investigated ranges except for cell line, which accounts for two unique allogeneic cell lineages. All primary interactions, 2nd level interactions, and 2nd level powers were given necessary estimability in the DOE dialog with the response output as doublings. Finally, an SLS regression model of the bioprocess dataset was created.

Analysis of bioprocess using DOE of neural network architecture.

In our ANN-DOE hybrid, 32 feedforward neural networks were constructed. Each ANN model in this exercise models the ability of cell line, seeding density, media supplement percentage, and media exchange percentage from the bioprocess dataset to predict doublings. The architecture of these 32 neural networks was chosen by running a D-optimal DOE with ANN node number and activation functions as input parameters (Table 1) and ANN output quality as response functions (Table 2). For ANN-DOE inputs, up to 100 linear, 100 TanH (sigmoid) and 100 Gaussian (bell curve) activation functions were evaluated at two levels each. ANN quality was evaluated using the coefficient of determination (R²) and Standard Square Error (SSE) for the training model as well as the difference between R² and SSE between the training and validation datasets.

Table 1 Neural network DOE input levels

Full size table

Table 2 Neural Network DOE response functions

Full size table

All primary terms, 2nd level interaction terms, and 2nd level powers estimability were set to necessary. Five random starts were employed for each ANN. The bioprocess dataset was randomly split into a training (n = 16) and validation (n = 8) dataset. The training and validation datasets were maintained for all models. To find our optimal ANN architecture, we made an SLS regression model of the ANN-DOE. A new ANN of the bioprocess unit operation was created using the maximally desirable ANN architecture.

The doublings predicted by our training set versus the actual doublings observed in our validation set were compared for goodness of fit in the JMP Pro 14 Compare Model dialog. If successful, our optimized ANN should have a higher R² and lower average absolute error (AAE) than a linear regression model or the non-optimized neural networks, when measuring the goodness fit for doublings.

In vitro model qualification

Cells were grown in triplicate using the optimal seeding density, media supplement percentage, cell line, and media exchange percentage as defined by the following models: the SLS regression theorized optimum, the ANN theorized optimum, our OFAT set point, and a historical set point. Average doublings and standard deviation (SD) for each condition were calculated. A successful run will have a statistically improved doublings compared to our OFAT or historical set point, as measured by an LSMeans Differences Student’s T test at an α of 0.050.

Results

Model creation and evaluation

This bioprocess DOE was evaluated using 32 unoptimized ANN (Fig. 1a). The 32 unoptimized ANNs were modeled using SLS regression. Parameters with an effects test p > 0.05 were removed from the regression model. Desirability of R² Training was maximized, whereas R² Delta, SSE-Training, and SSE-Delta response values were minimized (Fig. 1b). By providing equal weight to all four model output functions, the maximum desirable ANN was determined to be a 1-layer, 91 Gaussian ANN (Fig. 1c). All ANN response values were recorded (Online Resource 2) and compared (Fig. 2) (Online Resource 3):

R² Training represents our ANN ability to model the training dataset. Our DOE-ANN hybrid (R² Training = 0.97) outperformed the mean unoptimized ANN (Mean R² Training = 0.92, SD 0.08). The improvement suggests that our modeling approach improved our ability to model our training dataset.

The R² Delta values represent the difference between R² values in our training and validation data sets. The DOE-ANN model had the lowest R² Delta value (R² Delta 0.01) of all ANN tested (Mean R² Delta 0.06, SD 0.4). Reduction in our R² Delta value suggests that our ANN has an improved ability to model true process relationships, rather than overfitting or underfitting our data. This optimized R² Delta value is more important than the R² Training value because it is an indication of consistency between training and validation data sets.

SSE-Training represents the error in our training model calculated as the sum of squares for error. The DOE-ANN model (SSE-Training = 1.69) outperformed the mean (SSE-Training = 5.24, SD 5.70) of the unoptimized models. The improvement of SSE-Training in our optimized model suggests a reduced rate of error in our training dataset due to our DOE-ANN modeling approach.

SSE-Delta represents the difference between SSE values in our training and validation data sets. The optimized ANN-DOE (SSE-Delta = 0.83) outperformed the mean SSE-Delta from the 32 models (Mean SSE-Delta = 4.37, SD 3.82). The improvement in R² Training alongside the simultaneous reduction in SSE-Training, R² Delta, and SSE-Delta suggests that we improved the ANN modeling capacity while simultaneously reducing error compared to 32 unoptimized models with a wide range of model architectures.

In addition to direct comparison of the ANN-DOE response functions, models were also compared using the model comparison dialog in JMP Pro v 14.0 (Fig. 3). Specifically, the doublings predicted by our training set versus the actual doublings observed in our validation set were compared for goodness of fit using the 33 ANNs (32 from the ANN-DOE, plus one optimized ANN) and an SLS regression model of the bioprocess dataset. There are three key model comparability results. First, compared to the SLS regression model (R² = 0.95, AAE 0.39), some of the 32 unoptimized ANNs created have lower R² and higher error (R² = 0.81, AAE = 0.76), whereas some have higher R² and lower errors (R² = 0.98, AAE = 0.13). This underperformance by some but not all of the 32 unoptimized ANNs shows that not every ANN is capable of improving SLS regression modeling capability. Second, the optimized ANN (R² = 0.99, AAE of 0.10) had higher R² and lower AAE than any of the 32 unoptimized models. This suggests that our ANN-DOE hybrid approach converged on a model with better modeling power and reduced error more than was possible by chance using 32 ANN with a wide range of architecture. Third, the optimized ANN outperformed the SLS regression model in both R² (0.04 improvement) and AAE values (0.29 improvement). Together, these three key results show that not all ANNs outperform SLS regression and that our ANN is optimized to fit the validation dataset with more power and lower error than any of the other models generated, making it an optimal model for our bioprocess.

In vitro model qualification

To test the real-world applicability of our in silico models, we tested the bioprocess set points in vitro. SLS regression and a 1-layer, 91 Gaussian ANN were each used to model the bioprocess DOE. When comparing the SLS regression optimized desirability set point versus the 91-Gaussian neural network optimized set point, the cell line and seeding density were calculated to be the same. However, the ANN determined that a larger percentage of the media supplement and a lower media exchange percentage could be beneficial when compared to the SLS regression optimum. The coded values for each condition are shown in Table 3 where −1 represents the lower end of the investigated range, 1 represents the high end of the investigated range, and 0 represents the center point.

Table 3 Process Setpoints and Theorized Optimum by Model

Full size table

Flasks were grown in triplicate using the bioprocess DOE set points, historical set point, OFAT set point, Regression Setpoint, and 91-Gaussian Setpoint. After harvesting all flasks, doublings were calculated (Fig. 4a). Visually all cells exhibit the expected elongated fibroblast-like morphology (Fig. 4b). Flasks grown using the historical set point and ANN are the most confluent. However, as predicted, the historical set point resulted in lower doublings.

The flasks grown using OFAT optimization performed significantly better (4.86 doublings, SD 0.15) than flasks grown at our historical set point (3.14 doublings, SD 0.18). However, many of the flasks grown during our DOE runs outperformed the OFAT optimization flasks. This outcome demonstrates that there are still significant gains to be made by looking at multiple variables in a DOE fashion.

The flasks grown using the SLS regression modeling (6.19 doublings, SD 0.49) significantly outperformed those optimized with OFAT experimentation. Further, of the three flasks grown using our SLS modeled set points, two outperformed all DOE runs. This performance shows that we found a true process relationship which allows us to significantly and reliably improve cell growth compared to our OFAT experimentation. However, the SLS model and ANN model did not agree on optimal set points.

All three flasks grown in ANN theorized optimum (6.91 doublings, SD 0.21) outperform all 33 other conditions. This demonstrates that our optimized ANN modeled our bioprocess better than our SLS regression model. The performance of the ANN suggests that a process optimum was found and modeled with fidelity which we were unable to be capture with a regression line. The ANN optimum also outperformed all DOE runs, some of which had more frequent media exchanges, and higher percentages of media supplement. Thus, using the ANN model allows for improved growth with reduction in media which will save on resources.

Finally, the 12 in vitro model qualification flasks were modeled with SLS regression and evaluated with post hoc tests. The regression model exhibited significance as measured by ANOVA (p < 0.01, R² = 0.97) (Online Resource 4). Comparisons of least squares mean show that the ANN set point increased doublings by 0.69, 2.05, and 3.77 over the SLS setpoint, OFAT setpoint, and historical setpoint, respectively. Further, there were statistically significant differences between all four experimental groups as measured by Least Squares Means Differences Students Tukey at α = 0.05 (Fig. 4c). The statistically significant differences in doublings between all four groups show that the different models do grant different levels of process understanding, resulting in different abilities to optimize cell growth. The 11.6% improvement in cell doublings between the ANN- and SLS-derived setpoints suggests that a process optimum was found and modeled with fidelity which we were unable to be capture with a regression line. The ANN also outperformed all DOE runs, some of which had more frequent media exchanges, and higher percentages of media supplement. Thus, using the ANN model allowed for improved growth with reduction in media which will save on resources.

Discussion

Improvement of process understanding without the need for laboratory experiments is inherently attractive to the bioprocess professional. In silico, our optimized ANN performed well for our quality outputs R² Training, SSE Training, R² Fit and SSE Fit. These high-quality outputs indicate that our DOE approach found an improved ANN configuration compared to 32 unoptimized ANN. Further, our ANN demonstrated better fit for doublings with lower error than all 32 non-optimized ANN and the SLS model. This improvement in model fit suggests that our ANN-DOE approach models true process relationships that were not previously captured with SLS regression or unoptimized ANN models.

In vitro, we exhibited differences between each method of bioprocess development. The ANN-DOE showed 11.2% improvement in doublings over SLS regression, and 42.2% improvement over OFAT experimentation. The comparison of development pathways shows how a DOE dataset can significantly improve a process compared to OFAT experimentation regardless of whether ANN or SLS regression is used to model the data set. However, using the ANN-DOE, we found a process set point which significantly improved bioprocess outcomes compared to the already capable linear regression model.

This manuscript describes four tiers of bioprocess development efficiency, tests them rigorously with in silico and in vivo methodologies, and demonstrates statistically significant differences between each process outcome. Between the improved modeling capability in silico and significant increase in process doublings in vitro, we can conclude that our ANN-DOE hybrid approach to process development efficiently leveraged ANN to improve bioprocess outputs beyond the capabilities of an SLS regression model. The improvement of 0.69 doublings over SLS regression, and improvement of 2.05 doublings over OFAT experimentation indicates that the approach should be tested on other bioprocess operations.

The value of the ANN-DOE approach, beyond achieving an improved process model, is that it bypasses the complex training and validation exercises generally used in machine learning in favor of evaluating at the network architecture using DOE techniques already familiar to many bioprocess professionals. Modeling of an established bioprocess dataset in this manner is relatively quick and inexpensive to conduct.

Using DOE to evaluate machine learning models, a single scientist can use a laptop to improve the process set points using only a few hours of work and somewhere between a day to a week of computing power, depending on the complexity of the model and power of the computer. It is likely that the greatest value of ANN modeling will come not from evaluation of development studies but from modeling perturbations of established bioprocess manufacturing operations. However, given that ANNs are notoriously unreliable [5], we recommend always verifying manufacturing set points using a small-scale model before making drastic changes in the manufacturing process.

Despite its preliminary success, the combination of DOE with ANN could be improved upon in several ways. Increasing the size of the bioprocess dataset or augmenting our ANN-DOE hybrid model using standard DOE augmentation techniques would result in higher fidelity models. Only 3 neuron types are discussed here, but the DOE principals herein should be applicable for any neuron type and for many deep learning problems. Although either a traditionally boosted ANN or a DOE derived ANN is sufficient to model a process, further investigation into the harmonization of the two approaches would be beneficial.

With the immense value derived from process improvements for pharmaceuticals, and the rapid turnaround time of the ANN-DOE Hybrid approach, ANNs can provide an incredible investment to reward ratio. Beyond its application in the bioprocess field, we believe the principals herein may offer a robust and thoughtful way to explore the creation of ANN architecture across disciplines.

Data availability

Additional Data in Supplemental Materials.

Code availability

Not applicable.

References

Rathore A, Singh S (2015) Use of multivariate data analysis in bioprocessing. BioPharm Internat 28:26–78
Google Scholar
Jörg Schubert RS, Dors M, Havlik I, Lübbert A (1994) Bioprocess optimization and control: application of hybrid modelling. J Biotechnol 35(1):51–68. https://doi.org/10.1016/0168-1656(94)90189-9
Article Google Scholar
Ignova M, Glassey J, Ward AC, Montague GA (1997) Multivariate statistical methods in bioprocess fault detection and performance forecasting. Trans Instit Measure Control 19(5):271–279. https://doi.org/10.1177/014233129701900507
Article Google Scholar
Guthrie WF (2020) NIST/SEMATECH e-Handbook of Statistical Methods (NIST Handbook 151). Nat Instit Standards Technol 1:3. https://doi.org/10.18434/M32189
Article Google Scholar
Baughman DR, Liu YA (1995) Neural networks in bioprocessing and chemical engineering. Academic press, NY
Google Scholar
Rosenblatt F (1958) The perceptron: a probabilistic model for information storage and organization in the brain. Psychol Rev 65(6):386–408. https://doi.org/10.1037/h0042519
Article CAS PubMed Google Scholar
Olgac A, Karlik B (2011) Performance analysis of various activation functions in generalized MLP architectures of neural networks. Internat J Artif Intell Expert Syst 1:111–122
Google Scholar
Baharin A, Abdullah A, Yousoff SNM (2017) Prediction of bioprocess production using deep neural network method. Telkomnika (Telecommun Comput Elect Control) 15:805–813
Article Google Scholar
JMP® 14 (2018) Predictive and specialized modeling. SAS Institute Inc., Cary
Google Scholar
Vlassides S, Ferrier JG, Block DE (2001) Using historical data for bioprocess optimization: modeling wine characteristics using artificial neural networks and archived process information. Biotechnol Bioeng 73(1):55–68
Article CAS Google Scholar
Vaněk M, Hrnčiřík P, Vovsík J, Náhlík J (2004) On-line estimation of biomass concentration using a neural network and information about metabolic state. Bioprocess Biosyst Eng 27(1):9–15. https://doi.org/10.1007/s00449-004-0371-3
Article CAS PubMed Google Scholar
Steyer JP, Pelayo-Ortiz C, González-Alvarez V, Bonnet B, Bories A (2000) Neural network modelling of a depollution process. Bioprocess Eng 23(6):727–730. https://doi.org/10.1007/s004490070001
Article CAS Google Scholar
Weissman SA, Anderson NG (2015) Design of experiments (DoE) and process optimization a review of recent publications. Organic Process Res Dev 19(11):1605–1633. https://doi.org/10.1021/op500169m
Article CAS Google Scholar
Montgomery DC (2020) Design and analysis of experiments. Wiley, Hoboken, NJ
Google Scholar
Mandenius C-F, Brundin A (2008) Bioprocess optimization using design-of-experiments methodology. Biotechnol Prog 24(6):1191–1203. https://doi.org/10.1002/btpr.67
Article CAS PubMed Google Scholar
Björck Å (1996) Numerical methods for least squares problems. Other titles in applied mathematics. Soc Indust Appl Mathe 10(1137/1):9781611971484. https://doi.org/10.1137/1.9781611971484
Article Google Scholar
Lasheras FS, ViláN JAV, Nieto PJG, DíAz JJDC (2010) The use of design of experiments to improve a neural network model in order to predict the thickness of the chromium layer in a hard chromium plating process. Math Comput Model 52(7–8):1169–1176. https://doi.org/10.1016/j.mcm.2010.03.007
Article Google Scholar
Balestrassi P, Popova E, Paiva A, Lima J (2009) Design of experiments on neural network’s training for nonlinear time series forecasting. Neurocomputing 72:1160–1178. https://doi.org/10.1016/j.neucom.2008.02.002
Article Google Scholar
Silverman LI, Dulatova G, Tandeski T, Erickson IE, Lundell B, Toplon D, Wolff T, Howard A, Chintalacharuvu S, Foley KT (2020) In vitro and in vivo evaluation of discogenic cells, an investigational cell therapy for disc degeneration. Spine J 20(1):138–149. https://doi.org/10.1016/j.spinee.2019.08.006
Article PubMed Google Scholar

Download references

Acknowledgements

The studies in this manuscript are privately funded by DiscGenics Inc. This material is also based in part upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. 1451511. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Funding

This work was supported by DiscGenics Inc. This material is also based in part upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. 1451511. Any opinion, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Author information

Authors and Affiliations

DiscGenics Inc, Salt Lake City, Utah, USA
Daniel Rodriguez-Granrose, Amanda Jones, Hannah Loftus, Terry Tandeski, Will Heaton, Kevin T. Foley & Lara Silverman
Department of Biochemistry and Molecular Biology, University of Miami, Miami, FL, USA
Daniel Rodriguez-Granrose
Department of Neurosurgery, University of Tennessee Health Science Center, Memphis, TN, USA
Kevin T. Foley & Lara Silverman
Semmes-Murphey Clinic, Memphis, TN, USA
Kevin T. Foley

Authors

Daniel Rodriguez-Granrose
View author publications
You can also search for this author in PubMed Google Scholar
Amanda Jones
View author publications
You can also search for this author in PubMed Google Scholar
Hannah Loftus
View author publications
You can also search for this author in PubMed Google Scholar
Terry Tandeski
View author publications
You can also search for this author in PubMed Google Scholar
Will Heaton
View author publications
You can also search for this author in PubMed Google Scholar
Kevin T. Foley
View author publications
You can also search for this author in PubMed Google Scholar
Lara Silverman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: DR-G; Methodology: DR-G; Formal analysis and investigation: DR-G; Lab Work: HL, AJ, TT, WH, DR-G; Writing—original draft preparation: DR-G; Writing—review and editing: LS, DR-G, HL, AJ, TT, WH, KF; NSF GRFP Funding: DR-G, Supervision: LS.

Corresponding author

Correspondence to Daniel Rodriguez-Granrose.

Ethics declarations

Conflicts of interest

D Rodriguez-Granrose, H Loftus, A Jones, T Tandeski, W Heaton, K Foley and L Silverman are employees at DiscGenics. D Rodriguez-Granrose, H Loftus, A Jones, T Tandeski, W Heaton, K Foley and L Silverman own stock or stock options in DiscGenics. K Foley is on the board of DiscGenics.

Ethics approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 560 KB)

Supplementary file2 (PDF 148 KB)

Supplementary file3 (PDF 118 KB)

Supplementary file4 (PDF 206 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rodriguez-Granrose, D., Jones, A., Loftus, H. et al. Design of experiment (DOE) applied to artificial neural network architecture enables rapid bioprocess improvement. Bioprocess Biosyst Eng 44, 1301–1308 (2021). https://doi.org/10.1007/s00449-021-02529-3

Download citation

Received: 11 September 2020
Accepted: 04 February 2021
Published: 27 February 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s00449-021-02529-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Design of experiment (DOE) applied to artificial neural network architecture enables rapid bioprocess improvement

Abstract

Similar content being viewed by others

Hybrid modeling as a QbD/PAT tool in process development: an industrial E. coli case study

Designing a Model-Driven Approach Towards Rational Experimental Design in Bioprocess Optimization

Machine learning: an advancement in biochemical engineering

Introduction