Rheology-Informed Neural Networks (RhINNs) for forward and inverse metamodelling of complex fluids

Mahmoudabadbozchelou, Mohammadamin; Jamali, Safa

doi:10.1038/s41598-021-91518-3

Rheology-Informed Neural Networks (RhINNs) for forward and inverse metamodelling of complex fluids

Article
Open access
Published: 08 June 2021

Volume 11, article number 12015, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Rheology-Informed Neural Networks (RhINNs) for forward and inverse metamodelling of complex fluids

Download PDF

Mohammadamin Mahmoudabadbozchelou¹ &
Safa Jamali¹

5753 Accesses
30 Citations
4 Altmetric
Explore all metrics

Abstract

Reliable and accurate prediction of complex fluids’ response under flow is of great interest across many disciplines, from biological systems to virtually all soft materials. The challenge is to solve non-trivial time and rate dependent constitutive equations to describe these structured fluids under various flow protocols. We present Rheology-Informed Neural Networks (RhINNs) for solving systems of Ordinary Differential Equations (ODEs) adopted for complex fluids. The proposed RhINNs are employed to solve the constitutive models with multiple ODEs by benefiting from Automatic Differentiation in neural networks. In a direct solution, the RhINNs platform accurately predicts the fully resolved solution of constitutive equations for a Thixotropic-Elasto-Visco-Plastic (TEVP) complex fluid for a series of flow protocols. From a practical perspective, an exhaustive list of experiments are required to identify model parameters for a multi-variant constitutive TEVP model. RhINNs are found to learn these non-trivial model parameters for a complex material using a single flow protocol, enabling accurate modeling with limited number of experiments and at an unprecedented rate. We also show the RhINNs are not limited to a specific model and can be extended to include various models and recover complex manifestations of kinematic heterogeneities and transient shear banding of thixotropic fluids.

Data-driven constitutive model of complex fluids using recurrent neural networks

Article 02 August 2023

Data-driven selection of constitutive models via rheology-informed neural networks (RhINNs)

Article 03 August 2022

Constitutive Models of Complex Fluids

Introduction

Complex fluids are a broad class of materials, in which the macroscopic response of the fluid to an applied deformation or load is determined by the state of microstructure. In contrast to conventional fluid mechanics problems, where the viscosity of the fluid remains constant, the material functions of the complex fluids depend on the rate and time of applied deformation^{1,2,3,4,5,6,7,8,9,10}. To predict these complex fluids’ behavior under flowing conditions, it is indispensable to present closed-form constitutive equations that correlate the microstructural and kinematic variables of the material to the state of stress. Efforts in developing such constitutive equations are thus as old as the science of rheology itself^11,12,13. The constitutive models of choice become more intricate, as the fluid’s response to a deformation becomes rate or time dependent, leading to an inevitable increase in the number of model parameters. Hence, more experimental protocols are needed to determine these parameters and to describe the system under question. Nonetheless, even constitutive equations with several model parameters commonly fail to capture the rheology of a complex system subject to a series of different flow protocols.

Complex fluids often exhibit a time-dependent stress response under flow owing to their inherent viscoelastic and/or thixotropic timescales^14,15,16. Thixotropy observed in many complex fluids generally manifests in the sensitivity of the viscosity to the history of the applied strain rate^17,18,19. Thixotropic effects originate from evolution of the material’s microstructure as a result of the interplay between shearing forces exerted by the flow and the natural structure formation^20,21,22. Thus, in thixotropic constitutive equations, one will critically need to solve for the time evolution of a structure parameter under flow. On the other hand, the local shear stress/rate that the material experiences determines the rate of structure break-up under flow. Hence, detailed multi-component constitutive models that fully capture different rate and time dependent phenomena commonly involve systems of coupled differential equations. Many constitutive models have been proposed to recover thixotropic response of a complex fluid^{23,24,25,26,27}. For an ideal thixtropic fluid, also referred to as Thixo-Visco-Plastic (TVP) fluid, the shear stress depends on the structure parameter, $\lambda $, which itself evolves with time as shown in Eq. (1)²³. In this equation, $\sigma _y$ is the yield stress, $\eta _s$ and $\eta _p$ are background and plastic viscosities, ${\dot{\gamma }}$ is the applied deformation rate, and $k_+$ and $k_-$ are the build-up and breakage coefficients of the structure parameter.

$$\begin{aligned} \begin{aligned} {\left\{ \begin{array}{ll} \sigma (t)&{}= \sigma _y \lambda (t) + (\eta _s + \eta _p \lambda (t)){\dot{\gamma }}(t) \\ {\dot{\lambda }}(t) &{}= k_+ (1 - \lambda (t)) - k_- \lambda (t) {\dot{\gamma }} (t) \end{array}\right. } \end{aligned} \end{aligned}$$

(1)

Colloidal gels commonly show thixotropic, static and dynamic yielding, rate-dependent shear thinning, and elsatic response under different flow protocols and are referred to as Thixotropic Elasto-Visco-Plastic (TEVP) fluids^{3,28,29,30,31}. Thus, in addition to TVP model parameters, TEVP constitutive equations include the elastic modulus (G) of the fluid as well. A TEVP model is shown in Eq. (2), including six different model parameters.

$$\begin{aligned} \begin{aligned} {\left\{ \begin{array}{ll} {\dot{\sigma }}(t) &{}= \frac{G}{\eta _s + \eta _p}[-\sigma (t) +\sigma _y \lambda (t)+ (\eta _s + \eta _p \lambda (t)){\dot{\gamma }}(t)] \\ {\dot{\lambda }}(t) &{}= k_+ (1 - \lambda (t)) - k_- \lambda (t) {\dot{\gamma }}(t) \end{array}\right. } \end{aligned} \end{aligned}$$

(2)

While the model presented in Eq. (2) recovers a number of rheological features of TEVP fluids, in order to fully capture the response of the fluid to a Large Amplitude Oscillatory Shear (LAOS) flow protocol, a more sophisticated plastic component has to be considered. Iso-Kinematic Hardening (IKH) model^4,32 decouples the applied shear rate into plastic and viscoelastic contributions and introduces the back strain in order to account for the evolving microstructure from one cycle to next in oscillatory flows. This leads to a complex constitutive equation that predictably captures wide range of material behavior with different flow protocols. The general description of IKH model is shown as Eq. (3). The function f(.) is determined based on the viscoelastic model of choice that leads to acquisition of various models consisting of 9-15 parameters that are inevitably challenging to be determined. In this set of coupled ODEs, A is the back strain, m and q are the material constants, and ${\dot{\gamma }}_p$ is the plastic component of the applied shear rate.

$$\begin{aligned} \begin{aligned} {\left\{ \begin{array}{ll} {\dot{\sigma }}(t) &{}= f(\sigma ,C,A(t),\lambda (t),{\dot{\gamma }}(t)) \\ {\dot{A}}(t) &{}= \dot{\gamma _p} - (q|A(t)|)^m sign(A(t))|\dot{\gamma _p}(t)| \\ {\dot{\lambda }}(t) &{}= k_+ (1 - \lambda (t)) - k_- \lambda (t) |\dot{\gamma _p}(t)| \end{array}\right. } \end{aligned} \end{aligned}$$

(3)

As is clearly evident in Eq. (3), the number of parameters required to fully capture the rheological response of a complex fluid to an applied deformation increases very rapidly and eventually becomes computationally prohibitive. Moreover, these parameters are not necessarily based on a physical merit, and are often challenging to fit. Thus, an exhaustive list of experimental protocols are usually taken to fully parametrize a given model for a specific system. Even then, the emergence of multiple length and time scales due to structure break-up/formation, non-ideal behavior of the material under investigation, experimental artifacts, and many more delicate details can lead to erroneous predictions. This is even more evident when real-life and industrial complex fluids of interest that contain multiple components are considered. Thus, numerical platforms that reduce the computational complexity of implementing a fully resolved constitutive model, or decrease the number of experiments required to identify a system’s model parameters are of great interest.

Over the past few years, Machine Learning (ML) algorithms have found their way in all avenues of science and engineering. With an ever-increasing computational power and the ability to process large data sets, data-driven models have become indisputable and powerful tools. With a limited number of studies utilizing ML algorithms^33,34,35,36, the field of soft matter and more specifically rheology is lagging behind in leveraging such advanced methodologies. This is partially due to the ambiguous consequences of the produced meta-models and their adherence to the fundamental underlying physics. However, these issues would be effectively attenuated by executing the appropriate type of ML approach.

Traditional ML algorithms, regardless of their type, depend on abundance of data to be accurately predictive. This means it is absolutely essential to train the considered ML algorithms on extremely large enough data set. Moreover, most of ML algorithms are suitable for interpolation [when they are trained on a sufficiently large data sets], and are often incapable of out-of-range predictions (extrapolation). Recent physics-based ML algorithms not only include the physical governing equations of choice, but also diminish the need for big data sets. The groundbreaking work of Raissi et al.³⁷ on “Physics-Informed Neural Network” (PINN) paved the way for physics-based ML algorithms to address these issues. The central concept is to directly add physical governing equations to the neural network (NN) framework to achieve a meaningful meta-model. By incorporating the governing physical laws, and constraining the NN framework to adhere to these physical laws, the need for large training data sets can also be eliminated. It is worth mentioning that the scope of this work is limited to problems in which the constitutive model describing the material of choice is known. In the case with unknown governing laws, the pathway to embedding the physical laws into the training process has to change accordingly. One such method would be to introduce the physical intuition to the NN implicitly and by means of physics-based synthetic data, generated from constitutive laws³³.

In this study, we present Rheology-Informed Neural Networks (RhINNs) for direct and inverse solution of complex rheological constitutive models. In a direct solution, RhINNs are employed as an alternative platform for solving systems of Ordinary Differential Equations (ODEs) on predicting the rheology of complex fluids. In the inverse solution however, RhINNs are used to learn the hidden rheology of complex fluids with only a handful of data sets. To this end, we first describe the meta modeling approach using NNs in the form of RhINNs. Thereupon, results are presented for both direct and inverse problems, followed by concluding remarks and outlooks.

Problem setup and methodology

Neural Networks are a sub-class of supervised ML algorithms³⁸, consisting of many interconnected processing elements called neurons. Neurons process and predict data by creating a computational structured framework where the complex relations between the inputs and outputs is revealed as a function. Each NN consists of three main layers: input layer, output layer, and several hidden layers. Each of these hidden layer contains several neurons, and each neuron has an specific weight and bias. These networks learn to minimize their deviations from the actual data by adjusting the weights and biases between different neurons and layers within the structure of the network. In other words, the weights and biases of the neurons are changed continuously to generate an emend response when new inputs are provided. NNs generate meta-models based on these correlations in statistical variations of complex systems. In a purely statistical method, the training process for the NN is agnostic to the physical governing equations. Here however, we directly solve for nonlinear problem without any prior assumptions, linearization, or local time-stepping. We benefit from recent developments in automatic differentiation³⁹ to differentiate the NN with respect to its input coordinates and model parameters. In other words, we include the physical laws explicitly into the NN architecture. Figure 1 shows a schematic description of RhINNs. For visual purposes, Fig. 1 contains a NN with only three hidden layers and four neurons per layer, with 2 input parameters as time (t) and shear rate $({\dot{\gamma }})$, and 2 output parameters as shear stress $(\sigma )$ and structure parameter $(\lambda )$. We should mention that in the definition of $f_i()$, shear rate as ${\dot{\gamma }}$ plays an important role, since it reflects on the kinematics of the imposed flow protocol. Hence, including such function is a necessity either implicitly (as a function in the physical governing law) or explicitly (as one of the inputs of RhINNs). We chose to go with the latter, since we are offering a more generic framework without affecting the predicted results. We performed a comprehensive analysis to determine the effects of number of hidden layers and number of neurons in each hidden layer on the performance of our proposed RhINNs, which are presented in Appendix B.

In a data driven solution framework, the solution of the constitutive equation of choice is being inferred without any data, and the only thing that is needed is the constitutive equation itself and the initial conditions to the problem of interest. In this framework, one can think of the NN as an alternative ODE or PDE solver, where inputs are correlated directly to the predictions. These inputs and their corresponding predictions are used to calculate the residual of the constitutive model at hand, and the goal of the NN is to minimize this residual. Only then one can assure that the training process is informed by a physical intuition. On the other hand and in a data driven discovery framework, the input of the RhINNs is a $n \times 3$ matrix, in which the first and second columns are time and shear rate (${\dot{\gamma }}$), respectively, and the final column is the shear stress at that particular shear rate and time measured experimentally or calculated numerically. Since experimental observation of structure parameter is not feasible, we cannot include this information into the training process. Here by knowing the experimental measurements of only shear stress, the goal of NN is to minimize the residual for the constitutive model of choice and return the predicted model parameters.

In general, a system of ODEs with two independent variables can be written as Eq. (4).

$$\begin{aligned} \begin{aligned} {\left\{ \begin{array}{ll} \dot{y_1} &{}= {\mathcal {F}}_1(y_1,y_2,t) \\ \dot{y_2} &{}= {\mathcal {F}}_2(y_1,y_2,t) \end{array}\right. } \end{aligned} \end{aligned}$$

(4)

In this system of ODEs, $y_1(t)$ and $y_2(t)$ are the hidden solution and ${\mathcal {F}}_i$ are nonlinear operators in the time domain of [0, T]. As a motivating example, a TEVP material [described by Eq. (2)], represents a system of ODEs with two equations. Hence, the definitions of $f_i()$ shown on the right box of Fig. 1 would turn into $f_1(\sigma ,\lambda ,{\dot{\gamma }})=\frac{G}{\eta _s + \eta _p}[-\sigma (t) +\sigma _y \lambda (t)+ (\eta _s + \eta _p \lambda (t)){\dot{\gamma }}(t)]$ and $f_2(\sigma ,\lambda ,{\dot{\gamma }})=k_+ (1 - \lambda (t)) - k_- \lambda (t) {\dot{\gamma }}(t)$. By adjusting the correspondence from one neuron to another, and from one layer to another in a NN, a meta-model is produced to correlate the output results based on a series of new input variables. The variables of a RhINNs are learned by minimizing the loss function, that captures the residual of each equation in addition to the the discrepancy between the predicted and the actual Initial Condition (IC) during the training process. Eqs. (5) and (6) present the RhINNs loss functions for the direct and the inverse problems, respectively.

$$\begin{aligned} MSE_{Dir}= & {} MSE_{R} + MSE_{IC} \end{aligned}$$

(5)

$$\begin{aligned} MSE_{Inv}= & {} MSE_{R} + MSE_{d} \end{aligned}$$

(6)

In our system, and in Eqs. (5) and (6), $MSE_{R}$ (Eq. 7) is the residual calculated from the system of ODEs, $MSE_{d}$ (Eq. 8) is the deviation of RhINNs predictions from actual values , and $MSE_{IC}$ (Eq. 9) is discrepancy between the actual and the predicted values of the initial conditions. In practice, initial conditions are imposed by calculating the predicted output at t=0 and adding the discrepancy between the predicted value and actual initial condition as defined in Eq. (8) to total loss function. It should be noted that in an inverse approach, existence of IC is not a necessity.

$$\begin{aligned} MSE_{R}= & {} \sum _{j=1}^{N_{eqs}} \frac{1}{N_{R_j}} \sum _{i=1}^{N_{R_j}} |Residual_{(equation_j)}(t_i)|^2 \end{aligned}$$

(7)

$$\begin{aligned} MSE_{IC}= & {} |Predicted_{IC}-Actual_{IC}|^2 \end{aligned}$$

(8)

$$\begin{aligned} MSE_{d}= & {} \sum _{i=1}^{N_d} |Predicted(t_i)-Actual(t_i)|^2 \end{aligned}$$

(9)

In an inverse problem, the model parameters are chosen to be variables that can be changed throughout the optimization process. After initialization, a total loss is calculated based on Eq. (6). Afterward, these variables are consistently changed during the optimization process until the loss function is minimized (and becomes zero in an ideal case). After reaching a certain criteria, the training process stops and the model parameters are presented. It should be mentioned that there are no strict boundaries set for any of the parameters used in this work.

Results and discussion

As describe previously, the ultimate goal is to develop a reliable and accurate platform for fast data-driven solution of complex time and rate dependent constitutive equations. Thus here, the scope of our study is limited to demonstrating RhINNs as a robust alternative meta-constitutive model. In the following, several flow protocols of rheometric significance are solved in both direct and inverse problems, referred to as data driven solution and data driven discovery respectively. In data driven solution the NN is employed to find an answer in a certain domain for an existing set of equations and initial conditions. On the other hand, with the inverse problems, i.e. data driven discovery, the characteristics of a system of ODEs and hence material’s properties are predicted using the data at hand and the system of ODEs.

Data driven solution

RhINNs are devised and employed as alternative tools to solve systems of ODEs used in complex fluid modelling. In the training process, only the system of ODEs and the initial conditions are used without any additional data, hence the output of the NN will be the solution to the constitutive model. First, we consider different models outlined in Eqs. (1), (2), and (3) to show the capability of RhINNs in solving various constitutive equations. Then, we explore the role of rheometric protocol by solving for the stress response under a range of different deformation protocols. Note that there exist a number of viable options as thixotropic constitutive models to be adapted here; however, we are considering the three different constitutive models in Eqs. (1), (2), and (3), as they provide an increasing level of complexity with considerable number of model parameters involved in the IKH model: Eq. (1) is the simplest model for a thixotropic material with five (5) model parameters and a single algebraic equation coupled with an ODE, Eq. (2) includes elasticity with an additional parameter, and Eq. (3) contains a total of nine (9) model parameters and three coupled equations. Figure 2 shows the comparison between the ground truth solution of different thixotropic constitutive models and RhINNs predictions with parameters based on Table 1 and in a start-up of shear flow protocol with ${\dot{\gamma }}=0.1\,\text {s}^{-1}$. While Table 1 outline the choice of model parameters used in Fig. 2 for each model, we performed similar benchmarking with a wide range of parameters and initial conditions, and found that the RhINNs predictions are not limited by the choice of parameters or the initial conditions. Results in Fig. 2 clearly indicate that RhINNs’ predictions closely track the ground solution of the shear stress response, regardless of the choice of model. The value of the microstructure parameter, $\lambda $, ranges from zero for a fully destructured/fluidized system, to unity for a fully structured material, ex. unyielded gel. Comparing the Fig. 2a,b, where fully fluidized and fully structured systems are compared, it is evident that the RhINNs predictions remain valid by changing the initial conditions as well. The Fig. 2c,d respectively show the RhINNs-predicted flow curve as well as the ground truth solution of TEVP, and IKH constitutive models, with increasing levels of complexity.

Table 1 Values of the model parameters used for the flow curves presented in Fig. 2.

Full size table

The regression plot of the trained model for a direct problem with a TEVP model at the shear rate of ${\dot{\gamma }}=0.1\,[1/\text {s}]$ is shown in Fig. 3. As the figure shows, there is an excellent correlation between the predicted solution and the ground solution in this case, suggesting that the training is performed properly.

In the next step, we sought to investigate the role of the shear rate magnitude on the RhINNs predictions for the stress response of TEVP fluids. This is particularly important with respect to application of any data driven methodology to rheometric flows and predictions, where differences in the magnitude of applied rates and resulting stresses are commonly presented in logarithmic scales. Since the yield stress and the steady state shear stress, depending on the applied deformation rate, can greatly differ in their magnitude, it is critical to ensure that the neural network provides a reliable prediction for the low shear stress regime and the high shear stress regime alike. In other words, one has to ensure that the small values of stress and the residuals for the correlations in this shear regime are not screened by the large stresses at the highest deformation rates. To do this, we consider the Eq. (2) to be the constitutive model of choice with simple start up of shear protocol. Five different shear rates from ${\dot{\gamma }}=0.01\,\text {s}^{-1}$ to ${\dot{\gamma }}=100\,\text {s}^{-1}$ are presented to cover four decades of change in shear rate. As presented in Fig. 4 the predictions made by the RhINNs and the ground solution of the TEVP fall exactly on top of one another for all shear rates studied here.

In practice, a number of different flow protocols are commonly applied to a complex fluid to probe the relevant material function, properties and characteristic timescales. Start-up of shear, flow hysteresis or ramp cycles, small amplitude oscillatory shear (SAOS), LAOS, and step shear rate are among the most common rheometric flow protocols that can be used in order to fully investigate a thixotropic fluid. The data-driven methodologies commonly fail to capture the details of changes in a flow protocol since the equations are not fully solved, but are merely correlated in time. The traditional data-driven methodologies, such as deep neural networks without introduction of physical laws, commonly fail to capture the details of changes in a flow protocol since the equations are not fully solved, but are merely correlated in time. For instance, even if enough data is used for accurate training of a deep neural network for constant shear rate flow protocol, the network learns to predict the steady state response of the material to an applied deformation rate at long time (longer than the material timescale). Thus, when a flow protocol involves change of direction or magnitude at a later time, classical neural networks lose their ability to track the experiment entirely. Nonetheless, RhINNs does not suffer from the same deficiency and is able to recover these rate changes in different protocols. Figure 5 shows the comparison between the RhINNs prediction and the ground solution of the shear stress response of a TEVP fluid (Eq. 2) under flow hysteresis and LAOS experiments. The rheological hysteresis area is a hallmark of thixotropic fluids, where a ramp down followed by a ramp up shear protocol, returning to the initial shear rate (which is large enough to fluidize the entire system and erase any thermokinematic memory) results in close-loop flow curves. The physical significance of such protocols is the fact that the magnitude of this area strictly depends on the characteristic timescale at which the material begins to erase its memory to the previous deformation. Hence, such methods are used commonly to characterize the thixotropic timescale in TEVP fluids^20,21. On the other hand, the so-called Lissajous curves that describe the shear stress response of a fluid to a large oscillatory shear deformation have been studied extensively in order to characterize time and rate dependent complex fluids such as TEVPs^{4,26,32,40,41,42,43}. In both protocols, RhINNs closely mimics the ground solutions of the TEVP constitutive model over the entire range of shear rates and amplitudes (for brevity, only one frequency and amplitude is presented).

In order to fully probe the ability of our RhINNs methodology to predict the stress response of a TEVP fluid to temporal changes in the imposed shear rate, a more complex step rate-change protocol was applied. Figure 6 represents the comparison between the RhINNs predictions and the ground solutions of the TEVP constitutive model for the shear stress response of a complex shear rate protocol applied to the complex fluid: initial shear rate of ${\dot{\gamma }}=100\,\text {s}^{-1}$ is applied for 50 s, followed by a linear ramp down to ${\dot{\gamma }}=0.1\,\text {s}^{-1}$ over the next 50 s. Upon reaching ${\dot{\gamma }}=0.1\,\text {s}^{-1}$ the deformation rate is kept constant for the third 50 s of the protocol, followed by a final step-up to the initial shear rate of ${\dot{\gamma }}=100\,\text {s}^{-1}$. The results in Fig. 6 clearly indicate that even with a complex shear rate protocol, RhINNs gives a robust predictions with virtually no deviation from the ground solution of the constitutive equation.

Data driven discovery

As described previously, a major leap forward in constitutive modelling of complex fluids and in material design and discovery can be made by enabling data driven methods that recover material functions from a limited number of experiments. Practically, a series of different experiments are performed in order to fit a particular model that describes observed rheological behavior, in order to determine the model parameters and hence material properties of a thixotropic fluid. Of particular interest is to determine the timescales and kinetics of structure break-up and formation under different flowing conditions. Thus in this section, we seek to find the model parameters and material’s time constants from a series of simple flow curves. To do this, we employ our RhINNs methodology to solve for the inverse problem, and predict the material parameters from the shear stress response, i.e. find the hidden rheology with a limited set of data. This is done pedagogically, and by beginning with an assumption of partial information regarding material properties. The ultimate goal is to identify the number of experiments required to provide an accurate prediction for the material properties using the inverse RhINNs platform. These properties include the elastic modulus and the yield stress of the fluid, as well as the time constants required to recover the temporal evolution of the structure parameter. The number of data points commonly collected over a particular rheometric flow protocol greatly depends on the material under investigation, the inherent timescales associated with the material and with the flow protocol, etc.. Nonetheless, typically rheological data are collected and represented in logarithmically spaced intervals to reveal the material functions/timescales with respect to the applied flow protocol. Here, we used $\sim 200$ data points to remain in a relevant data size with respect to the common experiments. In the first step, we seek to identify the time constants for the time evolution of the structure parameter from the shear stress response, assuming that the yield stress and the elastic modulus of the fluid are known. Namely, flow curves such as ones presented in Fig. 2 are provided, and the RhINNs is used to recover the model parameters. Table 2 represents the actual and predicted values of $k_+$ and $k_-$, using a single shear flow curve, whether that is start up of flow, step rate, ramp cycle or SAOS/LOAS. For all various flow protocols, RhINNs recovers the time constants for the structure evolution, having the rest of parameters, with less than one percent error. The exception is the oscillatory shear protocol, for which the error rises to smaller than 5 percent. Nonetheless, this is extremely accurate considering that for each of these protocols, only one set of shear stress response is provided for RhINNs to learn the hidden rheology.

Table 2 RhINNs-predicted values for $k_+$ and $k_-$ in different shear rate protocols. For all cases $G = 30\,[\text {Pa}]$, $\sigma _y = 0.5\,[\text {Pa}]$, $\eta _s = 5\,[\text {Pa s}]$, and $\eta _p = 1\,[\text {Pa s}]$ are known from the material properties.

Full size table

In a similar fashion, we also investigated the impact of imposed shear rate on the performance of the RhINNs in solving the inverse problem, and finding the time constants for the structure evolution, $k_+$ and $k_-$. Table 3 shows the RhINNs predictions and the actual values for five (5) different shear rate magnitudes as studied in the direct problem. Evidently, the RhINNs-predicted time constants are closely tracking the actual values, with better efficiency in the intermediate range of applied shear rates. This could simply be explained by the fact that at the two extremeties, the low stress and high stress responses of the material become more dominant and thus slightly impact the overall predictions. Nonetheless, RhINNs predictions remain in the range of less than 5 percent error for all shear rates studied. Alternatively, Fig. 7 shows the time evolution of the structure parameter using RhINNs-predicted time constants for the flow curves previously seen in Fig. 4, compared to ground solution of the same parameter from a TEVP model. In these curves, RhINNs is solving for the time evolution of the microstructure parameter, based on a single shear stress vs. applied shear rate flow curve.

Table 3 RhINNs predicted values for $k_+$ and $k_-$ in different values of simple shear rate. For all cases $G = 800\,[\text {Pa}]$, $\sigma _y = 20\,[\text {Pa}]$, $\eta _s = 20\,[\text {Pa s}]$, and $\eta _p = 20\,[\text {Pa s}]$ are known from the material properties.

Full size table

One of most important factors contributing to the performance of our proposed RhINNs is the sensitivity of the method to noisy data. Indeed most of the experimental results are naturally associated with some level of noise, due to experimental artifacts and unknown variables affecting the results. To ensure applicability of RhINNs to real-world experimental data, we investigated the effect of noisy data on parameter prediction of RhINNs in an inverse solution. We are considering one of the cases presented in Table 3 with shear rate of ${\dot{\gamma }}=0.1\,[1/\text {s}]$ in a start-up of a flow. We intentionally introduce different levels of noise based on uncorrelated Gaussian noise process to the data at hand. Table 4 represents the results of the predicted coefficients for the structure evolution, $k_+$ and $k_-$. Upon addition of 5% noise to the data, the predictions remain in very good agreement with the ground solution. This further confirms that the proposed RhINNs algorithm is not compromised by the noisy data and the predictions stay realistic and explanatory of the material under question.

Table 4 RhINNs predicted values for $k_+$ and $k_-$ in a start-up of a flow with shear rate of ${\dot{\gamma }}=0.1\,[1/\text {s}]$ with noisy data. For all cases $G = 800 [\text {Pa}]$, $\sigma _y = 20\,[\text {Pa}]$, $\eta _s = 20\,[\text {Pa s}]$, and $\eta _p = 20\,[\text {Pa s}]$ are known from the material properties.

Full size table

We also interrogated the performance of our inverse RhINNs methodology to determine the entire list of material properties/model parameters from a limited number of experiments. To do this, we have provided the time evolution of the shear stress response of a TEVP fluid to our RhINNs platform and ask for the model to predict six (6) model parameters involved: the two time constants for the kinetics of structure formation and break-up, the elastic modulus, yield stress, and the background and plastic viscosities. Table 5 represents the actual against RhINNs-predicted values of all of these material properties provided the simple shear rate flow curves. The predictions are in excellent agreement with the actual values.

Table 5 RhINNs-predicted values for all coefficients based on 10 different experiments in a start-up of a flow with shear rates ranging between 0.1 and 1 [1/s].

Full size table

As demonstrated in Figs. 5, 6 and Table 5, the RhINNs platform accurately predicts the time evolution of the shear stress response of a thixotropic fluid under different flow protocols having the material properties or vise versa. Hence, combining the forward and inverse solutions, i.e. data-driven solution and discovery, one can recover the material properties through a series of simple experimental protocols followed by accurate prediction of the material behavior under a different more complex flow. This is investigated here by evaluating the possibility of predicting the stress behavior of a TEVP fluid under complex shear rate protocol, given its stress response to a simple shear experiment. Figure 8 presents the RhINNs predictions following two different provided data sets for a TEVP fluid under LAOS protocol: i. elastic modulus, yield stress, background and plastic viscosities are known, as well as the time evolution of the shear stress for a single applied shear rate, and ii. no information is available for the material, but shear stress responses are available for ten (10) different applied shear rates. It should be mentioned that both of these scenarios present realistic experimental protocols. For instance, from a single flow protocol, one can measure the yield stress value, the terminal Newtonian viscosity at the highest shear rates, and the background viscosity knowing the chemical nature of the background fluid. Alternatively, one may have virtually no information about these material properties, but able to run a series of simple shear rate protocols. Regardless of the available information, the RhINNs architecture provides an excellent prediction compared to ground solution of a TEVP fluid. In both of these settings, a data driven discovery RhINNs is in serial with a data driven solution RhINNs (Fig. 8).

Conclusion

In this work, we introduced and studied the performance of an adaptable and comprehensive data-driven algorithm for constitutive meta-modeling of complex fluids with respect to their rheological behavior. The proposed Rheology-Informed Neural Networks, RhINNs, is capable of taking advantage of NN versatility in solving constitutive equations for both direct and inverse problems. In the direct problems, the RhINNs can be used as an alternative method for solution of coupled ODEs with excellent accuracy and efficiency. This is particularly of interest with respect to complex rheological constitutive models that are commonly challenging to be implemented within CFD platforms of choice. In the inverse solution, referred to as data-driven discovery, the RhINNs accurately recovers the material properties and the model parameters having only a limited number of data sets and rheometric measurements. Due to presence of different timescales and different effects depending on the flow history, traditional approaches require several experimental protocols tested to find the best parameter fitting of a complex fluid model and to describe the system under question. Here and using RhINNs, we show only one (assuming we have partial information) or 10 (for a brand new material) simple start-up of a flow experimental data are sufficient to calculate the model parameters with a very good accuracy. This provides an extremely powerful platform for employing data-driven and machine learning algorithms in areas of research where often small sizes of data available prevents a meaningful predictive capability to be devised. To test the robustness of our proposed method, we showed one can easily determine the model parameters with a great accuracy, regardless of the type of experimental data at hand. We demonstrated that the incorporation of a physical intuition into the neural network architecture in the form of a constitutive model significantly improves the predictive ability of the algorithm. We also argue that even with a similar computational efficiency for the training of RhINNs compared to that of the traditional approaches, the main advantage of RhINNs methodology (and in general, similar science-based data-driven techniques) lies within reduction of the required data to determine model parameters and thus full characterization of a material with respect to any given rheological or thixotropic constitutive relation of interest. We need to stress on the fact that the goal of current work is not to provide a replacement for ODE solvers in either direct or inverse problem, but solely introducing a data driven method that can further be used as a powerful platform for integration of non-Newtonian constitutive laws of interest. It should also be noted that while inverse solution through common ODE backpropagator solvers will return a constant solution for characterization of a material, RhINNs (and other data-driven techniques) improve upon availability of more data and thus provide a more reliable characterization over time as well. RhINNs methodology introduced here can be directly used in order to significantly reduce the number of experiments required for probing different material properties and model parameters.

References

Colombo, J. & Del Gado, E. Stress localization, stiffening, and yielding in a model colloidal gel. J. Rheol. 58, 1089–1116. https://doi.org/10.1122/1.4882021 (2014).
Article ADS CAS Google Scholar
de Souza Mendes, P. R. Modeling the thixotropic behavior of structured fluids. J. Non-Newtonian Fluid Mech. 164, 66–75. https://doi.org/10.1016/j.jnnfm.2009.08.005 (2009).
Article CAS MATH Google Scholar
de Souza Mendes, P. R. Thixotropic elasto-viscoplastic model for structured fluids. Soft Matter 7, 2471. https://doi.org/10.1039/c0sm01021a (2011).
Article ADS CAS Google Scholar
Dimitriou, C. J. & McKinley, G. H. A comprehensive constitutive law for waxy crude oil: a thixotropic yield stress fluid. Soft Matter 10, 6619–6644. https://doi.org/10.1039/C4SM00578C (2014).
Article ADS CAS PubMed Google Scholar
Gurnon, A. K. & Wagner, N. J. Microstructure and rheology relationships for shear thickening colloidal dispersions. J. Fluid Mech. 769, 242–276. https://doi.org/10.1017/jfm.2015.128 (2015).
Article ADS MathSciNet CAS MATH Google Scholar
Gelbart, W. M. & Ben-Shaul, A. The, “new’’ science of “complex fluids’’. The J. Phys. Chem. 100, 13169–13189. https://doi.org/10.1021/jp9606570 (1996).
Article CAS Google Scholar
Masschaele, K., Fransaer, J. & Vermant, J. Flow-induced structure in colloidal gels: direct visualization of model 2D suspensions. Soft Matter 7, 7717–7726. https://doi.org/10.1039/C1SM05271C (2011).
Article ADS CAS Google Scholar
Rogers, S. A., Vlassopoulos, D. & Callaghan, P. T. Aging, yielding, and shear banding in soft colloidal glasses. Phys. Rev. Lett.https://doi.org/10.1103/PhysRevLett.100.128304 (2008).
Article PubMed Google Scholar
Vermant, J. & Solomon, M. J. Flow-induced structure in colloidal suspensions. J. Phys. Condens. Matter 17, R187–R216. https://doi.org/10.1088/0953-8984/17/4/r02 (2005).
Article ADS CAS Google Scholar
Wagner, N. J. & Brady, J. F. Shear thickening in colloidal dispersions. Phys. Today 62, 27–32. https://doi.org/10.1063/1.3248476 (2009).
Article CAS Google Scholar
Herschel, W. H. & Bulkley, R. Konsistenzmessungen von Gummi-Benzollösungen. Kolloid-Zeitschrift 39, 291–300. https://doi.org/10.1007/BF01432034 (1926).
Article Google Scholar
Bingham, E. C. An investigation of the laws of plastic flow. Bull. Bureau Standards 13, 309. https://doi.org/10.6028/bulletin.304 (1916).
Article Google Scholar
Gillespie, T. An extension of Goodeve’s impulse theory of viscosity to pseudoplastic systems. J. Colloid Sci. 15, 219–231. https://doi.org/10.1016/0095-8522(60)90024-6 (1960).
Article CAS Google Scholar
Mewis, J. Thixotropy—a general review. J. Non-Newtonian Fluid Mech. 6, 1–20. https://doi.org/10.1016/0377-0257(79)87001-9 (1979).
Article CAS MATH Google Scholar
Mujumdar, A., Beris, A. N. & Metzner, A. B. Transient phenomena in thixotropic systems. J. Non-Newtonian Fluid Mech. 102, 157–178. https://doi.org/10.1016/S0377-0257(01)00176-8 (2002).
Article CAS MATH Google Scholar
Barnes, H. A. Thixotropy—a review. J. Non-Newtonian Fluid Mech. 70, 1–33. https://doi.org/10.1016/S0377-0257(97)00004-9 (1997).
Article CAS Google Scholar
Larson, R. G. & Wei, Y. A review of thixotropy and its rheological modeling. J. Rheol. 63, 477–501. https://doi.org/10.1122/1.5055031 (2019).
Article ADS CAS Google Scholar
Larson, R. G. Constitutive equations for thixotropic fluids. J. Rheol. 59, 595–611. https://doi.org/10.1122/1.4913584 (2015).
Article ADS CAS Google Scholar
Wei, Y., Solomon, M. J. & Larson, R. G. A multimode structural kinetics constitutive equation for the transient rheology of thixotropic elasto-viscoplastic fluids. J. Rheol. 62, 321–342. https://doi.org/10.1122/1.4996752 (2018).
Article ADS CAS Google Scholar
Divoux, T., Grenard, V. & Manneville, S. Rheological hysteresis in soft glassy materials. Phys. Rev. Lett.https://doi.org/10.1103/PhysRevLett.110.018304 (2013).
Article PubMed Google Scholar
Jamali, S., Armstrong, R. C. & McKinley, G. H. Multiscale nature of thixotropy and rheological hysteresis in attractive colloidal suspensions under shear. Phys. Rev. Lett.https://doi.org/10.1103/PhysRevLett.123.248003 (2019).
Article PubMed Google Scholar
Jamali, S., Armstrong, R. C. & McKinley, G. H. Time-rate-transformation framework for targeted assembly of short-range attractive colloidal suspensions. Mater. Today Adv.https://doi.org/10.1016/j.mtadv.2019.100026 (2020).
Article Google Scholar
Goodeve, C. F. & Whitfield, G. W. The measurement of thixotropy in absolute units. Trans. Faraday Soc. 34, 511. https://doi.org/10.1039/tf9383400511 (1938).
Article CAS Google Scholar
Coussot, P., Nguyen, Q. D., Huynh, H. T. & Bonn, D. Viscosity bifurcation in thixotropic, yielding fluids. J. Rheol. 46, 573–589. https://doi.org/10.1122/1.1459447 (2002).
Article ADS CAS Google Scholar
Wei, Y., Solomon, M. J. & Larson, R. G. Quantitative nonlinear thixotropic model with stretched exponential response in transient shear flows. J. Rheol. 60, 1301–1315. https://doi.org/10.1122/1.4965228 (2016).
Article ADS CAS Google Scholar
Armstrong, M. J., Beris, A. N., Rogers, S. A. & Wagner, N. J. Dynamic shear rheology of a thixotropic suspension: comparison of an improved structure-based model with large amplitude oscillatory shear experiments. J. Rheol. 60, 433–450. https://doi.org/10.1122/1.4943986 (2016).
Article ADS CAS Google Scholar
Jacob, A. R., Moghimi, E. & Petekidis, G. Rheological signatures of aging in hard sphere colloidal glasses. Phys. Fluidshttps://doi.org/10.1063/1.5113500 (2019).
Article Google Scholar
de Souza Mendes, P. R. & Thompson, R. L. A critical overview of elasto-viscoplastic thixotropic modeling. J. Non-Newtonian Fluid Mech. 187–188, 8–15. https://doi.org/10.1016/j.jnnfm.2012.08.006 (2012).
Article CAS Google Scholar
Joshi, Y. M. & Petekidis, G. Yield stress fluids and ageing. Rheol. Acta 57, 521–549. https://doi.org/10.1007/s00397-018-1096-6 (2018).
Article CAS Google Scholar
Radhakrishnan, R., Divoux, T., Manneville, S. & Fielding, S. M. Understanding rheological hysteresis in soft glassy materials. Soft Matter 13, 1834–1852. https://doi.org/10.1039/C6SM02581A (2017).
Article ADS CAS PubMed Google Scholar
Jamali, S., McKinley, G. H. & Armstrong, R. C. Microstructural rearrangements and their rheological implications in a model thixotropic elastoviscoplastic fluid. Phys. Rev. Lett.https://doi.org/10.1103/PhysRevLett.118.048003 (2017).
Article PubMed Google Scholar
Geri, M., Venkatesan, R., Sambath, K. & McKinley, G. H. Thermokinematic memory and the thixotropic elasto-viscoplasticity of waxy crude oils. J. Rheol. 61, 427–454. https://doi.org/10.1122/1.4978259 (2017).
Article ADS CAS Google Scholar
Mahmoudabadbozchelou, M. et al. Data-driven physics-informed constitutive metamodeling of complex fluids: a multifidelity neural network (MFNN) framework. J. Rheol. 65, 179–198. https://doi.org/10.1122/8.0000138 (2021).
Article ADS CAS Google Scholar
Janes, K. A. & Yaffe, M. B. Data-driven modelling of signal-transduction networks. Nat. Rev. Mol. Cell Biol. 7, 820–828. https://doi.org/10.1038/nrm2041 (2006).
Article CAS PubMed Google Scholar
Solomatine, D. P. & Ostfeld, A. Data-driven modelling: some past experiences and new approaches. J. Hydroinform. 10, 3–22. https://doi.org/10.2166/hydro.2008.015 (2008).
Article Google Scholar
Solomatine, D., See, L. & Abrahart, R. Data-driven modelling: concepts, approaches and experiences. In Practical Hydroinformatics. Water Science and Technology Library (eds. Abrahart, R. J., See, L. M., Solomatine, D. P.), vol. 68, https://doi.org/10.1007/978-3-540-79881-1_2 (Springer, Berlin, Heidelberg, 2009).
Raissi, M., Perdikaris, P. & Karniadakis, G. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707. https://doi.org/10.1016/j.jcp.2018.10.045 (2019).
Article ADS MathSciNet MATH Google Scholar
Brunton, S. L., Noack, B. R. & Koumoutsakos, P. Machine learning for fluid mechanics. Annu. Rev. Fluid Mech. 52, 477–508. https://doi.org/10.1146/annurev-fluid-010719-060214 (2020).
Article ADS MATH Google Scholar
Baydin, A. G. et al. Automatic differentiation in machine learning: a survey. J. Mach. Learn. Res. 18 (2018).
Blackwell, B. C. & Ewoldt, R. H. A simple thixotropic-viscoelastic constitutive model produces unique signatures in large-amplitude oscillatory shear (LAOS). J. Non-Newtonian Fluid Mech. 208–209, 27–41. https://doi.org/10.1016/j.jnnfm.2014.03.006 (2014).
Article CAS Google Scholar
Blackwell, B. C. & Ewoldt, R. H. Non-integer asymptotic scaling of a thixotropic-viscoelastic model in large-amplitude oscillatory shear. J. Non-Newtonian Fluid Mech. 227, 80–89. https://doi.org/10.1016/j.jnnfm.2015.11.009 (2016).
Article MathSciNet CAS Google Scholar
Min Kim, J., Eberle, A. P. R., Kate Gurnon, A., Porcar, L. & Wagner, N. J. The microstructure and rheology of a model, thixotropic nanoparticle gel under steady shear and large amplitude oscillatory shear (LAOS). J. Rheol. 58, 1301–1328. https://doi.org/10.1122/1.4878378 (2014).
Article ADS CAS Google Scholar
Armstrong, M. J., Beris, A. N., Rogers, S. A. & Wagner, N. J. Dynamic shear rheology and structure kinetics modeling of a thixotropic carbon black suspension. Rheol. Acta 56, 811–824. https://doi.org/10.1007/s00397-017-1038-8 (2017).
Article CAS Google Scholar

Download references

Acknowledgements

MM and SJ would like to acknowledge support by Northeastern University’s Gap Fund program.

Author information

Authors and Affiliations

Department of Mechanical and Industrial Engineering, Northeastern University, Boston, MA, 02115, USA
Mohammadamin Mahmoudabadbozchelou & Safa Jamali

Authors

Mohammadamin Mahmoudabadbozchelou
View author publications
You can also search for this author in PubMed Google Scholar
Safa Jamali
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M. and S.J. conceptualized the research, developed the methodology, analyzed the results and wrote the main manuscript text. M.M. performed the investigations and prepared all figures and tables. Both authors reviewed the manuscript.

Corresponding author

Correspondence to Safa Jamali.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mahmoudabadbozchelou, M., Jamali, S. Rheology-Informed Neural Networks (RhINNs) for forward and inverse metamodelling of complex fluids. Sci Rep 11, 12015 (2021). https://doi.org/10.1038/s41598-021-91518-3

Download citation

Received: 23 February 2021
Accepted: 27 May 2021
Published: 08 June 2021
DOI: https://doi.org/10.1038/s41598-021-91518-3
Springer Nature Limited

This article is cited by

Reptation theory-similar deep learning model for polymer characterization from rheological measurement
- Javad Rahmannezhad
- Heon Sang Lee
Korea-Australia Rheology Journal (2024)
A Review of Physics Informed Neural Networks for Multiscale Analysis and Inverse Problems
- Dongjin Kim
- Jaewook Lee
Multiscale Science and Engineering (2024)
Fractional rheology-informed neural networks for data-driven identification of viscoelastic constitutive models
- Donya Dabiri
- Milad Saadat
- Safa Jamali
Rheologica Acta (2023)
Data-driven constitutive model of complex fluids using recurrent neural networks
- Howon Jin
- Sangwoong Yoon
- Kyung Hyun Ahn
Rheologica Acta (2023)
Scattering-Informed Microstructure Prediction during Lagrangian Evolution (SIMPLE)—a data-driven framework for modeling complex fluids in flow
- Charles D. Young
- Patrick T. Corona
- Michael D. Graham
Rheologica Acta (2023)

Rheology-Informed Neural Networks (RhINNs) for forward and inverse metamodelling of complex fluids

Abstract

Similar content being viewed by others

Data-driven constitutive model of complex fluids using recurrent neural networks

Data-driven selection of constitutive models via rheology-informed neural networks (RhINNs)

Constitutive Models of Complex Fluids

Introduction

Problem setup and methodology