Abstract
Developing constitutive models that can describe a complex fluid’s response to an applied stimulus has been one of the critical pursuits of rheologists. The complexity of the models typically goes hand-in-hand with that of the observed behaviors and can quickly become prohibitive depending on the choice of materials and/or flow protocols. Therefore, reducing the number of fitting parameters by seeking compact representations of those constitutive models can obviate extra experimentation to confine the parameter space. To this end, fractional derivatives in which the differential response of matter accepts non-integer orders have shown promise. Here, we develop neural networks that are informed by a series of different fractional constitutive models. These fractional rheology-informed neural networks (RhINNs) are then used to recover the relevant parameters (fractional derivative orders) of three fractional viscoelastic constitutive models, i.e., fractional Maxwell, Kelvin-Voigt, and Zener models. We find that for all three studied models, RhINNs recover the observed behavior accurately, although in some cases, the fractional derivative order is recovered with significant deviations from what is known as ground truth. This suggests that extra fractional elements are redundant when the material response is relatively simple. Therefore, choosing a fractional constitutive model for a given material response is contingent upon the response complexity, as fractional elements embody a wide range of transient material behaviors.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
The quest for finding appropriate closed-form constitutive models to represent certain material behaviors has witnessed decades of rigorous research, which was initially galvanized mainly by those observed behaviors (Bingham 1916; Morrison 2001; Bird et al. 1987). Viscous, elastic, plastic, and thixotropic behaviors are commonly capable of categorizing material responses to an imposed actuation. In particular, viscoelastic materials, in which the stress response decays over some characteristic time, inherit the viscous response of fluids and the elastic one from solids. The time for the decay to occur is defined as the relaxation time of the fluid, i.e., the time needed for the material to exhibit an elastic response and relax, which leaves the material with a steady-state viscous response. Canonically, the simplest constitutive model that captures linear viscoelasticity is the Maxwell modelFootnote 1 (Morrison 2001), whose (scalar) shear stress response (\(\sigma _{21}=\sigma \)) under the imposition of a simple shear is as follows:
where \(\eta \) and G are the viscosity (in Pa\(\cdot \)s) and elastic modulus (in Pa), respectively, the ratio of which (\(\eta /G\)) may be thought of as a relaxation time, t is time (in s), and \(\dot{\epsilon }(t)\), in s\(^{-1}\), is the imposed deformation rateFootnote 2. In fact, the Maxwell model is derived by assembling a spring (representative of the elastic response) and a dashpot (the viscous response) in series, superimposing the deformation (\(\epsilon (t)\)) of these two elements, and solving for \(\sigma (t)\). Polymer melts with a small and narrowly distributed molecular weight under small deformations are adequately modeled by the Maxwell constitutive equation. In another canonical model, the Kelvin-Voigt assembly considers the spring-dashpot elements stacked in parallel, which is suitable for creep (stress-controlled) experiments, for instance. While these simple models are based on a single characteristic relaxation time, in real-world viscoelastic fluids, a wide range of relaxation times are commonly present. Classically, this can be addressed by assembling a number of spring-dashpot (Maxwell) branches in parallel and thus capturing a range of relaxation times, also known as the generalized Maxwell model. On the other hand, in many rheologically relevant materials such as gels and worm-like micellar solutions and a range of polymeric systems, the material exhibits a power-law response, i.e., \(\epsilon (t)\propto t^a\). Such power-law responses are not well described by canonical rheological models such as the Maxwell or Kelvin-Voigt models (Tschoegl 2012; Jaishankar and McKinley 2013).
As mentioned above, the viscoelastic response inherits the elastic response of a solid (\(\sigma (t)\propto \epsilon \)) and the viscous behavior of a Newtonian fluid (\(\sigma (t)\propto \frac{\partial \epsilon }{\partial t}\)). Scott-Blair (Scott-Blair 1947) proposed a compact way to describe viscoelasticity by borrowing the concept of fractional derivatives (Stankiewicz 2018; Jaishankar and McKinley 2013):
where E and \(\tau \) are the elastic modulus (in Pa) and relaxation time (in s), respectively, and \(0\le \alpha \le 1\) is the fractional derivative order, effectively creating an element that interpolates between the constitutive responses of a spring and a dashpot; see Fig. 1. Such elements are usually referred to as Scott-Blair or spring-pot elements. In this formalism, the product of E and \(\tau ^\alpha \) (in Pa\(\cdot \)s\(^\alpha \)) may be thought of as a quasi-property, \(\mathbb {V}\), whose unit depends on the fractional derivative. The seemingly unphysical perception of this quasi-property (and its non-integer dimension in time) may be justified by the fact that \(\mathbb {V}\) represents the firmnessFootnote 3 of a material (Scott-Blair and Coppen 1942).
Fruitful efforts have been made to approximate (and model) the observed behavior of different fluids using the concept of fractional derivatives. Highly anomalous butyl rubber (Scott-Blair 1947), soft particulate gels (Bantawa et al. 2022), cheese (Bonfanti et al. 2020; Faber et al. 2017), Xanthan gum (Jaishankar and McKinley 2014), and biological samples such as highly viscoelastic bovine serum albumin (BSA) (Jaishankar and McKinley 2013) have been successfully modeled with respective fractional models, while the classical counterparts of these models often require an exceedingly large number of the canonical Maxwell (or Kelvin-Voigt) elements and thus circumscribing the applicability of classical approaches to model viscoelastic samples with power-law responses (Tschoegl 2012; Jaishankar and McKinley 2013). Consistent and approachable methods, therefore, are called for to identify fractional models apposite to such viscoelastic samples. The issue, however, is that solution of fractional constitutive models, compared to classical ordinary differential equations that are used in rheological constitutive models, is far from trivial. As such, platforms that can automatically solve for different forms of fractional models may pave the way for further development of this class of compact models.
In a forward problem, a constitutive equation, e.g., a fractional viscoelastic model (FVEM), is at hand with all the initial conditions and parametric information, i.e., the fractional derivative order (\(\alpha \)) and quasi-properties, and the task is to solve for a rheological property such as \(\sigma (t)\). Several methods have been proposed and employed in the literature to solve FVEMs, such as the Laplace transform (Mainardi 2010; Feng et al. 2020; Fang et al. 2020), finite-difference (Shen 2020; Lin et al. 2007; Diethelm et al. 2005), finite-element (Alotta et al. 2018), and the Adomian decomposition method (Tripathi et al. 2010), among others. However, in inverse problems, measurable observations are gathered from a sample with missing information about the constitutive equation that governs the rheological process, and the goal is to recover those missing parameters by leveraging the collected data. The algorithms mentioned above, however, often struggle in the face of inverse problems and ill-posedness (Jagtap et al. 2022), meaning that a solution may be either nonexistent or not unique. The parameter recovery of generalized Maxwell models (by stacking finite spring-dashpot units) can quickly become prohibitive as the number of units increases. Data-driven methodologies, on the other hand, can tolerate ill-posedness and thus are suitable choices when tackling inverse (and forward) problems (Chen et al. 2020) to provide unified platforms for the identification and discovery of FVEMs.
Machine learning platforms have shown promise to obtain forward and inverse solutions of differential equations alike (Raissi et al. 2019; Karniadakis et al. 2021; Raissi et al. 2020; Raissi 2018; Karnakov et al. 2022; Li et al. 2022). By inducing physical intuition into the training process, physics-informed neural networks (PINNs) were developed for the forward and inverse analysis of many scientific problems (Cai et al. 2022; Lawal et al. 2022; Ishitsuka and Lin 2023; Asrav and Aydin 2023; Tang et al. 2023). By implicitly (or explicitly) informing the neural net (NN) of a rheological constitutive model, several rheology-informed neural network (RhINN) platforms were recently developed (Mahmoudabadbozchelou and Jamali 2021; Mahmoudabadbozchelou et al. 2021, 2022b, a; Saadat et al. 2022). Other efficient and promising data-driven algorithms and tools have also been developed to recover parameters in rheological problems of interest (Freund and Ewoldt 2015; Armstrong et al. 2017; Singh et al. 2019; Thakur et al. 2022; Reyes et al. 2021). Generally, RhINNs can be used to detect, construct, and solve a variety of rheological constitutive models in the form of ordinary differential equations. Nonetheless, as these platforms have not been developed and tested for fractional derivatives, this work specifically focuses on the identification of FVEMs using RhINNs in an inverse implementation.
Here, as a subset of all the possible combinations of fractional viscoelastic models, three configurations of FVEMs, namely, the fractional Maxwell model (FMM), fractional Kelvin-Voigt model (FKVM), and fractional Zener model (FZM), which is a fractional Maxwell unit connected to a spring-pot in parallel are selected. The goal is to recover the fractional derivative order of each spring-pot element, assuming that other quasi-properties are known from previous experiments or rheometry. This task is far from trivial, as inverse-problem solvers for fractional constitutive equations are yet to be popularized, and the pressing need for material discovery propels the numerical and data-driven methods to confine the parameter space of inverse problems and eventually recover those parameters. While these cases by no means encompass all the possible combinations and intricacies of FVEMs, the selected configurations nonetheless hand valuable information about the applicability of RhINNs in solving fractional inverse problems. The ultimate goal is to develop a generalizable platform for the data-driven solution of fractional constitutive models without compromising accuracy as the complexity of the underlying response/model increases.
Problem setup and methodology
In “Fractional Maxwell model,” “Fractional Kelvin-Voigt model,” and “Fractional Zener model,” the fractional Maxwell, Kelvin-Voigt, and Zener models are introduced, respectively, in terms of their fractional differential equations (FDEs) and analytical solutions. These three models are also mechanistically depicted in Fig. 2. Then, in “Rheology-informed neural networks and fractional derivatives,” the concepts behind rheology-informed neural networks (RhINNs) and the numerical procedure to solve FDEs are discussed.
In this work, the analysis is performed by studying the synthetically generated data using the analytical solutions (as opposed to seeking digitized experimental measurements from the literature) because by doing so, the ground truth fractional derivative orders can be tuned to evaluate the robustness in the recovery of fractional derivative orders.
Schematic description of the fractional a Maxwell, b Kelvin-Voigt, and c Zener models using spring-pot elements. The objective of this work is to recover the fractional derivative orders, i.e., \(\alpha \) and \(\beta \) in fractional Maxwell and Kelvin-Voigt models and \(\alpha \), \(\beta \), and \(\gamma \) in the fractional Zener model
Fractional Maxwell model
As mentioned in “Introduction,” each Scott-Blair element is characterized by two parameters (E and \(\tau \)) and a fractional derivative order, \(\alpha \). In the fractional Maxwell model (FMM), two Scott-Blair (spring-pot) elements ((\(E_1,\tau _1,\alpha \)), (\(E_2,\tau _2,\beta \))) are connected in series, similar to the element arrangement in the classical Maxwell model; see Fig. 2a. By assuming equality of stress experienced by each Scott-Blair element (\(\sigma _1=\sigma _2=\sigma \)) and additivity of deformations (\(\epsilon =\epsilon _1+\epsilon _2\)), one may derive the Maxwell FDE (Schiessel et al. 1995; Schiessel and Blumen 1993):
where \(\tau = (E_1\tau _1^{\alpha }/E_2\tau _2^{\beta })^{1/(\alpha -\beta )}\) and \(E = E_1(\tau _1/\tau )^{\alpha }\). In deriving Eq. 3, it is assumed that \(0 \le \beta < \alpha \le 1\) without loss of generality (Schiessel et al. 1995).
The solution of Eq. 3 upon imposing a step strain (\(\epsilon (t)=\epsilon _0 H(t)\), where H(t) is the Heaviside step function) can be derived analytically, and the relaxation modulus, G(t), in Pa, is expressed as follows:
where \(\mathcal {M}_{\kappa ,\mu }\) is the generalized Mittag-Leffler function defined as follows (Schiessel et al. 1995):
where \(\Gamma (.)\) is the gamma function.
In “Fractional Maxwell model,” the solution of Eq. 3 under a unit step strain (\(\epsilon _0=1\)) is sought. In other words, \(\epsilon (t)=H(0_+)=1\), and \(G(t)=\sigma (t)\). Therefore, the solution of Eq. 3 and the analytical relaxation modulus (Eq. 4) coincide and are directly comparable.
Fractional Kelvin-Voigt model
For the case of the fractional Kelvin-Voigt model (FKVM), two Scott-Blair elements (with parameters similar to that in “Fractional Maxwell model”) are stacked in parallel; see Fig. 2b. The deformation, similar to the classical Kelvin-Voigt model, is identical in each branch, and the stresses are additive. By simply adding the stress response of two Scott-Blair elements (Eq. 2) and setting \(\epsilon _1=\epsilon _2=\epsilon (t)\), one may derive the stress response of FKVM (Schiessel et al. 1995):
where E and \(\tau \) continue to be defined as in “Fractional Maxwell model”. The creep compliance, J(t), in Pa\(^{-1}\), is then defined as follows:
where \(\sigma _0\) is the step-stress in a creep experiment. By assuming a unit step-stress (\(\sigma (t)=\sigma _0=1\)), the solution of Eq. 6 (\(\epsilon (t)\)) and the exact solution (Eq. 7) will become identical.
Schematic illustration of the RhINN architecture. In this figure, the material response is assumed to be the relaxation modulus, G(t), which is applicable to the fractional Maxwell and fractional Zener models. For the fractional Kelvin-Voigt model, this output is replaced with the creep compliance, J(t). The fitting parameters (Params) in RhINNs, which are the fractional derivative orders, are defined trainbale
Fractional Zener model
The classical Zener model consists of a Maxwell model in parallel with a spring. In the fractional Zener model (FZM), three fractional elements with parameters \((E_1, \tau _1, \alpha )\), \((E_2, \tau _2, \beta )\), and \((E_3, \tau _3, \gamma )\) are arranged, as displayed in Fig. 2c. The total mechanical response of the model with constraints \(0 \le \beta < \alpha \le 1\) and \(0 \le \gamma \le 1\), and by setting \(\tau = (E_1\tau _1^{\alpha }/E_2\tau _2^{\beta })^{1/(\alpha -\beta )}\), \(E_0 = E_1(\tau _1/\tau )^{\alpha }\), and \(E = E_3(\tau _3/\tau )^{\gamma }\) is given as follows (Schiessel et al. 1995):
The relaxation modulus, G(t), of FZM under a stress-relaxation test with constraints mentioned above is given by the following:
Similar to FMM, a stress relaxation test under a unit step strain is assumed here, i.e., \(\epsilon (t)=H(0_+)=1\), and \(G(t)=\sigma (t)\). Thus, the solution of Eq. 8 and the analytical solution (Eq. 9) can be compared side-by-side.
It is worth mentioning that the range of material response using the analytical solutions, i.e., Eqs. 4, 7, and 9, highly depends on the fractional derivative orders (and quasi-properties, which are assumed given here; see the outset of “Results and discussion”). Consequently, to keep the discussion concise, readers are encouraged to refer to the material response plots provided in “Results and discussion” for more quantitative information relevant to the respective data sets.
Rheology-informed neural networks and fractional derivatives
For all three cases introduced above, the only input to the neural network (NN) is time, t, while the NN output is the material response, which is \(\sigma (t)\) (or G(t)) for the FMM and FZM cases, and \(\epsilon (t)\) (or J(t)) for the FKVM; see Fig. 3. The neural network is composed of several hidden layers, each containing a predefined number of neurons. The trainable variables, i.e., the weights and biases of the neurons plus the fitting parameters (Params in Fig. 3), which are the fractional derivative orders in each case, are learned by minimizing a composite loss function:
where \(\phi _d\) is the loss due to the discrepancy between the predicted material response (\(G_p\)Footnote 4) and the ground truth material response (\(G_{gt}\)), defined as the following mean-squared error (MSE):
Moreover, the residual (Res in Fig. 3) between the two sides of each FDE (\(\phi _f\)) can be calculated, which is also a mean squared error. As an example, this residual for the case of FMM will be as follows:
where \(n_r\) is the number of residual points that are used to define artificial input arrays in time. In writing the second equality in Eq. 12, a unit step-strain function is assumed for \(\epsilon _i\), where the \(\alpha ^{th}\) fractional derivative of a constant function \(y(t)=1\) is known to be \(t^{-\alpha }/\Gamma (1-\alpha )\) (Garrappa et al. 2019).
The objective in an inverse problem, as stated in “Introduction,” is to recover the fitting parameters of an embedded constitutive equation, which are the three FDEs introduced in “Fractional Maxwell model,” “Fractional Kelvin-Voigt model,” and “Fractional Zener model.” In a RhINN implementation, these FDEs are interrogated at discrete points in time (\(n_r\) in Eq. 12). To calculate the residual loss component (\(\phi _f\)), it is thus necessary to adhere to a definition of fractional derivatives best suited for the problem at hand. Among the existing definitions (or senses) of fractional derivatives (Baleanu 2016), the Riemann-Liouville (RL) and Caputo definitions seem to receive more attention (Jiang and Zhang 2020). Despite the equivalency of these two senses for certain viscoelastic cases (e.g., the fractional Zener model (Bagley 2007)), the Caputo solution is the suitable choice when (integer-order derivative) information about the boundary and/or initial conditions is available (Mainardi 2010; Li et al. 2011), which is the case here: the zeroth-order derivative of G(t) and J(t) curves is given, and thus, the initial conditions are accessibleFootnote 5. Thus, the Caputo sense is employed in this work to handle the fractional derivative of the outputs (\(\sigma (t)\) or \(\epsilon (t)\)) with respect to the input, t. The \(\alpha ^{th}\) order (\( 0 \le \alpha \le 1\)) fractional derivative of f(t) w.r.t. time in the Caputo sense is defined as follows (Baleanu et al. 2016):
In writing Eq. 13, the lower limit of integration is set to \(t=0\). It is worth mentioning that fractional derivative orders higher than unity are calculable using the Caputo sense; however, fractional derivatives are employed for viscoelastic materials here, and thus, the derivative order is deemed to remain within zero and one.
Equation 13 is not readily implementable on NNs, and thus, one will need to approximate the integral term. In this work, a well-established finite-difference algorithm (Diethelm et al. 2005) is used to calculate the Caputo derivative of f(t):
where h is the (uniform) step size (\(h=t/n_r\)), \(f_0\) is the value of f(t) at \(t=0\), and \(a_{n,n_r}\) is the quadrature weights derived from a product trapezoidal rule:
which leads to an error of order \(h^{2-\alpha }\). The hereditary nature of the Caputo fractional derivative lies in the fact that the derivative at each point depends on the past information of f(t) in time.
In this study, \(n_r=100\) artificial points are uniformly distributed between 0.001 and 5s in time to calculate the residual loss, Eq. 12, and train RhINNs. Further increasing \(n_r\) resulted in negligible improvement in parameter recovery. One justification for selecting a small training time window is that the chosen numerical approach for the fractional derivative (Eq. 14) becomes unstable for non-uniform grids. Therefore, by increasing the time window, fewer points are available at the beginning of time, which can compromise the parameter recovery. The exact number of points is used to generate the ground truth data (G(t) or J(t)), which are then fed into the RhINNs.
One crucial step in solving inverse problems using RhINNs is the selection of NN hyperparameters, i.e., the NN parameters that are selected (and optimized) before the training process. For all three FDEs, four hidden layers, each containing ten neurons with a tanh activation function, are employed. Insignificant variation in the achieved convergence (and the execution time) was observed for other values of neuron or layer count. The trainable fractional derivative orders to be recovered are constrained to remain between 0.01 and 0.99 for numerical stability and also to respect the assumptions made to derive Eq. 14. A piecewise learning rate was employed that gradually decays in time to ensure gradient stability and also to find a compromise between the accuracy and the code execution time. Rather than a hard error threshold, the training process is halted once the recovered parameters (or the composite loss, \(\phi \)) plateaus and does not change. Each FDE inherits a particular degree of complexity, and a hard loss threshold that can effectively terminate the training process of one FDE might not apply to the other two. The initial conditions for the fitting parameters are also tabulated in Table 1. The neural network can tolerate initial conditions different than the ones tabulated below, but for unphysical initial conditions (e.g., \(\alpha <\beta \) in the FMM cases), there may be convergence issues. The loss minimization task is handled by TensorFlow’s built-in tf.keras.optimizers.Adam optimizer. The entire optimization task can also be performed using SPSVERBc1’s L-BFGS-B method in our code, but since this method is more computationally expensive than Adam, it is more convenient to perform Adam and use L-BFGS-B if need be.
Results and discussion
For all the FDE cases studied here, the compound elastic modulus (E) and relaxation time (\(\tau \)) are assumed given and were set to their respective values used when generating the exact data. The inevitability of this bold assumption lies in the fact that the (unique) recovery of fractional derivative orders, along with all the properties of the Scott-Blair elements, was infeasible for some cases. Nonetheless, assuming missing values of E and \(\tau \), the compound elastic modulus is accessible by taking the initial slope of the stress (or the relaxation modulus in a step-strain stress relaxation test) response in time. The relaxation time, in a similar fashion, is derivable by extracting the time needed for the material to relax the elastic response, which is the time range before the stress (or G(t)) plateau. Admittedly, these pieces of information may demand rounds of experiments. However, it is safe to assume that these parameters can be at least estimated with an acceptable level of confidence. Also, it is worth mentioning that our preliminary work suggests that all parameters can be recovered through RhINN solutions in the majority of the cases studied, but further work on that front is required to provide a robust and generalizable framework. As such, here and in this work, we limit the study to the recovery of fractional derivative orders. These fractional derivative orders are not readily accessible from rheometry, and RhINN is interrogated to precisely pinpoint these derivative orders.
Fractional Maxwell model
For the fractional Maxwell model, as mentioned in “Fractional Maxwell model,” RhINN is interrogated to recover the fractional derivative orders of the elements, i.e., \(\alpha \) and \(\beta \), as shown in Fig. 2a, under a stress relaxation test with a unit step-strain (\(\epsilon (t)=H(t)\)). The neural network is trained on a broad range of \(\alpha \) and \(\beta \) values with \(0 \le \beta < \alpha \le 1\). Moreover, the other two model parameters, i.e., \(E={1}\)Pa and \(\tau ={20}\)s, are assumed given; see the explanation in “Results and discussion.” In Fig. 4, the total relative absolute error (RAE) (in %) between the RhINN predictions and the ground truth values for a wide range of \(\alpha \) and \(\beta \) values is displayed. In each tile, the top and bottom numbers represent the recovered \(\alpha \) and \(\beta \), respectively, which can be cross-checked with their corresponding exact values given for each row and column. The maximum total RAE for the entire parameter space of \(\alpha \) and \(\beta \) is less than 10%, which indicates that RhINN is capable of recovering the fractional derivative order of FMMs.
Total relative absolute error (RAE) in fractional derivative order values (\(\alpha \) and \(\beta \)) compared to their respective exact values for the fractional Maxwell model under a stress relaxation test with a unit step-strain function. The numbers in each tile, from top to bottom, represent the recovered \(\alpha \) and \(\beta \) values, respectively, with their exact values given by their corresponding row and column. The upper triangle is inaccessible since in deriving Eq. 3, it is assumed that \(0 \le \beta < \alpha \le 1\)
From a rheologist’s vantage point, having a precise fractional derivative order recovery might seem secondary to having a measurable rheological property, which is the relaxation modulus, G(t), in the case of a stress relaxation test. That is, the ultimate goal in constitutive modeling is enabling accurate prediction of material response, and as such, the absolute value of the parameters involved in a model may not necessarily mean as much. Since the ground truth formula (Eq. 4) to generate the data is available, it is possible to calculate the recovered relaxation modulus (G(t)) by inserting the recovered \(\alpha \) and \(\beta \) values along with the given values of E and \(\tau \) and then comparing the recovered G(t) with the ground truth one. Such material responses are plotted in Fig. 5 for four combinations of \(\alpha \) and \(\beta \), with the RhINN predictions and the exact G(t) values shown in solid and dashed lines, respectively. The plots in Fig. 5 indicate that even for the case of the largest RAE reported, recovered relaxation spectra is closely tracking the ground truth solution. Recalling the definition of Scott-Blair elements (Eq. 2), the lower value of the fractional derivative order (\(\beta \)) is a manifestation of the high-frequency material response, which is the elastic response in viscoelastic materials. Therefore, changing \(\beta \) should be observable in the initial G(t) response in Fig. 5. Here, The relaxation modulus is lower for the smaller values of \(\beta \) since G(t) is proportional to \(t^{-\beta }\), which decreases as \(\beta \) drops; see Eq. 4.
The relaxation modulus, G(t) of the fractional Maxwell model. The RhINN (solid line) response is calculated by inserting the recovered sets of \(\alpha \) and \(\beta \) into Eq. 4, plotting the material response for a time window equal to that of the training data (0.001 and 5s), and is compared with the ground truth (dashed line) relaxation modulus. The elastic modulus (\(E={1}\)Pa) and relaxation time (\(\tau ={20}\)s) are assumed given
Fractional Kelvin-Voigt model
RhINNs are then applied to recover fractional derivative orders of the elements in the fractional Kelvin-Voigt model (FKVM) under a creep test using a unit step-stress function for various (\(\alpha \), \(\beta \)) combinations with constraint \(0 \le \beta < \alpha \le 1\). In each case, the other model parameters, \(E= 1\)Pa and \(\tau = 20\)s, are assumed given. The total RAE between RhINN predictions and the ground truth values for different fractional derivative orders are shown in Fig. 6. The RAE values are mostly within a 20% band, with the exception of some scenarios where the overall relative absolute inaccuracy reaches up to 50%. Note that in those examples, the small value of \(\beta \) results in a large RAE. The higher RAE scenarios were further investigated by examining the recovery of each particular fractional derivative order (as represented by the values in the tiles). It was found out that the higher order, \(\alpha \), is consistently recovered more accurately than \(\beta \). One possible reason for higher deviation in \(\beta \) recovery could be that RhINN was trained over a short time window, and the higher fractional derivative order (or more viscous element) dominates the material response in the Kelvin-Voigt model within short timescales.
Total relative absolute error (RAE) in the fractional derivative orders (\(\alpha \) and \(\beta \)) with respect to their exact values in the fractional Kelvin-Voigt model under creep compliance for various combinations of fractional derivative orders with constraint \(0 \le \beta < \alpha \le 1\). The numbers in each tile, from top to bottom, represent the recovered \(\alpha \) and \(\beta \) values, respectively, with their exact values given by their corresponding row and column
As previously presented for the case of FMMs, and to support our interpretation, the material response is also plotted using the predicted and ground truth fractional derivative orders. In Fig. 7, the creep compliance (J(t)) is depicted for cases where there are significant differences between the predicted and ground truth fractional derivative orders. In this figure, the recovered derivative orders are inserted into Eq. 7 over a larger time window, i.e., \(1 \times 10^{-2}\le t/\tau \le 1 \times 10^{4}\) to monitor the out-of-sample response of J(t) using the recovered parameters. Across the range of derivative orders, the predicted and the ground truth responses are in good agreement at shorter times, e.g., \(t/\tau < 10\). Despite the high RAE in recovering \(\alpha \) and \(\beta \) at smaller values of \(\alpha \) (the upper left cases in Fig. 6), the J(t) response nonetheless commits a much smaller error, indicating that the combination of \(\alpha and \beta \) determines the accuracy of J(t) and not their individual values. However, disparity between the predicted and ground truth J(t) responses arises at longer times due to the relatively substantial error in recovering \(\beta \). This can also be alleviated by increasing the training time window, which was set to be within 0.001 and 5 s throughout this study.
The creep compliance, J(t), response of the fractional Kelvin-Voigt model using the recovered (solid) and exact (dashed) fractional derivative orders. Other parameters, \(E= 1\)Pa and \(\tau = 20\)s, are assumed known. The recovered derivative orders are inserted into Eq. 7 over a larger time window, i.e., \(1 \times 10^{-2}\le t/\tau \le 1 \times 10^{4}\) to assess the out-of-sample response of J(t) using the recovered parameters
Fractional Zener model
Finally, RhINN is used in this case to recover the fractional derivative orders (\(\alpha \), \(\beta \), and \(\gamma \)) of the three Scott-Blair elements in the fractional Zener model (FZM) under a stress relaxation test using a unit step-strain function for a wide range of derivative order values with constraints \(0 \le \beta < \alpha \le 1\) and \(0 \le \gamma \le 1\). Other model parameters (\(E_0=E= 1\)Pa and \(\tau = 20\)s) remained constant, as before. The NN hyperparameters are also similar to those used in “Fractional Maxwell model” and “Fractional Kelvin-Voigt model.” Figure 8 summarizes the total relative absolute error in recovering the three fractional derivative orders. The RAE values are generally within a 30% band for the derivative orders, with the exception of a few scenarios where the relative absolute error hits 100%. In Fig. 8 b and c, these higher-error cases occur when \(\gamma \) is equal to one of the other two derivative orders (\(\beta \) and \(\alpha \), respectively). On the other hand, it should be noted that some recovered fractional derivative orders match their respective exact values. Thus, the true test of the RhINN order recovery can be done by comparing the relaxation moduli using the recovered parameters and benchmarking against the ground truth solution.
Total relative absolute error (RAE) in the recovered fractional derivative orders (\(\alpha \), \(\beta \), and \(\gamma \)) with respect to their exact values for the fractional Zener model under a stress relaxation test using a unit step-strain function for various combinations of derivative orders with constraints \(0 \le \beta < \alpha \le 1\) and \(0 \le \gamma \le 1\). The numbers in each tile, from top to bottom, represent the recovered \(\alpha \), \(\beta \), and \(\gamma \) values, respectively
In Fig. 9, the recovered derivative orders are inserted into Eq. 9 over a larger time window, i.e., \(1 \times 10^{-3}\le t/\tau \le 1 \times 10^{4}\) to assess the out-of-sample response of the material response using the recovered derivative orders. The relaxation modulus, G(t), using both the recovered and exact fractional derivative orders, is shown in Fig. 9 for the cases in which the highest values of RAE were observed. Despite the significant relative absolute inaccuracy in the recovered fractional derivative orders, the predicted relaxation moduli closely track the exact values for all scenarios at shorter times \(t/\tau <1\). At longer times (unseen by the RhINN), however, a significant discrepancy between the predicted and actual responses is observed for these cases. Once again, this can be explained by recalling the fact that the training time window (0.001 and 5s) was not large enough to encompass all the intricacies of the material response at longer times. The selection of the time window was also constrained by the computational expenses of fractional derivatives. We remind the reader that in this particular algorithm, time has to strictly be spaced linearly, and since small time intervals are required at short times, this prevents one from a comprehensive and long time training process. Therefore, employing more versatile and efficient fractional derivative algorithms compatible with nonlinear step sizes should be at the core of future efforts on this topic. It should be stressed, though, that out-of-sample prediction using RhINNs (or any other numerical toolkit) is accurate only up to a threshold, and the numerical solver usually cannot stray too far from the data that were given during the training stage.
The relaxation modulus, G(t), of the fractional Zener model using the recovered (solid) and exact (dashed) fractional derivative orders. Other parameters (\(E_0=E= 1\)Pa and \(\tau = 20\)s) are assumed known and constant. The recovered derivative orders are inserted into Eq. 9 over a larger time window, i.e., \(1 \times 10^{-3}\le t/\tau \le 1 \times 10{4}\) to monitor the out-of-sample response of G(t) using the recovered parameters
Conclusion
In this work, RhINNs were developed and employed to recover the fractional derivative orders of the elements in fractional viscoelastic Maxwell, Kelvin-Voigt, and Zener models based on single sets of (synthetically generated) data. Moreover, the derivative orders were used to generate the stress relaxation response of the fractional Maxwell and Zener models upon the imposition of a unit step-strain and the creep compliance response of the fractional Kelvin-Voigt model under a creep test using a unit step-stress function. A systematic study on the recovery of the derivative orders over their permitted parameter space (between zero and unity) was performed. With various combinations of the fractional derivative orders (and their respective constraints), multiple scenarios were examined in each of the three fractional models. Overall, in most situations, RhINNs are found to retrieve the fractional derivative orders of all the components in each of the models in the same range as the ground truth values used in the generation of the data. In other instances, the total relative absolute error (RAE) for some of the derivative orders is found to be substantial. However, the relaxation modulus (or creep compliance) using the recovered derivative orders closely mimics the input experiments with minimal deviations. This is mainly because the solution of these models to an applied flow protocol is not unique within the time window under question, meaning that several combinations of the derivative orders can replicate the same overall behavior. This is more noticeable when more complex models (with more spring-pot elements) are considered. As such, for the derivative order recovery of the fractional Zener model, the recovered parameters do not necessarily follow the ones used to generate the data; nonetheless, the retrieved order values do indeed result in an accurate recovery of the behavior. Also, in this work, the neural network was trained over a short time window in terms of the data provided, and in some cases, those time windows are likely not long enough for all model components to respond appropriately to an applied stimulus. Moreover, due to the hereditary nature of fractional derivatives and the need to sum all the past information to get the fractional derivative at each point, the RhINN convergence was found to be time-consuming (typically within tens of minutes, or an hour), which leaves room for further improvement in the computation time and efficiency of fractional problems using RhINNs. While this work presents a robust platform for the inverse data-driven solution of fractional constitutive models, further investigation is warranted on recovering all model parameters (including elastic moduli and relaxation times) and other flow protocols. Also, training the RhINNs over a longer time window could improve the efficiency of fractional derivative recovery.
Notes
The adjective “constitutive” is reluctantly attributed to the Maxwell model because this model, in its basic form, is not of tensor order, with objectivity issues.
In this work, the symbol \(\epsilon (t)\) is used for the shear deformation, as the more frequent candidate, \(\gamma \), is reserved for a fractional derivative order; see “Fractional Zener model”.
This is despite the fact that a unanimous definition of firmness (and other physiological features of matter such as springiness and rubberiness) is yet to be found and agreed upon (Wagner et al. 2017).
The concepts behind RhINNs and fractional derivatives are introduced using the fractional Maxwell model, and therefore, the material response is shown with G(t). The relevant symbol for the FKVM case would be J(t).
Due to the successive integrodifferential nature in the RL sense, this method is capable of approximating fractional integrals, too; consult the end discussion in (Mainardi 2010).
References
Alotta G, Barrera O, Cocks A et al (2018) The finite element implementation of 3D fractional viscoelastic constitutive models. Finite Elem Anal Des 146:28–41. https://doi.org/10.1016/j.finel.2018.04.003
Armstrong MJ, Beris AN, Wagner NJ (2017) An adaptive parallel tempering method for the dynamic data-driven parameter estimation of nonlinear models. AIChE Journal 63:1937–1958. https://doi.org/10.1002/aic.15577
Asrav T, Aydin E (2023) Physics-informed recurrent neural networks and hyper-parameter optimization for dynamic process systems. Comput Chem Eng 173(108):195. https://doi.org/10.1016/j.compchemeng.2023.108195
Bagley R (2007) On the equivalence of the Riemann-Liouville and the Caputo fractional order derivatives in modeling of linear viscoelastic materials. Fractional Calculus and Applied Analysis 10:123–126
Baleanu D (2016). A survey of numerical methods for the solution of ordinary and partial fractional differential equations. https://doi.org/10.1142/9789813140042_0002
Baleanu D, Diethelm K, Trujillo J et al (2016) Fractional calculus: models and numerical methods. World Scientific
Bantawa M, Keshavarz B, Geri M et al (2022) The hidden hierarchical nature of soft particulate gels. ArXiv Preprint
Bingham EC (1916) An investigation of the laws of plastic flow. Bulletin of the Bureau of Standards 13:309–353
Bird RB, Armstrong RC, Hassager O (1987) Dynamics of polymeric liquids, Volume 1: Fluid Mechanics. Wiley-Interscience
Bonfanti A, Kaplan JL, Charras G et al (2020) Fractional viscoelastic models for power-law materials. SoftMatter 16:6002–6020. https://doi.org/10.1039/D0SM00354A
Cai S, Mao Z, Wang Z et al (2022) Physics-informed neural networks (PINNs) for fluid mechanics: a review. Acta Mech Sinica 1727–1738. https://doi.org/10.1007/s10409-021-01148-1
Chen Y, Lu L, Karniadakis GE et al (2020) Physics-informed neural networks for inverse problems in nano-optics and metamaterials. Opt Express 28(11):618. https://doi.org/10.1364/OE.384875
Diethelm K, Ford N, Freed A et al (2005) Algorithms for the fractional calculus: a selection of numerical methods. Comput Methods Appl Mech Eng 194:743–773. https://doi.org/10.1016/j.cma.2004.06.006
Faber T, Jaishankar A, McKinley G (2017) Describing the firmness, springiness and rubberiness of food gels using fractional calculus. Part II: Measurements on semi-hard cheese. Food Hydrocoll 62:325–339. https://doi.org/10.1016/j.foodhyd.2016.06.038
Fang C, Shen X, He K et al (2020) Application of fractional calculus methods to viscoelastic behaviours of solid propellants. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 378(20190):291. https://doi.org/10.1098/rsta.2019.0291
Feng YY, Yang XJ, Liu JG (2020) On overall behavior of Maxwell mechanical model by the combined Caputo fractional derivative. Chin J Phys 66:269–276. https://doi.org/10.1016/j.cjph.2020.05.006
Freund JB, Ewoldt RH (2015) Quantitative rheological model selection: good fits versus credible models using Bayesian inference. J Rheol 59:667–701. https://doi.org/10.1122/1.4915299
Garrappa R, Kaslik E, Popolizio M (2019) Evaluation of fractional integrals and derivatives of elementary functions: overview and tutorial. Mathematics 7:407. https://doi.org/10.3390/math7050407
Ishitsuka K, Lin W (2023) Physics-informed neural network for inverse modeling of natural-state geothermal systems. Appl Energy 337(120):855. https://doi.org/10.1016/j.apenergy.2023.120855
Jagtap AD, Mao Z, Adams N et al (2022) Physics-informed neural networks for inverse problems in supersonic flows. ArXiv Preprint. https://doi.org/10.1016/j.jcp.2022.111402
Jaishankar A, McKinley GH (2013) Power-law rheology in the bulk and at the interface: quasi-properties and fractional constitutive equations. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 469(20120):284. https://doi.org/10.1098/rspa.2012.0284
Jaishankar A, McKinley GH (2014) A fractional K-BKZ constitutive formulation for describing the nonlinear rheology of multiscale complex fluids. J Rheol 58:1751–1788. https://doi.org/10.1122/1.4892114
Jiang Y, Zhang B (2020) Comparative study of Riemann-Liouville and Caputo derivative definitions in time-domain analysis of fractional-order capacitor. IEEE Transactions on Circuits and Systems II: Express Briefs 67:2184–2188. https://doi.org/10.1109/TCSII.2019.2952693
Karnakov P, Litvinov S, Koumoutsakos P (2022) Optimizing a discrete loss (ODIL) to solve forward and inverse problems for partial differential equations using machine learning tools. ArXiv Preprint
Karniadakis GE, Kevrekidis IG, Lu L et al (2021) Physics-informed machine learning. Nature Reviews. Physics 3:422–440. https://doi.org/10.1038/s42254-021-00314-5
Lawal ZK, Yassin H, Lai DTC et al (2022) Physics-informed neural network (PINN) evolution and beyond: a systematic literature review and bibliometric analysis. Big Data and Cognitive Computing 6:140. https://doi.org/10.3390/bdcc6040140
Li C, Qian D, Chen Y (2011) On Riemann-Liouville and Caputo derivatives. Discret Dyn Nat Soc 2011:1–15. https://doi.org/10.1155/2011/562494
Li Z, Meidani K, Farimani AB (2022) Transformer for partial differential equations’ operator learning. ArXiv Preprint
Lin YC, Chung YC, Wu CY (2007) Mixing enhancement of the passive microfluidic mixer with J-shaped baffles in the tee channel. Biomed Microdevices 9:215–221. https://doi.org/10.1007/s10544-006-9023-5
Mahmoudabadbozchelou M, Jamali S (2021) Rheology-informed neural networks (RhINNs) for forward and inverse metamodelling of complex fluids. Scientific Reports 11(12):015. https://doi.org/10.1038/s41598-021-91518-3
Mahmoudabadbozchelou M, Caggioni M, Shahsavari S et al (2021) Data-driven physics-informed constitutive metamodeling of complex fluids: a multifidelity neural network (MFNN) framework. J Rheol 65:179–198. https://doi.org/10.1122/8.0000138
Mahmoudabadbozchelou M, Kamani KM, Rogers SA et al (2022) Digital rheometer twins: learning the hidden rheology of complex fluids through rheology-informed graph neural networks. Proc Natl Acad Sci 119(e2202234):119. https://doi.org/10.1073/pnas.2202234119
Mahmoudabadbozchelou M, Karniadakis GE, Jamali S (2022) NN-PINNs: nonNewtonian physics-informed neural networks for complex fluid modeling. Soft Matter 18:172–185. https://doi.org/10.1039/D1SM01298C
Mainardi F (2010). Fractional viscoelastic models. https://doi.org/10.1142/9781848163300_0003
Morrison FA (2001) Understanding rheology, vol 1. Oxford University Press, New York
Raissi M (2018) Deep hidden physics models: deep learning of nonlinear partial differential equations. J Mach Learn Res 19:932–955
Raissi M, Perdikaris P, Karniadakis G (2019) Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys 378:686–707. https://doi.org/10.1016/j.jcp.2018.10.045
Raissi M, Yazdani A, Karniadakis GE (2020) Hidden fluid mechanics: learning velocity and pressure fields from flow visualizations. Science 367:1026–1030. https://doi.org/10.1126/science.aaw4741
Reyes B, Howard AA, Perdikaris P et al (2021) Learning unknown physics of non-Newtonian fluids. Physical Review Fluids 6(073):301. https://doi.org/10.1103/PhysRevFluids.6.073301
Saadat M, Mahmoudabadbozchelou M, Jamali S (2022) Data-driven selection of constitutive models via rheology-informed neural networks (RhINNs). Rheol Acta 721–732. https://doi.org/10.1007/s00397-022-01357-w
Schiessel H, Blumen A (1993) Hierarchical analogues to fractional relaxation equations. Journal of Physics A: Mathematical and General 26:5057–5069. https://doi.org/10.1088/0305-4470/26/19/034
Schiessel H, Metzler R, Blumen A et al (1995) Generalized viscoelastic models: their fractional equations with solutions. J Phys A Math Gen 28:6567–6584. https://doi.org/10.1088/0305-4470/28/23/012
Scott-Blair GW (1947) The role of psychophysics in rheology. J Colloid Sci 2:21–32. https://doi.org/10.1016/0095-8522(47)90007-X
Scott-Blair GW, Coppen FMV (1942) The subjective conception of the firmness of soft materials. The American Journal of Psychology 55:215. https://doi.org/10.2307/1417080
Shen LJ (2020) Fractional derivative models for viscoelastic materials at finite deformations. Int J Solids Struct 190:226–237. https://doi.org/10.1016/j.ijsolstr.2019.10.025
Singh PK, Soulages JM, Ewoldt RH (2019) On fitting data for parameter estimates: residual weighting and data representation. Rheol Acta 58:341–359. https://doi.org/10.1007/s00397-019-01135-1
Stankiewicz A (2018) Fractional Maxwell model of viscoelastic biological materials. BIO Web of Conferences 10(02):032. https://doi.org/10.1051/bioconf/20181002032
Tang S, Feng X, Wu W et al (2023) Physics-informed neural networks combined with polynomial interpolation to solve nonlinear partial differential equations. Computers & Mathematics with Applications 132:48–62. https://doi.org/10.1016/j.camwa.2022.12.008
Thakur S, Raissi M, Ardekani AM (2022) ViscoelasticNet: a physics informed neural network framework for stress discovery and model selection. ArXiv Preprint
Tripathi D, Pandey S, Das S (2010) Peristaltic flow of viscoelastic fluid with fractional Maxwell model through a channel. Appl Math Comput 215:3645–3654. https://doi.org/10.1016/j.amc.2009.11.002
Tschoegl NW (2012) The phenomenological theory of linear viscoelastic behavior: an introduction. Springer Science & Business Media
Wagner CE, Barbati AC, Engmann J et al (2017) Quantifying the consistency and rheology of liquid foods using fractional calculus. Food Hydrocoll 69:242–254. https://doi.org/10.1016/j.foodhyd.2017.01.036
Acknowledgements
The authors are grateful for insightful discussions with Dr. Gareth McKinley and also acknowledge the support from the National Science Foundation’s DMREF program through Award #2118962.
Funding
Open access funding provided by Northeastern University Library.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Donya Dabiri, Milad Saadat, and Deepak Mangal contributed equally to this work.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Dabiri, D., Saadat, M., Mangal, D. et al. Fractional rheology-informed neural networks for data-driven identification of viscoelastic constitutive models. Rheol Acta 62, 557–568 (2023). https://doi.org/10.1007/s00397-023-01408-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00397-023-01408-w