Introduction

Metformin (1,1-dimethylbiguanide), average molecular weight 129.164 g/mol, is a \(+ 1\) cation at physiological pH with apparent volumes of distribution (dogs) [1] so large that it may be problematic to use classical pharmacokinetic methods to calculate those volumes. Metformin is used among numerous indications as the default first-line treatment for type II diabetes and for cancer chemotherapy in humans. There is a developing interest in treating feline and canine cancer with metformin, which in dogs is more typically used as an anti-hyperglycaemic agent to treat obesity or insulin resistance [1, 2]. In humans, the postulated cancer cell susceptibility to metformin is from defective mitochondrial oxidative phosphorylation and low glucose levels [3]. Metformin is non-metabolised and exclusively plasma-phase renal cleared at an estimated 90–100% of total renal plasma flow in humans [4]. Seen in association with impaired renal function is dose related lactic acidosis, a rare but serious condition [5,6,7] suggesting a maximum safe dosing. Also, antineoplastic chemotherapy with metformin potentiates temsirolimus side effects [8], such that secondary metformin effects during therapy are also concerning. For example, extra-mitochondrial secondary metformin effects are suppression of glycogenolysis, and enhanced cellular glucose uptake relevant to metformin’s pharmacodynamics in type II diabetic therapy [9].

During oral dosing, washed jejunal tissue biopsies had about 30–\(300\,\times \) higher metformin concentrations than that in plasma (human) [10]. Metformin accumulates in the cytosol of erythrocytes, small intestine, skeletal and cardiac muscle cells, hepatocytes, and brain [4, 11,12,13]. Metformin’s primary effect is to suppress gluconeogenesis in mitochondria [14]. Metformin mitochondrial concentration from cultured rat hepatoma cells was \(1000\,\times \) that in plasma at 60 h [15]. Metformin’s extra-mitochondrial drug effects appear to occur in other intracellular locations. For example, control of gluconeogenesis was time-delayed using a same-dose regimen [16], and unrelated to blood concentrations [17]. As mitochondria have a very high affinity for metformin, and most eukaryotic cells (except mature erythrocytes) contain many mitochondria, which occupy up to 25 percent of the cytoplasmic volume [18], and are the sites of metformin tumoricidal drug action, with elapsing time one would expect a predominance of body drug mass to be in close proximity to the mitochondrial effector sites.

Intracellular localised metformin effects and plasma drug concentration are kinetically distinct as supported by the observations in rats, dogs and humans that half-lives of drug mass in urine or in erythrocytes are approximately 4.2 to 11.1 times longer than in plasma [4, 13, 19,20,21,22]. The difference of half-lives relates to persistent redistribution. To cause persistent redistribution, it would not be enough to add a solitary extra compartment to our model, but with elapsing time even more compartments would be needed. Variable-volume, variable half-life pharmacokinetic modelling is a relatively recent development that allows volume of distribution to be added to a model with elapsing time [23]. The additional concept of a half-life that is not static may seem novel, but was shown to arise in nonequilibrium states (i.e., when the mass half-life is longer than the plasma half-life), and occurs transiently before a compartmental model achieves dynamic equilibrium (i.e., when mass half-life and plasma half-life are equal). Thus, rather than just accept that metformin plasma concentration half-lives are poorly characterised [11], we note that Xie et al. [4] as well as Sambol et al. [24], the latter for oral multidosing, have commented that metformin half-life increased with elapsing time. Indeed, bolus intravenous metformin plasma half-life was 1.5 h as last sampled at 8 h [20], 4.5 h as last sampled at 12 h [22], and 20.4 h, as last sampled at 72 h (dogs) [1], such that there is good evidence that metformin plasma half-lives increase with elapsing time.

To duplicate metformin half-lives of drug mass in urine and erythrocytes that are persistently multiple times longer than in plasma, classical drug models like the sum of exponential terms (SET) and noncompartmental (NC) exponential models are perhaps not the best choices as only a terminal drug mass to plasma concentration half-life ratio of one is possible for those models. SET functions yield transient nonequilibrium models whose time to dynamic equilibrium can be prolonged 7.5 times but not eliminated using gamma distributions with greater than exponential log-convexity enforced using adaptive Tikhonov regularization [23, 25, 26]. It is not surprising then that for many bolus drug experiments, a late-time exponential tail underestimates the actual amount of drug remaining in the body [5, 27]. Many intravenously administered drugs has been observed to follow the very heavy-tailed, power-law at late times [28,29,30,31,32,33,34]. The statistical form for a power law is the Pareto distribution (PD), which like the Cauchy distribution, has tails so heavy they confer unusual statistical properties and have been given the name fat-tailed distributions. Power laws are scale independent and intrinsically fractal. Fractal drug models are consistent with carrier transport [35], and metformin is principally hOCT transported [7]. However, a power-law model does not accurately model the first few hours of concentration [36, 37]. The incorporation of a power function tail into a semi-infinite support model, \(0\le t<\infty \), was explored with fractional calculus [38], but without consideration of mass conservation and nonequilibrium dynamics. Explored is whether the inclusion of power function tail into a temporally more inclusive model of serum or plasma concentration will yield the permanent nonequilibrium that explains and duplicates the observed persistent mass to concentration ratio of half-lives and whether that forms an accurate enough concentration model to suggest clinical usefulness.

Convolution of a washout model (a monotonic decreasing function) with a gamma distribution (GD) has been used before for correcting the contribution to total residence time of early concentrations [37]. In the current paper, a gamma-Pareto type I convolution (GPC) concentration density function of time was used to model the pharmacokinetics of single bolus intravenous metformin injections is each of seven dogs, most having 22 time-samples collected over 20 min to 72 h. In a GPC model, the PD models the terminal tail of the drug’s plasma concentration after the lighter-tailed gamma density (GD) has decayed. Hyperglycaemia therapy with metformin multidosing has shown a progressively increasing effect of decreasing fasting plasma glucose for at least the 8 weeks of testing [39]. Thus, the longer metformin drug mass in tissue may be better linked to drug effect than serum concentration [17], and the modelling introduced should generate accurate drug mass curves from serum data alone, and to be useful should allow for a multidosing regimen that rapidly establishes a constant drug mass in the body, as opposed to dosing for many weeks for that to occur. This may be relevant for cancer chemotherapy as the alternatives are either to wait for months for tissue build up of metformin, which may not provide optimal therapy for rapidly growing tumours, or to give large dosages without knowing when unnecessarily high metformin tissue concentration will occur.

The residual drug masses imply tissue retentions and excretions that were contrasted with published urinary excretion and erythrocyte retention results. The modifications of pharmacokinetic parameters resulting from the application of the new models are presented and discussed for bolus intravenous metformin disposition curves and body drug mass as functions of time and their half-life functions together with some distinct pharmacodynamic implications arising from them. As some of the results are compared to exponential tailed model results, some statistical characterisation and comparative data analysis of exponential based models is also provided.

Background

To obtain ratios of half-lives of mass and concentration that do not tend to unity from persistently non-equilibrated models required defining half-life as a function of time. Said ratios naturally arise in the context of variable apparent volume of distribution models, \(V_d(t)\). The first mathematically correct \(V_d(t)\) models were proposed by Niazi for SET functions [40]. Those models were simultaneously compartmental models and variable volume models, but the interpretations of those different models are distinct. Niazi’s work was generalised by some of us for any density function supported on the time is \([0,\,\infty )\) interval having a varying apparent volume of drug distribution in time, \(V_d(t)\), with half-life expressions that vary in time [23]. For variable volume modelling \(V_d(t)\) is a drug-o-centric volume of distribution in time. For metformin, the drug is either in the body, or has been eliminated to the urine. We do not know where the M(t) drug mass is in the body. We only know that mass is concentration times volume. That apparent volume of distribution is the imaginary volume that the drug would occupy if it were everywhere at the same concentration as it was observed to be in plasma at that time, and CL eliminates drug mass from that single volume no matter how big that volume is.

The general basic equations of variable volume modelling with variable half-life and constant clearance are listed as Eqs. (1)–(10). Note well that these equations are for bolus conditions; dose delivered rapidly under intravenous conditions, and do not include dose that remains in any syringe, tubing or vial, nor any depo caused by inadvertent subcutaneous extravasation, i.e., a common, but largely unrecognised problem [41]. Drug mass, M(t), means drug mass in the body, not drug mass excreted. Equations (1, 2) require clearance \( CL \) to be proportional to plasma/serum concentration and constant in time. Strictly speaking, the observed concentrations, \(C_{obs}(\mathbf {t})\), for these two equations should be measured where elimination is occurring, which for metformin is where the renal artery contents are cleared in the kidney, and the C(t) model fit to those observations. Note that Eq. (1) reduces to \(D= CL \; AUC \) for \(t=0\), where \(D=M(t=0)\) is the dose, \( CL \) is plasma/serum clearance and \( AUC \) is the area under the curve of C(t) from \(t=0\) to \(\infty \). Equation (2) is a mass ratio; the quotient of Eq. (1) evaluated at t divided by its evaluation at \(t=0\), is the complementary cumulative density function and defines a density function’s right-hand tail area (pdf often have two-tails). As a matter of convenience, the complementary cumulative density function is referred to as a survival function \(S(t)\mathop {=}\limits ^{\textsf {def}}1-\text {CDF}(t)\), where CDF(t) is the cumulative density function and \(\text {CDF}(t)\mathop {=}\limits ^{\textsf {def}}\int _0^{t } \text {pdf}(u ) d u\). This is common practice despite the confusion it causes. That is, a survival function in the strictest sense is discrete, which is not how the term is being used here. Although the name survival arose from actuarial usage as the fraction of an initial population surviving in time, here it is the fraction surviving in the body of a unit dose mass at time t. Sometimes, survival functions are also used to calculate “Relative tail heaviness” as per the Appendix Section. Survival functions and CDF are dimensionless (units cancel).

A list of equations for bolus-dose, constant-clearance, intravenous conditions follows:

Mass as an f(t),

$$M(t) = { CL} \int _t^{\infty } C(u ) \, du.$$
(1)

Survival function definition,

$$S(t)\mathop {=}\limits ^{\textsf {def}}\int _t^{\infty } \text {pdf}(u) \, d u =\frac{M(t)}{M(0)} = \frac{\int _t^{\infty } C(u ) \, d u}{\int _0^{\infty }C(u ) \, d u}.$$
(2)

Conservation of mass,

$$M(t)={{V}_{d}}(t)\,C(t).$$
(3)

Concentration parsing,

$$-\frac{C\,'(t)}{C(t)}=-\frac{M\,'(t)}{M(t)} +\frac{V'_d(t)}{V_d(t)}.$$
(4)

Two time-sample exponential half-life,

$$\xi =-\ln (2)\dfrac{t_{i+1}-t_{i}}{\ln (C_{i+1})-\ln (C_i)}.$$
(5)

Instantaneous half-life,

$$t_{1/2};f(t) \mathop {=}\limits ^{\textsf {def}}-\ln (2)\dfrac{f(t)}{f'(t)}.$$
(6)

Half-life of drug mass,

$${t}_{1/2};M(t)=\frac{\ln ( 2)}{ CL }{{V}_{d}}(t).$$
(7)

Half-life of concentration,

$${t}_{1/2};C(t)=\frac{\ln ( 2)}{ CL +V'_d\left( t \right) }\,V_{d}( t).$$
(8)

Ratio of M(t) to C(t) half-lives,

$$\frac{t_{1/2};M(t)}{t_{1/2};C(t)} =\frac{ CL +V'_d( t)}{ CL }.$$
(9)

Harmonic sum of half-lives,

$$\frac{1}{t_{1/2};C(t)} =\frac{1}{t_{1/2};M(t)}+\frac{1}{-t_{1/2};V_d(t)}.$$
(10)

Equations (3)–(10) are variable apparent volume of distribution equations and/or variable half-life equations. Of these equations, Eq. (3) and the derivative of its logarithm, parsing of concentration Eq. (4), are the direct results of conservation of mass. For these equations, \(V_d(t)\) at \(t=0\) is either undefined for a short time when C(t) models a peripheral venous sample or a C(t) is used that models more proximal concentrations at the injection site itself. Equation (4) relates that the negative relative change in concentration (i.e., a positive number for decreasing concentration) is the negative relative change of mass (also typically positive) plus the relative drug volume of distribution apparent growth, which latter dilutes/rarefies the relative concentration. For a constant volume of distribution, \(V_{d{\text {-}} RT } = CL \; RT \) can be used, where RT is whichever type of residence time is appropriate to the model under consideration—see the “Residence Time” Appendix Subsection.

Equation (5) is common knowledge for any two time-samples, \(\{t_i,C_i\}\) and \(\{t_{i+1},C_{i+1}\}\). Use of this equation for non-compartmental conditions when \(t_{i+1}-t_{i}\) is small is done with median values from all of the dogs’ samples to obtain tractable results—see the “Noncompartmental time-sample group, median half-life” Appendix Subsection for details. However, Eq. (5) only applies for monoexponential conditions or in the limit if we substitute \(f(t)\approx C_i\), and \(t_{i+i}=t_{i}+\Delta t\) and let \(\Delta t\rightarrow 0^+\), where f(t) is any continuously differentiable function. That yields Eq. (6), which is of classical mechanical type, and applies equally well to the evolution of half-life of satellite orbital decay [42] as to variable half-life of concentration of any drug. Half-lives can be both positive and negative. For an f(t) that increases in time, the \(t_{1/2}\) is negative. For example, this occurs when concentration is increasing in time in a peripheral vein shortly after intravenous injection. It easier to understand a negative half-life of an increasing function of time when some maximum total value of a measurement is being approached in time by subtracting the measured values from that maximum and viewing the result as a positive half-life of a decreasing function in time, as is done, for example, using total dosage administered for a urinary drug recovery \(t_{1/2}\) calculation. However, growth of any function in time, including drug mass accumulating in urine, actually has a negative half-life.

For the half-life of drug mass from the Eq. (6) definition, we substitute Eq. (3) for f(t), and the derivative of Eq. (1) for \(f'(t)\), which yields Eq. (7). To find the plasma/serum clearance half-life, Eq. (8), we eliminate the mass terms in concentration parsing Eq. (4) using \(M(t)=V_{d}(t)\,C(t)\) and \(M'( t )=- CL \,C(t)\), then rearrange terms to put them into the form of Eq. (6). \(V'_{d}(t)\) is concentration dilution having the same units as \( CL \), e.g., ml min\(^{-1} \)kg\(^{-1}\). Next, we form the ratio of the mass and concentration half-lives above to yield Eq. (9). We hypothesise that this can have different concentration parsing limits, \(V'_d\left( \infty \right) =0 \text { or}>0\), [23] and Eq. (33) below, respectively. In the first case, \(V'_d( t )\) eventually shrinks to zero, volume dilution of concentration stops, and dynamic equilibrium is achieved. Thereafter, mass half-life is the same as the half-life in plasma/serum. In the second case, \(V'_d(\infty )\) is a positive constant and \({t}_{1/2};M(t)\) remains longer, e.g., as witnessed as \(-t_{1/2}\) of metformin mass in urine, than \(t_{1/2};C(t)\) in plasma/serum. The half-life of \(V_d(t)\) adds harmonically, i.e., reciprocally, with mass and concentration half-lives, as follows. Substituting the mass and concentration half-lives, Eqs. (8, 9), into concentration parsing Eq. (4) yields Eq. (10).

Constant infusion

Let us assume that elimination rate is proportional to concentration, and that superposition is valid. Using as impulse-response the infinitesimal of an intravenous bolus C(t) model and as constant input of that model, \(\theta (t)\), the unit step function in the role of its transfer or control function, their convolution is a constant infusion concentration model. Then as \(\theta (x)*\text {pdf}(x)\,(t)=\text {CDF}(t)\), where CDF is the cumulative density function,

$$\begin{aligned} {{C_{in}(t)}}&=C( x )*\theta ( x )\left( t \right) \nonumber \\&=\int _{0}^{t}{C( x ) dx}=\int _{0}^{\infty }{C( x )dx} \int _{0}^{t}{\text {pdf} \left( x \right) dx} \nonumber \\&=\textit{AUC}_{\mathrm{bolus}}\text {CDF}\left( t \right) \equiv {{ C_{SS} }}\text {CDF}\left( t \right) =\dfrac{D_R}{ CL }\text {CDF}(t), \end{aligned}$$
(11)

where the ’in’ of \(C_{in}(t)\) is for ’infusion’, and where \({C}_{SS}={D_R}/{ CL }\) is the steady state concentration [23]. \(D_R\), an acronym of dosing rate, was used and elsewhere \(\hbox {R}_0\) or \(k_0\) are often used. A \(\hbox {CDF}(t)\) has a maximum amplitude of 1 terminally; the occurrence time of a steady state concentration, \(C_{SS}\). Thus, to scale \(\hbox {CDF}(t)\), we multiply it by \(C_{SS}\) the magnitude of which is \(\int _0^\infty C(x)dx\) from the above convolution, such that \(C_{SS}\equiv \textit{AUC}_{\mathrm{bolus}}\). Note that there is no bolus physically, just a result that can predict the magnitude of \(C_{in}(t)\) in terms of a previously observed bolus function.

Methods

This section includes new material such as constant average drug mass multidosing and the explanatory statistical treatment of models as scaled density functions.

Data collection

The metformin serum concentration data was published elsewhere [1] was reused to reexamine basic modelling assumptions. In that study, seven healthy adult mixed-breed dogs were used including 2 sexually intact males, 2 neutered males, 1 sexually intact female, and 2 spayed females. The dogs were between 2 and 3 years old and weighed between 25.7 and 29.2 kg. The animal study was approved by the University of Saskatchewan’s Animal Research Ethics Board, and adhered to the Canadian Council on Animal Care guidelines for humane animal use. Following single intravenous bolus doses of metformin in each of seven dogs, (18.2 mg/kg in dog 1 and 19.5 mg/kg in the other six dogs), 19 to 22 serum samples drawn between 20 min and 72 h were used for modelling—see the Appendix “Concentration data and fitting it”, as well as the .xlsx file data worksheet in Supplementary Materials 1 for further information.

Fitting metformin concentrations

Area under the curve (AUC) inherits its units from those of the curve being used to calculate that area. For example, density functions have a total \(AUC _{\mathrm{pdf}}\) of 1, i.e., \(\int _{0}^\infty \text {pdf}(t)\,dt\mathop {=}\limits ^{\textsf {def}}1\), with no units (dimensionless) whereas for concentration \(AUC _{C(t)}\) has units like \(\frac{\text {mg}\cdot \text {h}}{\text {L}}\). The acronym pdf was used for density functions even though the p for probability was irrelevant. That is, concentration is not a probability, and a fraction of total AUC per unit time at time t density function, i.e., \(\hbox {pdf}(t) =C(t)/\textit{AUC}_{C(t)}\) (e.g., units \(\hbox {h}^{-1}\)), obeys similar rules to those that would apply to any other density per x-axis unit function (pdf) in whatever context the x-axis units and (xy) coordinated values provide. In our case, fitting data with \(C(t)=\kappa \text { pdf }(t)\) yields a concentration curve model, C(t), where \( AUC_{C(t)} =\kappa \).

Peripheral venous drug concentration is observed, \(C_{obs}(\mathbf {t})\), to be smoothed and delayed by its passage through the circulatory system causing zero initial concentration, \(C_{obs}(0)=0\), to be observed peripherally. In this paper, a virtual washout signal with a \(t=0\) start time, i.e., a gamma distribution, is convolved with a Pareto distribution having a 25—30 s start time to yield a peripheral C(t) model. Ideally, one should minimise bolus error; to not leave drug in the needle track within the dermis by using a large flush bolus injectate to wash the needle and to deliver a high-concentration, low drug volume quickly enough to improve and standardise circulatory mixing [41].

To select disposition curve models one needs to be mindful of which statistical distributions have the best chance of being useful. These selection criteria include inspection of each distribution’s parameter types. A distribution’s parameter types influence the distribution’s derivative fidelity for following the curve shape, and influences the well-posedness of the AUC determination, i.e., whether a unique (global) solution that is not sensitive to noise, called a stable solution exists. Stable solutions were obtained in high precision using the Nelder-Mead global optimisation method [43], The practical choice for the modeller is to either find a robust concentration model whose fit to the curve is accurate or to relax the goodness-of-fit criteria, for example by treating AUC as an ill-posed integral of the first kind with use of an inverse method for fitting [25]. Distributions generally take one or more of three types of parameters, those of shape, location and scale. For example, the exponential distribution has a scale parameter, but no shape or location parameters. A location parameter indicates where a distribution is positioned on the x-axis. Depending on the distribution, the location may be a best measurement of centrality, but more generally just indicates an x-axis position. Compared to a distribution without a shape parameter, if we choose a distribution with one or more shape parameters, not only will our goodness of fit generally improve, but the curve shape will be better fitted. Some functions, like the gamma distribution (GD), which has a shape and a scale parameter, may still have an ill-posed AUC, such that fitting with Tikhonov regularization rather than ordinary regression may be needed to find a Tikhonov well-posed clearance at the price of a small penalty in goodness-of-fit [25].

To fit concentrations we used proportional error minimisation, i.e., \(1/C_{obs}^{\;2}(\mathbf {t})\) weighted ordinary least squares fitting of models to data [37, 44,45,46,47]—see the “Concentration data and fitting it” Subsection of the Appendix for details on this and on proportional assay error, how to quantify root mean square proportional error (rrms), and noncompartmental methods. The GPC function was the experimental model fit to data. It is calculated from hundreds of sums using 65 decimal place arithmetic to mitigate roundoff error. The GPC algorithm with Nelder-Mead regression was designed for precision and accuracy to converge to 30 decimal places. The GPC algorithm used, Eq. (25) below, purpose written in Mathematica 12.0.0.0 code, was not optimised for speed such that the run time for each case was \(34\pm 12\) min on a 2.7 GHz Intel Core i7 CPU using four parallel processors. In post hoc testing this was reduced to 28 min using series acceleration methods, i.e., not much faster and that despite reducing the call to the GPC algorithm from 45 to 19 ms average. Thus, much of the elapsed time for regression is due to factors other than GPC algorithm run time, for example performing constrained rather than unconstrained regression and the use of extended precision. Individual subject parameter statistics (CV%) are not a feature of the global optimisation search method used, and see the “Parameter errors from model-based bootstrap” Appendix Section for a computationally intensive analysis of algorithmic robustness. For E2, parameter CV% are typically calculated from local optimisation gradients, e.g., using the Levenberg–Marquardt method and similar methods, which may not be the CV% from least error, i.e., global, solutions, and only indirectly examine for robustness. Sample-time groups are used in the text, especially in “Noncompartmental time-sample group, median half-life”. These are merely groups of samples having in common the same times following bolus injection that were used for drawing samples from all of the subjects.

Dose regimen models

The simplest dose regimen is repeat constant dosing. Another dose regimen of possible interest for vasoactive drugs, e.g., dipyridamole, would be to keep the mean plasma drug concentration constant for each dose interval. A dose regimen for drugs like metformin that have major effects in cytosol and mitochondria and that accumulate in cytosol, may have as a goal to dose for the same mean drug mass retained in the body during each dose interval.

Let us assume first order elimination and superposition. Then, a protocol for keeping a constant mean unit dose drug mass in the body starts with a unit intravenous dose from the identity function for a density, \(\int _0^\infty \text {pdf}(t)\,dt=1\), break it into dose periods of \(\tau \), and divide that series by \(\tau \) to calculate the average fraction of a unit dose mass per unit time during each dosing interval, which is the average mass over the dose interval, which written from the survival function equals \(\Delta S(t)/\tau \), i.e., \(S\tau (i)=\frac{1}{\tau }\left\{ S[\tau (i-1)]-S(\tau i)\right\} \), for \(i=1,2,3,\ldots \)

$$\begin{aligned} \frac{1}{\tau }= & {} \frac{1}{\tau }\int _0^\tau \text {pdf}(t) d t\nonumber \\&+\frac{1}{\tau }\int _\tau ^{2\tau } \text {pdf}(t) d t +\frac{1}{\tau }\int _{2\tau }^{3\tau } \text {pdf}(t) d t +\dots \end{aligned}$$
(12a)
$$\begin{aligned}= & {} S\tau (1)+S\tau (2)+S\tau (3) +\dots \ . \end{aligned}$$
(12b)

During the first dose interval of duration \(\tau \), to augment the dosage to yield an average drug mass during \(\tau \) to be a unit dose, the dose administered is augmented by multiplying the unit dose by the reciprocals of the first terms of right sides Eqs. (12a) and (12b) to yield the dose factor, DF

$$\begin{aligned} DF (1)=\frac{\tau }{\int _0^\tau \text {pdf}\,(t)\, d t} =\frac{1}{S\tau (1)}\ge 1\;\;. \end{aligned}$$
(13)

To obtain a dose factor for a constant average unit dose mass during the second \(\tau \) interval one includes the contribution from the first dose second dose interval, and the second dose first dose interval, which simplifies to yield

$$\begin{aligned} DF (2)= DF (1) \left[ 1- DF (1)\;S\tau (2)\right] \;\;. \end{aligned}$$

For \(n\ge 2\), this equation generalises to be the recursion,

$$\begin{aligned} DF (n)= DF (1) \left[ 1-\sum _{i=1}^{n-1} DF (i)\, S\tau (n+1-i)\right] . \end{aligned}$$
(14)

Note that as \(n\rightarrow \infty \), \( DF (n)\) goes to that fractional dose multiplier, \({>}\,0\), that allows an average of a unit dose to be maintained in the body during the \(n^{\text {th}}\) dose interval. A word of caution, for accurate results from Eq. (14), the terminal tail of the model used must accurately match plasma/serum drug concentrations.

Distributions

This subsection presents the distributions (pdf) used. Some properties of these distributions are: The convolution of two pdf is a pdf. The AUC of a pdf is 1. \(C(t)=\kappa \, \text {pdf}\) where \(\kappa \) is the AUC of C(t). A survival function, S(t), is the t to \(\infty \) integral of the pdf. For intravenous bolus conditions, regardless of distribution type, when \(t=0\), \(S(0)=1\), and the entire dose fraction is within the body.

Gamma distribution

The gamma distribution (GD) is given by

$$\begin{aligned} \text {GD}(a,b;t) =\,\dfrac{1}{t}\;\dfrac{e^{-b \, t} (b \, t)^{\,a} }{\Gamma (a)} \;\theta (t)\;, \end{aligned}$$
(15)

where the gamma function satisfies \(\Gamma (a)=\int _0^{\infty } e^{-t} t^{a-1}\,dt\), and \(\theta (t)\) is the unit step function, i.e., 0 for \(t<\,0\) and 1 for \(t\ge 0\). The GD is an exponential density (ED, below) when \(a\,=\,1\). The GD also has a (\(+\,\infty \)) discontinuity at \(t\,=\,0\) when \(0\,<\,a\,<\,1\). However, that discontinuity is integrable with zero area in the limit as \(t\rightarrow 0 \). Thus, the survival function (\(S_{\mathrm{GD}}\)) is defined at \(t\,=\,0\), and,

$$\begin{aligned} S_{\mathrm{GD}}(a,b;t)=\dfrac{\Gamma (a,b\, t)}{\Gamma (a)} \;\theta (t)=Q (a,b\, t)\;\theta (t)\;, \end{aligned}$$
(16)

where the upper incomplete gamma function satisfies \(\Gamma (a,z)=\int _z^{\infty } u^{a-1} e^{-u}d u\), and Q(az) is its regularised form. The GD rate parameter is b, whereas \(\frac{1}{b}\) is the scale parameter. The shape parameter for the GD is a. The shape parameter aids in fitting disposition curves and their shapes. There is no location parameter.

Pareto distribution

Negative power functions of time have an undefined (\(+\,\infty \)) discontinuity at \(t\,=\,0\) that is not integrable (\(+\,\infty \)). Thus, a power function as a density function could have a minimum positive start time, \(\beta >\,0\), thereby allowing for integration that avoids the discontinuity. Just such a function is the type I Pareto distribution (PD),

$$\begin{aligned} \text {PD}(\alpha , \beta ;t)=\dfrac{\alpha }{t} \left( \dfrac{\beta }{t}\right) ^{\alpha } \;\theta (t-\beta )\;. \end{aligned}$$
(17)

Note the shift to the right of the unit step function; \(\theta (t-\beta )\), i.e., the PD function is zero for \(t<\,\beta \). The PD survival function (\(S_{\mathrm{PD}}\)) is,

$$\begin{aligned} S_{\mathrm{PD}}(\alpha ,\beta ;t)=\left\{ \begin{array}{ll} \left( \dfrac{\beta }{t}\right) ^{\alpha } &{} t\ge \beta \\ \;\;1 &{} 0\le t<\beta \\ \;\;0 &{} t<0\\ \end{array}\right. \;. \end{aligned}$$
(18)

\(\beta \) is the scale parameter. The shape parameter is \(\alpha \). There is no location parameter.

Sums of exponential term distribution

Sums of exponential terms (SET) models are equivalently \(C_{\mathrm{SET}}(t) =\theta (t)\;\sum _{i=1}^n c_i \,e^{-\lambda _i\,t}\), where n is any positive integer including 1, and without loss of generality \(\lambda _1\ge \lambda _2\ge \lambda _3...\ge \lambda _n>0\). SET models have been incorrectly criticised as having meaningless \(c_i\) coefficients. In a statistical context note that \(\lambda e^{-\lambda t}\) is an exponential density, then let \(\sum _{i=1}^n p_i= 1\), where the \(p_i\) (not probabilities here) are the fractions of the model’s total density attributed to each exponential density term. Then,

$$\begin{aligned} \text {ED}_n(t)= \;\theta (t)\;\sum _{i=1}^n p_i\,\lambda _i \,e^{-\lambda _i\,t}, \end{aligned}$$
(19)

where the \(\lambda _i\) decay coefficients are rate (i.e., 1/scale) parameters. The \(c_i\) of SET functions relate to the \(\hbox {ED}_n\) parameters as \(c_i=\kappa \, \lambda _i\, p_i\), where concentration is \(C_{\mathrm{SET}}(t)=\kappa \, \text {ED}_n\) in which \(\kappa =\int _0^{\infty }C_{\mathrm{SET}}(t)dt= AUC (C_{\mathrm{SET}})\). Note that a scaled monoexponential, an E1, is not a monoexponential density, ED, as E1\(\,=\textit{AUC}_{\mathrm{E1}} \;\text {ED}\). So when we refer to a biexponential we write E2, where a biexponential density is an ED\(_2\). The survival function is

$$\begin{aligned} S_{\mathrm{ED}n}= \;\theta (t)\;\sum _{i=1}^n\,p_i \,e^{-\lambda _i\,t}\,. \end{aligned}$$
(20)

Note that \(\hbox {ED}_n\), \(S_{\mathrm{ED}n}\) and SET only have scale parameters (or their reciprocals). There are no shape parameters to aid for fitting curve shapes or for extrapolation, and the solutions for \(\hbox {SET}_{n\ge 2}\) lack robustness. For example, biexponentials (E2) solved for 4 time-samples sometimes have solutions as \(\in \mathbb {C}\), not \(\in \mathbb {R}\) [36]. There are no location parameters.

The gamma-Pareto convolution distribution, basic formulas

Derived from series expansion of the GD exponential, Eq. (15), followed by convolution with a PD, Eq. (17) and summation of the expanded parts,Footnote 1 the GPC model inherits two shape parameters, a and \(\alpha \), and two scale parameters, b and \(\beta \) from its convolved pdf’s,

$$\begin{aligned}&\text {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \nonumber \\&\quad =\text {GD}( a,b;x)*\text {PD}(\alpha , \beta ;x) \;(t)\nonumber \\&\quad =\theta (t-\beta ) \frac{\alpha b^a\beta ^\alpha }{\Gamma (a)}\nonumber \\&\qquad \sum _{n=0}^{\infty } \frac{1}{n!}(-1)^n b^n t^{a-\alpha +n-1} B_{1-\frac{\beta }{t}}(a+n,-\alpha ) \;\;, \end{aligned}$$
(21)

where the incomplete beta function is \(B_z(a,b)=\int _0^z u^{a-1} (1-u)^{b-1}d u\). The GPC function has a concentration start time, \(\beta >0\), and a circulatory mixing start time of 0. In practice, GPC models chain, i.e., convolve, two independent washout functions (monotonic decreasing; GD iff \(a\le 1\) and a PD) that only when combined produce a circulatory peak concentration. The GD washout models relate to the circulatory mixing that starts at \(t = 0\), whereas the PD models relate to the injection to sampling venous-venous delay of a few dozen seconds (\(=\beta \)) and the terminal rate of the increase in apparent volume of distribution in time [\(CL /\alpha \), Eq. (33), below].

For use with M(t), and other functions, we integrate the GPC function from 0 to t to construct its cumulative density function (proof by differentiation),

$$\begin{aligned}&\text {CDF}_{\mathrm{GPC}}\left( \begin{array}{ll} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \nonumber \\&\quad =\theta (t-\beta )\frac{\alpha \beta ^{\alpha }b^a}{\Gamma (a)} \nonumber \\&\qquad \times \sum _{n=0}^{\infty } \frac{(-1)^n b^n}{n! (a-\alpha +n)}\nonumber \\&\qquad \left\{ \begin{array}{l} \left( t^{a-\alpha +n}-\beta ^{a-\alpha +n}\right) B_{1-\frac{\beta }{t}}(a+n,-\alpha ) \\ \quad -\beta ^{a-\alpha +n} \left[ \frac{\left( \frac{t}{\beta }-1\right) ^{a+n}}{a+n} +B_{\frac{\beta }{t}}(-\alpha ,a+n)-B(-\alpha ,a+n)\right] \end{array}\right\} \;, \end{aligned}$$
(22)

where for \(z=1\), \(B_z(a,\,b)\) becomes the beta function \(B(a,b)=\frac{\Gamma (a)\,\Gamma (b)}{\Gamma (a+b)}\). As it is needed for the half-life of concentration, we take the derivative of the GPC,

$$\begin{aligned}&\text {GPC}\,'\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) = \theta (t-\beta )\frac{\alpha \beta ^{\alpha } b^a}{\Gamma (a)}\nonumber \\&\quad \times \sum _{n=0}^{\infty } \frac{(-b)^n}{n!} t^{-1} \bigg[ (a-\alpha +n-1) t^{a-\alpha +n-1} \nonumber \\&\qquad \qquad \left. B_{1-\frac{\beta }{t}}(a+n,-\alpha ) +\beta ^{-\alpha } (t-\beta )^{a+n-1}\right] \;\;. \end{aligned}$$
(23)

Asymptotes of the gamma-Pareto convolution

Two asymptotes to the GPC function were identified for t is a sufficiently long time. The first is the Pareto Type I distribution itself, and is the desired result; to write a power function tail into a more general PK model,

$$\begin{aligned} \text {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \;\sim \; \alpha {\beta }^\alpha {t}^{-1-\alpha }\;, \end{aligned}$$
(24)

and is a straight line on a log-log plot, where (\(\sim \)) means asymptotic to. This occurs because the GD tail decays quickly and the PD tail decays very slowly, see “Relative tail heaviness”.

The second GPC asymptote was obtained by Taylor series expansion of Eq. (21), with the proof appearing as the .pdf file labelled Supplementary Materials 2,

$$\begin{aligned}&\text {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \nonumber \\&\quad =-\theta (t-\beta ) \frac{\alpha b^a \beta ^\alpha }{\Gamma (a)} t^{a-\alpha -1} \sum _{n=0}^{\infty } \frac{(- b\,t)^{n} }{n!} B_{\frac{\beta }{t}}(-\alpha ,a+n)\nonumber \\&\qquad +\theta (t-\beta ) \alpha b^{a}\beta^{\alpha} \Gamma(-\alpha)t^{a-\alpha -1} \, _1\tilde{F}_1(a,a-\alpha ;-b\,t)\;, \end{aligned}$$
(25)

where \(_1\tilde{F} _1 (a;b;z)\mathop {=}\limits ^{\textsf {def}}\Gamma (b)\sum _{k=0}^{\infty } \frac{(a)_k }{(b)_k}\frac{z^k}{k!}\) is the regularised form of Kummer’s confluent hypergeometric function of the first kind wherein \((a)_k\) and \((b)_k\) are Pochhammer symbols, i.e., generalised ascending factorials, \((a)_k=a (a+1) \ldots (a+k-1)=\Gamma (a+k)/\Gamma (a)\). The closed form term (beginning \(+\theta \)) of Eq. (25) is positive when \(0<\alpha <1\) and \(b\,t\) are sufficiently large, while the limit of the sum converges to zero. This forms a second GPC asymptote that approaches the GPC more rapidly than the PD tail, and finds use for computing the GPC when t is long, e.g., approximately \(>100\) h. That is, for \(0<\alpha <1\) and t sufficiently long, that term and the GCP are asymptotic to each other,

$$\begin{aligned}&\text {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \nonumber \\&\quad \sim \; \theta (t-\beta )\alpha b^{a}\beta^{\alpha}{\Gamma (-\alpha )} t^{a-\alpha -1} \, _1\tilde{F}_1(a;a-\alpha ;-b\,t)\;. \end{aligned}$$
(26)

What the concentration models imply

Drug mass

The drug mass remaining in the body, is from the GPC CDF, Eq. (22) applied to Eq. (2),

$$\begin{aligned} M(t)=D\, S_{\mathrm{GPC}}=D\,\left[ 1-\text {CDF}_{\mathrm{GPC}} \left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \right] \ , \end{aligned}$$
(27)

where \(M(0)=D\) is the dose.

Half-life functions

The half-life function of metformin model concentration, \(t_{1/2};C(t)\), as observed in a peripheral venous sampling site, comes from substituting the \(\hbox {GPC}(t)\), Eq. (21), and its derivative Eq. (23) for the f(t) and \(f'(t)\) of Eq. (6),

$$\begin{aligned} t_{1/2};C(t)=-\ln (2)\;\frac{\text {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) }{\text {GPC}\;'\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) } \,. \end{aligned}$$
(28)

Next, the half-life of drug mass remaining in the body from Eqs. (2), (6), (21), and (22) is

$$\begin{aligned} t_{1/2};M(t) = \ln (2)\frac{ 1-\text {CDF}_{\mathrm{GPC}}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) }{\text {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) }\;. \end{aligned}$$
(29)

The asymptotic half-life of the GPC concentration model was obtained by substitution of the first GPC asymptote, Eq. (24), and its derivative into half-life Eq. (6), and is linear

$$\begin{aligned} t_{1/2};C(t)\sim \frac{\ln (2)}{\alpha +1}\;t\;;\qquad t\rightarrow \infty \; . \end{aligned}$$
(30)

Similarly, the half-life of drug mass is asymptotic to

$$\begin{aligned} t_{1/2};M(t)\sim \frac{\ln (2)}{\alpha }\;t\;;\qquad t\rightarrow \infty \;, \end{aligned}$$
(31)

a linear function of time. Equations (30) and (31) suggest that metformin GPC half-lives can be any time, no matter how long, just by measuring that half-life at a correspondingly late time. From Eqs. (31) divided by (30) the terminal ratio of half-lives of drug mass remaining in the body to serum concentration is

$$\begin{aligned} \frac{t_{1/2};M(t)}{t_{1/2};C(t)}\sim \frac{\alpha +1}{\alpha }; \qquad t\rightarrow \infty \end{aligned}$$
(32)

which as \(\alpha >0\), is \(>1\). That this limit is not equal to 1, as it would be for lighter-tailed models, exemplifies why Pareto distribution tails have been called fat-tailed historically; because of their provocatively different properties from those of lighter-tailed distributions.Footnote 2 There are now two expressions for the mass to concentration ratio, one of them for any variable volume model, Eq. (9), and the other from the PD asymptote to the GPC, Eq. (24). Combining these expressions yields the terminal rate for volume of distribution apparent growth for our fat-tailed model of \(V'_d( \infty )\),

$$\begin{aligned} V'_d( t)\sim \frac{ CL }{\alpha };\,\,\,t\rightarrow \infty \;. \end{aligned}$$
(33)

As \(\alpha \) is dimensionless, \(V'_{d}\); the rate of growth of \(V_d\), has the same units as \( CL \). Rewriting of the half-life harmonic sum Eq. (10) and solving for the asymptote of \(t_{1/2};V_d(t)\) yields its negative half-life (growth)

$$\begin{aligned} -t_{1/2};V_d(t)\sim \ln (2)\,t\,;\;\;t\rightarrow \infty \;. \end{aligned}$$
(34)

Results

The gamma-Pareto convolution (GPC) distributions’ parameters for metformin in seven dogs appear in Table 1. As a global search routine, the Nelder-Mead method does not generate case-wise CV% results. For that information, please refer to the “Parameter errors from model-based bootstrap” Appendix section. Figure 1 shows GPC models fit to the data over four orders of magnitude of serum concentration of metformin and 72 h of data sampling for seven subjects. Also shown as straight lines on the same log-log plots are the GPC’s Pareto power-function asymptotes, Eq. (24). A back extrapolated GPC concentration model is shown in Fig. 2, with zero initial concentration that cannot be shown in the Fig. 1 log-log plots. For semi-log plots please see the .pdf file labelled Supplementary Materials 3. Goodness of fit was shown as relative error plots of residuals of fitting in Fig. 3 for the GPC models and Fig. 4 for biexponential (E2) models. Recall (Appendix “Concentration data and fitting it”) that relative errors are identically \(1/C^2\) weighted residuals, and note that the sample-times were approximately equally logarithm of time distributed. Regression in y does not see the x-axis spacing, thus the equal spacing between grouped sample-times on these plots.

From Table 1 the GPC models mimicked experimental concentrations and achieved an early concentration peak ranging from 0.66 to 6.17 min total elapsed time. GPC shape parameters from the GD were \(a<1\) for which there were infinite concentrations at \(t=0\). These GD curves are washout or monotonic decreasing curves with a rate (1/scale) parameter, \(b\approx 1\) h\(^{-1}\). The GPC PD parameters are \(\alpha \) and \(\beta \). The PD shape parameters, \(\alpha \), in the power functions’ exponents, were less than one, which obviates calculation of MRT as per Appendix Subsection “Residence Time”.

Table 1 Shown are parameters from gamma densities (GD), Pareto densities (PD) and both from Gamma-Pareto convolution (GPC) fitting of concentrations data for seven dogs
Fig. 1
figure 1

Log-log profiles of metformin concentrations versus time in seven dogs. The black circles are data. The red circles are the five samples left out following outlier testing. The blue curves are the GPC concentration models and the green lines are the Pareto (power function) asymptotes of the GPC models. Note the maximum distance between the blue and green curves at 1–2 h. That is the peak effect time of GD convolution. At those times, the convolution of GD function’s slow mixing increased the PD function magnitude several fold

Fig. 2
figure 2

This shows the early time pdf, i.e., each having an AUC of 1, of the GPC model (blue) and the gamma density (GD, in red) for dog 5. The GD is shown truncated at its top, and is the faster decaying of two GPC constituent functions. Note that the GPC starts a small time (25 s here) later than the GD, and the GD is less than the GPC for late time

The GPC earliest, or start, time is \(\beta \). The start times, \(\beta \), of the PD (and thus of GPC distributions), known as the PD scale parameter could not be measured directly (no data at that time) and the unconstrained parameter introduced surfeit variability into the concentration and mass results. By testing \(\beta \) every 5 s from 10 to 35 s and plotting the errors of fit, a good compromise turned out to be an estimated circulatory first arrival time of drug of 25 to 30 s. The GPC harmonic mean residence time (H-MRT) was calculated as in Appendix Subsection “Residence Time”. Then \(V_{H{\text {-}}MRT}= CL _{\mathrm{GPC}}\,H{\text {-}} MRT \)—see Table 1.

Equation (19) gave the statistical context of constant multipliers of the exponential terms of SET functions. For E2 models \(c_1e^{-\lambda _1 t}+c_2e^{-\lambda _2 t}= AUC \bigg[ p_1 \left( \lambda _1e^{-\lambda _1 t}\right) +p_2\left( \lambda _2 e^{-\lambda _2 t}\right) \bigg]. \) In this, any \(f(t)=\lambda e^{-\lambda t}\) term is a pdf, where \( AUC _{\mathrm{pdf}}=1\). Then, \(p_1+p_2=1\), \(p_1=\frac{c_1 \,\lambda _2}{c_1\,\lambda _2 +c_2 \,\lambda _1}\) and \( AUC _{C(t)}=\frac{c_2}{\lambda _2\, p_2}.\) Biexponential fit parameters in this format appear in Table 2.

Table 2 Shown are biexponential (E2) statistical parameters from seven dogs

Fitting for proportional error was performed as per the “Concentration data and fitting it” Appendix Subsection. The results of the selection of the GPC and E2 fit functions and their parameters are next characterised by goodness of fit. As per Table 3, the GPC mean relative root mean square (rrms) error was 8.6%, where 10% or less was taken to be good a priori as elsewhere [37, 51], and where the residuals appear in Fig. 3. The GPC residuals were narrow in range, and approximately normally distributed.

Biexponential model (E2) residual errors—see Fig. 4—averaged 16.1% rrms or nearly twice that of the GPC fit error, using the same fitting methods.Footnote 3 The GPC intravenous bolus arrival time, \(\beta \), was highly constrained, which adds only a fraction of a degree of freedom, df, to the 4 other parameters. To put it another way, using a 25 to 30 s window for \(\beta \) gives an 8.61% residual error of fitting compared to 8.65% for a fixed 25 s time, which latter has exactly 4 df comparable to the 4 df of an E2, but which had a significantly smaller rrms than E2’s 16.1%. The E2 residuals are much more structured than the GPC residuals, see the “Parameter errors from model-based bootstrap” Appendix Section for details. The E2 model residuals had an overall ’M’ shape, which was also identified for each individual E2 curve. The two ’M’ humps are one each from the two local exponential density distributions of a biexponential (a triexponential has three humps). E2 model residuals were non-normally distributed (approximately a three parameter Weibull distribution) and had much wider 95% confidence interval than GPC residuals with an early and late time elongated tail in the underestimating direction (down in the figure). AUC was thus systematically underestimated using E2 models. As implied by the structured residuals, SET models do not systematically approximate the derivatives of concentration. That is, biexponentials fit to this data had slopes roughly analogous to how a broken 24 h clock shows the correct time three times within 72 h—see Fig. 5. This is the effect of not having explicit shape parameters in SET models; the shape of the data, i.e., the derivatives, are only transiently approximated.

Fig. 3
figure 3

GPC fit relative errors (± is over/under estimate) plotted for sample-time groups in temporal sequence. Note the normal distribution of error, the narrow the 95% confidence intervals, and the closeness to zero error of the connected black circles of mean values for samples. The red connected circles are mean values of 5 left-out samples

Fig. 4
figure 4

E2 fit relative errors (± is over/under estimate) plotted for sequential samples groups. Note the skewed distribution of error, the wide 95% confidence intervals, and the M-shaped variation of the connected black circles of mean values for samples. The mean values of 5 noisy samples left out not shown

Fig. 5
figure 5

An example E2 fit to data (dog 6, blue curve). The black circles are the time samples. The approximate times for correct E2 slopes are shown at the point of tangency of the log-log tangent red line segments. Note the M-shaped wandering of the E2 function above and below the data locations, and especially the underestimation of sample concentration as well as incorrect slopes below the earliest and latest time samples

Unexplained fraction is \(1-R^2\), a measure of goodness of fit, herein from the multiple correlations of \(\ln (model)\) and \(\ln (data)\)—see Eq. (39). Table 3 shows these unexplained fractions from the E2 models (0.84% mean) to be multiple times greater than from GPC models (0.22% mean) for which the two-tailed Wilcoxon signed-rank of \(p=0.0156\) suggested significant difference.Footnote 4

Table 3 Shown is clearance, CL from biexponential, E2; noncompartmental, NC; and gamma-Pareto convolution, GPC, models, where relative root mean square (rrms) error of fit and unexplained fraction are shown as percentages for E2 and GPC

Clearance

In addition to the fit errors, Table 3 shows clearance (CL) values for three models: the GPC, E2 and non-compartmental (NC) models, the latter as per the last paragraph of the “Concentration data and fitting it” Appendix Subsection. The NC CL was significantly less than the biexponential CL, but still significantly greater than the GPC CL from Eq. (27) (both \(p=0.0078\), 1-tailed paired Wilcoxon tests). The NC and GPC correlation was good, \(r^2=0.9840\), and the NC CL was \(3.6\ \hbox {ml}\, \hbox {min}^{-1}\, \hbox {kg}^{-1}\) greater than GPC CL. Note that for the GPC model \(V'_d(t)\), i.e., rate of apparent volume growth with the same units as CL, achieved a limiting value of \(6.0\pm 0.6\) (\(\hbox {mean} \pm \hbox {SEM}\)) times the CL.

Drug mass

Table 4 shows GPC model relative AUC-values values at various times where 21.1% occurred following the last sample-time corresponding to \(78.9\%\,\pm 8.2\%\) elimination of the dose by 72 h. Figure 6 plots drug mass remaining from the GPC model Eq. (27) application of \(M(t)=D\; S(t)\). Note, although dog 1 had the least CL (\(9.8\ \hbox {ml}\, \hbox {min}^{-1} \, \hbox {kg}^{-1}\)—Table 1) that dog had the greatest drug mass eliminated at 72 h (91.3%). Dog 1 needs less CL to clear mass as that dog’s volume of distribution from harmonic residence time, \(V_{H{\text {-}} MRT }\), of 0.069 (L/kg)—Table 1—was an extremely reduced volume outlier. GPC’s \(V_{H{\text {-}} MRT }\) classifies volumes similarly to biexponential \(V_{ SS }\)—see the “Residence Time” section of the Appendix for further information.

Table 4 AUC for \(C_{\mathrm{GPC}}(t)\) curves as the percent of dose eliminated is shown for early time from 0 to 20 min, sample-times from 20 min to 72 h, late-time from 72 h to \(\infty \), and from 0 to 72 h
Fig. 6
figure 6

Linear-log plot of metformin (as base) mass retained in seven dogs as calculated from \(D\cdot S(t)\) of the \(\hbox {GPC}_{C(t)}\) model. Note how the dose (in mg per kg body weight) is retained in time. Note the constant mass until the Pareto density scale, \(\beta \); the vascular first transit times of the models, occurred at 25 or 30 s. Dog 1 received a lesser dosage than the other dogs, and also rapidly cleared mass

Half-lives

Half-life defined at the instant in time it is measured was presented as Eq. (6) and in [23]. The fat-tailed GPC distribution, unlike lighter-tailed functions, yielded half-lives of mass and concentration that did not converge to the same value. Instead, they converged to different constants times the elapsed time—Eqs. (31) and (30)—where the ratio of those half-lives converged to a constant as shown in Fig. 7. The time following peak ratio for 95% of the ratio of half-lives of M(t) and C(t) to equal terminal occurred at a mean time of 9.8 h (range 7.4 to 13.1 h), well within the \(72\text { h}\) data collection time. The mean terminal ratio—obtained without urine collection—was 7.0 (range 4.8–9.3)—see Fig. 7 and Eq. (9). The mean terminal ratio of mass to concentration half-lives is 1 for SET functions. Figure 8 shows this effect for our E2 models for which by 21 h 8 min the average \(t_{1/2};M(t)\) to \(t_{1/2};C(t)\) ratio collapsed to \(<1.001\) and rate of apparent volume growth, i.e., \(V'_d(t)\), became vanishingly small as the models entered dynamic equilibrium. From Eq. (34) as illustrated in Fig. 7d, the GPC models \(-t_{1/2};V_d(t)\) eventually converged to \(\ln (2)\,t\). For an E2 model’s \(t_{1/2};V_d(t)\) the equivalent asymptotes are

Fig. 7
figure 7

GPC model \(t_{1/2}\) functions of time. Panel a shows \(t_{1/2};C(t)\); tightly grouped concentration \(t_{1/2}\) functions. Panel b shows \(t_{1/2};M(t)\); more dispersed drug mass \(t_{1/2}\) functions. Panel c shows the ratios of \(t_{1/2};M(t)\) to \(t_{1/2};C(t)\) converged to a mean of \(7.0\times \) by \(\approx 10\ \hbox {h}\). Panel d shows that the (negative) half-life of \(V_d(t)\) became proportional to \(\ln 2\;t\), by \(\approx 20\ \hbox {h}\)

$$\begin{aligned} -t_{1/2};V_d(t)\;\sim \;\ln (2)\frac{ \lambda _2 p_2 }{(\lambda _1-\lambda _2)^2 p_1}\;e^{(\lambda _1 -\lambda _2) t};\;\;t\rightarrow \infty \;, \end{aligned}$$

where \(p_2=1-p_1\) and where \(\lambda _1>\lambda _2\), which functions reached distinctly different asymptotes that grew much faster (\(\approx 10^{19}\) by 72 h) than the single-valued-line GPC asymptote. Thus, the result for the GPC model that \(-t_{1/2};V_d(t)\sim \ln (2)\,t\) is simpler than the comparable results for ED\(_{n\,\ge \,2}\) models.

Fig. 8
figure 8

Shown are E2 mass and concentration half-life ratios for metformin in seven dogs. Note the convergence of these ratios to 1 after approximately 21 h. In effect, the two compartments function as a single compartment after that time and volume growth collapsed to trace amounts

The Appendix “Noncompartmental time-sample group, median half-life” subsection provides the statistical properties of the noncompartmental, distribution-free method used to corroborate the trending of \(t_{1/2};C(t)\) values in time. (One does not use mean half-life values for this as they are unstable and do not measure centrality.) Figure 9 plots the stable noncompartmental (NC) median values at the geometric mean time for each pair of sample times of the each of 7 dogs’ \(t_{1/2};C(t)\) for the earliest sample-time pair, then the next earliest and so forth for all 21 pairs of sample-time groups using Eq. (5). This plot is overlaid with the (stable) median values of \(t_{1/2};C_{\mathrm{GPC}}(t)\) model curves evaluated at those same geometric mean times. In that figure, the NC model median exhibited a significant trend for increasing values with elapsing time. There is independence between NC half-lives that do not share common time samples, i.e., the odd sample groups, \(\{1,3,5,7,\ldots 21\}\), and the even sample groups \(\{2,4,6,8,\ldots 22\}\) are each independent for trending. Thus, the overall trending of NC half-lives was from the data and not from any assumption made. That is, the NC model median \(t_{1/2};C(t)\) trending was due to metformin concentration kinetics. The GPC concentration half-lives smoothly followed the trending of the NC median \(t_{1/2};C(t)\) results, due to the two shape parameters in the GPC models that allowed reduced-error, derivative matching to serum concentration samples, which in turn allowed for accurate half-life functions to be calculated using Eq. (6). No convincing trend toward a terminal \(t_{1/2}\)-value was seen within the data. When expressedFootnote 5 parametrically \(\ln \left[ t_{1/2};C_{\mathrm{NC}}(t)\right] =1.014\;\ln \left[ t_{1/2};C_{\mathrm{GPC}}(t)\right] \) with 95% CI of 0.966 to 1.062 for slope, and a discarded, not significantly different from 0 intercept, had multiple correlation \(R^2= 0.9820\) and Pearson \(r^2=0.9823\). Lin’s concordance correlation [52] was \(\rho _c=0.9906\), i.e., in the almost perfect range of the McBride scale [53] of \(\rho _c>0.99\), with most of the \({<}\,2\%\) error being NC noise. From this, these two very different measurements are almost perfectly measuring the same thing. To be clear, each adjacent sample group pair contained up to 14 samples for extraction of the NC median \(t_{1/2}\). This has the same meaning as a plot of the terminal exponentials of NC models fit to data ending at the last time of that sample group pair, and, the more temporal data was included, the longer the \(t_{1/2}\) was and Fig. 9 shows that \(t_{1/2};C(t)\)was a function of time for our data that was also almost perfectly fit by the median GPC model. When this same NC data was parametrically log-log plotted with the median E2 \(t_{1/2};C(t)\), rather than median GPC models, the plot with NC \(t_{1/2};C(t)\) was a highly nonlinear sigmoidal function and the concordance dropped to \(\rho _c=0.9463\), or in the 0.90 to 0.95 moderate concordance range, and with no 95% confidence interval (CI) overlap with the GPC 95% \(\rho _c\) CI’s. That is, E2 \(t_{1/2};C(t)\) did not measure the same thing as \(t_{1/2};C(t)\) from GPC and/or sample group median NC \(t_{1/2}\).

Fig. 9
figure 9

This shows how the noncompartmental (NC) median half-lives of seven dogs vary with predicted median GPC model half-lives. The NC median \(t_{1/2};C(\mathbf {t})\) model values are shown as black circles. These are discrete serum concentration half-lives from adjacent time sample groups. Each pair of adjacent time sample-groups exhibited a different half-life with a definite overall trend for increased half-life for increasing time of measurement. More robust results were obtained from the continuous median GPC concentration half-life curve (blue) from all seven dogs

Continuous infusion simulations

Constant infusion curves with limiting concentration, \(C_{ SS }\), were calculated indirectly from their IV bolus models’ CDF as implied by Eq. (11). These are shown for the E2 and GPC models for dog 6 as Fig. 10. For dog 6, the GCP infusion simulation of the right panel of Fig. 10 suggests that metformin’s persistent volume of distribution growth—see Eq. (4)—appeared to cause a major delay in the time (\(t\gg 72\ \hbox {h}\)) it takes concentration to approach a steady state, \(C_{ SS }\), suggesting that measuring metformin \( CL \) based on saturation during a constant infusion experiment would be intractable. However, constant infusion is considered by many to be a reference standard for measuring \( CL \). To see why this might be the case, let us assume that the GPC model is the more accurate measurement and that we observe a curve height of \(C_{ SS }=\dfrac{D_{\mathrm{R}}}{ CL }\), Eq. (11). Thus, if one drew a reduced \(C_{ SS }\) height thinking that the data is from a quickly convergent E2 infusion function, a \( CL \)-value that is 20% too large (\(0.991/0.826-1\)) would be produced. From Table 3 the NC and E2 CL-values were respectively 25% and 41% greater than the GPC \( CL \)-values for dog 6, such that a constant infusion measurement from an underestimated \(C_{SS}\) would have appeared to reflect a better standard than other methods that also use exponential approximations to data. However, assigning an underestimated terminal asymptote can be more misleading in another context, i.e., the completeness of collection of drug mass in urine to establish the percent drug recovery. For that problem, estimating percent recovery from the administered dose is numerically much more reliable than from asymptotic completeness based on curve shape.

Fig. 10
figure 10

Shown are constant infusion simulations from a biexponential bolus model (left panel) a GPC bolus model (right panel) from dog 6 data. Each steady state concentration \(C_{ SS }\) is plotted as 100% (red lines). As seen in the figure, at 72 h the biexponential model is already at 99.1% of \(C_{ SS }\), whereas the GPC model is only at 82.6% at that same time

Simulated dosing regimen for constant drug mass in the body

To illustrate a constant mean body drug burden during any dose interval, \(\tau \), Eqs. (13) and (14) were applied to dog 6 data, which dog had moderate 72 h drug retention (17.4%—Table 4). The intravenous dose loading needed to maintain a constant average body drug mass in every \(\tau =12\ \hbox {h}\) dose interval is shown in Fig. 11. As the plot is semi-log, a constant half-life would be a straight line, whereas in each dose interval, the plot is predominantly log-convex suggesting that the half-life increases for elapsed time during each dosing interval. As time progressed, the peak to trough ratios decreased and half-lives increased as per Eq. (6). Overall, the drug mass peak to trough variation varied less than one-tenth as much as it did for concentration for the same dosing intervals. The serial dose factor multipliers for constructing Fig. 11 were calculated to be 4.478, 1.931, 1.172, 0.844, 0.668, 0.558, 0.482, 0.427, 0.385, \(0.351,\ldots, 0.136\) times the body drug mass retained, where the latter number, 0.136, was the fractional dose administered under equilibrium conditions to maintain an average body burden of one dose during an eventual 12 h dose interval. This corresponds to an eventual \(1/0.136=7.37\) times as much drug in the body as is being given in each dose, with the caveat that it would take 6 weeks of twice daily dog 6 dosing to drop to within 10% of terminal maintenance dosing.

Fig. 11
figure 11

Semi-log plot of dog 6 simulations of Q12 h IV metformin multidosing for unit mean dose mass in the body. The amplitude of peak-to-trough ratio decreased markedly with time from 7.1 during the first dose interval to 1.5 during the 84–96 h dose interval. See text for the constant mass loading dose regimen

Discussion

This work gathers together treatments of models of drug concentration as AUC scaled density functions, variable apparent volumes of distribution equations with half-life defined as a real number variable of time. Also, the density function approach to convolution, a new definition for sums of exponential terms coefficients, an alternative expression for constant (inhomogeneous) volume of distribution from residence time that arises from a new density function model for concentration for power function tailed drugs; the gamma-Pareto type I convolution (GPC) were presented. The gamma-Pareto convolution (GPC) model chains a faster gamma distribution (GD) with a slower type I Pareto distribution (PD), the latter a normalised left truncated power function. As data during the first venous drug arrival times were not available, the \(\beta \)-parameter-values of the PD models were constrained to be 25–30 s and mimic reasonable vascular-drug arrival times. As per Table 1 both \( a \; \& \; \alpha <1\), and both the GD and PD were log-convex. However, their convolution; the GPC distributions, like the concentration curves they mimic, only became log-convex a short time after the model’s peak concentration circa 2.7 min was achieved—see Table 1. The GD models were lighter-tailed than the first compartment E2 \(\lambda _1\) but nonetheless augmented the GPC amplitudes out to relatively late times, circa 20 h—see Fig. 1. GPC models are asymptotic with elapsing time to their second constituent distributions, the Pareto distribution (PD) power functions, Eq. (24). As \(\alpha <1\) for both the PD, and the GPC function asymptotic to it, their mean residence times (MRT) were undefined, and different mean times were found using harmonic MRT (H-MRT). Unlike \(V_d(t)\), which grew with time, \(V_{H{\text {-}} MRT }= CL _{\mathrm{GPC}} \,H{\text {-}} MRT \) were constant with values \(<1\) (L/kg) that were significantly Spearman rank correlated to E2 \({V_{ SS }}\) values—see Table 1 and the Appendix “Residence Time” Subsection.

Neither non-equilibrated states nor variable volume modelling prohibit multicompartmental analysis. For example, a two-compartment GPC model would result from assigning a central compartment and a peripheral volume assigned that concentration as a function of time that conserves systemic mass. One might think that without classical multicompartmental modelling of cell membrane transport that pharmacogenomic effects would be undetectable. Dog 1 had the lowest \( CL \) and highest 72 h mass eliminations while its terminal rate of volume growth, \(V'_d(\infty )\), a parameter only meaningful for fat-tailed models, also had the least value in this short series of seven dogs. The reduced \(V'_d(\infty )\)-value indicates a relatively reduced rate of tissue absorption of the drug and may be due to a genetically deficient transporter protein. In effect, the requirement for conservation of mass in a single compartment variable volume model allowed physiological parsing into mass loss only to renal clearance and redistribution to be quantified as concentration dilution from apparent volume growth. GPC distributions imply a single fractal recirculatory structure that could be elaborated mathematically. For an introduction to variable volume models and fractals the reader is urged read reference [54]. Although compartments and fractals are of theoretical interest, a model’s usefulness arises from the scope of experimental results that it clarifies. Accordingly, this discussion focuses next on what GPC imply models, and how they might be used.

Implications

The usual definition of renal clearance for a non-metabolised renal-only cleared marker is

$$\begin{aligned} CL _{\mathrm{urine}}=\frac{\text {U}\,\text {V}}{\text {P}} \end{aligned}$$
(35)

where U is drug concentration in urine, V is urine volume, and P is drug concentration in blood plasma. The assumption is that such a system is in dynamic equilibrium. \(\text {U V}/\text {P}\) for intravenous bolus conditions can be generalised to nonequilibrium states by multiplying it by either side of Eq. (9), i.e.,

$$\begin{aligned} CL =\frac{{{t}_{1/2}};M(t)}{{{t}_{1/2}};C(t)} \;\frac{\text {U V}}{\text {P}}=\frac{ CL +{V}_{d} \,'\left( t \right) }{ CL }\;\frac{\text {U V}}{\text {P}}\,. \end{aligned}$$
(36)

To see this is so, one multiplies Eq. (10) by \(t_{1/2};C(t)\) to obtain the fractions of concentration that term-wise parse to mass elimination and volumetric dilution, as follows

$$\begin{aligned} 1&=\overbrace{\frac{t_{1/2};C(t)}{t_{1/2};M(t)}}^{\text {Mass}} +\overbrace{\frac{t_{1/2};C(t)}{-t_{1/2};V_d(t)}}^{\text {Volume}} \end{aligned}$$
(37a)
$$\begin{aligned} 1&=\underbrace{\frac{ CL }{ CL +V'_d( t)}} _{\text {Mass}}+\underbrace{\frac{V'_d(t)}{\; CL +V'_d\left( t\right) }}_{\text {Volume}}\ \ . \end{aligned}$$
(37b)

Then, to correct plasma concentration, P, in the denominator of Eq. (35) to mass elimination conditions, we multiply its numerator by the reciprocals of the mass terms of Eqs. (37a) and (37b), because these reciprocal mass terms are both sides of Eq. (9), and are equal to each other. In other words, for a non-metabolised, renal-excreted drug, the molecules of drug that have shown up in urine, are only those molecules within the plasma that are parsed at that instant of time into urine as opposed to being ’redistributed’ by ongoing apparent-volumetric concentration dilution.

How long the nonequilibrium state appears to persist depends on the model used. Monoexponential models lack nonequilibrium; they are instantly mixed. As all higher \(\hbox {ED}_n\) models have an instantly mixed volume, a premature achievement of a terminal volume carries over to biexponential and higher models. In transient nonequilibrium \(V'_{d}( t )\) eventually disappears—see Fig. 8. To delay the time to dynamic equilibrium one can use NC models or heavier-tailed GD models. For example, a population biexponential model of a GFR marker was observed to have a 7.5 times shorter time to 95% of terminal mass half-life than GD models with enforced log-convexity [23].Footnote 6

For a fat-tailed model, the nonequilibrium state is permanent and the ratio of half-lives achieved converged to \(\frac{\alpha +1}{\alpha }= 7.0 \pm 1.5 \) (\(\hbox {mean}\ \pm \ \hbox {standard deviation}\))—Eq. (32) and Table 1. As \(\alpha \) (dimensionless) is the shape parameter of the Pareto distribution, and as the Pareto distribution is the GPC terminal tail asymptote, only the plasma concentration tail information would need to be extracted from data to quantify a late-time nonequilibrium \(\frac{\text {U V}}{\text {P}}\) experiment. On average \( CL =7.0\,\frac{\text {U V}}{\text {P}}\) to within \(5\%\) for \(t>9.8\ \hbox {h}\)—see Fig. 7c.

The other shape parameter from the gamma distribution (GD) part of the GPC model, a, aids in shape fitting of the early data. The two shape parameters of the GPC model allow for good derivative fitting between the data and the model; \(C_{obs}(\mathbf {t})\) and \(\hbox {GPC}\left( \begin{array}{cc} a &{} b \\ \alpha &{} \beta \end{array} \Big |\,t\right) \). Even when GD models were used as a stand-alone; without convolution, provided that the \(0<a<1\) shape had been selected by adaptive regularization, that shape parameter variation allowed matching to the different shapes of the disposition curves for fluid overloaded patients [55]. That E2 models were inefficient for metformin curve fitting was suggested by their poor goodness-of-fit—see Figs 3 and 4—and by their derivatives being unrelated to the data shape—see Fig. 5. From Table 3, the E2 models averaged 16.1% rrms error with 0.84% unexplained fraction, the GPC models’ errors were significantly better at 8.6% rrms with 0.22% unexplained fraction.

Thus far, there have been interesting implications from GPC modelling, for example, an apparent volume of distribution growth within the body that was terminally 6.0 times the renal clearance, that \(\text {U}\,\text {V}/\text {P}\) needed correction, that terminal half-life of mass appearance in urine was 7.0 times the serum drug concentration half-life value, that the drug mass retained in the body was substantial and that \(V_d\) and \(t_{1/2}\) increased with elapsed time during multidosing. Although direct testing of these findings was not within the scope of the current study, some of these results can be contrasted with prior study results, as follows.

Clearance

If metformin is only cleared by renal plasma flow (RPF) extraction, then its clearance should be \(\le \hbox {RPF}\). We estimated RPF from p-aminohippurate (PAH) dog data as PAH renal clearance divided by PAH arteriovenous extraction ratio, \(\text {RPF} =\frac{ CL _{\mathrm{PAH}}}{\text {E}_{\mathrm{PAH}}}\). Thus, \( CL _{\mathrm{PAH}}=13.51\,(\hbox {ml}\, \hbox {min}^{-1} \, \hbox {kg}^{-1})\) (mean, 75 dogs) [56] was divided by the \(\hbox {E}_{\mathrm{PAH}}=0.7564\) (metadata; weighted average 65 dogs) from three series [57,58,59] to yield \(\text {RPF}\approx 17.86\,(\hbox {ml}\, \hbox {min}^{-1} \, \hbox {kg}^{-1})\). From Table 3 the GPC model metformin \( CL \) was 15.1 (\(\hbox {ml}\, \hbox {min}^{-1} \, \hbox {kg}^{-1}\)) or 84.8% of estimated RPF,Footnote 7 i.e., a result less than the 90–100% metformin extraction fraction expected for humans [4]. There are multiple confounders, including possibly inaccurate \(CL _{\mathrm{PAH}}\), and canine tubular resorption of PAH. However, the metformin extraction for the GPC model at 84.8% was greater than the nominal canine \(\hbox {E}_{\mathrm{PAH}}\) of 75.6%, where the latter in humans is circa 90% and is called effective RPF. Less plausible and significantly greater results were seen for noncompartmental (NC) \( CL \) of \(18.7\ (\hbox {ml}\, \hbox {min}^{-1}\, \hbox {kg}^{-1})\) at 104.7% of expected RPF with even worse results for E2 \( CL \) at 22.1 (\(\hbox {ml}\, \hbox {min}^{-1}\, \hbox {kg}^{-1}\)) or 123.7% of expected RPF. We attribute the increased E2 \( CL \)-values to AUC underestimation from under-extrapolation of concentration beyond the underestimated earliest and latest sample concentrations—see Figs 4 and 5.

The above results suggested that NC curve methods using exponential extrapolation largely prevented underestimation of slope magnitude under the time-samples themselves, but did not prevent a significant underestimation of AUC in the extrapolated tails. This supported the proposition that heavier-tailed GPC models captured more of the otherwise under-extrapolated AUC—see Fig. 3 and Table 4. From a prior work with a GFR marker, back-extrapolation using 120- to 240-min data validated with withheld early time-samples was more accurate using a \(A-B \ln (t)\) fit function than using exponential back extrapolation, where the exponential back-extrapolation led to an underestimation of AUC [36]. This agrees with the NC overestimation of \( CL \)-values seen elsewhere from exponential under-extrapolation of early AUC [60] and of terminal tail areas [5, 25, 27]. Thus for multiple reasons, the \( CL \)-values from the GPC models seemed more plausible than the alternative models applied to the same data.

Metformin drug mass

The survival of drug mass in the dogs’ bodies at 72 h in Table 4 of 21.1% remaining represents a 5 fold decrease from the initial dose. By way of comparison, the serum concentration decreased 1000-fold from estimated peak concentration over 72 h—see Fig. 1. As metformin extraction in the dog appeared to be less than estimated RPF, Clearance above, we assumed that drug mass decrease was from renal excretion. Thus, at 72 h on average 78.9% was GPC model renal excreted. Sheen and Sirtori et al. list human urinary metformin recoveries at 60 to 72 h, i.e., 86% and 78.9%—see Table 1 of [19] and Table II of [20]. A 99.9% 48 h urine recovery from IV metformin was calculated by Pentikäinen et al. [21], but may have been inflated from a graphic estimation of the location of maximum cumulative urine mass, thus subject to error as suggested in Fig. 10 and surrounding text. Overall, the drug excretion implied here was similar to published human values, such that we cannot reject the hypothesis that the 72 h GPC model excreted metformin estimates were accurate.

Metformin half-life

As summarised in Table 5 by collecting metformin in human urine Tucker et al. found 4.2 times longer mass than concentration half-life at 12 h [22] and Pentikäinen et al. [21] found a ratio of 5.1 times at 10 h. Collecting urine at 8 h Sirtori et al. found a ratio of 11.1 times [20]. During once a day multidosing of oral metformin in a PBPK model Robert et al. found an 8.8 times longer metformin half-life in erythrocytes, a mitochondrial depleted tissue marker, than in plasma [13]. In Table 5, the tissue (or urine) drug mass to plasma (or serum) ratios of half-lives averaged \(7.3\pm 1.6\ \hbox {SEM}\). The GPC model ratios averaged \(7.3 \pm 0.1\) SEM. The average E2 model half-life function ratios in this series at the same times (\(8,10,12,72\ \hbox {h}\)) would be 3.2, 1.9, 1.3, 1.0, which ratios are not as large as any of the published or GPC half-life ratios. Note that light-tailed modelling does not accommodate mass to concentration half-life ratios persistently greater than 1 until at least 168 h as RBC versus plasma \(t_{1/2}\) in rats [4].

Table 5 Metformin \(t_{1/2}\) in plasma [P, C(t)] and urine [U, M(t)] as \(t_{1/2};C(t)\) and ratios of \(t_{1/2}\) of M(t) and C(t)

The GPC model half-life of metformin serum concentration, Eq. (28), is asymptotically proportional to elapsed time, and predicts this trend with approximately twice the half-life observed for the exponential models in this series and in Table 5. This latter is not dissuasive as Fig. 4 clearly shows biexponential underestimation of late time-samples, which is consistent with underestimation of terminal half-lives for E2. Examination of the plot of median \(t_{1/2}\) of grouped time-samples of metformin concentration half-lives in Fig. 9 provided strong evidence that longer data collection times yielded longer half-lives. Table 5, Figs. 9 and  8 show that \(t_{1/2}\) of the mass and concentration, as first documented here, increased enough with elapsed time to provide a modelling agreement with experimentally observed ratios like \(\frac{t_{1/2};M(\infty )}{t_{1/2};C(\infty )} =\frac{\alpha +1}{\alpha }>1\) from persistent redistribution, \(V'_d(\infty )= \frac{ CL }{\alpha }\), i.e., old observations only now explained using new modelling concepts.

A dosing strategy for possible use for therapy

In human 850 mg three times daily metformin for 5 days, in comparison to a single dose, Sambol et al. noted drug effect versus no (or lesser) effect for a single dose [24]. Also noted were the effects with elapsed time noted above, increased \(C_{max}\), higher \(V_{d}\) consistent with drug mass accumulation, and a prolongation of \(t_{1/2}\) from 7.2 to 19.8 h. The tentative explanation for this was ’The \(V _{d} \)...is probably larger after multiple doses...’ Moreover, the lethargic pace adopted clinically of two weeks before changes are made to metformin dosing, the several months allotted to measuring outcomes in clinical trials [61], the relative sequestration of metformin into erythrocyte tissue versus plasma with elapsing time [13], the decreasing fasting plasma glucose for at least 8 weeks of multidosing [39] are consistent with the expanding apparent volume of drug distribution, increasing half-lives in time, and increasing sequestration of drug in tissue that form the basis for the “Simulated dosing regimen for constant drug mass in the body”. Although diabetes, as present in Sambol et al.’s patients may reduce glucose transport and retard the time course of drug effects, another hypothesis would be that the majority drug effect is retarded due to slow tissue accumulation in all subjects, including our normal dogs. In any population, a faster initiation of therapeutic cytosol drug content may be available by using the dose loading method hypothesised in this paper, which may find eventual relevance for metformin dosing for cancer chemotherapy and/or other indications.

Synopsis

Sum of exponential terms models have non-zero initial volumes problematic for calculating \(t_{1/2};M(t)\). For example, a monoexponential model has the same concentration half-life as its half-life of drug mass, \(\frac{\ln (2)}{\lambda }\), and as \(V'_d(t)=0\), there is no volume growth (or redistribution). Using monoexponentials in 1959, Walser et al. [62] found a multiple day stubbornly persistent delay between plasma disappearance of radiolabeled urea and its appearance in urine of approximately one hour. This implicates ongoing redistribution. It is only hindsightFootnote 8 that resolved the paradox in assuming no redistribution and finding irreducible redistribution. This adynamical constraint on apparent volume growth is only transiently mitigated by the inclusion of additional terms in a SET function, e.g., to 21 h for our E2, see Fig. 8.

Herein, fat-tailed metformin models offered for the first time, results consistent with the observed half-life of mass appearance in urine as multiple times the serum drug concentration half-life values with drug mass versus serum drug concentration half-lives which agreed by their ratios and amounts to otherwise irreconcilable results. There were plausible drug masses at 72 h, a plausible explanation for increased \(V_d\) and \(t_{1/2}\) with multidosing with a plausible strategy for better multidosing for drug effects, and more plausible clearance-values than from the other methods tested. All of these things suggest potential usefulness for this type of modelling for extracting information and prediction that is problematic using current models for metformin kinetics.

We sought to determine why metformin half-lives did not appear to agree between publications, to develop a hypothesis concerning why metformin control of plasma glucose levels had delayed onset, e.g., following 4-weeks of oral dosing [16], and to speculate concerning the lack of a direct correlation between drug effect and blood metformin concentrations [17]. A single explanation for these phenomena may very well be that the mass half-life is persistently longer than the serum half-life and that mitochondria and cytosol are absorbing much of this mass, although reversibly. All these findings are consistent with high affinity of the drug in or in proximity to the sites of drug action.

Limitations

The Introduction Section listed a number of references to drugs with heavy tails [5, 27,28,29,30,31,32,33,34, 63,64,65,66]. However, the potential clinical penetrance for fat-tailed data-spanning models is not known. For example, a power function tail for metformin does not seem to have been previously identified. There was no evidence for nonlinearity for the intravenous route of administration at the drug levels in this study. However, nonlinearity might occur at higher drug levels. A PBPK-type study where blood, urine, and tissues are all collected in an animal study may be appropriate for further investigation. Only a prospective study that measures both drug mass and drug effect can be used to validate how useful the drug mass with drug effect relationship hypothesised for the GPC model is for metformin.

The assay used allowed for 72 h of serum data collection [67]. Due to a technical error, the earliest time-samples were at 20 min [1]. Approximately 20.4% of the AUC occurred before 20-min—see Table 4. This was verified indirectly by comparing the \( CL \) to renal plasma flow as estimated from metadata, implying that a lot of this early AUC would be missed using NC methods. Due to this early time back-extrapolation some parameters may be somewhat imprecise but not others, e.g., \( AUC _0^{20\text {-min}}\) but not \(t_{1/2};C(t>20 \text {-min}\)). The last two time-samples were erratic especially for dogs 3, 4 and 5 but not for dogs 1, 2, 6 and 7 as seen in Fig. 1. The apparent increased magnitude and variability of some of the last two time-samples had no clear cause and was not prior identified [1].

The GPC model with simplex method global search optimisation has a long run time, and further refinement of the algorithm for speed would be useful. As a new PK model convergent to a power function tail, the GPC model has only been applied to metformin. How well the GPC model would estimate the plasma/serum concentrations of other intravenously injected substances is unknown at this time. In common with other first applications of a fit function for intravenous drug dosing, there was no modelling of oral dose data, tissue drug concentrations, or correlation with simultaneously measured drug excreted in urine, the applicability is only for linear systems, population kinetics were not explored in the usual sense, and the fitting algorithm used does not allow for case-wise error statistics of parameters, per se, necessitating calculation by other means, for example, as appears in the “Parameter errors from model-based bootstrap” Appendix Section.

Conclusion

Fat-tailed models of metformin concentration were fit to data from 7 dogs. The fat tails altered the usual context of some pharmacokinetic parameters. Metformin was cleared at 84.8% of estimated total renal plasma flow. At 72 h there was 21.1% unexcreted metformin. Half-lives eventually became linear functions of the sample-times. The mean ratio of mass to concentration \(t_{1/2}\ 95\%\) converged to 7.0 by 9.8 h. Multidosing for constant drug mass rather than serum drug levels was simulated and may allow for an earlier onset and better maintenance of drug effects but is in need of validation.