INTRODUCTION

Skin is the largest organ of the human body and is composed of three readily distinguishable layers: (a) the stratum corneum (SC), which is the outermost part of the epidermis, (b) the living epidermis, and (c) the dermis, penetrated by a highly complex network of capillaries involved in the removal of drugs from the skin into the systemic circulation. In addition, several pilosebaceous and sweat glands are dispersed throughout the skin, in various numbers and size, depending on body site. The SC is well-recognized as a heterogeneous two-compartment system of flattened keratinized cells embedded in a multilamellar lipid matrix mainly composed of neutral lipids and ceramides (14). It has a thickness of about 10 μm (57) and is composed of about 20 cell layers (5). Its main homeostatic functions are restriction of excessive water loss to the external environment, moisture retention, cell cohesion and desquamation. In addition, SC is considered to be the rate-controlling membrane for transport of xenobiotics across the skin (although, for very lipophilic molecules, partitioning into the more hydrophilic viable tissue may constitute the greater resistance (2,8)). Therefore, topical skin bioavailability assessment of xenobiotics is essential for both the (trans)dermal delivery of pharmacological active drugs and from a toxicological point of view (9).

For systemically delivered oral products, bioavailability is typically defined as the rate and extent at which a drug reaches the general circulation from an administered dosage form. Dermatological drug products include a broad array of preparations which are designed to exert a local effect in diseased skin following topical application on the skin surface. The objective is to maximize drug concentration at the site of action within the skin with, ideally, a minimal systemic uptake (10,11). Thus, systemic availability may not properly reflect local cutaneous bioavailability (as it does for transdermal products, on the other hand, which are designed to deliver drug into the systemic circulation). Moreover, topical doses tend to be so small (typically 2–5 mg of product/cm2) that serum and/or urine concentrations are often undetectable using conventional assay techniques (12). A further complicating factor is the lack of knowledge of the drug concentration needed at the skin target site (with the exception of antifungal agents whose target site is the SC surface). Topical bioavailability has been more rigorously defined as the temporal pattern of free drug concentration at the target site [the so-called C* concept (10)], but the value of this approach remains largely theoretical due to the difficulty of quantifying drug within the skin.

What options are available, then, for assessment of topical bioavailability and bioequivalence? Although a drug’s permeability coefficient through human skin can be estimated, for example, from the molecular weight and octanol–water partition coefficient (13), this information is not sufficient to estimate topical bioavailability. The algorithms available are able to approximate a value for the maximum drug flux across the skin, but cannot quantify the potential impact of particular vehicles (or vehicle constituents) on skin barrier function (14). Equally, these calculations assume that the thermodynamic activity of the drug remains constant during the application period, a situation which does not pertain, of course, when one or more formulation component is volatile (15) or the dose administered is very small (i.e., a finite dose situation).

It follows that an experimental approach is necessary, and the objective of this review is to evaluate different techniques currently under consideration. We begin with a discussion of the different procedures which are accepted by the regulatory authorities at this time, and which contribute to the approval of innovative, and/or generic, topical drug products.

CURRENT OPTIONS FOR TOPICAL BIOAVAILABILITY/BIOEQUIVALENCE ASSESSMENT

As mentioned above, for topical dermatological products, pharmacokinetic measurements in blood, plasma or urine are usually not feasible because of the very low concentrations achieved in these typical sampling compartments. There is also debate as to whether measurable levels in the blood (when detectable), for example, are relevant to local, topical drug bioavailability. On the other hand, the cumulative amount of drug reaching the systemic circulation is pertinent with respect to potential toxicity and may indicate the need to limit topical application to avoid undesirable side-effects.

For certain topical drugs, specifically the corticosteroids, pharmacodynamic measurements represent an accepted approach with which to establish bioequivalence between different formulations (and are used, as well, in the development of new chemical entities in the same therapeutic class). These drugs produce skin blanching at the site of application and this response has been correlated with clinical efficacy (16). The original Stoughton–McKenzie vasoconstriction assay (17) employed visual evaluation of the degree of blanching by trained observers. As this approach is rather subjective (18), it is now considered sensible to use a chromameter to quantify the blanching response (1922). This method has been reviewed in detail elsewhere (20).

While the advantages of the vasoconstriction assay for topical steroids are clear, the approach has the obvious limitation that it cannot be applied to all drugs. Indeed, the literature contains very few other examples in which pharmacologic response has been used to assess topical drug availability. Evaluation of vasodilatation provoked by nicotinic acid esters, and its inhibition by anti-inflammatory drugs, has been performed using laser Doppler velocimetry and chromametry (2327). The reduction in erythema induced by topical ibuprofen has been correlated with drug levels in the epidermis (27). Equally, topical retinoids have been shown to provoke increases in transepidemal water loss in a time- and dose-dependent fashion (2830), although the generality of this effect has not been established (29).

It follows that, at the present time, topical drug bioavailability and bioequivalence must be assessed via clinical studies, which are typically costly and time-consuming (31); regulatory agencies have therefore been seeking alternative methodologies, especially for bioequivalence studies to ease the burden on those attempting to commercialize useful products at competitive prices. While much debate has centered on the potential of different in vitro techniques, in particular the use of Franz diffusion cell studies to compare different formulations, these methods have not been accepted as useful contributors to the overall determination of bioavailability and bioequivalence. Currently, the relevance, reliability and reproducibility of the in vitro approach have not been demonstrated to the level necessary for the regulatory bodies; major concerns centre around the source and supply of human skin tissue, the ‘quality-control’ of the experiments, in general, and the absence of an effective clearance mechanism for drugs (especially lipophilic species) from the membrane. Only for demonstrating the benign effect of minor formulation changes are in vitro experiments, using model, polymer membranes, authorized by, for example, the US Food and Drug Administration.

Nevertheless, there remains a real need to develop valid, effective and economical approaches with which to measure topical drug bioavailability, not least for the evaluation of bioequivalence when generic products are submitted for marketing approval. The remainder of this review examines, primarily, two methodologies currently under close examination for the measurement of topical bioavailability and bioequivalence: namely, the tape-stripping and microdialysis techniques.

TAPE STRIPPING

Tape stripping involves sequentially removing microscopic layers (typically 0.5–1 μm) of stratum corneum. It is usually performed by placing an adhesive tape-strip onto the skin surface, followed by gentle pressure to ensure a good contact, and subsequently removal by a sharp upward movement. The procedure is relatively painless and non-invasive, given that only dead cells (corneocytes) embedded in their lipid matrix are removed. Even if skin stripping remains a form of environmental insult, a homeostatic repair response in the epidermis is rapidly elicited (32) which results in rapid restoration of barrier function (33).

Tape stripping is used in various fields of cutaneous biology: for example, to evaluate the barrier function (34), to investigate dermatopathologies such as inflammatory or neoplastic disorders or xerotic conditions (35), to monitor gene expression (36,37), to investigate pH profiles (38), to evaluate animal skin as a surrogate for human skin (39), to induce Toll-like receptor 9 expression (40) and so on (41). In dermatopharmacology, tape stripping is used to assess cutaneous drug or excipient levels in the skin after topical dermatological treatment, either in the removed tape-strips, or directly in the tape-stripped skin. It follows that tape stripping is a particularly useful technique to assess the local bioavailability of drugs whose target site is the SC itself (such as antifungal agents (4346), UVA/UVB filters (4751) or antiseptics (52).

The tape stripping technique may also offer a real alternative for local bioavailability and/or bioequivalence assessment of topically applied dermatological agents whose target is the underlying viable tissues. As the SC is usually the principal resistance to the penetration of topically applied compounds (4), it has been argued that drug levels in the barrier should be correlated with those attained in the underlying, viable components of the skin, i.e., the sites at which many dermatological diseases are manifest.

This hypothesis was tested, but only partially validated, by Rougier et al. (5356). In both animal models and human volunteers, for a range of compounds (mostly low molecular weight, low lipophilicity) delivered from several different formulations, the amount of active in the SC after a 30-min application, and subsequent removal of any remaining vehicle, was determined by tape-stripping. At a contra-lateral position, the same 30-min application and cleaning procedure was adopted but, rather than tape-stripping the SC, the penetration of chemical taken up into the skin was quantified by its urinary excretion over the next 4 days. Good correlation between the two measurements was achieved (see Fig. 1) suggesting the value of the tape-stripping approach for a simple and rapid determination of potential systemic exposure. However, the experiment only solves part of the bioavailability challenge: it provides a measurement of the extent of absorption but gives no information on rate. As such, in its originally developed form, the so-called Rougier technique is not suitable for regulatory use.

Fig. 1
figure 1

Correlation between the amount of different chemicals absorbed across skin following a 30-min application and the quantity recovered in SC tape-strips after an identical, but independent, administration procedure. Redrawn from [52].

An alternative strategy to test the hypothesis is to demonstrate correlation between drug levels measured in the SC by tape-stripping and a pharmacodynamic effect in the skin. This has been accomplished in some necessarily limited studies using (a) hydrocortisone (57) and betamethasone diproprionate (16,20), the pharmacological activity of which was assessed by skin blanching, (b) miconazole and ketoconazole, the efficacy of which against C. Albicans in vitro was determined (42), and (c) iododeoxyuridine, for which clinical efficacy in reducing lesion severity was evaluated (58). Ultimately, of course, these types of validation studies are essential if a tape-stripping approach is to be adopted for routine bioavailability and bioequivalence assessment. For the moment, however, the field is (appropriately) examining how to optimize the tape-stripping procedure so as to extract the maximum information from the experiment in the most efficient manner possible. These advances are now discussed in detail.

Method Development and Optimization

Sequential tape-stripping of the stratum corneum allows horizontal fractions of the membrane to be obtained. The tape-strips must then be extracted to recover and quantify the absorbed drug. Local bioavailability may be assessed either from the combined tape strips, or from the individual tape-strips. The method of quantification depends, of course, on the nature of the drug and the amount present on the tape-strips. Various approaches have been used ranging from UV-VIS spectrophotometry (59) to the more ubiquitous HPLC (with UV, fluorescence, or even mass spectrophotometric detection), gas chromatography (6062), scintillation counting (42,57,63) and infrared spectroscopy (6467). The key criteria are that the extraction process does not degrade the drug, that it is efficient and reproducible, and that it is free from interference from components of the SC and/or the tape adhesive. Quantification of the drug in the combined tape-strips enables the total amount in the stratum corneum to be determined.

After much discussion (31,6871), a draft guidance was published by the US Food and Drug Administration (FDA) in 1998 (21) in which the general procedures for conducting a bioavailability/bioequivalence study were described. Briefly, the guidance proposed the following steps:

  1. 1.

    After application of a drug product for a given application time, excess formulation was to be removed by an appropriate (but not specified) procedure.

  2. 2.

    Two sequential tape-strips of the SC were to be taken and discarded, the guidance arguing that material located on these samples had not been, and would not be, absorbed.

  3. 3.

    Subsequently, ten strips from the application site were to be made, combined and extracted with a suitable solvent.

  4. 4.

    The entire process was to be performed at multiple sites following different times of application (absorption phase, Fig. 2a) and different periods between the longest application time and the time at which stripping was performed (elimination phase, Fig. 2b).

  5. 5.

    The total amount of drug in the SC was then to be displayed as a function of time as a so-called dermatopharmacokinetic (DPK) profile, characterized by a maximum amount, the time at which this maximum was achieved, and an ‘area-under-the-curve’ (Fig. 3).

  6. 6.

    Simultaneous comparison of two formulations of the same drug would allow an assessment of local bioequivalence using the representative DPK parameters.

Fig. 2
figure 2

Schematic representation of a dermatopharmacokinetic (DPK) study to assess topical bioequivalence between test and reference formulations containing the same drug. a In the uptake, or absorption, phase, the SC is tape-stripped immediately after each treatment time, and the drug level in the barrier is determined. b In the clearance phase, after the maximum treatment time, the SC is subsequently stripped after progressively longer periods post-removal of the formulations. CTR = control, drug-free formulation.

Fig. 3
figure 3

DPK profiles of reference and test formulations containing the same drug. The parameters to be compared are the maximum drug level in the SC (C max ), the time at which this maximum level is reached (T max ) and the area under the SC quantity versus time profile (AUC).

Although useful as a ‘jumping-off’ point, the guidance was open to a number of criticisms, and exposed some clear flaws. In brief, while many of these problems have now been recognized, but not fully resolved, the document has therefore been withdrawn and, at the time of writing, the possibility of a refined version being developed is under examination. Among the weaknesses of the original guidance, one may cite, in particular:

  1. (a)

    The timing of the tape-stripping procedures during the absorption and elimination phases is not clearly delineated and (apart from trial-and-error) no rational approach to their determination is offered.

  2. (b)

    The number of tape-stripping procedures is considerable, rendering a DPK evaluation onerous in terms of the work involved (despite the fact that ‘pooling’ tapes for analysis does partly reduce this burden).

  3. (c)

    The unvalidated discarding of information from the first two tape-strips has never been subjected to rigorous examination; indeed, it has been argued that these data are relevant to a complete bioavailability assessment (46).

  4. (d)

    Ten tape-strips do not remove the same amount of SC from all subjects (or even, potentially, within a subject, especially if one of the formulations being compared contains an excipient which weakens SC cohesivity). As a result, drug will be measured in different ‘volumes’ of the matrix, rendering comparisons between different treatments/subjects essentially meaningless.

Nevertheless, there have been some reports of success using the DPK procedure as outlined in the FDA guidance (or at least part of it); one particularly good example showed a correlation between both the area-under-the-curve and maximum concentration parameters and the clinical efficacy of tretinoin gels (Fig. 4 (72)). Less encouraging, and ultimately responsible for the DPK guidance’s withdrawal, was an inter-laboratory comparative study (again using tretinoin) that resulted in diametrically opposite conclusions about the bioequivalence of different formulations (7375).

Fig. 4
figure 4

DPK profiles (means of n = 49) of three (0.025% v/v) tretinoin gel products in human stratum corneum (SC). Solid line = reference product (Retin-A); dotted line = generic formulation; dashed line = Avita gel (redrawn from (72)). At the time points indicated with an asterisk (*), a significant difference between Avita and both the reference and gel products exist (analysis of variance, Fisher’s protected least significant difference, p < 0.05).

The outcome of this initial work has been a serious re-evaluation of the overall worthiness of the approach and an ongoing examination of how it may be improved. The need to quantify (or, at least, to have some idea of the amount of) the SC removed is now clear, not only for the reasons already given, but because it is obvious that different tapes and different experimentalists will inevitably strip the SC in a variable fashion, making inter- and intra-laboratory comparisons impossible. Even when the type of tape is standardized, and an attempt is made to exert an equal pressure on each tape applied to the skin (76), the SC amount removed still depends on, for example, the speed with which the tape is subsequently removed from the subject and exactly where on the forearm the experiment is performed. As standardization appears very difficult to achieve, therefore, it is clearly more sensible to identify a method by which the SC removed can be quantified, thereby allowing the uptake of drug into this tissue to be normalized (and expressed, for example, as an amount per unit volume, or per unit weight, of SC). For the moment, these uncertainties mean that the minimum number of subjects required for a bioequivalence study cannot be specified.

Another, but more invasive (and probably less acceptable) option is to use a procedure which removes all the SC at once. In this case, cyanoacrylate adhesive is applied to a glass slide which is then pressed firmly to the skin for about 30 s. Subsequent removal in one quick movement removes essentially all of the SC in one complete sheet (77,78). However, validation of the method for bioavailability/bioequivalence studies has not been reported.

Various approaches to quantify the SC amount removed have been considered but, at this time, no single method has achieved universal acceptance. The most straightforward procedure (and, in fact, the one needed to calibrate all others) is to weigh each tape-strip before and after SC removal and to determine the amount (m) of tissue removed from the difference between these weights. As the area (A) of SC stripped is known, and the density (ρ) of the tissue has been published (5), it is possible to convert the SC weight into a depth, or distance (x), into the barrier: x = m/(A·ρ). Additional standardization is then possible by including in the experimental design, concomitant measurements of transepidermal water loss (TEWL) during the tape-stripping process (this can be performed either at the drug product application site, or at a position adjacent to the treated skin when the formulation contains volatile or occlusive excipients that can interfere with the TEWL measurement). Baseline TEWL (TEWL0) across unstripped SC of thickness L is given by Fick’s first law of diffusion:

$$ {\text{TEWL}}_{0} = \frac{{D \cdot K}} {L}\Delta C $$
(1)

where D and K are the diffusion coefficient of water in the SC, and the SC-viable tissue partition coefficient of water, respectively, and ΔC ∼1 g·cm−3. After tape-stripping has removed a depth x of SC, the TEWL will have increased to a new value given by:

$$ {\text{TEWL}}_{x} = \frac{{D \cdot K}} {{{\left( {L - x} \right)}}}\Delta C $$
(2)

Inversion of this second expression yields a linear relationship between (TEWL x )−1 and x, and the intercept of this line on the x-axis equals the SC thickness (L) (Fig. 5 (79)).

Fig. 5
figure 5

a Progressive increase of transepidermal water loss (TEWL) as a function of the absolute depth (μm) of stratum corneum removed. b The data in a, re-plotted as a linear relationship between (TEWL)−1 and the depth of SC removed. The x-axis intercept equals the SC thickness.

With this information in hand, all SC stripping data can now be expressed, not in terms of “amount per ten strips,” or even as amount per volume of SC, but as an amount per normalized fraction of SC removed (x/L), a strategy which allows results from disparate subjects of different SC thicknesses to be rationalized (Fig. 6 (80)). While this appears to be a sensible, general solution to the problem, it is important to point out one caveat: as the tape-stripping procedure and the TEWL measurements require a finite amount of time, analysis of the data for rapidly permeating compounds may be complicated by the fact that the time to obtain the tape-strips, and to record the changes in TEWL, may be similar to the application time, and the drug’s concentration gradient may therefore shift during the data collection process (65).

Fig. 6
figure 6

a Increase in transepidermal water loss (TEWL) as a function of the amount of SC thickness removed (x) by serial tape-stripping (n = 20). b Increase in TEWL as a function of SC thickness removed (x) normalized by the corresponding intact membrane thickness (H) for each individual.

Other methods to quantify the SC removed by tape-stripping have been proposed and demonstrated. Clearly, the gravimetric approach is not without problems (static electricity on the tapes, misleading weights due to uptake of formulation excipients, small amounts of SC removed, tediousness, etc.) and a more direct assessment would be useful. Progress has been made both with respect to the use of a protein assay to quantify keratin on the tapes, and via a spectrophotometric method evaluating the scattering/absorbance of UV/visible radiation; the latter procedure also has the potential to quantify drug on the tape-strip, assuming that an absorbance band distinct from those of the SC can be identified. Both techniques have the potential to be more ‘high-throughput’ once sufficiently optimized; however, for the moment, neither approach has been perfected, rejection of individual data has not been eliminated, and both rely upon the ‘gold standard’ weighing measurement for calibration (8186). Another option, which deserves a final word in this context, is the application of infrared spectroscopy (IR), particularly its use in vivo via the attenuated total reflectance (ATR) mode. In this approach, it is possible to semi-quantitatively assess an IR-active drug (assuming there is at least one absorbance within its IR spectrum distinct from those in that of the SC) either on a series of tape-strips or in the outermost micron or so of SC which has not yet been stripped from the skin (44,45,8791) (Fig. 7).

Fig. 7
figure 7

Schematic diagram of an attenuated total reflectance (ATR) device mounted on a Fourier transform infrared (FTIR) spectrometer. The sampling area of the crystal is made to contact with the treated skin site or the tape-strip. IR radiation from the spectrometer is deflected by a first mirror to the ATR crystal, in which it undergoes total internal reflection, penetrating superficially into the skin, before exiting the crystal and being re-directed to the detector.

The method has often been applied to a model permeant, 4-cyanophenol (the C ≡ N (nitrile) absorbance from which is particularly intense and occurs at a frequency where the SC is essentially IR-transparent (6467)), and also to other more relevant substances such as the anti-fungal drug, terbinafine (45). The IR spectrum of SC also contains information about the amount of endogenous lipid and protein present as a function of depth into the barrier (92), although this feature has not been explored as a potentially simple approach to quantify the amount of tissue removed on each tape-strip. Recent developments in IR imaging coupled with ATR, and the application of chemometrics to better “tease out” spectral information, may prove to be useful avenues for further research in this regard (93). However, even supposing the existing limitations can be overcome, the principal weakness of IR remains: namely, its relative insensitivity compared (for example) to analysis of extracted tape-strip samples by HPLC.

Interpretation of Tape-Strip Data

When drug and SC are quantified on each tape-strip removed post-treatment of the SC, and TEWL measurements have been made to determine SC thickness, it is possible to display the concentration of the permeant (C x ) as a function of its relative depth (x/L) into the barrier (Fig. 8). Assuming Fickian diffusion and the following boundary conditions:

  • the ‘dose’ of the drug applied is infinite—that is, the drug concentration in the vehicle (C v ) remains constant throughout the application time (t)

  • the SC initially contains no drug (C x  = 0 at t = 0)

  • the viable skin provides a ‘perfect sink’ for the penetrating drug (C x  = 0 at x = L for t ≥ 0)

then the profile is given by (46,64,94):

$$ C_{x} = K \cdot C_{v} \cdot {\left\{ {{\left( {1 - \raise0.7ex\hbox{$x$} \!\mathord{\left/ {\vphantom {x L}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$L$}} \right)} - \raise0.7ex\hbox{$2$} \!\mathord{\left/ {\vphantom {2 \pi }}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$\pi $}{\sum\limits_{n = 1}^\infty {\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 n}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$n$}\sin {\left( {\raise0.7ex\hbox{${n\pi \cdot x}$} \!\mathord{\left/ {\vphantom {{n\pi \cdot x} L}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$L$}} \right)} \cdot \exp {\left( {\raise0.7ex\hbox{${ - D \cdot n^{2} \cdot \pi ^{2} \cdot t}$} \!\mathord{\left/ {\vphantom {{ - D \cdot n^{2} \cdot \pi ^{2} \cdot t} {L^{2} }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${L^{2} }$}} \right)}} }} \right\}} $$
(3)

where K is the SC-vehicle partition coefficient of the drug and D is its diffusivity through the barrier. When data like those in Fig. 8 are fitted with the above expression, it is possible to derive two parameters, K and D/L 2, which characterize drug uptake into the SC from the applied formulation. These physicochemical parameters are relevant to DPK in that they report, respectively, on the extent and the rate of drug uptake into the SC: the partition coefficient (K) reflects the relative affinity of the drug for the SC compared to the applied formulation and is related to the extent of absorption, therefore; the ratio D/L 2 has units of (time)−1 and can be considered as a first-order rate constant for drug transport through the SC—a measure of absorption rate, in other words. Integration of the concentration profile equation yields an “area-under-the-curve” (AUC (46)):

$$ {\text{AUC}} = {\int\limits_0^1 {C_{x} d{\left( {\raise0.7ex\hbox{$x$} \!\mathord{\left/ {\vphantom {x L}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$L$}} \right)}} } = K \cdot C_{v} {\left\{ {\frac{1} {2} - \frac{4} {{\pi ^{2} }}{\sum\limits_{n = 0}^\infty {\frac{1} {{{\left( {2n + 1} \right)}^{2} }}\exp {\left( { - \frac{{{\left( {2n + 1} \right)}^{2} \cdot \pi ^{2} \cdot D \cdot t}} {{L^{2} }}} \right)}} }} \right\}} $$
(4)

which represents the total amount of drug in the SC at the end of the application period (t). Together, K, D/L 2 and AUC offer DPK parameters that can be used to objectively compare topical drug bioavailabilities from different formulations. They also allow differences in bioavailability to be explained mechanistically; for example, to answer the question whether a putative penetration enhancer elicits improved drug delivery by increasing partitioning into, or by facilitating diffusion through, the barrier. This capability has been illustrated using the anti-fungal drug, terbinafine (Fig. 8 (44)).

Fig. 8
figure 8

Terbinafine (TBF) concentration versus SC depth profiles in vivo (n = 5). The individual data points and the best fits of Eq. 1 to the results are shown following application a in the control formulation (EtOH-IPM, 50:50), and b in the same formulation containing 5% oleic acid. Redrawn from 46).

Finally, it is worth pointing out that once K and D/L 2 have been evaluated from an experimentally determined C x versus x/L profile, they can be used with the equation above to predict AUC at all other times of application (t). Providing that the parameters do not change with time [which, it must be admitted, is not always the case (45)], this means that the complete absorption profile of the drug can be derived from a relatively short-duration experiment. Again using terbinafine, some preliminary validation of this idea has been achieved [Table I (46)]. A more extensive effort, with ibuprofen as the delivered drug, has confirmed this earlier finding (94). In addition, this data analytical approach has value with respect to experimental design; with knowledge of D/L 2, the classic lag-time for diffusion (L 2/6D) across the SC can be calculated, as well as the time necessary to reach steady-state transport (∼2.7·L 2/6D (95)). This information should permit more efficient DPK studies to be performed with tape-stripping timed to provide the maximum possible discrimination, for example, between the different drug products under evaluation.

Table I Comparison Between the Experimentally Determined Values of AUC (mean ± SD; n = 4) Following 2 and 4 h of Terbinafine Application, and the Predictions Based Upon K and D/L2 Results Determined from the Data Obtained after a 30-min Application

Tape-Stripping—Unresolved Issues

Despite the clear value of a DPK approach based upon tape-stripping, there remains work to be done before its usefulness can be fully exploited. First and foremost, the method has to be validated appropriately via clear demonstrations that significant differences in drug uptake into normal SC translates into clinically distinguishable scenarios in the real world; that is, the hypothesis that the SC concentration profile reflects drug availability at the site of action must be confirmed. Logically, this will be more straightforward to accomplish with drugs having their site of action on or in the SC (e.g., anti-fungals), and it is likely that, at least initially, the case will have to be made on a drug class by drug class basis. Although it is clearly expedient to move quickly to an easier approval path for generic topicals, the value and relevance of any approach adopted must be recognized by the dermatologist and must be seen to translate into a measurable clinical outcome.

Second, the procedures to be followed, and the DPK protocol itself, have to be at least partially standardized. Quantification of the SC removed is mandatory and there is an important need to develop a simple and rapid method for this purpose. With such knowledge, complete removal of the SC is not necessary for comparative purposes, thereby allowing for a less invasive procedure; further, this means that the method does not have to be restricted to one particular adhesive tape, the choice of which can be based on methodological criteria such as ease of drug extraction, absence of components that may potentially interfere with the drug assay, and allowing removal of the SC with a reasonable number of strips (∼10–15). Cleaning the treated area of the SC prior to tape-stripping is important and must be capable of removing excess formulation efficiently without inadvertently ‘driving’ drug into the barrier. This includes displacement of vehicle from the natural “furrows” in the skin (91), the depth of which can be significantly greater than the SC thickness. Debate continues as to whether one or two tape-strips should be taken initially and discarded, the argument being that any drug lodged on these layers, even after cleaning the skin, would never be absorbed. For the moment, no consensus exists on this point but clearly it will be necessary to better define the procedures to be followed when cleaning the skin at the end of the application period. This issue is particularly important both for poorly penetrating chemicals (e.g., those of higher molecular weight) and for drug products administered in complex vehicles (such as those including liposomes or nanoparticles).

Third, the question of spatial localization of drug must be addressed; specifically, it has been recognized that the target for certain drugs may be a skin appendage (a hair follicle, or a sebaceous gland, for instance) and that particular formulations have been proposed for optimization of delivery to these structures. Exactly how tape-stripping might be used to compare vehicles which set out to achieve, for example, follicular targeting, has not been demonstrated. In this case, it would be necessary to show that SC levels are correlated with drug amounts in the appendage; however, if a formulation did specifically target a follicle, it might be logical to think that one would find less drug in the SC as a result. The problem is complex, therefore, and may need to call upon application of recent work which has attempted to deduce the contribution of follicular transport to total drug delivery by comparing SC uptake in normal skin to that in skin whose follicles have been physically sealed (9699). Obviously, further validation is required using drugs and drug products designed to act on these appendageal structures.

Finally, there is the question of the relevance of the DPK approach, which has been developed for use on normal skin, to drug performance on diseased skin. In part, this relates to the validation issue discussed above and the need to correlate DPK measurements with clinical outcome. There is also the outstanding unknown as to whether DPK, or a modified version thereof, can be adopted for use on diseased tissue. It is important to acknowledge, furthermore, that the methodology in its present state does not differentiate free and bound drug in the SC.

MICRODIALYSIS

During the last decade, microdialysis has been shown to be a promising technique for the assessment of cutaneous drug delivery (100,101). The technique is based on the passive diffusion of compounds down a concentration gradient across a semi-permeable membrane forming a thin hollow “tube” (typically a few tenths of a millimetre in diameter), which—at least, in theory—functionally represents a permeable blood vessel. Two kinds of probe are in common use: the so-called linear and concentric probes (Fig. 9). The former is the most prevalent design used because it is simple, thinner and inexpensive to manufacture from artificial kidney fibres (102104). The probe is implanted superficially into the dermis or subdermis, parallel to the skin surface via a guide cannula. The fibre is slowly perfused with a physiological solution, which equilibrates with the extracellular fluid of the surrounding tissue, exchanging substances smaller than the cut-off value of the membrane. The exchange of material is driven by passive diffusion (105), and the microdialysis technique may be used not only to monitor the extracellular concentrations of exogenous or endogenous compounds, therefore, but also to deliver drugs to a specific target tissue.

Fig. 9
figure 9

Schematic diagram of two currently used microdialysis probes for DPK experiments. In both cases, a semi-permeable dialysis membrane collects the absorbed free drug. a Single-lumen probe: the membrane is located in the middle of the probe, and the perfusate flow is mono-directional. b Dual-lumen probe: the membrane is on the tip of the probe, and the perfusate flow is bi-directional.

A key feature of microdialysis is its ability to continuously monitor the extracellular concentration of a drug in different compartments of the body. Indeed, a microdialysis probe can be implanted in virtually any body organ or tissue (106). Among the limited number of techniques available for the direct assessment of drug concentrations in dermis, cutaneous microdialysis is the only one that allows unbound analytes to be sampled. Hence, microdialysis provides a pharmacokinetic profile, with high temporal resolution, of the drug in the target tissue. As the level of unbound drug generally determines the pharmacodynamic response, the pharmacological relevance of the method is clear (Fig. 10).

Fig. 10
figure 10

Penetration of lidocaine from three different formulations into the dermis of eight volunteers measured by microdialysis (from (107)).

The feasibility of microdialysis to sample a drug in the skin has been demonstrated in animals and humans (101,107). Microdialysis has been shown to be useful (a) to assess free drug concentrations in the dermis or subcutaneous tissue, depending on probe depth, after topical application of a formulation, (b) to compare different formulations of a drug and assess bioequivalence/bioavailability, and (c) to measure whether therapeutic concentrations are reached in the skin.

Another important advantage of microdialysis is that no biological fluid is removed. In vivo, in man, multiple sampling sites can be simultaneously set up and used to determine, for example, drug penetration into the skin and the effect of different vehicles (107), cutaneous metabolism (108,109), and the impact of barrier disruption (110). This same flexibility has also allowed the impact of probe depth on the data obtained to be assessed (111,112), and has allowed the inherent variability of the method to be reduced (113).

The initial research evaluating microdialysis for topical delivery was mostly performed in rats, an animal chosen for its ease of handling rather than for its suitability as a model for human skin penetration. Proof-of-concept was established with 5-fluorouracil (103), and subsequent work evaluated, inter alia, the impact of formulation, the action of penetration enhancers, and the improved delivery possible with iontophoresis (111,112,114122)). A significant effort has also been made to correlate microdialysis recovery with barrier perturbation. Using salicylic acid as a model penetrant, uptake was shown to be proportional to barrier function as measured, for example, by transepidermal water loss (123), suggesting that the technique might be useful for assessing bioavailability in topical dermatoses.

In humans, the first applications of microdialysis involved the measurement of (a) ethanol absorption (124), and (b) cutaneous glucose levels (125,126). Subsequently, the transdermal delivery of nicotine released from a commercial patch has been investigated (127,128), and some correlation has been found between the plasma concentration and perfusate level versus time profiles. Less success was obtained with estradiol, however, with eight out of ten experiments revealing no detectable estradiol in the dialysate. Likewise, even after barrier disruption, betamethasone 17-valerate, and the highly protein-bound drug, fusidic acid, could not be successfully sampled (129). On the other hand, less hydrophobic and more quickly penetrating substances, such as non-steroidal anti-inflammatory drugs, have been monitored easily by cutaneous microdialysis (108,109,111, 130132). The cutaneous penetration of salicylic acid through normal and perturbed skin using microdialysis sampling has been shown to be more sensitive than non-invasive bioengineering techniques in detecting the effect of acetone treatment. A positive dose–response was also found between salicylic acid penetration and the degree of irritant dermatitis induced by sodium lauryl sulphate pre-treatment.

The skin penetration of various salicylate esters and their skin metabolism in dermal and subcutaneous tissue has been studied (109). The delivery of topical diclofenac was similarly demonstrated following a single application of a gel (130). The topical bioequivalence of lidocaine delivery from a microemulsion and from a conventional oil-in-water emulsion was assessed pharmacokinetically by microdialysis and the results were compared to pharmacodynamic measurements as well (107). Furthermore, microdialysis has been used to study iontophoretic delivery of propanolol, acyclovir and flurbiprofen (111,133,134). Additionally, the role of stratum corneum, and of dermal microvasculature perfusion, in the penetration and tissue distribution of the water-soluble drugs, acyclovir and pencyclovir, have been investigated by cutaneous microdialysis (135).

Microdialysis is claimed to be minimally invasive, provoking only a minor, reversible trauma caused by insertion of the guide cannula when implanting the probe (136138). Moreover, changes in the tissue surrounding the probe may affect the recovery and measurement of the analyte. Therefore, when using the microdialysis technique, an equilibration of 60–90 min following probe insertion is usually an integral part of the protocol (136,137). The effects of the probe insertion can be classified as (a) direct trauma to cells and tissues; (b) modification of blood perfusion due to an axon reflex; and (c) inflammatory or “foreign body” reactions to the probe (139). In vivo assessment of the skin after insertion of the microdialysis probe is usually performed using non-invasive measurements of TEWL, cutaneous blood flow, and histology (136139). The extent and duration of “trauma” to the skin can be modulated by the probe design and the use of a local anaesthetic. Guide cannulas with smaller dimensions generally provoke smaller reactions and the provoked increases in local blood flow (and histamine release) subside more quickly; the concomitant use of a topical anaesthetic cream can reduce the duration of vascular response to less than an hour (140142). Longer-term implantation has been studied in animal models and a general inflammatory response observed: infiltration of lymphocytes at 8 h, cell changes and cell attachment to the dialysate membrane at 24 h, fibrosis after a week, and scar tissue formation. However, apart from the initial reaction on probe insertion, tissue responses are relatively minor throughout the duration of a typical microdialysis experiment (∼8 h (103,138,143)).

Method Development and Optimization

Since microdialysis operates under non-equilibrium conditions, the concentration of analyte in the dialysate is lower than that in the extracellular fluid surrounding the probe. Relative recovery has been defined as the ratio of these concentrations, and should be independent of the absolute, free concentration of the compound in the skin (assuming no binding to the dialysis membrane). Several approaches have been suggested for determination of the in vivo recovery of the analyte by the microdialysis probe: the stop-flow method (144), the extrapolation-to-zero flow technique (142), the point of no net flux (102), and retrodialysis (145). Extensive reviews of the various techniques have been published (105). Retrodialysis is probably faster and more convenient than the other alternatives. This, “internal reference technique” (146,147), introduces a marker into the perfusate and the rate of delivery of this compound is measured during the microdialysis experiment. Recovery (R) is then determined from:

$$ R = {\text{100}} \bullet {{\left( {C_{{{\text{in}}}} - C_{{{\text{out}}}} } \right)}} \mathord{\left/ {\vphantom {{{\left( {C_{{{\text{in}}}} - C_{{{\text{out}}}} } \right)}} {C_{{{\text{in}}}} }}} \right. \kern-\nulldelimiterspace} {C_{{{\text{in}}}} } $$
(5)

where C in is the concentration of marker perfused into the fibre and C out is the concentration determined in the sample. This method assumes, and/or demonstrates, that the recovery efficiency of the target analyte is quantitatively similar to the delivery of the retrodialysis marker. It follows that the marker should physicochemically resemble, as far as possible, the analyte of interest. An important advantage is that fluctuations in the recovery of the probe during the experiment are taken into account by continuous retrodialysis of the marker. Conversely, a significant limitation is that the presence of the marker perturbs in some way (e.g., competition for binding) the local disposition of the drug.

The relative recovery is influenced by the probe, the tissue perfused and the analyte of interest (100). To increase the relative recovery, the length of the probe and its molecular weight cut-off limit should be increased, while perfusate flow should be reduced. Typically, in dermal microdialysis, the probe length is ∼1–2 cm, and the flow rate between 0.5 and 5 μl/min. The molecular weight cut-off chosen is typically in the range of 5–20 KDa to avoid protein leakage into the dialysate. For dermal microdialysis, the linear probe design is preferred whereas, for subcutaneous implantation, both linear and concentric probes have been used.

Because the dialysis fibre excludes diffusion of proteins, the samples are sufficiently purified for further analysis. The principal challenge to the procedure is sensitivity, especially for the measurement of (a) lipophilic permeants which are slowly and poorly extracted into the microdialysis perfusate, and (b) compounds with high protein binding. For example, in earlier work, fusidic acid and betamethasone-17-valerate were not detectable by microdialysis (129). Although it has been suggested that recovery may be improved by introducing co-solvents (e.g., polyethylene glycol), or cyclodextrins, or lipids, into the perfusate, or by changing its pH, the advantages of such a strategy must be balanced against potential tissue alterations and their impact upon biocompatibility (148,149). Low rates of perfusion (0.5–5 μl/min) are mandatory for compounds with low recoveries, especially those which are highly protein-bound. Nevertheless, this means that the samples are of low volume and contain small absolute amounts of analyte. The analytical challenge is therefore considerable and demands sensitive techniques; HPLC may not be sufficient and the use of specific biosensors and mass spectrometry (as well as other techniques) has been reported (138,150).

To validate the use of microdialysis, as a method to assess bioavailability/bioequivalence, a systematic comparison with existing methods is essential. Generally speaking, when different formulations of a drug have been compared, microdialysis has determined the same rank order as other approaches. However, the absolute, measured amounts of drug absorbed are different (Fig. 11). For example, the quantity of drug found in the skin after an in vitro diffusion cell experiment is higher than that detected by microdialysis (103). The explanation for this observation is that the classic in vitro technique (a) measures both bound and unbound drug, and (b) does not reproduce the effects of local blood flow on absorption.

Fig. 11
figure 11

Local availability of ketoprofen applied topically from two gel formulations: assessment a in vitro in diffusion cells (upper panel), b in vivo by tape stripping (middle panel), and c by dermal microdialysis (lower panel; A.-R. Denet and V. Préat, unpublished data).

A particularly positive, and rather unique feature, of microdialysis is that the method can be used to evaluate drug permeation across both normal and diseased skin. Furthermore, in the same psoriatic patient, penetration at a specific lesion may be compared to that through unaffected skin. In addition, as well as measuring drug concentration, microdialysis may also have the potential to quantify a biomaker, or the concentration of another surrogate measure, for therapeutic activity. Concomitant pharmcokinetic and pharmacodynamic evaluations are therefore possible.

Interpretation of Microdialysis Data

It should be emphasized that data from microdialysis experiments can be analyzed to yield practical pharmacokinetic parameters. C max, T max, absorption constant, lag time and AUC can be determined from the plot of free drug concentration in the dermis as a function of time. For example, a one-compartment model has been used to differentiate between results from two formulations of lidocaine (107) via comparison of absorption rates and lag-times. In this case, prilocaine was used as a retrodialysis calibrator. In another elegant study, cutaneous microdialysis demonstrated that the skin/plasma ratio of 8-methoxypsoralen was orders of magnitude greater following topical administration of the drug (as compared to oral dosing). Evaluation of C max in the skin, furthermore, permitted optimization of the time of UV irradiation in PUVA therapy (151). Elsewhere, topical application of the antifungal fluconazole resulted in free dermal concentrations below the minimal inhibitory concentration (112). This ability to measure whether a therapeutic concentration of a drug in the skin has been attained after topical administration is a particular advantage of microdialysis. In addition, both the applied drug and any metabolites formed by biotransformation in the skin are, in theory, detectable by this method. However, only in a few exceptional circumstances [e.g., salicylate derivatives (109)] has significant metabolism been observed.

The number of subjects necessary for a microdialysis study to determine whether two formulations are bioequivalent depends on the degree of variability observed in the data. From a practical standpoint, it has been suggested that this problem should be tackled by ensuring that each subject acts as its own control (113). Under these circumstances, and performing duplicate measurements in every volunteer, ∼20 subjects are considered sufficient to distinguish two formulations within the classic 80–125% confidence interval. In contrast, most microdialysis bioequivalence studies in the literature have been performed with six to ten volunteers (107,110).

Microdialysis—Unresolved Issues

While many studies comparing microdialysis to other methods have shown reasonable correlation, further validation work is essential if this approach is to be recognized by the regulatory agencies as an option for bioavailability and bioequivalence assessment. One key issue in the development of a standardized protocol is the reproducible insertion of the microdialysis probe to a consistent depth within the skin. Probe implantation remains a technical challenge that is only mastered with patience and practice. Although the significance of probe depth on the resulting permeation profile is a matter of some debate, it is clear that implantation in the either the superficial dermis, or the deep dermis or the subcutaneous tissue can have an obvious impact on the data obtained (111,112).

A further limitation to the routine application of microdialysis, at the present time, is analysis of the very low levels of drug in the dialysate. The most successful demonstrations of the approach have been made with compounds which readily penetrate the skin and which have decent aqueous solubilities and low protein binding. Such molecules are rare, however, and are not always relevant to dermatological therapy. Corticosteroids, on the other hand, the vasoconstriction response to which might offer a useful validation ‘marker’ for microdialysis, have never been successfully monitored by the technique (presumably due to their low permeabilities, poor aqueous solubilities and high protein binding).

In addition, even though microdialysis may be reasonably termed “minimally invasive,” its use in vivo is typically limited to experiments lasting 5–10 h. For slowly permeating drugs, therefore, the quantities absorbed in this time will be minimal, and the analytical problems quite daunting.

OTHER APPROACHES

Finally, a few words about some alternative strategies, of which only confocal Raman spectroscopy is receiving significant attention at this time.

Skin Biopsy

Skin biopsies, whether to the level of the dermis (shave biopsy) or through to the sub-cutis (punch biopsy), are invasive and generally performed under local anaesthesia. While such methods have a place in dermatological surgery (to remove warts and small tumours, for example), their application for tissue sampling and analysis post-drug application has not attracted much attention as a routine approach for use in vivo. Despite the obvious advantage of offering a ‘snapshot’ of drug disposition in the different skin layers, the aggressive nature of the biopsy rules out any chance that it might be adopted as a standard procedure. Even attempts to minimize tissue trauma (131,152) fail to render the method remotely acceptable and its use in the foreseeable future will be restricted to animal and in vitro studies.

Suction Blister

Applying a partial negative pressure to the skin disrupts the epidermal–dermal junction and forms a blister which fills progressively with interstitial fluid and serum (153,154). This liquid offers a pharmacokinetic “compartment,” therefore, in which a previously applied drug can be sampled with a hypodermic needle and quantified; if multiple blisters are raised (as is possible with certain commercially available devices), then a concentration-time profile of the drug in the skin can be obtained. While on the surface attractive, this approach is also quite invasive and causes obvious scarring, albeit over relatively small areas of skin. The technique can be used to compare topical formulations in a reasonably objective way (155,156), but the potential binding of the drug to skin tissue, especially for very lipophilic species, may mean that very low levels are present, if at all, in the blister fluid (157). Taken together, this has meant that the procedure has not been widely used and, once again, is presently viewed as too invasive for practical and routine application in topical bioavailability experiments.

Follicle Delivery

Drug delivery to the follicle following application of formulations to the skin is an established fact. Although the follicles occupy a relatively small fractional area (on average, ∼0.1%) of the surface, it appears that diffusion via these appendages is fast relative to that through intact SC, and probably accounts for the first drug molecules which penetrate deep into the skin (158). Furthermore, the follicle is a bona fide target for skin diseases, such as acne and alopecia, and this raises the issue of drug bioavailability to this specific structure. For example, it has been claimed that certain particle sizes are optimal for sequestration in the follicle (159) thereby concentrating the drug at this site and subsequently providing a sustained release to the local environment. However, a method, with which to quantify such ‘targeting’, has been difficult to validate unequivocally and no approach can be claimed to have demonstrated practical utility for quantitative bioavailability/bioequivalence assessment to the follicle. Nevertheless, there have been some innovative ideas reported in the literature (77,160,161) and, more recently, somewhat simpler methodology has been described, with which the contribution of follicular transport to the overall permeation of a drug in vivo in man has been estimated (99,162).

Confocal Raman Spectroscopy

This technique uses the confocal principle to noninvasively examine the outer layers of the skin by Raman spectroscopy. In this way, it is possible to acquire rapidly, in vivo in humans, Raman spectra of the skin as a function of depth (163). The method is able to provide, for example, information on the hydration of the SC, and the manner in which the water concentration profile across the barrier may be altered by the use of moisturizing agents (163165). Other analytes, such as urea, can also be examined via their unique Raman signature which allows them to be differentiated from the spectrum of the skin’s constituents (166).

The approach is comparable to the application of reflectance infrared spectroscopy described earlier. In distinct contrast, however, confocal Raman does not require progressive stripping of the SC to determine the concentration versus depth profile of a target substance, and this is clearly an important advantage. Like ATR-IR, however, confocal Raman requires that the molecule of interest be present at a sufficient concentration, and possess spectral features of sufficient intensity, to permit its differentiation from those of the skin (163). Therein lies, at present, the major drawback of these IR-based techniques. With confocal Raman, in addition, only relative concentrations (rather than absolute levels) can be determined. It remains to be seen, therefore, whether this significant technological development will contribute eventually to the evaluation of topical drug bioavailability and/or bioequivalence. For the moment, the method offers a unique research tool and a novel approach to evaluate skin moisturization.

CONCLUSIONS

Despite pressure on regulatory agencies, such as the FDA, there is no generally accepted method with which to evaluate the bioavailability and bioequivalence of topical drug products. For the present, therefore, with the exception of the vasoconstriction assay for corticosteroids, and despite the diversity of efforts which have been made (Table II), clinical studies are obligatory. Unlike oral administration, for example, where the blood level of a drug is a generally accepted ‘surrogate’ for its concentration at the site of action, topical drug delivery poses a more complex problem. In this case, the site of action (even when it is known, which is not always the case) is not always accessible and a suitable ‘surrogate’, e.g., the SC, has not been generally validated. Two approaches are under most intensive scrutiny: one, microdialysis, strives to make measurements of the drug in a true “biophase” of the skin; the other, tape-stripping, attempts to establish the SC as the ‘surrogate’ sampling site. In both instances, significant progress has been made and proof-of-concept demonstrated. However, the next steps, i.e., optimization of the procedure(s) and transition to a useful and practical regulatory test, remain challenges for future work.

Table II Methods to Assess Drug Penetration into and/or Across the Skin