INTRODUCTION

Bioanalytical method validation (BMV) of ligand-binding assays (LBA) involving biomarkers endogenously present in substantial quantities in matrix (e.g., serum, plasma, etc.) can create challenges to accurate quantitation of that analyte as a spiked entity in the sample (13). This is because the assay frequently cannot distinguish between added analyte and the endogenous component. The challenge of adjusting for endogenous analyte content in matrix becomes apparent when assaying validation samples (VS) and attempting to accurately quantitate the spiked component. This same challenge presents itself during selectivity determinations when multiple individual matrix samples are tested for matrix effects by addition and quantitation of spiked analytes.

Two basic methods have been described to adjust for endogenous biomarker levels in percent analytical recovery (%AR) calculations involving analyses of such VS or selectivity samples. One method (termed addition method herein) uses a comparison of the observed VS results vs. the expected total concentration, the latter of which being derived by addition of the nominal value of the spike plus the observed experimental value from the unspiked matrix control. The other adjustment calculation is termed the subtraction method. Thus, the concentration of the tested biomarker observed in the matrix blank is subtracted from the amount observed in the spiked VS, and the net difference vs. nominal spiked value is used to calculate %AR.

Both addition (e.g., 48) and subtraction (e.g., 913) methods are used for endogenous analyte adjustment, although the latter is the predominant calculation in the immunoanalytical field. Frequently, it is stated that the endogenous analyte concentration had been taken into account for %AR calculations, but the exact method was ill-defined (e.g., 1416). As will be described in “Results,” the choice of calculation method can be a critical factor regarding conclusions on the reliability of the assay and resulting data. Considering that both addition and subtraction methods are currently being used, it is essential to describe in full the calculation employed for %AR determinations.

Opinions have been put forth of late supporting the subtraction method, although arguments for or against the addition method have not been widely published. The seminal article by the AAPS Ligand-Binding Assay Bioanalytical Focus Group described both addition and subtraction for calculating %AR when endogenous signal is present within the range of a LBA (17). However, the relative merits of subtraction vs. addition methods were not addressed in that 2003 publication. Specifics about %AR calculations were not substantially discussed in the recently released 2013 FDA BMV draft guidance, nor in the Crystal City V workshop report (18). Nevertheless, influential publications support use of the subtraction method (e.g., 1921). Moreover, recent white papers on biomarker BMV by the Global CRO Council and by the Global Bioanalysis Consortium Harmonization Team state that the subtraction method should be used for adjusting endogenous biomarker content in selectivity determinations (22,23).

Crucial to this discussion and to the generalized appreciation for the importance of calculation method are published results showing direct comparisons of the two methods using experimental data. Such analyses have not been widely published. Therefore, we undertook just such a comparison using examples from our ongoing BMV studies in normal human serum (NHS) with two Meso Scale Discovery (MSD®) multiplex cytokine assays, the Proinflammatory Panel 1 and Cytokine Panel 1 systems. Endogenous levels of the 20 cytokines were observed in NHS to varied degrees and had to be accounted for during BMV.

Case studies were selected based on the presence of detectable endogenous analytes and used to directly compare %AR results with subtraction and addition calculation methods. As shown in “Results,” it was observed that subtraction consistently yielded more convincing %AR values under conditions where addition gave unreliable conclusions. Another issue along these lines relates to endogenous analytes that are above the limit of detection (LOD), but below the limit of quantitation (BLOQ). We observed herein that subtraction of assay signal attributable to the matrix is a feasible approach to account for detectable endogenous analyte levels in %AR calculations.

MATERIALS AND METHODS

Preparation of Calibration Standard and Validation Samples

The MSD V-PLEX™ Proinflammatory Panel 1 and Cytokine Panel 1 kits for human samples were purchased for these studies (Meso Scale Discovery, Rockville, MD, catalog numbers K15049G and K15050G, respectively). The kits were supplied with lyophilized calibration standard (CS) blends, a mixture of the 10 cytokines to be measured by each respective multiplex panel. The CS blend was reconstituted with MSD assay buffer for use in standard curves and for spiking NHS as described below.

Twenty NHS samples were purchased from Bioreclamation (Hicksville, NY) and screened for suitability for use in a NHS matrix pool by running in the assays with and without spiked CS blend to monitor %AR of the individual cytokines. Sera for the NHS pool were selected based on the lack of appreciable positive or negative matrix effects on analyses of the spike cytokines, meeting acceptance of 80–120% AR using the subtraction calculation. The NHS pool consisted of eight individuals selected from the initial lot of 20 NHS purchased. Seven levels of VS were prepared by spiking this pool of NHS with the reconstituted CS blend at escalating concentrations. All VS contained at least 95% NHS. VS and any reconstituted CS preparations were stored at −70°C. All kit components had an expiration of 3 years from the date of manufacturer, while reconstituted preparations were given a 30-day expiration.

Multiplex Assay System

The assay methods followed closely to those of the product insert, except as required to incorporate NHS matrix. Briefly, reconstituted CS blend was serially diluted in assay buffer to obtain the desired concentrations for the standard curves. Diluted calibrators were added 50 μL/well directly to the plate without further dilution. VS and NHS samples were diluted in assay buffer for a final minimum required dilution (MRD) of 1:2 (50%; Proinflammatory Panel) and 1:4 (25%; Cytokine Panel) when added 50 μL/well. All determinations were assayed in duplicate (two wells). The plate was incubated 2 h at room temperature with orbital shaking. The wells were washed with buffer (PBS with 2% Tween-20) before the addition of 25 μL/well assay diluent containing a mixture of 10 SULFO-TAG™-labeled, cytokine-specific detection antibodies. The plates were incubated again at room temperature for 2 h with orbital shaking. The wells were washed with PBS-Tween. MSD read buffer was added to the wells, and the plate loaded into a MSD SECTOR® Imager 2400 instrument where a voltage was applied to the plate electrodes, causing the capture labels to emit light as electrochemiluminescence units (ECLU). A regression of the ECLU by the standard curve samples vs. concentrations was performed with a 5-parameter logistic model (5-PL) using Watson LIMS™ software (Version 7.3.0.01, Thermo Fisher Scientific, Inc., Waltham, MA) and the concentrations of analytes in the samples determined following adjustment for dilution.

Relevant BMV Details

The BMV data presented herein were derived in a manner consistent with the principles of GLP referred to hereafter as regulated BMV. Lower limit of quantitation (LLOQ) is defined by the VS with the lowest passing concentration of analyte in regulated BMV (18,24,25). Less formal assay characterization protocols (variously termed exploratory, nonregulated, and method qualification) can use other means to define LLOQ as described in the “Discussion” (2527).

Standard regulated BMV parameters were assessed with these two biomarker panels, including accuracy and precision (A&P), selectivity, stability, etc. Only the A&P runs will be discussed for this study. A total of six A&P runs were conducted for each multiplex panel over multiple days by two analysts. Each run consisted of a 12-point CS blend curve, 3 buffer blanks, and 3 replicate determinations of the 7 VS and NHS matrix pool, all of which assayed in duplicate. The CS curve preparation was made fresh for each assay run, while frozen VS were used for all other determinations. Acceptance criteria as applied to these analyses included percent coefficient of variation (%CV) ≤20% except for LLOQ where ≤25% was applicable. Percent AR was considered acceptable at 80–120%, except for LLOQ where 75–125% was applicable.

Calculations

Percent total error (%TE) was used to describe accuracy and precision from multiples runs of VS. Percent TE as used herein is the sum of inter-assay accuracy (percent relative error, %RE) plus inter-assay precision (percent coefficient of variation, %CV). Typical acceptance criteria are TE ≤30%, except at LLOQ where ≤40% is applicable (24). Percent spike was calculated by adding the nominal spike value with the mean endogenous concentration of the matrix blank control and dividing the nominal spike concentration by the total in the VS and multiplying by 100. LOD was defined using the mean buffer blank ECLU plus 2.5 times the standard deviation of that mean value.

Raw mass units (e.g., pg/mL) reported for spiked VS, as generated by regression analysis with Watson software, reflects the combined presence of the spiked entity plus the endogenous component. Unadjusted %AR values are derived by division of these raw mass units by the respective spike mass unit values multiplied by 100.

The addition calculation method involved division of the observed VS analytical result (measured concentration) by the expected results, which is the sum of the nominal spike plus the endogenous biomarker concentrations. The endogenous biomarker concentrations were determined by concurrent analysis of the NHS matrix blank during the A&P run of the VS. The equation for the addition %AR calculation is as follows:

$$ \%\kern0.5em \mathrm{Analytical}\kern0.5em \mathrm{Recovery}\kern0.5em \mathrm{b}\mathrm{y}\kern0.5em \mathrm{Addition}=\begin{array}{c}\hfill \frac{\mathrm{Measured}\kern0.5em \mathrm{Conc}\mathrm{entration}}{\mathrm{Expected}\kern0.5em \mathrm{Conc}\mathrm{entration}}\hfill \\ {}\hfill \left(\mathrm{Nominal}\kern0.5em \mathrm{Spike}\kern0.5em \mathrm{Conc}.+\mathrm{Endogenous}\kern0.5em \mathrm{Conc}.\right)\hfill \end{array}\times 100 $$

The subtraction calculation method involved elimination of the endogenous component from the %AR determination. Thus, the measured amount of endogenous analyte was subtracted from the total observed analyte concentration in the VS, which was subsequently divided by the nominal spike concentration to yield AR. The equation for the subtraction %AR calculation is as follows:

$$ \%\kern0.5em \mathrm{Analytical}\kern0.5em \mathrm{Recovery}\kern0.5em \mathrm{b}\mathrm{y}\kern0.5em \mathrm{Subtraction}=\begin{array}{c}\hfill \mathrm{N}\mathrm{e}\mathrm{t}\kern0.5em \mathrm{Measured}\kern0.5em \mathrm{Spike}\hfill \\ {}\hfill \frac{\left(\mathrm{Measured}\kern0.5em \mathrm{Conc}.-\mathrm{Endogenous}\kern0.5em \mathrm{Conc}.\right)}{\mathrm{Nominal}\kern0.5em \mathrm{Spike}\kern0.5em \mathrm{Conc}\mathrm{e}\mathrm{ntration}}\hfill \end{array}\times 100 $$

Adjustments by subtraction were also conducted using assay signal, ECLU. This adjustment involved calculating the net ECLU attributable to the NHS matrix in the VS by subtracting buffer blank mean ECLU from the matrix blank mean ECLU to generate a net matrix blank ECLU value. Acceptance criteria for mean ECLU values were CV ≤25% between replicate determinations and between duplicates for each of those determination. Net matrix blank ECLU was then subtracted from the mean ECLU obtained with the NHS VS. Regression analysis vs. the buffer CS curves was subsequently conducted, and the data processed without further adjustment.

RESULTS

Case Study 1: Endogenous Analyte Adjustments Using IL-12/IL-23p40 A&P Run Data

Table I shows a comparison of addition vs. subtraction methods for endogenous analyte adjustment using data from IL-12/IL-23p40 A&P assay runs. The cumulative A&P results established that the LLOQ for the assay was 17.5 pg/mL (not shown). Endogenous IL-12/IL-23p40 was detected in the matrix blanks in the range of 40.8–48 pg/mL, depending on the run (inter-assay mean 42.3 pg/mL). Using a method described by others (1,17), parallelism was confirmed for endogenous IL-12/IL-23p40 in the NHS pool over a series of five dilutions, and following adjustment for the dilution factor, a CV of 9.0% was derived between sample means. Spike IL-12/IL-23p40 was 5.83 pg/mL for all runs, approximately 12% of total IL-12/IL-23p40 analyte levels in the VS.

Table I Case Study 1: IL-12/IL-23p40 A&P Runs Illustrate Disparity in %AR Conclusions Using Subtraction and Addition Adjustment Methods

Three runs are shown with unadjusted %AR failing for all runs. This confirms that accurate determinations of recovery required adjustment for the endogenous component. Run 1 shows near complete recovery of the spike (96%), with passing %AR values regardless of adjustment method. Run 2 yielded only an apparent 1.8 pg/mL net detectable spike (31% of the 5.83 pg/mL added) reflected in a failing %AR using the subtraction method for adjustment. However, since such a large part of the “expected” value was due to endogenous analyte, the addition method yielded a passing %AR of 91%. Finally, run 3 shows an example of unacceptably high recovery using the subtraction method (146% of spike) which would be considered passing using the addition method for adjustment. Thus, addition and subtraction methods for adjusting endogenous analyte concentrations in %AR calculations can yield disparate AR conclusions.

Case Study 2: Adjustment for Low Relative Endogenous IL-13 Levels with High Proportion IL-13 Spike Values

Shown in Table II are inter-assay results from 6 IL-13 A&P runs using VS1, VS2, VS3, and VS4 (VS5, VS6, VS7, not shown for simplicity since unadjusted values passed criteria). These data illustrate that the disparity in conclusions given by the subtraction vs. addition adjustment methods was not restricted to biomarkers with very high endogenous analyte levels. It was observed that endogenous IL-13 was approximately 3.7 pg/mL in the matrix blank as assessed by the standard curve. Percent spike as a proportion of total IL-13 in the VS ranged from 21 to 83% using the approximate 3.7 pg/mL endogenous IL-13 value.

Table II Case Study 2: Adjustment for Low Relative Endogenous IL-13 Levels with High Proportion IL-13 Spikes Using Inter-assay Data from Six A&P Runs

Unadjusted %AR failed at all VS levels shown (Table II), confirming that accurate determinations of recovery required adjustment for the endogenous component. The addition method using 3.7 pg/mL endogenous IL-13 for adjustment yielded passing conclusions at all levels, including the 1 pg/mL spiked VS1. Consequently, from such data, it would appear that the LLOQ of the assay is 1 pg/mL and the 3.7 pg/ml endogenous analyte value was a valid determination. Using this endogenous IL-13 value with the subtraction method resulted in failed VS1 and VS2, but passed at VS3 and VS4. However, the 3.7 pg/mL value should not be considered valid since it was BLOQ using the subtraction method, so the approach of using assay signal was investigated. Only VS4 passed using assay signal, identifying the LLOQ for the assay as 18 pg/mL based on these VS results.

The validity of assay signal vs. mass units to calculated %AR was examined more fully by comparison of representative A&P results for IL-12/IL-23p40, IL-15, and IL-16. A modest relationship (R 2 = 0.6693) was observed between the nominal spiked VS concentrations and unadjusted observed results (N = 42 paired values); lack of a strong relationship reflects the endogenous analytes that had not yet been accounted for in the determinations. Using the subtraction method and mass units within the range of the assays, a much stronger relationship was observed between nominal spike and derived results (R 2 = 0.9650). This was comparable to mass units calculated by subtraction of matrix signal vs. nominal spike (R 2 = 0.9809). There was a very strong relationship (R 2 = 0.9886) between adjusted mass units as derived by subtracting assay signal attributable to matrix vs. mass units derived by subtracting endogenous mass unit concentrations within the range of the respective assays. Preliminary results with individual NHS selectivity samples confirmed utility of assay signal subtraction. However, this will have to be examined in greater detail since selectivity usually involves single determinations with and without analyte and limited inter-assay comparisons.

These results demonstrate: (1) a potential pitfall using the addition method to determine LLOQ with VS, (2) potential issues associated with using mass units derived BLOQ for endogenous analyte adjustments, and (3) apparently successful use of assay signal for %AR calculations involving analytes that are detectable, but not quantifiable in this assay system.

Case Study 3: Comparison of Addition vs. Subtraction Adjustment Methods with High Endogenous IL-16 Levels

As shown in Table III, case study 3 confirms the disparity between addition and subtraction %AR calculations and also indicates the potential importance of cytokine spike to endogenous proportions in achieving a passing %AR. Shown are inter-assay A&P results (six runs) for IL-16, VS1 through VS7 in NHS. The nominal spike range was 5 to 494 pg/mL. Mean endogenous IL-16 was 167 pg/mL, with spiked quantities ranging from 3 to 75% of total IL-16 content in the VS. Parallelism was established for endogenous IL-16 in the NHS pool over a series of five dilutions, and following adjustment for the dilution factor, a CV of 16.1% was derived between sample means.

Table III Case Study 3: Comparison of Addition vs. Subtraction Adjustment Methods with High Endogenous IL-16 Levels

Adjustments for endogenous IL-16 were necessary as indicated by unadjusted VS failing %AR at all levels (Table III). The addition method yielded passing %AR at all levels, even the very lowest VS1 with 5.1 pg/mL (3%) spike. The subtraction method by mass units identified a spike of 56.5 pg/mL as the first passing VS concentration, with failed VS1, VS2, and VS3 for %TE reasons. These same low VS were BLOQ after subtraction of the signal attributable to NHS matrix.

These results support the following conclusions, including that: (1) the addition method for endogenous analyte adjustment gave passing results regardless of spike to endogenous IL-16 proportions, (2) subtraction using mass units or assay signal yielded matching conclusions about which VS passed vs. which failed, and (3) spikes ≥25% may be needed to consistently observe passing %AR using the subtraction method for endogenous analyte adjustment.

Requirements for VS Spike to Endogenous Biomarker Proportions to Consistently Observe Passing Analytical Recovery %TE Results

The data shown in Fig. 1 directly examines the relationships between spiked cytokine to endogenous cytokine levels in VS vs. resulting %TE by subtraction and addition endogenous analyte adjustment methods with mass units (pg/mL). TE ≤30% was considered passing in this analysis (horizontal red lines). The proportions of spiked cytokine (panel a) and endogenous cytokine concentrations (panel b) in the various VS for analytes IFN-γ, TNF-α, IL-8, IL-12/IL-23p40, IL-15, IL-16, and VEGF were plotted vs. the resulting inter-assay %TE from six A&P runs for each analyte using the two adjustment methods.

Fig. 1
figure 1

Successful validation samples tend to have ≥25% spike regardless of endogenous biomarker concentrations. Cumulative A&P results involving six runs and seven VS each are shown for IFN-γ, TNF-α, IL-8, IL-12/IL-23p40, IL-15, IL-16, and VEGF. Percent TE and spike were determined as described in “Materials and Methods.” a presents the relationship between %TE and percent spike as a function of total respective biomarker in the VS. Red symbols for a represent %TE values calculated using the addition adjustment method, while black symbols represent %TE values calculated using subtraction. b shows %TE calculated only by subtraction vs. endogenous biomarker levels for the respective cytokines. Black symbols in b illustrate %TE for VS having ≥25% spiked biomarker, and red symbols illustrate %TE for VS having <25% spiked biomarker

As shown in panel a, using the subtraction calculation (black symbols), spikes <25% of total tended to yield high, unacceptable %TE in the upper left quadrant of the figure. Indeed, none of the VS with <25% spike exhibited passing %TE. Conversely, spikes ≥25% tended to yield low %TE, as shown in the lower right quadrant of the figure, with passing %TE 88% of the time. Increasing the pass cut point for TE to 40% yielded 95% passing VS, with ≥25% spike using subtraction. The addition calculation method yielded dramatically different conclusions, as further illustrated by the red symbols in panel a of Fig. 1. Fully 100% of the VS passed at a TE cut point of 30%, even with spikes as low as 2–3% of total cytokine (circled data points in panel a).

The relationship between baseline endogenous biomarker levels and %TE values was also investigated as shown in panel b of Fig. 1. Only %TE values calculated using the subtraction method are shown since the addition method yielded 100% passing VS regardless endogenous analyte content. Percent TE values were broken down into VS having ≥25% spike (black symbols) vs. those with <25% spike (red symbols). While increased variability was observed with increasing endogenous biomarker concentrations, this variability was largely restricted to VS having <25% spike. There is an apparent relationship with increased endogenous biomarker levels which likely relates to correspondingly decreasing percent spike rather than endogenous concentrations per se. The data shown in Fig. 1 indicate that high A&P variability in %TE for VS is less a function of high vs. low endogenous biomarker levels in the matrix and has more to do with the proportion of spike to endogenous concentrations.

DISCUSSION

The case studies presented herein clearly support the notion that the subtraction method for adjusting endogenous biomarker content in %AR calculations produces more reliable conclusions, as contrasted to the addition method for adjustment. Three individual case studies were described in which experimental data were used to calculate %AR by the two methods. Dramatically different conclusions about acceptability of VS values were observed in each case, with results from the addition method often yielding rather unreasonable passing conclusions. Figure 1 presented the cumulative results from six A&P runs, each for seven biomarkers using seven VS (total 49 VS) for each analyte. None of the 49 VS were considered failed for %AR reasons using the addition adjustment method. This goes for VS containing as little as 2–3% spiked biomarker. This contrasts with the subtraction adjustment method where such small amounts of spiked biomarker could not be reproducibly documented with the assays at hand.

The data in Fig. 1 also indicate that successful %AR results are less a factor of very high vs. low endogenous biomarker concentrations, but rather the percent of spike to endogenous analyte content. The findings from Fig. 1 that approximately 25–30% spike to total biomarker content is needed for achieving reproducible %AR is consistent with known accuracy and precision characteristics for LBA, 30–40% TE. Thus, it is reasonable that comparable %AR conclusion might be expected when applying a 30% spike to an endogenous background of 10 or 100 pg/mL, as observed in Fig. 1, assuming sufficient sensitivity for the two respective assays.

Relationships Between the Present Findings and Other Published Works

The conclusions obtained from this study are entirely consistent with most current thinking on %AR calculations for adjusting endogenous biomarker content in spiked VS and selectivity samples (22,23). Accordingly, it is preferable to make these calculations by subtracting the endogenous component first, followed by determining the analytical recovery of the spike reference material fraction. It is nevertheless still common for the addition method to be used (48) and for publications to not state the calculation method employed (1416). These observations suggest that: (1) consensus has not yet been achieved and/or (2) it is still under appreciated that the method for calculating %AR is critical to understanding the accuracy and selectivity of biomarker assays in biological matrix.

At least two factors are responsible for this lack of consensus or under appreciation. The first is that early white papers did not indicate a preference for calculation method, implying acceptability of either subtraction or addition in %AR determinations. For example, the seminal 2003 AAPS focus group white paper describes both the addition and subtraction methods, but the two calculations were not critically compared, and only the addition method was used in the examples (17). Specifics about %AR calculations were not substantially discussed in the recently released 2013 BMV draft guidance, nor the Crystal City V workshop report (18). Thus, while there are published white papers specifying the use of subtraction in selectivity assessments (22,23), the message is not being effectively delivered.

A second factor leading to common use of both calculation methods is that published data and rationale for not using addition are still lacking. The present work attempts to rectify that situation by presenting experimental data and comparative analysis of %AR calculation methods to drive conclusions on this subject. This is the first published report to our knowledge that directly compares %AR calculation methods with experimental data, providing justification for the preference of subtraction over addition in %AR determinations.

The results and conclusions described herein are specifically applicable to situations where spiked reference material and endogenous analytes exhibit a linear mathematical relationship. Situations where nonlinear contributions of these components are observed require other remedies. Physical means have been proposed to deal with matrix effects, including increasing the MRD and use of stripped or immunodepleted matrix. Matrix effects and means to address issues with endogenous biomarker levels have been discussed in detail elsewhere (1,3,20,23).

The Basis for Divergent Conclusions with the Addition and Subtraction Methods

Addition and subtraction methods for endogenous analyte adjustments will yield exactly identical numerical values only with 100% AR conditions. Deviation from 100% AR conditions, no matter how small, will always yield different %AR values between the methods. The basis for this divergence in %AR values relates to using the endogenous component as a constant in the addition calculation, while it is eliminated with the subtraction method. The differences in %AR can be small or even negligible when using mixtures with high spike to endogenous proportions (e.g., 90 pg/mL spike with 10 pg/mL endogenous). However, even with moderate spike to endogenous proportions like 50:50, the inherent bias of the addition method can yield misleading conclusions.

To illustrate this latter point, consider the following hypothetical mixture containing 50 pg/mL spike and 50 pg/mL endogenous biomarker. The assumptions include 80–120% AR acceptance limits, and statistics like %TE and parallelism are acceptable and equal between calculations and observed results. Unadjusted assay results of 100 ± 20 pg/mL (40 pg difference) would be acceptable with addition, reflecting 80–120% of “expected” 100 pg/mL. With subtraction, unadjusted assay results would have to be 100 ± 10 pg/mL (20 pg difference) because acceptance limits concern only the spike (i.e. 80–120% of 50 pg/mL). Thus, if 80 pg/mL was the observed result, it would pass with addition (80% AR) and fail with subtraction (60% AR) because the endogenous component is treated differently between calculations.

It should be noted that the conclusions from these studies apply only to spiking defined reference materials into biological matrix samples. Addition calculations are acceptable and sometimes the only way to address %AR under certain conditions. For example, mixing of biological matrix samples containing different endogenous biomarker levels to achieve quality control samples with desired levels of that biomarker has been described and recommended (17,25,26). It is reasonable that addition would be used to derive expected concentrations under these and other circumstances.

Technical Considerations and Issues Associated with LLOQ

The distinctions between advanced biomarker method validation (i.e., regulated BMV) and protocols with less rigor (e.g., exploratory, method qualification, and category 1) have been reviewed elsewhere (1,18,20,22,2527). It has been recommended that regulated BMV of biomarker LBA be conducted to the extent possible when the results are intended to support key claims in regulatory submissions (18). A crucial component of regulated BMV is the use of VS prepared in the same biological matrix as the authentic study samples. Multiple VS are required (generally 5, but we used 7) with spiked levels spanning the expected range of the assay. The passing VS containing the lowest level of spiked analyte defines the LLOQ of the assay with regulated BMV. This is different from biomarker analytical method qualification where the LLOQ can be defined any number of ways, including the lowest concentration of analyte that can be quantitatively determined with predefined A&P, such as spiked buffer specimens, matrix specimens containing endogenous analyte, etc.

The results reported herein can have implications for LLOQ conclusions for biomarker assays undergoing regulated BMV. A circular, and potentially fallacious, set of conclusions about passing LLOQ VS and the LLOQ of an assay could be derived using the addition, but not the subtraction, adjustment method in the context of a regulated biomarker BMV. An example of this potential problem was observed regarding the data in Table II. As described, VS1 with 1 pg/mL IL-13 spike was the lowest passing VS in the series using the addition adjustment method. This could lead to the conclusion that 1 pg/mL is the LLOQ of the assay. Endogenous IL-13 was present in the matrix blank at ~3.7 pg/mL, an ostensibly valid conclusion since the addition method identified 1 pg/mL as LLOQ. However, two subtraction methods (i.e., using mass units and ECLU) identified at least a 6 pg/mL LLOQ. These latter results cast doubt on the validity of the aforementioned 3.7 pg/mL endogenous IL-13 determination and the 1 pg/mL LLOQ conclusion. The 3.7 pg/mL was BLOQ with both subtraction methods, and the likely valid results were derived using subtraction of assay signal as described subsequently.

Adjusting for Endogenous Analyte Signal Above LOD, but BLOQ

It is conceivable that endogenous levels of a particular biomarker are normally above LOD but BLOQ, but valid data in that region are desired. For example, we were attempting in another study to verify acceptance of a 1 pg/mL LLOQ VS; the matrix blank signal was above the LOD at an approximate 0.5 pg/mL level, resulting in an approximate unadjusted 150% AR. The strategy investigated for adjusting endogenous analyte content BLOQ was to eliminate that portion of the assay signal attributable to the matrix. Thus, mean buffer blank ECLU was subtracted from the mean matrix blank ECLU to yield net matrix blank signal, which was then subtracted from all samples prepared in that matrix blank. Direct interpolation of sample data points along the buffer standard curve yielded derived mass units that required no additional adjustment. The resulting mass units were highly correlated with results obtained by subtracting mass units observed within the assay range, as described relative to case study 2. This strategy was also investigated with individual NHS selectivity samples by subtracting the net signal of the unspike sample from the spiked sample signal, with comparable conclusions. These results suggest that similar data manipulations BLOQ are valid, but more detailed studies along these lines are required to confirm that supposition.

It should be pointed out that this strategy of signal subtraction would only apply to %AR calculations during BMV and not regulated analysis of trial samples when biomarker levels fall between LOD and LLOQ. As discussed by Lee and associates (1,20), it is sometime useful to accept such sample results for exploratory biomarker analyses only, but the variability associated with working in that low analyte region should be kept in mind. We have noticed that it is common for a 10–20% CV in signal to yield >30–40% CV in derived mass units when working between LOD and LLOQ. This may explain, in part, why %TE was higher with mass unit subtraction BLOQ vs %TE with signal in Tables II and III. It is generally agreed that extrapolation BLOQ in advanced (regulated) biomarker analyses is not an accepted practice.

Assay technologies other than MSD may not be amenable to this signal manipulation strategy for adjusting BLOQ endogenous biomarker content. MSD typically has a substantial range in signal (3–4 logs) and is noted for being relatively unaffected by matrix effects. Techniques such as ELISA, which are known to have higher biological matrix variability, may not yield such clean results and conclusions. Additional studies are therefore required to draw broad reaching conclusions about applicability of this signal subtraction strategy.

CONCLUSIONS

The presence of appreciable concentrations of biomarker(s) in matrix can complicate %AR determinations during BMV. We consider the subtraction method preferred over that of addition to adjust for detectable endogenous biomarker levels in %AR calculations, consistent with opinions of others (e.g., 913,20,22,23). The subtraction method can use either mass units or raw assay signal, the latter of which being applicable when the endogenous analyte is present above the LOD, but BLOQ. Optimal spike to endogenous proportions are ≥25-30%, which is expected for a LBA system with %CV and %RE acceptance criteria generally 20–25% (17).