Introduction

The standard addition method (SAM) has been being used since many years ago for instrumental analysis. According to Burns and Walker [1] and Kelly et al. [2], the first use of the SAM in instrumental chemical analysis was in polarography by Hohn in as early as 1937. Outside the area of polarography, the SAM was first used in atomic emission spectrometry in 1950, in ultraviolet-visible spectroscopy in 1960, and in 1H nuclear magnetic resonance spectroscopy in 2012 [1]. The theory of SAM in instrumental analyses was published by Bader [3] in 1980. Therefore, the idea of SAM is far from being new; it is a very old but intelligent idea to analyze a compound as accurately as possible.

In the field of forensic toxicology, the SAM was first used in 1990s; Siek and Dunn [4] reported a doxylamine overdose death using SAM quantification in 1993, and Poklis et al. [5] reported a fatal suicidal valproic acid overdose in 1998 using the SAM. In those days, it was not necessary for scientists to be nervous to present the validation data of the methods used. Therefore, the detailed method validation data were not demonstrated until early 2000s.

Since the early 2000s, making validations for instrumental analyses have been lively discussed mainly in the U.S. and Europe, and many guideline papers on validation of instrumental analyses have been published in every field of analytical chemistry [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]. Currently, when scientists try to publish the result(s) of quantification in an international journal, the presentation of validation data has become mandatory, but they are not necessary for screening tests or semiquantitative experiments. The above guideline papers are exclusively based on the conventional matrix-matched calibration methods (MMCMs), but no papers are available dealing with validation guidelines for the SAM in all analytical chemistry fields including forensic toxicology.

In 2013, our group published a paper on the presence of several ten to hundred ng/mL of ethylene glycol, propylene glycol, and diethylene glycol in whole blood specimens collected from ten healthy subjects [23]. Although high concentrations of ethylene glycol or propylene glycol in blood at more than several hundred μg/mL in suicidal cases caused by ingesting automobile antifreezes are sometimes encountered in medico-legal cases [24], it was surprising for us to detect them in blood specimens from healthy subjects. The low concentrations of glycols are ubiquitously present even in tap water and atmospheric air. To quantify their concentrations, we had to use SAM, because there were no appropriate blank blood specimens obtainable, which absolutely contained no such glycols. Since then, we have published a considerable number of articles using the SAM together with its validation data [25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42], being included in Table 1. Moreover, we have relatively many opportunities to perform forensic autopsies of cadavers that underwent abuse of new psychoactive substances (NPS). We have conducted studies on the postmortem distribution and redistribution of NPS in various kinds of body fluids and solid tissues to find out the best matrix that contains the highest concentrations of the parent forms or metabolites of NPS, because they are the matters in which the forensic scientists are most interested. To fulfill such distribution studies of toxic compounds over quite different kinds of matrices of cadavers, the SAM seems to be a good choice for accurate quantifications of target compounds.

Table 1 Articles describing the quantification of xenobiotics using the standard addition method (SAM) in the field of forensic toxicology reported after 2010 (for abbreviations, see the footnote of this table)

The aim of this review is to explain the basic methodology of SAM experiments, its advantages and disadvantages, and finally how to present their validation data.

Literature search

Some databases, such as SciFinder, Google and Google Scholar were utilized to collect relevant reports referring to SAM. Their full texts were carefully checked according to their necessity.

How to get concentration(s) of preexisting xenobiotic(s) in a matrix by the standard addition method

For the SAM experiments, in the early stage before 2000, the standard addition with points of only four different concentrations (i.e., three calibrators) were widely employed as the final probe [1, 5, 43]. In our opinion, though it is understandable to use such calibration curves with a few calibration points due to the laboriousness of SAM, the accurateness of such a method is of concern. In addition, when we encounter a case in which the xenobiotic effect is suspected, the identity, level(s) or even order(s) of the target compound(s) are unknown at the beginning; thus we here propose a two-step calibration method. The first step is to check the presence or absence of xenobiotic(s), and if present, to roughly estimate the level(s) of xenobiotic(s) in matrix/matrices. Such information is essential to proceed to the second step for quantification of xenobiotic(s) by SAM.

The first step is simpler the better; thus we have chosen the simplest one-point standard addition test only to check the profile of xenobiotic(s) preexisting in a matrix using only two sample lots for each matrix; 0.2 g of solid tissue or 0.2 mL of body fluid is homogenized or diluted to 2.0 mL, respectively. It is divided into two sample lots of the same volume (1.0 mL). To one of the lots, known amount(s) of the standard(s) of suspected xenobiotic is/are spiked. To another lot, the same volume of water is added. The both sample lots are subjected to pretreatment/extraction procedure and finally instrumental analysis. By such an experiment, it becomes clear whether the matrix contains xenobiotic(s), and if absent, the experiment terminates at the time; if present, the calculation of each concentration (Cx) for each xenobiotic in the diluted matrix can be made as follows (also see Fig. 1):

$$C_{x} = \, [P_{0} /(P_{{\text{a}}} - P_{0} )] \times \, A_{{\text{t}}} /W,$$
(1)

where P0 is a peak area or peak height of a xenobiotic preexisting in a homogenate sample lot or a diluted body fluid sample lot without any exogenous addition; Pa is a sum peak area or peak height of the preexisting plus spiked xenobiotic compound; At is an amount (i.e., μg or ng) of a xenobiotic compound standard spiked into a sample lot; W is g wet weight or mL of the matrix contained in each sample lot.

Fig. 1
figure 1

Image for a one-point standard addition assay as the first step experiment. Pa: a sum of peak area or peak height of the preexisting plus spiked target compound; P0: peak area or peak height of a target compound preexisting in a matrix

In case of multidrug consumption, known amounts of corresponding xenobiotic standard mixture are spiked altogether into one of the two homogenate or diluted body fluid lots; this sample lot together with the corresponding blank sample lot containing the preexisting xenobiotic(s) only is subjected to the pretreatment/extraction, followed by the instrumental analyses as described above. The concentrations of all kinds of xenobiotics can be roughly calculated using the above Eq. (1). If the peak of the non-spiked sample lot is less than 1/10 that of the spiked sample lot, the first step experiment should be again performed from the beginning with reducing the amount(s) of corresponding xenobiotic(s) spiked into one of the two sample lots.

To make the above explanation for the first step experiment easier, the image of the two peaks for the one-point standard addition method conducted at the first step is shown in Fig. 1.

In the second (final) experiment, after homogenization of the tissue specimen (1.0 g wet weight) or dilution of the body fluid specimen (1.0 mL) to prepare 10 mL of sample suspension or solution, respectively, a fixed concentration of an internal standard (IS) is added to it to reduce errors during the pipetting manipulation of sample solution volumes and during instability of the target compound(s) and IS inside an analytical instrument. Then, it is divided into ten sample lots of the same volume; six sample lots are used to fulfill the final SAM measurements. The SAM calibration equation is expressed as y = ax + b, where “a” is the slope of the curve; “b” is the value of the intercept on the y axis (also see Fig. 2).

Fig. 2
figure 2

Various patterns of standard addition calibration curves to obtain an accurate concentration of a target compound. The negative intercept position indicated by closed circle (2axbx) on the x axis shows the concentration of the target compound in preferable pattern. Please note whether the concentration of each target compound is within the range of respective calibrators. open circle (negative intercept >> bx) and open square (negative intercept << bx): non- preferable patterns

To the five sample lots, different concentrations of xenobiotic (target compound) standard(s) are added. Thereby, in case of multidrug consumption, where multidrugs coexist in the same sample lot, it is not necessary to add the same concentration for every multidrug standard to each sample lot to create a SAM calibration curve. The five concentrations with an appropriate range and spaces between points for each xenobiotic standard to be spiked into the five sample lots are separately decided on the basis of information for each concentration of the preexisting drugs obtained by the first step experiment. Therefore, different concentrations of spiked multudrugs can coexist in each sample lot. However, for all of the multidrugs, the concentrations of each drug standard spiked should be aligned in the increasing order from sample lots 1 to 5. The extractions and the instrumental analyses have to be started from one sample lot spiked only with IS without addition of any xenobiotic standard, followed by the one to five sample lots (in total six points for each target compound) for each matrix. Then a ratio of (peak area of the preexisting xenobiotic compound)/(IS peak area) and five different ratios of (combined peak area of the preexisting xenobiotic compound plus each different concentration of the same xenobiotic compound spiked)/(IS peak area) are plotted against the concentration on the horizontal axis for each xenobiotic in the matrix. Along the six points, a straight line is drawn by the least square method. The straight line intercepts the horizontal axis in the negative range, which gives the preexisting concentration of each xenobiotic compound in the tissue matrix (/g wet weight) or in the body fluid matrix (/mL) (Fig. 2). If four kinds of drugs coexist in a matrix, four calibration curves with six points can be drawn to give four concentrations of the multidrugs in the matrix. This means that the concentrations of multiple preexisting xenobiotics in a matrix can be obtained by the simultaneous analysis with only six sample lots for each matrix.

The remaining four sample lots (4.0 mL) containing a fixed concentration of IS plus preexisting xenobiotic(s) are used for the validation experiments, which will be explained later.

It should be mentioned here that the IS can be omitted (see Table 1, nos. 12 and 25), and the procedure can be performed only with peak areas or peak heights (instrumental units of them) of the preexisting compound(s) and spiked calibrators. An appropriate IS is added to a matrix sample only to reduce errors as stated above. The best IS is the target compound itself labelled with stable isotopic atoms.

In our group, we usually create the SAM regression equations with five different concentrations of calibrators (totally six points including the no addition point). Although the accurateness of the experiment depends on the skillfulness of an experimenter, the patterns of the calibration curves (Fig. 2) also affect the accurateness of the quantification, because each value of the preexisting xenobiotic concentration should be located within the corresponding bx calibrator concentration range and preferably close to the midst of the range. When the three patterns of SAM calibration curves are compared, the curve indicated by closed circle pattern in Fig. 2 probably gives the most accurate result; in other curve patterns indicated by open circle (negative intercept >> bx) and by open square (negative intercept << bx), the obtained concentrations of the target compound are outside of the calibration range, and the errors in drawing each calibration curve will cause a significant shift of the intercept location on the negative horizontal axis resulting in relatively larger variations of the xenobiotic concentrations.

Advantages and disadvantages of the standard addition method

The first advantage to be mentioned for SAM is that it does not need a blank matrix (surrogate matrix) for quantification of a target compound. In the conventional MMCM, to construct a calibration curve, different amounts of each target compound should be spiked into suitable blank matrix [7,8,9,10,11,12,13,14,15,16,17,18,19,20]. However, strictly speaking, for example, the components in blood specimen collected from a cadaver are not the same or not even similar as compared to those of fresh blood collected from healthy volunteers; furthermore, there are individual differences of blood components due to different age, sex, constitution, body mass index/obesity, and the presence and absence of disorder(s) for each cadaver, and most importantly the postmortem change/decomposition for the cadaver results in marked changes in components of blood. Also for solid tissues/organs, their components are quite different according to individual differences; for validation of quantification of a life-threatening chemical or its metabolite in solid tissue, Lehmann et al. [44] quantified NPS in various body fluids and solid tissues collected at autopsy by SAM, but used blank matrices from different cadavers that were completely free of NPS exposure to perform validation tests according to the MMCM. However, it is questionable that the quantitative results obtained by SAM are strictly equal to the data obtained by MMCM.

In many institutions in Japan, the institutional review board does not approve of using forensic cadavers for getting blank matrices; they usually insist that the cadavers should be used only for investigating the direct or indirect cause(s) of deaths, and criticize the use of cadavers as blank specimens only. This is the main reason for our research group to use SAM for quantification of xenobiotics in cadaver samples collected at autopsy. However, in most advanced countries, there are no such high barriers to use autopsy samples for medical research purposes.

As the second advantage, the SAM overcomes the matrix effects and recovery rates. The obtained values are free of matrix effects and recovery rates. Gergov et al. [45] reported that SAM was very effective for overcoming the matrix effects on metformin caused by postmortem blood samples. It is true that SAM really overcomes the matrix effects and recovery rates, but other methods to overcome the matrix effects should be mentioned; most effective one is to use a stable isotope-labeled IS of a target compound, and another one is to use a highly effective extraction/purification pretreatment such as the QuEChERS method [25]. In addition, in this review, we have tried to devise new simple calculation methods of matrix effects and recovery rates for SAM, which are also explained later. The quantification of drugs in solid tissues by liquid chromatography–tandem mass spectrometry (LC–MS/MS) is most sensitive to matrix effects. The SAM is very useful for the LC–MS/MS quantification of compound(s) contained in solid tissues frequently causing serious matrix effects by the conventional MMCM.

As the third advantage of the SAM, when an appropriate IS (stable isotope-labelled compound of the target compound or the different one but is very close physicochemically) is not available, the quantification can be fulfilled without IS [45,46,47]. As stated before, in the SAM, the presence of IS is not essential, but only contributes to reduction of errors in assays.

As the main disadvantage of the SAM, the laboriousness of the experiments should be mentioned. Even to quantify a single concentration of a xenobiotic in a matrix, two peaks with and without addition of a known amount of the target compound standard should be obtained as the first step experiment (Fig. 1). Then to perform accurate quantification of preexisting compound(s), the second step experiment is needed as already shown in Fig. 2. The first step one is rough estimation of unknown level(s) of the xenobiotic(s); another is for accurate quantification with six points. If the experiment is repeated in duplicate or triplicate, the labor increases twice or three times, respectively.

According to the larger number of experiments to be done in the SAM, larger amounts of matrix containing target xenobiotic(s) become necessary [e.g., more than 1.0 mL of body fluids, and more than 1.0 g of solid tissues containing target compound(s)]. This is also a disadvantage of SAM. However, such a problem is being solved because the sensitivities of analytical instruments have markedly increased during the past 10 years, resulting in that much less amounts of matrices have become sufficient to complete the experiments currently.

Validation for the standard addition method

The conventional MMCM and SAM are fundamentally different. In the MMCM, the calibration curve is constructed by adding different concentrations of a xenobiotic standard together with IS to divided lots of blank matrix fluid or blank tissue homogenate for creating the calibrators. This means that the validation is not focused on the original sample matrix in question, but focused on the blank matrix, which may be similar to but not the same as the original sample matrix, while in the SAM, IS only and IS plus different concentrations of the xenobiotic(s) are spiked into divided sample lots composed of each real matrix fluid or homogenate (sometimes undiluted or diluted) containing the preexisting xenobiotic(s) to make calibrators before extraction procedure. The SAM validations are conducted on the target matrices, and not on the blank matrices being used in the MMCM.

In recent years, it seems likely that scientific reports using SAM are increasing, but in many of them except our reports, the validation data for SAM are either missing, incomplete or confusingly mixed with the validations of MMCM [48,49,50,51,52,53,54,55] (see Table 1). Only one report [56] describing postmortem distribution of mepirapim and acetyl fentanyl in a cadaver that died of drug poisoning presented satisfactory validation data for SAM, which were similar to those proposed by our group [25, 28, 31,32,33,34,35, 37, 39].

We think that although many of forensic toxicologists are interested in the SAM, they are usually not so familiar with how to present validation data for studies or case reports using SAM. In the following sections, we describe details of how to present the validation data of SAM.

Standard addition calibration regression equations and correlation coefficients

As shown in Fig. 2, by adding different concentrations of xenobiotic(s) to divided sample lots of diluted body fluid or tissue homogenates containing the preexisting xenobiotic(s), the concentration(s) of the preexisting xenobiotic(s) can be quantified as described before, being free from the matrix effects or recovery rates. The SAM regression equation can be expressed as y = ax + b, where “a” is the slope of the SAM calibration curve; “b” is the value of the intercept of the SAM curve on the y axis, reflecting the area or height of a preexisting compound in a matrix as described before. When y equals zero, x = − b/a, which shows the negative concentration of the preexisting compound in the tested matrix. The correlation coefficients or coefficients of determination should be greater than 0.99 within appropriate calibration range as described before, for reliable measurements of the target compounds.

Limit of detection and limit of quantification

In the SAM in the absence of blank matrices, it is, of course, impossible to create quality controls spiked with low concentrations of a target compound. Therefore, exact limits of detection (LOD) and quantification (LOQ) cannot be calculated in the same way as that of the conventional MMCM. However, there is a method to roughly estimate the LOD and LOQ; the only method is to carefully examine the height of the target compound peak in comparison with the width of fluctuating basal noise just before and after the peak. Since the concentration of the preexisting target compound had been obtained by the SAM calibration, the estimation of the concentrations based on the presumed peak heights that give the signal-to-noise (S/N) ratio = 3 (for LOD) and S/N ratio = 10 (for LOQ) can be made. If we assume that the peak height of a xenobiotic in a solid tissue is 33.5 mm; the width of fluctuating noise just before and after the peak is 2.3 mm; and the concentration of the compound has been calculated to be 24.5 ng/g wet weight, the LOD and LOQ can be calculated as follows: The S/N ratio of the peak = 33.5/2.3 = 14.6. The LOD = 24.5 ng/g × 3/14.6 = 5.03 ng/g; the LOQ = 24.5 ng/g × 10/14.6 = 16.8 ng/g. They are only the exemplifications of how to calculate them.

Intraday and interday variations

This parameter is almost equivalent to “accuracy and precision” in the MMCM. There are two procedures that we have proposed as follows.  One of the below two procedures can be chosen. In our group, we are largely using the “repeatabilities” because they reflect variation ranges at the real concentration of a xenobiotic present in the corresponding matrix.

Repeatabilities

In the conventional MMCM, intraday and interday accuracies and precisions are tested by spiking low, medium and high concentrations of the target compound standard into fluid blank matrix or homogenized blank solid tissue matrix, followed by extraction and instrumental analyses. However, to make the experiment simpler and less laborious as much as possible, we propose only repetition of the quantification of the preexisting target compound in a diluted body fluid or homogenized solid tissue matrix with IS addition only using the second step procedure (see the section of “How to get concentration(s) of preexisting xenobiotic(s) in a matrix by the standard addition method”). This method includes both intraday and interday repetitive experiments. We recommend that each number of experiments is “n = 5” based on the literature and our own experience. We think that this method seems advantageous, because the simple intraday and interday repetition is tested to gain respective percent relative standard deviation (%RSDs) at the actual concentration of the target compound preexisting in the matrix, which is equivalent to “precisions” in term of MMCM. A table example of the repeatability can be referred to reference [25].

Modified accuracies and precisions

The intraday and interday accuracies and precisions at low, medium and high concentrations of spiked target compound, which are currently standard in the MMCM, can be also examined in the SAM (see ref. [42] for details). Zero, low, medium and high amounts of a target compound together with a fixed amount of an IS (four points including zero addition) are spiked into the divided sample lots of each diluted liquid matrix or solid tissue homogenate all containing a preexisting xenobiotic, followed by extractions and instrumental analyses. The preexisting concentration (D) (for the explanation of capital alphabet symbols “A’” and “D”, also see Fig. 3) of a xenobiotic can be subtracted from total concentration (A′) of the preexisting and spiked xenobiotic for all low, medium and high levels. Each net concentration to be analyzed for the target compound is (A′–D) at each concentration level, which corresponds to each spiked concentration in the MMCM. The “accuracy” at each level can be calculated as the percent value of [(A′–D)/(the corresponding nominal concentration spiked)] × 100. The “precision” equals to the %RSD of triplicate or quadruplicate determinations of the net accuracy concentrations.

Fig. 3
figure 3

Images of some quantities of a preexisting or added target compound to calculate the matrix effect and recovery rate in the SAM analysis. Each quantity can be a peak area or height itself, a ratio of (peak area or height)/internal standard, or also finally calculated concentration value

Matrix effects and recovery rates

Even with the absence of appropriate blank matrices, it is possible to calculate matrix effect and recovery rate in SAM. Although the SAM can overcome the matrix effects and recovery rates, the readers seem to be of interest in these parameters. Four alphabetical symbols A′, B′, C and D as the quantities specified in Fig. 3 are also used to explain the calculation of matrix effects and recovery rates by using modified procedures in SAM.

Simple calculation of matrix effects and recovery rates

For the percent matrix effect, the calculation should be [(B′–D)/C] × 100. For the percent recovery rate, it should be [(A′–D)/(B′–D)] × 100. The symbols can be either peak areas, peak heights, the ratios of (peak area or height)/IS or also finally calculated concentration values.

Relative matrix effect using standard line slope

In the conventional MMCM, the matrix effect can be roughly estimated with the calibration line slope [10]. However, such a value obtained from slopes is less accurate than the above [(B′–D)/C] × 100 value. In the conventional method the calibration is created by spiking different amounts of a target compound into an appropriate blank matrix regardless of the presence and absence of a fixed concentration of IS. The difference of the calibration curves between the MMCM and SAM is the presence of preexisting xenobiotic(s) in SAM. Nevertheless, there will be no difference in the standard line slope. Therefore, standard line slope (a) can be also used as a tentative indicator of matrix effect in SAM. To obtain the control slope (ac) of SAM calibration equation, the matrix should be replaced by water or a mobile phase (neat solvents) for LC–MS/MS. The simple matrix effect (bias) calculation is [(a − ac)/ac] × 100 (%). The negative value obtained shows a suppressive effect, and positive one enhanced effect. Even if the ratio of a peak area against IS is employed, there is no problem, because IS values disappear during the calculation. In addition, it should be taken into mind that the relative matrix effect (bias) calculation can also be influenced by recovery rates of the assay within tested calibration range, which contributes to values of the slope.

Stability

As in the MMCM, it is preferable to present stability data also in studies and case reports using SAM. For this experiment, no quality controls (QCs: blank matrix spiked with target compound) are not necessary. The matrix containing the preexisting xenobiotic spiked or not spiked with the same compound can be used. The data are largely expressed as the percent values. The readers can follow the guideline by Peters et al. [8] in the conventional MMCM because there is essentially no big difference between the QC and original matrix containing preexisting xenobiotic in SAM (with or without spiking) for this experiment.

Discussion

In the present article, the procedures of how to conduct the SAM to quantify target xenobiotic(s) included or embedded in a matrix and of how to validate the SAM quantification in the field of forensic toxicology have been presented. Although major parameters of validations are based on the conventional MMCM, there are no guidelines focused on the details of SAM validation in the field of forensic toxicology and in other fields of analytical chemistry to our knowledge; many of the described details presented in this article are our in-house ones, which are not authorized at this time.

It should be mentioned that the word “MMCM” is not used so widely. For the calibrations by instrumental analyses, the terms “external standard calibration method”, “internal standard calibration method” and “SAM” are most frequently used. Although, strictly speaking, the term “MMCM” may belong to the “external standard calibration method”, but the notion of “matrix” is lacking in this term; thus the word “MMCM” seems most appropriate to use it in comparison with SAM.

As described before, the blank matrix (sometimes called “surrogate matrix”) should be almost the same as  or very similar to the sample matrix; however, even blood and urine samples collected from human cadavers 72 h or more after deaths are exactly not the same as those collected from healthy subjects due to postmortem changes such as autolysis, putrefaction and decomposition of target compound(s). Since the validation data obtained by the conventional MMCM largely depend upon QC samples composed of a blank matrix and standard compounds, the validation data do not reflect those of the sample matrix itself, but reflect those of the blank matrix. This should be kept in mind when the MMCM is used for calibration and data validation. In this respect, the SAM and its validation presented in this article exactly give the method and validation data of the sample matrix of its own. This matter can be regarded as the fourth advantage of SAM.

As can be seen in Table 1, studies and case reports dealing with SAM tend to increase from 2012 to 2020 in forensic toxicology largely because of unavailability of the blank matrix. As stated before, it is currently mandatory to add validation data when publishing such quantitative results in an international journal. According to such trend, when the SAM had to be used, the validation data should accompany their analytical results.

As shown in Table 1 dealing with reports of quantification by SAM in forensic toxicology, we have carefully checked their details of validation data; many of the scientists appeared unfamiliar with how to create validation data for SAM. This is the reason why we tried to publish a review article on how to quantify a xenobiotic compound in an impurity-rich matrix and on how to obtain validation data, which endorse the quantification results obtained by SAM.

The outlines of 31 papers dealing with SAM in the forensic toxicology are summarized in Table 1. The 12 out of 31 are not from our laboratories; to avoid the laboriousness to create SAM calibration curve, only three calibration points (in total, four points) were employed to make final quantification of target compound(s) with one-step calibration for seven out of the 12 papers. In the earliest paper (Table 1, No. 1), the concentration of endogenous γ-hydroxybutyrate was lower than the lowest calibration standard spiked as is the case of Fig. 2c, which might cause less accurateness in quantification. If one-step calibration is preferred, the number of calibrators should be increased to at least five (in total six points), and the resulting concentration should be inside the calibrator range.

There were relatively many SAM papers lacking validation data for eight out of the 12 papers. Furthermore, in two papers, only incomplete SAM validation data were presented. There are four papers, in which only quantifications were conducted by SAM, but the validation was performed according to the conventional MMCM. In view of these situations, it has become clear that many of the forensic toxicologists are strangers to how to create validation data for SAM.

These findings show that many of SAM papers in forensic toxicology are very problematic at this moment; the procedure of SAM quantification and its validation should be improved and standardized as soon as possible.

Such situations seem also similar in other fields of analytical chemistry from 1990s to nowadays; many papers related to SAM have appeared in environmental chemistry [57,58,59,60,61,62,63,64], food chemistry [65,66,67,68], clinical chemistry [69,70,71] and miscellaneous fields [72,73,74,75,76,77,78,79,80,81]. The validation approaches in non-forensic toxicology fields were also of various types like those of forensic toxicology as listed in Table 1.

Upon quantification of targeted substances in various matrices using SAM, evaluating the linearities of corresponding calibration curves should be of technical importance. In many cases of constructing calibration curves and measuring concentrations of the targeted substances using the calibrators, their calibration equations can usually be fitted to almost linear curves with a single valuable parameter using linear regression, in which correlation coefficient values being more than 0.99 within  the calibration range employed; single-parameter calibration curves typically can be expressed as linear proportional function of x parameter (concentration) and y value (response) in measurement of target compound concentrations (y = ax + b). In our experience, all 20 studies in our laboratories (Table 1) using SAM and GC–MS/LC–MS/MS gave correlation coefficients more than 0.99 and enabled the analyte concentrations with a single parameter.

However, if calibration curves contain multiple parameters  with explanatory valuables, the calibration equations can demonstrate various nonlinear or polynominal curves, such as parabola (y = ax12 + bx2 + c) and cubic curves (y = ax13 + bx22 + cx3 + d) according to each analytical condition [82]. Especially in forensic area, possible multiple parameters which can be involved in the calibration equation, are usually thought to be matrix effects and recovery rates for various kinds of matrices, or their detected signal responses in analytical instruments including LC–MS/MS [82]. Matrix effects in instrumental analysis and recovery rates in extraction procedure can vary remarkably when tested calibration range between its lower and upper levels being quite wide, because these values rely largely on either concentrations of target compounds or coeluting components in tested matrices. Therefore, the nonlinearity phenomena should also be taken into consideration upon constructing calibration curves using SAM;  although such appearance of the nonlinearity when using SAM technique plus MS techniques has not been reported in the field of forensic toxicology to our knowledge, it should be kept in mind that the SAM analyses and their validations proposed in this article can be applied to real forensic toxicological quantifications only when each calibration curve is explained as a linear model. Regarding quantitative analysis by flame atomic absorption spectrometry using SAM, where calibration curves could be demonstrated as nonlinear pattern, Kościelniak [83] proposed that the more examined samples be diluted more than fivefold; the more linearity of the calibration curve might be improved, so as to fit the curves to the linear model.

On the other hand, relative narrow calibration ranges are usually employed in measurements using the SAM in forensic toxicological area. This may be one of the reasons why we neither experienced significant errors nor nonlinearity of the SAM calibration curves.

The main interest of our research group is the distribution/redistribution of toxic substances and their metabolites in various body fluids and solid tissues collected at autopsies. In such studies, appropriate blank specimens (surrogate matrices) are usually unavailable; we have to use the SAM for such a line of studies. The main aim of such studies is to identify the location of the body, where high concentrations of the xenobiotic or its metabolite(s) can be sampled to prove the consumption of a substance by a user and also to find the best marker(s) (largely metabolite(s)) for identifying the parent compound in question.

Finally, a new methodology called “surrogate analyte approach (SAA)” should be mentioned in comparison with the SAM [84,85,86,87,88,89,90,91,92]. In this method, no blank matrix is also required, but stable isotope-labeled target compound(s) is/are essential to create calibration curve(s) and to gain validation data. All validation data of SAA do not reflect those of the blank matrix, but reflect those of sample matrix only. This is the same as that of SAM. The only disadvantage is that the stable isotope-labeled compounds are generally costly and are not available for every compound to be analyzed at the time of experiment. In the forensic toxicology field, only one paper has been published on the quantification of cocaine in street samples by the SAA in 2017, although the authors did not call it “surrogate analyte approach” [88]. Although some isotope effects on the peak areas take place for SAA, such effects can easily be corrected by using the response factor (RF):

$${\text{RF }} = {\text{ Area}}_{{{\text{stable }}\;{\text{isotope}}}} /{\text{Area}}_{{{\text{natural}}\;{\text{ analyte}}}} .$$

The first step experiment is also required for the SAA to estimate the level(s) of xenobiotic(s) in each matrix, which is described in the first step section of “How to get concentration(s) of preexisting xenobiotic(s) in a matrix by the standard addition method” in this article. As the second step of SAA, each solid tissue matrix or body fluid (sample matrix) all containing the target compound(s) is homogenized or diluted, respectively; a fixed concentration of IS is added to the homogenate or diluted body fluid in advance, which is then divided into ten sample lots of the same volume. Five different concentrations of the stable-isotopic surrogate analyte(s) are spiked into the five sample lots, respectively, to construct calibration curve(s). The linear calibration curve(s) of SAA should pass the zero point. The remaining five lots are used for conducting validation experiments also by spiking the stable isotopic surrogate analyte(s) in place of non-isotopic target standard according to guidelines for the routine MMCM methods [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]. In the near future, the SAA method will join the SAM in forensic toxicology analyses, because many of analytical procedures, such as screening/the first step for an unknown matrix, stable-isotopic standard addition tests for constructing calibration curve(s), pretreatment/extraction and quantitative instrumental analysis for SAA method can be performed in common with those of SAM.

Both methods can equally work in the absence of blank matrix and can overcome the matrix effects and recovery rates; in addition, both of them function complementarily to each other. The most advantageous point of the SAA is that once a calibration curve of a specific compound in a specific matrix is constructed, every level of the same compound in the same matrix can be quantified using this calibration curve and RF without constructing a new calibration curve; but the most serious weak point is that when the stable isotope-labeled target compound is not available, the experiment cannot be conducted. In contrast, to quantify a compound in a sample matrix, the SAM is almost almighty if the detailed methods described in this article are followed; the method works even in the absence of non-isotopic or isotopic IS and also in the absence of the blank matrix (surrogate matrix), but it has a disadvantage for the necessity of creating a new SAM calibration curve for every new target for quantification, which is more laborious as described before than the conventional MMCM and the SAA method.

The combination of SAA and SAM will enhance the quantitative analytical capability when many stable isotope-labeled compounds including new psychoactive substances (NPS) are available in forensic toxicology laboratories, because a majority of drug abusers are consuming multidrugs [93], some of which are quantified by SAM and some of which are quantified by SAA. Such situation seems ideal for forensic toxicologists.

Conclusions

Based on the literature search together with our experience on the quantification of toxic substances and their metabolites by SAM, we have presented a review article on how to quantify a preexisting compound in a matrix by SAM, on advantages and disadvantages of SAM, and on how to present validation data for SAM without using the blank matrix. In addition, we have introduced 31 articles relating to quantification of toxic/abused compounds preexisting in matrices using SAM, published from 2012 to 2020. Because SAM is becoming an indispensable tool for quantification of xenobiotic(s) in a matrix especially when a blank matrix, or an appropriate IS is not available, and when distribution of a toxic compound and its metabolites over quite different matrices (body fluids and solid tissues of different components) of cadaver is studied, this article seems to serve as timely guidelines for utilizing SAM; this kind of review has not been published elsewhere to our knowledge.