Introduction

Hazardous chemicals co-occur. An exposure to a single, pure chemical does not exist in the real world. Instead, chemical pollution is characterized by complex multi-component mixtures that can easily comprise dozens or even hundreds of chemicals, e.g., [1, 2]. The risk of such mixtures to the environment or human health is typically higher than the risks of each of its individual components (at the concentration at which those compounds are present in the mixture). Even if all chemicals are present only at concentrations that do not cause visible toxic effects, i.e., that do not exceed their individual No Observed Effect Concentrations or Levels (NOECs, NOELs), the mixture might still cause a significant toxicity. This phenomenon has been termed the “something from nothing” behavior of mixtures, and has been repeatedly demonstrated to apply for a broad range of complex chemical mixtures, recently for example in [3, 4], see also [5,6,7] for a summary of older studies.

Even if the components are only present at regulatory “safe” concentrations (so-called Environmental Quality Standards, EQS-values), a toxic mixture effect was observed in ecotoxicological assays [8]. Unfortunately, a similar study that uses toxicological (human health relevant) endpoints, and in which all the components of a multi-component mixture are present at concentrations that are deemed “safe” from a public health perspective (based on, for example, Acceptable Daily Intakes or Tolerable Daily Intakes), is still missing, although the aforementioned “something from nothing” phenomenon has been empirically demonstrated also for toxicological assays and endpoints, see [4] and the studies reviewed in [5, 6].

Consequently, controlling the risk of individual chemicals does not necessarily ensure that the overall risk from combined exposures is acceptable, see [5,6,7] and references therein. Chemical risk assessment and management strategies need to be adapted to the reality of combined exposures [5,6,7, 9,10,11,12,13,14].

The question therefore is, in the context of inherently substance-oriented regulatory frameworks such as REACH, the Regulations on Plant Protection Products (PPR) or the Biocidal Products Regulation (BPR): how can the risks that result from the coincidental co-occurrence of different industrial chemicals, biocides, pesticides and/or other chemicals, be accounted for already during the registration or authorization of an individual substance? Specific and targeted mixture evaluations are often not feasible for coincidental mixtures, given that the evaluation of every relevant co-exposure scenario for all uses of a substance is well beyond the capacity of an individual registration/applicant, in particular because the necessary data are rarely available.

For this purpose, the use of a “Mixture Assessment Factor” (MAF) has been suggested in the European Chemicals Strategy for Sustainability (CSS) [14], which recently received specific support in a joint letter by the Environmental Ministers of Austria, Belgium, Denmark, Finland, Luxemburg, Norway, Spain and Sweden to the European Commission [15].

The CSS requests to “assess how to best introduce in REACH mixture assessment factor(s) for the chemical safety assessment of substances”. However, there are no conceptual reasons why the MAF idea should be restricted to chemicals regulated under REACH. In fact, given that humans and other organisms in the environment are exposed to mixtures comprising chemicals from basically every regulatory realm (biocides, pesticides, pharmaceutical residues, etc.), the MAF idea needs to be extended beyond REACH in the future.

The Swedish Chemicals Agency (KEMI) commissioned a first report on the MAF in 2015 [16], which provided a basis for a report to the Swedish government on instruments for future chemical risk assessment and management [6]. Both documents provide an overview of the fundamental pros and cons of the MAF as an instrument for the regulatory assessment of mixtures. A follow-up report from 2021 [17] then presented a new algorithm for calculating the MAF, see below.

A consensus seemed to emerge during recent workshops between various stakeholders from academia, research institutes, civil society organizations and representatives of many regulatory authorities that the introduction of a MAF is a pragmatic way forward to account for mixture risks in chemical regulation [18,19,20,21,22,23]. Concerns are raised especially by toxicologists working in the area of human health assessment (especially the food safety area), arguing that more precise mixture assessment tools are at hand, which should given preference [24]. It is, however, currently unclear how such instruments could be applied to highly complex mixtures of hundreds of co-occurring chemicals, many of which are toxicologically ill characterized. Industry organizations have especially raised concerns about the anticipated economic impacts of introducing a MAF [25].

In the following, I will first analyze the two fundamentally different options for estimating the numerical size of a MAF. Afterwards the different estimation algorithms will be discussed, followed by an assessment of their robustness in view of less-than-perfect input data. All algorithms presented in this paper are made available in an accompanying Github repository, implemented in Excel and R (https://github.com/ThomasBackhausLab/MAF).

These algorithms provide the scientific foundation for estimating the size of a MAF for different mixture scenarios, which in turn is the basis for implementing the MAF in a regulatory context, as outlined above.

Estimating the MAF

Operationalizing the zero pollution ambition with respect to chemical mixtures

The increasing realization that humans and the environment are too often exposed to unacceptably high concentrations of various hazardous chemicals and their mixtures led to the establishment of the European zero pollution vision for 2050, which aims to ensure that “air, water and soil pollution is reduced to levels no longer considered harmful to health and natural ecosystems” [26].

The estimation of a chemical’s maximum concentration that is “not harmful” for the exposed entity results in regulatory threshold values such as the PNEC (Predicted No Effect Concentration) or DNEL (Derived No Effect Level) for industrial chemicals that are regulated under REACH. Safe use is considered to have been demonstrated if the actual concentration of a chemical in the relevant exposure scenarios does not exceed these regulatory thresholds. That is, the aim of chemical regulation in general is that

$$\frac{{{\text{Exposure}}\,{\text{concentration}}}}{{{\text{Regulatory}}\,{\text{threshold}}}} \le 1.$$
(1)

The maximum acceptable value of 1 for this risk quotient (RQ) is the typical yardstick used during regulatory risk assessment and management.

Under the assumption that Concentration Addition is an adequate approximation of mixture risks and that the summation of RQ values is an adequate approximation of Concentration Addition ([5, 6, 27] and references therein, see also discussion below), it then follows that a mixture poses an acceptable risk if it is ensured that

$$\sum\limits_{i = 1}^{n} {\frac{{{\text{Exposure}}\,{\text{concentration}_{i}}}}{{{\text{Regulatory}}\,{\text{threshold}_{i}}}} = \sum\limits_{i = 1}^{n} {{\text{RQ}}_{i} } } \le 1,$$
(2)

where n is the number of mixture components.

This constitutes an operationalization of the previously mentioned zero pollution ambition that adequately considers the joint toxicity of chemical mixtures. It should be noted that this operationalization applies to the human health context as well as to the assessment of environmental risks of chemical mixtures.

Equation 2 implies that even if all components in a mixture have an RQ of less than 1, i.e., safe use is demonstrated for each individual chemical in the mixture, the overall RQ sum can still exceed the critical value of 1, and the resulting mixture might therefore still put people or the environment at risk. This gap in the current approach to chemical risk assessment and management has raised concern and is the underlying rational why the issue of mixture toxicities and risks is highlighted as a specific task in the CSS.

Two fundamentally different MAF classes: MAF factor and MAF ceiling

Two fundamentally different MAF classes, which will be called MAFfactor and MAFceiling in the following, are described in the literature and have been discussed at various expert workshops.

MAF factor

This MAF class is based on the reduction of the RQ value of each mixture component by a constant factor, the MAFfactor [13]:

$${\mathrm{RQ}}_{\mathrm{MAF}(\mathrm{factor})}=\mathrm{RQ}\times \frac{1}{{\mathrm{MAF}}_{\mathrm{factor}}}.$$
(3)

An inverse (1/MAFfactor) is used here simply for the purpose of ending up with values for MAFfactor greater than 1, which are easier to communicate and interpret than values in the range of 0–1. The residual risk (RR) of the mixture after the application of MAFfactor is then defined as:

$${\text{RR}} = \sum\nolimits_{i = 1}^{n} {\frac{{{\text{Exposure}}\,{\text{concentration}}_{i} }}{{{\text{Regulatory}}\,{\text{threshold}}_{i} }}} \times \frac{1}{{{\text{MAF}}_{{{\text{factor}}}} }} = \sum\nolimits_{i = 1}^{n} {{\text{RQ}}_{i} } \times \frac{1}{{{\text{MAF}}_{{{\text{factor}}}} }}.$$
(4)

Achieving the zero pollution ambition as operationalized in Eq. 2 requires to estimate the minimum MAFfactor that ensures RR ≤ 1. It is important to note that applying a MAFfactor implies that the RQ of every mixture component is lowered by a constant factor, independent of the initial size of its RQ.

In order to differentiate between substances that cause an unacceptably high risks even as a single compound (i.e., substances with an RQ > 1) and the mixture risk, [16] suggests to calculate MAFfactor only after limiting the initial RQ of every mixture component to a maximum value of 1, i.e., after assuming successful single-substance oriented risk management, implemented during substance registration or via subsequent restrictions.

MAF ceiling

The second MAF class is based on the idea that the MAF defines a new value for the maximum acceptable RQ for each compound, lower than the current value of 1 [17, 28]. The original RQ of a mixture component i is reduced to the new ceiling of 1/MAFceiling if it initially exceeds this value. Otherwise, it is left unchanged:

$${\text{RQ}}_{{{\text{MAF(ceiling)}},\,i}} = \left\{ {\begin{array}{*{20}c} {{\text{if}}\,{\text{RQ}}_{i} > \frac{1}{{{\text{MAF}}_{{{\text{ceiling}}}} }},} & {{\text{RQ}}_{i} = \frac{1}{{{\text{MAF}}_{{{\text{ceiling}}}} }}} \\ {{\text{else}}\,{\text{RQ}}_{i} \le \frac{1}{{{\text{MAF}}_{{{\text{ceiling}}}} }},} & {{\text{RQ}}_{i} \,{\text{remains}}\,{\text{unchanged}}} \\ \end{array} } \right.$$
(5)

Again, the inverse (1/MAFceiling) is used in order to end up with numerical MAFceiling values ≥ 1. As before, the residual mixture risk after the application of the MAF is then calculated as the sum of all MAF-adjusted RQ values:

$${\text{RR}} = \sum\limits_{i = 1}^{n} {{\text{RQ}}_{{{\text{MAF(ceiling)}},\,i}} }$$
(6)

Fulfilling the zero pollution ambition, (Eq. 2), requires estimating the smallest possible value for MAFceiling that ensures RR ≤ 1. It should be pointed out that the application of a MAFceiling is conceptually similar to the current approach for assessing “safe use” under REACH. The only difference is that MAFceiling is currently set to 1 and, in order to account for the fact that several chemicals share the same emission space, is now adjusted to a value greater than 1 (i.e., 1/MAFceiling ≤ 1).

Properties of MAF factor and MAF ceiling

Both MAF types share four main assumptions:

1. Multi-component mixtures behave according to Concentration Addition (CA, also called Dose Addition).

This assumption seems justified in view of the empirical data at hand. Synergistic or antagonistic interactions that would lead to a joint risk higher or lower than predicted by CA are rare for relevant multi-component mixtures, see [5, 6] and references therein.

The competing concept of Independent Action (IA, sometimes also called Effect Addition) is not considered for the calculation of the MAF, for three reasons: (1) CA- and IA-based mixture risk predictions of real-world mixtures are often virtually identical [4] and references therein, (2) if they differ, CA provides a slightly (but not excessively) precautionary mixture risk estimate, and (3) the application of IA would require the availability of the full concentration–response curves for all potentially important endpoints for all mixture components—a requirement that is almost impossible to fulfill.

2. The concentration-additive risk of a mixture is adequately estimated as the sum of individual risk quotients (e.g., the sum of PEC/PNEC ratios or Exposure/DNEL ratios) [17].

This implies that PNECs, DNELs and similar regulatory thresholds are adequate and unbiased hazard estimates. However, regulatory thresholds are often, especially for industrial chemicals, derived from few empirical data and the application of conservative assessment factors, and therefore provide hazard estimates that are biased toward precaution. Under these conditions, also RQ sums are conservative mixture risk estimates. This might be less of a problem for data-rich chemicals, such as pesticides.

It should, however, be pointed out that the application of the CA principle does not inflate the overall conservatism. In fact, CA is simply based on the harmonic mean of the component’s concentrations [5], and a CA-based mixture risk thus represents an average of the uncertainties (of the exposure as well as the hazard estimates) and assessment factors used for the individual mixture components.

3. The RQ values of the mixture components are independent of each other.

The outlined MAF approaches assume that the RQ values (i.e., hazard profiles and exposure estimates) of the mixture components are independent of each other. If risk mitigation measures lower exposure to a whole range of mixture components simultaneously, for example as a consequence of an STP plant being upgraded with a tertiary treatment, the MAF algorithms would need to be adapted accordingly. However, in the context of REACH and similar frameworks, where chemicals are assessed individually, the assumption of independence between the RQ values is, in all likelihood, fulfilled.

4. The mixture of interest is well defined.

A numerical MAF value can only be calculated for defined mixtures, as it is an application of the family of the so-called component-based mixture assessment approaches [5, 6]. What is more, only those chemicals can be considered during the MAF calculation for which exposure concentrations and adequate thresholds are known. The MAF is therefore not an estimate of the total site-specific risk. It is an estimate of the excess mixture risk, based on the assumption that a given set of exposure data adequately reflect the overall exposure. This implies that (1) the use of obviously incomplete datasets (in which only a subset of relevant chemicals is included) leads to artificially low MAF values and (2) that a MAF value is always based on an underestimation of the actual mixture risk, as there will always be at least some chemicals not included in an exposure estimate. The second point, however, is less relevant for multi-component mixtures, as long as at least the mixture risk drivers are included in the assessment.

MAFfactor and MAFceiling also differ in several important characteristics. For their usefulness in the context of substance registration under REACH one particular difference matters most: MAFfactor impacts all substance in a mixture equally, even if a component only contributes marginally to the overall risk. This shortcoming of the MAFfactor has been previously criticized [29,30,31], and, in order to address this criticism, MAFceiling was developed [17]. In sharp contrast to MAFfactor, MAFceiling only affects a small subset of the mixture components, i.e., those chemicals whose original RQ exceeds the critical value of 1/MAFceiling; see Fig. 1 and Table 1. The example shown in Table 1 demonstrates that the application of MAFfactor implies that even the RQ values of compounds 5–10, with an original RQ of just 0.01, would need to be lowered (by an improved risk assessment or even risk mitigation measures) to a value of 0.006.

Fig. 1
figure 1

The characteristics of the two fundamental MAF classes (MAFfactor and MAFceiling). The original (arbitrary) mixture comprises 3 chemicals, being present with a risk quotient (RQ) of 0.66, 0.33 and 0.33, i.e., 66%, 33% and 33% of the carrying capacity of the exposed entity (a human body (DNEL) or an ecosystem (PNEC), symbolized by the open box). For this mixture, “safe use” is demonstrated for each individual chemical. However, the RQ sum is 1.33, i.e., the mixture overburdens the exposed entity with 133% of its carrying capacity. Two different classes of MAF can be applied to estimate by how much the overall exposure needs to be reduced. MAFfactor applies an identical factor (1.3. in the example) to every substance in the mixture in order to ensure that 100% capacity is not exceeded. In contrast, MAFceiling establishes a new maximum acceptable single substance RQ of 1/3 = 0.33, which affects only 1 of the 3 mixture components (the first chemical, whose RQ exceed a value of 1/MAFceiling). This makes the RQ values of the MAF-adjusted mixture more similar to each other

Table 1 Numerical comparison of MAFfactor and MAFceiling for a mixture for which safe use is demonstrated for all components

This particular characteristic makes the MAFfactor largely unsuited for an application within the context of single substance registration or authorization under REACH or similar regulations, as it would impact the plethora of small tonnage chemicals and low-risk substances, without resulting in any conceivable improvement of actual risks. This issue can be minimized by applying MAFfactor only to a pre-defined subset of the mixture components (i.e., those with an RQ above a certain critical value), but that would basically be just another approach for approximating a MAFceiling.

MAFfactor reduces the RQ of all mixture components equally by the numerical value of MAFfactor. MAFceiling, on the other hand, implies different risk reduction requirements for different mixture components. For example, while the RQ of the first compound in Table 1 needs to be reduced by a factor of 0.8/0.28 = 2.86, the RQ values of the second and third compounds only need to be reduced by a factor of 0.3/0.28 = 1.07.

Especially in the context of highly complex multi-component mixtures it should not go unnoticed that the application of MAFceiling implies that all the chemicals with an RQ below 1/MAFceiling get a “free ride”. That is to say that, as long as a concentration-additive behavior of the mixture is assumed, those small-RQ compounds add to the mixture risk—but this contribution is not reduced by the application of the MAF.

A MAFfactor that reduces the original RQ sum to a certain pre-defined RR value is always smaller than the corresponding MAFceiling value, which is a direct consequence of MAFfactor affecting all components of the mixture while MAFceiling only affects the mixture risk drivers. The smaller value of a MAFfactor might seem tempting, but care needs to be taken to balance that against the fact that a MAFfactor would impact each and every chemical on the market.

Only the application of a MAFfactor fully preserves the original RQ distribution of the mixture, although the RQ ranking order is not affected by the application of either MAF class.

Algorithms to estimate the MAF

At least three different algorithms to establish the numerical value for MAFfactor or MAFceiling have been described in different papers, reports and opinion pieces.

Algorithm 1: represents the simplest and most basic approach: a value for MAFfactor or MAFceiling can simply be set a priori, without any consideration of the risk of the original mixture, the underlying RQ distribution or a particular acceptability criterion for the residual risk of the mixture, e.g., [22]. As a consequence, the MAF remains constant across mixture scenarios, but the residual risks fluctuate greatly, in direct relation to the absolute risk and the RQ distribution of the original mixtures.

Algorithm 2: makes use of certain characteristics of the initial mixture in order to establish a MAF. An implementation of this algorithm is presented in [28], where a MAFceiling is estimated as the number of “risk drivers” in the original mixture. The authors define a mixture risk driver as a chemical that belongs to the sub-group of mixture components that, in their totality, make up a defined percentage of the overall RQ sum, e.g., 90% of the RQ sum (subsequently denoted as MAF90%). Because this approach anchors the MAF to a characteristic of the RQ distribution of the original mixture (i.e., the number of risk driving compounds), different mixtures might result in different MAF values. However, as the approach does not anchor the MAF to the total risk of the mixture and also does not specify an acceptability criterion for the MAF application, the residual risk RR of the mixture after the application of a MAF, calculated according to this approach, varies between different mixtures.

Algorithm 3: establishes the MAF in relation to a pre-defined acceptability criterion for the MAF-adjusted mixture. An algorithm for calculating the minimum value for MAFfactor that ensures an RR of exactly 1 (with a view to the European zero pollution vision, see above) has been presented in [16]. This algorithm calculates the MAFfactor as the sum of the individual RQs, after limiting each individual RQ to a maximum of 1 (i.e., the maximum RQ that is still compatible with the current “safe use” idea under REACH). The MAFfactor is then numerically identical to the so-called maximum cumulative ratio (MCR) of the mixture after limiting each individual RQ to 1 [16].

An iterative approach to calculate the minimum MAFceiling that ensures an RR of exactly 1 has been recently published in [17]. This implementation of the MAFceiling will therefore be abbreviated as MAFexact in the following. Figure 2 provides the details on the logic of the MAFexact calculation. An implementation in Excel and R is available at https://github.com/ThomasBackhausLab/MAF.

Fig. 2
figure 2

Iterative algorithm for estimating MAFexact. RQ: risk quotient such as PNEC or DNEL. RQSum: sum of risk quotients of all mixture components. RQSumMan: RQSum after limiting the maximum individual RQ to 1

Because this algorithm takes the initial RQ distribution into account and is based on achieving a defined numerical acceptability criterion (RR ≤ 1), the MAFexact values resulting from applying Algorithm 3 are highly specific for each analyzed mixture.

Which algorithm to select?

If MAFceiling is the preferred MAF class, this raises the question on how to select between the different algorithms for estimating numerical MAF values. “Algorithm” 1 (setting an a priori MAF value) is the most pragmatic approach, but does not consider any characteristics of environmentally relevant mixtures. In particular, it is basically impossible to define scientific or policy criteria for the a priori selection of a particular numerical value beyond mere convenience and outward appearance. It would be impossible, without accompanying and more detailed analyses, to assess whether, and to what extent, certain policy goals (e.g., the European zero pollution ambition) would be achieved by the MAF implementation via “Algorithm” 1.

Algorithms 2 and 3 differ in one crucial aspect, i.e., the residual risk, which is the RQ sum of the MAF-adjusted mixture. The residual risks after an application of Algorithm 2 might differ considerably even between two mixtures that pose the same initial risk, in dependency of the RQ distribution characteristics of the original mixture. In contrast, the application of Algorithm 3 always results in a residual risk that corresponds exactly to the pre-defined acceptability criterion (RQ sum ≤ 1). This behavior is demonstrated for several arbitrary (but realistic) mixtures in Table 2, contrasting MAFexact and MAF90% (as an example of a MAFceiling). It can be clearly seen that the application of Algorithm 2 for defining a MAFceiling leads to risk over- as well as under-estimations, in comparison to the defined acceptability criterion (RQSum ≤ 1).

Table 2 Numerical comparison of MAFexact and MAF90%

In other words: Algorithm 3 (MAFexact) captures the entire dynamics of risks across all samples included in the assessment of a certain exposure scenario in the distribution of the resulting MAF values. A certain cut-off value (for example, the upper 95% percentile) of this distribution can then be used to set a sufficiently protective MAF value across tall samples (the definition of “sufficiently protective” would obviously be a policy decision). In contrast, Algorithm 2 captures the dynamics of mixture risks across a range of samples in two interlinked distributions: the distribution of MAF values and the distribution of residual risks. This makes the political discussion about a sufficiently, but not excessively, protective MAF value more difficult.

In addition, the application of Algorithm 2 also requires to select a “suitable” percentage value a priori. Shall the MAF be tied to the number of compounds responsible for 80%, 90% or 95% of the RQ sum? The smaller the percentage value, the smaller the resulting MAF but the higher the residual risk of the mixture. Algorithm 3 does not require taking such a decision.

The fact that the application of Algorithm 3 always results in an exact, pre-defined residual risk of the mixture can be considered highly advantageous, as it facilitates the conversation about suitable numerical MAF estimates and because it allows to tailor the acceptable residual risk (and hence the MAF values) to various policy and feasibility considerations. In the examples outlined in Fig. 1 and Tables 1, 2, the acceptability criterion is set to 1, corresponding to the zero pollution vision for 2050. A less stringent criterion might be set, for example, for environmental samples taken in the vicinity of a sewage treatment effluent stream, or for chemicals found in heavily modified water bodies. It is, however, doubtful whether a less stringent criterion should also be applied in the context of human health assessments.

Robustness of the MAF estimate

Neither exposure nor hazard estimates are error free. It is therefore important to compare the robustness of Algorithms 2 and 3 against errors in the RQ estimates of the mixture. Both algorithms are quite robust against risk under- and over-estimations that occur in any of the less relevant mixture components (not shown). However, MAFexact and MAF90% differ in their robustness against over- or under-estimations of the RQ values of the risk drivers. Table 3 shows that MAFexact is incredibly robust against mis-estimations of the RQ of even the most important mixture component.

Table 3 Consequences for MAFexact and MAF90% when the “true” risk of the most important mixture component is under- or overestimated by a factor of 10

This is a consequence of the fact that MAFexact introduces a ceiling for the maximum acceptable value for an individual RQ. For the MAF estimation, it is only important whether an RQ value is above or below the critical value of 1/MAFexact—the exact numerical RQ value is of no relevance (see example in Table 3). This is the reason why the numerical value for MAFexact is identical for all three mixtures shown in Table 3, despite the fact that they pose different risks, with RQ sums between 5.5 and 55.

In contrast, MAF90% is more sensitive to RQ mis-estimations (Table 3), as the numerical RQ values define the number of components needed to account for 90% of the RQ sum.

Any chemical that is unaccounted for leads to an underestimation of the mixture risk. However, if the MAF is actually applied as an upper ceiling for the maximum acceptable single substance risk contribution, any compound that is not included in the initial assessment contributes with just 1/MAFexact at maximum.

MAF exact is the preferred approach

Table 4 provides a summary of the properties of the different MAF classes. In summary, MAFexact has at least seven distinct advantages, compared to other MAF classes:

  1. 1.

    The application of a MAFexact is conceptually identical to the current approach for substance registration and authorization under REACH and similar regulatory frameworks. No new approaches are needed, only the yardstick for defining safe use changes from “safe use is demonstrated if the ratio of expected exposure to hazard threshold is at or below 1” to “safe use is demonstrated if the ratio of expected exposure to hazard threshold is at or below 1/ MAFexact”.

  2. 2.

    MAFexact focusses implicitly on the mixture risk drivers, where improved risk assessment and/or risk mitigation have a tangible impact. Substances with small risk quotients (chemicals with low emission volumes and chemicals with low hazards) are not affected.

  3. 3.

    Mixtures of no concern always result in a MAFexact of 1. This minimizes the risk of overregulating (groups of) exposure scenarios in which the overall risk is acceptable.

  4. 4.

    The dynamics of the mixture risk across a range of exposure scenarios are captured in the distribution of MAFexact values, and are not scattered across several descriptors (distribution of MAF values, distribution of residual risks after the MAF application, a priori decision on a certain percentage of chemicals to be included). This facilitates an informed discussion on suitable protection goals, cost–benefit considerations, etc.

  5. 5.

    Setting the numerical goal of applying MAFexact to “the sum of risk characterization ratios of all individual components shall not exceed 1” is an operationalization of Europe’s political goal of a “toxic-free environment” as outlined in the European Action Plan on a “Pathway to a Healthy Planet for All: Towards Zero Pollution for Air, Water and Soil”. [26]

  6. 6.

    The MAFexact allows a simple adjustment to different protection goals for different exposure scenarios, simply by setting different numerical values for the acceptable residual risk.

  7. 7.

    The MAFexact is robust against exposure and hazard data of poor quality.

Some of those characteristics are shared with the other MAF classes. However, only MAFexact combines all seven characteristics.

Table 4 Properties of the different MAF classes

A case study example

Figure 3 shows the distribution of the RQ values for the mixtures analyzed in Socianu et al. [32]. The underlying exposure data were compiled from aggregated human biomonitoring data that were recorded in the HBM4EU project, https://www.hbm4eu.eu/. 20 chemicals were included in the exposure evaluation of the adult population, 17 chemicals were considered for the infant population, using health-based guidance values for the most critical toxicological endpoint, independent of mode-of-action and grouping considerations.

Fig. 3
figure 3

Distribution of RQ values in the study by Socianu et al. [29]. A RQ values based on median exposure (50% percentile) of adults. B RQ values based on maximum exposures (95% percentile) of adults. C RQ values based on median exposure (50% percentile) of children. D RQ values based on maximum exposure (95% percentile) of children. Only the 15 compounds with the highest RQ values are plotted. DEP diethyl phthalate, BBzP butylbenzyl phthalate, DEHP di(2-ethylhexyl)phthalate, DiBP di-iso-butyl phthalate, DiDP di-isodecyl phthalate, DiNP di-isononyl phthalate, DEHTP di-2-ethylhexyl terephthalate, DPHP di(2-propylheptyl) phthalate, DINCH: di-isononyl cyclohexane-1,2-dicarboxylate, PFOS perfluorooctane sulfonic acid, PFOA perfluorooctanoic acid, BPA bisphenol A, BDE-99 polybrominated diphenylether 99, Cd: cadmium, As: arsenic compounds, Hg: mercury. RQ values taken from Socianu et al. [32]. Please note the different scales of the y-axis in the four sub-plots

Figure 3A, C represents the distribution of the median RQ values for the adult and infant population, respectively, while Fig. 3B, D represents the corresponding upper 95% exposure percentiles. No reliable RQ values could be calculated for children for arsenic compounds, pyrethroids and chlorpyrifos. As arsenic compounds actually contribute quite substantially to the overall risk in the adult population, this explains the lower RCR sum for children and emphasizes how critical it is to include all relevant compounds in the mixture assessment.

The resulting MAF values are summarized in Table 5. As expected (see above), the numerical values for MAFexact are highest, but its application affects only between 4 and 8 substances. These are the substances whose initial RQ values exceed the acceptable value of 1/MAFexact.

Table 5 MAF values determined in the study by Socianu et al. [29]

The factor by which the initial RQ values need to be reduced differs substantially between those compounds that are affected by the application of MAFexact. For example, the maximum risk reduction is required for PFOS in the upper 95% percentile scenario. Even from a single substance perspective, its risk is too high, by a factor of 2.84 (Fig. 3B). Taking MAF considerations into account, the risk contribution needs to be lowered by an additional factor of 10.1 (= MAFexact). On the other hand, the risk contribution of DEHP in the same exposure scenario (Fig. 3B) is acceptable from a single substance perspective (RQ = 0.17), but would need to be lowered by a factor of 1.7 to reach the critical value of 1/ MAFexact = 0.1, i.e., the maximum risk contribution that is acceptable also from a mixture perspective.

The application of MAF90% leads to largely similar results as MAFexact, but with a residual risk constantly exceeding 1 (Table 4). This shows, again, that the application of MAF90% (or similar types of MAF values estimated using algorithms from group 2) does not ensure that the application of the MAF would achieve the goal of the European zero pollution ambition (RCR sum of not more than 1). As discussed earlier, this might be acceptable in certain situations (e.g., mixtures in heavily modified bodies of water). But this should be an intentional regulatory decision, not a semi-random result of insufficiently accurate numerical MAF estimations.

Mixture assessment factor or mixture allocation factor?

The MAF is conceptually different from the “classical” assessment factors used in regulatory chemical risk assessment and management, e.g., under REACH. Such assessment factors conservatively account for data gaps. It is the responsibility of the registrant to adjust them according to the amount and quality of the (eco)toxicological data at hand. As a consequence, a registrant can reduce assessment factors along the guidelines provided by regulatory authorities, by providing more and better (eco)toxicological data.

In sharp contrast, a MAF is the result of a combined hazard and exposure assessment. Given that an individual registrant rarely has the possibility to impact broader co-exposure patterns, the MAF-size cannot be adjusted in the context of a typical tiered risk assessment approach.

The MAF is actually an allocation factor, similar to the allocation factors used for example by the WHO to distinguish the contributions of different exposure pathways to an overall chemical exposure during drinking water quality assessment [33]. Allocation factors reflect the notion that the exposed entity (human or environmental organisms) has a given tolerance for toxic stress and that the different exposure pathways all occupy a certain fraction of the available exposure space, i.e., they all contribute to filling up a “risk cup”, in proportion to the corresponding individual risk quotients. The risk cup / allocation factor concept is applied in the context of evaluating coincidental mixtures already for certain pesticide mixtures under the Food Quality Protection Act of the United States [34]. Taken together with the generally accepted notion that Concentration Addition is a suitable first tier of mixture risk evaluations in general [5, 6] and references therein), the risk cup concept lends itself naturally to a broader application for mixture risk assessments.

The MAF in a regulatory context

The MAF has been suggested as an instrument to account for the fact that real-world mixtures typically cause a risk that is higher than the risk of each of the individual component (at the concentration at which it is present in the mixture), without requesting a whole range of in-depth and scenario-specific mixture risk assessments from each company that registers a substance or applies for its authorization. The MAF is therefore a regulatory instrument for improving the risk assessment and risk management of chemicals, to be applied mainly in the prospective assessment during substance registration or authorization.

MAFexact, as described in detail above, is the scientific approach to estimate the maximum acceptable risk contribution of each component in a given mixture, so that the RQ sum of a given mixture reaches exactly 1. MAFexact is therefore an estimate that is specific for each analyzed mixture. There are two options on how to translate such a scientific specific MAF into a regulatory MAF, i.e., a numerical estimate that can be included in a regulatory framework. Deriving a regulatory MAF requires to decide on one numerical estimate that is representative for a broader exposure scenario, e.g., all the mixtures occurring in a given geographical area or in a defined human (sub-)population. A basic prerequisite for setting a regulatory MAF is that the understanding of co-exposure patterns in the scenario of interest and of the (eco)toxicological properties of the involved chemicals is sufficient to derive reliable estimates on maximum acceptable concentrations.

The first option for estimating a regulatory MAF is to average the exposure data from all mixtures included in a given scenario, in order to describe the typical exposure situation and then calculate a MAF value on this basis (see the case study described above). The second option is to calculate a specific MAF for each individual mixture, i.e., for each sample. From the resulting distribution of all the sample-specific MAF values, a sufficiently protective regulatory MAF can then be estimated, for example by selecting the upper 95% percentile of the distribution of sample-specific MAF values.

The second approach allows a more detailed analysis and provides a better understanding of exposure and risk dynamics. However, implementing this approach might not always be possible. For example, the individual exposure profiles of the participants in a human monitoring study are often not available for re-analysis due to privacy considerations.

It will be a major challenge to decide which samples should be grouped together for sizing a regulatory MAF. How broad or how specific are the criteria used for grouping samples to define a regulatory MAF? Are the grouping criteria defined in relation to, for example, chemical class, geographic area or exposed (sub)population)?

If the scenario definition is too broad, the resulting regulatory MAF risks being either over- or under-protective for many of the situations that were grouped together. If the scenario definition is too specific, there will be a whole collection of MAF values to be considered for a given substance, making the MAF less useful as a regulatory instrument. Finding the right balance here depends on the differences of the MAF values in scenarios of different specificity. Collecting and analyzing case studies in detail, in order to provide a better understanding of necessary MAF sizes in relation to mixture risks, will be a in important task for the future, for example in the context of the European Partnership for the Assessment of Risks from Chemicals (PARC).

The MAF reflects the acceptable contribution of an emitted chemical to the overall toxic burden of an ecosystem or a human population. It is based on a broad overview of chemical exposures, amended, if necessary and feasible, by specific considerations (e.g., presence of PBT/vPvB substances or endocrine disrupters) as well as political/socio-economic considerations such as risk–benefit evaluations. Only regulatory authorities can be expected to accrue the necessary knowledge on chemical use and emissions across different actors and use patterns, necessary to initially size and then adjust the MAF in view of evolving scientific knowledge and ever-changing exposure patterns. However, even authorities might need to develop improved exposure evaluations that go beyond current generic estimates and better take co-exposure patterns into account. Finally, only regulatory authorities can be provided with a political mandate to balance the different economic and societal demands, in order to ensure a fair sharing of the available emission space.

The size of the MAF is a result of integrating the co-exposure patterns in a given large-scale scenario with the hazard profiles of the involved compounds. Especially the former will change continuously, as chemical use and production change. Also regulatory decisions, such as use restrictions, will impact the emissions of the affected chemicals. It will therefore be important to design future chemical monitoring and modeling campaigns with a view on describing co-exposure patterns, and to systematically feed the results of these activities back into the review process of chemical regulations.

The current analysis focusses on chemicals that are toxic to the exposed entities and that have no natural background concentration. Incorporating such substances, e.g., phosphate, nitrate or metals, will require further fine-tuning and adjustment of the MAF concept.

Our current understanding of co-exposure patterns and of the (eco)toxicological potency of many of the involved chemicals is still limited, but is continuously increasing across the board. While it is clear that a MAF is urgently needed now, given that current approaches for chemical risk and safety assessment insufficiently protect against elevated mixture risks [5, 6, 13, 35], the first-generation of MAF values might need to be (partly) driven by pragmatic considerations, which should be refined in regular intervals, in view of new empirical data (and changes in chemical use and emission). As usual, the challenge will be to derive a sufficiently, but not excessively protective value, while avoiding paralysis by analysis.

Chemicals on the European market will be affected differently by the application of a MAF:

  1. 1)

    Chemicals with low emission rates and/or low hazards. Chemicals with an RQ value of less than 1/MAF will not be affected by the implementation of a MAF. However, the underlying documents that support chemical safety assessment would need to be checked accordingly, which requires a limited effort from all companies producing or importing chemicals for the European market.

  2. 2)

    Chemicals with an RQ value exceeding 1/MAF for which the RQ value can be lowered by providing better/additional data on their exposure and/or hazard. This group includes chemicals whose safety assessment is based on only few empirical data and cautious default hazard and exposure assessments. Implementing a MAF for those chemicals will require investing additional resources to improve the precision of the underlying exposure and/or hazard assessments, but does not have any direct consequences for the marketing of the chemical.

  3. 3)

    Chemicals with an RQ value exceeding 1/MAF for which the RQ value can only be lowered by implementing additional risk mitigation measures, i.e., exposure reductions.

In particular, the implementation of exposure reduction measures for the third group of chemicals will support achieving the European zero pollution ambition, as outlined in the Chemicals Strategy for Sustainability [14] and European zero pollution action plan [26].