Replacement of animal testing by integrated approaches to testing and assessment (IATA): a call for in vivitrosi

Caloni, Francesca; De Angelis, Isabella; Hartung, Thomas

doi:10.1007/s00204-022-03299-x

Replacement of animal testing by integrated approaches to testing and assessment (IATA): a call for in vivitrosi

Review Article
Open access
Published: 03 May 2022

Volume 96, pages 1935–1950, (2022)
Cite this article

Download PDF

You have full access to this open access article

Archives of Toxicology Aims and scope Submit manuscript

Replacement of animal testing by integrated approaches to testing and assessment (IATA): a call for in vivitrosi

Download PDF

11k Accesses
20 Citations
6 Altmetric
Explore all metrics

Abstract

Alternative methods to animal use in toxicology are evolving with new advanced tools and multilevel approaches, to answer from one side to 3Rs requirements, and on the other side offering relevant and valid tests for drugs and chemicals, considering also their combination in test strategies, for a proper risk assessment.

While stand-alone methods, have demonstrated to be applicable for some specific toxicological predictions with some limitations, the new strategy for the application of New Approach Methods (NAM), to solve complex toxicological endpoints is addressed by Integrated Approaches for Testing and Assessment (IATA), aka Integrated Testing Strategies (ITS) or Defined Approaches for Testing and Assessment (DA). The central challenge of evidence integration is shared with the needs of risk assessment and systematic reviews of an evidence-based Toxicology. Increasingly, machine learning (aka Artificial Intelligence, AI) lends itself to integrate diverse evidence streams.

In this article, we give an overview of the state of the art of alternative methods and IATA in toxicology for regulatory use for various hazards, outlining future orientation and perspectives. We call on leveraging the synergies of integrated approaches and evidence integration from in vivo, in vitro and in silico as true in vivitrosi.

SEURAT: Safety Evaluation Ultimately Replacing Animal Testing—Recommendations for future research in the field of predictive toxicology

Article 30 November 2014

Evolving the Principles and Practice of Validation for New Alternative Approaches to Toxicity Testing

Computer-Based Prediction Models in Regulatory Toxicology

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Toxicology has traditionally relied on animal testing as the primary evidence stream. However, the life sciences have advanced offering a continuously expanding portfolio of technologies, mechanistic understanding and data analysis approaches. To answer to the requirements of testing chemicals and products, as well as to follow the 3Rs concept of Replace, Reduce and Refine to incorporate NAM to complement and substitute complex in vivo studies, toxicology is looking to new approaches and strategies (Caloni et al. 2021). This requires considering new promising tools, like spheroids, organoids, and organs-on-chip (Lee and Lee 2020), jointly referred to as Microphysiological Systems (MPS) (Marx et al. 2016, 2020; Roth and MPS-WS Berlin 2019 2021), building relevant in vitro systems, or to combine different tests in chemico, in vitro, in silico, to mimic the different cellular and molecular events (Rovida et al. 2015), considering also the increasing attention in risk assessment on the combination of chemical substances and mixtures (Kar and Leszczynski 2019; Hayes et al. 2020).

The concept of Integrated Testing Strategies originated early on out of an ECVAM taskforce with idea of combining methods to replace animal testing (Blaauboer et al. 1999; DeJongh et al. 1999; Blaauboer and Barratt 1999). The emergence of the European REACH legislation and its enormous challenges independently furthered this discussion (Ahlers et al. 2008). In a report (Jaworska and Hoffmann 2010) commissioned by the Center for Alternatives to Animal Testing (CAAT), opportunities to better use existing data and guide future testing in toxicology by ITS were elaborated. Based on an earlier OECD workshop (OECD 2008) and earlier work (Jaworska et al. 2010), they delineated the conceptual requirements as being (a) transparent and consistent, (b) rational and (c) hypothesis-driven. Notably, these resonate strongly with the principles of Evidence-based Toxicology (Hoffmann and Hartung 2006).

We have earlier summarized the many reasons for combining tests or test results (Table 1).

Table 1 Reasons to combine tests^a

Full size table

Stand-alone methods, able to substitute completely for an in vivo test, were the frontier of the past, aiming for a full replacement, and demonstrated through validation (Hartung et al. 2004; Leist et al. 2012) to be largely applicable in specific toxicological test like skin irritation, skin corrosion, or phototoxicity testing, the 3Rs vision must take into account the difficulties to find solutions when it is necessary to mimic multiple physiological responses or complex pathways of toxicity. A possible way for overcoming intrinsic limitation of stand-alone method is the use of a testing battery in which different elements of existing information are assembled through a specific sequence of tests. These test batteries are generally defined as integrated approach in which mechanistic data are combined with different information sources. The integrated approaches most frequently applied in regulatory contest, particularly for complex toxicological endpoints as carcinogenicity or skin sensitization, are the IATA, the DA and the Adverse Outcome Pathways (AOP) (Rovida et al. 2015; Eskes 2019; Leist et al. 2017). Table 2 shows definitions for associated terms. OECD has coined the term IATA and it is, therefore, the term of choice for the field. IATA is based on integration of weighed multiple information and an expert judgment is always requested for regulatory decisions (OECD 2017a), while DA is more standardized being based on a fixed data evaluation (Casati 2018a; Eskes 2019). Conversely, AOPs are only based on mechanistic information; they describe the sequence of events that, starting from an initial perturbation at biological level due to a stressor(s), induce an adverse effect in the organism of regulatory relevance going through a series of intermediate events (Vinken et al. 2017; OECD 2018e). Notably, the usefulness of AOP to design IATA was recognized in an OECD workshop (Tollefsen et al. 2014). A fruitful aspect on evolution of alternative methods and their applicability, is the reconversion of pre-existing test, with the necessary changes, to be adopted in another domain, how has seen recently for the Reconstructed human Epidermis models (RhE), firstly applied in skin corrosion (OECD 2019a), but recently included in phototoxicity (OECD 2021a).

Table 2 Definition of reference terms for integrated approaches^a,b,c

Full size table

The term IATA has been coined by OECD^{Footnote 1} (OECD 2008), describing it as “pragmatic, science-based approaches for chemical hazard characterization that rely on an integrated analysis of existing information coupled with the generation of new information using testing strategies. IATA follow an iterative approach to answer a defined question in a specific regulatory context, taking into account the acceptable level of uncertainty associated with the decision context.” In the context of the OECD Guidance Document on the Use of the Adverse Outcome Pathways in developing IATA^{Footnote 2} (Series on Testing and Assessment No. 260 2017), an IATA generic structure was defined consisting of three sequential steps/ parts: (i) the collection of existing information relevant for the chemical under evaluation; (ii) the weight-of-evidence evaluation of the gathered information that may enable regulatory decision-making regarding potential hazard and/or risk or may indicate what additional information is needed for a sound assessment; and (iii) the generation of new testing data to support the conclusion. This, in principle, suggests the use of IATA for all hazards. Next, a review of the state-of-the-art of 3Rs approaches in topical and systemic toxicities with specific view on the state of IATA development shall elucidate innovative perspectives for the future of replacement of the traditional test.

Topical toxicities

Skin irritation and skin corrosion

Reconstructed human epidermis models (RhE), skin tissue derived from non-transformed keratinocytes cultured on insert polycarbonate support that mimic morphological, physiological and biochemical characteristics of the human epidermis (Gordon et al. 2015), are getting more and more attention, for their wide application, becoming relevant in testing cosmetics, chemicals, and in pharmaco-toxicological research (Faller et al. 2003; Kandarova et al. 2005, 2006, 2018; Netzlaff et al. 2005; Schäfer-Korting et al. 2008; Gordon et al. 2015; Kandarova and Hayden 2021).

In skin irritation and corrosion, defined as a reversible and irreversible skin damage, respectively, RhE, that represent the in vitro target organ, firstly adopted for skin corrosion in 2004 (OECD 2019a), and subsequently in skin irritation in 2010 (OECD 2019b), are applied, with different protocols, as stand-alone methods, and identified as model of total replacement based on the endpoint of cell viability, evaluated with the well-known toxicity test MTT.

In skin irritation, that historically used laboratory animals with the Draize Skin Irritation Test (OECD 2015a), the in vitro RhE Test (OECD 2019b, is the only recognized alternative method, including EpiSkin™ (SM), EpiDerm™ SIT (EPI-200), SkinEthic™ RHE, LabCyte EPI-MODEL24 SIT, epiCS^®, Skin + ^®. Actually, these methods represent already very simple testing strategies, because they are preceded by a pH measurement with testing being not necessary if the substance is a strong acid (pH ≤ 2.0) or base (pH ≥ 11.5).

For skin corrosion, two more validated methods, though non-human models, are available: the Rat Skin Trans Cutaneous Electrical Resistance (TER) (OECD 2013), an ex vivo/in vitro test, based on the use of skin from rats at 28–30 days humanely killed, where the endpoint is the alteration of the skin barrier, and the Corrositex^® assay, (OECD 2015b), that is based on the use of an artificial biomembrane, and the endpoint is the barrier damage, caused by corrosive substances, acidic and alkaline.

An implementation of RhE through new technologies like automation or bioprinting, the improvement of the model through for example the partial or total elimination of animal components from the medium, to allow the application of microfluidic techniques, could bring to a broader application of the RhE model in regulatory pharmacology and toxicology, but also in personalised medicine (Kandarova et al. 2021). Moreover, the RhE models, found application in other domains, as demonstrated recently for phototoxicity (OECD 2021a), genotoxicity (Pfuhler et al. 2021), skin sensitization (McKim et al. 2012; Saito et al. 2013; Johansson et al. 2019) and various pharmacological applications.

The methods for skin irritation and corrosion testing have been combined to an IATA in the context of the European REACH legislation and the OECD (2014a). In vivo testing is foreseen only as a last resort for example in those cases where in vitro methods are not suitable for testing the specific substance or if the results of the in vitro tests are not adequate for the regulatory need.

Phototoxicity

Photoxicity is classified as an acute toxic effect due to the activation of photoreactive chemicals by cutaneous exposure to UV or visible light. Exposure to photoreactive chemicals can occur both by topical application (UV-filters, cosmetics) and by systemic administration (drugs). These light-mediated effects may be in turn categorized as photoirritation (acute light-induced skin response to a photoreactive chemical), photoallergy (immune-mediated reaction), and photogenotoxicity (genotoxic response either directly by photoexcitation of DNA or indirectly by excitation of photoreactive chemicals). Assessment of photoxicity (mainly as photoirritation) is requested for cosmetics and drugs. Three OECD Testing Guidelines (TG) based on in vitro assays are currently available: TG 432, based on Neutral Red Uptake (NRU) by 3T3 mouse fibroblast (OECD 2019c), TG 495 based on the evaluation of Reactive Oxygen Species (ROS) formed by chemicals irradiated with a simulated sunlight (OECD 2021b), and TG 498 based on the use of reconstructed human epidermis tissues (OECD 2021a). The in vitro 3T3 NRU Phototoxicity Test was the first in vitro test included in an OECD guideline; it compares cytotoxic effect of a test substance, determined by the relative reduction in cell viability exposed to test chemical, in presence or in absence of light. Several limits have been ascribed to this test as the lack of bioavailability/biokinetics modelling resulting in poor correlation in vivo-in vitro (Onoue et al. 2013) and the high frequency of false-positive results (85% of in vitro 3T3-NRU positive assays were negative in further in vivo testing) (Lynch and Wilcox 2011) but it is generally considered robust enough. Moreover, the use of a mouse cell line has been criticized and the substitution of 3T3 cell line with human keratinocytes was proposed, to provide a more realistic experimental model with respect to the human situation (Clothier et al. 1999; Maciel et al. 2019). To date, 3D human skin models represent a flexible tool to investigate skin alterations after chemical exposure, including phototoxicity, since they take into account also some relevant in vivo parameters as skin penetration or stratum corneum barrier function (uco Dayane et al. 2018; Kim et al. 2020). In this respect, the new OECD TG 498 (OECD 2021a) fully reflects these aspects; it is based on the in vitro test system of the RhE, which closely mimics biochemical and physiological properties of the human epidermis (Dellambra et al. 2019). Moreover, the test system uses human-derived keratinocytes to reconstruct an epidermal model which maintains histology and cytoarchitecture of the human skin. This test guideline is applicable to determine the phototoxic potential of test chemicals after topical application to RhE tissues in presence and absence of simulated sunlight. Phototoxicity potential is evaluated by viability reduction of RhE tissues exposed to the test chemicals. No IATA developments were identified for the phototoxicity hazard. Potentially, the fibroblast and the RhE models could be combined into an IATA. Furthermore, the combination with genotoxicity assays for photogenotoxicity or with skin sensitization assays would be possible, which is, however, these are no standard testing requirement.

Skin sensitization

A skin sensitizer is defined a chemical substance or mixture that can induce an allergic response at the skin level (i.e., allergic contact dermatitis, ACD) after repeated dermal exposure. Currently, more than 3000 chemicals were classified as skin sensitizer (de Ávila et al. 2019). Often skin allergens have electrophilic moieties which ensure their reaction with the nucleophilic sites in skin proteins. In the current regulatory framework, skin sensitization is mandatory for many consumer products and related ingredients as cosmetics or agrochemical products. In recent years, OECD promoted some in vitro approaches addressing different mechanistic events of the skin sensitization process summarised in an AOP (OECD 2014a, b; Basketter 2016). In this AOP four sequential key events (KE) have been identified:

(a) KE-1 (defined also as Molecular Initiating Event, MIE) represents the covalent irreversible binding of electrophilic substances to nucleophilic sites of dermal proteins producing an antigenic stimulation of human immune system. KE-1 is covered by the Direct Peptide Reactivity Assay–DPRA, (OECD 2021c) in which the reactivity of test chemicals towards model synthetic peptides containing either lysine or cysteine are quantified.

(b) KE-2 represents inflammation of keratinocytes, the most common epidermal cells. Methods corresponding to this step are the KeratinoSens™ and the LuSens tests, both based on a specific cell signalling pathway of the antioxidant/electrophile response element (ARE)-dependent pathway, comprised in the ARE-Nrf2 Luciferase Test Method, (OECD 2018a). Luciferase-based reporter assays are highly sensitive and easy to use. Moreover, interferences are rare and usually restricted to direct structural interactions of test substance with the luciferase enzyme.

(c) KE-3 represents the activation of dendritic cells (DC) detected by expression of specific cell surface markers, as chemokines and cytokines, whose maturation and migration to lymph nodes provides the essential trigger for the following KE-4. Assessment of the KE-3 is quite challenging due to the complex biology of DC activation. Three assays addressed this endpoint: (i) the human cell Line Activation Test or h-CLAT method, (ii) the U937 Cell Line Activation Test or U-SENS and (iii) the Interleukin-8 Reporter Gene Assay or IL-8 Luc assay, (OECD 2018b). All these test methods both quantify changes in the expression of cell surface marker(s) associated with the DC activation after exposure to sensitisers (as CD54, CD86) and changes in cytokine IL-8 expression, also associated with DC the activation.

(d) KE-4 represents T cell proliferation and activation. This is the most complex event closely related to immune system response in vivo; so far, no validated in vitro methods are available for this endpoint, so it is generally addressed by the murine Local Lymph Node Assay, LLNA, (OECD 2010a), a validated refinement and reduction alternative in mice.

It is important to highlight that the in vitro tests of KE 1–3 if used alone are not considered to provide a level of information for risk assessment comparable to in vivo tests (e.g., LLNA assay) since they are considered insufficient to cover the complexity of the biological mechanisms occurring in vivo. Therefore, results obtained with these methods have to be used in conjunction with other relevant information, as physico-chemical properties, in IATA or DA framework (Casati 2018a; b; Kleinstreuer et al. 2018). Twelve case studies of Das were submitted to OECD including DA with fixed data interpretation procedures as well as IATA incorporating expert judgment.

Skin sensitization therefore pioneered IATA development: A generic IATA for skin sensitisation was proposed in the European Chemicals Agency’s (ECHA) Guidance to Industry on Information Requirements and Chemical Safety Assessment (Chapter R 7.a, section R.7.3 Skin sensitisation). The guidance was revised to comply with the changes made to the REACH legal text adopted in Version 5.0 2016 making the use of in vitro methods for skin sensitisation testing a standard information requirement and the primary choice over in vivo studies. Several integrated approaches have been constructed for skin sensitisation depending on the final regulatory purpose. With the aim to harmonize the different approaches and to produce at least the same level of information of LLNA assay for hazard identification (i.e., discrimination of sensitising substances from non-sensitizers) a Guideline Document on Defined Approaches for skin sensitization has been recently published as OECD TG 497 (OECD 2021d). Three DAs are included in this Guideline, two of them are also able to provide information for sensitisation potency categorisation, equivalent to the potency categorisation provided by LLNA. In this process, Kleinstreuer et al. (2018) developed generic evaluation categories and criteria for DA (Table 3).

Table 3 Qualitative evaluation categories and criteria for Defined Approaches^a

Full size table

The evaluation criteria in Table 3 from Kleinstreuer et al. (2018) are only brief bullets, which need further details. For example, costs can refer to costs per test, cost of infrastructure, labour, or time. The original publication gives more details, but the development of Defined Approaches and its evaluation is still its infancy and represent a challenge to the validation bodies.

Moving forward to skin sensitization process, the AOP developed for this endpoint also contribute to improve the understanding of mechanisms related to other immune-mediated processes such as chemical respiratory allergy, which share some toxicity pathways with skin sensitization, such as T lymphocyte activation and proliferation (Kimber et al. 2018).

Skin absorption

Skin penetration is considered relevant for occupational and public health risk assessment of specific classes of industrial chemicals such as pesticides and biocides. An OECD TG for an in vitro test for skin absorption was adopted in 2004 (OECD 2010b) and, more recently, OECD released Guidance Notes to facilitate the harmonization of the experimental data from absorption studies (OECD 2019d). TG 428 (OECD 2010b) is based on absorption of a test substance applied to the surface of a skin samples separating the two chambers (donor chamber and receptor chamber) of a glass diffusion cell. Both static and flow-through diffusion chamber are acceptable. Skin samples, to a specific thickness, from human or animal sources can be used. Normally, viable skin is preferred but standardized non-viable skin can also be used checking its integrity prior use. Since human skin is considered less permeable than that of laboratory animals (i.e., rodents), data obtained with human skin samples are considered as stand-alone data to predict the expected absorption in humans. Combination of animal, human in vitro, and human in vivo data is also suggested; for example, the “Triple Pack approach” combines three types of dermal absorption data derived from: (1) in vivo animal; (2) in vitro animal; and (3) in vitro human dermal absorption studies (EFSA Panel on Plant Protection Products and their Residues [EFSA PPR] 2011).

Suitability of commercial reconstructed human epidermis model and/or full human skin equivalents (i.e., including both epidermal and dermal layers) for absorption studies has been investigated with chemicals and drugs. Results show that these models are more permeable than human skins (Henning et al. 2009). These limited barrier properties are mainly attributable to differences in composition and non-homogeneous distribution of lipids that hampered the formation of a continuous lipid barrier (Tfayli et al. 2014). Moreover, absence of the vascular network also plays a critical role in the establishment of barrier functions. For this reason, the use of these skin models for absorption studies is still limited (Neupane et al. 2020).

Simple and reproducible alternatives to human and animal skins are also represented by synthetic artificial membranes (e.g., multi-layered silicone-based membranes) (Abd et al. 2016). They may be easily procured and stored and show less variability than biological skins. Conversely, because of the lack of the superficial barrier of the stratum corneum, they show a poor correlation with human absorption data. So, they are recommended for the initial screening of new molecules while permeation data for hazard assessment should be obtained on biological skin models (Neupane et al. 2020). No IATA developments were identified for this aspect of toxicokinetics (Tsaioun et al. 2016), notably not a hazard. In general, biokinetic (ADME) represent an enormous opportunity for IATA development.

Eye irritation/corrosion

For the evaluation of eye irritation and serious eye damage, historically one of the priorities of alternative methods (Adriaens et al. 2014), no stand-alone test is available, but combined methods in a testing strategy, with a top down and a bottom-up approach, were described (Scott et al. 2009), to replace the in vivo ocular Draize rabbit test (OECD 2017b), where the damage in iris, conjunctiva and cornea is scored.

To date, several NAMs are validated for testing singular chemicals but with limited evidence for mixtures of compounds. Non-human tests, with different endpoints, have been validated like the ex vivo/ in vitro methods of the Bovine Corneal Opacity and Permeability test (BCOP) (OECD 2017c) and the Isolated Chicken Eye test (ICE) (OECD 2018c), where the application of the 3Rs is firstly evident in the ethical sourcing of the collected eyes from slaughterhouses, and further in vitro tests like the Fluorescein Leakage (FL) Test method, (OECD 2017d), a cell based assay with the Madin-Darby Canine Kidney (MDCK) cells and fluoresceine as marker, the Short Time Exposure (STE) In Vitro Test Method (OECD 2020a), with Staten Seruminstitut Rabbit Cornea cells (SIRC), and the in vitro macromolecular test method, (OECD 2019e). The microphysiometer method was validated for non-irritant substances (Hartung et al. 2010) but so far not taken up in an OECD test guideline.

The Reconstructed human Cornea-like Epithelium (RhCE) model, a human-based 3D model (OECD 2019f) mimics the human corneal epithelium. It is an in vitro test for chemicals not requiring classification and labelling for eye irritation and serious damage, like Vitrigel^®-EIT (OECD 2019g) another model based on immortalized cornea cells.

Different commercial tissues of RhCE are reported in the OECD 492 (OECD 2019f), like EpiOcular™ EIT, SkinEthic™ Human Corneal Epithelim HCE EIT, LabCyte CORNEA-MODEL24 EIT, and MCTT HCE™ EIT, are now available (Kolle and Landsiedel 2021).

Even if these models have many advantages and domains of application, there are also some limits that draw attention: one emerging problem, to solve in the near future regulatory requirements, is testing of agrochemical formulations, that even if they fall in the definition of “mixture”, included in OECD guidelines, there are no available NAMs until now for the evaluation of serious eye damage or for skin irritation potential (Kolle and Landsiedel 2021).

IATA development for eye irritation and corrosion has progressed very much in parallel to skin irritation and corrosion with the difference that there is less agreement whether and how the different NAMs in combination can avoid animal testing. The different OECD methods are broadly accepted to identify either non-irritant or corrosive but for irritant substances there is only a weight-of-evidence approach. An IATA has nevertheless been put forward by OECD (2017e). Casati (2018a) concluded “Classification of GHS Category 2 chemicals (eye irritation) can be concluded only with a weight-of-evidence approach. Thus, the decision process within the IATA for serious eye damage and eye irritation IATA cannot be fully standardised at the moment and for this reason, there is no assurance for international use and acceptance of in vitro data in cases where conclusions are derived on the basis of weight-of-evidence”.

Systemic toxicity

Genotoxicity

Genotoxicity refers to the ability of chemical agents to damage genetic information within the cells causing mutations and/or inducing alterations in the DNA molecules, as modifications in nucleotide sequence or in double helix structure. While all mutagenic substances are genotoxic, not all genotoxic substances are mutagenic. Hence, to perform genotoxicity hazard evaluation of chemicals three different endpoints are requested at least, one for mutagenicity, as in vitro gene mutations assays, and two for DNA alteration, as chromosomal aberrations (clastogenicity), and chromosomal damage which can cause a change in their number (aneuploidy). Hence, a test battery is requested utilizing in vitro approaches followed, in some cases, by in vivo confirmation when positive results are reported in vitro. In vitro assays for genotoxicity have been validate for long time and are suitable to identify direct chemical effects on DNA while secondary genotoxicity effects (i.e., mediated by the immune system) are not resolved by standard in vitro approaches so far. A standard approach for in vitro genotoxicity includes: (i) a gene mutation test in bacteria (OECD 2020b) or in mammalian cells using two different gens (OECD 2016a, b) and (ii) a structural chromosomal test, as the in vitro mammalian chromosomal aberration test (OECD 2016c) and the in vitro micronucleus test (OECD 2016d) that identify micronuclei in the cytoplasm of interphase cells. The latter is a very popular and applied test, suitable to detect both clastogenic and aneugenic effects. Moreover, albeit not regulated by specific TGs, detection of DNA damage (as single and double strand breaks or specific DNA lesions) can provide useful information about chemical genotoxic potential. In this respect, the in vitro Comet assay (with or without incorporation of Formamidopyrimidine DNA glycosylase (Fpg)) is widely used to determine DNA effects due to oxidative stress (Kohl 2020). It has been validated (though not peer-reviewed by a validation body yet) in vitro using the human lymphoblastoid TK-6 cell line (Muruzabal et al. 2021) for 3D reconstructed human skin (Pfuhler et al. 2021). One of the main shortcomings of in vitro genotoxicity assays is that they frequently produce false-positive results, due to the coupling of high sensitivity with a relatively low specificity (Corvi and Madia 2017). The combination as a test battery where any positive test results in a positive call accumulates these.

OECD guidelines for in vitro genotoxicity testing are mainly focused on 2D models but, recently, increasing efforts have been channelled to the use and optimization of in vitro 3D models. In particular, robust protocols for genotoxicity assessment have been established for both Micronucleus and Comet assay on 3D models representative of different routes of exposure, i.e., dermal (commercial skin models), inhalation (airway model) and systemic (mainly liver spheroids) (Kooter et al. 2016; Wills et al. 2016; Shah et al. 2018; Conway et al. 2020). However, some experts believe that these assays are not mature enough to develop specific TGs (Pfuhler 2020). The improving of efficiency of in vitro genotoxicity assays is another point that need to be stressed. Development of high-throughput methodology as well as assay miniaturization (96-well microplate-based and chip-based) are considered a priority to produce a large quantity of genotoxicity data in a fast and cost-effective way (Guo et al. 2020).

With current proposals to revise the European REACH prompting in vivo confirmation of positive in vitro mutagenicity findings,^{Footnote 3} IATA development to better interpret in vitro results and possibly include steps before moving to in vivo is most promising.

Carcinogenicity

Carcinogenesis is a multistep process in which transition of normal cells into cancer cells is mediated by a sequence of biological events. Several major hallmarks of this process have been identified including genome instability, inflammation, immunosuppression, and metabolism deregulation (Madia et al. 2021). Evaluation of genotoxicity is also the preliminary step for determination of carcinogenic potential of chemicals. This is a multi-step process that involves complex biological interactions driven by many different factors (genetic, hormonal, age related, environmental, etc.), so it is extremely difficult to address this endpoint by stand-alone in vitro approaches. The in vitro cell transformation assay (CTA) covers some of the key phases of the carcinogenicity process; it was not considered robust enough for an OECD TG but it was reported in two OECD Guidelines, Syrian Hamster Cells (SHE) CTA (OECD 2015c) and Bhas 42 cell line CTA, (OECD 2016e). To date, the use of IATA is strongly recommended for carcinogenicity evaluation (Eskes 2019; Dal Negro et al. 2018) particularly for non-genotoxic carcinogens where a dedicated expert group was established at OECD in 2014 (Corvi et al. 2019; Jacobs et al. 2020). OECD recognising that the CTA alone was insufficient to address non-genotoxic carcinogenicity and that a more comprehensive battery of tests addressing different non-genotoxic mechanisms of carcinogenicity would be needed in the future, identified the need for an IATA to properly address the issue of non-genotoxic carcinogenicity. The expert working group examines the current international regulatory requirements and their limitations in respect to non-genotoxic carcinogenicity, and how an IATA could be developed to assist regulators in their assessment of non-genotoxic carcinogenicity. The development of an ITS is ongoing also in the context of the International Conference on Harmonisation (ICH), the collaboration of pharmaceutical regulators, based on knowledge of pharmacological targets and pathways, together with toxicological and other data. The European Partnership for Alternative Approaches to Animal Testing has started a project to evaluate whether this approach is applicable to the carcinogenicity assessment of pesticides. The promise of IATA for carcinogenicity was recognized in our roadmap exercise (Basketter et al. 2012; Leist et al. 2014). It is exciting to see that several high-level projects are now realizing this vision.

IATAs for carcinogenicity will require the combination of various tests as in vitro genotoxicity and carcinogenicity assays are able to address just some stages of the respective in vivo process that are characterized by multi-step components. To date, the in vitro tests cover specific and basic cellular endpoints (such as gene mutation, chromosomal damage, cell transformation) and no cell signalling or gene expression are considered, though these are increasingly considered as valuable biomarkers.

Developmental and reproductive toxicology

A very high number of laboratory animals is involved in studies for reproductive toxicology (Hartung and Rovida 2009) with many false-positives (Hartung 2009) as well as low inter-species concordance (Smirnova et al. 2018) and in the last years many alternative methods to the in vivo studies were set up and evaluated (Brannen et al. 2016; Corvi et al. 2019).

Three alternative methods, even if not accepted for full replacement, are available (Adler et al. 2011; Pistollato et al. 2021): the mouse Embryonic Stem Cell test (EST), a cell-based assay, Micromass Test (MM), the Whole Embryo Culture (WEC), based on embryonic tissue, limb and cephalic tissue, and the Whole Embryo Culture (WEC), on rat embryos (Pistollato et al. 2021).

The toxicological effects of xenobiotics on EST, is based on three different endpoints: the inhibition of differentiation on Embryonic Stem Cells (D3) and proliferation, through a cytotoxicity test, either on ESC (D3) and 3T3 fibroblasts adult cells, (Spielman et al. 2006; Brannen et al. 2016).

Even if EST is considered a model with a low biological complexity (Brannen et al. 2016), there are many advantages since the cells could be maintained in vitro, reducing costs and it can be applied in high-throughput screening (Brannen et al. 2016), or as a part of an integrated strategy (Pistollato et al. 2021).

We were not able to identify any attempts to establish IATA for reproductive toxicology in the regulatory arena, while such an approach was highly recommended in our roadmap exercise (Basketter et al. 2012; Leist et al. 2014). The EU projects ReProTect (Hareng et al. 2005) and ChemScreen (van den Burg et al. 2011) developed such a preliminary IATA, with promising results (Schenk et al. 2010). Given the resource-demanding impact of this hazard, it is not clear why so little effort is put into developing an IATA for developmental and reproductive toxicology.

Endocrine disruptors

The different and multiple mechanisms of action and pathways of Endocrine Disruptors, and MIEs, imply a complex approach for their toxicological evaluation (OECD 2018d), and, to decrease the number of animals used, the Level 2 of testing strategy, based on in vitro approach, should be implemented with regulatory validated methods. In vitro tests currently available, are focused on the action of chemicals on estrogen receptor agonists and antagonists (OECD 2021e), estrogen receptor binding (OECD 2015d), androgenic receptor agonists and antagonists (OECD 2020c), and on steroidogenesis (OECD 2011).

Future directions and perspectives are oriented to cover other toxicological endpoints, enlarging the field of action, with, for example, assays based on other hormone receptors, or specific target organ effects (Pistollato et al. 2021), and applying methodologies for a large-scale testing, like high throughput screening.

Based on US EPA ToxCast data, which tested about 2000 chemicals in hundreds of robotized assays, pioneering work showed how to use a combination of tests to predict endocrine activity.

(Browne et al. 2015; Kleinstreuer et al. 2018; Judson et al. 2020), which was accepted for the US Endocrine Disruptor Screening Program (EDSP). The respective prediction model is clearly an IATA. So far this has only been applied to estrogenic and androgenic endocrine disruption but it shows the potential of combining a number of tests.

The status of IATA development for different human hazards is summarized in Table 4

Table 4 Status of IATA development for different hazards

Full size table

In vivo testing as part of IATA

With the promise of REACH to further also the use and availability of alternative methods, animal tests were usually only placed as last resort in the IATAs developed. However, we should note first of all that the mere combination of animal tests into one can save animals used (Hartung 2018). For example, short-term toxicity tests with rodents can be combined with developmental toxicity screening assays as OECD Test Guideline 422. Similarly, chronic toxicity studies can be combined with carcinogenicity studies in rodents as OECD Test Guideline 453. A number of genotoxicity tests can be incorporated into acute and short-term in vivo assays. The potential of addressing endocrine disruptor effects within guideline studies needs to be further explored. The problem of multiple testing adding more endpoints to these studies is, however, an important caveat (Hartung 2013). The entire area of lower species such as zebrafish, C. elegans or Drosophila represent further in vivo assays as building blocks for IATA. Last but not least, legacy data of the past represent valuable information not only for the substances themselves but through read-across (Ball et al. 2016) also for similar chemicals.

Progress made in recent years and challenges for future developments toward IATA

IATA are slowly but continuously being embraced in regulatory toxicology as shown for the different hazards above. Major driving forces are the European REACH legislation as well as the US EPA’s ToxCast and EDSP program. The fact that OECD has taken over developing guidance for IATA and DA is of critical importance. OECD has for example developed General Principles for the Reporting of Defined Approaches to Testing and Assessment based on Multiple Information Sources to facilitate their regulatory use. They request a DA to be based on a fixed DIP and a defined set of information sources, associated with the following set of information:

1.
A defined endpoint
2.
A defined purpose
3.
A description of the underlying rationale
4.
A description of the individual information sources used
5.
A description of how data from the individual information sources are processed
6.
A consideration of the known uncertainties

This reminds very much of the requirements for test definition for the validation of alternative methods (Hartung et al. 2004).

An important development was the realization that the fundamental problem of evidence integration from different evidence stream is shared in IATAs, risk assessments and systematic reviews (EFSA 2018), the first prospectively and the latter two retrospectively (Fig. 1). The scheme applies to individual substances such as chemicals and final products (mixtures). Sure, not all evidence generating methods are applicable for the latter, but to some extent this hold also for individual substances (applicability domains). The distinction between prospectively and retrospectively is, however, blurring as more and more existing data are considered in testing strategies and risk assessments/systematic reviews also map testing needs. The advent of machine learning (artificial intelligence, AI) has provided us also with a new tool to do exactly this. This approach is uniquely suited to train on heterogenous datasets, which is called transfer learning. AI makes sense of Big Data (Hartung 2016), which are beside the volume and velocity they are acquired, defined by their variety (forming the 3 V of Big Data). Our earlier work showed, how 74 properties of chemicals could be used to predict hazard classification actually outperforming animal tests (Luechtefeld et al. 2018a, b). For IATA, this has not been sufficiently employed, also because we usually do not have enough training data from NAMs. This has changed with ToxCast and Tox21 and it is most promising to use such information as a source for probabilistic hazard and risk assessment (Maertens et al. 2022).

Schematic illustrating the similarity of combining various evidence streams for deterministic or probabilistic hazard assessment for individual substances or final products (mixtures). Abbreviations (see Table 1 for OECD definition): DIP = Data Interpretation Procedure, i.e., fixed algorithm for interpreting data to derive test result, IATA = Integrated Approach to Testing and Assessment, DA = Defined Approach, RASAR = Read-Across-based Structure/Activity Relationship, A.I. = Artificial Intelligence, aka Machine Learning.

Kinsner-Ovaskainen et al. (2009) gave a number of challenges for ITS/IATA:

Scientific knowledge and guidance on how to develop an ITS; how to combine the different building blocks for an efficient and effective decision-making process?
The extent of flexibility in combining the ITS components;
The optimal combination of ITS components (including the minimal number of components and/or combinations that have a desired predictive capacity);
The applicability domain of single components and the whole ITS;
The efficiency of the ITS (cost, time, technical difficulties)
Need to further discuss and to develop the ITS validation principles

These aspects were discussed in Hartung et al. (2013) but not much conceptual progress has been noted since. With respect to validation, the development of evaluation criteria (Table 3) coupled with performance assessment for the overall DA (Kleinstreuer et al. 2018) was mentioned.

Jaworska and Hoffmann (2010) noted “Because high-throughput datasets may suffer from technical and biological noise or from various technical biases and biological shortcomings, improved statistics are needed for the separation of signal from noise, as well as for better data integration annotating biologically relevant relationships. The logical interpretation of the complex signal propagation leading to an observed effect is not easily comprehensible. Therefore, computational modelling can be expected to play a crucial role in predicting the output from the signal input or system perturbation to obtain a more comprehensive, less technically biased and more accurate view of the true effect”. This call for considering the key role of data management (Wilkinson et al. 2016).

The ITS workshop (Rovida et al. 2015) noted that ITS development for hazard characterization in the context of REACH are prescribed, deterministic and serve classification, while ITS for full safety assessment should be flexible, probabilistic, and fit for purpose. We have recently addressed the important role of probabilistic approaches in general (Maertens et al. 2022). The challenges and opportunities for IATA thus have not changed very much since 2015. Some progress for individual hazards is noted as summarized in Table 3; the tremendous opportunity of this approach, however, have not yet been leveraged.

For the improvement of safety sciences and the replacement of traditional animal-based approaches by IATA, approaches should be relevant combining the different aspects of complexity, from chemicals to models. From a multi-disciplinarity perspective, already existing models coming from different scientific domains could be reconverted and applied in toxicological studies and, at the same time, toxicology could be a source for novel methodologies for other fields for cross-cutting applications to speed up the 3Rs vision.

Notes

Abbreviations

AI:: Artificial intelligence
AOP:: Adverse outcome pathways
ADME:: Absorption, distribution, metabolism and excretion
DA:: Defined approaches for testing and assessment
DIP:: Data interpretation procedure
KE:: Key events
IATA:: Integrated approaches for testing and assessment
ITS:: Integrated testing strategies
LLNA:: Local lymph node assay
MIE:: Molecular initiating event
NAM:: New approach methods
QSAR:: Quantitative structure–activity relationships
RhE:: Reconstructed human epidermis models
SAR:: Structure–activity relationships
TG:: Testing guidelines

References

Abd E, Yousef SA, Pastore MN et al (2016) Skin models for the testing of transdermal drugs. Clin Pharm Adv Appl 8:163–176. https://doi.org/10.2147/CPAA.S64788
Article CAS Google Scholar
Adler S, Basketter D, Creton S et al (2011) Alternative (non-animal) methods for cosmetics testing: current status and future prospects—2010. Arch Toxicol 85:367–485. https://doi.org/10.1007/s00204-011-0693-2
Article CAS PubMed Google Scholar
Adriaens E, Barroso J, Eskes C et al (2014) Retrospective analysis of the draize test for serious eye damage/eye irritation: importance of understanding the in vivo endpoints under UN GHS/EU CLP for the development and evaluation of in vitro test methods. Arch Toxicol 88:701–723. https://doi.org/10.1007/s00204-013-1156-8
Article CAS PubMed Google Scholar
Ahlers J, Stock F, Werschkun B (2008) Integrated testing and intelligent assessment-new challenges under REACH. Env Sci Polluti Res 15:565–572. https://doi.org/10.1007/s11356-008-0043-y
Article Google Scholar
Ball N, Cronin MTD, Shen J et al (2016) Toward good read-across practice (GRAP) guidance. Altex 33:149–166. https://doi.org/10.14573/altex.1601251
Article PubMed Central PubMed Google Scholar
Basketter DA (2016) Skin sensitisation, adverse outcome pathways and alternatives. Altern Lab Anim 44(5):431–436. https://doi.org/10.1177/026119291604400501
Article PubMed Google Scholar
Basketter DA, Clewell H, Kimber I et al (2012) A roadmap for the development of alternative (non-animal) methods for systemic toxicity testing. Altex 29(3):e89. https://doi.org/10.14573/altex.2012.1.003
Article Google Scholar
Blaauboer BJ, Barratt M (1999) The integrated use of alternative methods in toxicological risk evaluation. Altern Lab Anim 27:229–237. https://doi.org/10.1177/026119299902700211
Article CAS PubMed Google Scholar
Blaauboer B, Barratt MD, Houston JB (1999) The Integrated use of alternative methods in toxicological risk evaluation—ECVAM integrated testing strategies task force report 1. Altern Lab Anim 27(2):229–237. https://doi.org/10.1177/026119299902700211
Article CAS PubMed Google Scholar
Brannen KC, Chapin RE, Jacobs AC et al (2016) Alternative models of developmental and reproductive toxicity in pharmaceutical risk assessment and the 3Rs. ILAR J 57(2):144–156. https://doi.org/10.1093/ilar/ilw026
Article CAS PubMed Google Scholar
Browne P, Judson WM, Casey NC et al (2015) Screening chemicals for estrogen receptor bioactivity using computational model. Environ Sci Tecnol 49(14):8804–8841. https://doi.org/10.1021/acs.est.5b02641
Article CAS Google Scholar
Caloni F, Cazzaniga A, Coccini T et al (2021) Second virtual summer school: alternative methods in science: towards model complexity. Altex 38(3):510–512. https://doi.org/10.14573/altex.2106221
Article PubMed Google Scholar
Casati S (2018a) Integrated approaches to testing and assessment. Basic Clin Pharm Toxicol 123:51–55. https://doi.org/10.1111/bcpt.13018
Article CAS Google Scholar
Casati S, Aschberger K, Barroso J et al (2018b) Standardisation of defined approaches for skin sensitisation testing to support regulatory use and international adoption: position of the international cooperation on alternative test methods. Arch Toxicol 92(2):611–617. https://doi.org/10.1007/s00204-017-2097-4
Article CAS PubMed Google Scholar
Clothier R, Willshaw A, Cox H et al (1999) The use of human keratinocytes in the EU/COLIPA international in vitro phototoxicity test validation study and the ECVAM/COLIPA study on UV filter chemicals. Altern Lab Anim 27(2):247–259. https://doi.org/10.1177/026119299902700203
Article CAS PubMed Google Scholar
Conway GE, Shah U-K, Llewellyn S et al (2020) Adaptation of the in vitro micronucleus assay for genotoxicity testing using 3D liver models supporting longer-term exposure durations. Mutagenesis 35:319–329. https://doi.org/10.1093/mutage/geaa018
Article CAS PubMed Central PubMed Google Scholar
Corvi R, Madia F (2017) In vitro genotoxicity testing—can the performance be enhanced? Food Chem Toxicol 106:600–608. https://doi.org/10.1016/j.fct.2016.08.024
Article CAS PubMed Google Scholar
Corvi R, Spielmann H, Hartung T (2019) Alternative approaches for carcinogenicity and reproductive toxicity. In: Balls M, Combes R, Worth A (eds) The history of alternative test methods in toxicology. Academic Press, London, pp 209–218
Chapter Google Scholar
Dal Negro G, Eskes C, Belz S et al (2018) One science-driven approach for the regulatory implementation of alternative methods: a multi-sector perspective. Regul Toxicol Pharm 99:33–49. https://doi.org/10.1016/j.yrtph.2018.08.002
Article Google Scholar
de Ávila RI, Lindstedt M, Campos Valadares M (2019) The 21st century movement within the area of skin sensitization assessment: from the animal context towards current human-relevant in vitro solutions. Regul Toxicol Pharm 108:104445. https://doi.org/10.1016/j.yrtph.2019.104445
Article CAS Google Scholar
Dejongh J, Forsby A, Houston JB et al (1999) An Integrated approach to the prediction of systemic toxicity using computer-based biokinetic models and biological in vitro test methods: overview of a prevalidation study based on the ECITTS project. Toxicol Vitro 13:549–554. https://doi.org/10.1016/s0887-2333(99)00030-2
Article CAS Google Scholar
Dellambra E, Odorisio T, D’Arcangelo D et al (2019) Non-animal models in dermatological research. Altex 36(2):177–202. https://doi.org/10.14573/altex.1808022
Article PubMed Google Scholar
EFSA European food safety authority and ebtc (evidence-based toxicology collaboration) (2018) EFSA scientific colloquium 23: evidence integration in risk assessment: the science of combining apples and oranges. EFSA Support Publ 16(3):1396. https://doi.org/10.2903/sp.efsa.2018.EN-1396
Article Google Scholar
EFSA Panel on Plant Protection Products and their Residues (PPR) (2011) Scientific opinion on the science behind the revision of the guidance document on dermal 30 absorption. EFSA J 9(7):2294. https://doi.org/10.2903/j.efsa.2011.2294
Article Google Scholar
Eskes C (2019) The usefulness of integrated strategy approaches in replacing animal experimentation. Ann Ist Super Sanità 55(4):400–404. https://doi.org/10.4415/ANN_19_04_16
Article PubMed Google Scholar
Faller C, Bracher M, Dami N et al (2003) Predictive ability of reconstructed human epidermis equivalents for the assessment of skin irritation of cosmetics. Toxicol Vitro 16:557–572. https://doi.org/10.1016/s0887-2333(02)00053-x
Article Google Scholar
Gordon S, Daneshian M, Bouwstra J et al (2015) Non-animal models of epithelial barriers (skin, intestine and lung) in research, industrial applications and regulatory toxicology. Altex 32:327–378. https://doi.org/10.14573/altex.1510051
Article PubMed Google Scholar
Guo X, Seo J-E, Li X et al (2020) Genetic toxicity assessment using liver cell models: past, present, and future. J Toxicol Env Health Part B Crit Rev 23(1):27–50. https://doi.org/10.1080/10937404.2019.1692744
Article CAS Google Scholar
Hareng L, Pellizzer C, Bremer S et al (2005) The Integrated project ReProTect: a novel approach in reproductive toxicity hazard assessment. Reprod Toxicol 20:441–452. https://doi.org/10.1016/j.reprotox.2005.04.003
Article CAS PubMed Google Scholar
Hartung T (2009) Toxicology for the twenty-first century. Nature 460:208–212. https://doi.org/10.1038/460208a
Article CAS PubMed Google Scholar
Hartung T (2013) Look Back in anger—what clinical studies tell us about preclinical work. Altex 30:275–291. https://doi.org/10.14573/altex.2013.3.275
Article PubMed Central PubMed Google Scholar
Hartung T (2016) Making big sense from big data in toxicology by read-across. Altex 33:83–93. https://doi.org/10.14573/altex.1603091
Article PubMed Google Scholar
Hartung T (2018) Rebooting the generally recognized as safe (GRAS) approach for food additive safety in the US. Altex 35:3–25. https://doi.org/10.14573/altex.1712181
Article PubMed Google Scholar
Hartung T, Rovida C (2009) Chemical regulators have overreached. Nature 460:1080–1081. https://doi.org/10.1038/4601080a
Article CAS PubMed Google Scholar
Hartung T, Bremer S, Casati S et al (2004) A Modular approach to the ECVAM principles on test validity. ATLA—Altern Lab Anim 32:467–472
Article CAS Google Scholar
Hartung T, Bruner L, Curren R et al (2010) First alternative method validated by a retrospective weight-of-evidence approach to replace the draize eye test for the identification of non-irritant substances for a defined applicability domain. Altex 27:43–51. https://doi.org/10.14573/altex.2010.1.43
Article PubMed Google Scholar
Hartung T, Luechtefeld T, Maertens A et al (2013) Integrated testing strategies for safety assessments. Altex 30:3–18. https://doi.org/10.14573/altex.2013.1.003
Article PubMed Central PubMed Google Scholar
Hayes AW, Muriana A, Alzualde A et al (2020) Alternatives to animal use in risk assessment of mixtures. Int J Toxicol 39(2):165–172. https://doi.org/10.1177/1091581820905088
Article Google Scholar
Henning A, Schaefer UF, Neumann D (2009) Potential pitfalls in skin permeation experiments: influence of experimental factors and subsequent data evaluation. Eur J Pharm Biopharm 72:324–331. https://doi.org/10.1016/j.ejpb.2008.07.016
Article CAS PubMed Google Scholar
Hoffmann S, Hartung T (2006) Towards an evidence-based toxicology. Human Exp Toxicol 25:497–513
Article CAS Google Scholar
Jacobs MN, Colacci A, Corvi R et al (2020) Chemical carcinogen safety testing: OECD expert group international consensus on the development of an integrated approach for the testing and assessment of chemical non-genotoxic carcinogens. Arch Toxicol 94:2899–2923. https://doi.org/10.1007/s00204-020-02784-5
Article CAS PubMed Central PubMed Google Scholar
Jaworska J, Hoffmann S (2010) Integrated testing strategy (ITS)–opportunities to better use existing data and guide future testing in toxicology. Altex 27:231–242. https://doi.org/10.14573/altex.2010.4.231
Article PubMed Google Scholar
Jaworska J, Gabbert S, Aldenberg T (2010) Towards optimization of chemical testing under REACH: a bayesian network approach to integrated testing strategies. Reg Toxicol Pharm 57:157–167. https://doi.org/10.1016/j.yrtph.2010.02.003
Article CAS Google Scholar
Johansson H, Gradin R, Johansson A et al (2019) Validation of the GARD™skin assay for assessment of chemical skin sensitizers—ring trial results of predictive performance and reproducibility. Toxicol Sci. https://doi.org/10.1093/toxsci/kfz108
Article PubMed Central PubMed Google Scholar
Judson R, Houck K, Friedman KP et al (2020) Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol Pharmacol 117:104764. https://doi.org/10.1016/j.yrtph.2020.104764
Article CAS PubMed Google Scholar
Kandarova H, Hayden PJ (2021) Standardised reconstructed skin models in toxicology and pharmacology: state of the art and future development. In: Schäfer-Korting M, Stuchi Maria-Engler S, Landsiedel R (eds) Organotypic models in drug development organotypic models in drug development. Springer Nature, Switzerland AG, pp 57–71. https://doi.org/10.1007/164_2020_417
Chapter Google Scholar
Kandarova H, Liebsch M, Gerner I et al (2005) The EpiDerm test protocol for the upcoming ECVAM validation study on in vitro skin irritation tests–an assessment of the performance of the optimised test. Altern Lab Anim 33:351–367. https://doi.org/10.1177/026119290503300408
Article CAS PubMed Google Scholar
Kandarova H, Liebsch M, Spielmann H et al (2006) Assessment of the human epidermis model SkinEthic RHE for in vitro skin corrosion testing of chemicals according to new OECD TG 431. Toxicol Vitro 20:547–559. https://doi.org/10.1016/j.tiv.2005.11.008
Article CAS Google Scholar
Kandarova H, Willoughby JA, De Jong WH et al (2018) Pre-validation of an in vitro skin irritation test for medical devices using the reconstructed human tissue model epidermTM. Toxicol Vitro 50:407–417. https://doi.org/10.1016/j.tiv.2018.02.007
Article CAS Google Scholar
Kar S, Leszczynski J (2019) Exploration of computational approaches to predict the toxicity of chemical mixtures. Toxics 7:15. https://doi.org/10.3390/toxics7010015
Article CAS PubMed Central Google Scholar
Kim SY, Seo S, Choi KH et al (2020) Evaluation of phototoxicity of tattoo pigments using the 3T3 neutral red uptake phototoxicity test and a 3D human reconstructed skin model. Toxicol Vitro 65:104813. https://doi.org/10.1016/j.tiv.2020.104813
Article CAS Google Scholar
Kimber I, Poole A, Basketter DA (2018) Skin and respiratory chemical allergy: confluence and divergence in a hybrid adverse outcome pathway. Toxicol Res 7:586–605. https://doi.org/10.1039/c7tx00272f
Article CAS Google Scholar
Kinsner-Ovaskainen A, Akkan Z, Casati S et al (2009) Overcoming barriers to validation of non-animal partial replacement methods/integrated testing strategies: the report of an EPAA-ECVAM workshop. Altern Lab Anim 37:437–444. https://doi.org/10.1177/026119290903700413
Article CAS PubMed Google Scholar
Kleinstreuer NC, Hoffmann S, Alépée N et al (2018) Non-animal methods to predict skin sensitization (II): an assessment of defined approaches. Cri Rev Toxicol 48(5):359–374
Article CAS Google Scholar
Kohl Y, Rundén-Pran E, Mariussen E et al (2020) Genotoxicity of nanomaterials: advanced in vitro models and high throughput methods for human hazard assessment—a review. Nanomaterials 10:1911. https://doi.org/10.3390/nano10101911
Article CAS PubMed Central Google Scholar
Kolle SN, Landsiedel R (2021) Human derived in vitro models used for skin toxicity testing under REACh. In: Schäfer-Korting M, Stuchi Maria-Engler S, Landsiedel R (eds) Organotypic models in drug development. Springer Nature, Switzerland AG, pp 9–27. https://doi.org/10.1007/164_2020_417
Chapter Google Scholar
Kooter IM, Gröllers-Mulderij M, Steenhof M et al (2016) Cellular effects in an in vitro human 3D cellular airway model and A549/BEAS-2B in vitro cell cultures following air exposure to cerium oxide particles at an air–liquid interface. Appl Vitro Toxicol 2:56–66. https://doi.org/10.1089/aivt.2015.0030
Article CAS Google Scholar
Lee SJ, Lee HA (2020) Trends in the development of human stem cell-based non-animal drug testing models. Korean J Physiol Pharm 24(6):441–452. https://doi.org/10.4196/kjpp.2020.24.6.441
Article CAS Google Scholar
Leist M, Hasiwa M, Daneshian M et al (2012) Validation and quality control of replacement alternatives—current status and future challenges. Toxicol Res 1:8. https://doi.org/10.1039/C2TX20011B
Article CAS Google Scholar
Leist M, Hasiwa N, Rovida C et al (2014) Consensus report on the future of animal-free systemic toxicity testing. Altex 31:341–356. https://doi.org/10.14573/altex.1406091
Article PubMed Google Scholar
Leist M, Ghallab A, Graepel R et al (2017) Adverse outcome pathways: opportunities, limitations and open questions. Arch Toxicol 31:221–229. https://doi.org/10.1007/s00204-017-2045-3
Article CAS Google Scholar
Luechtefeld T, Rowlands C, Hartung T (2018a) Big-data and machine learning to revamp computational toxicology and its use in risk assessment. Toxicol Res 7:732–744. https://doi.org/10.1039/C8TX00051D
Article CAS Google Scholar
Luechtefeld T, Marsh D, Rowlands C et al (2018b) Machine learning of toxicological big data enables read-across structure activity relationships (RASAR) outperforming animal test reproducibility. Toxicol Sci 165:198–212. https://doi.org/10.1093/toxsci/kfy152
Article CAS PubMed Central PubMed Google Scholar
Lynch AM, Wilcox P (2011) Review of the performance of the 3T3 NRU in vitro phototoxicity assay in the pharmaceutical industry. Exp Toxicol Pathol 63(3):209–214. https://doi.org/10.1016/j.etp.2009.12.001
Article CAS PubMed Google Scholar
Maciel B, Moreira PH, Carmo H (2019) Implementation of an in vitro methodology for phototoxicity evaluation in a human keratinocyte cell line. Toxicol Vitro 61:104618. https://doi.org/10.1016/j.tiv.2019.104618
Article CAS Google Scholar
Madia F, Pillo G, Worth A, Corvi R, Prieto P (2021) Integration of data across toxicity endpoints for improved safety assessment of chemicals: the example of carcinogenicity assessment. Arch Toxicol 95:1971–1993. https://doi.org/10.1007/s00204-021-03035-x
Article CAS PubMed Central PubMed Google Scholar
Maertens A, Golden E, Luechtefeld TH et al (2022) Probabilistic risk assessment–the keystone for the future of toxicology. Altex 39:3–29. https://doi.org/10.14573/altex.2201081
Article PubMed Central PubMed Google Scholar
Marx U, Andersson TB, Bahinski A, Beilmann M et al (2016) Biology-inspired microphysiological system approaches to solve the prediction dilemma of substance testing using animals. Altex 33:272–321. https://doi.org/10.14573/altex.1603161
Article PubMed Central PubMed Google Scholar
Marx U, Akabane T, Andersson TB et al (2020) Biology-inspired microphysiological systems to advance medicines for patient benefit and animal welfare. Altex 37:364–394. https://doi.org/10.14573/altex.2001241
Article Google Scholar
McKim JM Jr, Keller DJ, Gorski JR (2012) An in vitro method for detecting chemical sensitization using human reconstructed skin models and its applicability to cosmetic, pharmaceutical, and medical device safety testing. Cutan Ocul Toxicol 31(4):292–305. https://doi.org/10.3109/15569527.2012.667031
Article CAS PubMed Google Scholar
Muruzabal D, Sanz-Serrano J, Treillard B et al (2021) Validation of the in vitro comet assay for DNA cross-links and altered bases detection. Arch Toxicol 95:2825–2838. https://doi.org/10.1007/s00204-021-03102-3
Article CAS PubMed Central PubMed Google Scholar
Netzlaff F, Lehr C-M, Wertz PW et al (2005) The human epidermis models EpiSkin^®, SkinEthic^® and EpiDerm^®: An evaluation of morphology and their suitability for testing phototoxicity, irritancy, corrosivity, and substance transport. Eur J Pharm Biopharm 60:167–178. https://doi.org/10.1016/j.ejpb.2005.03.004
Article CAS PubMed Google Scholar
Neupane R, Boddu SHS, Jwala RJ et al (2020) Alternatives to biological skin in permeation studies: current trends and possibilities. Pharmaceutics 12:152. https://doi.org/10.3390/pharmaceutics12020152
Article CAS PubMed Central Google Scholar
OECD (2008) Organisation for economic co-operation and development. workshop on integrated approaches to testing and assessment. OECD environment health and safety publications. series on testing and assessment No. 88
OECD (2010a) Organisation for economic co-operation and development. Test guideline 429: skin sensitization: local lymph node assay
OECD (2010b) Organisation for economic co-operation and development. Test guideline 428: skin absorption: in vitro method
OECD (2011) Organisation for economic co-operation and development. Test guideline 456: H295R steroidogenesis assay
OECD (2013) Organisation for economic co-operation and development. Test guideline 430: in vitro skin corrosion: transcutaneous electrical resistance test method (TER)
OECD (2014a) Organisation for economic co-operation and development. Test guideline 168: the adverse outcome pathway for skin sensitisation initiated by covalent binding to proteins
OECD (2014a) Organisation for economic co-operation and development. New guidance document No.203 an integrated approach on testing and assessment (IATA) for skin corrosion and irritation.
OECD (2015a) Organisation for economic co-operation and development. Test guideline 404: acute dermal irritation/corrosion
OECD (2015b) Organisation for economic co-operation and development. Test guideline 435: in vitro membrane barrier test method for skin corrosion
OECD (2015c) Organisation for economic co-operation and development. Guidance document on the in vitro Syrian Hamster Embryo (SHE) cell transformation assay
OECD (2015d) Organisation for economic co-operation and development. Test guideline 493: performance-based test guideline for human recombinant Estrogen Receptor (hrER) in vitro assays to detect chemicals with ER binding affinity
OECD (2016a) Organisation for economic co-operation and development. Test guideline 476: in vitro mammalian cell gene mutation tests using the hprt and xprt genes
OECD (2016b) Organisation for economic co-operation and development. Test guideline 490: in vitro mammalian cell gene mutation tests using the thymidine kinase gene
OECD (2016c) Organisation for economic co-operation and development. Test guideline 473: in vitro mammalian chromosomal aberration test
OECD (2016d) Organisation for economic co-operation and development Test guideline 487: in vitro mammalian cell micronucleus test
OECD (2016e) Organisation for economic co-operation and development. Guidance document 231 on the in vitro BHAS 42 Cell Transformation Assay
OECD (2017a) Organisation for economic co-operation and development. New guidance document No 203 on an integrated approach on testing and assessment (IATA) for skin corrosion and irritation
OECD (2017b) Organisation for economic co-operation and development. Test guideline 405: acute eye irritation/corrosion
OECD (2017c) Organisation for economic co-operation and development. Test guideline 437: bovine corneal opacity and permeability test method for identifying (i) chemicals inducing serious eye damage a (ii) chemicals not requiring classification for eye irritation or serious eye damage
OECD (2017d). Organisation for economic co-operation and development. Test guideline 460: fluorescein leakage test method for identifying ocular corrosives and severe irritants
OECD (2017e) Organisation for economic co-operation and development. Guidance document No. 263 on an Integrated Approach on Testing and Assessment (IATA) for serious eye damage and eye irritation
OECD (2018a) Organisation for economic co-operation and development. Test guideline 442D: in vitro skin sensitization: ARE-Nrf2 luciferase test method
OECD (2018b) Organisation for economic co-operation and development. Test guideline 442E: in vitro skin sensitization: in vitro skin sensitisation assays addressing the key event on activation of dendritic cells on the adverse outcome pathway for skin sensitisation
OECD (2018c) Organisation for economic co-operation and development. Test guideline 438: isolated chicken eye test method for identifying (i) chemicals inducing serious eye damage and (ii) chemicals not requiring classification for eye irritation or serious eye damage
OECD (2018d) Organisation for economic co-operation and development. Revised guidance document No.150 on standardised test guidelines for evaluating chemicals for endocrine disruption
OECD (2018e) Organisation for economic co-operation and development. Users’ Handbook supplement to the guidance document for developing and assessing adverse outcome pathways
OECD (2019a) Organisation for economic co-operation and development. Test guideline 431: in vitro skin corrosion: reconstructed human epidermis (RHE) test method
OECD (2019b) Organisation for economic co-operation and development. Test guideline 439: in vitro skin irritation: reconstructed human epidermis test method
OECD (2019c) Organisation for economic co-operation and development. Test guideline 432: in vitro 3T3 NRU phototoxicity test
OECD (2019d) Organisation for economic co-operation and development. Draft second edition: guidance note on dermal absorption
OECD (2019e) Organisation for economic co-operation and development. Test guideline 496: in vitro macromolecular test method for identifying chemicals inducing serious eye damage and chemicals not requiring classification for eye irritation or serious eye damage
OECD (2019f) Organisation for economic co-operation and development. Test guideline 492: reconstructed human cornea like epithelium (RhCE) test method for identifying chemicals not requiring classification and labelling for eye irritation or serious eye damage
OECD (2019g) Organisation for economic co-operation and development. Test guideline 494: Vitrigel-eye irritancy test method for identifying chemicals not requiring classification and labelling for eye irritation or serious eye damage
OECD (2020a) Organisation for economic co-operation and development. Test guideline 491: short time exposure in vitro test method for identifying (i) chemicals inducing serious eye damage and (ii) chemicals not requiring classification for eye irritation or serious eye damage
OECD (2020b) Organisation for economic co-operation and development. test guideline 471: bacterial reverse mutation test
OECD (2020c) Organisation for economic co-operation and development. Test guideline 458: stably transfected human androgen receptor transcriptional activation assay for detection of androgenic agonist and antagonist activity of chemicals
OECD (2021a) Organisation for economic co-operation and development. Test guideline 498: in vitro phototoxicity – reconstructed human epidermis phototoxicity test method
OECD (2021b) Organisation for economic co-operation and development. Test guideline 495: ROS (reactive oxygen species) assay for photoreactivity
OECD (2021c) Organisation for economic co-operation and development. Test guideline 442c: in chemico skin sensitization: assays addressing the adverse outcome pathway key event on covalent binding to proteins
OECD (2021d) Organisation for economic co-operation and development. Guideline 497: defined approaches on skin sensitization
OECD (2021e) Organisation for economic co-operation and development. Test guideline 455: performance-based test guideline for stably transfected transactivation in vitro assays to detect estrogen receptor agonists and antagonists
Onoue S, Suzuki G, Kato M et al (2013) Non-animal photosafety assessment approaches for cosmetics based on the photochemical and photobiochemical properties. Toxicol Vitro 8:2316–2324. https://doi.org/10.1016/j.tiv.2013.10.003
Article CAS Google Scholar
Pfuhler S, Downs TR, Hewitt JN (2021) Validation of the 3D reconstructed human skinmicronucleus (RSMN) assay: an animal-free alternative for following-up positive results from standard in vitro genotoxicity assays. Mutagenesis 36(1):1–17. https://doi.org/10.1093/mutage/geaa035
Article CAS PubMed Central PubMed Google Scholar
Pfuhler S, van Benthem J, Curren, R et al (2020) Use of in vitro 3D tissue models in genotoxicity testing: Strategic fit, validation status and way forward. report of the working group from the 7th international workshop on genotoxicity testing (IWGT). Mutat Res Genet Toxicol Env Mutagen 850-851: 503135 https://doi.org/10.1016/j.mrgentox.2020.503135
Pistollato F, Madia F, Corvi R et al (2021) Current EU regulatory requirements for the assessment of chemicals and cosmetic products: challenges and opportunities for introducing new approach methodologies. Arch Toxicol 5(6):1867–1897. https://doi.org/10.1007/s00204-021-03034-y
Article CAS Google Scholar
Roth A, Berlin MPS-WS (2019) (2021) Human microphysiological systems for drug development. Science 373:1304–1306. https://doi.org/10.1126/science.abc3734
Article CAS Google Scholar
Rovida C, Alépée N, Api AM et al (2015) Integrated Testing strategies (ITS) for safety assessment. Altex 32(1):25–40. https://doi.org/10.14573/altex.1411011
Article PubMed Google Scholar
Saito K, Nukada Y, Takenouchi O et al (2013) Development of a new in vitro skin sensitization assay (epidermal sensitization assay; EpiSensA) using reconstructed human epidermis. Toxicol Vitro 27(8):2213–2224. https://doi.org/10.1016/j.tiv.2013.08.007
Article CAS Google Scholar
Schäfer-Korting M, Bock U, Diembeck W et al (2008) The use of reconstructed human epidermis for skin absorption testing: results of the validation study. Altern Lab Anim 36:161–187. https://doi.org/10.1177/026119290803600207
Article PubMed Google Scholar
Schenk B, Weimer M, Bremer S et al (2010) The ReProTect Feasibility Study, a novel comprehensive in vitro approach to detect reproductive toxicants. Reprod Toxicol 30:200–218. https://doi.org/10.1016/j.reprotox.2010.05.012
Article CAS PubMed Google Scholar
Scott L, Eskes C, Hoffmann S et al (2009) A proposed eye irritation testing strategy to reduce and replace in vivo studies using bottom-up and top-down approaches. Toxicol Vitro 24(1):1–9. https://doi.org/10.1016/j.tiv.2009.05.019
Article CAS Google Scholar
Shah U-K, de Oliveira MJ, Singh N et al (2018) A three-dimensional in vitro HepG2 cells liver spheroid model for genotoxicity studies. Mutat Res Gen Tox En 825:51–58. https://doi.org/10.1016/j.mrgentox.2017.12.005
Article CAS Google Scholar
Smirnova L, Kleinstreuer N, Corvi R et al (2018) 3S – Systematic, systemic, and systems biology and toxicology. Altex 35:139–162. https://doi.org/10.14573/altex.1804051
Article PubMed Central PubMed Google Scholar
Spielmann H, Seiler A, Bremer S et al (2006) The practical application of three validated in vitro embryotoxicity tests the report and recommendations of an ECVAM/ZEBET workshop (ECVAM workshop 57). Altern Lab Anim 34(5):527–538
Article CAS PubMed Google Scholar
Tfayli A, Bonnier F, Farhane Z (2014) Comparison of structure and organization of cutaneous lipids in a reconstructed skin model and human skin: spectroscopic imaging and chromatographic profiling. Exp Derm 23:441–443. https://doi.org/10.1111/exd.12423
Article CAS PubMed Google Scholar
Tollefsen KE, Scholz S, Cronin MT et al (2014) Applying adverse outcome pathways (AOPs) to support integrated approaches to testing and assessment (IATA). Reg Toxicol Pharm 70:629–640. https://doi.org/10.1016/j.yrtph.2014.09.009
Article Google Scholar
Tsaioun K, Blaauboer BJ, Hartung T (2016) Evidence-based absorption, distribution, metabolism, excretion and toxicity (ADMET) and the role of alternative methods. Altex 33:343–358. https://doi.org/10.14573/altex.1610101
Article PubMed Google Scholar
Uco DP, Leite-Silva VR, Silva HDT et al (2018) UVA and UVB formulation phototoxicity in a three-dimensional human skin model: photo-degradation effect. Toxicol Vitro 53:37–44. https://doi.org/10.1016/j.tiv.2018.07.009
Article CAS Google Scholar
van der Burg B, Kroese ED, Piersma AH (2011) Towards a pragmatic alternative testing strategy for the detection of reproductive toxicants. Reprod Toxicol 31:558–561. https://doi.org/10.1016/j.reprotox.2011.02.012
Article CAS PubMed Google Scholar
Vinken M, Knapen D, Vergauwen L et al (2017) Adverse outcome pathways: a concise introduction for toxicologists. Arch Toxicol 91(11):3697–3707. https://doi.org/10.1007/s00204-017-2020-z
Article CAS PubMed Central PubMed Google Scholar
Wilkinson MD, Dumontier M, Aalbersberg IJ et al (2016) The FAIR guiding principles for scientific data management and stewardship. Sci Data 3:160018. https://doi.org/10.1038/sdata.2016.18
Article PubMed Central PubMed Google Scholar
Wills JW, Hondow N, Thomas AD et al (2016) Genetic toxicity assessment of engineered nanoparticles using a 3D in vitro skin model (EpiDerm™). Part Fibre Toxicol 13:50. https://doi.org/10.1186/s12989-016-0161-5
Article CAS PubMed Central PubMed Google Scholar

Download references

Acknowledgements

Supported by European Union’s Horizon 2020 research and innovation program under grant agreements No. 681002 (EU-ToxRisk) and No. 963845 (ONTOX).

Funding

Open access funding provided by Università degli Studi di Milano within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

Department of Environmental Science and Policy (ESP), Università degli Studi di Milano, Via Celoria 10, 20133, Milan, Italy
Francesca Caloni
Environment and Health Department, Istituto Superiore di Sanità, Viale Regina Elena, 299, 00161, Rome, Italy
Isabella De Angelis
Center for Alternatives to Animal Testing (CAAT), Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
Thomas Hartung
CAAT Europe, University of Konstanz, 78464, Konstanz, Germany
Thomas Hartung

Authors

Francesca Caloni
View author publications
You can also search for this author in PubMed Google Scholar
Isabella De Angelis
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Hartung
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the conception and design of the work and drafted, revised and gave final approval of the version to be published.

Corresponding author

Correspondence to Francesca Caloni.

Ethics declarations

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visithttp://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Caloni, F., De Angelis, I. & Hartung, T. Replacement of animal testing by integrated approaches to testing and assessment (IATA): a call for in vivitrosi. Arch Toxicol 96, 1935–1950 (2022). https://doi.org/10.1007/s00204-022-03299-x

Download citation

Received: 18 February 2022
Accepted: 06 April 2022
Published: 03 May 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s00204-022-03299-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Replacement of animal testing by integrated approaches to testing and assessment (IATA): a call for in vivitrosi

Abstract

Similar content being viewed by others

SEURAT: Safety Evaluation Ultimately Replacing Animal Testing—Recommendations for future research in the field of predictive toxicology

Evolving the Principles and Practice of Validation for New Alternative Approaches to Toxicity Testing

Computer-Based Prediction Models in Regulatory Toxicology

Introduction

Topical toxicities

Skin irritation and skin corrosion

Phototoxicity

Skin sensitization

Skin absorption

Eye irritation/corrosion

Systemic toxicity

Genotoxicity

Carcinogenicity

Developmental and reproductive toxicology

Endocrine disruptors

In vivo testing as part of IATA

Progress made in recent years and challenges for future developments toward IATA

Notes

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Replacement of animal testing by integrated approaches to testing and assessment (IATA): a call for in vivitrosi

Abstract

Similar content being viewed by others

SEURAT: Safety Evaluation Ultimately Replacing Animal Testing—Recommendations for future research in the field of predictive toxicology

Evolving the Principles and Practice of Validation for New Alternative Approaches to Toxicity Testing

Computer-Based Prediction Models in Regulatory Toxicology

Introduction

Topical toxicities

Skin irritation and skin corrosion

Phototoxicity

Skin sensitization

Skin absorption

Eye irritation/corrosion

Systemic toxicity

Genotoxicity

Carcinogenicity

Developmental and reproductive toxicology

Endocrine disruptors

In vivo testing as part of IATA

Progress made in recent years and challenges for future developments toward IATA

Notes

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation