Use and Safety of Anthroposophic Medicinal Products: An Analysis of 44,662 Patients from the EvaMed Pharmacovigilance Network
There is a need for data on the clinical safety of anthroposophic medicinal products (AMPs).
The main objective of this analysis was to determine the frequency of adverse drug reactions (ADRs) to AMPs, relative to the number of AMP prescriptions.
EvaMed (Evaluation of Anthroposophic Medicine) was a prospective pharmacovigilance study with the patients of 38 physicians in outpatient care in Germany. Diagnoses and prescriptions were extracted from the electronic medical records. All physicians documented ADRs of Grades III–IV and serious ADRs, seven ‘prescriber physicians’ also documented non-serious ADRs of any intensity. Patients were eligible for this analysis if they had one or more AMP prescription in the years 2001–2010, followed by one or more physician visit.
A total of 44,662 patients with 311,731 AMP prescriptions, comprising 1722 different AMPs, were included. One hundred ADRs to AMPs occurred, caused by 83 different AMPs. ADR intensity was mild, moderate, and severe in 50% (n = 50/100), 43%, and 7% of cases, respectively; one ADR was serious. Among patients of prescriber physicians, ADRs of any intensity occurred in 0.071% (n = 67/94,734) of AMP prescriptions and in 0.502% (n = 65/12,956) of patients prescribed AMPs. In subgroup analyses according to age, specific AMPs or AMP groups, dosage forms, and concentrations (altogether 11 groups), the highest ADR frequency was 0.290% of prescriptions (for one specific AMP). Among all patients, serious ADRs occurred in 0.0003% (n = 1/311,731) of prescriptions and 0.0022% (n = 1/44,662) of patients.
In this analysis from a large sample, ADRs to AMP therapy in outpatient care were rare; ADRs of high intensity as well as serious ADRs were very rare.
This safety analysis comprised 44,662 outpatients with altogether 311,731 prescriptions of anthroposophic medications.
Adverse reactions to these medications were rare (0.071% of prescriptions), serious adverse reactions were very rare (0.0003%).
In this analysis, anthroposophic medication therapy was a safe treatment.
1.1 Pharmacovigilance of Medicinal Products Used in Whole Medical Systems
The clinical safety of drugs (medicinal products, MPs) on the market is an ongoing public health issue and of paramount importance for drug regulation. Pharmacovigilance of MPs is defined as the science and activities relating to the detection, assessment, understanding, and prevention of adverse effects or any other drug-related problems. Methods used in pharmacovigilance include pre-clinical testing, spontaneous reporting of suspected adverse drug reactions (ADRs), case-control studies, cohort studies, database analyses, and clinical trials . National and supranational pharmacovigilance centers (e.g., Uppsala Monitoring Centre of the WHO , European Medicines Agency ) are central players in initiating and coordinating pharmacovigilance activities, but initiatives may also come from other stakeholders.
Modern pharmacovigilance has become differentiated to accommodate special product groups, including herbal  and other MPs used in whole medical systems (WMPs) [7, 13, 43]. Of these WMPs, some (e.g., those used traditionally in China  and on the Indian subcontinent ) were historically produced for local use. In modern times, industrial scale production developed without the rigorous quality control of modern drug manufacturing, and in many countries these WMPs are regulated as food or food supplements or are imported for use without any regulation. Some of these WMPs have been associated with repeated, severe ADRs including toxicity, and current pharmacovigilance initiatives towards these WMPs has to deal with specific challenges (e.g., classification issues, lack of standardization, contamination by heavy metals or pesticides ).
For the WMPs used in homeopathy and anthroposophic medicine (AM), the historical and regulatory situation is different: since the last quarter of the 20th century, they have been marketed in European countries such as Austria, France, Germany, and Switzerland as drugs, manufactured according to Good Manufacturing Practice standards, and subject to modern drug regulation. Toxicologically relevant starting materials (e.g., aconite, cinnabar) are highly diluted according to the safety requirements of European regulations [7, 15, 32]. Adverse reactions to these WMPs are infrequent and usually of mild to moderate severity; anaphylactic reactions occur but are very rare [14, 27, 30]. However, the large majority of these WMPs were introduced on the European market before clinical trials became widespread, and for many MPs safety data from clinical studies are still sparse. Also, there is a need for clinical safety data on children.
A feature of all WMPs is the large number of different MPs used within each medical system, often alongside non-medication therapies. To accommodate for this and other features outlined above, a five-step research strategy for whole medical systems and their WMPs has been proposed: (1) context, paradigms, philosophical understanding, and utilization; (2) safety status; (3) comparative effectiveness; (4) component efficacy; (5) biological mechanisms . In this model, effect evaluation takes place first on the level of the whole system (Step 3: comparative effectiveness) and subsequently on the level of system components (e.g., individual WMPs). Similarly, safety assessment (Step 2) includes analyses on the level of the whole medical system as well as for individual system components . In this context, the present analysis concerns MPs used in the whole medical system of AM (AMPs), and its primary scope is use (Step 1) and safety (Step 2) on the level of the whole system.
1.2 Anthroposophic Medicinal Products and the EvaMed Study
AM was developed in the early 1920s in Germany and Switzerland and is now practiced by an estimated 19,000 physicians around the world . AM therapy is employed in inpatient and outpatient settings in most medical fields and involves a large number of different AMPs, with more than 1300 AMPs marketed in Germany in the years 2012–2014. AMPs are manufactured from plants, minerals, animals, and from chemically definable substances ; quality standards for starting materials and manufacturing procedures are described in the European Pharmacopoeia (Ph.Eur.), in German [Deutsches Arzneibuch (DAB), Deutsches Homöopathisches Arzneibuch (HAB)], French (Ph.fr.) or Swiss (Ph.Helv.) pharmacopoeias or in the Anthroposophic Pharmaceutical Codex (APC) . Manufacturing procedures for AMPs include standard procedures used for the manufacturing of homeopathic or herbal MPs as well as special procedures such as the production of metal mirrors (deposits of metals in reduced state onto a surface ) by chemical vapor decomposition; the processing of herbs by fermentation, toasting, carbonizing, incineration, and digestion (heat treatment at 37 °C); and the cultivation of plants in soils pre-treated with diluted metal salts (vegetabilization). AMPs are manufactured in concentrated form as well as in homeopathic decimal potencies (involving successive 1:10 dilutions) and are administered as oral, rectal, vaginal, conjunctival, nasal, or percutaneous applications or by subcutaneous, intracutaneous, or intravenous injections . To sum up, despite some overlap with herbal and homeopathic MPs, AMPs as a group have distinct properties that could influence their clinical safety.
Safety and effects of AMP treatment have been evaluated both on the level of the whole system and for individual MPs. The most recent comprehensive systematic review of clinical studies of AM from 2011 comprised 255 studies involving AMP treatment, with 38 studies on the level of the whole system and 217 studies of individual AMPs or small AMP groups. External validity was high but many studies had methodological shortcomings. In the reviewers’ conclusion, trials of varying design and quality in a variety of diseases showed mostly good clinical outcomes, only marginal side effects, high patient satisfaction, and presumably slightly lower costs .
An occasion to evaluate the clinical safety of AMPs was given by the Evaluation of Anthroposophic Medicine Pharmacovigilance Network (EvaMed), a German prospective, multicenter, observational study on prescription and safety of MPs among physicians practicing AM. The EvaMed study employed a novel method for automatic extraction of anonymized core data from physicians’ electronic medical records, combined with a structured approach to detection and reporting of suspected ADRs during routine outpatient care. Previous analyses from EvaMed have comprised AMP prescriptions for children during 1 year (2005 ), ADRs from a small MP subgroup (Asteraceae extracts ), and the safety of all MPs over a 5.5-year period (Jan 2004–June 2009) but without data on the subgroup of AMPs . Here we present an analysis of prescription and safety of all AMPs in EvaMed. Compared with previous analyses, this report covers a longer documentation period (10 years, 2001–2010) and provides specific data on ADRs to AMPs, also in pertinent subgroups.
The primary objective of this analysis was to determine the frequency of ADRs to AMPs, relative to the number of AMP prescriptions in EvaMed, for all AMPs as well as in subgroups according to age, specific AMPs or AMP groups, dosage forms, and concentrations.
to describe characteristics of the prescription of AMPs and other MPs;
to compare the frequency of ADRs to AMPs with the frequency of ADRs to Non-AMPs (all other MPs), relative to the number of prescriptions and the number of patients prescribed AMPs and Non-AMPs, respectively.
2.2 The EvaMed Pharmacovigilance Network
EvaMed was a prospective, multicenter, observational study conducted in outpatient medical practices in Germany. Detailed descriptions of settings, participants, and data collection have been published elsewhere [24, 40]. In brief, 38 physicians (21 family physicians, 9 pediatricians, 4 internists, 4 other specialists) from 12 of 16 German Federal States participated; they had a qualification in AM, ≥ 5 years of practice experience, and used an electronic medical record system in their practice which fulfilled the technical requirements for data collection.
For each patient consultation, anonymized core data were extracted automatically from the electronic medical records: consultation date; patient age, gender, diagnoses; prescriptions of all MPs and nonmedication treatment. All physicians were obliged to link all MP prescriptions to the respective indications (diagnoses), and to document all serious ADRs as well as all ADRs of intensity III–IV to any MP. In addition, a subgroup of seven ‘prescriber physicians’ agreed to also document all non-serious ADRs of any intensity. Otherwise, the study was conducted under routine care conditions with ADRs identified at ordinary follow-up consultations, without any additional scheduled follow-up visits. Physicians were remunerated with 15 Euro for each ADR report but not for their regular participation; patients received no remuneration.
2.3 Eligibility Criteria for this Analysis
at least one prescription of an AMP documented in the period 1 January 2001 to 31 December 2010
followed by at least one visit to the EvaMed physician in the period 2 January 2001 to 31 December 2011.
These criteria were selected in order to identify patients for whom ADRs to AMPs could be detected (the EvaMed documentation started in 1997, but during the years 1997–2000 only 70 MP prescriptions were documented altogether and no ADRs to AMPs occurred; therefore these years were excluded from the analysis).
2.4 Outcome Measures
The primary outcome measure for this analysis was the frequency of ADRs to AMPs [any ADR, ADRs Grade III–IV and serious ADRs (SADRs)], relative to the number of AMP prescriptions.
properties of AMPs prescribed: starting materials, manufacturing procedures, concentration of active ingredients, administration forms;
indications for prescription of AMPs and non-AMPs;
frequency of ADRs to non-AMPs, relative to the number of non-AMP prescriptions;
properties of ADRs to AMPs and non-AMPs, respectively: intensity and type of symptoms; intensity, occurrence, duration, management, outcome, and seriousness of ADRs.
2.5 Documentation, Preparation, and Classification of Data
The term ‘prescription’ refers to the prescriptions of one MP, with prescriptions of more than one MP at one physician visit being counted separately.
The dataset for the present analysis was prepared by author KH by checking prescription data in the raw dataset, excluding prescriptions not referring to MPs (e.g., food supplements, body care products), prescriptions of unclassifiable MPs, and consultations without prescriptions. Subsequently, author AG checked, verified, and classified all AMP prescriptions.
2.5.1 Anthroposophic Medicinal Products (AMPs)
AMPs were defined according to the German Medicines Act . For practical purposes, all MPs marketed in Germany by the manufacturers Abnoba (Pforzheim, Germany), Birken (Niefern-Öschelbronn, Germany), Helixor (Rosenfeld, Germany), Wala (Bad Boll, Germany), and Weleda (Arlesheim, Switzerland) were classified as AMPs. For AMPs, the unit of analysis was each product with a separate registration or marketing authorization, corresponding to each AMP with a separate entry in the medicines lists of the respective manufacturers. Accordingly, AMPs listed together within a separate entry but marketed in different concentrations or pack sizes were grouped together.
In EvaMed, the prescribed MPs were identified according to German National Drug Code (Pharmazentralnummer, PZN). However, the PZN codes had not been used consistently for all AMPs, with some AMPs being documented in free text by proprietary names or as abbreviations. In addition, some AMP-related PZN codes were changed during the study period. These factors brought a risk for misclassifications, in particular between anthroposophic and homeopathic or herbal MPs. In order to minimize possible classification errors, all AMP prescriptions (> 300,000) were reassessed by visual inspection of each record in the dataset, and the classification as AMPs was verified by cross-checking with a database of all AMPs available on the market in Germany in the documentation period [the European Scientific Cooperative on Anthroposophic Medicinal Products (ESCAMP) database of AMPs, http://www.escamp.org, data on file].
AMPs were classified with regard to starting materials, manufacturing processes, and dosage forms, according to APC, Fourth Edition .
For individual AMPs mentioned in this paper, proprietary names were recoded as generic names: for AMPs manufactured from between one and three starting materials, the starting materials were listed in the order used in the medicines lists; for AMPs manufactured from four or more starting materials, the two first starting materials were listed, followed by ‘comp.’.
2.5.2 Other Medicinal Products
Non-AMPs were classified at the Research Institute Havelhöhe (German: Forschungsinstitut Havelhöhe, FIH) according to the Anatomical Therapeutic Chemical Classification System (ATC).
Indications for prescriptions were classified by the participating physicians according to the International Classification of Diseases, Tenth Revision (ICD-10); this was part of their routine work, since in the German Statutory Health System, physician remuneration for patient consultations requires at least one ICD-10 diagnosis for each consultation. For this analysis, diagnoses were grouped according to the ICD-10 diagnosis chapters as well as the ICD-10 diagnosis blocks.
2.5.4 Adverse Drug Reactions (ADRs)
date of onset, date of recovery;
occurrence (once, several times, continuous);
overall severity (classified according to the recommendations of the WHO Uppsala Monitoring Centre : Grades I = mild, II = moderate, III = severe, IV = life threatening);
severity of each symptom (mild, moderate, severe);
management (drug withdrawal, dose reduction, change of therapy, no change in drug and no additional treatment, other);
outcome following drug withdrawal;
outcome (recovered, not yet recovered, permanent damage, unknown, death);
current diagnoses at onset of ADR;
all MPs used at onset of ADR;
date of starting MP use;
for each MP used: physicians’ assessment of causal relation to the ADR in question.
seriousness of the ADR (a serious ADR being defined according to the International Conference on Harmonization  as an ADR that results in death, requires inpatient hospitalization or prolongation of existing hospitalization, results in persistent or significant disability/incapacity, or is life threatening);
causal relationship between documented ADR and the MPs used (classified according to WHO Uppsala Monitoring Centre : certain, probable, possible, unlikely, conditional, unclassified, unassessable) [criteria for each category listed in Online Resource 1, see electronic supplementary material (ESM)].
The causality assessment at the FIH was performed independently by two research physicians trained in ADR evaluation, using a pre-defined case report verification form. In case of disagreement between the two research physicians, an expert team of three senior physicians and two pharmacists was consulted .
Symptoms of the ADRs were classified by AG according to the Medical Dictionary for Regulatory Activities (MedDRA, Version 19.0, MedDRA MSSO, McLean, VA, USA). For descriptive purposes, the frequency of ADRs was classified according to the recommendations of the Council for International Organizations of Medical Sciences (CIOMS)  as ‘very common’ (≥ 10%), ‘common’ (1 < x < 10%), ‘uncommon’ (0.1 < x < 1%), ‘rare’ (0.01 < x < 0.1%), and ‘very rare’ (< 0.01%).
Each participating physician received a face-to-face training program to assist in the detection, classification, and reporting of ADRs .
2.6 Data Analysis
Patients fulfilling all eligibility criteria were included in the analysis. Data analysis was performed using IBM SPSS Statistics 19® (International Business Machines Corp., Armonk, NY, USA) and StatXact® 9.0.0 (Cytel Software Corporation, Cambridge, MA, USA). Missing data for individual variables were not replaced.
Analysis was descriptive. In addition, bivariate analyses of independent samples were performed using non-parametric methods: for dichotomous data, Fisher’s exact test was used; for multinomial data, the Fisher–Freeman–Halton test; for rank-ordered or continuous data, Mann–Whitney U test with estimation of median shift and 95% confidence interval (95% CI) according to Hodges–Lehmann. All tests were two-tailed. Significance criteria were p < 0.05 and 95% CI not including 0. Since this was a descriptive study, no adjustment for multiple comparisons was performed .
3.1 Patients, Physicians, and Prescriptions Included in the Analysis
Patient inclusion into the analysis
Inclusion in analysis of
≥ 1 MP prescription 1997–2010
≥ 1 MP prescription 2001–2010
No. 2 + ≥1 AMP prescription
No. 3 + ≥1 physician consultation after AMP prescription (2001–2011)
ADR, Grades III–IV only
No. 4 + patient of prescriber physician
ADR, Grades I–IV
Age groups were 0–2 years (27.1% of evaluable patients, n = 12,065/44,573), 3–6 years (17.9%), 7–10 years (11.1%), 11–17 years (7.5%), 18–44 years (17.7%), 45–64 years (12.6%), and ≥ 65 years (6.2%) with a median age of 8.0 years [range 0–101 years, interquartile range (IQR) 2.0–38.0 years, mean 20.1 years, standard deviation (SD) 22.9]. A total of 57.2% (n = 25,533/44,658) of all patients and 70.8% (n = 11,490/16,230) of adults (aged ≥ 18 years) were women. A total of 54.6% (n = 24,392/44,662) of patients were treated by family physicians, 35.4% by pediatricians, 5.6% by internists, 2.6% by dermatologists, and 1.8% by gynecologists.
Comparing patients of prescriber physicians (n = 12,956 patients) and of other physicians (n = 31,617), the patients of prescriber physicians were a median of 2.0 years younger (95% CI 2.0–2.0 years, p < 0.001), with a slightly lower proportion of females among all patients (55.9 vs 57.7%, p = 0.0003), while the gender distribution in adults did not differ significantly (72.0 vs 70.4%, p = 0.0691).
3.2 Documentation Period, Physician Visits, Prescriptions
For each patient, the documentation period from the first to the last physician visit during the study period was 0–11 months in 34.9% (n = 15,598/44,662) of patients, 12–23 months in 18.8%, 24–35 months in 14.7%, 36–47 months in 12.0%, and ≥ 48 months in 19.5%, with a median documentation period of 21.7 months (IQR 7.7–42.9 months, mean 27.1 months, SD 22.4). The number of physician visits per patient during the whole documentation period was 1–9 visits in 67.2% (n = 30,032/44,662) of patients, 10–19 visits in 20.6%, 20–29 visits in 6.9%, and 30–192 visits in 5.3%, with a median of 6.0 visits (IQR 3.0–12.0 visits, mean 9.8 visits, SD 11.1) per patient.
Compared with the other patients, the patients of prescriber physicians had a longer documentation period (median difference 50.0 days, 95% CI 39.0–62.0 days, p < 0.001) and more physician visits (median difference 1.0 visit, 95% CI 1.0–1.0, p < 0.001).
During the documentation period, the 44,662 patients received a total of 717,545 MP prescriptions, of which 43.4% (n = 311,731) were AMP prescriptions (Table 1); the remaining prescriptions were of conventional (41.2%), homeopathic (10.0%), or herbal (5.4%) MPs (hereafter summarized as ‘non-AMPs’).
Of the 311,731 AMP prescriptions, an ICD-10 diagnosis could be coded for n = 303,725 (97.4%) prescriptions.
Indications for prescriptions of anthroposophic medicinal products, most frequent diagnosis chapters of the International Classification of Diseases, Tenth Revision
Adults (18–101 years)
Children (0–17 years)
Diseases of the respiratory system
Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified
Certain infectious and parasitic diseases
Diseases of the skin and subcutaneous tissue
Mental and behavioural disorders
Diseases of the digestive system
External causes of morbidity and mortality
Diseases of the musculoskeletal system and connective tissue
Endocrine, nutritional and metabolic diseases
Diseases of the circulatory system
Diseases of the ear and mastoid process
Diseases of the genitourinary system
Injury, poisoning and certain other consequences of external causes
Diseases of the eye and adnexa
Diseases of the nervous system
Other chapters (with < 2% in each age group)
Total evaluable diagnoses
Among children aged 0–17 years, the most frequent ICD-10 diagnosis chapters were J00–J99 Respiratory diseases (32.4% of evaluable prescriptions, n = 61,127/188,668), A00–B99 certain infectious and parasitic diseases (9.9%), and R00–R99 Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified (9.8%) (Table 2). The most frequent ICD-10 diagnosis blocks were J00–J06 Acute upper respiratory infections (16.5%, n = 31,080/188,598 evaluable prescriptions), L20–L30 Dermatitis and eczema (5.8%), H65–H75 Diseases of middle ear and mastoid (5.3%), J30–J39 Other diseases of upper respiratory tract (4.3%), and J40–J47 Chronic lower respiratory diseases (4.1%).
3.3.2 Prescriptions and Products
Of the 311,731 AMP prescriptions issued in the period 2001–2010, a total of 311,381 (99.89%) prescriptions could be matched with an AMP recorded in the ESCAMP Database of AMPs, corresponding to 1722 different AMPs, which amounts to 74.2% of a total of 2321 different AMPs marketed in Germany in this period.
Of the prescribed AMPs, 8.8% (n = 152/1722) were only prescribed once during the documentation period, 22.9% of AMPs had 2–9 prescriptions each, 32.2% had 10–49 prescriptions, 27.9% had 50–499 prescriptions, and 8.2% had ≥ 500 prescriptions, with a median of 25.0 prescriptions per AMP (IQR 6.0–99.3 prescriptions, mean 180.8, SD 620.0).
The numbers of AMP prescriptions per patient (repeated prescriptions of the same AMP being counted separately) were 1–4 AMP prescriptions in 54.2% (n = 24,211/44,662) of patients, 5–9 prescriptions in 25.3%, 10–19 prescriptions in 14.1%, and ≥ 20 prescriptions in 6.4%, with a median of 4.0 AMP prescriptions per patient (IQR 2.0–8.0 prescriptions, mean 7.0, SD 9.7). Regarding the number of AMP prescriptions per patient, the patients from prescriber physicians and other physicians did not differ significantly (median difference 0.0 prescriptions, 95% CI 0.0–0.0). A total of 57.6% (n = 179,703/311,731) of AMP prescriptions were issued by family physicians, 34.4% by pediatricians, 5.1% by internists, 1.8% by dermatologists, and 1.1% by gynecologists.
Most frequently prescribed anthroposophic medicinal products
Name of anthroposophic medicinal product
All other products (each with < 1.0% of prescriptions)
Total number of prescriptions
AMPs were manufactured from a single starting material (56.2%, n = 968/1722 AMPs), from more than one starting material (36.5%) or from one or several compositions according to the APC  (7.2%). Among the AMPs manufactured from a single starting material (n = 968), the starting material was of mineral origin in 13.8% (n = 134/968), of botanical origin in 46.8%, of zoological origin in 21.7%, was a chemically definable substance in 13.0%, and a substance (metal salt or mineral) that had undergone vegetabilization in 4.6%.
Regarding concentration of active ingredients, 22.7% of evaluable AMPs (n = 389/1711) were manufactured in concentrated form including mother tinctures, 37.2% were in D1–D3 decimal potencies, 29.7% in D4–6, and 10.3% in potencies ≥ D7. Administration forms of the prescribed AMPs were oral (44.9%, n = 774/1,722 AMPs), parenteral (38.4%), cutaneous (11.7%), ophthalmic (2.3%), rectal (1.7%), and other (0.9%).
In the documentation period, the EvaMed physicians documented a total of 111 cases of ADRs to AMPs. Of these, the causal relation to the AMPs in question was assessed by the research scientists at FIH as certain (n = 33), probable (n = 37), possible (n = 30), unlikely (n = 9), and unclassified (n = 2). The 100 cases with causal relationship assessed as certain, probable or possible were classified as confirmed ADRs to AMPs and included in the ADR analysis.
The 100 confirmed ADRs were caused by a total of 109 AMPs, made up of 83 different AMPs. The cause of the ADR was one single AMP in 92.0% (n = 92/100) of ADRs and two or more AMPs in 8.0%. The numbers of ADRs caused by a single AMP was seven ADRs (1 AMP), five ADRs (3 AMPs), two ADRs (5 AMPs), and one ADR (77 different AMPs). In order to identify possible ADR clusters from similar AMPs, AMPs with similar starting materials were grouped together, yielding nine AMP groups. Of these, one AMP group (mistletoe products = AMPs with Viscum album as starting material) caused 11 ADRs and the remaining eight AMP groups caused two ADRs each.
The 100 ADRs to AMPs affected 95 patients: 90 with one ADR and five with two ADRs each. Of the 95 patients, 54 were adults aged ≥ 18 years and 51 were children aged 0–17 years, with a median age of 36.0 years (IQR 2.0–51.0 years, mean 31.9, SD 27.7). A total of 85.2% (n = 46/54) of adults and 42.5% (n = 17/40) of children with ADRs to AMPs were females.
The intensity of the ADRs to AMPs was Grade I (mild) in 50.0% (n = 50/100) of ADRs, Grade II (moderate) in 43.0%, Grade III (severe) in 7.0%, and Grade IV (life threatening) in 0.0%. The number of symptoms documented for each ADR was one symptom in 74.0% (n = 74/100) of ADRs, two symptoms in 18.0%, three symptoms in 4.0%, four symptoms in 3.0%, and five symptoms in 1.0%, with a mean of 1.4 symptoms per ADR (SD 0.79, median 1.0, IQR 1.0–2.0). Altogether, 139 ADR symptoms were documented, with symptom intensity classified as mild in 18.7% (n = 25/134) of evaluable symptoms, moderate in 59.7%, and strong (German: ‘stark’) in 21.6%. The most common ADR symptoms, according to the MedDRA classification of System Organ Classes, were Skin and subcutaneous tissue disorders (21.6% of symptoms, n = 30/139), Psychiatric disorders (19.4%), Gastrointestinal disorders (17.3%), General disorders and administration site conditions (9.4%), and Respiratory, thoracic and mediastinal disorders (6.5%).
The ADRs occurred only once in 17.0% (n = 17/100) of ADRs, they occurred several times in 40.0%, and were continuous in 43.0%. Duration of the ADR was < 24 h in 4.3% (n = 4/94) of evaluable ADRs, 1–2 days in 29.8%, 3–6 days in 36.2%, 7–13 days in 10.6%, 14–30 days in 10.6%, and 31–103 days in 8.5%, with a median duration of 4.0 days (IQR 2.0–7.5 days, mean 9.8 days, SD 17.0). Management of ADRs was drug withdrawal in 80.4% (n = 78/97) of evaluable ADRs, dose reduction in 11.3%, and no precautions in 8.2%. Outcome of the ADR was ‘recovered’ in 95.0% (n = 95/100) of ADRs and ‘not yet recovered’ in 5.0%. A list of the 100 ADRs to AMPs is presented in Online Resource 2 (see ESM).
Of the 100 ADRs to AMPs, one ADR was classified by the research scientists as a SADR. Notably, for this ADR the documentation was ambiguous with respect to the seriousness. The ADR was classified as serious because of being documented as ‘life threatening at the occurrence of the symptoms’, but the overall intensity of the ADR was not documented as Grade IV (life threatening) but as Grade III (severe). Nonetheless, the classification as a SADR was upheld in the analysis. The SADR occurred in a 45-year-old woman with pre-existing ductal invasive breast cancer (ICD-10 C50.9) and pneumothorax (J93) and was caused by a mistletoe AMP (Herba Visci albi subsp. abietis, 10 mg ampoules for subcutaneous injection). Symptoms of the SADR with respective symptom severity were breathlessness (moderate), panic (strong), paraesthesia of the hands (moderate), and anxiety (strong). The causal relationship between the mistletoe AMP and the SADR was classified as probable. Concurrently with the mistletoe AMP the patient was also using an oil of Argania spinosa, which was judged to be unrelated to the SADR. The SADR occurred once, duration was < 24 h, management was drug withdrawal, outcome was ‘recovered’.
Frequency of adverse drug reactions in relation to prescriptions
All age groups, 0–101 years
Children, 0–17 years
Patients of prescriber physicians
ADRs, Grades I–IV
Patients with ADR
Patients with prescriptions
Patients with ADR/with prescriptions (%)
Patients of all physicians
ADRs, Grades III–IV only
Patients with ADR
Patients with prescriptions
Patients with ADR/with prescriptions (%)
ADRs of any intensity occurred in 0.071% of AMP prescriptions and in 0.502% of patients prescribed AMPs.
ADRs with intensity Grades III–IV occurred in 0.002% of AMP prescriptions and in 0.016% of patients prescribed AMPs.
Frequency of adverse drug reactions to anthroposophic medicinal products, subgroup analyses
All age groups, patients of prescriber physicians
ADRs, all grades
Patients with ADR
Patients with prescription
Patients with ADR/with prescription (%)
All patients of prescriber physicians
Specific AMPs or AMP groups*
Mistletoe AMPs, ampoules
Belladonna/Chamomilla recutita, Radix comp., suppositories
Hepar sulfuris/Membrana sinuum paranasalium bovis, ampoules
Local (cutaneous, ophthalmic, rectal, other)
3.4 Non-anthroposophic Medicinal Products (Non-AMPs)
3.4.1 Prescriptions and Products
During the documentation period, the 44,662 patients received a total of 405,814 prescriptions of non-AMPs. A total of 11.6% (n = 5,189/44,662) of the patients did not have any non-AMP prescriptions, 41.5% had 1–4 non-AMP prescriptions, 20.3% had 5–9 non-AMP prescriptions, 14.8% had 10–19 non-AMP prescriptions, and 11.8% had ≥ 20 non-AMP prescriptions, with a median of 4.0 non-AMP prescriptions for each patient (IQR 1.0–10.0, mean 9.1, SD 16.3, range 0–447 non-AMP prescriptions).
The most common medication groups, according to the ATC system—first level, were Respiratory system (26.6%, n = 107,997/405,814 non-AMP prescriptions), alimentary tract and metabolism (14.5%), various (12.5%), nervous system (9.8%), cardiovascular system (8.2%), dermatologicals (6.4%), and Anti-infectives for systemic use (6.2%). The most common second-level groups were R05 Cough and cold preparations (13.7%, n = 55,475/405,814 prescriptions), V60 Homeopathic MPs1 (9.5%), R01 Nasal preparations (6.8%), J01 Antibacterials for systemic use (5.2%), N02 analgesics (4.8%), R03 Drugs for obstructive airway diseases (4.2%), and M01 anti-inflammatory and antirheumatic products (3.2%).
3.4.2 Adverse Drug Reactions to Non-AMPs
A total of 695 cases of ADRs to non-AMPs were documented, of which 682 ADRs were confirmed by the FIH research scientists. The 682 ADRs affected 637 patients and were caused by 775 different non-AMPs. Of the 682 ADRs, 29 ADRs (4.3%) were SADRs, affecting 29 different patients.
The most common causes of ADRs to non-AMPs, according to the ATC system—second level, were J07 Vaccines (46.0%, n = 353/768 evaluable non-AMPs), J01 Antibacterials for systemic use (24.1%), and R05 cough and cold preparations (7.4%).
3.5 Comparison of AMPs and Non-AMPs
Comparing the most common indications for AMPs versus non-AMPs in adults (Online Resource 3, see ESM) and in children (Online Resource 4, see ESM), respectively, the relative frequency of ICD-10 diagnosis chapters were similar: the only differences above 3% (absolute percentages) were for I00–I99 Diseases of the circulatory system (10.3% of AMP prescriptions vs 15.8% of non-AMP prescriptions), and E00–E90 Endocrine, nutritional and metabolic diseases (7.8% vs 11.5%) in adults and A00–B99 Certain infectious and parasitic diseases (9.9 vs 13.1%) in children.
3.5.2 Features of ADRs
Comparing the relative frequencies (not the absolute numbers) of features of ADRs to AMPs and non-AMPs, respectively, the ADRs to AMPs had significantly lower severity (Grade I severity being a feature of 50.0% for AMPs vs 20.4% for non-AMPs, p < 0.0001), were less likely to occur continuously (43.0 vs 79.4%, p < 0.0001), and were more often treated with drug withdrawal (80.4 vs 56.5%, p < 0.0001). Regarding seriousness, duration, and outcome of ADRs, the relative frequencies did not show any significant differences between the two groups (Online Resource 5, see ESM).
3.5.3 Frequencies of ADRs, Relative to Prescription
Frequency of adverse drug reactions in relation to prescriptions in the three largest diagnostic subgroups in adults and children, respectively
Adults (≥ 18 years)
Children (0–17 years)
Respiratory (ICD-10 J00–J99)a
Respiratory (ICD-10 J00–J99)a
Symptoms, signs, findings (R00–R99)a
Patients of prescriber physicians
ADRs, Grades I–IV
Patients with ADR
Patients with prescriptions
Patients with ADR/with prescriptions (%)
Patients of all physicians
ADRs, Grades III–IV only
Patients with ADR
Patients with prescriptions
Patients with ADR/with prescriptions (%)
Among all patients, the corresponding frequency of ADRs of Grades III–IV was 0.002% (n = 7 ADRs/311,731 prescriptions) and 0.020% (82 ADRs/405,814 prescriptions) for AMPs and non-AMPs, respectively (OR 0.11, 95% CI 0.05–0.24, p < 0.0001) (cf. also Table 6 for diagnostic subgroups). The frequency of SADRs was 0.0003% (1 SARD/311,731 prescriptions) for AMPs and 0.0071% (29 SADRs/405,814 prescriptions) for non-AMPs (OR 0.045, 95% CI 0.001–0.271, p < 0.0001).
4.1 Main Findings
This is the largest analysis of AMP safety in a prospective patient cohort so far: 44,662 patients from the EvaMed pharmacovigilance network with a total of 311,731 AMP prescriptions were followed up for a mean of 27 months. ADRs to AMPs were rare (0.071% of AMP prescriptions), high intensity ADRs and SADRs were very rare (0.002% and 0.0003%, respectively). Among 11 analyzed subgroups, the highest ADR frequency was still in the ‘uncommon’ range (0.290% of prescriptions for one specific AMP). Compared with ADRs from non-AMPs, ADRs to AMPs were less frequent (OR for ADRs to AMPs and non-AMPs, respectively, ranging from 0.045 for SADRs to 0.17 for any ADR).
4.2 Strength and Limitations
An important strength of the EvaMed study and this analysis lies in the automatic data extraction from the electronic medical records of the participating physicians. This procedure enabled the inclusion of all MP prescriptions to all patients without any selection, and helped to achieve a very large sample of children and adults with MP prescriptions, increasing the power to detect rare ADRs as well as ADRs in subgroups of interest. Further strengths include the representation of outpatient care in 12 of 16 German Federal States; a long documentation period; the coverage of three-quarters of all AMPs marketed in Germany in the study period; and a comprehensive and stringent assessment of ADRs to all MPs.
Notably, the study was not designed to test MP effectiveness, hence no clinical outcomes were documented. The safety documentation was restricted to suspected ADRs to any MP; an ongoing documentation and assessment of all adverse events following drug prescription in this very large patient sample was not feasible. Therefore, unusual or unexpected ADRs to AMPs or other MPs, not recognized as such, cannot be excluded (this question has been assessed in other studies of AMP therapy, e.g. [22, 41]). Also, the study documentation did not include any scheduled follow-up visits; accordingly, detection of ADRs was dependent on patients returning for subsequent consultations or otherwise notifying the physicians. Hence, underreporting of suspected ADRs by the patients to their physicians cannot be excluded in this study, similar to real-world medical practice.
Considering the long documentation period of 27 months on average per physician and the inclusion of all patients seen by the physicians in the study project, ADR underreporting by the physicians is also a potential limitation. Precautions against physician underreporting were taken at the levels of study organization as well as analysis: Each physician was individually trained in ADR reporting . Furthermore, the obligation of physicians to report ADRs was graded: All physicians reported the kind of ADRs that are easily noticed and where awareness may be highest, that is, serious ADRs and ADRs of high intensity. The additional documentation of nonserious ADRs of mild to moderate intensity was limited to a smaller group of ‘prescriber’ physicians who had explicitly agreed to do so. In order to have an unbiased estimate of the frequency of these ADRs in relation to prescription rates, the sample for that analysis was restricted to patients of the prescriber physicians.
For causality assessment of adverse events and suspected ADRs, different criteria and instruments are available: in a systematic review from 2008 a total of 34 different methods were identified. Out of 13 possible criteria for assigning causality (12 listed + ‘other’), most methods used five or six. The authors concluded that so far, no method had shown consistent and reproducible measurement of causality; therefore, no single method was universally accepted . In EvaMed, the causality assessment system of the Uppsala Monitoring Centre of the World Health Organization (WHO-UMC) was used . For each of the categories in the WHO-UMC system, criteria are listed (Online Resource 1, see ESM) but, unlike for many other instruments , not formalized into a fixed algorithm or probability score. The WHO-UMC method has been criticized for being subjective and imprecise , since it is to some extent based on expert opinion. However, other presumed ‘objective’ instruments may require subjective judgment (as noted for the question on “previous conclusive reports on this reaction?”  in the Naranjo score ). Algorithms lack flexibility  and may lead to loss of relevant information , while probability scores presuppose that the certainty of causality assessment can be reliably modeled in a mathematical formula, which is difficult to prove. Comparisons of different instruments are complicated by their performance varying in different situations (e.g. design , setting , types of ADR , professional background of the raters ). In EvaMed, causality assessment was performed independently by two different research scientists, with disagreements solved by discussion in an interdisciplinary expert team, which would expect to reduce any bias from subjectivity. An advantage of the WHO-UMC method categories is their widespread use amongst others in the WHO Drug Monitoring Programme, with currently 127 participating countries worldwide .
The EvaMed physicians were paid 15 Euro per filed ADR report as compensation for the time spent investigating the suspected ADR, interviewing the patient, and filing the report . These physicians worked within the German Statutory Health System, where they are paid for each patient-related service provided, while the physicians pay all expenses for office, equipment, salaries to assistants, etc. The sum of 15 Euro was calculated to be high enough for EvaMed physicians not to suffer a financial loss from ADR reporting, but low enough for this activity not to be financially attractive. Thus, neither over-reporting bias due to economic gain nor under-reporting bias due to avoidance of economic strain would be expected from this provision. Compared with total study costs, the costs for reimbursing physicians for ADR were negligible; such a fee could be considered also for other pharmacovigilance projects.
Some items of interest for safety assessment were not documented, such as the number of days or applications for each prescribed MP, and patient or caregiver compliance with prescriptions. Furthermore, the documentation of demographics and baseline morbidity was limited to age, gender, and diagnoses, offering limited possibilities to analyze factors associated with the occurrence of ADRs, apart from MP-related variables.
4.3 Comparison with Other Studies, Interpretation
The International Integrative Primary Care Outcomes Study (IIPCOS) concerned patients with acute respiratory and ear infections; therefore, our comparison was restricted to EvaMed patients with the same diagnoses. ADRs to AMPs were similarly frequent in IIPCOS and EvaMed (0.28 vs 0.34% of AMP users). Notably, the absolute number of ADRs to AMPs in IIPCOS was low (n = 2); thus, the EvaMed analysis confirms the results from IIPCOS for this patient group (Online Resource 6).
The Anthroposophic Medicine Outcomes Study (AMOS) had a much older patient sample than EvaMed and 80% of all confirmed ADRs to AMPs in AMOS occurred in adults; therefore our comparison was restricted to adult patients. In AMOS, patients and physicians repeatedly filled in follow-up questionnaires (EvaMed had no scheduled follow-up documentation) and suspected ADRs were documented independently by patients as well as physicians (EvaMed: physicians only), with one-third of the confirmed ADRs to AMPs in the AMOS analysis being documented by patients only . ADRs to AMPs were approximately 4.5 times more frequent in AMOS than in EvaMed (3.08% vs 0.69% of AMP users); this difference could be explained by the above-mentioned differences in study documentation (Online Resource 7).
SADRs to AMPs were very rare in EvaMed (0.0022% of 44,662 users) and did not occur in the two other studies.
Also, the frequency of ADRs to non-AMPs in this study can be compared with other studies. Focusing on children aged 0–17 years, who made up almost two-thirds (63.6%) of patients in this analysis, and matching for study design (observational studies), setting (outpatients), and denominator (number of patients exposed to a drug), we identified four studies from a recent, very comprehensive systematic review of ADRs in children . The frequency of ADRs to non-AMPs among drug-exposed pediatric outpatients was 6.22% in EvaMed (Table 4) and 5.92% in the four studies [17, 28, 35, 36] (weighted mean: n = 404/6824 patients, cf. Fig. 3 in ). Withstanding the limitations of such comparisons from confounding by other variables (e.g., indications and type of MPs), the incidence of ADRs to non-AMPs in EvaMed seems to be in the same order of magnitude as in other observational studies.
The primary focus of this analysis was safety of AMPs, which is from a regulatory perspective a distinct topic. However, with AM being an integrative system of medicine, AM physicians prescribe not only AMPs in their patient care but also other MPs. Accordingly, a broader, public health focus on the safety of all MPs prescribed is also relevant. In this analysis, ADRs to any MP were uncommon (0.28% of prescriptions, n = 679/243,147). Notably, within this broad focus, ADRs to AMPs were less frequent than ADRs to non-AMPs, with OR for ADRs to AMPs and non-AMPs, respectively, of 0.17. This difference was even larger for ADRs of high intensity (OR 0.11) and SADRs (OR 0.045). Notably, the two types of MPs had relatively similar indications. Therefore, in routine outpatient care, risks of AMP use seem to be low, not only in absolute terms but also in comparison to risks from other types of MPs used by the patients.
V60 is used as the ATC code in Germany for homeopathic and anthroposophic MPs, but in this subgroup analysis of non-AMPs only the homeopathic MPs were included.
The EvaMed study was funded by the Software AG-Stiftung. This analysis and publication was commissioned by the European Scientific Cooperative on Anthroposophic Medicinal Products (http://www.escamp.org) with financial support from foundations (Christophorus-Stiftung, Damus-Donata-Stiftung, Ekhagastiftelsen, Mahle-Stiftung, Software AG-Stiftung) and manufacturers (Wala, Weleda). The sponsors had no influence on the planning or conduct of the EvaMed study; the collection, preparation, analysis or interpretation of data for this paper; nor on the preparation, review or approval of the manuscript.
HJH, AG and KH wrote the analysis plan. AG and KH prepared data. AG analyzed data. HM designed and was director of the EvaMed study. HJH was principal author, had access to all data, and is guarantor. All authors approved the final manuscript.
Compliance with Ethical Standards
Quality assurance, adherence to guidelines, approval
The EvaMed study was based on secondary data provided by physicians and adhered to the recommendations for good practice in secondary data analysis from the German Working Group on the Collection and Use of Secondary Data . The study was approved by the responsible data security official. As all data were collected anonymously without any documentation on the part of the patients, and since the study did not entail any intervention or experimental research, no ethical approval was required . This report followed the STROBE guidelines for reporting of observational studies .
In the past 3 years, HJH has received research grants from manufacturers of AMPs (Wala, Weleda). Since 2012 (after the EvaMed study was completed), HM has been member of the Board of Directors of Weleda, manufacturer of AMPs. AG and KH declare that they have no competing interests.
- 2.Anonymous (2014) Arzneimittelgesetz in der Fassung der Bekanntmachung vom 12. Dezember 2005 (BGBl. I S. 3394), das zuletzt durch Artikel 2a des Gesetzes vom 27. März 2014 (BGBl. I S. 261) geändert worden ist. Bundesministerium der Justiz und für Verbraucherschutz, Bonn.Google Scholar
- 3.Anonymous (2016) EMA pharmacovigilance system manual. Version 1.2. In: European Medicines Agency, p. 26.Google Scholar
- 4.Anonymous. Facts and figures on Anthroposophic Medicine worldwide. Brussels: IVAA-International Federation of Anthroposophic Medical Associations; 2012.Google Scholar
- 5.Anonymous (2007) Frequency of adverse drug reactions (CIOMS). In: Definitions. World Health Organization, Geneva, p. 10.Google Scholar
- 6.Anonymous (1997) ICH harmonised tripartite guideline. Guideline for good clinical practice. In: International conference on harmonisation of technical requirements for registration of pharmaceuticals for human use (ICH), Geneve.Google Scholar
- 7.Anonymous. Safety issues in the preparation of homeopathic medicines. Geneva: World Health Organization; 2009.Google Scholar
- 8.Anonymous. Safety monitoring of medicinal products: guidelines for setting up and running a pharmacovigilance centre. Uppsala: Uppsala Monitoring Centre; 2000.Google Scholar
- 9.Anonymous. The safety of medicines in public health programmes: pharmacovigilance an essential tool. Geneva: World Health Organization; 2006.Google Scholar
- 10.Anonymous. The story of UMC and the WHO Programme. Uppsala: Uppsala Monitoring Centre-WHO Collaborating Centre for International Drug Monitoring; 2017.Google Scholar
- 11.Anonymous. WHO guidelines on safety monitoring of herbal medicines in pharmacovigilance systems. Geneva: World Health Organization; 2004.Google Scholar
- 26.Jones G, Maier J, Pedersen P, et al., editors. Anthroposophic Pharmaceutical Codex APC. 4th ed. Dornach: International Association of Anthroposophic Pharmacists; 2017.Google Scholar
- 28.Kaushal R, Goldmann DA, Keohane CA, et al. Adverse drug events in pediatric outpatients. Ambul Pediatr Off J Ambul Pediatr Assoc. 2007;7:383–9.Google Scholar
- 30.Kienle GS, Glockmann A, Grugel R, et al. Klinische Forschung zur Anthroposophischen Medizin—Update eines Health Technology Assessment-Berichts und Status Quo [Clinical research on anthroposophic medicine: update of a health technology assessment report and status quo]. Forsch Komplementmed. 2011;18:269–82.PubMedGoogle Scholar
- 32.Mol C, Roemer F, Scheer R, et al. Arzneimittel [überarbeitete Monographien der Kommission C]. In: Glöckler M, editor. Anthroposophische Arzneitherapie für Ärzte und Apotheker. Stuttgart: Wissenschaftliche Verlagsgesellschaft; 2008.Google Scholar
- 33.Mouton JP, Mehta U, Rossiter DP, et al. Interrater agreement of two adverse drug reaction causality assessment methods: a randomised comparison of the Liverpool Adverse Drug Reaction Causality Assessment Tool and the World Health Organization-Uppsala Monitoring Centre system. PLoS One. 2017;12:e0172830.CrossRefPubMedPubMedCentralGoogle Scholar
- 35.Rebelo Gomes E, Fonseca J, Araujo L, et al. Drug allergy claims in children: from self-reporting to confirmed diagnosis. Clin Exp Allergy J Br Soc Allergy Clin Immunol. 2008;38:191–8.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.