Introduction

To date, most epidemiological studies have focused on the effects of single environmental exposures, without considering the role of real-life multiple exposures. A major innovation has been provided by the introduction of the concept of the exposome, defined as the totality of exposures of an individual in a lifetime, and its health effects1. It consists of three domains: the external specific exposome, the external non-specific exposome and the internal exposome1,2. The external specific exposome consists of the exposures to a set of external environmental factors (e.g. air, soil and water chemicals, biological and physical pollutants, radiations, noise, etc.) to which individuals are personally exposed over lifetime2. The external non-specific exposome consists of generic exposures (e.g. climate, greenness, socioeconomic factors, etc.) usually assessed at the community level (neighbourhood, city, social group, etc.)2. The internal exposome consists of the internal body chemical environment, namely of signals resulting from internal processes (metabolic, inflammatory, omics, etc.) as a consequence of external environmental aggressions2.

Since Wild introduced the exposome concept, an increasing number of studies have applied it in the case of various diseases1,2,3,4,5,6. They have clearly indicated that as lifetime environmental exposure assessments are difficult to achieve in practice, the external exposome has to be estimated at different life stages1,5,7. In particular, the first years from conception onwards (1000 days of life) are considered a critical window, due to the physiological changes and metabolic development in fetal and postnatal life, often resulting in impaired health conditions like allergic/asthma and neurodevelopmental outcomes throughout the life span4,5,7.

As a whole, exposome studies have shown that environmental data may often be incomplete and are rarely standardized, despite the fact that the European Union has been moving toward the digitalization and the implementation of thematic harmonized datasets8,9. Furthermore, studies have also been affected by the lack of routine and Open Access FAIR data i.e. providing freely available, usable and sharable data on the considered risk factors10.

The HEALS (Health and Environment—wide Associations based on Large population Surveys, https://cordis.europa.eu/project/id/603946) and EarlyFOOD (Long-term impact of gestational and early-life dietary habits on infant gut immunity and disease risk, https://immulab.fr/cms/index.php/projects/earlyfood) were EU funded projects, which aim was to build and estimate the children’s internal and external exposomes to gain mechanistic insights into the development of major childhood diseases such as asthma, allergies, obesity, diabetes and neurodevelopmental troubles. In both projects, birth cohort data were used to implement exposome-health association studies (Environment Wide Association Studies—EnvWAS) and suggest prevention measures.

Within these two projects, an Environmental Data Management System (EDMS) was implemented, containing as much exposome data as possible freely available on the web. The EDMS is intended as a useful tool to support the scientific community and stakeholders dealing with environmental issues, as well as environmental policies.

The aim of this paper is to describe the methodology, usefulness and feasibility of EDMS implementation with Open-Access data on exposure to external non-specific and specific exposome from 10 European countries, in the context of the European HEALS/EarlyFOOD projects.

Materials and methods

Study design

An observational cross-sectional study has been undertaken to further implement EDMS with data on exposure to external non-specific and specific exposome.

Study cities/countries

The HEALS project (2013–2019) integrated a comprehensive array of novel technologies, data analysis and modelling tools for supporting exposome-health association studies in a European birth cohort from ten European cities initiated in the frame of the Exposure and Health Examination Survey (EXHES): Celje (Slovenia), Łódź (Poland), Manchester (United Kingdom, UK), Palermo (Italy), Paris (France), Porto (Portugal), Regensburg (Germany), Reus (Spain), Rijeka (Croatia), Thessaloniki (Greece). These cities were chosen to represent different European meteo-climatic, geographical, geological, cultural, and behavioural characteristics (Fig. 1).

Figure 1
figure 1

Map of the cities participating in the HEALS and EarlyFOOD projects (modified by MapChart, https://mapchart.net).

The EarlyFOOD project (2018–2022) investigated the external and internal exposome (metabolomics and microbiomics) for better understanding the development of asthma and allergies, obesity, diabetes and neurodevelopmental disorders. It involved birth cohorts from Palermo, Paris, and Reus, also participating in the HEALS study (Fig. 1).

Within the HEALS project, an EDMS, gathering information on environmental stressors (e.g. environmental pollution, food, bio-contaminants) in the ten cities participating in EXHES, was developed. Furthermore, within EarlyFOOD project, the EDMS was enriched with lifestyle, additional environmental stressors, and socio-economic characteristics for 2017–2020.

Due to the SARS-CoV-2 pandemic restrictions, the year 2020 is an outlier in relation to the considered stressors, thus it was decided to analyse it separately, wherever possible.

External exposome factors and data collection

External non-specific exposome factors included socio-demographic characteristics (population, mortality rate, life expectancy at birth, population at risk of poverty), lifestyle factors (Body Mass Index (BMI), smoking habits, alcohol and drug consumption), climatic parameters (temperature and humidity), land use/land cover (LU/LC). External specific exposome included air pollution (PM10: Particulate Matter (PM) with an aerodynamic diameter smaller than 10 µm; PM2.5: PM with an aerodynamic diameter smaller than 2.5 µm; NO2: Nitrogen Dioxide; O3: Ozone), pollen/spores, noise (from traffic and railway), drinkable water, and pesticide residues in food. In order to implement the EDMS, we conducted thorough research on supplier websites at European, national, and local levels to collect relevant data concerning the factors mentioned above. In cases where Open Data were unavailable, data were requested from the respective institutions to complete the EDMS. The current paper deals with socio-economic, lifestyle and environmental data at city or country level in accordance to the relative copyright policy (Table S4). For each factor, all the details on considered period and metrics are reported in the Supplementary Information (Tables S1S4: data source, considered variables, temporal resolution, provided statistics, data availability and inter-cities comparability). Briefly, whenever feasible, we downloaded data at the city level from single European databases, ensuring consistency in temporal resolution to facilitate uniformity of information. Basic statistical analyses were performed to allow data comparability, such as means over the specified periods, particularly in cases where data did not entirely cover the 2017–2020 period.

No experiments on humans and/or use of human tissue samples occurred in this study as well as no use of personal data (e.g. health, location, lifestyle, job).

Results

External non-specific exposome

Socio-demographic characteristics

Table 1 shows socio-demographic characteristics stratified by city. Paris was the city with the highest total mean population (n = 2,168,614), and overall females overcome males (average across cities: 52% vs. 48%). In 2017–2019, the highest mortality rate was registered in Łódź, Porto and Rijeka (14%) and the highest mean percentage of the population at risk of poverty was in Spain (21%). In 2020, a general increase in the mortality rate was shown while risk of poverty slightly increased only in France (+ 1%).

Table 1 External non-specific exposome factors between 2017 and 2020 in the ten cities contributing to HEALS and EarlyFOOD projects.

Life expectancy at birth was generally higher in Reus females (86 years) and Palermo males (81 years).

Lifestyle risk factors

Country lifestyle risk factors are reported in Table 1. In 2017 smoking prevalence was highest in Greece (37%), and a general decrease was shown in 2020, except for Greece and Croatia (+ 5% and + 1%, respectively).

In 2019, the highest percentage of pre-obese (25 ≤ BMI < 30 kg/m2) and obese (BMI ≥ 30 kg/m2) population resulted in Croatia (41%) and the UK (28%). As regards alcohol consumption, the highest “every day” and “every week” consumption was in Portugal (21%) and France (34%), respectively. In contrast, the highest percentage of “never” alcohol consumption was in Croatia (38%). On the other hand, drug consumption was highest in Croatia (62% prescribed) and Poland (45% not prescribed).

Climatic parameters

As expected according to geographical location, the hottest and wettest cities were Palermo (19 °C) and Manchester (82%), respectively (Table 1).

Land use/land cover

The highest percentage of the agricultural, blue, green, grey areas was in Reus (69%), Manchester (5%), Rijeka (84%), and Thessaloniki (98%), respectively (Table 1).

External specific exposome

Air quality

In 2017–2019, the cities with the highest value of air pollutants (µg/m3) were: Thessaloniki for PM10 (38), Łódź for PM2.5 (25), Porto for NO2 (62), and Rijeka for summer O3 (92). In 2020 a general decrease in PM and NO2 values was shown in all cities with available data, while there was not a specific uniform trend for O3 (Table 2).

Table 2 External specific exposome factors between 2017 and 2020 in the ten cities contributing to HEALS and EarlyFOOD projects.

Pollen and spores

Regarding pollen presence, Celje had the highest values (p/m3) of Ambrosia (649), Corylus (1714), Fagus (2269), Urticaceae (12,978); Łódź of Artemisia (847), Alnus (6612), Betula (14,918); Manchester of Poaceae (4829); Reus of Quercus (4469), Oleaceae (5110); Rijeka of Carpinus (1698), Cupressaceae (15,052). Overall, the total pollen load (cAPI) for the period 2017–2020, had the highest value in Łódź (39,041 p/m3). Instead, the highest value of spores’ concentration (s/m3) was in Łódź (Alternaria: 25,499; Cladosporium: 1,110,795) (Table 2).

Noise

Reus reported the highest percentage of high traffic noise levels exposure during the day and the night (Lden ≥ 55 dB: 81%; Lnight ≥ 50 dB: 59%), and Regensburg the highest percentage of high railway noise levels exposure (Lden ≥ 55 dB: 21%; Lnight ≥ 50 dB: 16%) (Table 2).

Drinking water

In 2017–2019, the cities with the highest mean values of one or more contaminants were: Thessaloniki (arsenic: 6.4 µg/L; boron: 0.21 mg/L; bromate: 6.9 µg/L), Celje (1,2-dichloroethane: 0.2 µg/L; antimony: 0.7 µg/L; benzene: 0.3 µg/L; benzo(a)pyrene: 0.003 µg/L; cadmium: 0.05 µg/L; chromium: 1.3 µg/L; mercury: 0.1 µg/L; polycyclic aromatic hydrocarbons (PAHs): 0.01 µg/L; selenium: 0.5 µg/L; tetrachloroethene and trichloroethene: 1.3 µg/L), Manchester (copper: 0.06 mg/L; fluoride: 0.7 mg/L), Paris (lead: 46.7 µg/L; nitrate: 45.8 mg/L; total pesticides: 0.09 µg/L), Porto (nitrite: 0.02 mg/L; Total Trihalomethanes (TTHMs): 42.2 µg/L; Enterococci: 1 CFU/100 mL; Escherichia coli: 6 CFU/100 mL). The values of nickel were highest in Celje and Thessaloniki (5 µg/L).

In 2020, the cities with the highest values were: Paris (bromate: 7.4 µg/L; nitrate: 47 mg/L; TTHMs: 56.3 µg/L; total pesticides: 0.09 µg/L), Celje (1,2-dichloroethane: 0.2 µg/L; antimony: 0.5 µg/L; benzene: 0.3 µg/L; benzo(a)pyrene: 0.002 µg/L; boron: 24 mg/L; cadmium: 0.05 µg/L; chromium: 1 µg/L; copper: 2.1 mg/L; cyanide: 5 µg/L; nickel: 2 µg/L; PAHs: 0.005 µg/L; tetrachloroethene and trichloroethene: 0.9 µg/L), Manchester (fluoride: 0.7 mg/L; lead: 1 µg/L), Porto (arsenic: 3.3 µg/L; mercury: 0.14 µg/L; nitrite: 0.11 mg/L; selenium: 8 µg/L; Enterococci: 32 CFU/100 mL; Escherichia coli: 2 CFU/100 mL) (Table 2).

Food pesticides

In 2017–2019, the highest percentage of food pesticide residues (Maximum Residue Levels (MRLs)) was observed in Portugal (7%), and a general increasing in the values was shown in 2020 (Table 2).

Discussion

The exposome refers to the sum of all environmental exposures that an individual experiences throughout the lifetime. Building the exposome requires collecting and integrating data from a variety of sources, including personal monitoring devices, biomonitoring, and environmental monitoring.

This paper reports the accessibility and usability of Open Data on exposure to the external non-specific (socio-demographic and lifestyle risk factors, climatic parameters, LU/LC) and specific (air, water and noise pollution, pollen/spores, food pesticides) exposome in 10 European cities/countries participating in the HEALS and EarlyFOOD projects.

Through the EDMS, inter-cities differences in demographic and environmental aspects have emerged which could affect children’s development.

Socio-economic status and lifestyle are significant determinants of children’s health and wellbeing11. The results showed that Poland, UK and Croatia had lower life expectancies and higher rates of pre-obesity/obesity. It is well-known that a higher BMI in childhood and adolescence is associated with adverse health consequences throughout the lifespan12. Also, in Croatia and Poland, the higher consumption of drugs represented a potential higher morbidity risk for new generations. Children living in western-central Europe might be at higher poverty risk and be more susceptible to indirect consequences of high parental alcohol consumption, such as abandonment and violence11. Given the high smoking prevalence rates, France, Greece, and Croatia displayed a likely higher prevalence of Environmental Tobacco Smoking. Fetal exposure to tobacco and alcohol increases the risk of adverse pregnancy outcomes and other health problems in the life course, such as obesity11.

As regards environmental risk factors, the high pollen load in Łódź and PM concentration in Łódź and Thessaloniki could increase the risk for children respiratory health and allergic sensitization13. Porto and Rijeka could expose children to a higher level of gaseous pollutants. Regarding urbanization, children of Thessaloniki lived in a highly urbanized city. Pollen is often at the origin of rhinitis and asthma2,13,14. Air pollution is a significant public health problem and urbanization is firmly correlated to air pollution15,16. Recent Italian studies showed that grey spaces are related to adverse effects on children and adults allergic status and PAHs exposure in adulthood17,18. The beneficial influence of green space was shown for wellbeing, diastolic blood pressure, salivary cortisol, heart rate, incidence of diabetes, and total and cardiovascular mortality19,20. Meanwhile, a recent meta-analysis found that green space exposure is a risk factor for childhood wheeze, asthma, and allergic rhinitis21. Other studies have highlighted how urban green areas enhance human wellbeing by providing ecosystem services absent in grey areas. These include improving air quality, offering recreational opportunities, and enriching cultural experiences22,23,24. Additionally, green areas contribute to regulating local temperatures, reducing the urban heat island effect, and moderating adverse natural events like floods22,25. The latter are particularly important in light of potential changes in meteorological patterns, such as heat waves and extreme events, resulting from climate change26. European children and especially in the “climatic hot spot” of the Mediterranean area, will live in increasingly hot climates, with persistent heat waves. Substantial and abrupt variations in temperatures are related to detrimental health effects, such as increasing total mortality and allergic morbidity2,25,26. Furthermore, the rising temperature and carbon dioxide concentrations may impact on pollen allergenicity, seasonality, distribution, and load, thus increasing pollen exposure13,14,26. It is also supposed that higher temperatures are associated with faster rates of fungal growth (but with fewer spores) and that extreme events could increase indoor dampness and consequent fungal proliferation2. These changes, ultimately, might affect children’s allergic and respiratory health2.

Less known factors were also considered. Although likely underestimated, chronic exposure to noise pollution negatively impacts physical and mental health and wellbeing. It has been estimated that 113 million people are exposed to Lden of at least 55 dB from traffic and 22 million from railways in Europe27. Safe drinking water is a fundamental human right and it is essential to health, especially for the most vulnerable categories, such as infants, children, elderly and frail people28. The increasing frequency and severity of droughts may affect both the quality and quantity of water28. Lead exposure was associated with neurodevelopmental disorders, impaired renal function, hypertension, impaired fertility, adverse pregnancy outcomes, and mortality28. Animal studies showed a likely toxic effect on male reproductive tract of boron compounds exposure28. Even bacterial contamination may be dangerous for human health. Short-term peaks in pathogen concentration may considerably increase disease risks and expose the community to the risk of intestinal and other infectious disease outbreaks28.

Using plant protection products on crops or food products may result in pesticides residues in food, potentially posing a risk to public health29. To ensure a high level of consumer protection, Regulation (EC) No 396/20,059 established a legal limit of 0.01 mg/kg (MRLs)29. The higher percentage of MRLs exceeding in Portugal and Greece than in other countries generates concerns, which need to be deepened29.

Comparison with EU and international limits

Currently, EU limits and WHO guidelines exist only for air pollution, noise and water parameters (Table 3). The differences between the EU air quality standards and the WHO recommended values have increased after the publication of the new WHO air quality guidelines (AQG) on September 22, 2021, indicating significantly lower levels than previous ones30,31. In the study period 2017–2019, the cities generally exceeded the PM and NO2 annual reference values of WHO air quality guidelines, but not of EU limits (Fig. 2, Table 3).

Table 3 References value from European Union Directive (EU) and international guidelines (WHO).
Figure 2
figure 2

Mean values of air pollutants (PM2.5, PM10, NO2, summer O3) over the period 2017–2019 (orange) and 2020 (blue) for the ten cities, and the EU limits (red line) and WHO guidelines values (yellow line). PM2.5: particulate matter with an aerodynamic diameter smaller than 2.5 µm; PM10: particulate matter with an aerodynamic diameter smaller than 10 µm; NO2: Nitrogen Dioxide; summer O3: April-September Ozone; µg/m3: microgram/cubic meter; SI: Slovenia; PL: Poland; UK: United Kingdom; IT: Italy; FR: France; PT: Portugal; DE: Germany; ES: Spain; HR: Croatia; GR: Greece.

Only Paris and Porto violate the NO2 EU limits. Despite lacking a summer limit for O3, the WHO air quality guidelines provided a seasonal recommended value, consisting of the average daily maximum 8-h mean O3 concentration in the six consecutive months with the highest average O3 concentration (60 µg/m3). Apart from Manchester, Paris and Porto, all the cities exceeded the O3 recommended value, in 2017–2019.

The EU Directive 2002/49/EC established general high noise thresholds: 55 dB for Lden and 50 dB for Lnight. The WHO European office defined lower guidelines values, based on the evidence on health effects (Table 3)32,33. Since exposed population percentage was provided according to EU limits, it is only possible to evidence their exceeding in almost every city.

Regarding drinking water, the substances above the EU limit and/or the WHO guidelines were: the 2017–2019 lead in Paris and Thessaloniki; the 2020 boron, and chlorates and chlorites in Celje and Łódź, respectively28,34. The presence of lead may be due to corrosive water effects on household plumbing systems containing lead in pipes or from the connections service to homes28. The amount of lead dissolved in the plumbing system depends on several physicochemical water properties28. Boron presence depends on the surrounding geology and wastewater discharges28. Values exceeding WHO guidelines and EU limits value of Enterococci and Escherichia coli were found in Porto drinking water. Celje drinking water had exceeded in Escherichia coli limits value, as well. The presence of bacteria in water is likely due to faecal contamination28.

SARS-CoV-2 pandemic effects

As expected, the average mortality in 2020 among the ten cities increased, probably due to SARS-CoV-2 pandemic. Overall, the Covid-19 pandemic affected urban air pollution, especially PM and NO2 (Table 2). In Europe during Covid-19 lockdowns, road transport reductions caused a temporary drop in NO2 annual mean concentrations, up to 25% in France, Italy and Spain35. During the first lockdown in April 2020, the traffic monitoring stations registered a NO2 concentrations drop up to 70%35. However, the Covid-19 lockdown measures’ impact on the annual mean level of PM10 and PM2.5 was limited, and no greater than a median reduction of 4% and 5% across all stations, respectively35. This may be because increased emissions from residential heating could have counterbalanced reductions in other sectors35.

Usefulness and feasibility

The EDMS has been implemented to provide an informative, interactive, user-friendly tool to support the scientific community and stakeholders in environmental-health studies and environmental policies. Recently, a growing interest in the role of the external exposome in disease development has led to an increasing number of studies on this topic4,5. The challenge remains finding and processing of these different data. In this paper, all the possible data that could be recovered, regarding external non-specific and specific exposome factors, has been archived. The added value of this work is that the EDMS can be further expanded and constantly updated to be used in other European settings dealing with lifetime external exposome.

However, its effectiveness is hampered by several criticisms. The difficulties in building and implementing the EDMS ranged from finding the data source to heterogeneity in the used language. In the Supplementary Information (Tables S2-S4) data search issues and source links are detailed.

One of the major concerns experienced for retrieving most environmental and lifestyle factors is the lack of a centralized database at the European level. This leads to time consuming data searches in separate national or even local sites or reports. Language barriers further exacerbate this issue, as many databases and reports are only accessible in the native language, thus requiring extensive translation efforts. Additionally, on web platforms exclusively in the native language, direct translation features within browsers were not always accessible, further complicating the translation process. Consequently, this linguistic harmonization has significantly consumed both time and staff resources, decreasing operational efficiency and productivity. Furthermore, the availability of reports rather than databases has necessitated manual extraction of data, so increasing the risk of human error and further prolonging the data retrieval process.

Another critical issue was the lack of standardization in reporting the data (e.g. different temporal resolution, parameters, unit of measures, etc.), which did not allow appropriate comparability of the various exposome factors among countries/cities.

Notwithstanding the existence of the climatic, air and noise pollution, and LU/LC European centralized databases (ECA&D, EEA and Copernicus databases), data for each of the ten studied cities were not always available. In many cases, data had insufficient coverage and therefore were unusable. In particular, this problem arose in air pollution data. The failure of European oversight of data supply, availability and quality or non-compliance of countries with European indications/regulations, results in a severe gap for multi-centre studies in this sector.

In addition, even if available, data did not completely cover the study period, weighting on the estimates of the various indicators. This aspect highlights Europe’s digital gap, which is expected to increase without faster and more comprehensive engagement in Artificial Intelligence (AI) technologies, especially in European countries with relatively low AI facilities.

Although the EU encourages the implementation of Open Data, and the adoption of the FAIR data principle, according to which data have to meet principles of findability, accessibility, interoperability, and reusability, public data availability remains limited10. Indeed, we have found the lower availability for open pollen data in most of the European countries in this paper. A public consultation of pollen data would be necessary in a world of increasing allergic population13.

Finally, regarding drinking water, not all parameters regulated by the European Union are provided in the local reports/databases, whereas it is the citizen's right to exhaustively know the most important characteristics of the supplied water.

Overall, Open Access datasets provide ecological data. In epidemiology, ecological studies are used to understand the relationship between health outcome and exposure at a population level, where the population represents a group of individuals with shared characteristics such as geography, ethnicity and socio-economic status. Ecologic studies are more often subject to confounding bias than analytic risk studies due to the lack of information on potential confounders. However, mixing specific and generic exposome can reduce such confounding bias.

In summary, the limitations in data accessibility, standardization, and spatial and temporal coverage pose significant challenges. Addressing these issues is crucial for enhancing the effectiveness of EDMS, enabling the integration of additional data on factors like pollution and socio-economic status. Our study was limited to a descriptive analysis, lacking statistical inferences and validation of the tool's effectiveness. Consequently, further research on the application of EDMS in environmental epidemiology is warranted. Nevertheless, we believe EDMS can be an excellent tool to support ecological studies by centralizing all exposures within a unified database. This not only streamlines researchers' efforts but also enhances the precision of exposome estimations. By overcoming data heterogeneity and enhancing spatial resolution of data, EDMS would facilitate comparisons of exposomes across various European populations, thereby aiding in the identification of population-specific risk factors. Consequently, policymakers are empowered to make prompt evidence-based decisions to mitigate the public health burden.

Conclusions

We assessed external exposome by implementing an Environmental Data Management System, the EDMS, in ten European cities for the first time, ensuring the accessibility and the usability of data. As expected, a wide heterogeneity of risk factors exposures was shown across cities, highlighting some critical public health issues. From our study emerged the potential of standardized data management systems to facilitate interdisciplinary collaboration and data-driven decision-making in public health. Leveraging Open-Access databases for health research can yield numerous public health benefits including enhanced accessibility, accelerated discovery, transparency and reproducibility, cost-effectiveness, long-term sustainability, and finally ethical considerations (data sharing practices, research benefits equitably distributed across diverse populations). In conclusion, by harnessing these data management systems, we can address critical public health issues and pave the way for tailored preventive measures at the population level, thus ensuring a healthier future for generations to come.