Abstract
Standard and easily accessible cross-thematic spatial databases are key resources in ecological research. In Switzerland, as in many other countries, available data are scattered across computer servers of research institutions and are rarely provided in standard formats (e.g., different extents or projections systems, inconsistent naming conventions). Consequently, their joint use can require heavy data management and geomatic operations. Here, we introduce SWECO25, a Swiss-wide raster database at 25-meter resolution gathering 5,265 layers. The 10 environmental categories included in SWECO25 are: geologic, topographic, bioclimatic, hydrologic, edaphic, land use and cover, population, transportation, vegetation, and remote sensing. SWECO25 layers were standardized to a common grid sharing the same resolution, extent, and geographic coordinate system. SWECO25 includes the standardized source data and newly calculated layers, such as those obtained by computing focal or distance statistics. SWECO25 layers were validated by a data integrity check, and we verified that the standardization procedure had a negligible effect on the output values. SWECO25 is available on Zenodo and is intended to be updated and extended regularly.
Similar content being viewed by others
Background & Summary
Spatial ecology has benefited from growth in data availability, geostatistical methods, and computing facilities, and is now central for a wide range of applications including public health1,2,3, agriculture4,5,6, and biological conservation7,8,9. Because the targets of such applications (e.g., water quality, soil nutrients, species’ environmental suitability) are controlled by multiple environmental drivers (e.g., climate, vegetation, land use and cover), their study requires working with cross-thematic data. However, in most countries, thematic data are often scattered across computer servers of different institutions (e.g., universities, federal and state offices, non-governmental organizations), and are generally lacking spatial standards, whether in terms of resolution, extent, or projection system. Consequently, a significant effort for data compilation and standardization is usually required prior to using them, which implies a non-sustainable use of time and resources, but also hinders comparisons and cross-project usage of data.
In Switzerland, many spatial datasets are publicly available from university and governmental computer servers and cover a wide range of themes including topography10, hydrography11, land use and cover12, transportation13, and several others. Although there has been recent efforts for developing web portals aimed at gathering the increasing volume of new data, such as the Swiss Data Cube (https://www.swissdatacube.org/) for remote sensing data14, or opendata.swiss (https://opendata.swiss/en) for Swiss government data, available layers are rarely provided in standard formats, so their joint use can require tedious data management and heavy geomatic operations. Currently, a ready-to-use, standard, cross-thematic, geospatial database gathering key layers for ecological research in Switzerland is lacking. Yet, thanks to the large amount of available data and the diversity of landscapes to study and protect, Switzerland is an ideal candidate to promote greater standards in scientific data, which is essential for advancing research in ecology and can inspire similar initiatives worldwide.
Here we introduce SWECO25, a 25-meter resolution raster database gathering 5,265 layers on 10 main environmental categories. The 25-meter resolution was chosen as a trade-off between spatial accuracy, resolution of input sources, and size of output database. Layers available in SWECO25 were standardized to a common spatial grid covering all of Switzerland so they all share the same spatial resolution, extent, and geographic coordinate system. SWECO25 includes both the standardized sources and newly calculated layers, such as those obtained by computing focal or distance statistics. By providing standardized spatial data for a large range of environmental themes, SWECO25 stands as a foundational contribution for more effective analyses, informed decision-making, collaboration, and sustainable development across various sectors. It should help streamlining stakeholder workflows and support them in making more accurate decisions. The variety of potential applications of national interest for Switzerland includes, but is not limited to, biodiversity conservation, glacier and snowmelt modelling, tourism and recreation management, natural hazard mitigation, energy transition planning, or ecosystem services assessment. The SWECO25 database and associated metadata are openly available on Zenodo (https://zenodo.org/communities/sweco25/).
Methods
The development of SWECO25 followed four main steps (Fig. 1): (1) dataset identification, (2) dataset selection, (3) layer processing, and (4) public upload on Zenodo (https://zenodo.org/communities/sweco25/).
Dataset identification
We identified candidate datasets for SWECO25 by screening academic geodata servers (e.g., University of Lausanne UnilGis, Zurich Polytechnic GeoVITe, University of Bern GIUBGIS, and University of Geneva GRID), Swiss governmental geodata (https://opendata.swiss/en), and consulted with a panel of ~20 scientists involved in ecological research in Switzerland. Discussions among panel members allowed sharing unpublished data and identifying gaps in available datasets (e.g., missing themes or coarse spatial resolutions). When these gaps were not solvable by applying basic geomatic operations (e.g., resampling or reprojection), research initiatives involving more advanced computational techniques were started. This was the case for developing the high-resolution climate15, and land-use and cover16 datasets.
Dataset selection
Three main criteria were used for deciding on the selection of an existing dataset for SWECO25: (i) its relevance for ecological research, (ii) a spatial extent covering all of Switzerland, and (iii) a minimum input resolution of 100 meters to preserve data accuracy after resampling.
Layer processing
Individual layers from selected datasets were processed following an eight-step standardization procedure: (i) rasterization of vector layers, (ii) reprojection to the CH1903 +/LV95 (https://epsg.io/2056) geographic coordinate system, (iii) resampling to a common spatial grid of 25-meter resolution (bilinear and nearest neighbor methods for continuous and categorical layers, respectively), (iv) transformation of discrete data to continuous values (e.g. converting discrete land use and cover classes within a grid cell to percentage cover for a particular class), (v) computation of distance statistics for linear features (e.g., Euclidean and path distance to roads and rivers), (vi) computation of focal statistics by applying a cell-level function calculating the average value in a circular moving window of 13 radii ranging from 25 meters to 5 kilometers, (vii) conversion of decimals to integer values for storage efficiency, and (viii) saving the final raster layers as GeoTIFF files.
Data Records
Following our dataset selection criteria, 16 datasets were collected (Table 1, Table 2, and Table 3). The source datasets consisted predominantly of raster files (10 out of 16) with a mean ± standard deviation (SD) spatial resolution of 32.5 ± 34.8 meters and most of them were already projected in the CH1903 +/LV95 geographic coordinate system (10 out of 16). Most of the datasets were available for a single time step (static), except the chclim25, geostat25, statpop, and sdc datasets that contained layers for several time steps. In addition, the chclim25 dataset included layers for three future greenhouse gas concentration trajectories, or representative concentration pathways (RCPs)17, RCP2.6 (“Very Low Carbon”), RCP4.5 (“Low Carbon”), and RCP8.5 (“High Carbon”). After running the standardization procedure, a total of 5,265 layers was produced and compiled together in SWECO25 (v1.0.0) for a total size of 157 GB. Figure 2 provides an overview of the diversity of layer types available in SWECO25.
All SWECO25 layers and files are following a standard naming scheme, which is also used for folder organization (Fig. 3). The tree structure of SWECO25 folders can be developed to a maximum of six levels: category, dataset, period, sub-period, scenario, and variable (Fig. 3). At its top level SWECO25 is divided into ten main environmental categories: geologic (“geol”)18, topographic (“topo”)19, bioclimatic (“bioclim”)20, hydrologic (“hydro”)21, edaphic (“edaph”)22, land use and cover (“lulc”)23, population (“pop”)24, transportation (“trans”)25, vegetation (“vege”)26, and remote sensing indices (“rs”)27. The environmental category with the most layers was land use and cover (3,304), followed by bioclimatic (428) and geologic (420). For each environmental category, the detailed list of layers can be found in the SWECO25_datalayers_details_categoryname.csv file available in its respective Zenodo repository.
Technical Validation
SWECO25 was technically validated through an automated procedure during which all layers were checked for their standard format (reference system, spatial resolution, and extent), folder structure, naming scheme, and data integrity (count of NA cells, range of values, and integer format). All layers successfully passed the validation procedure, except for several from the remote sensing category that contained a higher number of NA cells, as well as very high or low values. The propagation of errors inherent to the source data to SWECO25 is difficult to avoid and to measure. For instance, reason for NA cells is most likely artefacts from satellite imagery, whereas very high or low values arise because remote sensing vegetation indices were computed for all of Switzerland, including areas not covered by vegetation. We did not mask very high and low values, as they might still be informative for some users (e.g. provide information on the presence or absence of vegetation). The supplementary file SWECO25_datalayers_details_rs.csv available on the SWECO25 remote sensing Zenodo repository (https://zenodo.org/record/7994481) allows identifying these layers. In addition, we assessed the potential effects of spatial resampling on SWECO25 layers by comparing the values of 15'000 random points extracted from the source and resampled layers. This analysis was conducted for source layers from all datasets, except the gwn07 and tlmd3D datasets that consisted of linear features (i.e., river and road networks, respectively) from which distance layers were computed directly on the SWECO25 grid. The chclim25 dataset was also excluded from this analysis as the source dataset used the SWECO25 grid. For the “sdc” dataset, due to the number of source layers in it, only three time-steps were randomly tested for each spectral index. Results from the resampling analysis indicated very low differences between source and resampled values, with a median coefficient of variation28 ± standard deviation of 0.0066 ± 0.0348 for continuous data (32 source layers evaluated), and a median Dice coefficient29 ± standard deviation of 0.9906 ± 0.0793 for discrete data (7 source layers evaluated). A detailed version of the technical validation procedure with additional results can be found on the SWECO25 GitHub repository (https://github.com/NKulling/SWECO25/tree/main/database_validation).
Code availability
The R-code and ArcGIS toolboxes allowing to reproduce the standardization procedure, the computation of focal and distance statistics, and the technical validation are openly available on the SWECO25 GitHub repository https://github.com/NKulling/SWECO25.
References
Dlamini, S. N. et al. Review of remotely sensed data products for disease mapping and epidemiology. Remote Sensing Applications: Society and Environment 14, 108–118 (2019).
Jamison, A., Tuttle, E., Jensen, R., Bierly, G. & Gonser, R. Spatial ecology, landscapes, and the geography of vector-borne disease: A multi-disciplinary review. Applied Geography 63, 418–426 (2015).
Meentemeyer, R. K., Haas, S. E. & Václavík, T. Landscape epidemiology of emerging infectious diseases in natural and human-altered ecosystems. Annual review of Phytopathology 50, 379–402 (2012).
Oliver, M. A. in Geostatistical Applications for Precision Agriculture (ed M.A. Oliver) 1-34 (Springer, 2010).
Cambouris, A., Zebarth, B., Ziadi, N. & Perron, I. Precision agriculture in potato production. Potato Research 57, 249–262 (2014).
Mathenge, M., Sonneveld, B. G. & Broerse, J. E. Application of GIS in Agriculture in Promoting Evidence-Informed Decision Making for Improving Agriculture Sustainability: A Systematic Review. Sustainability 14, 9974 (2022).
Turner, W. et al. Free and open-access satellite data are key to biodiversity conservation. Biological Conservation 182, 173–176 (2015).
Fletcher, R. & Fortin, M. Spatial ecology and conservation modeling. (Springer, 2018).
Moilanen, A., Wilson, K. & Possingham, H. Spatial conservation prioritization: quantitative methods and computational tools. (Oxford University Press, 2009).
Swiss Federal Office of Topography [swisstopo]. The high precision digital elevation model of Switzerland swissALTI3D (2016 version). (Wabern, Switzerland, 2016).
Swiss Federal Office of Topography [swisstopo]. Hydrographic network VECTOR25 GWN07 (2007 version). (Wabern, Switzerland, 2007).
Swiss Federal Statistical Office [FSO]. Swiss Land Use Statistics. (Neuchâtel, Switzerland, 2021).
Swiss Federal Office of Topography [swisstopo]. The large-scale topographic landscape model of Switzerland swissTLM3D (2021 version). (Wabern, Switzerland, 2021).
Chatenoux, B. et al. The Swiss data cube, analysis ready data archive using earth observations of Switzerland. Scientific data 8, 295 (2021).
Broennimann, O. CHclim25: A high spatial and temporal resolution climate dataset for Switzerland (v.1). (Lausanne, Switzerland, 2021).
Giuliani, G., Rodila, D., Külling, N., Maggini, R. & Lehmann, A. Downscaling Switzerland Land Use/Land Cover Data Using Nearest Neighbors and an Expert System. Land 11, 615 (2022).
van Vuuren, D. P. et al. The representative concentration pathways: an overview. Climatic Change 109, 5–31 (2011).
Külling, N. & Adde, A. SWECO25: Geologic (geol). Zenodo https://doi.org/10.5281/zenodo.7981134 (2023).
Külling, N. & Adde, A. SWECO25: Topographic (topo). Zenodo https://doi.org/10.5281/zenodo.7973960 (2023).
Külling, N. & Adde, A. SWECO25: Bioclimatic (bioclim). Zenodo https://doi.org/10.5281/zenodo.7978760 (2023).
Külling, N. & Adde, A. SWECO25: Hydrologic (hydro). Zenodo https://doi.org/10.5281/zenodo.7981127 (2023).
Külling, N. & Adde, A. SWECO25: Edaphic (edaph). Zenodo https://doi.org/10.5281/zenodo.7981143 (2023).
Külling, N. & Adde, A. SWECO25: Land Use and Cover (lulc). Zenodo https://doi.org/10.5281/zenodo.7981110 (2023).
Külling, N. & Adde, A. SWECO25: Population density (pop). Zenodo https://doi.org/10.5281/zenodo.7981095 (2023).
Külling, N. & Adde, A. SWECO25: Transportation (trans). Zenodo https://doi.org/10.5281/zenodo.7981069 (2023).
Külling, N. & Adde, A. SWECO25: Vegetation (vege). Zenodo https://doi.org/10.5281/zenodo.7973922 (2023).
Külling, N. & Adde, A. SWECO25: Remote Sensing (rs). Zenodo https://doi.org/10.5281/zenodo.7994481 (2023).
Lovie, P. Coefficient of Variation. in Encyclopedia of Statistics in Behavioral Science (eds. Everitt, B. S. & Howell, D. C.) (Wiley, 2005).
Dice, L. R. Measures of the Amount of Ecologic Association Between Species. Ecology 26, 297–302 (1945).
Swiss Federal Office of Topography [swisstopo]. Geotechnical map of Switzerland (1967 version). (Wabern, Switzerland, 1967).
Federal Office for the Environment [FOEN]. Swiss watercourse structure and morphology (2009 version). (Bern, Switzerland, 2009).
Kaelin, K. & Altermatt, F. Landscape-level predictions of diversity in river networks reveal opposing patterns for different groups of macroinvertebrates. Aquatic Ecology 50, 283–295 (2016).
Descombes, P. et al. Spatial modelling of ecological indicator values improves predictions of plant distributions in complex landscapes. Ecography 43, 1448–1463 (2020).
Hürdler J., Prasuhn V., Spiess E. Abschätzung diffuser Stickstoff- und Phosphoreinträge in die Gewässer der Schweiz MODIFFUS 3.0: Bericht im Auftrag des Bundesamtes für Umwelt (BAFU). (Zürich, Switzerland, 2015).
Price, B., Huber, N., Ginzler, C., Pazúr, R., Rüetschi, M. The Habitat Map of Switzerland (v.1). (Birmensdorf, Switzerland, 2021).
Swiss Federal Statistical Office [FSO]. Population and Households Statistics (STAPOP) (2010-2020 versions). (Neuchâtel, Switzerland, 2020).
Swiss Federal Office for the Environment [FOEN]. sonBASE GIS noise database (2018 version). (Bern, Switzerland, 2018).
Ginzler, C. Vegetation Height Model (National Forest Inventory) (2019 version). (Birmensdorf, Switzerland, 2019).
European Environment Agency [EEA]. Copernicus Land Monitoring Service - High Resolution Layer Forest (2018 version). (Copenhagen, Denmark, 2018).
Chatenoux, B., Rodila, D., Giuliani, G. & Italiano, J. Swissdatacube Geonetwork online database (accessed on Jan-2023), https://geonetwork.swissdatacube.org (2019).
Acknowledgements
We gratefully acknowledge financial support through the Action Plan of the Swiss Biodiversity Strategy by the Federal Office for the Environment (FOEN) for financing the ValPar.CH (https://www.valpar.ch/) and SwissCatchment projects. All the institutions and contributors who made the input data freely available are gratefully acknowledged.
Author information
Authors and Affiliations
Contributions
Nathan Külling contributed to the conceptualization of the study and co-led the technical preparation of the database (all datasets), the validation, and the writing of the original draft. Antoine Adde contributed to the conceptualization of the study and co-led the technical preparation of the database (all datasets), the validation, and the writing of the original draft. Fabian Fopp contributed to the conceptualization of the study, led the preparation of the hydrologic datasets, and helped in the writing of the original draft. Anna K. Schweiger contributed to the conceptualization of the study, led the preparation of the remote sensing dataset, and helped in the writing of the original draft. Olivier Broennimann contributed to the conceptualization of the study, led the preparation of the bioclimatic dataset, and helped in the writing of the original draft. Pierre-Louis Rey contributed to the conceptualization of the study and helped in the review and editing of the manuscript. Gregory Giuliani contributed to the preparation of the remote sensing dataset and in the review and editing of the manuscript. Teresa Goicolea contributed to figure preparation and in the review and editing of the manuscript. Blaise Petitpierre contributed to the conceptualization of the study and helped in the review and editing of the manuscript. Niklaus E. Zimmermann contributed to the conceptualization of the study and helped in the review and editing of the manuscript. Florian Altermatt contributed to the conceptualization of the study and helped in the review and editing of the manuscript. Loic Pellissier contributed to the conceptualization of the study and helped in the review and editing of the manuscript. Anthony Lehmann co-led the funding acquisition, contributed to the conceptualization of the study, and helped in the review and editing of the manuscript. Antoine Guisan co-led the funding acquisition, contributed to the conceptualization of the study, and helped in the review and editing of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Külling, N., Adde, A., Fopp, F. et al. SWECO25: a cross-thematic raster database for ecological research in Switzerland. Sci Data 11, 21 (2024). https://doi.org/10.1038/s41597-023-02899-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02899-1
- Springer Nature Limited
This article is cited by
-
Multispecies deep learning using citizen science data produces more informative plant community models
Nature Communications (2024)