Predicting the timing of ecological phenomena using dates of species occurrence records: a methodological approach and test case with mushrooms

Capinha, César

doi:10.1007/s00484-019-01714-0

Predicting the timing of ecological phenomena using dates of species occurrence records: a methodological approach and test case with mushrooms

Original Paper
Published: 18 April 2019

Volume 63, pages 1015–1024, (2019)
Cite this article

International Journal of Biometeorology Aims and scope Submit manuscript

César Capinha ORCID: orcid.org/0000-0002-0666-9755^1,2

617 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

Spatiotemporal predictions of ecological phenomena are highly useful and significant in scientific and socio-economic applications. However, the inadequate availability of ecological time-series data often impedes the development of statistical predictions. On the other hand, considerable amounts of temporally discrete biological records (commonly known as ‘species occurrence records’) are being stored in public databases, and often include the location and date of the observation. In this paper, we describe an approach to develop spatiotemporal predictions based on the dates and locations found in species occurrence records. The approach is based on ‘time-series classification’, a field of machine learning, and consists of applying a machine-learning algorithm to classify between time series representing the environmental variation that precedes the occurrence records and time series representing the full range of environmental variation that is available in the location of the records. We exemplify the application of the approach for predicting the timing of emergence of fruiting bodies of two mushroom species (Boletus edulis and Macrolepiota procera) in Europe, from 2009 to 2015. Predictions made from this approach were superior to those provided by a ‘null’ model representing the average seasonality of the species. Given the increased availability and information contained in species occurrence records, particularly those supplemented with photographs, the range of environmental events that could be possible to predict using this approach is vast.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Chronicles of nature calendar, a long-term and large-scale multitaxon database on phenology

Article Open access 11 February 2020

Otso Ovaskainen, Evgeniy Meyke, … Juri Kurhinen

Assessing cumulative uncertainties of remote sensing time series and telemetry data in animal-environment studies

Article Open access 25 January 2024

Ines Standfuß, Christian Geiß, … Hannes Taubenböck

Advancing our understanding of biological invasions with long-term biomonitoring data

Article Open access 05 August 2023

Phillip J. Haubrock, Laís Carneiro, … Danish A. Ahmed

Data availability

Data downloaded from GBIF can be found in https://doi.org/10.15468/dl.yiaod6 and https://doi.org/10.15468/dl.2ohxaa. The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Allouche O, Tsoar A, Kadmon R (2006) Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). J Appl Ecol 43:1223–1232. https://doi.org/10.1111/j.1365-2664.2006.01214.x
Article Google Scholar
Almeida-Neto M, Lewinsohn TM (2004) Small-scale spatial autocorrelation and the interpretation of relationships between phenological parameters. J Veg Sci 15:561–568. https://doi.org/10.1111/j.1654-1103.2004.tb02295.x
Article Google Scholar
Andrew C, Heegaard E, Gange AC, Senn-Irlet B, Egli S, Kirk PM, Büntgen U, Kauserud H, Boddy L (2018) Congruency in fungal phenology patterns across dataset sources and scales. Fungal Ecol 32:9–17. https://doi.org/10.1016/j.funeco.2017.11.009
Article Google Scholar
Bagnall A, Davis L, Hills J, Lines J (2012) Transformation based ensembles for time series classification. In: Proceedings of the 2012 SIAM international conference on data mining. society for industrial and applied mathematics, pp 307–318
Bagnall A, Lines J, Bostrom A, Large J, Keogh E (2017) The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min Knowl Disc 31:606–660. https://doi.org/10.1007/s10618-016-0483-9
Article Google Scholar
Balfour NJ, Ollerton J, Castellanos MC, Ratnieks FLW (2018) British phenological records indicate high diversity and extinction rates among late-summer-flying pollinators. Biol Conserv 222:278–283. https://doi.org/10.1016/j.biocon.2018.04.028
Article Google Scholar
Barve V (2014) Discovering and developing primary biodiversity data from social networking sites: a novel approach. Ecol Inform 24:194–199. https://doi.org/10.1016/j.ecoinf.2014.08.008
Article Google Scholar
Bird TJ, Bates AE, Lefcheck JS, Hill NA, Thomson RJ, Edgar GJ, Stuart-Smith RD, Wotherspoon S, Krkosek M, Stuart-Smith JF, Pecl GT, Barrett N, Frusher S (2014) Statistical solutions for error and bias in global citizen science datasets. Biol Conserv 173:144–154. https://doi.org/10.1016/j.biocon.2013.07.037
Article Google Scholar
Bishop TR, Botham MS, Fox R, Leather SR, Chapman DS, Oliver TH (2013) The utility of distribution data in predicting phenology. Methods Ecol Evol 4:1024–1032. https://doi.org/10.1111/2041-210X.12112
Article Google Scholar
Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 30:1145–1159. https://doi.org/10.1016/S0031-3203(96)00142-2
Article Google Scholar
Broennimann O, Di Cola V, Guisan A (2018) Ecospat: spatial ecology miscellaneous methods. R package version 3.0. URL https://CRAN.R-project.org/package=ecospat
Buisson L, Thuiller W, Casajus N et al (2010) Uncertainty in ensemble forecasting of species distribution. Glob Chang Biol 16:1145–1157. https://doi.org/10.1111/j.1365-2486.2009.02000.x
Article Google Scholar
Chapman DS, Bell S, Helfer S, Roy DB (2015) Unbiased inference of plant flowering phenology from biological recording data. Biol J Linn Soc 115:543–554. https://doi.org/10.1111/bij.12515
Article Google Scholar
Chuine I, Régnière J (2017) Process-based models of phenology for plants and animals. Annu Rev Ecol Evol Syst 48:159–182. https://doi.org/10.1146/annurev-ecolsys-110316-022706
Article Google Scholar
R Core Team (2018). R: a language and environment for statistical computing. R Foundation for statistical computing, Vienna, Austria. URL https://www.R-project.org/
Czernecki B, Nowosad J, Jabłońska K (2018) Machine learning modeling of plant phenology based on coupling satellite and gridded meteorological dataset. Int J Biometeorol 62:1297–1309. https://doi.org/10.1007/s00484-018-1534-2
Article Google Scholar
Dietze M (2017) Ecological forecasting. Princeton University Press, Princeton
Book Google Scholar
Diez JM, James TY, McMunn M, Ibáñez I (2013) Predicting species-specific responses of fungi to climatic variation using historical records. Glob Chang Biol 19:3145–3154. https://doi.org/10.1111/gcb.12278
Article Google Scholar
Elith J, Leathwick JR, Hastie T (2008) A working guide to boosted regression trees. J Anim Ecol 77:802–813. https://doi.org/10.1111/j.1365-2656.2008.01390.x
Article CAS Google Scholar
ElQadi MM, Dorin A, Dyer A et al (2017) Mapping species distributions with social media geo-tagged images: case studies of bees and flowering plants in Australia. Ecol Inform 39:23–31. https://doi.org/10.1016/j.ecoinf.2017.02.006
Article Google Scholar
Flaxman S, Chirico M, Pereira P, Loeffler C (2018) Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: a winning solution to the NIJ “Real-Time Crime Forecasting Challenge.” arXiv:180102858
Franklin J (2010) Moving beyond static species distribution models in support of conservation biogeography. Divers Distrib 16:321–330. https://doi.org/10.1111/j.1472-4642.2010.00641.x
Article Google Scholar
Fulcher BD, Jones NS (2014) Highly comparative feature-based time-series classification. IEEE Trans Knowl Data Eng 26:3026–3037. https://doi.org/10.1109/TKDE.2014.2316504
Article Google Scholar
García-Roselló E, Guisande C, Manjarrés-Hernández A, González-Dacosta J, Heine J, Pelayo-Villamil P, González-Vilas L, Vari RP, Vaamonde A, Granado-Lorencio C, Lobo JM (2015) Can we derive macroecological patterns from primary global biodiversity information facility data? Glob Ecol Biogeogr 24:335–347. https://doi.org/10.1111/geb.12260
Article Google Scholar
Geurts P (2001) Pattern extraction for time series classification. In: De Raedt L, Siebes A (eds) Principles of data mining and knowledge discovery. Springer Berlin Heidelberg, pp 115–127
Griffith DA, Peres-Neto PR (2006) Spatial modeling in ecology: the flexibility of eigenfunction spatial analyses. Ecol 87:2603–2613. https://doi.org/10.1890/0012-9658(2006)87[2603:SMIETF]2.0.CO;2
Hijmans RJ (2018) Raster: geographic data analysis and modeling. R package version 2, pp 7–15 URL https://CRAN.R-project.org/package=raster
Google Scholar
Hijmans RJ, Phillips S, Leathwick J, Elith J (2017) Dismo: species distribution modeling. R package version 1.1–4. URL https://CRAN.R-project.org/package=dismo
Hsieh C, Anderson C, Sugihara G (2008) Extending nonlinear analysis to short ecological time series. Am Nat 171:71–80. https://doi.org/10.1086/524202
Article Google Scholar
Hudson IL, Keatley MR (eds) (2010a) Phenological research: methods for environmental and climate change analysis. Springer, Dordrecht
Google Scholar
Hudson IL, Keatley MR (2010b) Singular spectrum analysis: climatic niche identification. In: Hudson IL, Keatley MR (eds) Phenological research: methods for environmental and climate change analysis. Springer, Dordrecht, pp 393–424
Chapter Google Scholar
Hudson IL, Keatley MR, Kang I (2011a) Wavelet signatures of climate and flowering: identification of species groupings. In: Olkkonen H (ed) Discrete wavelet transforms-biomedical applications. InTech, Vienna
Google Scholar
Hudson IL, Keatley MR, Lee SY (2011b) Using Self-Organising Maps (SOMs) to assess synchronies: an application to historical eucalypt flowering records. Int J Biometeorol 55:879–904. https://doi.org/10.1007/s00484-011-0427-4
Article Google Scholar
Huete A, Didan K, Miura T, Rodriguez EP, Gao X, Ferreira LG (2002) Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens Environ 83:195–213. https://doi.org/10.1016/S0034-4257(02)00096-2
Article Google Scholar
Humphries G, Che-Castaldo C, Bull P et al (2018) Predicting the future is hard and other lessons from a population time series data science competition. Ecol Inform 48:1–11. https://doi.org/10.1016/j.ecoinf.2018.07.004
Article Google Scholar
Isaac NJB, Pocock MJO (2015) Bias and information in biological records. Biol J Linn Soc 115:522–531. https://doi.org/10.1111/bij.12532
Article Google Scholar
Jeanneret F, Rutishauser T (2010) Phenology for topoclimatological surveys and large-scale mapping. In: Hudson IL, Keatley MR (eds) Phenological research: methods for environmental and climate change analysis. Springer Netherlands, Dordrecht, pp 159–175
Chapter Google Scholar
Kampouraki A, Manis G, Nikou C (2009) Heartbeat time series classification with support vector machines. IEEE Trans Inf Technol Biomed 13:512–518. https://doi.org/10.1109/TITB.2008.2003323
Article Google Scholar
Kuhn M, Johnson K (2013) Applied predictive modeling. Springer-Verlag, New York
Book Google Scholar
Laurinec P, Lucká M (2016) Comparison of representations of time series for clustering smart meter data. In: Proceedings of IEEE 16th international conference on data mining workshops. IEEE, pp 398–405
Laurinec P, Lucká M (2018) Interpretable multiple data streams clustering with clipped streams representation for the improvement of electricity consumption forecasting. Data Min Knowl Disc 33:1–33. https://doi.org/10.1007/s10618-018-0598-2
Article Google Scholar
Lincoff G (2017) The complete mushroom hunter: an illustrated guide to finding, harvesting, and enjoying wild mushrooms, illustrated edition. Quarry Books, Massachusetts
Loarie S (2017) We’ve reached 150,000 observers! Retrieved from https://www.inaturalist.org/blog/11756-we-ve-reached-150-000-observers
Lowe R, Coelho CA, Barcellos C et al (2016) Evaluating probabilistic dengue risk forecasts from a prototype early warning system for Brazil. Elife 5. https://doi.org/10.7554/eLife.11285
Ma J, Perkins S (2003) Time-series novelty detection using one-class support vector machines. In: Proceedings of the International Joint Conference on Neural Networks, 2003. pp 1741–1745
Marmion M, Parviainen M, Luoto M, Heikkinen RK, Thuiller W (2009) Evaluation of consensus methods in predictive species distribution modelling. Divers Distrib 15:59–69. https://doi.org/10.1111/j.1472-4642.2008.00491.x
Article Google Scholar
Mazurkiewicz N, Podlasińska J (2014) Bioaccumulation of trace elements in wild-growing edible mushrooms from Lubuskie voivodeship, Poland. Chem Ecol 30:110–117. https://doi.org/10.1080/02757540.2013.841899
Article CAS Google Scholar
Mörchen F (2003) Time series feature extraction for data mining using DWT and DFT. University of Marburg, Department of Mathematics and Computer Science, Technical Report no. 33
Moriondo M, Maselli F, Bindi M (2007) A simple model of regional wheat yield based on NDVI data. Eur J Agron 26:266–274. https://doi.org/10.1016/j.eja.2006.10.007
Article Google Scholar
Neuheimer AB, Taggart CT (2007) The growing degree-day and fish size-at-age: the overlooked metric. Can J Fish Aquat Sci 64:375–385. https://doi.org/10.1139/f07-003
Article Google Scholar
Oliver TH, Roy DB (2015) The pitfalls of ecological forecasting. Biol J Linn Soc 115:767–778. https://doi.org/10.1111/bij.12579
Article Google Scholar
Potamitis I, Rigakis I, Fysarakis K (2015) Insect biometrics: optoacoustic signal processing and its applications to remote monitoring of McPhail type traps. PLoS One 10:e0140474. https://doi.org/10.1371/journal.pone.0140474
Article CAS Google Scholar
Prank M, Chapman DS, Bullock JM, Belmonte J, Berger U, Dahl A, Jäger S, Kovtunenko I, Magyar D, Niemelä S, Rantio-Lehtimäki A, Rodinkova V, Sauliene I, Severova E, Sikoparija B, Sofiev M (2013) An operational model for forecasting ragweed pollen release and dispersion in Europe. Agric For Meteorol 182–183:43–53. https://doi.org/10.1016/j.agrformet.2013.08.003
Article Google Scholar
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Müller M (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12:77. https://doi.org/10.1186/1471-2105-12-77
Article Google Scholar
Ruiz-Gutierrez V, Hooten MB, Grant EHC (2016) Uncertainty in biological monitoring: a framework for data collection and analysis to account for multiple sources of sampling bias. Methods Ecol Evol 7:900–909. https://doi.org/10.1111/2041-210X.12542
Article Google Scholar
Scales KL, Hazen EL, Maxwell SM, Dewar H, Kohin S, Jacox MG, Edwards CA, Briscoe DK, Crowder LB, Lewison RL, Bograd SJ (2017) Fit to predict? Eco-informatics for predicting the catchability of a pelagic fish in near real time. Ecol Appl 27:2313–2329. https://doi.org/10.1002/eap.1610
Article Google Scholar
Schäfer P (2015) The BOSS is concerned with time series classification in the presence of noise. Data Min Knowl Disc 29:1505–1530. https://doi.org/10.1007/s10618-014-0377-7
Article Google Scholar
Schäfer P, Leser U (2017) Fast and accurate time series classification with WEASEL. In: Proceedings of the 2017 ACM on conference on information and knowledge management. ACM, New York, pp 637–646
Google Scholar
Studer S, Stöckli R, Appenzeller C, Vidale PL (2007) A comparative study of satellite and ground-based phenology. Int J Biometeorol 51:405–414. https://doi.org/10.1007/s00484-006-0080-5
Article CAS Google Scholar
Tiago P, Ceia-Hasse A, Marques TA, Capinha C, Pereira HM (2017) Spatial distribution of citizen science casuistic observations for different taxonomic groups. Sci Rep 7:12832. https://doi.org/10.1038/s41598-017-13130-8
Article CAS Google Scholar
Urban MC, Bocedi G, Hendry AP, Mihoub JB, Peer G, Singer A, Bridle JR, Crozier LG, de Meester L, Godsoe W, Gonzalez A, Hellmann JJ, Holt RD, Huth A, Johst K, Krug CB, Leadley PW, Palmer SCF, Pantel JH, Schmitz A, Zollner PA, Travis JMJ (2016) Improving the forecast for biodiversity under climate change. Science 353:aad8466. https://doi.org/10.1126/science.aad8466
Article CAS Google Scholar
White MA, Thornton PE, Running SW (1997) A continental phenology model for monitoring vegetation responses to interannual climatic variability. Glob Biogeochem Cycles 11:217–234. https://doi.org/10.1029/97GB00330
Article CAS Google Scholar
Wickham H (2016) ggplot2: elegant graphics for data analysis, 2nd edn. Springer, New York
Book Google Scholar
Zadrozny B (2004) Learning and evaluating classifiers under sample selection bias. In: Proceedings of the Twenty-first International Conference on Machine Learning ACM, New York, NY, USA, pp 114

Download references

Acknowledgments

The author thanks two anonymous reviewers for valuable comments on an earlier version of the manuscript.

Author information

Authors and Affiliations

CIBIO/InBio, Centro de Investigação em Biodiversidade e Recursos Genéticos, Laboratório Associado, Campus Agrário de Vairão, Universidade do Porto, Vairão, 4485-661, Porto, Portugal
César Capinha
CIBIO/InBio, Centro de Investigação em Biodiversidade e Recursos Genéticos, Laboratório Associado, Instituto Superior de Agronomia, Universidade de Lisboa, Tapada da Ajuda, 1349-017, Lisbon, Portugal
César Capinha

Authors

César Capinha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to César Capinha.

Ethics declarations

Conflict of interest

The author declares that he has no conflict of interest.

Electronic supplementary material

ESM 1

(DOCX 2040 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Capinha, C. Predicting the timing of ecological phenomena using dates of species occurrence records: a methodological approach and test case with mushrooms. Int J Biometeorol 63, 1015–1024 (2019). https://doi.org/10.1007/s00484-019-01714-0

Download citation

Received: 11 October 2018
Revised: 15 March 2019
Accepted: 27 March 2019
Published: 18 April 2019
Issue Date: 15 August 2019
DOI: https://doi.org/10.1007/s00484-019-01714-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting the timing of ecological phenomena using dates of species occurrence records: a methodological approach and test case with mushrooms

Abstract

Access this article

Similar content being viewed by others

Chronicles of nature calendar, a long-term and large-scale multitaxon database on phenology

Assessing cumulative uncertainties of remote sensing time series and telemetry data in animal-environment studies

Advancing our understanding of biological invasions with long-term biomonitoring data

Data availability

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Predicting the timing of ecological phenomena using dates of species occurrence records: a methodological approach and test case with mushrooms

Abstract

Access this article

Similar content being viewed by others

Chronicles of nature calendar, a long-term and large-scale multitaxon database on phenology

Assessing cumulative uncertainties of remote sensing time series and telemetry data in animal-environment studies

Advancing our understanding of biological invasions with long-term biomonitoring data

Data availability

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation