Sources of uncertainty in ecological modelling: predicting vegetation types from environmental attributes

Dale, M. B.; Dale, P. E. R.

doi:10.1556/ComEc.5.2004.2.9

Sources of uncertainty in ecological modelling: predicting vegetation types from environmental attributes

Open access
Published: 30 December 2004

Volume 5, pages 203–225, (2004)
Cite this article

Download PDF

You have full access to this open access article

Community Ecology Aims and scope Submit manuscript

Sources of uncertainty in ecological modelling: predicting vegetation types from environmental attributes

Download PDF

M. B. Dale¹ &
P. E. R. Dale¹

92 Accesses
4 Citations
Explore all metrics

Abstract

In this paper, we use decision trees to construct models for predicting vegetation types from environmental attributes in a salt marsh. We examine a method for evaluating the worth of a decision tree and look at seven sources of uncertainty in the models produced, namely algorithmic, predictive, model, scenario, objective, context and scale. The accuracy of prediction of types was strongly affected by the scenario and scale, with the most dynamically variable attributes associated with poor prediction, while more static attributes performed better. However, examination of the misclassified samples showed that prediction of processes was much better, with local vegetation type-induced patterns nested within a broader environmental framework.

Article PDF

On the interpretability of predictors in spatial data science: the information horizon

Article Open access 07 October 2020

Vegetation and water resource variability within the Köppen-Geiger global climate classification scheme: a probabilistic interpretation

Article 05 October 2023

Extending vegetation site data and ensemble models to predict patterns of foliage cover and species richness for plant functional groups

Article Open access 06 March 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Abbreviations

MML:: Minimum Message Length.

References

Adomavicius, G. and A. Tuzhilin. 1997. Discovery of actionable patterns in databases: the action hierarchy approach. In: D. Heckerman, H. Mannila, D. Pregibon and R. Uthurusamy (eds.), Proc. 3^rd International Conference on Knowledge Discovery and Data Mining. AAAI. pp. 111–114.
Google Scholar
Akaike, H. 1973. Information theory and an extension of the maximum likelihood principle. In: B. Petrov and F. Cáski (eds.) 2^nd International Symposium on Information Theory, Akadémiai Kiadó, Budapest, pp. 267–281.
Google Scholar
Allwein, E.L., R. E. Schapire and Y. Singer 2000. Reducing multiclass to binary: a unifying approach for margin classifiers. J. Machine Learning Research 1:113–141.
Google Scholar
Baxter, J 2000. A model of inductive bias learning. J. Artif. Intell. Res. 12:149–198.
Article Google Scholar
Beeston, G. R. and M. B. Dale 1975. Multiple predictive analysis: a management tool. Proceedings of the Ecological Society of Australia 9:172–181.
Google Scholar
Bengio, Y. and Y. Grandvalet 2003. No unbiased estimator of the variance of k-fold cross-validation. Technical Report 1234, Dept. Inform. Rech. Opérat., Université Montréal.
Google Scholar
Boerlijst, M. C. and P. Hogeweg. 1991. Spiral wave structure in prebiotic evolution: hypercycles stable against parasites. Physica D 48:17–28.
Google Scholar
Breiman, L., J. H. Friedman, R. A. Orshen and C. J. Stone. 1984. Classification and Regression Trees. Belmont CA. Wadsworth.
Google Scholar
Brézillon P. and J-Ch. Pomerol. 1999. Contextual knowledge sharing and cooperation in intelligent assistant systems. Le Travail Humain 62:223–246 PUF, Paris.
Google Scholar
Brown, D. E. and C. L. Pittard. 1993. Classification trees with optimal multivariate splits. Proc. International Conf. on Systems, Man and Cybernetics: Systems Engineering in the Service of Humans. I. E. E. E. New York, Vol 3. pp. 475–477.
Google Scholar
Carley, K. and M. Palmquist 1992. Extracting, representing and analyzing mental models. Social Forces 70:601–636.
Article Google Scholar
Chatfield, C. 1995. Model uncertainty, data mining and statistical-inference J. R. Statist. Soc. Series A, 158:419–466.
Article Google Scholar
Chipman, H., E. I. George and R. E. McCullough 2000. Hierarchical priors for Bayesian CART shrinkage. Statistics and Computing 10:17–24.
Article Google Scholar
Chipman, H., E. I. George and R. E. McCullough 2001. The practical implementation of Bayesian model selection. IMS Lecture Notes: Monograph Ser. 38:67–136.
Google Scholar
Czárán, T., R. F. Hoekstra. and L. Pagie 2002. Chemical warfare between microbes promotes biodiversity. Proc. Natl. Acad. Sci. 99:786–790.
Article CAS PubMed PubMed Central Google Scholar
Dale, M. B 1982. Strategy and tactics in pattern analysis: a response to Harrington, Dawes and Ludwig. Austral. J. Ecology 7:411–414.
Article Google Scholar
Dale, M. B, 1999. The dynamics of diversity: mixed strategy systems Coenoses 13:105–113.
Google Scholar
Dale, P. E. R. and M. B. Dale 2002. Optimal classification to describe environmental change: pictures from the exposition. Community Ecology 3:19–29.
Article Google Scholar
Dale, M. B., P. E. R. Dale and T. Edgoose. 2002a. Using Markov models to incorporate serial dependence in studies of vegetation change. Acta Oecologica 23:261–269.
Article Google Scholar
Dale, M. B., P. E. R. Dale, C. Li and G. Biswas. 2002b. Assessing impacts of small perturbations using a model-based approach. Ecological Modelling 156:185–199.
Article Google Scholar
Daley, R., 1977. On the inference of optimal descriptions. Theor. Comput. Sci. 4:301–319.
Article Google Scholar
De’ath, Glenn, 1999: Principal Curves: A new technique for indirect and direct gradient analysis. Ecology 80:2237–2253.
Article Google Scholar
Dasarathy, B. V. and A. L. Lakshminarasimhan 1976. Sequential learning employing unfamiliar teacher hypothesis (SLEUTH) with concurrent estimation of both parameters and teacher characteristics. International J. Computer Information Sciences 5:1–7.
Article Google Scholar
Esposito, F., D. Malerbo and G. Semeraro. 1995. Simplifying decision trees by pruning and grafting: new results. In: N. Larraç and S. Wrobel (eds.), Proc. 8^th European Conf. Machine Learning, Lecture Notes in Artificial Intelligence 912 Springer, Berlin, pp. 287–290.
Google Scholar
Fisher, D. 1992. Pessimistic and optimistic induction. Technical Report CS-92–12 Dept. Comput. Sci., Vanderbilt Univ.
Google Scholar
Fitzgibbon, L. J., D. L. Dowe and L. Allison. 2002. Univariate polynomial inference by Monte Carlo Message Length approximation. In: C. Sammut and A. Hoffman (eds.), Proc. 19^th Internatl. Conf. Machine Learning (1CML-2002), Morgan-Kauffman, San Francisco. pp. 147–154.
Google Scholar
Forster, M. R 2000. Key concepts in model selection: performance and generalizability. J. Math. Psychol: 44:205–231.
Article CAS PubMed Google Scholar
Freund, Y. and L. Mason. 1999. The alternating decision tree learning algorithm. In: Proc 16th Int. Conf. Machine Learning, Morgan-Kaufman, San Francisco. pp. 124–133.
Google Scholar
Friedman, J. H., R. Kohavi and Y. Yun. 1996. Lazy decision trees. in: Proceedings of the 13^th National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, AAAI 96, IAAI 96 AAAI Press / The MIT Press. pp. 717–724.
Google Scholar
Fürnkranz, J. 1996. Separate and conquer rule learning. Technical Report TR-96–25. Austrian Research Inst. Artificial Intell., Vienna.
Google Scholar
Hájek, P., I. Havel and M. Chytil 1966. The GUHA method of automatic hypotheses determination. Computing 1:293–308.
Article Google Scholar
Hájek, P. and T. Havránek 1977. On generation of inductive hypotheses. Internatl. J. Man-Mack Stud. 9:415–438.
Article Google Scholar
Hancock, T, T. Jiang., M. Li and J. Tromp 1996. Lower bounds on learning decision lists and trees. Information and Computation 126:114–122.
Google Scholar
Heckman, J 1979. Sample selection bias as a specification error. Econometrika 47:153–161.
Article Google Scholar
Heitjan, D. F 1993. Ignorability and coarse data: some biomedical examples. Biometrics 49:1099–1109.
Article CAS PubMed Google Scholar
Ho, T. K. 1998. The random subspace method for constructing decision forests. I. E. E. E. Trans. Pattern Analysis Machine Intelligence PAMI-20:832–844.
Google Scholar
Hogeweg, P 2002. Computing an organism: on the interface between informatic and dynamic processes. Biosystems 64:97–109.
Article PubMed Google Scholar
Hunt, E. B., J. Marin, and P. J. Stone. 1966. Experiments in Induction, Academic Press, New York.
Google Scholar
Hyafil, L. and R. L. Rivest 1976. Constructing optimal binary decision trees is NP-complete. Information Processing Letters 5:15–17.
Article Google Scholar
Jelinski, D. E. and J-G. Wu 1996. The modifiable areal unit problem and implications for landscape ecology. Landscape Ecology 11:129–140.
Article Google Scholar
Jordan, M., Z. Ghahramani and L. K. Saul. 1997. Hidden Markov decision trees. In: M. C. Mozer, M. I. Jordan and T. Petsche (eds.), Advances in Neural Information Processing Systems 9. MIT Press Cambridge MA.
Google Scholar
Lanterman, A. D 2000. Schwarz, Wallace and Rissanen: Intertwining themes in theories of model order estimation. International Statistical Review 69:185–212.
Article Google Scholar
Li, C. G. Biswas, M. B. Dale and P. E. R. Dale. 2001. Building models of ecological dynamics using HMM based temporal data clustering. In: Advances in Intelligent Data Analysis, the 4^th International Conference on Intelligent Data Analysis, Lecture Notes in Computer Science, 2189, Springer-Verlag, Berlin, pp. 53–62.
Google Scholar
Loehle, C 1987. Hypothesis testing in ecology: psychological aspects and the importance of theory maturation. Quarterly Review of Biology 62:397–409.
Article CAS PubMed Google Scholar
Mac Nally, R 2000. Regression and model-building in conservation biology, biogeography and ecology: The distinction between - and reconciliation of - ‘predictive’ and ‘explanatory’ models. Biodiversity and Conservation 9:655–671.
Article Google Scholar
Macnaughton-Smith, P 1963. The classification of individuals by the possession of attributes associated with a criterion. Biometrics 19:364–366.
Article Google Scholar
Macnaughton-Smith, P. 1965. Some statistical and other numerical methods for classifying individuals. Home Office Research Unit Report 6, Home Office, London.
Google Scholar
Mehta, M., J. Rissanen and R. Agrawal. 1995. MDL-Based Decision Tree Pruning. Proc. 1st International Conf. Knowledge Discovery and Data Mining (KDD’95), pp. 216–221.
Google Scholar
Mingers, J 1989. An empirical comparison of selection measures for decision tree induction. Machine Learning 3:319–342.
Google Scholar
Mikkelson, G. M 2001. Complexity and verisimilitude: realism in ecology. Biology and Philosophy 16:533–546.
Article Google Scholar
Murphy, P. M. and M. J. Pazzani. 1991. ID2-of-3: Constructive induction of M-of-N concepts for discriminators in decision trees. Proc. 8^th International workshop Machine Learning, Morgan Kaufmann, San Mateo, CA. pp. 183–187.
Google Scholar
Murthy, S., S. Kasif and S. Salzberg 1994. A system for induction of oblique decision trees. J. Artif. Intell. Res. 2:1–32.
Article Google Scholar
Oates, T. and D. Jensen. 1999. Toward a Theoretical Understanding of Why and When Decision Tree Pruning Algorithms Fail. Proc. 16^th National Conf. Artificial Intelligence and 11th Conf. Innovative Applications of Artificial Intelligence, AAAI Press / The MIT Press. pp. 372–378.
Google Scholar
Oliver, J. and Baxter, R. 1994. MML and Bayesianism: similarities and differences. Technical Report 206, Department of Computer Science, Monash University.
Google Scholar
Oliver, J. and Hand, D 1996. Averaging over decision trees. J. Classif. 13:281–297.
Article Google Scholar
O’Neill, R. V., D. L. DeAngelis, J. B. Waide and T. F. H. Allen, 1986. A Hierarchical Concept of Ecosystems, Princeton Univ. Press, Princeton, N.J.
Google Scholar
Opitz, D. and R. Machin 1999. Popular ensemble methods: an empirical study. J. Artif. Intell. Res. 11:160–198.
Article Google Scholar
Picard, R. R. and K. N. Berk 1990. Data Splitting. American Statistician 44:140–147.
Google Scholar
Posse, C. 1995a. Tools for two-dimensional exploratory projection pursuit. J. Computer Graphics Statist. 4:83–100.
Google Scholar
Posse, C. 1995b. Projection pursuit exploratory data analysis. Computat. Statist. Data Anal. 20:669–687.
Article Google Scholar
Quinlan, J. R. 1993. C4.5: Programs for Machine Learning, Morgan Kaufmann: San Mateo, CA.
Google Scholar
Ramsey, J. B. and H-J. Yuan 1990. The statistical properties of dimension calculations. Nonlinearity 3:155–176.
Article Google Scholar
Rastetter, E. B., A. W. King, B. J. Cosby, G. M. Hornberger, R. V. O’Neill and J. E. Hobbie 1992. Aggregating fine-scale ecological knowledge to model coarser-scale attributes of ecosystems. Ecological Applications 2:55–70.
Article PubMed Google Scholar
Riddle, B. R. and D. J. Hafner 1999. Species as units of analysis in ecology and biogeography: time to take the blinkers off. Global Ecol. Biogeog. 8:433–441.
Article Google Scholar
Robnik-Sikonja, M. and I. Kononenko. 1998. Pruning Regression Trees with MDL. In: Henri Prade (Ed.): Proc. 13^th European Conf. Artificial Intelligence, Brighton, UK, August 23–28 1998. John Wiley and Sons, Chichester. pp. 455–459.
Rymon, R. 1993. An SE-tree based characterization of the induction problem. Proceedings of the 10^th International Conference on Machine Learning, ML-93, Morgan Kaufmann, CA, pp. 268–275.
Google Scholar
Schwarz, G 1978. Estimating dimension of a model. Ann. Statist. 6:461–464.
Article Google Scholar
Srikant, R. and R. Agrawal. 1995. Mining generalized association rules. Res. Rep. RJ9963, IBM Almaden Res. Cent. San José, CA. and Proc. 21^st Very Large Data Base Conference, Zurich, 13 pp.
Google Scholar
Stone, M. 1977. An Asymptotic Equivalence of Choice of Model by Cross-Validation and Akaike’s Criterion. J. R. Stat. Soc., B 38:44–47.
Google Scholar
Tan, P. J. and D. L. Dowe. 2002. MML inference of decision graphs with multi-way joins. Lecture Notes in Artificial Intelligence 2557, pp. 131–142.
Google Scholar
Tan, P. J. and D. L. Dowe. 2003. MML inference of decision graphs with multi-way joins and dynamic attributes. Proc. 16^th Australian Joint Conference on Artificial Intelligence, Perth, Dec. (to appear).
Book Google Scholar
Ting, K. M. 1994. The problem of small disjuncts: its remedy in decision trees. Proc 10th Canadian Conf. Artif. Intell. pp. 91–97.
Google Scholar
Todorovski, L. and S. Dzeroski. 2000. Combining multiple models with meta decision trees. Principles of Data Mining and Knowledge Discovery, pp. 54–64.
Google Scholar
Tothill, J. C., J. N. G. Hargreaves and R. M. Jones. 1978. - BOTANAL - a comprehensive sampling and computing procedure for estimating pasture yield and composition. I. Field sampling. Technical Memorandum 8, CSIRO, Division Tropical Crops Pastures, Brisbane.
Google Scholar
Utgoff, P. E. 1994. An improved algorithm for incremental induction of decision trees. In: W. Cohen and H. Hirsh (eds.), Proceedings of the 11^th International Conference on Machine Learning, New Brunswick, NJ. Morgan Kaufmann. pp. 318–325.
Google Scholar
van den Bosch, A., A. Weijters, H. J. Van der Herik. and W. Daelemans. 1997. When small disjuncts abound, try lazy learning: a case study. Proc. 7^th Belgian-Dutch Conf. Machine Learning, pp. 109–118.
Google Scholar
van de Velde, W. 1990. Incremental induction of topologically minimal decision trees. In: Proceedings of the 7^th International Conference on Machine Learning, Austin, TX. Morgan Kaufmann. pp. 66–74.
Google Scholar
Vapnik V.N. 1995. The Nature of Statistical Learning Theory. Springer, Berlin.
Book Google Scholar
Wallace, C. S. 1996. MML inference of predictive trees, graphs and nets. In: A. Gammerman (ed.), Computational Learning and Probabilistic Reasoning, John Wiley. pp. 43–66.
Google Scholar
Wallace, C. S. and D. L. Dowe 1999. Minimum Message Length and Kolmogorov complexity. Comput. J. 42:270–283.
Article Google Scholar
Wallace, C. S. and J. Patrick 1993. Coding decision trees. Machine Learning 11:7–22.
Article Google Scholar
Webb, G. I. 1994. Generality is more significant than complexity: toward alternatives to Occam’s razor. In: C. Zhang, J. Debenham and D. Lukose (eds), AI’94 - Proc. 7^th Australian Joint Conf. Artificial Intelligence, Armidale, World Scientific. pp. 60–67.
Widmer, G. and M. Kubat 1996. Learning in the presence of concept drift and hidden contexts. Machine Learning 23:69–101.
Google Scholar
Williams, G. J. 1988 Combining decision trees. In: J. S. Gero and R. Stanton (eds.), Artifiical Intelligence: Developments and Applications. Elsevier Scientific, Amsterdam. pp. 273–289.
Wittgenstein, L. 1995. Tractacus 5:3651
Yamada, H. and S. Amaroso 1971. Structural and behavioural equivalences of tessellation automata. Information and Control 18:1–31.
Article Google Scholar
Young, P., S. Parkinson and M. Lees 1996. Simplicity out of complexity in environmental modeling: Occam’s razor revisited. J. Appl. Statist. 234:165–210.
Article Google Scholar
Zhang, J., Y-S. Yim and J. Yang 1997. Intelligent selection of instances for prediction function in lazy learning algorithms. Artif. Intel. Rev. 11:175–192.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Australian School of Environmental Studies, Griffith University, Nathan, Qld., 4111, Australia
M. B. Dale & P. E. R. Dale

Authors

M. B. Dale
View author publications
You can also search for this author in PubMed Google Scholar
P. E. R. Dale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. E. R. Dale.

Rights and permissions

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Dale, M.B., Dale, P.E.R. Sources of uncertainty in ecological modelling: predicting vegetation types from environmental attributes. COMMUNITY ECOLOGY 5, 203–225 (2004). https://doi.org/10.1556/ComEc.5.2004.2.9

Download citation

Published: 30 December 2004
Issue Date: June 2004
DOI: https://doi.org/10.1556/ComEc.5.2004.2.9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sources of uncertainty in ecological modelling: predicting vegetation types from environmental attributes

Abstract

Article PDF

Similar content being viewed by others

On the interpretability of predictors in spatial data science: the information horizon

Vegetation and water resource variability within the Köppen-Geiger global climate classification scheme: a probabilistic interpretation

Extending vegetation site data and ensemble models to predict patterns of foliage cover and species richness for plant functional groups

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sources of uncertainty in ecological modelling: predicting vegetation types from environmental attributes

Abstract

Article PDF

Similar content being viewed by others

On the interpretability of predictors in spatial data science: the information horizon

Vegetation and water resource variability within the Köppen-Geiger global climate classification scheme: a probabilistic interpretation

Extending vegetation site data and ensemble models to predict patterns of foliage cover and species richness for plant functional groups

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation