Skip to main content

Advertisement

Log in

Information-theoretic approaches to statistical analysis in behavioural ecology: an introduction

  • Review
  • Published:
Behavioral Ecology and Sociobiology Aims and scope Submit manuscript

Abstract

Scientific thinking may require the consideration of multiple hypotheses, which often call for complex statistical models at the level of data analysis. The aim of this introduction is to provide a brief overview on how competing hypotheses are evaluated statistically in behavioural ecological studies and to offer potentially fruitful avenues for future methodological developments. Complex models have traditionally been treated by model selection approaches using threshold-based removal of terms, i.e. stepwise selection. A recently introduced method for model selection applies an information-theoretic (IT) approach, which simultaneously evaluates hypotheses by balancing between model complexity and goodness of fit. The IT method has been increasingly propagated in the field of ecology, while a literature survey shows that its spread in behavioural ecology has been much slower, and model simplification using stepwise selection is still more widespread than IT-based model selection. Why has the use of IT methods in behavioural ecology lagged behind other disciplines? This special issue examines the suitability of the IT method for analysing data with multiple predictors, which researchers encounter in our field. The volume brings together different viewpoints to aid behavioural ecologists in understanding the method, with the hope of enhancing the statistical integration of our discipline.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

  • Adolph SC, Hardin JS (2007) Estimating phenotypic correlations: correcting for bias due to intraindividual variability. Funct Ecol 21:178–184

    Article  Google Scholar 

  • Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Csáki F (ed) 2nd International Symposium on Information Theory. Akadémiai Kiadó, Budapest, pp 267–281

    Google Scholar 

  • Allen DM (1974) The relationship between variable selection and data augmentation and a method for prediction. Technometrics 16:125–127

    Article  Google Scholar 

  • Anderson DR (2008) Model based inference in the life sciences: a primer on evidence. Springer, New York

    Book  Google Scholar 

  • Anderson DR, Burnham KP (2002) Avoiding pitfalls when using information-theoretic methods. J Wildl Manage 66:910–916

    Google Scholar 

  • Anderson DR, Burnham KP, Thompson WL (2000) Null hypothesis testing: problems, prevalence, and an alternative. J Wildl Manage 64:912–923

    Article  Google Scholar 

  • Bell AM, Hankison SJ, Laskowski KL (2009) The repeatability of behaviour: a meta-analysis. Anim Behav 77:771–783

    Article  Google Scholar 

  • Berger JO, Wolpert RL (1984) The likelihood principle. Institute of Mathematical Statistics, Hayward

    Google Scholar 

  • Burnham KP, Anderson DR (2002) Model selection and multimodel inference: a practical information-theoretic approach. Springer, New York

    Google Scholar 

  • Burnham K, Anderson D, Huyvaert K (2010) Improving inferences in ecological and behavioral science: some background, observations, and comparisons. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1029-6

  • Cézilly F, Danchin É, Giraldeau L-A (2008) Research methods in behavioural ecology. In: Danchin É, Giraldeau L-A, Cézilly F (eds) Behavioural ecology: an evolutionary perspective on behaviour. Oxford University Press, Oxford, pp 55–95

    Google Scholar 

  • Chamberlin TC (1890) The method of multiple working hypotheses. Science 15:92–96

    Article  Google Scholar 

  • Claeskens C, Hjort NL (2008) Model selection and model averaging. Cambridge University Press, Cambridge

    Google Scholar 

  • Craven P, Wahba G (1979) Smoothing noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numerische Mathematik 31:377–403

    Article  Google Scholar 

  • Crawley MJ (2007) The R book. Wiley, West Sussex

    Book  Google Scholar 

  • Derksen S, Keselman HJ (1992) Backward, forward and stepwise automated subset-selection algorithms—frequency of obtaining authentic and noise variables. Br J Math Stat Psychol 45:265–282

    Google Scholar 

  • Dochtermann N, Jenkins SH (2010) Developing and evaluating candidate hypotheses in behavioral ecology. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1039-4

  • Draper NR, Smith H (1981) Applied regression analysis, 2nd edn. Wiley, New York

    Google Scholar 

  • Forster MR (2000) Key concepts in model selection: performance and generalizability. J Math Psychol 44:205–231

    Article  PubMed  Google Scholar 

  • Forstmeier W, Schielzeth H (2010) Cryptic multiple hypotheses testing in linear models: overestimated effect sizes and the winner’s curse. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1038-5

  • Fox J (2002) An R and S-PLUS companion to applied regression. Sage, Newbury Park

    Google Scholar 

  • Freckleton RP (2010) Dealing with collinearity in behavioural and ecological data: model averaging and the problems of measurement error. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1045-6

  • Garamszegi LZ, Calhim S, Dochtermann N, Hegyi G, Hurd PL, Jørgensen C, Kutsukake N, Lajeunesse MJ, Pollard KA, Schielzeth H, Symonds MRE, Nakagawa S (2009) Changing philosophies and tools for statistical inferences in behavioral ecology. Behav Ecol 20:1363–1375

    Article  Google Scholar 

  • Gelman A, Hill J (2007) Data analysis using regression and multilevel/hierarchical models. Cambridge University Press, Cambridge

    Google Scholar 

  • Ginzburg LR, Jensen CXJ (2004) Rules of thumb for judging ecological theories. Trends Ecol Evol 19:121–126

    Article  PubMed  Google Scholar 

  • Graham MH (2003) Confronting multicollinearity in ecological multiple regression. Ecology 84:2809–2815

    Article  Google Scholar 

  • Guthery FS (2007) Deductive and inductive methods of accumulating reliable knowledge in wildlife science. J Wildl Manage 71:222–225

    Article  Google Scholar 

  • Guthery FS, Brennan LA, Peterson MJ, Lusk JJ (2005) Information theory in wildlife science: critique and viewpoint. J Wildl Manage 69:457–465

    Article  Google Scholar 

  • Harvey PH, Pagel MD (1991) The comparative method in evolutionary biology. Oxford University Press, Oxford

    Google Scholar 

  • Hegyi G, Garamszegi LZ (2010) Stepwise selection and information theory in ecology and behavior. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1036-7

  • Hilborn R, Mangel M (1997) The ecological detective: confronting models with data. Princeton University Press, Princeton

    Google Scholar 

  • Hobbs NT, Hilborn R (2006) Alternatives to statistical hypothesis testing in ecology: a guide to self teaching. Ecol Appl 16:5–19

    Article  PubMed  Google Scholar 

  • Huston MA (1997) Hidden treatments in ecological experiments: re-evaluating the ecosystem function of biodiversity. Oecologia 110:449–460

    Article  Google Scholar 

  • Johnson JB, Omland KS (2004) Model selection in ecology and evolution. Trends Ecol Evol 19:101–108

    Article  PubMed  Google Scholar 

  • Jones KS, Nakagawa S, Sheldon BC (2009) Environmental sensitivity in relation to size and sex in birds: meta-regression analysis. Am Nat 174:122–133

    Article  PubMed  Google Scholar 

  • Konishi S, Kitagawa G (2008) Information criteria and statistical modeling. Springer, New York

    Book  Google Scholar 

  • Krebs JR, Davies NB (1984) Behavioural ecology: an evolutionary approach. Blackwell Scientific, Oxford

    Google Scholar 

  • Lajeunesse MJ (2009) Meta-analysis and the comparative phylogenetic method. Am Nat 174:369–381

    Google Scholar 

  • Lebreton J-D, Burnham KP, Clobert J, Anderson DR (1992) Modeling survival and testing biological hypotheses using marked animals: a unified approach with case studies. Ecol Monogr 62:67–118

    Article  Google Scholar 

  • Liang H, Wu HL, Zou GH (2008) A note on conditional AIC for linear mixed-effects models. Biometrika 95:773–778

    Article  PubMed  Google Scholar 

  • Linhart H, Zucchini W (1986) Model selection. Wiley, New York

    Google Scholar 

  • Lukacs PM, Thompson WL, Kendall WL, Gould WR, Doherty PF, Burnham KP, Anderson DR (2007) Concerns regarding a call for pluralism of information theory and hypothesis testing. J Appl Ecol 44:456–460

    Article  Google Scholar 

  • Mallows CL (1973) Some comments on Cp. Technometrics 15:661–675

    Article  Google Scholar 

  • Massart P (2007) Concentration inequalities and model selection: ecole d’eté de probabilités de Saint-Flour XXXIII—2003. Springer, Berlin

    Google Scholar 

  • McArdle BH (2003) Lines, models, and errors: regression in the field. Limnol Oceanogr 48:1363–1366

    Google Scholar 

  • McCarthy MA (2007) Bayesian methods for ecology. Cambridge University Press, Cambridge

    Google Scholar 

  • McQuarrie ADR, Tsai C-L (1998) Regression and time series model selection. World Scientific, Singapore

    Book  Google Scholar 

  • Mundry R (2010) Issues in information theory based statistical inference—a commentary from a frequentist’s perspective. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1040-y

  • Mundry R, Nunn CL (2008) Stepwise model fitting and statistical inference: turning noise into signal pollution. Am Nat 173:119–123

    Article  Google Scholar 

  • Murtaugh PA (2009) Performance of several variable-selection methods applied to real ecological data. Ecol Lett 12:1061–1068

    Article  PubMed  Google Scholar 

  • Nakagawa S, Freckleton R (2008) Missing inaction: the dangers of ignoring missing data. Trends Ecol Evol 23:592–596

    Article  PubMed  Google Scholar 

  • Nakagawa S, Freckleton RP (2010) Model averaging, missing data and multiple imputation: a case study for behavioural ecology. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1044-7

  • O’Hara RB, Sillanpää MJ (2009) A review of Bayesian variable selection methods: what, how, and which. Bayesian Analysis 4:85–118

    Article  Google Scholar 

  • Owens IPF (2006) Where is behavioural ecology going? Trends Ecol Evol 21:356–361

    Article  PubMed  Google Scholar 

  • Platt JR (1964) Strong inference. Science 146:347–353

    Article  PubMed  CAS  Google Scholar 

  • Popper KR (1963) Conjectures and refutations. Routledge and Keagan Paul, London

    Google Scholar 

  • Pötscher BM (1989) Model selection under nonstationary: autoregressive models and stochastic linear regression models. Ann Stat 17:1257–1274

    Article  Google Scholar 

  • Quinn JF, Dunham AE (1983) On hypothesis testing in ecology and evolution. Am Nat 122:602–617

    Article  Google Scholar 

  • Quinn GP, Keough MJ (2002) Experimental design and data analysis for biologists. Cambridge University Press, Cambridge

    Google Scholar 

  • Rabosky DL (2006) Likelihood methods for detecting temporal shifts in diversification rates. Evolution 60:1152–1164

    PubMed  Google Scholar 

  • Rao CR, Wu Y (1989) A strongly consistent procedure for model selection in a regression problem. Biometrika 76:369–374

    Article  Google Scholar 

  • Richards SA, Whittingham MJ, Stephens PA (2010) Model selection and model averaging in behavioural ecology: the utility of the IT-AIC framework. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1035-8

  • Ripplinger J, Sullivan J (2008) Does choice in model selection affect maximum likelihood analysis? Syst Biol 57:76–85

    Article  PubMed  Google Scholar 

  • Rissanen J (1978) Modeling by shortest data description. Automatica 14:465–471

    Article  Google Scholar 

  • Royall MR (1997) Statistical evidence: a likelihood paradigm. Chapman and Hall, London

    Google Scholar 

  • Rushton SP, Ormerod SJ, Kerby G (2004) New paradigms for modelling species distributions? J Appl Ecol 41:193–200

    Article  Google Scholar 

  • Sakamoto Y (1991) Categorical data analysis by AIC. KTK Scientific, Tokyo

    Google Scholar 

  • Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464

    Article  Google Scholar 

  • Shibata R (1981) An optimal selection of regression variables. Biometrika 68:45–54

    Article  Google Scholar 

  • Sokal RR, Rohlf FJ (1995) Biometry, 3rd edn. Freeman, New York

    Google Scholar 

  • Steidl RJ (2006) Model selection, hypothesis testing, and risks of condemning analytical tools. J Wildl Manage 70:1497–1498

    Article  Google Scholar 

  • Stephens PA, Buskirk SW, Hayward GD, Del Rio CM (2005) Information theory and hypothesis testing: a call for pluralism. J Appl Ecol 42:4–12

    Article  Google Scholar 

  • Stephens PA, Buskirk SW, del Rio CM (2007a) Inference in ecology and evolution. Trends Ecol Evol 22:192–197

    Article  PubMed  Google Scholar 

  • Stephens PA, Buskirk SW, Hayward GD, Del Rio CM (2007b) A call for statistical pluralism answered. J Appl Ecol 44:461–463

    Article  Google Scholar 

  • Stone M (1974) Cross-validatory choice and assessment of statistical predictions. Jean-le-Blanc Journal of the Royal Statistical Society, Series B 36:111–147

    Google Scholar 

  • Sugiura N (1978) Further analysis of the data by Akaike’s information and the finite corrections. Commun Stat A7:13–26

    Article  Google Scholar 

  • Symonds M, Moussalli A (2010) Model selection, multimodel inference and model averaging using Akaike’s information criterion: an introduction for statistically terrified behavioural ecologists. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1037-6

  • Takeuchi K (1976) Distribution of informational statistics and a criterion of model fitting (in Japanese). Suri-Kagaku (Mathematical Sciences) 153:12–18

    Google Scholar 

  • Towner MC, Luttbeg B (2007) Alternative statistical approaches to the use of data as evidence for hypotheses in human behavioral ecology. Evol Anthropol 16:107–118

    Article  Google Scholar 

  • Vaida F, Blanchard S (2005) Conditional Akaike information for mixed-effects models. Biometrika 92:351–370

    Article  Google Scholar 

  • Vapnik V, Chervonenkis A (1974) Theory of pattern recognition (in Russian). Nauka, Moscow

    Google Scholar 

  • Ward EJ (2008) A review and comparison of four commonly used Bayesian and maximum likelihood model selection tools. Ecol Modell 211:1–10

    Article  CAS  Google Scholar 

  • Wetherill GB, Duncombe P, Kenward M, Kollerstrom J, Paul SR, Vowden BJ (1986) Regression analysis with applications. Chapman and Hall, London

    Google Scholar 

  • Whiteheat H (2007) Selection of models of lagged identification rates and lagged association rates using AIC and QAIC. Commun Stat, Simul Comput 36:1233–1246

    Article  Google Scholar 

  • Whittingham MJ, Stephens PA, Bradbury RB, Freckleton RP (2006) Why do we still use stepwise modelling in ecology and behaviour? J Anim Ecol 75:1182–1189

    Article  PubMed  Google Scholar 

  • Zucchini W (2000) An introduction to model selection. J Math Psychol 44:41–46

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

I am grateful to D. R. Anderson, R. Freckleton, F. Guthery, R. Montgomerie, S. Nakagawa, and P. Stephens for their constructive comments at the different stages of the manuscript. Special thanks to all referees that participated in the evaluation of the contributed papers (see details at the end of this volume). P. A. Bednekoff kindly assisted during the editorial process and helped obtain reports from independent referees. During this study, I was supported by a “Ramon y Cajal” research grant from the Spanish National Research Council (Consejo Superior de Investigaciones Científicas–CSIC). The Department of Systematic Zoology and Ecology, Eötvös Loránd University, Hungary provided stimulating working place, for which I am indebted to J. Török.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to László Zsolt Garamszegi.

Additional information

Communicated by P. Bednekoff

This contribution is part of the Special Issue “Model selection, multimodel inference and information-theoretic approaches in behavioural ecology”.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Garamszegi, L.Z. Information-theoretic approaches to statistical analysis in behavioural ecology: an introduction. Behav Ecol Sociobiol 65, 1–11 (2011). https://doi.org/10.1007/s00265-010-1028-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00265-010-1028-7

Keywords