Abstract
Scientific thinking may require the consideration of multiple hypotheses, which often call for complex statistical models at the level of data analysis. The aim of this introduction is to provide a brief overview on how competing hypotheses are evaluated statistically in behavioural ecological studies and to offer potentially fruitful avenues for future methodological developments. Complex models have traditionally been treated by model selection approaches using threshold-based removal of terms, i.e. stepwise selection. A recently introduced method for model selection applies an information-theoretic (IT) approach, which simultaneously evaluates hypotheses by balancing between model complexity and goodness of fit. The IT method has been increasingly propagated in the field of ecology, while a literature survey shows that its spread in behavioural ecology has been much slower, and model simplification using stepwise selection is still more widespread than IT-based model selection. Why has the use of IT methods in behavioural ecology lagged behind other disciplines? This special issue examines the suitability of the IT method for analysing data with multiple predictors, which researchers encounter in our field. The volume brings together different viewpoints to aid behavioural ecologists in understanding the method, with the hope of enhancing the statistical integration of our discipline.


Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Adolph SC, Hardin JS (2007) Estimating phenotypic correlations: correcting for bias due to intraindividual variability. Funct Ecol 21:178–184
Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Csáki F (ed) 2nd International Symposium on Information Theory. Akadémiai Kiadó, Budapest, pp 267–281
Allen DM (1974) The relationship between variable selection and data augmentation and a method for prediction. Technometrics 16:125–127
Anderson DR (2008) Model based inference in the life sciences: a primer on evidence. Springer, New York
Anderson DR, Burnham KP (2002) Avoiding pitfalls when using information-theoretic methods. J Wildl Manage 66:910–916
Anderson DR, Burnham KP, Thompson WL (2000) Null hypothesis testing: problems, prevalence, and an alternative. J Wildl Manage 64:912–923
Bell AM, Hankison SJ, Laskowski KL (2009) The repeatability of behaviour: a meta-analysis. Anim Behav 77:771–783
Berger JO, Wolpert RL (1984) The likelihood principle. Institute of Mathematical Statistics, Hayward
Burnham KP, Anderson DR (2002) Model selection and multimodel inference: a practical information-theoretic approach. Springer, New York
Burnham K, Anderson D, Huyvaert K (2010) Improving inferences in ecological and behavioral science: some background, observations, and comparisons. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1029-6
Cézilly F, Danchin É, Giraldeau L-A (2008) Research methods in behavioural ecology. In: Danchin É, Giraldeau L-A, Cézilly F (eds) Behavioural ecology: an evolutionary perspective on behaviour. Oxford University Press, Oxford, pp 55–95
Chamberlin TC (1890) The method of multiple working hypotheses. Science 15:92–96
Claeskens C, Hjort NL (2008) Model selection and model averaging. Cambridge University Press, Cambridge
Craven P, Wahba G (1979) Smoothing noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numerische Mathematik 31:377–403
Crawley MJ (2007) The R book. Wiley, West Sussex
Derksen S, Keselman HJ (1992) Backward, forward and stepwise automated subset-selection algorithms—frequency of obtaining authentic and noise variables. Br J Math Stat Psychol 45:265–282
Dochtermann N, Jenkins SH (2010) Developing and evaluating candidate hypotheses in behavioral ecology. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1039-4
Draper NR, Smith H (1981) Applied regression analysis, 2nd edn. Wiley, New York
Forster MR (2000) Key concepts in model selection: performance and generalizability. J Math Psychol 44:205–231
Forstmeier W, Schielzeth H (2010) Cryptic multiple hypotheses testing in linear models: overestimated effect sizes and the winner’s curse. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1038-5
Fox J (2002) An R and S-PLUS companion to applied regression. Sage, Newbury Park
Freckleton RP (2010) Dealing with collinearity in behavioural and ecological data: model averaging and the problems of measurement error. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1045-6
Garamszegi LZ, Calhim S, Dochtermann N, Hegyi G, Hurd PL, Jørgensen C, Kutsukake N, Lajeunesse MJ, Pollard KA, Schielzeth H, Symonds MRE, Nakagawa S (2009) Changing philosophies and tools for statistical inferences in behavioral ecology. Behav Ecol 20:1363–1375
Gelman A, Hill J (2007) Data analysis using regression and multilevel/hierarchical models. Cambridge University Press, Cambridge
Ginzburg LR, Jensen CXJ (2004) Rules of thumb for judging ecological theories. Trends Ecol Evol 19:121–126
Graham MH (2003) Confronting multicollinearity in ecological multiple regression. Ecology 84:2809–2815
Guthery FS (2007) Deductive and inductive methods of accumulating reliable knowledge in wildlife science. J Wildl Manage 71:222–225
Guthery FS, Brennan LA, Peterson MJ, Lusk JJ (2005) Information theory in wildlife science: critique and viewpoint. J Wildl Manage 69:457–465
Harvey PH, Pagel MD (1991) The comparative method in evolutionary biology. Oxford University Press, Oxford
Hegyi G, Garamszegi LZ (2010) Stepwise selection and information theory in ecology and behavior. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1036-7
Hilborn R, Mangel M (1997) The ecological detective: confronting models with data. Princeton University Press, Princeton
Hobbs NT, Hilborn R (2006) Alternatives to statistical hypothesis testing in ecology: a guide to self teaching. Ecol Appl 16:5–19
Huston MA (1997) Hidden treatments in ecological experiments: re-evaluating the ecosystem function of biodiversity. Oecologia 110:449–460
Johnson JB, Omland KS (2004) Model selection in ecology and evolution. Trends Ecol Evol 19:101–108
Jones KS, Nakagawa S, Sheldon BC (2009) Environmental sensitivity in relation to size and sex in birds: meta-regression analysis. Am Nat 174:122–133
Konishi S, Kitagawa G (2008) Information criteria and statistical modeling. Springer, New York
Krebs JR, Davies NB (1984) Behavioural ecology: an evolutionary approach. Blackwell Scientific, Oxford
Lajeunesse MJ (2009) Meta-analysis and the comparative phylogenetic method. Am Nat 174:369–381
Lebreton J-D, Burnham KP, Clobert J, Anderson DR (1992) Modeling survival and testing biological hypotheses using marked animals: a unified approach with case studies. Ecol Monogr 62:67–118
Liang H, Wu HL, Zou GH (2008) A note on conditional AIC for linear mixed-effects models. Biometrika 95:773–778
Linhart H, Zucchini W (1986) Model selection. Wiley, New York
Lukacs PM, Thompson WL, Kendall WL, Gould WR, Doherty PF, Burnham KP, Anderson DR (2007) Concerns regarding a call for pluralism of information theory and hypothesis testing. J Appl Ecol 44:456–460
Mallows CL (1973) Some comments on Cp. Technometrics 15:661–675
Massart P (2007) Concentration inequalities and model selection: ecole d’eté de probabilités de Saint-Flour XXXIII—2003. Springer, Berlin
McArdle BH (2003) Lines, models, and errors: regression in the field. Limnol Oceanogr 48:1363–1366
McCarthy MA (2007) Bayesian methods for ecology. Cambridge University Press, Cambridge
McQuarrie ADR, Tsai C-L (1998) Regression and time series model selection. World Scientific, Singapore
Mundry R (2010) Issues in information theory based statistical inference—a commentary from a frequentist’s perspective. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1040-y
Mundry R, Nunn CL (2008) Stepwise model fitting and statistical inference: turning noise into signal pollution. Am Nat 173:119–123
Murtaugh PA (2009) Performance of several variable-selection methods applied to real ecological data. Ecol Lett 12:1061–1068
Nakagawa S, Freckleton R (2008) Missing inaction: the dangers of ignoring missing data. Trends Ecol Evol 23:592–596
Nakagawa S, Freckleton RP (2010) Model averaging, missing data and multiple imputation: a case study for behavioural ecology. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1044-7
O’Hara RB, Sillanpää MJ (2009) A review of Bayesian variable selection methods: what, how, and which. Bayesian Analysis 4:85–118
Owens IPF (2006) Where is behavioural ecology going? Trends Ecol Evol 21:356–361
Platt JR (1964) Strong inference. Science 146:347–353
Popper KR (1963) Conjectures and refutations. Routledge and Keagan Paul, London
Pötscher BM (1989) Model selection under nonstationary: autoregressive models and stochastic linear regression models. Ann Stat 17:1257–1274
Quinn JF, Dunham AE (1983) On hypothesis testing in ecology and evolution. Am Nat 122:602–617
Quinn GP, Keough MJ (2002) Experimental design and data analysis for biologists. Cambridge University Press, Cambridge
Rabosky DL (2006) Likelihood methods for detecting temporal shifts in diversification rates. Evolution 60:1152–1164
Rao CR, Wu Y (1989) A strongly consistent procedure for model selection in a regression problem. Biometrika 76:369–374
Richards SA, Whittingham MJ, Stephens PA (2010) Model selection and model averaging in behavioural ecology: the utility of the IT-AIC framework. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1035-8
Ripplinger J, Sullivan J (2008) Does choice in model selection affect maximum likelihood analysis? Syst Biol 57:76–85
Rissanen J (1978) Modeling by shortest data description. Automatica 14:465–471
Royall MR (1997) Statistical evidence: a likelihood paradigm. Chapman and Hall, London
Rushton SP, Ormerod SJ, Kerby G (2004) New paradigms for modelling species distributions? J Appl Ecol 41:193–200
Sakamoto Y (1991) Categorical data analysis by AIC. KTK Scientific, Tokyo
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464
Shibata R (1981) An optimal selection of regression variables. Biometrika 68:45–54
Sokal RR, Rohlf FJ (1995) Biometry, 3rd edn. Freeman, New York
Steidl RJ (2006) Model selection, hypothesis testing, and risks of condemning analytical tools. J Wildl Manage 70:1497–1498
Stephens PA, Buskirk SW, Hayward GD, Del Rio CM (2005) Information theory and hypothesis testing: a call for pluralism. J Appl Ecol 42:4–12
Stephens PA, Buskirk SW, del Rio CM (2007a) Inference in ecology and evolution. Trends Ecol Evol 22:192–197
Stephens PA, Buskirk SW, Hayward GD, Del Rio CM (2007b) A call for statistical pluralism answered. J Appl Ecol 44:461–463
Stone M (1974) Cross-validatory choice and assessment of statistical predictions. Jean-le-Blanc Journal of the Royal Statistical Society, Series B 36:111–147
Sugiura N (1978) Further analysis of the data by Akaike’s information and the finite corrections. Commun Stat A7:13–26
Symonds M, Moussalli A (2010) Model selection, multimodel inference and model averaging using Akaike’s information criterion: an introduction for statistically terrified behavioural ecologists. Behav Ecol Sociobiol. doi:10.1007/s00265-010-1037-6
Takeuchi K (1976) Distribution of informational statistics and a criterion of model fitting (in Japanese). Suri-Kagaku (Mathematical Sciences) 153:12–18
Towner MC, Luttbeg B (2007) Alternative statistical approaches to the use of data as evidence for hypotheses in human behavioral ecology. Evol Anthropol 16:107–118
Vaida F, Blanchard S (2005) Conditional Akaike information for mixed-effects models. Biometrika 92:351–370
Vapnik V, Chervonenkis A (1974) Theory of pattern recognition (in Russian). Nauka, Moscow
Ward EJ (2008) A review and comparison of four commonly used Bayesian and maximum likelihood model selection tools. Ecol Modell 211:1–10
Wetherill GB, Duncombe P, Kenward M, Kollerstrom J, Paul SR, Vowden BJ (1986) Regression analysis with applications. Chapman and Hall, London
Whiteheat H (2007) Selection of models of lagged identification rates and lagged association rates using AIC and QAIC. Commun Stat, Simul Comput 36:1233–1246
Whittingham MJ, Stephens PA, Bradbury RB, Freckleton RP (2006) Why do we still use stepwise modelling in ecology and behaviour? J Anim Ecol 75:1182–1189
Zucchini W (2000) An introduction to model selection. J Math Psychol 44:41–46
Acknowledgements
I am grateful to D. R. Anderson, R. Freckleton, F. Guthery, R. Montgomerie, S. Nakagawa, and P. Stephens for their constructive comments at the different stages of the manuscript. Special thanks to all referees that participated in the evaluation of the contributed papers (see details at the end of this volume). P. A. Bednekoff kindly assisted during the editorial process and helped obtain reports from independent referees. During this study, I was supported by a “Ramon y Cajal” research grant from the Spanish National Research Council (Consejo Superior de Investigaciones Científicas–CSIC). The Department of Systematic Zoology and Ecology, Eötvös Loránd University, Hungary provided stimulating working place, for which I am indebted to J. Török.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by P. Bednekoff
This contribution is part of the Special Issue “Model selection, multimodel inference and information-theoretic approaches in behavioural ecology”.
Rights and permissions
About this article
Cite this article
Garamszegi, L.Z. Information-theoretic approaches to statistical analysis in behavioural ecology: an introduction. Behav Ecol Sociobiol 65, 1–11 (2011). https://doi.org/10.1007/s00265-010-1028-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00265-010-1028-7

