Abstract
The need to count things is ubiquitous, so data in the form of counts arise often in practice. Examples include: the number of alpha particles emitted from a source of radiation in a given time; the number of cases of leukemia reported per year in a certain jurisdiction; the number of flaws per metre of electrical cable. This chapter is concerned with counts when the individual events being counted are independent, or nearly so, and where there is no clear upper limit for the number of events that can occur, or where the upper limit is very much greater than any of the actual counts. We first compile important information about the Poisson distribution, the distribution most often used with count data. Poisson regression, or models for count data described by covariates, has already been covered elsewhere. We then focus on describing models for rates and models for counts organized in tables. Overdispersion is then discussed, including a discussion negative binomial glmsĀ and quasi-Poisson models as alternative models.
Poor data and good reasoning give poor results.
Good data and poor reasoning give poor results.
Poor data and poor reasoning give rotten results.
E. C. BerkeleyĀ [ 4 , p.Ā 20]
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agresti, A.: An Introduction to Categorical Data Analysis, second edn. Wiley-Interscience, New York (2007)
Andersen, E.B.: Multiplicative Poisson models with unequal cell rates. Scandinavian Journal of Statistics 4, 153ā158 (1977)
Appleton, D.R., French, J.M., Vanderpump, M.P.J.: Ignoring a covariate: An example of Simpsonās paradox. The American Statistician 50, 340ā341 (1996)
Berkeley, E.C.: Right answersāa short guide for obtaining them. Computers and Automation 18(10) (1969)
Brockmann, H.J.: Satellite male groups in horseshoe crabs, limulus polyphemus. Ethology 102, 1ā21 (1996)
Brunswick, A.F.: Adolescent health, sex, and fertility. American Journal of Public Health 61(4), 711ā729 (1971)
Canadian CancerĀ Society: Canadian cancer statistics 2000. Published on the internet: www.cancer.ca/stats2000/tables/tab5e.htm (2000). Accessed 19Ā September 2001
Charig, C.R., Webb, D.R., Payne, S.R., Wickham, J.E.A.: Comparison of treatment of renal calculi by open surgery, percutaneous nephrolithotomy, and extracorporeal shockwave lithotripsy. British Medical Journal 292, 879ā882 (1986)
Christensen, R.: Log-Linear Models. Springer Texts in Statistics. Springer, New York (2013)
Davison, A.C.: Statistical Models. Cambridge University Press, UK (2003)
Dunn, P.K.: Contingency tables and log-linear models. In: K.Ā Kempf-Leonard (ed.) Encyclopedia of Social Measurement, pp. 499ā506. Elsevier (2005)
Dunn, P.K., Smyth, G.K.: Randomized quantile residuals. Journal of Computational and Graphical Statistics 5(3), 236ā244 (1996)
Ellison, A.M.: Bayesian inference in ecology. Ecology Letters 7, 509ā520 (2004)
Everitt, B.S., Hothorn, T.: A Handbook of Statistical Analyses using, second edn. Chapman & Hall/CRC, Boca Raton, FL (2010)
Everitt, B.S., Smith, A.M.R.: Interactions in contingency tables: A brief discussion of alternative definitions. Psychological Medicine 9, 581ā583 (1979)
Fienberg, S.: The Analysis of Cross-Classified Categorical Data. Springer, New York (2007)
Giardiello, F.M., Hamilton, S.R., Krush, A.J., Piantadosi, S., Hylind, L.M., Celano, P., Booker, S.V., Robinson, C.R., Johan, G., Offerhaus, A.: Treatment of colonic and rectal adenomas with sulindac in famial adenomatous polyposis. New England Journal of Medicine 328(18), 1313ā1316 (1993)
Gotelli, N.J., Ellison, A.M.: Biogeography at a regional scale: Determinants of ant species density in bogs and forests of New England. Ecology 83(6), 1604ā1609 (2002)
Hand, D.J., Daly, F., Lunn, A.D., McConway, K.Y., Ostrowski, E.: A Handbook of Small Data Sets. Chapman and Hall, London (1996)
Health Department of Western Australia: Annual report 1997/1998āhealth of Western Australiansāmortality and survival. Published on the internet: www.health.wa.gov.au/Publications/annualreport_9798/. Accessed 19Ā September 2001
Howes, A.L., Maron, M., McAlpine, C.A.: Bayesian networks and adaptive management of wildlife habitat. Conservation Biology 24(4), 974ā983 (2010)
Hutchinson, M.K., Holtman, M.C.: Analysis of count data using Poisson regression. Research in Nursing and Health 28, 408ā418 (2005)
Jorgensen, D.W.: Multiple regression analysis of a Poisson process. Journal of the American Statistical Association 56(294), 235ā245 (1961)
Julious, S.A., Mullee, M.A.: Confounding and Simpsonās paradox. British Medical Journal 309(1480), 1480ā1481 (1994)
King, G.: Statistical models for political science event counts: Bias in conventional procedures and evidence for the exponential Poisson regression model. American Journal of Political Science 32(3), 838ā863 (1988)
Lindsey, J.K.: Modelling Frequency and Count Data. No.Ā 15 in Oxford Statistical Science Series. Clarendon Press, Oxford (1995)
Lovett, A.A., Gatrell, A.C.: The geography of spina bifida in England and Wales. Transactions of the Institute of British Geographers (New Series) 13(3), 288ā302 (1988)
Luo, D., Wood, G.R., Jones, G.: Visualising contingency table data. The Australian Mathematical Society Gazette 31(4), 258ā262 (2004)
Maag, J.W., Behrens, J.T.: Epidemiologic data on seriously emotionally disturbed and learning disabled adolescents: Reporting extreme depressive symptomatology. Behavioral Disorders 15(1) (1989)
Maron, M.: Threshold effect of eucalypt density on an aggressive avian competitor. Biological Conservation 136, 100ā107 (2007)
Norton, J., Lawrence, G., Wood, G.: The Australian publicās perception of genetically-engineered foods. Australasian Biotechnology pp. 172ā181 (1998)
Pettifor, R.A.: Brood-manipulation experiments. I.Ā The number of offspring surviving per nest in blue tits (Parus caeruleus). Journal of Animal Ecology 62, 131ā144 (1993)
Piantadosi, S.: Clinical Trials: A Methodologic Perspective, second edn. John Wiley and Sons, New York (2005)
Siegel, R.L., Miller, K.D., Jemal, A.: Cancer statistics, 2015. CA: A Cancer Journal for Clinicians 65(1), 5ā29 (2015)
Smith, P.T., Heitjan, D.F.: Testing and adjusting for departures from nominal dispersion in generalized linear models. Journal of the Royal Statistical Society, SeriesĀ C 42(1), 31ā41 (1993)
Smyth, G.K.: Australasian data and story library (Ozdasl) (2011). URLĀ http://www.statsci.org/data
Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S, fourth edn. Springer-Verlag, New York (2002). URL http://www.stats.ox.ac.uk/pub/MASS4
Whittemore, A.S., Gong, G.: Poisson regression with misclassified counts: Applications to cervical cancer mortality rates. Journal of the Royal Statistical Society, SeriesĀ C 40(1), 81ā93 (1991)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
Ā© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Dunn, P.K., Smyth, G.K. (2018). Chapter 10: Models for Counts: Poisson and Negative Binomial GLMs. In: Generalized Linear Models With Examples in R. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-0118-7_10
Download citation
DOI: https://doi.org/10.1007/978-1-4419-0118-7_10
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-0117-0
Online ISBN: 978-1-4419-0118-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)