Abstract
A negative binomial (NB) distribution is useful to model over-dispersed count data arising from agriculture, health, and pest control. We design purely sequential bounded-risk methodologies to estimate an unknown NB mean \(\mu (>0)\) under different forms of loss functions including customary and modified Linex loss as well as squared error loss. We handle situations when the thatch parameter \(\tau (>0)\) may be assumed known or unknown. Our proposed methodologies are shown to satisfy properties including first-order asymptotic efficiency and first-order asymptotic risk efficiency. Summaries are provided from extensive sets of simulations showing encouraging performances of the proposed methodologies for small and moderate sample sizes. We follow with illustrations obtained by implementing estimation strategies using real data from statistical ecology: (1) weed count data of different species from a field in Netherlands and (2) count data of migrating woodlarks at the Hanko bird sanctuary in Finland.
Similar content being viewed by others
References
Anscombe, F. J. (1949). The statistical analysis of insect counts based on the negative binomial distribution. Biometrics, 5, 165–173.
Anscombe, F. J. (1950). Sampling theory of the negative binomial and logarithmic series distributions. Biometrika, 37, 358–382.
Anscombe, F. J. (1952). Large-sample theory of sequential estimation. Proceedings of Cambridge Philosophical Society, 48, 600–607.
Aoshima, M., Takada, Y. (2000). Second order properties of a two stage procedure for comparing several treatments with a control. Journal of Japan Statistical Society, 30, 27–41.
Aoshima, M., Yata, K. (2010). Asymptotic second order consistency for two stage estimation methodologies and its applications. Annals of the Institute of Statistical Mathematics, 62, 571–600.
Banerjee, S., Mukhopadhyay, N. (2016). A general sequential fixed-accuracy confidence interval estimation methodology for a positive parameter: Illustrations using health and safety data. Annals of the Institute of Statistical Mathematics, 68, 541–570. http://dx.doi.org/10.1007/s10463-015-0504-2.
Bapat, S. R. (2017). Multistage sampling strategies and inference in health studies under appropriate Linex loss functions, Ph.D. thesis, August 2017, Department of Statistics, University of Connecticut, Storrs, Connecticut, USA.
Bliss, C. I., Owen, A. R. G. (1958). Negative binomial distributions with a common \(\kappa \). Biometrika, 45, 37–58.
Chattopadhyay, S. (1998). Sequential estimation of normal mean under asymmetric loss function with a shrinkage stopping rule. Metrika, 48, 53–59.
Chattopadhyay, S. (2000). Sequential estimation of exponential location parameter using an asymmetric loss function. Communications in Statistics-Theory & Methods, 29, 783–795.
Chow, Y. S., Robbins, H. (1965). On the asymptotic theory of fixed width sequential confidence intervals for the mean. Annals of Mathematical Statistics, 36, 457–462.
Chow, Y. S., Robbins, H., Teicher, H. (1965). Moments of randomly stopped sums. Annals of Mathematical Statistics, 36, 789–799.
Chow, Y. S., Hsiung, C., Lai, T. L. (1979). Extended renewal theory and moment convergence in Anscombe’s theorem. Annals of Probability, 7, 304–318.
Ghosh, M., Mukhopadhyay, N. (1979). Sequential point estimation of the mean when the distribution is unspecified. Communications in Statistics-Theory & Methods, 8, 637–652.
Ghosh, M., Mukhopadhyay, N. (1981). Consistency and asymptotic efficiency of two-stage and sequential procedures. Sankhya, Series A, 43, 220–227.
Ghosh, M., Mukhopadhyay, N., Sen, P. K. (1997). Sequential estimation. New York: Wiley.
Heijting, S. (2013). Weed count data from Heijting et al. (2007). DANS. http://dx.doi.org/10.17026/dans-zu9-r7y8.
Heijting, S., Van Der Werf, W., Stein, A., Kropff, M. J. (2007). Are weed patches stable in location? Application of an Explicitly Two-Dimensional Methodology, Weed Research, 47, 381–395. http://dx.doi.org/10.17026/dans-zu9-r7y8v
Kuno, E. (1972). Some notes on population estimation by sequential sampling. Researches on Population Ecology, 14, 58–73.
Linden, A., Mantyniemi, S. (2011). Using negative binomial distribution to model overdispersion in ecological count data. Ecology, 92, 1414–1421.
Mukhopadhyay, N. (1974). Sequential estimation of location parameter in exponential distributions. Calcutta Statistical Association Bulletin, 23, 85–95.
Mukhopadhyay, N. (1978). Sequential point estimation of the mean when the distribution is unspecified, Technical Report #312. School of Statistics: Department of Theoretical Statistics, University of Minnesota, Minneapolis, Minnesota, USA.
Mukhopadhyay, N. (2002). Sequential sampling. In A. H. Sharaawi, W. W. Piegorsch (Eds.), The encyclopedia of environmetrics (pp. 1983–1988). Chichester: Wiley.
Mukhopadhyay, N., Banerjee, S. (2014). Purely sequential and two stage fixed-accuracy confidence interval estimation methods for count data for negative binomial distributions in statistical ecology: One-sample and two-sample problems. Sequential Analysis, 33, 251–285.
Mukhopadhyay, N., Banerjee, S. (2015). Sequential negative binomial problems and statistical ecology: A selected review with new directions. Statistical Methodology, 26, 34–60.
Mukhopadhyay, N., Bapat, S. R. (2016a). Multistage point estimation methodologies for a negative exponential location under a modified linex loss function: Illustrations with infant mortality and bone marrow data. Sequential Analysis, 35, 175–206. http://dx.doi.org/10.1080/07474946.2016.1165532.
Mukhopadhyay, N., Bapat, S. R. (2016b). Multistage estimation of the difference of locations of two negative exponential populations under a modified Linex loss function: Real data illustrations from cancer studies and reliability analysis. Sequential Analysis, 35, 387–412. http://dx.doi.org/10.1080/07474946.2016.1206386.
Mukhopadhyay, N., de Silva, B. M. (2005). Two stage estimation of the mean of a negative binomial distribution with applications to mexican bean beetle data. Sequential Analysis, 24, 99–137.
Mukhopadhyay, N., de Silva, B. M. (2009). Sequential methods and their applications. Boca Raton: CRC Press.
Mukhopadhyay, N., Diaz, J. (1985). Two stage sampling for estimating the mean of a negative binomial distribution. Sequential Analysis, 4, 1–18.
Mukhopadhyay, N., Duggan, W. T. (1997). Can a two-stage procedure enjoy second order properties? Sankhya, Series A, 59, 435–448.
Mukhopadhyay, N., Duggan, W. T. (2000). On a two-stage procedure having second-order properties with applications. Annals of the Institute of Statistical Mathematics, 51, 621–636.
Mukhopadhyay, N., Duggan, W. T. (2001). A two-stage point estimation procedure for the mean of an exponential distribution and second-order results. Statistics & Decisions, 19, 155–171.
Mukhopadhyay, N., Solanky, T. K. S. (1994). Multistage selection and ranking procedures. New York: Dekker.
Mukhopadhyay, N., Zacks, S. (2017). Modified Linex two-stage and purely sequential estimation of the variance in a normal distribution with illustrations using horticultural data. Journal of Statistical Theory and Practice (in press). http://dx.doi.org/10.1080/15598608.2017.1350608.
Mukhopadhyay, N., Datta, S., Chattopadhyay, S. (2004). Applied sequential methodologies, edited volume. New York: Dekker.
R Core Team. (2014). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.
Sen, P. K., Ghosh, M. (1981). Sequential point estimation of estimable parameters based on U-statistics. Sankhya, Series A, 43, 331–344.
Varian, H. R. (1975). A Bayesian approach to real estate assessment. In L. J. Savage, S. E. Fienberg, A. Zellner (Eds.), Studies in Bayesian econometrics and statistics (pp. 195–208). North Holland: Elsevier.
Wiener, N. (1939). The ergodic theorem. Duke Mathematical Journal, 5, 1–18.
Willson, L. J., Folks, J. L. (1983). Sequential estimation of the mean of the negative binomial distribution. Communications in Statistics-Sequential Analysis, 2, 55–70.
Willson, L. J., Folks, J. L., Young, J. H. (1984). Multistage estimation compared with fixed-sample size estimation of the negative binomial parameter \(\kappa \). Biometrics, 40, 109–117.
Zacks, S. (2009). Stage-wise adaptive designs. New York: Wiley.
Zellner, A. (1986). Bayesian estimation and prediction using asymmetric loss functions. Journal of American Statistical Association, 81, 446–451.
Acknowledgements
We are grateful to Andreas Linden for giving us the permission to use bird count data found in Linden and Mantyniemi (2011) paper. We also appreciate Kees Waterman’s approval to use the dataset from Heijting et al. (2007). The Reviewer, the Associate Editor, and the Editor gave us kind encouragements to tighten the original submission by streamlining its focus. We heartily thank them all.
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Mukhopadhyay, N., Bapat, S.R. Purely sequential bounded-risk point estimation of the negative binomial mean under various loss functions: one-sample problem. Ann Inst Stat Math 70, 1049–1075 (2018). https://doi.org/10.1007/s10463-017-0620-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-017-0620-2