Skip to main content

Missing Data

  • Chapter
  • First Online:
Unobserved Variables

Part of the book series: SpringerBriefs in Statistics ((BRIEFSSTATIST))

  • 1419 Accesses

Abstract

It is very common for data to be missing and this introduces a risk of bias if inferences are drawn from incomplete samples. However, we are not usually interested in the missing data themselves but in the population characteristics to whose estimation those values were intended to contribute. Learning something about the data that are missing is thus only the first step on the way to inference. One approach is to use a direct method, such as maximum likelihood but the price to be paid is usually much greater complexity in the estimation process. Methods such as the E-M algorithm sometimes make this easier by requiring us to solve a much simpler problem many times as the estimates converge to the desired values. Sometimes it is actually advantageous to introduce hypothetical variables. Which are then treated as unobserved and an example is provided concerning a mixture of exponential distributions. A different kind of approach is to impute values to replace those that are missing. This yields a complete sample which can then be analysed in the usual way. Imputed values can be derived from the conditional distribution of the missing values given those that are observed. This possibility depends upon being able to say something about why some sample members are missing and this may be done by specifying a probabilistic loss mechanism.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood estimation from incomplete data via the EM algorithm(with discussion). Journal of Royal Statistical Society B, 39, 1–38.

    MathSciNet  MATH  Google Scholar 

  • Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data. New York: Wiley. (1st edn. 1987).

    MATH  Google Scholar 

  • van Buuren, S. (2012). Flexible imputation of missing data. London: Chapman and Hall/CRC Press.

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David J. Bartholomew .

Rights and permissions

Reprints and permissions

Copyright information

© 2013 The Author(s)

About this chapter

Cite this chapter

Bartholomew, D.J. (2013). Missing Data. In: Unobserved Variables. SpringerBriefs in Statistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39912-1_10

Download citation

Publish with us

Policies and ethics