Missing Data

Bartholomew, David J.

doi:10.1007/978-3-642-39912-1_10

David J. Bartholomew²

Part of the book series: SpringerBriefs in Statistics ((BRIEFSSTATIST))

1419 Accesses

Abstract

It is very common for data to be missing and this introduces a risk of bias if inferences are drawn from incomplete samples. However, we are not usually interested in the missing data themselves but in the population characteristics to whose estimation those values were intended to contribute. Learning something about the data that are missing is thus only the first step on the way to inference. One approach is to use a direct method, such as maximum likelihood but the price to be paid is usually much greater complexity in the estimation process. Methods such as the E-M algorithm sometimes make this easier by requiring us to solve a much simpler problem many times as the estimates converge to the desired values. Sometimes it is actually advantageous to introduce hypothetical variables. Which are then treated as unobserved and an example is provided concerning a mixture of exponential distributions. A different kind of approach is to impute values to replace those that are missing. This yields a complete sample which can then be analysed in the usual way. Imputed values can be derived from the conditional distribution of the missing values given those that are observed. This possibility depends upon being able to say something about why some sample members are missing and this may be done by specifying a probabilistic loss mechanism.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood estimation from incomplete data via the EM algorithm(with discussion). Journal of Royal Statistical Society B, 39, 1–38.
MathSciNet MATH Google Scholar
Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data. New York: Wiley. (1st edn. 1987).
MATH Google Scholar
van Buuren, S. (2012). Flexible imputation of missing data. London: Chapman and Hall/CRC Press.
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

London School of Economics, London, UK
David J. Bartholomew

Authors

David J. Bartholomew
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David J. Bartholomew .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bartholomew, D.J. (2013). Missing Data. In: Unobserved Variables. SpringerBriefs in Statistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39912-1_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-39912-1_10
Published: 08 September 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39911-4
Online ISBN: 978-3-642-39912-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics