Encyclopedia of Social Network Analysis and Mining

2018 Edition
| Editors: Reda Alhajj, Jon Rokne

Imputation of Missing Network Data

  • Mark Huisman
  • Robert W. Krause
Reference work entry
DOI: https://doi.org/10.1007/978-1-4939-7131-2_394

Synonyms

Glossary

Actor Non-response (Unit Non-response)

Missing all outgoing ties of an actor

Imputation

Substituting missing data by plausible values

MAR

Missing at random

MCAR

Missing completely at random

MNAR

Missing not at random

Multiple Imputation

Repeated stochastic imputation of a dataset to generate multiple completed datasets. These completed datasets are analyzed separately, after which the results of the analysis are pooled to generate proper estimates of parameters and standard errors

Tie Non-response (Item Non-response)

Missing some ties of an actor

Definition

When confronted with missing data, researchers often want to handle the missing observations by substituting plausible values for the missing scores. This practice of filling in missing items is called imputation (e.g., Schafer and Graham 2002). Imputation has several advantages: it is more...

This is a preview of subscription content, log in to check access.

References

  1. Barabasi A-L, Albert R (1999) Emergence of scaling in random networks. Science 286:509–512MathSciNetzbMATHCrossRefGoogle Scholar
  2. Borgatti SP, Molina JL (2003) Ethical and strategic issues in organizational social network analysis. J Appl Behav Sci 39:337–349CrossRefGoogle Scholar
  3. Borgatti SP, Carley KM, Krackhardt D (2006) On the robustness of centrality measures under conditions of imperfect data. Soc Netw 28:124–136CrossRefGoogle Scholar
  4. Burt RS (1987) A note on missing network data in the general social survey. Soc Netw 9:63–73CrossRefGoogle Scholar
  5. Butts CT (2003) Network inference, error, and informant (in)accuracy: a Bayesian approach. Soc Netw 25:103–140CrossRefGoogle Scholar
  6. Costenbader E, Valente TW (2003) The stability of centrality measures when networks are sampled. Soc Netw 25:283–307CrossRefGoogle Scholar
  7. De Leeuw ED, Hox JJ, Huisman M (2003) Prevention and treatment of item nonresponse. J Off Stat 19:153–176Google Scholar
  8. Dempster AP, Rubin DB (1983) Overview. In: Madow WG, Olkin I, Rubin DB (eds) Incomplete data in sample surveys vol II: theory and bibliographies. Academic, New York, pp 3–10Google Scholar
  9. Graham JW (2009) Missing data analysis: making it work in the real world. Annu Rev Psychol 60:549–576CrossRefGoogle Scholar
  10. Guimerà R, Sales-Pardo M (2009) Missing and spurious interactions and the reconstruction of complex networks. Proc Natl Acad Sci 106:22073–22078CrossRefGoogle Scholar
  11. Handcock MS, Gile KJ (2010) Modeling social networks from sampled data. Ann Appl Stat 4:5–25MathSciNetzbMATHCrossRefGoogle Scholar
  12. Hipp JR, Wang C, Butts CT, Jose R, Lakon CM (2015) Research note: the consequences of different methods for handling missing network data in stochastic actor based models. Soc Netw 41:56–71CrossRefGoogle Scholar
  13. Hoff P (2009) Multiplicative latent factor models for description and prediction of social networks. Comput Math Organ Theory 15:261–272CrossRefGoogle Scholar
  14. Huisman M (2009) Imputation of missing network data: some simple procedures. J Soc Struct 10:1–29Google Scholar
  15. Huisman M, Steglich CEG (2008) Treatment of non-response in longitudinal network studies. Soc Netw 30:297–308CrossRefGoogle Scholar
  16. Kim M, Leskovec J (2011) The network completion problem: inferring missing nodes and edges in networks. In: SIAM international conference on data mining (SDM), Mesa, pp 47–58CrossRefGoogle Scholar
  17. Koskinen JH, Robins GL, Pattison PE (2010) Analysing exponential random graph (p-star) models with missing data using Bayesian data augmentation. Stat Method 7:366–384MathSciNetzbMATHCrossRefGoogle Scholar
  18. Koskinen JH, Robins GL, Wang P, Pattison PE (2013) Bayesian analysis for partially observed network data, missing ties, attributes and actors. Soc Netw 35:514–527CrossRefGoogle Scholar
  19. Kossinets G (2006) Effects of missing data in social networks. Soc Netw 28:247–268CrossRefGoogle Scholar
  20. Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. J Am Soc Inf Sci Technol 58:1019–1031CrossRefGoogle Scholar
  21. Lusher D, Koskinen JH, Robins GL (eds) (2013) Exponential random graphs models for social networks, Structural analysis in the social sciences, vol 35. Cambridge University Press, CambridgeGoogle Scholar
  22. Ouzienko V, Obradovic Z (2011) Imputation of missing links and attributes in longitudinal social surveys. In: IEEE international conference on data mining workshops, Vancouver, pp 957–964Google Scholar
  23. Robins G, Pattison P, Woolcock J (2004) Missing data in networks: exponential random graph (p∗) models for networks with non-respondents. Soc Netw 26:257–283CrossRefGoogle Scholar
  24. Rubin DB (1987) Multiple imputation for nonresponse in surveys. Wiley, New YorkzbMATHCrossRefGoogle Scholar
  25. Sande IG (1982) Imputation in surveys: coping with reality. Am Stat 36:145–152Google Scholar
  26. Schafer JL, Graham JW (2002) Missing data: our view of the state of the art. Psychol Methods 7:147–177CrossRefGoogle Scholar
  27. Smith JA, Moody J (2013) Structural effects of network sampling coverage I: nodes missing at random. Soc Netw 35:652–668CrossRefGoogle Scholar
  28. Smith JA, Moody J, Morgan JH (2017) Network sampling coverage II: the effect of non-random missing data on network measurement. Soc Netw 48:78–99CrossRefGoogle Scholar
  29. Snijders TAB (2005) Models for longitudinal network data. In: Carrington PJ, Scott J, Wasserman S (eds) Models and methods in social network analysis. Cambridge University Press, Cambridge, pp 215–247CrossRefGoogle Scholar
  30. Stork D, Richards WD (1992) Nonrespondents in communication network studies. Group Org Manag 17:193–209CrossRefGoogle Scholar
  31. van Buuren S (2012) Flexible imputation of missing data. Chapman & Hall/CRC, Boca RatonzbMATHCrossRefGoogle Scholar
  32. Wang DJ, Shi X, McFarland DA, Leskovec J (2012) Measurement error in network data: a re-classification. Soc Netw 34:396–409CrossRefGoogle Scholar
  33. Wang C, Butts CT, Hipp JR, Jose R, Lakon CM (2016) Multiple imputation for missing edge data: a predictive evaluation method with application to add health. Soc Netw 45:89–98CrossRefGoogle Scholar
  34. Žnidaršič A, Doreian P, Ferligoj A (2012a) Absent ties in social networks, their treatments, and blockmodeling outcomes. Metodološki Zvezki 9:119–138Google Scholar
  35. Žnidaršič A, Ferligoj A, Doreian P (2012b) Non-response in social networks: the impact of different non-response treatments on the stability of blockmodels. Soc Netw 34:438–450CrossRefGoogle Scholar
  36. Žnidaršič A, Ferligoj A, Doreian P (2017) Actor non-response in valued social networks: the impact of different non-response treatments on the stability of blockmodels. Soc Netw 48:46–56CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Sociology/ICSUniversity of GroningenGroningenThe Netherlands

Section editors and affiliations

  • V. S. Subrahmanian
    • 1
  • Jeffrey Chan
    • 2
  1. 1.University of MarylandCollege ParkUSA
  2. 2.RMIT (Royal Melbourne Institute of Technology)MelbourneAustralia