Skip to main content
Log in

Missing data imputation in PLS-SEM

  • Published:
Quality & Quantity Aims and scope Submit manuscript

Abstract

As a useful tool for business research, PLS-SEM is widely adopted for the assessment of causal-predictive relationships of models when developing and testing theories. Nevertheless, the less error-prone techniques for handling missing data are routinely ignored by PLS-SEM researchers. In this paper, we propose an imputation method, called EM PLS-SEM, to deal with missing values in PLS-SEM. The method takes advantages of the estimation procedure of PLS-SEM to reach the goal of filling the missing elements with values that are most likely to appear. Numerical studies verify that the proposed method outperforms other alternatives in data completion and model fitting.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data availability

Enquiries about data availability should be directed to the authors.

Notes

  1. The KNN imputation and regression imputation are implicated by R packages “DMwR” and “mice”, respectively.

  2. The generation is completed by mvrnorm function in R software.

References

  • Bollen, K.A.: Structural Equations with Latent Variables. Wiley, New York (1989)

    Book  Google Scholar 

  • Carrión, G.C., Nitzl, C., Roldán, J.L.: Mediation analyses in partial least squares structural equation modeling: Guidelines and empirical examples. In: Partial Least Squares Path Modeling, pp. 173–195. Springer, New York (2017)

    Chapter  Google Scholar 

  • Cepeda-Carrion, G., Cegarra-Navarro, J.G., Cillo, V.: Tips to use partial least squares structural equation modelling (PLS-SEM) in knowledge management. J. Knowl. Manag. 23(1), 67–89 (2019)

    Article  Google Scholar 

  • Chin, W., Cheah, J.H., Liu, Y., Ting, H., Lim, X.J., Cham, T.H.: Demystifying the role of causal-predictive modeling using partial least squares structural equation modeling in information systems research. Ind. Manag. Data Syst. (2020). https://doi.org/10.1108/IMDS-10-2019-0529

    Article  Google Scholar 

  • Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 39(1), 1–22 (1977)

    Google Scholar 

  • Dijkstra, T.K., Henseler, J.: Consistent partial least squares path modeling. MIS Q. 39(2), 297–316 (2015)

    Article  Google Scholar 

  • Eberl, M., Schwaiger, M.: Corporate reputation: disentangling the effects on financial performance. Eur. J. Market. 39(7/8), 838–854 (2005)

    Article  Google Scholar 

  • Fornell, C., Bookstein, F.L.: Two structural equation models: Lisrel and pls applied to consumer exit-voice theory. J. Market. Res. 19(4), 440–452 (1982)

    Article  Google Scholar 

  • Grimm, M.S., Wagner, R.: The impact of missing values on PLS, ML and FIML model fit. Arch. Data Sci. Ser. 6(1), 04 (2019)

    Google Scholar 

  • Hair Jr., J.F., Sarstedt, M.: Data, measurement, and causal inferences in machine learning: opportunities and challenges for marketing. J. Market. Theory Pract. (2021). https://doi.org/10.1080/10696679.2020.1860683

  • Hair, J.F., Sarstedt, M., Pieper, T.M., Ringle, C.M.: The use of partial least squares structural equation modeling in strategic management research: a review of past practices and recommendations for future applications. Long range planning 45(5–6), 320–340 (2012a)

    Article  Google Scholar 

  • Hair, J.F., Sarstedt, M., Ringle, C.M., Mena, J.A.: An assessment of the use of partial least squares structural equation modeling in marketing research. J. Acad. Market. Sci. 40(3), 414–433 (2012b)

    Article  Google Scholar 

  • Hair, Jr., J.F., Sarstedt, M., Hopkins, L., Kuppelwieser, V.G.: Partial least squares structural equation modeling (PLS-SEM): an emerging tool in business research. Eur. Bus. Rev. 26, 106–121 (2014)

  • Hair, J.F., Hult, G.T.M., Ringle, C., Sarstedt, M.: A Primer on Partial Least Squares Structural Equation Modeling (PLS-SEM). SAGE, Thousand Oaks (2017)

  • Hair, J.F., Risher, J.J., Sarstedt, M., Ringle, C.M.: When to use and how to report the results of PLS-SEM. Eur. Bus. Rev. 31(1), 2–24 (2019)

  • Helm, S., Eggert, A., Garnefeld, I.: Modeling the impact of corporate reputation on customer satisfaction and loyalty using partial least squares. In: Handbook of Partial Least Squares, pp. 515–534. Springer, Berlin (2010)

    Chapter  Google Scholar 

  • Jhun, M., Jeong, H.C., Koo, J.Y.: On the use of adaptive nearest neighbors for missing value imputation. Commun. Stat. Comput. 36(6), 1275–1286 (2007)

    Article  Google Scholar 

  • Josse, J., Pagès, J., Husson, F.: Multiple imputation in principal component analysis. Adv. Data Anal. Classif. 5(3), 231–246 (2011)

    Article  Google Scholar 

  • Kaufmann, L., Gaeckler, J.: A structured review of partial least squares in supply chain management research. J. Purchasing Supply Manag. 21(4), 259–272 (2015)

    Article  Google Scholar 

  • Kock, N.: Single missing data imputation in PLS-based structural equation modeling. J. Mod. Appl. Stat. Methods 17(1), 2 (2018)

    Article  Google Scholar 

  • Little, R.J., Rubin, D.B.: Statistical Analysis with Missing Data, vol. 793. Wiley, Hoboken (2019)

    Google Scholar 

  • Lohmöller, J.B.: Latent Variable Path Modeling with Partial Least Squares. Springer, Cham (2013)

    Google Scholar 

  • Newman, D.A.: Missing data: five practical guidelines. Organ. Res. Methods 17(4), 372–411 (2014)

    Article  Google Scholar 

  • Nitzl, C., Roldan, J.L., Cepeda, G.: Mediation analysis in partial least squares path modeling: helping researchers discuss more sophisticated models. Ind. Manag. Data Syst. 116(9), 1849–1864 (2016)

  • Olinsky, A., Chen, S., Harlow, L.: The comparative efficacy of imputation methods for missing data in structural equation modeling. Eur. J. Oper. Res. 151(1), 53–79 (2003)

    Article  Google Scholar 

  • Parwoll, M., Wagner, R.: The impact of missing values on pls model fitting. In: Challenges at the Interface of Data Analysis, Computer Science, and Optimization, pp. 537–544. Springer, Berlin (2012)

    Chapter  Google Scholar 

  • Rana, N.P., Chatterjee, S., Dwivedi, Y.K., Akter, S.: Understanding dark side of artificial intelligence (AI) integrated business analytics: assessing firm’s operational inefficiency and competitiveness. Eur. J. Inf. Syst. (2021). https://doi.org/10.1080/0960085X.2021.1955628

  • Ray, S., Danks, N.P., Calero Valdez, A.: SEMinR: building and estimating structural equation models. R Package version 2.3.0 (2022). https://CRAN.R-project.org/package=seminr

  • Richter, N.F., Sinkovics, R.R., Ringle, C.M., Schlägel, C.: A critical look at the use of SEM in international business research. Int. Market. Rev. 33(3), 376–404 (2016)

  • Rigdon, E.E., Sarstedt, M., Ringle, C.M.: On comparing results from CB-SEM and PLS-SEM: five perspectives and five recommendations. Market. ZFP J. Res. Manag. 39(3), 4–16 (2017)

    Google Scholar 

  • Ringle, C.M., Sarstedt, M., Straub, D.W.: Editor’s comments: a critical look at the use of PLS-SEM. MIS Q. 36(3), iii–xiv (2012)

  • Ringle, C.M., Sarstedt, M., Mitchell, R., Gudergan, S.P.: Partial least squares structural equation modeling in HRM research. Int. J. Hum. Resour. Manag. 31(12), 1617–1643 (2020)

    Article  Google Scholar 

  • Rosseel, Y.: Lavaan: an R package for structural equation modeling and more. version 0.5–12 (beta). J. Stat. Softw. 48(2), 1–36 (2012)

    Article  Google Scholar 

  • Sarstedt, M., Wilczynski, P., Melewar, T.: Measuring reputation in global markets—a comparison of reputation measures’ convergent and criterion validities. J. World Bus. 48(3), 329–339 (2013)

    Article  Google Scholar 

  • Sarstedt, M., Hair, J.F., Ringle, C.M., Thiele, K.O., Gudergan, S.P.: Estimation issues with pls and cbsem: where the bias lies! J. Bus. Res. 69(10), 3998–4010 (2016)

    Article  Google Scholar 

  • Sarstedt, M., Ringle, C.M., Hair, J.F.: Partial least squares structural equation modeling. In: Handbook of Market Research, vol. 26(1), pp. 1–40. Springer, Cham (2017)

    Google Scholar 

  • Sarstedt, M., Ringle, C.M., Cheah, J.H., Ting, H., Moisescu, O.I., Radomir, L.: Structural model robustness checks in PLS-SEM. Tour. Econ. 26(4), 531–554 (2020)

    Article  Google Scholar 

  • Schafer, J.L., Graham, J.W.: Missing data: our view of the state of the art. Psychol. Methods 7(2), 147 (2002)

    Article  Google Scholar 

  • Schwaiger, M.: Components and parameters of corporate reputation—an empirical study. Schmalenbach Bus. Rev. 56(1), 46–71 (2004)

    Article  Google Scholar 

  • Tenenhaus, M., Vinzi, V.E., Chatelin, Y.M., Lauro, C.: Pls path modeling. Comput. Stat. Data Anal. 48(1), 159–205 (2005)

    Article  Google Scholar 

  • Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., Altman, R.B.: Missing value estimation methods for DNA microarrays. Bioinformatics 17(6), 520–525 (2001)

    Article  Google Scholar 

  • Wilkinson, L.: Statistical methods in psychology journals: guidelines and explanations. Am. Psychol. 54(8), 594 (1999)

    Article  Google Scholar 

Download references

Funding

This research was financially supported by the National Natural Science Foundation of China (Grant Nos. 72021001 and 72001222). S. Lu is a member of Financial Sustainable Development Research Team in Central University of Finance and Economics. S. Lu also thanks the support from the Emerging Interdisciplinary Project of CUFE, Program for Innovation Research in Central University of Finance and Economics, the Disciplinary Funding of Central University of Finance and Economics. Y. Liu also thanks the support from iFRG Grant at Macau University of Science and Technology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shan Lu.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, H., Lu, S. & Liu, Y. Missing data imputation in PLS-SEM. Qual Quant 56, 4777–4795 (2022). https://doi.org/10.1007/s11135-022-01338-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11135-022-01338-4

Keywords

Navigation