Abstract
As a useful tool for business research, PLS-SEM is widely adopted for the assessment of causal-predictive relationships of models when developing and testing theories. Nevertheless, the less error-prone techniques for handling missing data are routinely ignored by PLS-SEM researchers. In this paper, we propose an imputation method, called EM PLS-SEM, to deal with missing values in PLS-SEM. The method takes advantages of the estimation procedure of PLS-SEM to reach the goal of filling the missing elements with values that are most likely to appear. Numerical studies verify that the proposed method outperforms other alternatives in data completion and model fitting.
Similar content being viewed by others
Data availability
Enquiries about data availability should be directed to the authors.
Notes
The KNN imputation and regression imputation are implicated by R packages “DMwR” and “mice”, respectively.
The generation is completed by mvrnorm function in R software.
References
Bollen, K.A.: Structural Equations with Latent Variables. Wiley, New York (1989)
Carrión, G.C., Nitzl, C., Roldán, J.L.: Mediation analyses in partial least squares structural equation modeling: Guidelines and empirical examples. In: Partial Least Squares Path Modeling, pp. 173–195. Springer, New York (2017)
Cepeda-Carrion, G., Cegarra-Navarro, J.G., Cillo, V.: Tips to use partial least squares structural equation modelling (PLS-SEM) in knowledge management. J. Knowl. Manag. 23(1), 67–89 (2019)
Chin, W., Cheah, J.H., Liu, Y., Ting, H., Lim, X.J., Cham, T.H.: Demystifying the role of causal-predictive modeling using partial least squares structural equation modeling in information systems research. Ind. Manag. Data Syst. (2020). https://doi.org/10.1108/IMDS-10-2019-0529
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 39(1), 1–22 (1977)
Dijkstra, T.K., Henseler, J.: Consistent partial least squares path modeling. MIS Q. 39(2), 297–316 (2015)
Eberl, M., Schwaiger, M.: Corporate reputation: disentangling the effects on financial performance. Eur. J. Market. 39(7/8), 838–854 (2005)
Fornell, C., Bookstein, F.L.: Two structural equation models: Lisrel and pls applied to consumer exit-voice theory. J. Market. Res. 19(4), 440–452 (1982)
Grimm, M.S., Wagner, R.: The impact of missing values on PLS, ML and FIML model fit. Arch. Data Sci. Ser. 6(1), 04 (2019)
Hair Jr., J.F., Sarstedt, M.: Data, measurement, and causal inferences in machine learning: opportunities and challenges for marketing. J. Market. Theory Pract. (2021). https://doi.org/10.1080/10696679.2020.1860683
Hair, J.F., Sarstedt, M., Pieper, T.M., Ringle, C.M.: The use of partial least squares structural equation modeling in strategic management research: a review of past practices and recommendations for future applications. Long range planning 45(5–6), 320–340 (2012a)
Hair, J.F., Sarstedt, M., Ringle, C.M., Mena, J.A.: An assessment of the use of partial least squares structural equation modeling in marketing research. J. Acad. Market. Sci. 40(3), 414–433 (2012b)
Hair, Jr., J.F., Sarstedt, M., Hopkins, L., Kuppelwieser, V.G.: Partial least squares structural equation modeling (PLS-SEM): an emerging tool in business research. Eur. Bus. Rev. 26, 106–121 (2014)
Hair, J.F., Hult, G.T.M., Ringle, C., Sarstedt, M.: A Primer on Partial Least Squares Structural Equation Modeling (PLS-SEM). SAGE, Thousand Oaks (2017)
Hair, J.F., Risher, J.J., Sarstedt, M., Ringle, C.M.: When to use and how to report the results of PLS-SEM. Eur. Bus. Rev. 31(1), 2–24 (2019)
Helm, S., Eggert, A., Garnefeld, I.: Modeling the impact of corporate reputation on customer satisfaction and loyalty using partial least squares. In: Handbook of Partial Least Squares, pp. 515–534. Springer, Berlin (2010)
Jhun, M., Jeong, H.C., Koo, J.Y.: On the use of adaptive nearest neighbors for missing value imputation. Commun. Stat. Comput. 36(6), 1275–1286 (2007)
Josse, J., Pagès, J., Husson, F.: Multiple imputation in principal component analysis. Adv. Data Anal. Classif. 5(3), 231–246 (2011)
Kaufmann, L., Gaeckler, J.: A structured review of partial least squares in supply chain management research. J. Purchasing Supply Manag. 21(4), 259–272 (2015)
Kock, N.: Single missing data imputation in PLS-based structural equation modeling. J. Mod. Appl. Stat. Methods 17(1), 2 (2018)
Little, R.J., Rubin, D.B.: Statistical Analysis with Missing Data, vol. 793. Wiley, Hoboken (2019)
Lohmöller, J.B.: Latent Variable Path Modeling with Partial Least Squares. Springer, Cham (2013)
Newman, D.A.: Missing data: five practical guidelines. Organ. Res. Methods 17(4), 372–411 (2014)
Nitzl, C., Roldan, J.L., Cepeda, G.: Mediation analysis in partial least squares path modeling: helping researchers discuss more sophisticated models. Ind. Manag. Data Syst. 116(9), 1849–1864 (2016)
Olinsky, A., Chen, S., Harlow, L.: The comparative efficacy of imputation methods for missing data in structural equation modeling. Eur. J. Oper. Res. 151(1), 53–79 (2003)
Parwoll, M., Wagner, R.: The impact of missing values on pls model fitting. In: Challenges at the Interface of Data Analysis, Computer Science, and Optimization, pp. 537–544. Springer, Berlin (2012)
Rana, N.P., Chatterjee, S., Dwivedi, Y.K., Akter, S.: Understanding dark side of artificial intelligence (AI) integrated business analytics: assessing firm’s operational inefficiency and competitiveness. Eur. J. Inf. Syst. (2021). https://doi.org/10.1080/0960085X.2021.1955628
Ray, S., Danks, N.P., Calero Valdez, A.: SEMinR: building and estimating structural equation models. R Package version 2.3.0 (2022). https://CRAN.R-project.org/package=seminr
Richter, N.F., Sinkovics, R.R., Ringle, C.M., Schlägel, C.: A critical look at the use of SEM in international business research. Int. Market. Rev. 33(3), 376–404 (2016)
Rigdon, E.E., Sarstedt, M., Ringle, C.M.: On comparing results from CB-SEM and PLS-SEM: five perspectives and five recommendations. Market. ZFP J. Res. Manag. 39(3), 4–16 (2017)
Ringle, C.M., Sarstedt, M., Straub, D.W.: Editor’s comments: a critical look at the use of PLS-SEM. MIS Q. 36(3), iii–xiv (2012)
Ringle, C.M., Sarstedt, M., Mitchell, R., Gudergan, S.P.: Partial least squares structural equation modeling in HRM research. Int. J. Hum. Resour. Manag. 31(12), 1617–1643 (2020)
Rosseel, Y.: Lavaan: an R package for structural equation modeling and more. version 0.5–12 (beta). J. Stat. Softw. 48(2), 1–36 (2012)
Sarstedt, M., Wilczynski, P., Melewar, T.: Measuring reputation in global markets—a comparison of reputation measures’ convergent and criterion validities. J. World Bus. 48(3), 329–339 (2013)
Sarstedt, M., Hair, J.F., Ringle, C.M., Thiele, K.O., Gudergan, S.P.: Estimation issues with pls and cbsem: where the bias lies! J. Bus. Res. 69(10), 3998–4010 (2016)
Sarstedt, M., Ringle, C.M., Hair, J.F.: Partial least squares structural equation modeling. In: Handbook of Market Research, vol. 26(1), pp. 1–40. Springer, Cham (2017)
Sarstedt, M., Ringle, C.M., Cheah, J.H., Ting, H., Moisescu, O.I., Radomir, L.: Structural model robustness checks in PLS-SEM. Tour. Econ. 26(4), 531–554 (2020)
Schafer, J.L., Graham, J.W.: Missing data: our view of the state of the art. Psychol. Methods 7(2), 147 (2002)
Schwaiger, M.: Components and parameters of corporate reputation—an empirical study. Schmalenbach Bus. Rev. 56(1), 46–71 (2004)
Tenenhaus, M., Vinzi, V.E., Chatelin, Y.M., Lauro, C.: Pls path modeling. Comput. Stat. Data Anal. 48(1), 159–205 (2005)
Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Botstein, D., Altman, R.B.: Missing value estimation methods for DNA microarrays. Bioinformatics 17(6), 520–525 (2001)
Wilkinson, L.: Statistical methods in psychology journals: guidelines and explanations. Am. Psychol. 54(8), 594 (1999)
Funding
This research was financially supported by the National Natural Science Foundation of China (Grant Nos. 72021001 and 72001222). S. Lu is a member of Financial Sustainable Development Research Team in Central University of Finance and Economics. S. Lu also thanks the support from the Emerging Interdisciplinary Project of CUFE, Program for Innovation Research in Central University of Finance and Economics, the Disciplinary Funding of Central University of Finance and Economics. Y. Liu also thanks the support from iFRG Grant at Macau University of Science and Technology.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, H., Lu, S. & Liu, Y. Missing data imputation in PLS-SEM. Qual Quant 56, 4777–4795 (2022). https://doi.org/10.1007/s11135-022-01338-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135-022-01338-4