Abstract
In many longitudinal and hierarchical epidemiological frameworks, observations regarding to each individual are recorded repeatedly over time. In these follow-ups, accurate measurements of time-dependent covariates might be invalid or expensive to be obtained. In addition, in the recording process, or as a result of other undetected reasons, miscategorization of the response variable might occur, that does not demonstrate the true condition of the response process. In contrast with binary outcome by which classification error occurs between two categories, disorderliness in categorical outcome has more intricate impacts, as a result of the increased number of categories and asymmetric miscategorization matrix. When no modification is made, insensitivity of errors in either covariate or response variable, results in potentially incorrect conclusion, tends to bias the statistical inference and eventually degrades the efficiency of the decision-making procedure. In this article, we provide an approach to simultaneously adjust for misclassification in the correlated nominal response and measurement error in the covariates, incorporating validation data in the estimation of misclassification probabilities, using the multivariate Gauss–Hermite quadrature technique for the approximation of the likelihood function. Simulation results demonstrate the effects of modifying covariate measurement error and response misclassification on the estimation procedure.
Similar content being viewed by others
REFERENCES
C. E. McCullach, S. R. Searle, and J. M. Neuhaus, Generalized, Linear, and Mixed Models (John Wiley & Sons, London, 2008).
C. E. McCullach, ‘‘Maximum likelihood variance components estimation for binary data,’’ Journal of the American Statistical Association 89, 330–335 (1994).
L. Wu, Mixed Effects Models for Complex Data (CRC Press, Boca Raton, 2009).
W. W. Stroup, Generalized Linear Mixed Models: Modern Concepts, Methods, and Applications (CRC Press,Boca Raton, 2012).
P. J. Diggle, K. Y. Liang, and S. L. Zeger, Analysis of Longitudinal Data (Oxford University Press, Oxford, 1994).
M. A. Tanner, Tools for Statistical Inference: Observed Data and Data Augmentation (Springer, New York, 1993).
N. Wang, X. Lin, and R. G. Guttierrez, ‘‘A bias correction regression calibration approach in generalized linear mixed measurement error models,’’ Communications in Statistics 28, 217–232 (1999).
N. Wang, X. Lin, R. G. Guttierrez, and R. J. Carroll, ‘‘ Bias analysis and SIMEX approach in generalized linear mixed measurement error models,’’ Journal of American Statistical Association 93, 249–261 (1998).
J. P. Buonaccorsi, G. Romeo, and M. Thoresen, ‘‘Model-based bootstapping when correcting for measurement error with application to logistic regression,’’ Biometrics 74, 135–144 (2018).
R. J. Carroll, D. Ruppert, L. A. Stefanski, and C. M. Crainiceanu, Measurement Error in Nonlinear Models: A Modern Perspective (CRC Press, Boca Raton, 2006).
M. Torabi, ‘‘Likelihood inference in generalized linear mixed measurement error models,’’ Computational Statistics and Data Analysis 57, 549–557 (2013).
X. Xie, X. Xue, and H. D. Strickler, ‘‘Generalized linear mixed model for binary outcomes when covariates are subject to measurement errors and detection limits,’’ Statistics in Medicine 37, 119–136 (2017).
G. Y. Yi, Statistical Analysis with Measurement Error or Misclassification (Springer, New York, 2017).
L. S. Magder and J. P. Hughes, ‘‘Logistic regression when the outcome is measured with uncertainty,’’ American Journal of Epidemiology 146 (2), 195–203 (1997).
J. M. Neuhaus, ‘‘Bias and efficiency loss due to misclassified responses in binary regression,’’ Biometrika 86(4), 843–855 (1999).
J. M. Neuhaus, ‘‘Analysis of clustered and longitudinal binary data subject to response misclassification,’’ Biometrics 58(3), 675–683 (2002).
C. D. Paulino, P. Soares, and J. Neuhaus, ‘‘Binomial regression with misclassification,’’ Biometrics 59 (3), 670–675 (2003).
R. Gerlach and J. Stamey, ‘‘Bayesian model selection for logistic regression with misclassified outcomes,’’ Statistical Modelling 7 (3), 255–273, (2007).
L. Tang, R. H. Lyles, C. C. King, J. W. Hogan, and Y. Lo, ‘‘Regression analysis for differentially misclassified correlated binary outcomes,’’ Journal of the Royal Statistical Society. Series C, Applied Statistics 64 (3), 433–449 (2015).
R. H. Lyles, L. Tang, H. M. Superak, C. C. King, D. D. Celentano, Y. Lo, and J. D. Sobel, ‘‘Validation data-based adjustments for outcome misclassification in logistic regression: An illustration, Epidemiology (Cambridge, Mass.) 22 (4), 589 (2011).
L. Naranjo, C. J. Prez, J. Martn, T. Mutsvari, and E. Lesaffre, ‘‘A Bayesian approach for misclassified ordinal response data,’’ Journal of Applied Statistics 46 (12), 2198-2215, (2019).
D. Cheng, A. J. Branscum, and J. D. Stamey, ‘‘Accounting for response misclassification and covariate measurement error improves power and reduces bias in epidemiologic studies,’’ Annals of Epidemiology 20(7), 562–567, (2010).
D. Shu and G. Y. Yi, ‘‘Weighted causal inference methods with mismeasured covariates and misclassified outcomes,’’ Statistics in Medicine 38 (10), 1835–1854 (2019).
S. Roy, ‘‘Accounting for response misclassification and covariate measurement error using a random effect logit model,’’ Communications in Statistics-Simulation and Computation 41(9), 1623–1636 (2012).
S. Roy, ‘‘Analysis of ordered probit model with surrogate response data and measurement error in covariates,’’ Communications in Statistics-Theory and Methods 45 (9), 2665–2678 (2016).
J. P. Buonaccorsi, Measurement Error, Models, Methods, and Applications (CRC Press, New York, 2010).
R. H. Keogh, P. A. Shaw, P. Gustafson, R. J. Carroll, V. Deffner, K. W. Dodd, H. Kchenhoff, J. A. Tooze, M. P. Wallace, V. Kipnis, and L. S. Freedman, ‘‘STRATOS guidance document on measurement error and misclassification of variables in observational epidemiology: Part 1-basic theory and simple methods of adjustment,’’ Statistics in Medicine 39 (16), 2197–2231 (2020).
P. Jaeckel, A Note on Multivariate Gauss-Hermite Quadrature (ABN-Amro, London, 2005).
A. Agresti, Categorical Data Analysis (John Wiley and Sons, New York, 2002).
A. Skrondal and S. Rabe-Hesketh, Generalized Latent Variable Modeling (CRC Press, Boca Raton, 2004).
G. Molenberghs and G. Verbeke, Models for Discrete Longitudinal Data (Springer, New York, 2006).
J. Pan and R. Thompson, ‘‘Gauss-hermite quadrature approximation estimation in generalized linear mixed models,’’ Computational Statistics 18, 57–78 (2003).
J. A. Nelder and R. Mead, ‘‘A simplex algorithm for function minimization,’’ Computer Journal 7, 308–313 (1965).
ACKNOWLEDGMENTS
Receiving support from the Center of Excellence in Analysis of Spatio-Temporal Correlated Data at Tarbiat Modares University is acknowledged.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The authors declare that they have no conflicts of interest.
About this article
Cite this article
Ahangari, M., Golalizadeh, M. & Ghahroodi, Z.R. Validation Data-Located Modification for the Multilevel Analysis of Miscategorized Nominal Response with Covariates Subject to Measurement Error. Math. Meth. Stat. 32, 223–240 (2023). https://doi.org/10.3103/S1066530723040026
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3103/S1066530723040026