Abstract
Missing data present an important challenge when dealing with high-dimensional data arranged in the form of an array. The main purpose of this article is to introduce methods for estimation of the parameters of array variate normal probability model from partially observed multiway data. The methods developed here are useful for missing data imputation, estimation of mean, and covariance parameters for multiway data. A review of array variate distributions is included. A multiway semi-parametric mixed-effects model that allows separation of multiway mean and covariance effects is also defined, and an efficient algorithm for estimation based on the spectral decompositions of the covariance parameters is recommended. We demonstrate our methods with simulations and real-life data involving the estimation of genotype and environment interaction effects on possibly correlated traits.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Akdemir, D., & Gupta, A. K. (2011). Array variate random variables with multiway kronecker delta covariance matrix structure. Journal of Algebraic Statistics, 2(1), 98–113.
Allen, G. I., & Tibshirani, R. (2010). Transposable regularized covariance models with an application to missing data imputation. The Annals of Applied Statistics, 4(2), 764–790.
Anderson, T. W. (1957). Maximum likelihood estimates for a multivariate normal distribution when some observations are missing. Journal of the American Statistical Association, 52(278), 200–203.
Anderson, T. W. (1984). An introduction to multivariate. Wiley.
Beale, E. M. L., & Little, R. J. A. (1975). Missing values in multivariate analysis. Journal of the Royal Statistical Society. Series B (Methodological), 129–145.
Blaha, G. (1977). A few basic principles and techniques of array algebra. Journal of Geodesy, 51(3), 177–202.
Bro, R. (1997). Parafac. Tutorial and applications. Chemometrics and Intelligent Laboratory Systems, 38(2), 149–171.
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 1–38.
Dempster, A. P., Rubin, D. B., & Tsutakawa, R. K. (1981). Estimation in covariance components models. Journal of the American Statistical Association, 76(374), 341–353.
Flury, B. (1997). A first course in multivariate statistics. Springer.
Pieter, G. F. (1921). Heredity of headform in man. Genetica, 3(3), 193–400.
Gianola, D., & Van Kaam, J. B. (2008). Reproducing kernel hilbert spaces regression methods for genomic assisted prediction of quantitative traits. Genetics, 178(4), 2289–2303.
Gupta, A. K., & Nagar, D. K. (2000). Matrix variate distributions. In: Chapman and Hall/CRC Monographs and Surveys in Pure and Applied Mathematics. London: Chapman and Hall.
Harshman, R. A. (1970). Foundations of the parafac procedure: models and conditions for an “explanatory” multimodal factor analysis. UCLA Working Papers in Phonetics.
Hartley, H. O., & Hocking, R.R. (1971). The analysis of incomplete data. Biometrics, 783–823.
Harville, D. A. (1983). Discussion on a section on interpolation and estimation. In: DHA and HT David (ed.), Statistics an Appraisal (pp. 281–286). Ames: The Iowa State University Press.
Henderson, C. R., & Quaas, R. L. (1976). Multiple trait evaluation using relatives’ records. Journal of Animal Science, 43(6), 1188–1197.
Hoff, P. D. (2011). Hierarchical multilinear models for multiway data. Computational Statistics and Data Analysis, 55(1), 530–543.
Jørgensen, B., & Petersen, H. C. (2012). Efficient estimation for incomplete multivariate data. Journal of Statistical Planning and Inference, 142(5), 1215–1224.
Kang, H. M., Zaitlen, N. A., Wade, C. M., Kirby, A., Heckerman, D., Daly, M. J., et al. (2008). Efficient control of population structure in model organism association mapping. Genetics, 178(3), 1709–1723.
Kimeldorf, G. S., & Wahba, G. (1970). A correspondence between bayesian estimation on stochastic processes and smoothing by splines. The Annals of Mathematical Statistics, 495–502.
Lu, N., & Zimmerman, D. L. (2005). The likelihood ratio test for a separable covariance matrix. Statistics and Probability Letters, 73(4), 449–457.
Meng, X. L., & Rubin, D. B. (1993). Maximum likelihood estimation via the ecm algorithm: A general framework. Biometrika, 80(2), 267–278.
Ohlson, M., Ahmad, M. R., & von Rosen D. (2011). The multilinear normal distribution: Introduction and some basic properties. Journal of Multivariate Analysis.
Orchard, T., & Woodbury, M. A. (1972). A missing information principle: theory and applications. In: Proceedings of the 6th Berkeley Symposium on Mathematical Statistics and Probability (vol. 1, pp. 697–715).
Rauhala, U. A. (1974). Array Algebra with Applications in Photogrammetry and Geodesy. Division of Photogrammetry, Royal Institute of Technology.
Robinson, G. K. (1991). That blup is a good thing: The estimation of random effects. Statistical Science, 6(1), 15–32.
Roy, A., & Khattree, R. (2003). Tests for mean and covariance structures relevant in repeated measures based discriminant analysis. Journal of Applied Statistical Science, 12(2), 91–104.
Roy, A., & Leiva, R. (2008). Likelihood ratio tests for triply multivariate data with structured correlation on spatial repeated measurements. Statistics and Probability Letters, 78(13), 1971–1980.
Schölkopf, B., & Smola, A. (2005). Learning with Kernels. Cambridge: MIT Press.
Sorensen, D., & Gianola, D. (2002). Likelihood, Bayesian, and MCMC methods in quantitative genetics. Springer.
Speed, T. (1991). Comment on “That blup is a good thing: The estimation of random effects”. Statistical Science, 6(1), 42–44.
Srivastava, M. S., Nahtman, T., & Von Rosen, D. (2008). Estimation in general multivariate linear models with kronecker product covariance structure. Report: Research Report Centre of Biostochastics, Swedish University of Agriculture science. 1.
Srivastava, M. S., Von Rosen, T., & Von Rosen, D. (2008). Models with a kronecker product covariance structure: Estimation and testing. Mathematical Methods of Statistics, 17(4), 357–370.
Trawinski, I. M., & Bargmann, R. E. (1964). Maximum likelihood estimation with incomplete multivariate data. The Annals of Mathematical Statistics, 35(2), 647–657.
Acknowledgments
This research was supported by the USDA-NIFA-AFRI Triticeae Coordinated Agricultural Project, award number 2011-68002-30029.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 The Author(s)
About this chapter
Cite this chapter
Akdemir, D. (2016). Array Normal Model and Incomplete Array Variate Observations. In: Sakata, T. (eds) Applied Matrix and Tensor Variate Data Analysis. SpringerBriefs in Statistics(). Springer, Tokyo. https://doi.org/10.1007/978-4-431-55387-8_5
Download citation
DOI: https://doi.org/10.1007/978-4-431-55387-8_5
Published:
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-55386-1
Online ISBN: 978-4-431-55387-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)