Array Normal Model and Incomplete Array Variate Observations

Akdemir, Deniz

doi:10.1007/978-4-431-55387-8_5

Deniz Akdemir²

Part of the book series: SpringerBriefs in Statistics ((JSSRES))

2340 Accesses

Abstract

Missing data present an important challenge when dealing with high-dimensional data arranged in the form of an array. The main purpose of this article is to introduce methods for estimation of the parameters of array variate normal probability model from partially observed multiway data. The methods developed here are useful for missing data imputation, estimation of mean, and covariance parameters for multiway data. A review of array variate distributions is included. A multiway semi-parametric mixed-effects model that allows separation of multiway mean and covariance effects is also defined, and an efficient algorithm for estimation based on the spectral decompositions of the covariance parameters is recommended. We demonstrate our methods with simulations and real-life data involving the estimation of genotype and environment interaction effects on possibly correlated traits.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Akdemir, D., & Gupta, A. K. (2011). Array variate random variables with multiway kronecker delta covariance matrix structure. Journal of Algebraic Statistics, 2(1), 98–113.
Article MathSciNet Google Scholar
Allen, G. I., & Tibshirani, R. (2010). Transposable regularized covariance models with an application to missing data imputation. The Annals of Applied Statistics, 4(2), 764–790.
Article MathSciNet MATH Google Scholar
Anderson, T. W. (1957). Maximum likelihood estimates for a multivariate normal distribution when some observations are missing. Journal of the American Statistical Association, 52(278), 200–203.
Article MathSciNet MATH Google Scholar
Anderson, T. W. (1984). An introduction to multivariate. Wiley.
Google Scholar
Beale, E. M. L., & Little, R. J. A. (1975). Missing values in multivariate analysis. Journal of the Royal Statistical Society. Series B (Methodological), 129–145.
Google Scholar
Blaha, G. (1977). A few basic principles and techniques of array algebra. Journal of Geodesy, 51(3), 177–202.
MathSciNet Google Scholar
Bro, R. (1997). Parafac. Tutorial and applications. Chemometrics and Intelligent Laboratory Systems, 38(2), 149–171.
Article Google Scholar
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 1–38.
Google Scholar
Dempster, A. P., Rubin, D. B., & Tsutakawa, R. K. (1981). Estimation in covariance components models. Journal of the American Statistical Association, 76(374), 341–353.
Article MathSciNet MATH Google Scholar
Flury, B. (1997). A first course in multivariate statistics. Springer.
Google Scholar
Pieter, G. F. (1921). Heredity of headform in man. Genetica, 3(3), 193–400.
Google Scholar
Gianola, D., & Van Kaam, J. B. (2008). Reproducing kernel hilbert spaces regression methods for genomic assisted prediction of quantitative traits. Genetics, 178(4), 2289–2303.
Article Google Scholar
Gupta, A. K., & Nagar, D. K. (2000). Matrix variate distributions. In: Chapman and Hall/CRC Monographs and Surveys in Pure and Applied Mathematics. London: Chapman and Hall.
Google Scholar
Harshman, R. A. (1970). Foundations of the parafac procedure: models and conditions for an “explanatory” multimodal factor analysis. UCLA Working Papers in Phonetics.
Google Scholar
Hartley, H. O., & Hocking, R.R. (1971). The analysis of incomplete data. Biometrics, 783–823.
Google Scholar
Harville, D. A. (1983). Discussion on a section on interpolation and estimation. In: DHA and HT David (ed.), Statistics an Appraisal (pp. 281–286). Ames: The Iowa State University Press.
Google Scholar
Henderson, C. R., & Quaas, R. L. (1976). Multiple trait evaluation using relatives’ records. Journal of Animal Science, 43(6), 1188–1197.
Google Scholar
Hoff, P. D. (2011). Hierarchical multilinear models for multiway data. Computational Statistics and Data Analysis, 55(1), 530–543.
Article MathSciNet MATH Google Scholar
Jørgensen, B., & Petersen, H. C. (2012). Efficient estimation for incomplete multivariate data. Journal of Statistical Planning and Inference, 142(5), 1215–1224.
Article MathSciNet Google Scholar
Kang, H. M., Zaitlen, N. A., Wade, C. M., Kirby, A., Heckerman, D., Daly, M. J., et al. (2008). Efficient control of population structure in model organism association mapping. Genetics, 178(3), 1709–1723.
Article Google Scholar
Kimeldorf, G. S., & Wahba, G. (1970). A correspondence between bayesian estimation on stochastic processes and smoothing by splines. The Annals of Mathematical Statistics, 495–502.
Google Scholar
Lu, N., & Zimmerman, D. L. (2005). The likelihood ratio test for a separable covariance matrix. Statistics and Probability Letters, 73(4), 449–457.
Article MathSciNet MATH Google Scholar
Meng, X. L., & Rubin, D. B. (1993). Maximum likelihood estimation via the ecm algorithm: A general framework. Biometrika, 80(2), 267–278.
Article MathSciNet MATH Google Scholar
Ohlson, M., Ahmad, M. R., & von Rosen D. (2011). The multilinear normal distribution: Introduction and some basic properties. Journal of Multivariate Analysis.
Google Scholar
Orchard, T., & Woodbury, M. A. (1972). A missing information principle: theory and applications. In: Proceedings of the 6th Berkeley Symposium on Mathematical Statistics and Probability (vol. 1, pp. 697–715).
Google Scholar
Rauhala, U. A. (1974). Array Algebra with Applications in Photogrammetry and Geodesy. Division of Photogrammetry, Royal Institute of Technology.
Google Scholar
Robinson, G. K. (1991). That blup is a good thing: The estimation of random effects. Statistical Science, 6(1), 15–32.
Article MathSciNet MATH Google Scholar
Roy, A., & Khattree, R. (2003). Tests for mean and covariance structures relevant in repeated measures based discriminant analysis. Journal of Applied Statistical Science, 12(2), 91–104.
Google Scholar
Roy, A., & Leiva, R. (2008). Likelihood ratio tests for triply multivariate data with structured correlation on spatial repeated measurements. Statistics and Probability Letters, 78(13), 1971–1980.
Article MathSciNet MATH Google Scholar
Schölkopf, B., & Smola, A. (2005). Learning with Kernels. Cambridge: MIT Press.
Google Scholar
Sorensen, D., & Gianola, D. (2002). Likelihood, Bayesian, and MCMC methods in quantitative genetics. Springer.
Google Scholar
Speed, T. (1991). Comment on “That blup is a good thing: The estimation of random effects”. Statistical Science, 6(1), 42–44.
Article MathSciNet Google Scholar
Srivastava, M. S., Nahtman, T., & Von Rosen, D. (2008). Estimation in general multivariate linear models with kronecker product covariance structure. Report: Research Report Centre of Biostochastics, Swedish University of Agriculture science. 1.
Google Scholar
Srivastava, M. S., Von Rosen, T., & Von Rosen, D. (2008). Models with a kronecker product covariance structure: Estimation and testing. Mathematical Methods of Statistics, 17(4), 357–370.
Article MathSciNet MATH Google Scholar
Trawinski, I. M., & Bargmann, R. E. (1964). Maximum likelihood estimation with incomplete multivariate data. The Annals of Mathematical Statistics, 35(2), 647–657.
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

This research was supported by the USDA-NIFA-AFRI Triticeae Coordinated Agricultural Project, award number 2011-68002-30029.

Author information

Authors and Affiliations

Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY, USA
Deniz Akdemir

Authors

Deniz Akdemir
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Deniz Akdemir .

Editor information

Editors and Affiliations

Faculty of Design, Kyushu University, Fukuoka, Fukuoka, Japan
Toshio Sakata

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Akdemir, D. (2016). Array Normal Model and Incomplete Array Variate Observations. In: Sakata, T. (eds) Applied Matrix and Tensor Variate Data Analysis. SpringerBriefs in Statistics(). Springer, Tokyo. https://doi.org/10.1007/978-4-431-55387-8_5

Download citation

DOI: https://doi.org/10.1007/978-4-431-55387-8_5
Published: 03 February 2016
Publisher Name: Springer, Tokyo
Print ISBN: 978-4-431-55386-1
Online ISBN: 978-4-431-55387-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics