Skip to main content

Distance Based Association and Multi-Sample Tests for General Multivariate Data

  • Chapter
  • First Online:
Advances in Mathematical and Statistical Modeling

Part of the book series: Statistics for Industry and Technology ((SIT))

Abstract

Most multivariate tests are based on the hypothesis of multinormality. But often this hypothesis fails, or we have variables that are non quantitative. On the other hand we can deal with a large number of variables. Defining probabilistic models with mixed data is not easy. However, it is always possible to define a measure of distance between two observations. We prove that the use of distances can provide alternative tests for comparing several populations when the data are of general type. This approach is illustrated with three real data examples. We also define and study a measure of association between two data sets and make a Bayesian extension of the so-called distance-based discriminant rule.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Anderson, M. J. (2006). Distance-based tests for homogeneity of multivariate dispersions. Biometrics, 62:245–253.

    Article  MATH  MathSciNet  Google Scholar 

  • Arenas, C. and Cuadras, C. M. (2002). Recent statistical methods based on distances. Contributions to Science, 2:183–191.

    Google Scholar 

  • Arenas, C. and Cuadras, C. M. (2004). Comparing two methods for joint representation of multivariate data. Communications in Statistics, Simulation and Computation, 33:415–430.

    Article  MATH  MathSciNet  Google Scholar 

  • Cox, T. F. and Cox, M. A. A. (1994). Multidimensional Scaling. Chapman and Hall, London.

    MATH  Google Scholar 

  • Cuadras, C. M. (1989). Distance analysis in discrimination and classification using both continuous and categorical variables. In Statistical Data Analysis and Inference, (Ed. Y. Dodge), pp. 459–473. Elsevier Science Publishers B. V. (North–Holland), Amsterdam.

    Google Scholar 

  • Cuadras, C. M. (1992). Some examples of distance based discrimination. Biometrical Letters, 29:3–20.

    Google Scholar 

  • Cuadras, C. M. and Arenas, C. (1990). A distance based regression model for prediction with mixed data. Communications in Statistics, Theory and Methods, 19:2261–2279.

    Article  MathSciNet  Google Scholar 

  • Cuadras, C. M. and Fortiana, J. (1995). A continuous metric scaling solution for a random variable. Journal of Multivariate Analysis, 52:1–14.

    Article  MATH  MathSciNet  Google Scholar 

  • Cuadras, C. M. and Fortiana, J. (2004). Distance-based multivariate two sample tests. In Parametric and Semiparametric Models with Applications to Reliability, Survival Analysis and Quality of Life, (Eds. M. S. Nikulin, N. Balakrishnan, M. Mesbah, N. Limnios), 273–290. Birkhauser, Boston.

    Google Scholar 

  • Cuadras, C. M., Arenas, C., and Fortiana, J. (1996). Some computational aspects of a distance-based model for prediction. Communications in Statistics, Simulation and Computation, 25:593–609.

    Article  MATH  Google Scholar 

  • Cuadras, C. M., Atkinson, R. A., and Fortiana, J. (1997a). Probability densities from distances and discriminant analysis. Statistics and Probability Letters, 33:405–411.

    Article  MATH  MathSciNet  Google Scholar 

  • Cuadras, C. M., Fortiana, J., and Oliva, F. (1997b). The proximity of an individual to a population with applications in discriminant analysis. Journal of Classification, 14:117–136.

    Article  MATH  MathSciNet  Google Scholar 

  • Cuadras, C. M., Cuadras, D., and Lahlou, Y. (2006). Principal directions of the general Pareto distribution with applications. Journal of Statistical Planning and Inference, 136:2572–2583.

    Article  MATH  MathSciNet  Google Scholar 

  • Cuadras, C. M. and Lahlou, Y. (2000). Some orthogonal expansions for the logistic distribution. Communications in Statistics, Theory and Methods, 29:2643–2663.

    Article  MATH  MathSciNet  Google Scholar 

  • Escoufier, Y. (1973). Le trataiment des variables vectorielles. Biometrics, 29:751–760.

    Article  MathSciNet  Google Scholar 

  • Flury, B. (1997). A First Course in Multivariate Statistics. Springer-Verlag, New York.

    MATH  Google Scholar 

  • Gower, J. C. (1966). Some distance properties of latent roots and vector methods in multivariate analysis. Biometrika, 53:315–328.

    Google Scholar 

  • Gower, J. C. and Legendre, P. (1986). Metric and Euclidean properties of dissimilarity coefficients. Journal of Classification, 3:5–48.

    Article  MATH  MathSciNet  Google Scholar 

  • Liu, Z. J. and Rao, C. R. (1995). Asymptotic distribution of statistics based on quadratic entropy and bootstrapping. Journal of Statistical Planning and Inference, 43:1–18.

    Article  MATH  MathSciNet  Google Scholar 

  • Mardia, K. V, Kent, J. T. and Bibby, J. M. (1979). Multivariate Analysis. Academic Press, London.

    MATH  Google Scholar 

  • Rao, C. R. (1982). Diversity: its measurement, decomposition, apportionment and analysis. Sankhya A, 44:1–21.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Birkhäuser Boston

About this chapter

Cite this chapter

Cuadras, C.M. (2008). Distance Based Association and Multi-Sample Tests for General Multivariate Data. In: Advances in Mathematical and Statistical Modeling. Statistics for Industry and Technology. Birkhäuser Boston. https://doi.org/10.1007/978-0-8176-4626-4_5

Download citation

Publish with us

Policies and ethics