Skip to main content
Log in

A short history of algebraic statistics

  • Published:
Metrika Aims and scope Submit manuscript

Abstract

In algebraic statistics, computational techniques from algebraic geometry become tools to address statistical problems. This, in turn, may prompt research in algebraic geometry. The basic ideas at the core of algebraic statistics will be presented. In particular, we shall consider application to contingency tables and to design of experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abbott J, Bigatti A, Kreutzer M, Robbiano L (2000) Computing ideals of points. J Symbol Comput 30(4): 341–356

    Article  MATH  Google Scholar 

  • Allman ES, Rhodes JA (2008) Identifying evolutionary trees and substitution parameters for the general Markov model with invariable sites. Math Biosci 211(1): 18–33

    Article  MATH  MathSciNet  Google Scholar 

  • Aoki S, Takemura A (2008) Markov bases for design of experiments with three-level factors, In: Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Aoki S, Takemura A, Yoshida R (2008) Indispensable monomials of toric ideals and Markov bases. J Symbol Comput 43: 490–507

    MATH  MathSciNet  Google Scholar 

  • Bailey RA (1982) The decomposition of treatment degrees of freedom in quantitative factorial experiments. J R Statist Soc B 44(1): 63–70

    MATH  MathSciNet  Google Scholar 

  • Bailey RA, Cameron PJ, Connelly R (2008) Sudoku, Gerechte Designs, Resolutions, Affine Space, Spreads, Reguli, and Hamming codes. Am Math Mon 115(5): 383–404

    MATH  MathSciNet  Google Scholar 

  • Becker T, Weispfenning V (1991) The Chinese remainder problem, multivariate interpolation, and Gröbner bases. In: ISSAC91: Proceedings of the 1991 international symposium on symbolic and algebraic computation, pp 64–69

  • Beerenwinkel N, Eriksson N, Sturmfels B (2007) Conjunctive Bayesian networks. Bernoulli 13(4): 893–909

    Article  MATH  MathSciNet  Google Scholar 

  • Berstein Y, Lee J, Maruri-Aguilar H, Onn S, Riccomagno E, Weismantel R, Wynn HP (2008) Nonlinear matroid optimization and experimental design. SIAM J Discrete Math 22(3): 901–919

    Article  MathSciNet  Google Scholar 

  • Berstein Y, Maruri-Aguilar H, Onn S, Riccomagno E, Wynn HP (2007) Minimal average degree aberration and the state polytope for experimental design, MUCM report no. 07/07

  • Besag J, Clifford P (1989) Generalized Monte Carlo significance tests. Biometrika 76: 633–642

    Article  MATH  MathSciNet  Google Scholar 

  • Bishop YM, Fienberg SE, Holland PW (1977) Discrete multivariate analysis: theory and practice, x+557. MIT Press, Cambridge

    Google Scholar 

  • Bochnak J, Coste M, Roy MF (1998) Real algebraic geometry, x+430. Springer, Berlin

    Google Scholar 

  • CoCoA Team (2004) CoCoA a system for doing Computations in Commutative Algebra. Available at http://cocoa.dima.unige.it

  • Caboara M, Riccomagno E (1998) An algebraic computational approach to the identifiability of Fourier models. J Symbol Comput 26(2): 245–260

    Article  MATH  MathSciNet  Google Scholar 

  • Carlini E, Rapallo F (2008) Algebraic modelling of category distinguishability. In: Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Casanellas M, Fernndez-Snchez J (2007) Performance of a new invariants method on homogeneous and non-homogeneous quartet trees. Mol Biol Evol 24(1): 288–293

    Article  Google Scholar 

  • Catanese F, Hoşten C, Khetan A, Sturmfels B (2006) The maximum likelihood degree. Am J Math 128(3): 671–697

    Article  MATH  Google Scholar 

  • Chen Y, Dinwoodie IH, Sullivant S (2006) Sequential importance sampling for multiway tables. Ann Stat 34(1): 523–545

    Article  MATH  MathSciNet  Google Scholar 

  • Cox D, Little J, O’Shea D (2008) Ideal, varieties, and algorithms, xvi+551, 3rd edn. Springer, New York

    Google Scholar 

  • Diaconis P, Eriksson N (2006) Markov bases for noncommutative Fourier analysis of ranked data. J Symbol Comput 41(2): 182–195

    Article  MATH  MathSciNet  Google Scholar 

  • Diaconis P, Sturmfels B (1998) Algebraic algorithms for sampling from conditional distributions. Ann Stat 26(1): 363–397

    Article  MATH  MathSciNet  Google Scholar 

  • Dimitrova E, Jarrah A, Laubenbacher R, Stigler B (2007) A Gröbner fan method for biochemical network modeling. ISSAC Proceedings, pp 122–126

  • Dinwoodie IH (1998) The Diaconis–Sturmfels algorithm and rules of succession. Bernoulli 4(3): 401–410

    Article  MATH  MathSciNet  Google Scholar 

  • Dobra A (2003) Markov Bases for decomposable graphical models. Bernoulli 9: 1093–1108

    Article  MATH  MathSciNet  Google Scholar 

  • Drton M (2006) Algebraic techniques for Gaussian models. In: Huskova M, Janzura M (eds) Prague stochastics, pp 81–90

  • Drton M (2008) Likelihood ratio tests and singularities. Ann Stat (to appear)

  • Drton M, Sullivant S (2007) Algebraic statistical models. Statistica Sinica 17(4): 1273–1297

    MATH  MathSciNet  Google Scholar 

  • Eriksson N, Fienberg SE, Rinaldo A, Sullivant S (2006) Polyhedral conditions for the non-existence of the MLE for hierarchical log-linear models. J Symbol Comput 41(2): 222–233

    Article  MATH  MathSciNet  Google Scholar 

  • Fienberg SE (2007) Expanding the statistical toolkit with algebraic statistics, editorial. Statistica Sinica 17(4): 1261–1272

    MATH  MathSciNet  Google Scholar 

  • Fienberg SE, Hersh P, Rinaldo A, Zhou Y (2008) Maximum likelihood estimation in latent class models For contingency table data. In: Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Fienberg SE, Slavkovic A (2005) Preserving the confidentiality of categorical statistical data bases when releasing information for association rules. Data Min Knowl Discov 11(2): 155–180

    Article  MathSciNet  Google Scholar 

  • Fontana R, Pistone G, Rogantin MP (2000) Classification of two-level factorial fractions. J Stat Planning Inference 87(1): 149–172

    Article  MATH  MathSciNet  Google Scholar 

  • Fontana R, Rogantin MP (2008) Indicator function and sudoku designs. In: Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Garcia LD, Stillman M, Sturmfels B (2005) Algebraic geometry of Bayesian networks. J Symbol Comput 39(3–4): 331–355

    Article  MATH  MathSciNet  Google Scholar 

  • Geiger D, Meek C, Sturmfels B (2006) On the toric algebra of graphical models. Ann Stat 34(3): 1463–1492

    Article  MATH  MathSciNet  Google Scholar 

  • Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) (2008) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Hara H, Takemura A, Yoshida R (2008) A Markov basis for conditional test of common diagonal effect in quasi-independence model for two-way contingency tables, arXiv:0802.2603

  • Journal of Symbolic Computation: special issue on Computational Algebraic Statistics 41:2 (2006)

    Google Scholar 

  • Kang C, Tian J (2007) Polynomial constraints in causal Bayesian networks. In: Proceedings of the conference on uncertainty in artificial intelligence (UAI)(2006)

  • Krampe A, Kuhnt S (2007) Bowker’s test for symmetry and modifications within the algebraic framework. Comput Stat Data Anal 51(9): 4124–4142

    Article  MathSciNet  Google Scholar 

  • Krampe A, Kuhnt S (2008) Model selection for contingency tables with algebraic statistics. In: Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Kreuzer M, Robbiano L (2000) Computational commutative algebra, 1, x+321. Springer, Berlin

    Google Scholar 

  • Kreuzer M, Robbiano L (2008) Computational commutative algebra, 2, x+586. Springer, Berlin

    Google Scholar 

  • Lin Z, Xu L, Wu Q (2004) Applications of Gröbner bases to signal and image processing: a survey. Linear Algebra Appl 391: 169–202

    Article  MATH  MathSciNet  Google Scholar 

  • Malkin P (2007) Computing Markov bases, Gröbner bases, and extreme rays, x+223. Ph.D. thesis, Université Catholique de Louvain

  • Maruri-Aguilar H (2007) Algebraic statistics in experimental design. Department of Statistics, University of Warwick, UK

    Google Scholar 

  • Maruri-Aguilar H, Notari R, Riccomagno E (2007) On the description and identifiability analysis of mixture designs. Statistica Sinica 17(4): 1417–1440

    MATH  MathSciNet  Google Scholar 

  • Maruri-Aguilar H, Riccomagno E (2007) A model selection algorithm for mixture experiments including process variables. In: Lopez-Fidalgo J, Rodrguez-Daz J, Torsney B (eds) Proceedings of Moda8, pp 107–114

  • Maruri-Aguilar H, Wynn HP (2008) Generalised design: interpolation and statistical modelling over varieties. In: Gibilisco P, Riccomagno E, Rogantin MP, Wynn HP (eds) Geometric and algebraic methods in statistics. Cambridge University Press, Cambridge (forthcoming)

  • Mora T, Robbiano L (1988) The Gröbner fan of an ideal. J Symbol Comput 6: 183–208

    Article  MATH  MathSciNet  Google Scholar 

  • Onn S, Sturmfels B (1999) Cutting corners. Adv Appl Math 23(1): 29–48

    Article  MATH  MathSciNet  Google Scholar 

  • Pachter L, Sturmfels B (2005) Algebraic statistics for computational biology, vol 420. Cambridge University Press, Cambridge

    Google Scholar 

  • Pearl J (2000) Causality Models, reasoning, and inference, xvi+384. Cambridge University Press, Cambridge

    Google Scholar 

  • Pistone G, Riccomagno E, Wynn HP (2001) Algebraic statistics, xvii+160. Chapman & Hall/CRC, Boca Raton

    Google Scholar 

  • Pistone G, Rogantin MP (2008) Indicator function and complex coding for mixed fractional factorial designs. J Stat Planning Inference 138:3:1, 787–802

    Google Scholar 

  • Pistone G, Wynn HP (1996) Generalised confounding with Gröbner bases. Biometrika 83(3): 653–666

    Article  MATH  MathSciNet  Google Scholar 

  • Rapallo F (2003) Algebraic Markov bases and MCMC for two-way contingency tables. Scand J Stat 30: 385–397

    Article  MATH  MathSciNet  Google Scholar 

  • Rapallo F (2005) Algebraic exact inference for rater agreement models. Stat Methods Appl 14(1): 45–66

    Article  MATH  MathSciNet  Google Scholar 

  • Rapallo F (2006) Markov bases and structural zeros. J Symbol Comput 41(2): 164–172

    Article  MATH  MathSciNet  Google Scholar 

  • Riccomagno E, Smith JQ (2004) Identifying a cause in models which are not simple Bayesian networks. In: Proc IPMU, pp 1345–1322

  • Riccomagno E, Smith JQ (2008) The geometry of causal probability trees that are algebraically constrained. In: Pronzato L, Zigljavsky AA (eds) Search for optimality in design and statistics: algebraic and dynamical system methods, pp 95–129

  • Riccomagno E, Smith JQ (2005) The causal manipulation of chain event graphs. http://arxiv.org/abs/0709.3380

  • Riccomagno E, Smith JQ (2007) Algebraic causality: Bayes nets and beyond. CRiSM Paper No. 07-3

  • Scheffé H (1958) Experiments with mixtures. J Roy Statist Soc Ser B 20: 344–360

    MATH  MathSciNet  Google Scholar 

  • Scheffé H (1963) The simplex-centroid design for experiments with mixtures. J Roy Statist Soc Ser B 25: 235–263

    MathSciNet  Google Scholar 

  • Shafer G (1996) The art of causal conjecture, vol 552. MIT Press, Cambridge

    Google Scholar 

  • Statistica Sinica: special issue on Algebraic Statistics and Computational Biology 17:4 (2007)

    Google Scholar 

  • Sturmfels B (2002) Solving systems of polynomial equations. CBMS Reg Conf Ser Math, Washington DC

    MATH  Google Scholar 

  • Sullivant S (2008) Algebraic geometry of Gaussian Bayesian networks. Adv Appl Math (to appear)

  • Ye KQ (2003) Indicator function and its application in two level factorial designs. Ann Stat 31(3): 984–994

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eva Riccomagno.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Riccomagno, E. A short history of algebraic statistics. Metrika 69, 397–418 (2009). https://doi.org/10.1007/s00184-008-0222-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00184-008-0222-3

Keywords

Navigation