Skip to main content

Computational Algebraic Methods in Efficient Estimation

  • Chapter
  • First Online:
Geometric Theory of Information

Part of the book series: Signals and Communication Technology ((SCT))

  • 1741 Accesses

Abstract

A strong link between information geometry and algebraic statistics is made by investigating statistical manifolds which are algebraic varieties. In particular it is shown how first and second order efficient estimators can be constructed, such as bias corrected Maximum Likelihood and more general estimators, and for which the estimating equations are purely algebraic. In addition it is shown how Gröbner basis technology, which is at the heart of algebraic statistics, can be used to reduce the degrees of the terms in the estimating equations. This points the way to the feasible use, to find the estimators, of special methods for solving polynomial equations, such as homotopy continuation methods. Simple examples are given showing both equations and computations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Adler, R.J., Taylor, J.E.: Random Fields and Geometry. Springer Monographs in Mathematics. Springer, New York (2007)

    Google Scholar 

  2. Amari, S., Nagaoka, H.: Methods of Information Geometry, vol. 191. American Mathematical Society, Providence (2007)

    Google Scholar 

  3. Amari, S.: Differential geometry of curved exponential families-curvatures and information loss. Ann. Stat. 10, 357–385 (1982)

    Google Scholar 

  4. Amari, S.: Differential-Geometrical Methods in Statistics. Springer, New York (1985)

    Google Scholar 

  5. Amari, S., Kumon, M.: Differential geometry of Edgeworth expansions in curved exponential family. Ann. Inst. Stat. Math. 35(1), 1–24 (1983)

    MATH  MathSciNet  Google Scholar 

  6. Andersson, S., Madsen, J.: Symmetry and lattice conditional independence in a multivariate normal distribution. Ann. Stat. 26(2), 525–572 (1998)

    MATH  MathSciNet  Google Scholar 

  7. Bergsma, W.P., Croon, M., Hagenaars, J.A.: Marginal Models for Dependent, Clustered, and Longitudinal Categorical Data. Springer, New York (2009)

    Google Scholar 

  8. Buchberger, B.: Bruno Buchberger’s PhD thesis 1965: an algorithm for finding the basis elements of the residue class ring of a zero dimensional polynomial ideal. J. Symbol. Comput. 41(3), 475–511 (2006)

    Google Scholar 

  9. Cox, D.A., Little, J., O’Shea, D.: Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, 3/e (Undergraduate Texts in Mathematics). Springer, New York (2007)

    Google Scholar 

  10. Dawid, A.P.: Further comments on some comments on a paper by Bradley Efron. Ann. Stat. 5(6), 1249 (1977)

    Article  MATH  MathSciNet  Google Scholar 

  11. Drton, M., Sturmfels B., Sullivant, S.: Lectures on Algebraic Statistics. Springer, New York (2009)

    Google Scholar 

  12. Drton, M.: Likelihood ratio tests and singularities. Ann. Stat. 37, 979–1012 (2009)

    Google Scholar 

  13. Efron, B.: Defining the curvature of a statistical problem (with applications to second order efficiency). Ann. Stat. 3, 1189–1242 (1975)

    Article  MATH  MathSciNet  Google Scholar 

  14. Gehrmann, H., Lauritzen, S.L.: Estimation of means in graphical Gaussian models with symmetries. Ann. Stat. 40(2), 1061–1073 (2012)

    Article  MATH  MathSciNet  Google Scholar 

  15. Gibilisco, P., Riccomagno, E., Rogantin, M.P., Wynn, H.P.: Algebraic and Geometric Methods in Statistics. Cambridge University Press, Cambridge (2009)

    Google Scholar 

  16. Huber, B., Sturmfels, B.: A polyhedral method for solving sparse polynomial systems. Math. Comput. 64, 1541–1555 (1995)

    Article  MATH  MathSciNet  Google Scholar 

  17. Kuriki, S., Takemura, A.: On the equivalence of the tube and Euler characteristic methods for the distribution of the maximum of Gaussian fields over piecewise smooth domains. Ann. Appl. Probab. 12(2), 768–796 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  18. Lee, T.L., Li, T.Y., Tsai, C.H.: HOM4PS2.0: a software package for solving polynomial systems by the polyhedral homotopy continuation method. Computing 83(2–3), 109–133 (2008)

    Article  MATH  MathSciNet  Google Scholar 

  19. Li, T.Y.: Numerical solution of multivariate polynomial systems by homotopy continuation methods. Acta Numer 6(1), 399–436 (1997)

    Article  Google Scholar 

  20. Naiman, D.Q.: Conservative confidence bands in curvilinear regression. Ann. Stat. 14(3), 896–906 (1986)

    Article  MATH  MathSciNet  Google Scholar 

  21. Pistone, G., Sempi, C.: An infinite-dimensional geometric structure on the space of all the probability measures equivalent to a given one. Ann. Stat. 23, 1543–1561 (1995)

    Google Scholar 

  22. Pistone, G., Wynn, H.P.: Generalised confounding with Gröbner bases. Biometrika 83, 653–666 (1996)

    Article  MATH  MathSciNet  Google Scholar 

  23. Rao, R.C.: Information and accuracy attainable in the estimation of statistical parameters. Bull. Calcutta Math. Soc. 37(3), 81–91 (1945)

    MATH  MathSciNet  Google Scholar 

  24. Verschelde, J.: Algorithm 795: PHCpack: a general-purpose solver for polynomial systems by homotopy continuation. ACM Trans. Math. Softw. (TOMS) 25(2), 251–276 (1999)

    Article  MATH  Google Scholar 

  25. Watanabe, S.: Algebraic analysis for singular statistical estimation. In: Watanabe, O., Yokomori, T. (eds.) Algorithmic Learning Theory. Springer, Berlin (1999)

    Google Scholar 

  26. Weyl, H.: On the volume of tubes. Am. J. Math. 61, 461–472 (1939)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgments

This paper has benefited from conversations with and advice from a number of colleagues. We should thank Satoshi Kuriki, Tomonari Sei, Wicher Bergsma and Wilfred Kendall. The first author acknowledges support by JSPS KAKENHI Grant 20700258, 24700288 and the second author acknowledges support from the Institute of Statistical Mathematics for two visits in 2012 and 2013 and from UK EPSRC Grant EP/H007377/1. A first version of this paper was delivered at the WOGAS3 meeting at the University of Warwick in 2011. We thank the sponsors. The authors also thank the referees of the short version in GSI2013 and the referees of the first long version of the paper for insightful suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kei Kobayashi .

Editor information

Editors and Affiliations

Appendices

A Normal Forms

A basic text for the materials in this section is [9]. The rapid growth of modern computational algebra can be credited to the celebrated Buchberger’s algorithm [8].

A monomial ideal \(I\) in a polynomial ring \(K[x_1,\ldots ,x_n]\) over a field \(K\) is an ideal for which there is a collection of monomials \(f_1,\ldots , f_m\) such that any \(g \in I\) can be expressed as a sum

$$ g = \sum _{i=1}^m g_i(x)f_i(x) $$

with some polynomials \(g_i\in K[x_1,\ldots ,x_n]\). We can appeal to the representation of a monomial \(x^{\alpha }=x_1^{\alpha _1}\ldots x_n^{\alpha _n}\) by its exponent \(\alpha =(\alpha _1,\ldots ,\alpha _n)\). If \(\beta \ge 0\) is another exponent then

$$x^{\alpha } x^{\beta } = x^{\alpha + \beta },$$

and \(\alpha + \beta \) is in the positive (shorthand for non-negative) “orthant” with corner at \(\alpha \). The set of all monomials in a monomial ideal is the union of all positive orthants whose “corners” are given by the exponent vectors of the generating monomial \(f_1, \ldots , f_m\). A monomial ordering written \(x^{\alpha } \prec x^{\beta }\) is a total (linear) ordering on monomials such that for \(\gamma \ge 0\), \(x^{\alpha } \prec x^{\beta } \Rightarrow x^{\alpha +\gamma } \prec x^{\beta + \gamma }\). Any polynomial \(f(x)\) has a leading terms with respect to \(\prec \), written \(LT(f)\).

There are, in general, many ways to express a given ideal \(I\) as being generated from a basis \(I = \langle f_1,\ldots , f_m\rangle \). That is to say, there are many choices of basis. Given an ideal \(I\) a set \(\{g_1, \ldots g_m\}\) is called a Gröbner basis (G-basis) if:

$$\langle LT(g_1), \ldots , LT(g_m)\rangle \; = \; \langle LT(I)\rangle ,$$

where \(\langle LT(I)\rangle \) is the ideal generated by all the monomials in \(I\). We sometimes refer to \(\langle LT(I) \rangle \) as the leading term ideal. Any ideal \(I\) has a Gröbner basis and any Gröbner basis in the ideal is a basis of the ideal.

Given a monomial ordering and an ideal expressed in terms of the G-basis, \(I\;=\; \langle g_1,\ldots , g_m\rangle \), any polynomial \(f\) has a unique remainder with respect the quotient operation \(K[x_1, \ldots , x_k]/I\). That is

$$f = \sum _{i=1}^m s_i(x)g_i(x) + r(x).$$

We call the remainder \(r(x)\) the normal form of \(f\) with respect to \(I\) and write \(NF(f)\). Or, to stress the fact that it may depend on \(\prec \), we write \(NF(f, \prec )\). Given a monomial ordering \(\prec \), a polynomial \(f=\sum _{\alpha \in L} \theta _{\alpha } x^{\alpha }\) for some \(L\) is a normal form with respect to \(\prec \) if \(x^{\alpha } \notin \langle LT(f) \rangle \) for all \(\alpha \in L\). An equivalent way of saying this is: given an ideal \(I\) and a monomial ordering \(\prec \), for every \(f \in K[x_1,\ldots ,x_k]\) there is a unique normal form \(NF(f)\) such that \(f-NF(f) \in I\).

B Homotopy Continuation Method

Homotopy continuation method is an algorithm to find the solutions of simultaneous polynomial equations numerically. See, for example, [19, 24] for more details of the algorithm and theory.

We will explain the method briefly by a simple example of 2 equations with 2 unknowns

$$ \text {Input:} f, g \in \mathbb {R}[x,y] $$
$$ \text {Output: The solutions of} \,f(x,y)=g(x,y)=0. $$

 

Step 1 :

Select arbitrary polynomials of the form:

$$\begin{aligned} f_0(x,y):=f_0(x):=a_1x^{d_1}-b_1&=0,\nonumber \\ g_0(x,y):=g_0(y):=a_2y^{d_2}-b_2&=0 \end{aligned}$$
(6.8)

where \(d_1= \deg (f)\) and \(d_2=\deg (g)\). Polynomial equations in this form are easy to solve.

Step 2 :

Take the convex combinations:

$$\begin{aligned} f_t(x,y):=\,&t f(x,y)+ (1-t) f_0(x,y),\\ g_t(x,y):=\,&t g(x,y)+ (1-t) g_0(x,y) \end{aligned}$$

then our target becomes the solution for \(t=1\).

Step 3 :

Compute the solution for \(t=\delta \) for small \(\delta \) by the solution for \(t=0\) numerically.

Step 4 :

Repeat this until we obtain the solution for \(t=1\).

 

Figure 6.4 shows a sketch of the algorithm. This algorithm is called the (linear) homotopy continuation method and justified if the path connects \(t=0\) and \(t=1\) continuously without an intersection. That can be proved for almost all \(a\) and \(b\). See [19].

Fig. 6.4
figure 4

Paths for the homotopy continuation method

For each computation for the homotopy continuation method, the number of the paths is the number of the solutions of (6.8). In this case, the number of paths is \(d_1 d_2\). In general case with \(m\) unknowns, it becomes \(\prod _{i=1}^m d_i\) and this causes a serious problem for computational cost. Therefore decreasing the degree of second-order efficient estimators plays an important role for the homotopy continuation method.

Note that in order to solve this computational problem, the authors of [16] proposed the nonlinear homotopy continuation methods (or the polyhedral continuation methods). But as we can see in Sect. 6.5.2, the degree of the polynomials still affects the computational costs.

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Kobayashi, K., Wynn, H.P. (2014). Computational Algebraic Methods in Efficient Estimation. In: Nielsen, F. (eds) Geometric Theory of Information. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-05317-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05317-2_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05316-5

  • Online ISBN: 978-3-319-05317-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics