Computational Algebraic Methods in Efficient Estimation

Kobayashi, Kei; Wynn, Henry P.

doi:10.1007/978-3-319-05317-2_6

Kei Kobayashi² &
Henry P. Wynn³

Part of the book series: Signals and Communication Technology ((SCT))

1741 Accesses

Abstract

A strong link between information geometry and algebraic statistics is made by investigating statistical manifolds which are algebraic varieties. In particular it is shown how first and second order efficient estimators can be constructed, such as bias corrected Maximum Likelihood and more general estimators, and for which the estimating equations are purely algebraic. In addition it is shown how Gröbner basis technology, which is at the heart of algebraic statistics, can be used to reduce the degrees of the terms in the estimating equations. This points the way to the feasible use, to find the estimators, of special methods for solving polynomial equations, such as homotopy continuation methods. Simple examples are given showing both equations and computations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Adler, R.J., Taylor, J.E.: Random Fields and Geometry. Springer Monographs in Mathematics. Springer, New York (2007)
Google Scholar
Amari, S., Nagaoka, H.: Methods of Information Geometry, vol. 191. American Mathematical Society, Providence (2007)
Google Scholar
Amari, S.: Differential geometry of curved exponential families-curvatures and information loss. Ann. Stat. 10, 357–385 (1982)
Google Scholar
Amari, S.: Differential-Geometrical Methods in Statistics. Springer, New York (1985)
Google Scholar
Amari, S., Kumon, M.: Differential geometry of Edgeworth expansions in curved exponential family. Ann. Inst. Stat. Math. 35(1), 1–24 (1983)
MATH MathSciNet Google Scholar
Andersson, S., Madsen, J.: Symmetry and lattice conditional independence in a multivariate normal distribution. Ann. Stat. 26(2), 525–572 (1998)
MATH MathSciNet Google Scholar
Bergsma, W.P., Croon, M., Hagenaars, J.A.: Marginal Models for Dependent, Clustered, and Longitudinal Categorical Data. Springer, New York (2009)
Google Scholar
Buchberger, B.: Bruno Buchberger’s PhD thesis 1965: an algorithm for finding the basis elements of the residue class ring of a zero dimensional polynomial ideal. J. Symbol. Comput. 41(3), 475–511 (2006)
Google Scholar
Cox, D.A., Little, J., O’Shea, D.: Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, 3/e (Undergraduate Texts in Mathematics). Springer, New York (2007)
Google Scholar
Dawid, A.P.: Further comments on some comments on a paper by Bradley Efron. Ann. Stat. 5(6), 1249 (1977)
Article MATH MathSciNet Google Scholar
Drton, M., Sturmfels B., Sullivant, S.: Lectures on Algebraic Statistics. Springer, New York (2009)
Google Scholar
Drton, M.: Likelihood ratio tests and singularities. Ann. Stat. 37, 979–1012 (2009)
Google Scholar
Efron, B.: Defining the curvature of a statistical problem (with applications to second order efficiency). Ann. Stat. 3, 1189–1242 (1975)
Article MATH MathSciNet Google Scholar
Gehrmann, H., Lauritzen, S.L.: Estimation of means in graphical Gaussian models with symmetries. Ann. Stat. 40(2), 1061–1073 (2012)
Article MATH MathSciNet Google Scholar
Gibilisco, P., Riccomagno, E., Rogantin, M.P., Wynn, H.P.: Algebraic and Geometric Methods in Statistics. Cambridge University Press, Cambridge (2009)
Google Scholar
Huber, B., Sturmfels, B.: A polyhedral method for solving sparse polynomial systems. Math. Comput. 64, 1541–1555 (1995)
Article MATH MathSciNet Google Scholar
Kuriki, S., Takemura, A.: On the equivalence of the tube and Euler characteristic methods for the distribution of the maximum of Gaussian fields over piecewise smooth domains. Ann. Appl. Probab. 12(2), 768–796 (2002)
Article MATH MathSciNet Google Scholar
Lee, T.L., Li, T.Y., Tsai, C.H.: HOM4PS2.0: a software package for solving polynomial systems by the polyhedral homotopy continuation method. Computing 83(2–3), 109–133 (2008)
Article MATH MathSciNet Google Scholar
Li, T.Y.: Numerical solution of multivariate polynomial systems by homotopy continuation methods. Acta Numer 6(1), 399–436 (1997)
Article Google Scholar
Naiman, D.Q.: Conservative confidence bands in curvilinear regression. Ann. Stat. 14(3), 896–906 (1986)
Article MATH MathSciNet Google Scholar
Pistone, G., Sempi, C.: An infinite-dimensional geometric structure on the space of all the probability measures equivalent to a given one. Ann. Stat. 23, 1543–1561 (1995)
Google Scholar
Pistone, G., Wynn, H.P.: Generalised confounding with Gröbner bases. Biometrika 83, 653–666 (1996)
Article MATH MathSciNet Google Scholar
Rao, R.C.: Information and accuracy attainable in the estimation of statistical parameters. Bull. Calcutta Math. Soc. 37(3), 81–91 (1945)
MATH MathSciNet Google Scholar
Verschelde, J.: Algorithm 795: PHCpack: a general-purpose solver for polynomial systems by homotopy continuation. ACM Trans. Math. Softw. (TOMS) 25(2), 251–276 (1999)
Article MATH Google Scholar
Watanabe, S.: Algebraic analysis for singular statistical estimation. In: Watanabe, O., Yokomori, T. (eds.) Algorithmic Learning Theory. Springer, Berlin (1999)
Google Scholar
Weyl, H.: On the volume of tubes. Am. J. Math. 61, 461–472 (1939)
Article MathSciNet Google Scholar

Download references

Acknowledgments

This paper has benefited from conversations with and advice from a number of colleagues. We should thank Satoshi Kuriki, Tomonari Sei, Wicher Bergsma and Wilfred Kendall. The first author acknowledges support by JSPS KAKENHI Grant 20700258, 24700288 and the second author acknowledges support from the Institute of Statistical Mathematics for two visits in 2012 and 2013 and from UK EPSRC Grant EP/H007377/1. A first version of this paper was delivered at the WOGAS3 meeting at the University of Warwick in 2011. We thank the sponsors. The authors also thank the referees of the short version in GSI2013 and the referees of the first long version of the paper for insightful suggestions.

Author information

Authors and Affiliations

The Institute of Statistical Mathematics, 10-3, Midori-cho, Tachikawa, Tokyo, Japan
Kei Kobayashi
London School of Economics, London, UK
Henry P. Wynn

Authors

Kei Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Henry P. Wynn
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kei Kobayashi .

Editor information

Editors and Affiliations

Laboratoire d'Informatique (LIX), Polytechnique School, Palaiseau Cedex, France
Frank Nielsen

Appendices

A Normal Forms

A basic text for the materials in this section is [9]. The rapid growth of modern computational algebra can be credited to the celebrated Buchberger’s algorithm [8].

A monomial ideal $I$ in a polynomial ring $K[x_1,\ldots ,x_n]$ over a field $K$ is an ideal for which there is a collection of monomials $f_1,\ldots , f_m$ such that any $g \in I$ can be expressed as a sum

$$ g = \sum _{i=1}^m g_i(x)f_i(x) $$

with some polynomials $g_i\in K[x_1,\ldots ,x_n]$. We can appeal to the representation of a monomial $x^{\alpha }=x_1^{\alpha _1}\ldots x_n^{\alpha _n}$ by its exponent $\alpha =(\alpha _1,\ldots ,\alpha _n)$. If $\beta \ge 0$ is another exponent then

$$x^{\alpha } x^{\beta } = x^{\alpha + \beta },$$

and $\alpha + \beta $ is in the positive (shorthand for non-negative) “orthant” with corner at $\alpha $. The set of all monomials in a monomial ideal is the union of all positive orthants whose “corners” are given by the exponent vectors of the generating monomial $f_1, \ldots , f_m$. A monomial ordering written $x^{\alpha } \prec x^{\beta }$ is a total (linear) ordering on monomials such that for $\gamma \ge 0$, $x^{\alpha } \prec x^{\beta } \Rightarrow x^{\alpha +\gamma } \prec x^{\beta + \gamma }$. Any polynomial $f(x)$ has a leading terms with respect to $\prec $, written $LT(f)$.

There are, in general, many ways to express a given ideal $I$ as being generated from a basis $I = \langle f_1,\ldots , f_m\rangle $. That is to say, there are many choices of basis. Given an ideal $I$ a set $\{g_1, \ldots g_m\}$ is called a Gröbner basis (G-basis) if:

$$\langle LT(g_1), \ldots , LT(g_m)\rangle \; = \; \langle LT(I)\rangle ,$$

where $\langle LT(I)\rangle $ is the ideal generated by all the monomials in $I$. We sometimes refer to $\langle LT(I) \rangle $ as the leading term ideal. Any ideal $I$ has a Gröbner basis and any Gröbner basis in the ideal is a basis of the ideal.

Given a monomial ordering and an ideal expressed in terms of the G-basis, $I\;=\; \langle g_1,\ldots , g_m\rangle $, any polynomial $f$ has a unique remainder with respect the quotient operation $K[x_1, \ldots , x_k]/I$. That is

$$f = \sum _{i=1}^m s_i(x)g_i(x) + r(x).$$

We call the remainder $r(x)$ the normal form of $f$ with respect to $I$ and write $NF(f)$. Or, to stress the fact that it may depend on $\prec $, we write $NF(f, \prec )$. Given a monomial ordering $\prec $, a polynomial $f=\sum _{\alpha \in L} \theta _{\alpha } x^{\alpha }$ for some $L$ is a normal form with respect to $\prec $ if $x^{\alpha } \notin \langle LT(f) \rangle $ for all $\alpha \in L$. An equivalent way of saying this is: given an ideal $I$ and a monomial ordering $\prec $, for every $f \in K[x_1,\ldots ,x_k]$ there is a unique normal form $NF(f)$ such that $f-NF(f) \in I$.

B Homotopy Continuation Method

Homotopy continuation method is an algorithm to find the solutions of simultaneous polynomial equations numerically. See, for example, [19, 24] for more details of the algorithm and theory.

We will explain the method briefly by a simple example of 2 equations with 2 unknowns

$$ \text {Input:} f, g \in \mathbb {R}[x,y] $$

$$ \text {Output: The solutions of} \,f(x,y)=g(x,y)=0. $$

Step 1 :

Select arbitrary polynomials of the form:

$$\begin{aligned} f_0(x,y):=f_0(x):=a_1x^{d_1}-b_1&=0,\nonumber \\ g_0(x,y):=g_0(y):=a_2y^{d_2}-b_2&=0 \end{aligned}$$

(6.8)

where $d_1= \deg (f)$ and $d_2=\deg (g)$. Polynomial equations in this form are easy to solve.

Step 2 :

Take the convex combinations:

$$\begin{aligned} f_t(x,y):=\,&t f(x,y)+ (1-t) f_0(x,y),\\ g_t(x,y):=\,&t g(x,y)+ (1-t) g_0(x,y) \end{aligned}$$

then our target becomes the solution for $t=1$.

Step 3 :

Compute the solution for $t=\delta $ for small $\delta $ by the solution for $t=0$ numerically.

Step 4 :

Repeat this until we obtain the solution for $t=1$.

Figure 6.4 shows a sketch of the algorithm. This algorithm is called the (linear) homotopy continuation method and justified if the path connects $t=0$ and $t=1$ continuously without an intersection. That can be proved for almost all $a$ and $b$. See [19].

For each computation for the homotopy continuation method, the number of the paths is the number of the solutions of (6.8). In this case, the number of paths is $d_1 d_2$. In general case with $m$ unknowns, it becomes $\prod _{i=1}^m d_i$ and this causes a serious problem for computational cost. Therefore decreasing the degree of second-order efficient estimators plays an important role for the homotopy continuation method.

Note that in order to solve this computational problem, the authors of [16] proposed the nonlinear homotopy continuation methods (or the polyhedral continuation methods). But as we can see in Sect. 6.5.2, the degree of the polynomials still affects the computational costs.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kobayashi, K., Wynn, H.P. (2014). Computational Algebraic Methods in Efficient Estimation. In: Nielsen, F. (eds) Geometric Theory of Information. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-05317-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-05317-2_6
Published: 09 May 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05316-5
Online ISBN: 978-3-319-05317-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics