Estimation and testing of multiplicative models for frequency data

Forcina, Antonoio

doi:10.1007/s00184-019-00709-6

Estimation and testing of multiplicative models for frequency data

Published: 01 February 2019

Volume 82, pages 807–822, (2019)
Cite this article

Metrika Aims and scope Submit manuscript

Antonoio Forcina ORCID: orcid.org/0000-0001-5239-5495¹

156 Accesses
1 Citation
Explore all metrics

Abstract

This paper is about models for a vector of probabilities whose elements must have a multiplicative structure and sum to 1 at the same time; in certain applications, like basket analysis, these models may be seen as a constrained version of quasi-independence. After reviewing the basic properties of the models, their geometric features as a curved exponential family are investigated. An improved algorithm for computing maximum likelihood estimates is introduced and new insights are provided on the underlying geometry. The asymptotic distribution of three statistics for hypothesis testing are derived and a small simulation study is presented to investigate the accuracy of asymptotic approximations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Logarithmic Models

A new approach to distribution free tests in contingency tables

Article 08 September 2016

Confidence distributions and hypothesis testing

Article Open access 29 March 2024

References

Aitchison J, Silvey SD (1958) Maximum-likelihood estimation of parameters subject to restraints. Ann Math Stat 29:813–828
Article MathSciNet MATH Google Scholar
Aitchison J, Silvey SD (1960) Maximum-likelihood estimation procedures and associated tests of significance. J R Stat Soc Ser B 22:154–171
MathSciNet MATH Google Scholar
Barndorff-Nielsen OE (1978) Information and exponential families. Wiley, New York
MATH Google Scholar
Courant R (1964) Differential and integral calculus. Blacks and Sons, London
Google Scholar
Efron B (1978) The geometry of exponential families. Ann Stat 6:362–376
Article MathSciNet MATH Google Scholar
Evans RJ, Forcina A (2013) Two algorithms for fitting constrained marginal models. Comput Statist Data Anal 66:1–7
Article MathSciNet MATH Google Scholar
Forcina A (2012) Smoothness of conditional independence models for discrete data. J Multivar Anal 106:49–56
Article MathSciNet MATH Google Scholar
Giudici P, Passerone G (2002) Data mining of association structures to model consumer behaviour. Comput Stat Data Anal 38:533–541
Article MathSciNet MATH Google Scholar
Goodman LA (1981) Association models and canonical correlation in the analysis of cross-classifications having ordered categories. J Am Stat Assoc 76(374):320–334
MathSciNet Google Scholar
Klimova A, Rudas T (2015) Iterative scaling in curved exponential families. Scand J Stat 42:832–847
Article MathSciNet MATH Google Scholar
Klimova A, Rudas T (2016a) On the closure of relational models. J Multivar Anal 143:440–452
Article MathSciNet MATH Google Scholar
Klimova A, Rudas T (2016b) Testing the fit of relational models. arXiv preprint arXiv:1612.02416
Klimova A, Rudas T, Dobra A (2012) Relational models for contingency tables. J Multivar Anal 104:159–173
Article MathSciNet MATH Google Scholar
Pace L, Salvan A (1997) Principles of statistical inference. World Scientific, Singapore
MATH Google Scholar

Download references

Acknowledgements

The author would like to thank A. Klimova and T. Rudas for sharing ideas concerning Relational models and for several very enlightening discussions, A. Salvan for comment on the nature of the curved exponential family and P. Giudici for providing the basked data.

Author information

Authors and Affiliations

Dipartimento di Economia, University of Perugia, Via Pascoli, 06100, Perugia, Italy
Antonoio Forcina

Authors

Antonoio Forcina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonoio Forcina.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

1.1 Multinomial and Poisson as exponential families

Let ${\varvec{v}}$$\sim $ Mn$(n,{\varvec{\pi }})$ where ${\varvec{\pi }}$ has dimension q; a multivariate logistic transform of ${\varvec{\pi }}$ may be defined as $\log {\varvec{\pi }}$ = ${\varvec{G}}{\varvec{\lambda }}-{\varvec{1}}_q\log [{\varvec{1}}_q^{\prime }\exp ({\varvec{G}}{\varvec{\lambda }})]$, where ${\varvec{\lambda }}$ is a vector of canonical parameters determined by ${\varvec{G}}$, an arbitrary $q \times (q-1)$ matrix of full rank whose columns do not span the unitary vector. The kernel of the log of the probability distribution may be written as

$$\begin{aligned} {\varvec{v}}^{\prime }{\varvec{\log }}{\varvec{\pi }}= {\varvec{v}}^{\prime }{\varvec{G}}{\varvec{\lambda }}- n\log [{\varvec{1}}_q^{\prime }\exp ({\varvec{G}}{\varvec{\lambda }})], \end{aligned}$$

both ${\varvec{\lambda }}$ and ${\varvec{t}}$ = ${\varvec{G}}^{\prime }{\varvec{v}}$, the vector of sufficient statistics, have size $q-1$ and $K({\varvec{\lambda }})$ = $n\log [{\varvec{1}}^{\prime }\exp ({\varvec{G}}{\varvec{\lambda }})]$.

To derive an explicit expression for ${\varvec{\lambda }}$, let ${\varvec{R}}$ = ${\varvec{I}}_q-{\varvec{1}}_q{\varvec{1}}_q^{\prime }/q$ and

$$\begin{aligned} {\varvec{D}} = ({\varvec{G}}^{\prime }{\varvec{R}} {\varvec{G}})^{-1} {\varvec{G}}^{\prime }{\varvec{R}}; \quad \Rightarrow {\varvec{D}}{\varvec{G}} ={\varvec{I}}_q,\quad {\varvec{D}}{\varvec{1}}_q={\varvec{0}}_q, \end{aligned}$$

then ${\varvec{\lambda }}$ = ${\varvec{D}}\log {\varvec{\pi }}$ ia s vector of $q-1$ canonical parameters. To see why the coefficient of any linear constraint on canonical parameters must sum to 0, note that ${\varvec{D}}{\varvec{1}}_q$ = ${\varvec{0}}_{q-1}$. To introduce linear restrictions on ${\varvec{\lambda }}$, assume that ${\varvec{G}}$ is partitioned as $({\varvec{X}}\,\, {\varvec{Z}})$, where ${\varvec{Z}}$ is such that ${\varvec{Z}}^{\prime }{\varvec{R}}{\varvec{X}}$ = ${\varvec{0}}$, let also ${\varvec{H}}$ = $({\varvec{Z}}^{\prime }{\varvec{R}}{\varvec{Z}})^{-1}{\varvec{Z}}^{\prime }{\varvec{R}}$; now define ${\varvec{\eta }}$ = ${\varvec{H}}\log {\varvec{\pi }}$. Then the model ${\varvec{\lambda }}$ = ${\varvec{X}}{\varvec{\theta }}$ is equivalent to assume that ${\varvec{\eta }}$ = ${\varvec{0}}$.

If, instead, the elements of ${\varvec{v}}$ were distributed as q independent Poisson variables, the kernel of the log of the probability distribution would be

$$\begin{aligned} {\varvec{v}}^{\prime }\log {\varvec{\mu }}-{\varvec{1}}^{\prime }{\varvec{\mu }}= {\varvec{y}}^{\prime }{\varvec{\lambda }}-K({\varvec{\lambda }}), \end{aligned}$$

where ${\varvec{\lambda }}$ = $\log {\varvec{\mu }}$ and $K({\varvec{\lambda }})$ = ${\varvec{1}}^{\prime }\exp ({\varvec{\lambda }})$

1.2 Proof of Lemma 1

Point (i) follows because ${\varvec{\theta }}\in \mathcal{F}({\varvec{X}})$ implies $-{\varvec{X}}{\varvec{\theta }}>0$. Concerning (ii), let ${\varvec{C}}$ be a matrix whose columns are the generators of $\mathcal{C}$, then any element in the interior of $\mathcal{C}$ may be written as ${\varvec{c}}$ = $x {\varvec{C}}{\varvec{w}}$, where $x>0$ and the elements of ${\varvec{w}}$ are strictly positive and sum to 1. The derivative of c(x) with respect to x, computed by the chain rule, equals

$$\begin{aligned} d(x) = -\left[ \frac{\exp (-x(-{\varvec{X}}){\varvec{C}}{\varvec{w}})}{{\varvec{1}}^{\prime }\exp (-x(-{\varvec{X}}){\varvec{C}}{\varvec{w}})}\right] ^{\prime }(-{\varvec{X}}){\varvec{C}}{\varvec{w}}. \end{aligned}$$

To prove that d(x) is negative everywhere, note that the expression in square brackets is positive; the fact that the elements of the vector $(-{\varvec{X}}){\varvec{C}}{\varvec{w}}$ are also strictly positive follows from basic results on convex cones: the columns of ${\varvec{X}}^{\prime }$ are the generators of $\mathcal{C}^0$, the dual cone, where an edge of $\mathcal{C}^0$ can be orthogonal to, at most, $k-1$ edges of $\mathcal{C}$ and forms an obtuse angle with all the others. Because c(x) is continuous, strictly decreasing, positive for x close to 0 and negative for sufficiently large x, the value of x that satisfy (3) must be unique.

1.3 Proof of Lemma 2

To differentiate $f(\gamma )$ = $\log [{\varvec{1}}^{\prime }\exp ({\varvec{X}}{\varvec{\theta }}({\varvec{\gamma }}))]$ note that (4) implies ${\varvec{\tau }}(\gamma )$ = ${\varvec{X}}^{\prime }{\varvec{\pi }}(\gamma )$ = $\gamma {\varvec{X}}^{\prime }{\varvec{p}}$. By the chain rule

$$\begin{aligned} \frac{\partial f(\gamma )}{\partial \gamma } = \frac{\partial f/\gamma )}{\partial {\varvec{\theta }}(\gamma )^{\prime }} \frac{\partial {\varvec{\theta }}(\gamma )}{\partial {\varvec{\tau }}(\gamma )^{\prime }} \frac{\partial {\varvec{\tau }}(\gamma )}{\partial \gamma } = \frac{\exp ({\varvec{X}}{\varvec{\theta }}(\gamma ))^{\prime }}{{\varvec{1}}^{\prime }\exp ({\varvec{X}} {\varvec{\theta }}(\gamma ))} {\varvec{X}} \frac{\partial {\varvec{\theta }}(\gamma )}{\partial {\varvec{\tau }}(\gamma )^{\prime }}{\varvec{X}}^{\prime }{\varvec{p}}. \end{aligned}$$

The result follows because, by construction, ${\varvec{X}}^{\prime }\exp ({\varvec{X}}{\varvec{\theta }}(\gamma )) /[{\varvec{1}}^{\prime }\exp ({\varvec{X}}{\varvec{\theta }}(\gamma ))]$ = ${\varvec{\tau }}(\gamma )$ = $\gamma {\varvec{X}}^{\prime }{\varvec{p}}$ and

$$\begin{aligned} \frac{\partial {\varvec{\theta }}(\gamma )}{\partial {\varvec{\tau }}(\gamma )^{\prime }} = \left( \frac{\partial {\varvec{\tau }}(\gamma )}{\partial {\varvec{\theta }}(\gamma )^{\prime }}\right) ^{-1} = {\varvec{X}}^{\prime }\frac{\partial {\varvec{\pi }}(\gamma )}{\partial ({\varvec{X}}{\varvec{\theta }}(\gamma ))^{\prime }}{\varvec{X}} = {\varvec{F}}(\gamma ). \end{aligned}$$

Differentiation of the function $g(\gamma )$ is similar, except that, because ${\varvec{\tau }}(\gamma )$ = ${\varvec{s}}/\gamma $, the last component in the derivative is $-{\varvec{s}}/\gamma ^2$.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Forcina, A. Estimation and testing of multiplicative models for frequency data. Metrika 82, 807–822 (2019). https://doi.org/10.1007/s00184-019-00709-6

Download citation

Received: 18 June 2018
Published: 01 February 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s00184-019-00709-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimation and testing of multiplicative models for frequency data

Abstract

Access this article

Similar content being viewed by others

Logarithmic Models

A new approach to distribution free tests in contingency tables

Confidence distributions and hypothesis testing

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

1.1 Multinomial and Poisson as exponential families

1.2 Proof of Lemma 1

1.3 Proof of Lemma 2

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Estimation and testing of multiplicative models for frequency data

Abstract

Access this article

Similar content being viewed by others

Logarithmic Models

A new approach to distribution free tests in contingency tables

Confidence distributions and hypothesis testing

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

1.1 Multinomial and Poisson as exponential families

1.2 Proof of Lemma 1

1.3 Proof of Lemma 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation