Hypothesis Testing Theory

Mittelhammer, Ron C.

doi:10.1007/978-1-4614-5022-1_9

Ron C. Mittelhammer²

6528 Accesses

Abstract

A primary goal of scientific research often concerns the verification or refutation of assertions, conjectures, currently accepted laws, or descriptions relating to a given economic, sociological, psychological, physical, or biological process or population. Statistical hypothesis testing concerns the use of probability samples of observations from processes or populations of interest, together with probability and mathematical statistics principles, to judge the validity of stated assertions, conjectures, laws, or descriptions in such a way that the probability of falsely rejecting a correct hypothesis can be controlled, while the probability of rejecting false hypotheses is made as large as possible. The precise nature of the types of errors that can be made, how the probabilities of such errors can be controlled, and how one designs a test so that the probability of rejecting false hypotheses is as large as possible is the subject of this chapter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Compare this set to the range of X over Ω introduced in our discussion of minimal sufficient statistics, Section 7.4.
2.
The terminology “acceptable region” is sometimes replaced by “acceptance region” in the literature. We will see later that while the behavior of sample outcomes may be “acceptable” to H given the characteristics of the probability space implied by H, there are statistical reasons why one might not want to literally conclude acceptance of H on the basis of this “acceptable” behavior. We will clarify this subtle but important distinction with additional rigorous rationale later, which will motivate further why we choose to use the terminology “acceptable”.
3.
For convenience, we have chosen to “connect the dots” and display the graph as a continuous curve. We will continue with this practice wherever it is convenient and useful.
4.
If the magnitudes of the costs or losses incurred when errors are omitted can be expressed in terms of a loss function, then a formal analysis of expected losses can lead to a choice of type I and type II error probabilities. For an introduction to the ideas involved, see Mood, A., F. Graybill, and D. Boes, (1974). Introduction to the Theory of Statistics, 3rd Ed., New York: McGraw-Hill, pp. 414–418.
5.
In parametric models, sets of fully specified probability distributions will be identified, whereas in semiparametric models, only a subset of moments or other characteristics of the underlying probability distributions are generally identified by the hypotheses.
6.
The properties we will examine do not exhaust the possibilities. See E. Lehmann, (1986), Testing Statistical Hypotheses, John Wiley, NY.
7.
Recall that $ {{\sup}_{{{\mathbf{\Theta}} \in H}}}\{\pi \left( {\mathbf{\Theta}} \right)\} $ denotes the smallest upper bound to the values of π(Θ) for Θ ∈ H (i.e., the supremum). If the maximum of π(Θ) for Θ ∈ H exists, then sup is the same as max.
8.
Recall the previous footnote, and the fact that $ {{\inf}_{{\Theta \in \bar{H}}}} $ {π(Θ)} denotes the largest lower bound to the values of π(Θ) for Θ ∈ $ \bar{H} $ (i.e., the infimum). The sup and inf of π(Θ) are equivalent to max and min, respectively, when the maximum and/or minimum exists.
9.
In the case of continuous X, the choice of size is generally a continuous interval contained in [0,1]. If X is discrete, the set of choices for size is generally finite, as previous examples have illustrated.
10.
The maximum is achievable in this case, and equals .05 when μ = 15.
11.
Note that $ \mathop{{\min }}\limits_{{\mu > 15}} \left\{ {{{\pi}_{\rm{n}}}(\mu )} \right\} $does not exist in this case. The largest possible lower bound (i.e., the infimum) is .05, which is < π_n(μ), ∀ μ > 15.
12.
Neyman, J. and E.S. Pearson, “On the Problem of the Most Efficient Tests of Statistical Hypotheses,” Phil. Trans., A, vol. 231, 1933, p. 289.
13.
Recall that the support of a density function is the set of x-values for which f(x;Θ ₀) > 0, i.e., {x:f(x;Θ ₀) > 0} is the support of the density f(x;Θ ₀).
14.
This limitation can be overcome, in principle, by utilizing what are known as randomized tests. Essentially, the test rule is made to depend not only on the outcomes of X but also on auxiliary random variables that are independent of X, so as to allow any level of test size to be achieved. However, the fact that the test outcome can depend on random variables that are independent of the experiment under investigation has discouraged its use in practice. For an introduction to the ideas involved, see Kendall, M. and A. Stuart, (1979) The Advanced Theory of Statistics, Vol. 2, 4th Edition, New York: MacMillan, 1979, p. 180–181. Also, see problem 9.8.
15.
This can be shown via the MGF approach, since the MGF of $ {\sum\nolimits_{{i = 1}}^{{100}} {{{X}_i}} = \prod\nolimits_{{i = 1}}^{{100}} {{{M}_{{{{X}_i}}}}} (t) = \prod\nolimits_{{i = 1}}^{{100}} {{{\left( {1 - \theta t} \right)}}^{{ - 1}}} = {{{\left( {1 - \theta t} \right)}}^{{ - 100}}} \ \rm{for} \ {\it t} < {{\theta}^{{ - 1}}}} $, which is of the gamma form with β = θ, α = 100.
16.
As we noted previously, it is possible to use a randomized test to achieve a size of .05 exactly, but the test can depend on the outcome of a random variable that has nothing to do with the experiment being analyzed. See problem 9.8 for an example of this approach. Randomized tests are not often used in practice.
17.
To this point, we have established UMP tests in the class of all tests of a certain level α. The reader should note that in all cases examined heretofore, we have shown that UMP level α tests were also unbiased. This is clearly different than examining only unbiased tests of a certain level α, and within this restricted set of tests, attempting to find one that is UMP.
18.
Results are available for a more general class of densities referred to as Polya distributions, which subsumes the exponential class densities as a special case. However, the mathematics involved in analyzing the more general distributions is beyond the scope of our study. Interested readers can consult the work of S. Karlin, (1957), “Polya Type Distributions II,” Ann. Math. Stat., 28, pp. 281–308.
19.
We are assuming that C _r is such that a power function is defined, i.e., C _r can be assigned probability by f(x;Θ), Θ∈Ω.
20.
One can show using the monotone likelihood ratio approach that a UMP level α test of H ₀ versus H _a does not exist. Recall Theorem 9.9.
21.
The algorithm actually used was the NLSYS procedure in the GAUSS Matrix language.
22.
The algorithm actually used was the NLSYS procedure in the GAUSS matrix language.
23.
We are assuming that C _r is such that a power function is defined, i.e., C _r can be assigned probability by f(x;Θ), Θ ∈ Ω.
24.
Note that c = (c ₁,…,c _k) could be viewed as an alternative parameterization of the exponential class of densities, where
$$ {{f}_{*}}\left( {{\bf x};{\bf c}} \right) = \exp \left( {\sum\limits_{{i = 1}}^k {{{c}_i}} {{g}_i}\left( {\bf x} \right) + {{d}_{*}}\left( {\bf c} \right) + z\left( {\bf x} \right)} \right){{I}_A}\left( {\bf x} \right),{c}\in {{\Omega}_c}, $$
with $ {{d}_{*}}\left( {c} \right) = \ln {{\left( {\int_{{ - \infty }}^{\infty } \cdots \int_{{ - \infty }}^{\infty } {\exp } \left( {\sum\nolimits_{{i = 1}}^k {{{c}:g:}} \left( {\bf x} \right) + z\left( {\bf x} \right)} \right){{I}_A}\left( {\bf x} \right)d{\bf x}} \right)}^{{ - 1}}} $(use summation in the discrete case). This parameterization is referred to as the natural parameterization of the exponential class of densities. Note that the definition of d _*(c) is a direct result of the fact that the density must integrate (or sum) to 1.
25.
Except, perhaps, on a set having probability zero.
26.
The surface area of an n-dimensional hypersphere is given by $ {A = \left( {n{{r}^{{n - 1}}}{{\pi}^{{n/2}}}} \right)/\Gamma \left( {\left( {n/2} \right) + 1} \right),} $ where r is the radius of the hypersphere. See R.G. Bartle, (1976), The Elements of Real Analysis, 2nd Ed., John Wiley, pp. 454–455, and note that the surface area can be defined by differentiating the volume of the hypersphere with respect to r. For n = 2, the bracketed expression becomes simply 2π(s*₂)^1/2, which is the familiar 2πr.
27.
Expanding $ {\sum\nolimits_{{i = 1}}^n {{{{\left[ {\left( {{{x}_i} - \bar{x}} \right) + \left( {\bar{x} - {{\mu}_0}} \right)} \right]}}^2}} } $ leads to the result.
28.
If the likelihood ratio is strictly increasing in t(x), then c can be chosen to satisfy g(c) = k ⁻¹.

Author information

Authors and Affiliations

School of Economic Sciences, Washington State University, Pullman, Washington, USA
Ron C. Mittelhammer

Authors

Ron C. Mittelhammer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mittelhammer, R.C. (2013). Hypothesis Testing Theory. In: Mathematical Statistics for Economics and Business. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-5022-1_9

Download citation

DOI: https://doi.org/10.1007/978-1-4614-5022-1_9
Published: 06 October 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-5021-4
Online ISBN: 978-1-4614-5022-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics