Abstract
In this chapter we review some popular methods for estimating the statistical significance of various conclusions that can be drawn from experimental data. These include the \(\chi ^2\)-text, the Kolmogorov–Smirnov (K–S) test for goodness of fit, the ‘student’ \(t\)-test for testing the null hypothesis that two sets of data have the same mean, Significance Analysis for Microarrays (SAM), Pattern Analysis for Microarrays (PAM) and Gene Set Enhancement Analysis (GSEA).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note that the Glivenko-Cantelli lemma does not require \(\varPhi _X(\cdot )\) to be continuous.
- 2.
Strictly speaking, we should first define a \(\sigma \)-algebra \(\mathcal{S}\) of subsets of \(X\) and assume that \(\mathcal{A}\subseteq \mathcal{S}\). Such details are glossed over here but the treatment in [9] is quite precise.
- 3.
We mostly follow the notation in [11], in which the letter S in various fonts is used to denote various quantities. The reader is therefore urged to pay careful attention.
References
Glivenko, V.I.: Sulla determinazione empirica delle legge di probabilità. Giorn. dell’Ist. Italia degli Attuari 4, 92–99 (1933)
Cantelli, F.P.: Sulla determinazione empirica delle legge di probabilità. Giorn. dell’Ist. Italia degli Attuari 4, 421–424 (1933)
Gnedenko, B.V.: Theory of probability, 4th edn. Chelsea, New York (1968)
Loève, M.: Probability Theory I, 4th edn. Springer, Heidelberg (1977)
Kolmogoroff, A.: Sulla determinazione empirica di una legge di probabilita. Giorn. dell’Ist. Italia degli Attuari 4, 83–91 (1933)
Smirnov, N.: Ob uknonenijah empiriceskoi krivoi raspredelenija. Recueil Mathematique (Matematiceskii Sbornik) 6(48), 3–26 (1939)
Feller, W.: On the kolmogorov-smirnov limit theorems for empirical distributions. Ann. Math. Stat. 19(2), 177–189 (1948)
Justel, A., Pena, D., Zamar, R.: A multivariable kolmogorov-smirnov test of goodness of fit. Stat. Probab. Lett. 35, 251–259 (1997)
Vidyasagar, M.: Learning and generalization: with applications to neural networks and control systems. Springer, London (2003)
Virginia Goss Tusher, R.T., Chu, G.: Significance analysis of microarrays applied to the ionizing radiation responses. Proc. Natl. Acad. Sci 98(9), 5116–5121 (2001)
Efron, B., Tibshirani, R.: On testing the significance of a set of genes. Ann. Appl. Stat. 1(1), 107–129 (2007)
Subramanian, A., et al.: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. 102(43), 15,545–15,550 (2005)
Tibshirani, R., et al.: Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc. Natl. Acad. Sci. 99(10), 6567–6572 (2002)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 The Author(s)
About this chapter
Cite this chapter
Vidyasagar, M. (2012). Analyzing Statistical Significance. In: Computational Cancer Biology. SpringerBriefs in Electrical and Computer Engineering(). Springer, London. https://doi.org/10.1007/978-1-4471-4751-0_2
Download citation
DOI: https://doi.org/10.1007/978-1-4471-4751-0_2
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4750-3
Online ISBN: 978-1-4471-4751-0
eBook Packages: Computer ScienceComputer Science (R0)