Skip to main content

Association Measures and Statistical Significance Measures

  • 3182 Accesses

Abstract

An association measure can be used to measure the relationships between two random variables. These variables may be numeric, categorical, or binary. Statistical test statistics can often be defined for deriving association measures. For example, several statistical test statistics (Fisher Z, Student t-test, Hotelling) can be used to calculate a statistical significance level (p value) for a correlation coefficient. Multiple comparison correction (MCC) procedures are needed to protect against false positives due to multiple comparisons. The Bonferroni- and Sidak correction are very conservative MCC procedures. The q-value (local false discovery rate) MCC is often advantageous since it allows one to detect more significant variables. To calculate the false discovery rate (FDR), one considers the shape of the histogram of p values. MCC procedures can often be interpreted as transformations that increase the p value to account for the fact that multiple comparisons have been carried out. For example, the Bonferroni correlation multiplies each p value by the number of comparisons. The q-value transformation is sometimes improper, i.e., it decreases significant p values. p values and q-values can be used to screen for significant variables. The WGCNA library contains several R functions that implement standard screening criteria for finding variables (e.g., gene expression profiles) associated with a sample trait y. In practice, many seemingly different gene screening methods turn out to be significant. p values (or q-values) can be used to formulate a statistical criterion for choosing the (hard) threshold τ when defining an unweighted correlation network. Many methods for defining unweighted networks on the basis of pairwise linear relationships between variables turn out to be equivalent.

Keywords

  • False Discovery Rate
  • Significance Measure
  • Correlation Network
  • Association Measure
  • Multiple Comparison Correction

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-1-4419-8819-5_10
  • Chapter length: 29 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   169.00
Price excludes VAT (USA)
  • ISBN: 978-1-4419-8819-5
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   219.99
Price excludes VAT (USA)
Hardcover Book
USD   299.99
Price excludes VAT (USA)
Fig. 10.1
Fig. 10.2

References

  • Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B 57:289–300

    Google Scholar 

  • Hawkins DL (1989) Using U statistics to derive the asymptotic distribution of Fisher’s Z statistic. Am Stat 43(4):235–237

    Google Scholar 

  • Horvath S, Zhang B, Carlson M, Lu KV, Zhu S, Felciano RM, Laurance MF, Zhao W, Shu Q, Lee Y, Scheck AC, Liau LM, Wu H, Geschwind DH, Febbo PG, Kornblum HI, Cloughesy TF, Nelson SF, Mischel PS (2006) Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a novel molecular target. Proc Natl Acad Sci USA 103(46):17402–17407

    PubMed  CrossRef  CAS  Google Scholar 

  • Li A, Horvath S (2007) Network neighborhood analysis with the multi-node topological overlap measure. Bioinformatics 23(2):222–231

    PubMed  CrossRef  Google Scholar 

  • Sokal RR, Rohlf FJ (1981) Biometry: The principles and practice of statistics in biological research, 3rd edn. WH Freeman, New York

    Google Scholar 

  • Storey JD (2002) A direct approach to false discovery rates. J R Stat Soc Ser B 64:479–498

    CrossRef  Google Scholar 

  • Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci USA 100(16):9440–9445

    PubMed  CrossRef  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Steve Horvath .

Rights and permissions

Reprints and Permissions

Copyright information

© 2011 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Horvath, S. (2011). Association Measures and Statistical Significance Measures. In: Weighted Network Analysis. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-8819-5_10

Download citation