Skip to main content

Probability and Likelihood

  • Chapter
  • First Online:
Bioinformatics

Part of the book series: Computational Biology ((COBO,volume 21))

  • 5920 Accesses

Abstract

This chapter begins with a concise primer, or aide mémoire, of probability theory. The fundamentals are recapitulated, drawing on the material of Chap. 4. Moments of distributions are reviewed. The theory of runs (successions of similar events preceded and succeeded by different events), which is useful for analysing nucleic acid sequences and series of events, is reviewed, and it leads naturally onto the hypergeometric distribution. The chapter then moves on to likelihood as a valuable means of determining the degree of support for a proposition. A reliable method of assessing the support of any proposition basing its validity on data as the evidence for it is required for a great deal of work in bioinformatics. The chapter closes with a brief review of the method of maximum entropy, which is already used for image restoration but which has the potential for a great many more applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Called Merkmalraum (“label space”) in R. von Mises’ (1931) treatise Wahrscheinlichkeitsrechnung.

  2. 2.

    Its protagonists include Laplace, Keynes, and Jeffreys.

  3. 3.

    According to J.M. Keynes, probability is to be regarded as “the degree of our rational belief in a proposition”.

  4. 4.

    Made during the 17th Guthrie Lecture to the Physical Society in London.

  5. 5.

    Notation: in this chapter, \(P\{X\}\) denotes the probability of event X; \(N\{X\}\) is the number of simple events in (compound) event X. S denotes the certain event that contains all possible events. Sample space and events are primitive (undefined) notions (cf. line and point in geometry).

  6. 6.

    The proof is given in Feller (1967), Chap. 4.

  7. 7.

    Indeed, Reichenbach, Popper, and others have taken the view that conditional probability may and should be chosen as the basic concept of probability theory. We should in any case note that most of the results derived for unconditional probabilities are also valid for conditional probabilities.

  8. 8.

    Stochastic independence is formally defined via the condition

    figure a

    which must hold if the two events A and H are stochastically (sometimes called statistically) independent.

  9. 9.

    If and only if.

  10. 10.

    Due to P.V. Sukhatme and V.G. Panse, quoted by Feller (1967), Chap. 6.

  11. 11.

    \(\mathbf {X}\) may assume the values \(x_1,x_2,\ldots \) (i.e., the range of \(\mathbf {X}\)).

  12. 12.

    The distribution function F(x) of \(\mathbf {X}\) is defined by

    figure b

    (i.e., a nondecreasing function tending to 1 as \(x\rightarrow \infty \)).

  13. 13.

    Also denoted by angular brackets or a bar.

  14. 14.

    Notice the mechanical analogies: centre of gravity as the mean of a mass and moment of inertia as its variance.

  15. 15.

    Older literature uses the term “dispersion”.

  16. 16.

    Strictly speaking, one should instead refer to propositions. A hypothesis is an asserted proposition, whereas at the beginning of an investigation it would be better to start with considered propositions, to avoid prematurely asserting what one wishes to find out. Unfortunately, the use of the term “hypothesis” seems to have become so well established that we may risk confusion if we avoid using the word.

  17. 17.

    As Fisher and others have pointed out, it is not strictly correct to associate Bayes with the inverse probability method. Bayes’ doubts as to its validity led him to withhold publication of his work (it was published posthumously).

  18. 18.

    Sometimes brevity is taken as the main criterion. This is the minimum description length (MDL) approach. See also the discussion in Sects. 3.4 and 6.5.

  19. 19.

    Implicitly, Platonic reality is meant here.

References

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jeremy Ramsden .

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag London

About this chapter

Cite this chapter

Ramsden, J. (2015). Probability and Likelihood. In: Bioinformatics. Computational Biology, vol 21. Springer, London. https://doi.org/10.1007/978-1-4471-6702-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-6702-0_5

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-6701-3

  • Online ISBN: 978-1-4471-6702-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics