Bits and Pieces

  • Lloyd Allison


It is one thing to have an awareness of a subject but it is another to be an active and able worker in it. This chapter describes experiences, pitfalls, hints and tricks that may help the reader to get started at putting MML into practice. “Probability theory is nothing but common sense reduced to calculation” (Laplace) but data analysis software is numerical software and the results of computations need to be checked with scepticism, common sense and cunning.


  1. 1.
    L. Allison, Models for machine learning and data mining in functional programming. J. Funct. Program. 15(1), 15–32 (2005). MathSciNetCrossRefGoogle Scholar
  2. 10.
    A.J. Bleasby, J.C. Wootton, Constructing validated, non-redundant composite protein sequence databases. Protein Eng. 3(3), 153–159 (1990).
  3. 13.
    G.E.P. Box, Robustness in the strategy of scientific model building. Robustness Stat. 1, 201–236 (1979). Includes, “All models are wrong but some are useful”
  4. 29.
    S. Fienberg, When did Bayesian inference become “Bayesian”? Bayesian Anal. 1(1), 1–40 (2006). MathSciNetCrossRefGoogle Scholar
  5. 42.
    J. Gosling, B. Joy, G. Steele, G. Bracha, A. Buckley, The Java Language Specification, Java SE 8 (Addison Wesley, Reading, MA, 2014). ISBN 978-0133900699Google Scholar
  6. 50.
    A.S. Konagurthu, L. Allison, D. Abramson, P.J. Stuckey, A.M. Lesk, How precise are reported protein coordinate data? Acta Crystallogr. D70(3), 904–906 (2014). CrossRefGoogle Scholar
  7. 51.
    S. Kullback, R.A. Leibler, On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951). MathSciNetCrossRefGoogle Scholar
  8. 60.
    M. Nelson, Arithmetic coding + statistical modeling = data compression. Dr Dobbs J. (1991).
  9. 64.
    R.C. Pasco, Source coding algorithms for fast data compression. PhD thesis, Stanford University CA, 1976Google Scholar
  10. 67.
    S. Peyton Jones et al., Haskell 98 Language and Libraries, the Revised Report (Cambridge University Press, Cambridge, 2003). ISBN 978-0521826143zbMATHGoogle Scholar
  11. 70.
    J. Rissanen, Generalized kraft inequality and arithmetic coding. IBM J. Res. Dev. 20(3), 198–203 (1976). MathSciNetCrossRefGoogle Scholar
  12. 75.
    P.H. Sellers, On the theory and computation of evolutionary distances. SIAM J. Appl. Math. 26(4), 787–793 (1974). MathSciNetCrossRefGoogle Scholar
  13. 82.
    B. Stroustrup, The C++ Programming Language: Special Edition (Addison Wesley, Reading, MA, 2000). ISBN 978-0201700732zbMATHGoogle Scholar
  14. 88.
    C.S. Wallace, How to do inductive inference using information theory. Seminar, Department of Computer Science, Monash University, 1993. Google Scholar
  15. 91.
    C.S. Wallace, A brief history of MML. Seminar, Faculty of Information Technology, Monash University, 2003. Google Scholar
  16. 96.
    C.S. Wallace, P.R. Freeman, Estimation and inference by compact coding. J. R. Stat. Soc. Ser. B Methodol. 240–265 (1987).
  17. 98.
    C.S. Wallace, J.D. Patrick, Coding decision trees. Mach. Learn. 11(1), 7–22 (1993). CrossRefGoogle Scholar
  18. 102.
    C.N. Yee, L. Allison, Reconstruction of strings past. J. Bioinformatics 9(1), 1–7 (1993). CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • Lloyd Allison
    • 1
  1. 1.Faculty of Information TechnologyMonash UniversityMelbourneAustralia

Personalised recommendations