Human-Computer Interaction in a Computational Evolution System for the Genetic Analysis of Cancer

  • Jason H. Moore
  • Douglas P. Hill
  • Jonathan M. Fisher
  • Nicole Lavender
  • La Creis Kidd
Chapter

Abstract

The paradigm of identifying genetic risk factors for common human diseases by analyzing one DNA sequence variation at a time is quickly being replaced by research strategies that embrace the multivariate complexity of the genotype to phenotype mapping relationship that is likely due, in part, to nonlinear interactions among many genetic and environmental factors. Embracing the complexity of common diseases such as cancer requires powerful computational methods that are able to model nonlinear interactions in high-dimensional genetic data. Previously, we have addressed this challenge with the development of a computational evolution system (CES) that incorporates greater biological realism than traditional artificial evolution methods, such as genetic programming. Our results have demonstrated that CES is capable of efficiently navigating these large and rugged fitness landscapes toward the discovery of biologically meaningful genetic models of disease predisposition. Further, we have shown that the efficacy of CES is improved dramatically when the system is provided with statistical expert knowledge, derived from a family of machine learning techniques known as Relief, or biological expert knowledge, derived from sources such as protein-protein interaction databases. The goal of the present study was to apply CES to the genetic analysis of prostate cancer aggressiveness in a large sample of European Americans. We introduce here the use of 3D visualization methods to identify interesting patterns in CES results. Information extracted from the visualization through human-computer interaction are then provide as expert knowledge to newCES runs in a cascading framework. We present aCES-derived multivariate classifier and provide a statistical and biological interpretation in the context of prostate cancer prediction. The incorporation of human-computer interaction into CES provides a first step towards an interactive discovery system where the experts can be embedded in the computational discovery process. Our working hypothesis is that this type of human-computer interaction will provide more useful results for complex problem solving than the traditional black box machine learning approach.

Keywords

Computational Evolution Genetic Epidemiology epistasis Prostate Cancer Visualization 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Banzhaf, W., Beslon, G., Christensen, S., Foster, J.A., K´ep`es, F., Lefort, V., Miller, J.F., Radman,M., and Ramsden, J.J. (2006). From artificial evolution to computational evolution: a research agenda. Nature Reviews Genetics, 7:729–735.Google Scholar
  2. Banzhaf, W., Nordin, P., Keller, R.E., and Francone, F.D. (1998). Genetic Programming Ð An Introduction; On the Automatic Evolutionof Computer Programs and its Applications. Morgan Kaufmann, San Francisco, CA, USA.Google Scholar
  3. Cordell, H.J. (2009). Detecting gene-gene interactions that underlie human diseases. Nature Reviews Genetics, 10:392–404.CrossRefGoogle Scholar
  4. Fogel, G.B. and Corne, D.W. (2003). Evolutionary Computation in Bioinformatics. Morgan Kaufmann Publishers.Google Scholar
  5. Greene, C.S., Hill, D.P., and Moore, J.H. (2009a). Environmental noise improves epistasis models of genetic data discovered using a computational evolution system. In Proceedings of the Genetic and Evolutionary Computation Conference, pages 1785–1786.Google Scholar
  6. Greene, C.S., Hill, D.P., andWhite, B.C. (2010). Genetic Programming Theory and Practice VII, chapter Environmental sensing using expert knowledge in a computational evolution system for complex problem solving in human genetics, pages 195–210. Springer, Ann Arbor.Google Scholar
  7. Greene, C.S., White, B.C., and Moore, J.H. (2009b). An expert knowledgeguided mutation operator for genome-wide genetic analysis using genetic programming. In Lecture Notes in Bioinformatics, volume 4774, pages 30– 40.Google Scholar
  8. Greene, C.S.,White, B.C., and Moore, J.H. (2009c). Sensible initialization using expert knowledge for genome-wide analysis of epistasis using genetic programming. In Proceedings of the IEEE Congress on Evolutionary Computation, pages 1289–1296.Google Scholar
  9. Hastie, T., Tibshirani, R., and Friedman, J. (2001). The Elements of Statistical Learning : Data Mining, Inference, and Prediction. New York: Springer- Verlag.Google Scholar
  10. Heer, J., Bostock, M., and Ogievetsky, V. (2010). A tour through the visualization zoo. Comm ACM, 53:59.CrossRefGoogle Scholar
  11. Koza, J.R. (1992). Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA.Google Scholar
  12. Langley, P. (2002). Lessons for the computational discovery of scientific knowledge. In Proceedings of First International Workshop on Data Mining Lessons Learned, volume 1, pages 9–12.Google Scholar
  13. McKinney, B.A., Reif, D.M, Ritchie, M.D., and Moore, J.H. (2006). Machine learning for detecting gene-gene interactions: a review. Appl. Bioinformatics, 5:77–88.CrossRefGoogle Scholar
  14. Mitchell, T.M. (1997). Machine Learning. MacGraw-Hill, Boston.Google Scholar
  15. Moore, J.H., Andrews, P.C., Barney, N., andWhite, B.C. (2008). Development and evaluation of an open-ended computational evolution system for the genetic analysis of susceptibility to common human diseases. In Lecture Notes in Computer Science, volume 4973, pages 129–140.Google Scholar
  16. Moore, J.H.,Asselbergs, F.W., andWilliams, S.M. (2010). Bioinformatics challenges for genome-wide association studies. Bioinformatics, 26(4):445–455.Google Scholar
  17. Moore, J.H.,Gilbert, J.C., Tsai, C.-T., Chiang, F.T.,Holden,W., Barney, N., and White, B.C. (2006). A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. Journal of Theoretical Biology, 241:252–61.MathSciNetCrossRefGoogle Scholar
  18. Moore, J.H., Lari, R.C.,Hill, D.,Hibberd, P.L., andMadan, J.C. (2011). Human microbiome visualization using 3d technology. In Pac Symp Biocomput., pages 154–64.Google Scholar
  19. Moore, J.H., Parker, J.S., Olsen, N.J., and Aune, T.M. (2002). Symbolic discriminant analysis of microarray data in autoimmune disease. Genetic Epidemiology, 23:57–69.CrossRefGoogle Scholar
  20. Moore, J.H. and White, B.C. (2007). Tuning relieff for genome-wide genetic analysis. Lec. Notes Comp. Sci., 4447:166–175.CrossRefGoogle Scholar
  21. Moore, J.H. and Williams, S.M. (2009). Epistasis and its implications for personalGoogle Scholar
  22. genetics. American Journal of Human Genetics, 85:309–320.Google Scholar
  23. Motsinger, A.A., Ritchie, M.D., and Reif, D.M (2007). Novel methods for detecting epistasis in pharmacogenomics studies. Pharmacogenomics, 8:1229– 41.CrossRefGoogle Scholar
  24. Pattin, K.A., Payne, J.L., Hill, D.P., Caldwell, T., Fisher, J., and Moore, J.H. (2011). Genetic Programming Theory and Practice VIII, chapter Exploiting expert knowledge of protein-protein interactions in a computational evolution system for detecting epistasis in common human disease, pages 195–210. Springer.Google Scholar
  25. Payne, J.L., Greene, C.S., Hill, D.P., and Moore, J.H. (2010). Exploitation of Linkage Learning in Evolutionary Algorithms, chapter 10: Sensible initialization of a computational evolution systemusing expert knowledge for epistasis analysis in human genetics, pages 215–226. Springer.Google Scholar
  26. Spector, L. (2003). Genetic Programming Theory and Practice, chapter An essay concerning human understanding of genetic programming, pages 11– 24. Springer.Google Scholar
  27. Thomas, J. and Cook, K. (2005). Illuminating the Path: Research and Development Agenda for Visual Analytics. IEEE Press.Google Scholar
  28. Thornton-Wells, T.A., Moore, J.H., and Haines, J.L. (2004). Genetics, statistics and human disease: analytical retooling for complexity. Trends Genet., 20:640–7.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • Jason H. Moore
    • 1
  • Douglas P. Hill
    • 1
  • Jonathan M. Fisher
    • 1
  • Nicole Lavender
    • 1
  • La Creis Kidd
    • 1
  1. 1.Dartmouth Medical SchoolLebanonUSA

Personalised recommendations