Skip to main content

Identification of Novel Genetic Models of Glaucoma Using the “EMERGENT” Genetic Programming-Based Artificial Intelligence System

Part of the Genetic and Evolutionary Computation book series (GEVO)

Abstract

The genetic basis for primary open-angle glaucoma (POAG) is not yet understood but is likely the result of many interacting genetic variants that influence risk in the context of our local ecology. The complexity of the genotype to phenotype mapping relationship for common diseases like POAG necessitates analytical approaches that move beyond parametric statistical methods such as logistic regression that assume a particular mathematical model. This is particularly important in the era of big data where it is routine to collect and analyze data sets with hundreds of thousands of measured genetic variants in thousands of human subjects. We introduce here the Exploratory Modeling for Extracting Relationships using Genetic and Evolutionary Navigation Techniques (EMERGENT) algorithm as an artificial intelligence approach to the genetic analysis of common human diseases. EMERGENT builds models of genetic variation from lists of mathematical functions using a form of genetic programming called computational evolution. A key feature of the system is the ability to utilize pre-processed expert knowledge giving it the ability to explore model space much as a human would. We describe this system in detail and then apply it to the genetic analysis of POAG in the Glaucoma Gene Environment Initiative (GLAUGEN) study that included approximately 1,272 subjects with the disease and 1057 healthy controls. A total of 657,366 single-nucleotide polymorphisms (SNPs) from across the human genome were measured in these subjects and available for analysis. Analysis using the EMERGENT framework revealed a best model consisting of six SNPs that map to at least six different genes. Two of these genes have previously been associated with POAG in several studies. The others represent new hypotheses about the genetic basis of POAG. All of the SNPs are involved in non-additive gene-gene interactions. Further, the six genes are all directly or indirectly related through biological interactions to the vascular endothelial growth factor (VEGF) gene that is an actively investigated drug target for POAG. This study demonstrates the routine application of an artificial intelligence-based system for the genetic analysis of complex human diseases.

Keywords

  • Exploratory modeling for extracting relationships using genetic and evolutionary navigation techniques
  • Artificial intelligence
  • Genetic programming
  • Glaucoma

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-319-16030-6_2
  • Chapter length: 19 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   84.99
Price excludes VAT (USA)
  • ISBN: 978-3-319-16030-6
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Hardcover Book
USD   119.99
Price excludes VAT (USA)
Fig. 2.1
Fig. 2.2
Fig. 2.3
Fig. 2.4
Fig. 2.5

References

  • Banzhaf W, Francone FD, Keller RE, Nordin P (1998) Genetic programming: an introduction: on the automatic evolution of computer programs and its applications. Morgan Kaufmann Publishers Inc., San Francisco

    MATH  Google Scholar 

  • Banzhaf W, Beslon G, Christensen S, Foster J, Kepes F, Lefort V, Miller J, Radman M, Ramsden J (2006) From artificial evolution to computational evolution: a research agenda. Nature Rev Genet 7:729–735

    CrossRef  Google Scholar 

  • Bush WS, Moore JH (2012) Chapter 11: Genome-wide association studies. PLoS Comput Biol 8(12):e1002822. doi:10.1371/journal.pcbi.1002822

    CrossRef  Google Scholar 

  • Clark AF, Yorio T (2003) Ophthalmic drug discovery. Nat Rev Drug Discov 2(6):448–459. doi:10.1038/nrd1106

    CrossRef  Google Scholar 

  • Cooke Bailey JN, Sobrin L, Pericak-Vance MA, Haines JL, Hammond CJ, Wiggs JL (2013) Advances in the genomics of common eye diseases. Hum Mol Genet 22(R1):R59–65. doi:10.1093/hmg/ddt396

    CrossRef  Google Scholar 

  • Cornelis MC, Agrawal A, Cole JW, Hansel NN, Barnes KC, Beaty TH, Bennett SN, Bierut LJ, Boerwinkle E, Doheny KF, Feenstra B, Feingold E, Fornage M, Haiman CA, Harris EL, Hayes MG, Heit JA, Hu FB, Kang JH, Laurie CC, Ling H, Manolio TA, Marazita ML, Mathias RA, Mirel DB, Paschall J, Pasquale LR, Pugh EW, Rice JP, Udren J, van Dam RM, Wang X, Wiggs JL, Williams K, Yu K (2010) The gene, environment association studies consortium (geneva): maximizing the knowledge obtained from gwas by collaboration across studies of multiple conditions. Genet Epidemiol 34(4):364–372. doi:10.1002/gepi.20492

    CrossRef  Google Scholar 

  • Dewey FE, Grove ME, Pan Cea (2014) Clinical interpretation and implications of whole-genome sequencing. JAMA 311(10):1035–1045. doi:10.1001/jama.2014.1717. http://dx.doi.org/10.1001/jama.2014.1717,/data/Journals/JAMA/929845/joi140017.pdf

    CrossRef  Google Scholar 

  • Fan R, Zhong M, Wang S, Zhang Y, Andrew A, Karagas M, Chen H, Amos CI, Xiong M, Moore JH (2011) Entropy-based information gain approaches to detect and to characterize gene-gene and gene-environment interactions/correlations of complex diseases. Genet Epidemiol 35(7):706–721. doi:10.1002/gepi.20621

    CrossRef  Google Scholar 

  • Fogel GB, Corne DW (eds) (2003) Evolutionary computation in bioinformatics. Morgan Kaufmann Publishers Inc.

    Google Scholar 

  • Greene CS, Hill DP, Moore JH (2009a) Environmental noise improves epistasis models of genetic data discovered using a computational evolution system. In: Proceedings of the 11th annual conference on genetic and evolutionary computation, ACM, New York, NY, GECCO '09, pp 1785–1786. doi:10.1145/1569901.1570160. http://doi.acm.org/10.1145/1569901.1570160

  • Greene CS, Hill DP, Moore JH (2009b) Environmental sensing of expert knowledge in a computational evolution system for complex problem solving in human genetics. In: Riolo RL, O’Reilly UM, McConaghy T (eds) Genetic programming theory and practice VII, genetic and evolutionary computation. Springer, Ann Arbor, pp 19–36

    Google Scholar 

  • Hahn LW, Ritchie MD, Moore JH (2003) Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions. Bioinformatics 19(3):376–382

    CrossRef  Google Scholar 

  • Hirschhorn JN, Daly MJ (2005) Genome-wide association studies for common diseases and complex traits. Nat Rev Genet 6(2):95–108. doi:10.1038/nrg1521

    CrossRef  Google Scholar 

  • Horn J, Nafpliotis N, Goldberg DE (1994) A niched pareto genetic algorithm for multiobjective optimization. In: Evolutionary Computation, 1994. IEEE world congress on computational intelligence, proceedings of the first IEEE conference on, pp 82–87 vol.1. doi:10.1109/ICEC.1994.350037

    Google Scholar 

  • Hornby GS (2006) Alps: the age-layered population structure for reducing the problem of premature convergence. In: Proceedings of the 8th annual conference on Genetic and evolutionary computation, ACM, New York, NY, GECCO '06, pp 815–822. doi:10.1145/1143997.1144142

    Google Scholar 

  • Horsley MB, Kahook MY (2010) Anti-vegf therapy for glaucoma. Curr Opin Ophthalmol 21(2):112–117. doi:10.1097/ICU.0b013e3283360aad

    CrossRef  Google Scholar 

  • Hu T, Sinnott-Armstrong NA, Kiralis JW, Andrew AS, Karagas MR, Moore JH (2011) Characterizing genetic interactions in human disease association studies using statistical epistasis networks. BMC Bioinform 12:364. doi:10.1186/1471-2105-12-364

    CrossRef  Google Scholar 

  • Hu T, Chen Y, Kiralis JW, Moore JH (2013) Visen: methodology and software for visualization of statistical epistasis networks. Genet Epidemiol 37(3):283–285. doi:10.1002/gepi.21718

    CrossRef  Google Scholar 

  • Koza JR (1992) Genetic programming: on the programming of computers by means of natural selection (Complex Adaptive Systems), 1st edn. A Bradford Book. http://www.amazon.com/exec/obidos/redirect?tag=citeulike07-20&path=ASIN/0262111705

  • Mailman MD, Feolo M, Jin Y, Kimura M, Tryka K, Bagoutdinov R, Hao L, Kiang A, Paschall J, Phan L, Popova N, Pretel S, Ziyabari L, Lee M, Shao Y, Wang ZY, Sirotkin K, Ward M, Kholodov M, Zbicz K, Beck J, Kimelman M, Shevelev S, Preuss D, Yaschenko E, Graeff A, Ostell J, Sherry ST (2007) The ncbi dbgap database of genotypes and phenotypes. Nat Genet 39(10):1181–1186. doi:10.1038/ng1007-1181

    CrossRef  Google Scholar 

  • Moore J (2003) The ubiquitous nature of epistasis in determining susceptibility to common human diseases. Human Heredity 56:73–82

    CrossRef  Google Scholar 

  • Moore JH, White BC (2007) Tuning relieff for genome-wide genetic analysis. Proceedings of the 5th European conference on Evolutionary computation, machine learning and data mining in bioinformatics. Springer-Verlag, Berlin, pp 166–175. http://dl.acm.org/citation.cfm?id=1761486.1761502

  • Moore JH, Williams SM (2005) Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis. Bioessays 27(6):637–646. doi:10.1002/bies.20236

    CrossRef  Google Scholar 

  • Moore JH, Williams SM (2009) Epistasis and its implications for personal genetics. Am J Hum Genet 85(3):309–320. doi:10.1016/j.ajhg.2009.08.006

    CrossRef  Google Scholar 

  • Moore JH, Parker JS, Olsen NJ, Aune TM (2002) Symbolic discriminant analysis of microarray data in autoimmune disease. Genet Epidemiol 23(1):57–69

    CrossRef  Google Scholar 

  • Moore JH, Gilbert JC, Tsai CT, Chiang FT, Holden T, Barney N, White BC (2006) A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. J Theor Biol 241(2):252–261. doi:10.1016/j.jtbi.2005.11.036. http://dx.doi.org/10.1016/j.jtbi.2005.11.036

    MathSciNet  CrossRef  Google Scholar 

  • Moore JH, Barney N, Tsai CT, Chiang FT, Gui J, White BC (2007) Symbolic modeling of epistasis. Hum Hered 63(2):120–133. doi:10.1159/000099184

    CrossRef  Google Scholar 

  • Moore JH, Andrews PC, Barney N, White BC (2008a) Development and evaluation of an open-ended computational evolution system for the genetic analysis of susceptibility to common human diseases. In: Marchiori E, Moore JH (eds) EvoBIO, Springer, lecture notes in computer science, vol 4973, pp 129–140

    Google Scholar 

  • Moore JH, Barney N, White BC (2008b) Solving complex problems in human genetics using genetic programming: the importance of theorist-Practitionercomputer interaction. Springer US, pp 69–85

    Google Scholar 

  • Moore JH, Greene CS, Andrews PC, White BC (2008c) Does complexity matter? artificial evolution, computational evolution and the genetic analysis of epistasis in common human diseases. In: Riolo RL, Soule T, Worzel B (eds) Genetic programming theory and practice VI, genetic and evolutionary computation. Springer, Ann Arbor, pp 125–145. doi:10.1007/978-0-387-87623-8-9

    Google Scholar 

  • Moore JH, Asselbergs FW, Williams SM (2010) Bioinformatics challenges for genome-wide association studies. Bioinformatics 26(4):445–455. doi:10.1093/bioinformatics/btp713

    CrossRef  Google Scholar 

  • Moore JH, Hill DP, Fisher JM, Lavender N, Kidd LC (2011) Human-computer interaction in a computational evolution system for the genetic analysis of cancer. In: Riolo R, Vladislavleva E, Moore JH (eds) Genetic programming theory and practice IX, genetic and evolutionary computation, Springer, Ann Arbor, pp 153–171. doi:10.1007/978-1-4614-1770-5-9

    Google Scholar 

  • Moore JH, Hill DP, Sulovari A, Kidd L (2012) Genetic analysis of prostate cancer using computational evolution, pareto-optimization and post-processing. In: Riolo R, Vladislavleva E, Ritchie MD, Moore JH (eds) Genetic programming theory and practice X, genetic and evolutionary computation. Springer, Ann Arbor, pp 87–101. doi:10.1007/978-1-4614-6846-2-7. http://dx.doi.org/10.1007/978-1-4614-6846-2-7

  • Moore JH, Hill DP, Saykin A, Shen L (2013) Exploring interestingness in a computational evolution system for the genome-wide genetic analysis of alzheimer’s disease. In: Riolo R, Moore JH, Kotanchek M (eds) Genetic programming theory and practice XI, genetic and evolutionary computation. Springer, pp 31–45. doi:10.1007/978-1-4939-0375-7-2

    Google Scholar 

  • Osaadon P, Fagan XJ, Lifshitz T, Levy J (2014) A review of anti-vegf agents for proliferative diabetic retinopathy. Eye (Lond) 28(5):510–520. doi:10.1038/eye.2014.13

    CrossRef  Google Scholar 

  • Pattin KA, Payne JL, Hill DP, Caldwell T, Fisher JM, Moore JH (2010) Exploiting expert knowledge of protein-protein interactions in a computational evolution system for detecting epistasis. In: Riolo R, McConaghy T, Vladislavleva E (eds) Genetic programming theory and practice VIII, genetic and evolutionary computation, vol 8. Springer, Ann Arbor. (Chap 12, pp 195–210, http://www.springer.com/computer/ai/book/978-1-4419-7746-5)

  • Payne J, Greene C, Hill D, Moore J (2010) Exploitation of linkage learning in evolutionary algorithms. Springer. (chap 10: Sensible initialization of a computational evolution system using expert knowledge for epistasis analysis in human genetics), pp 215–226

    Google Scholar 

  • Ritchie M, Hahn L, Roodi N, Bailey L, Dupont W, Parl F, Moore J (2001) Multifactor dimensionality reduction reveals high-order interactions among estrogen metabolism genes in sporadic breast cancer. Am J Hum Genetics 69:138–147

    CrossRef  Google Scholar 

  • Ritchie MD, White BC, Parker JS, Hahn LW, Moore JH (2003) Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases. BMC Bioinform 4:28. doi:10.1186/1471-2105-4-28

    CrossRef  Google Scholar 

  • SooHoo JR, Seibold LK, Kahook MY (2014) The link between intravitreal antivascular endothelial growth factor injections and glaucoma. Curr Opin Ophthalmol 25(2):127–133. doi:10.1097/ICU.0000000000000036

    CrossRef  Google Scholar 

  • Spector L, Robinson A (2002) Genetic programming and autoconstructive evolution with the push programming language. Genet Program Evolvable Mach pp 7–40

    Google Scholar 

  • Tyler AL, Asselbergs FW, Williams SM, Moore JH (2009) Shadows of complexity: what biological networks reveal about epistasis and pleiotropy. Bioessays 31(2):220–227. doi:10.1002/bies.200800022

    CrossRef  Google Scholar 

  • Wang WYS, Barratt BJ, Clayton DG, Todd JA (2005) Genome-wide association studies: theoretical and practical concerns. Nat Rev Genet 6(2):109–118. doi:10.1038/nrg1522

    CrossRef  Google Scholar 

  • Wang X, Harmon J, Zabrieskie N, Chen Y, Grob S, Williams B, Lee C, Kasuga D, Shaw PX, Buehler J, Wang N, Zhang K (2010) Using the utah population database to assess familial risk of primary open angle glaucoma. Vision Res 50(23):2391–2395. doi:10.1016/j.visres.2010.09.018

    CrossRef  Google Scholar 

  • Welter D, MacArthur J, Morales J, Burdett T, Hall P, Junkins H, Klemm A, Flicek P, Manolio T, Hindorff L, Parkinson H (2014) The nhgri gwas catalog, a curated resource of snp-trait associations. Nucleic Acids Res 42(Database issue):D1001–6. doi:10.1093/nar/gkt1229

    CrossRef  Google Scholar 

  • Wong AK, Park CY, Greene CS, Bongo LA, Guan Y, Troyanskaya OG (2012) Imp: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks. Nucleic Acids Res 40(Web Server issue):W484–90. doi:10.1093/nar/gks458

    CrossRef  Google Scholar 

Download references

Acknowledgements

The work on glaucoma was supported by NIH grant EY022300. Aspects of the algorithm development were also supported by NIH grants LM009012, LM010098, and AI59694. Computational infrastructure was partially supported by NIH grants P20 GM103506 and P20 GM103534. We would like to thank the participants of present and past Genetic Programming Theory and Practice Workshops (GPTP) for their stimulating feedback and discussion that helped formulate some of the ideas in this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jason H. Moore .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Moore, J., Greene, C., Hill, D. (2015). Identification of Novel Genetic Models of Glaucoma Using the “EMERGENT” Genetic Programming-Based Artificial Intelligence System. In: Riolo, R., Worzel, W., Kotanchek, M. (eds) Genetic Programming Theory and Practice XII. Genetic and Evolutionary Computation. Springer, Cham. https://doi.org/10.1007/978-3-319-16030-6_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16030-6_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16029-0

  • Online ISBN: 978-3-319-16030-6

  • eBook Packages: Computer ScienceComputer Science (R0)