Skip to main content
Log in

Incorporating heuristic information into ant colony optimization for epistasis detection

  • Research Article
  • Published:
Genes & Genomics Aims and scope Submit manuscript

Abstract

Epistasis has been receiving increasing attention in understanding the mechanism underlying susceptibility to complex diseases. Though many works have been done for epistasis detection, genome-wide association study remains a challenging task: it makes the search space excessively huge while solution quality is excessively demanded. In this study, we introduce an ant colony optimization based algorithm, AntMiner, by incorporating heuristic information into ant-decision rules. The heuristic information is used to direct ants in the search process for improving computational efficiency and solution accuracy. During iterations, chi-squared test is conducted to measure the association between an interaction and the phenotype. At the completion of the iteration process, statistically significant epistatic interactions are ordered and then screened by a post-procedure. Experiments of AntMiner and its comparison with existing algorithms epiMODE, TEAM and AntEpiSeeker are performed on both simulation data sets and real age-related macular degeneration data set, under the criteria of detection power and sensitivity. Results demonstrate that AntMiner is promising for epistasis detection. In terms of detection power, AntMiner performs best among all the other algorithms on all cases regardless of epistasis models and single nucleotide polymorphism size; compared with AntEpiSeeker, AntMiner can obtain better detection power but with less ants and iterations. In terms of sensitivity, AntMiner is better than AntEpiSeeker in detecting epistasis models displaying marginal effects but it has moderate sensitivity on epistasis models displaying no marginal effects. The study may provide clues on heuristics for further epistasis detection. The software package is available online at https://sourceforge.net/projects/antminer/files/.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Balding DJ (2006) A tutorial on statistical methods for population association studies. Nat. Rev. Genet. 7: 781–791.

    Article  PubMed  CAS  Google Scholar 

  • Cardon LR and Bell JI (2001) Association study designs for complex diseases. Nat. Rev. Genet. 2: 91–99.

    Article  PubMed  CAS  Google Scholar 

  • Christmas J, Keedwell E and Frayling TM (2010) Ant colony optimisation to identify genetic variant association with type 2 diabetes. Inform. Sciences 181: 1609–1622.

    Article  Google Scholar 

  • Culverhouse R, Klein T and Shannon W (2004) Detecting epistatic interactions contributing to quantitative traits. Genet. Epidemiol. 27: 141–152.

    Article  PubMed  Google Scholar 

  • Das S, Abraham A and Konar A (2008) Swarm intelligence algorithms in bioinformatics. Stud. Comp. Intell. 94: 113–147.

    Article  Google Scholar 

  • Fontanarosa J and Dai Y (2010) A block-based evolutionary optimization strategy to investigate gene-gene interactions in genetic association studies. IEEE Internat. Confer. Bioinform. Biomed. Workshop: 330–335.

  • Frankel WN and Schork NJ (1996) Who’s afraid of epistasis? Nat. Genet. 14: 371–373.

    Article  PubMed  CAS  Google Scholar 

  • Gilmore J, Greene C, Andrews P, Kiralis J and Moore J (2011) An Analysis of New Expert Knowledge Scaling Methods for Biologically Inspired Computing. Lect. Notes Comput. Sci. 5778: 286–293.

    Article  Google Scholar 

  • Greene C, Gilmore J, Kiralis J, Andrews P and Moore J (2009a) Optimal use of expert knowledge in ant colony optimization for the analysis of epistasis in human disease. Lect. Notes Comput. Sci. 5483: 92–103.

    Article  Google Scholar 

  • Greene CS, Penrod NM, Kiralis J and Moore JH (2009b) Spatially uniform relieff (SURF) for computationally-efficient filtering of gene-gene interactions. BioData Min. 2: 5.

    Article  PubMed  CAS  Google Scholar 

  • Greene CS, White BC and Moore JH (2008) Ant Colony Optimization for Genome-Wide Genetic Analysis. Lect Notes Comput. Sci. 5217: 37–47.

    Article  Google Scholar 

  • Jiang R, Tang W, Wu X and Fu W (2009) A random forest approach to the detection of epistatic interactions in case-control studies. BMC Bioinformatics 10Suppl 1: S65.

    Article  PubMed  CAS  Google Scholar 

  • Klein RJ, Zeiss C, Chew EY, Tsai JY, Sackler RS, Haynes C, Henning AK, SanGiovanni JP, Mane SM, Mayne ST et al. (2005) Complement factor H polymorphism in age-related macular degeneration. Science 308: 385–389.

    Article  PubMed  CAS  Google Scholar 

  • Li W and Reich J (2000) A complete enumeration and classification of two-locus disease models. Hum. Hered. 50: 334–349.

    Article  PubMed  CAS  Google Scholar 

  • Maher B (2008) Personal genomes: The case of the missing heritability. Nature 456: 18–21.

    Article  PubMed  CAS  Google Scholar 

  • Moore JH, Asselbergs FW and Williams SM (2010) Bioinformatics challenges for genome-wide association studies. Bioinformatics 26: 445–455.

    Article  PubMed  CAS  Google Scholar 

  • Quevedo J, Bahamonde A, Prez-Enciso M and Luaces O (2011) Disease liability prediction from large scale genotyping data using classifiers with a reject option. IEEE/ACM T. Comput. Bi. 9: 88–97.

    Article  Google Scholar 

  • Rekaya R and Robbins K (2009) Ant colony algorithm for analysis of gene interaction in high-dimensional association data. Rev. Bras. Zootecn. 38: 93–97.

    Article  Google Scholar 

  • Risch N and Merikangas K (1996) The future of genetic studies of complex human diseases. Science 273: 1516–1517.

    Article  PubMed  CAS  Google Scholar 

  • Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, Parl FF and Moore JH (2001) Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am. J. Hum. Genet. 69: 138–147.

    Article  PubMed  CAS  Google Scholar 

  • Shang J, Zhang J, Sun Y, Liu D, Ye D and Yin Y (2011) Performance analysis of novel methods for detecting epistasis. BMC Bioinformatics 12: 475.

    Article  PubMed  Google Scholar 

  • Shen Y, Liu Z and Ott J (2010) Systematic removal of outliers to reduce heterogeneity in case-control association studies. Hum. Hered. 70: 227–231.

    Article  PubMed  Google Scholar 

  • Tang W, Wu X, Jiang R and Li Y (2009) Epistatic module detection for case-control studies: a Bayesian model with a Gibbs sampling strategy. PLoS Genet. 5: e1000464.

    Article  PubMed  CAS  Google Scholar 

  • Wan X, Yang C, Yang Q, Xue H, Tang NL and Yu W (2010) Detecting two-locus associations allowing for interactions in genome-wide association studies. Bioinformatics 26: 2517–2525.

    Article  PubMed  CAS  Google Scholar 

  • Wang Y (2010) Computation Bioinformatics and Bioimaging Laboratory. http://www.cbil.ece.vt.edu/software.htm.

  • Wang Y, Liu X, Robbins K and Rekaya R (2010) AntEpiSeeker: detecting epistatic interactions for case-control studies using a two-stage ant colony optimization algorithm. BMC Res. Notes 3: 117.

    Article  PubMed  CAS  Google Scholar 

  • Wei B, Peng Q, Zhang Q and Li C (2011) Identification of a combination of SNPs associated with Graves’ disease using swarm intelligence. Sci. China Life Sci. 54: 139–145.

    Article  PubMed  Google Scholar 

  • WTCCC (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447: 661–678.

    Article  CAS  Google Scholar 

  • Yuan X, Zhang J and Wang Y (2011) Mutual information and linkage disequilibrium based SNP association study by grouping case-control. Genes Genom. 33: 65–73.

    Article  Google Scholar 

  • Zhang X, Huang S, Zou F and Wang W (2010) TEAM: efficient two-locus epistasis tests in human genome-wide association study. Bioinformatics 26: 217–227.

    Article  CAS  Google Scholar 

  • Zhang Y and Liu JS (2007) Bayesian inference of epistatic interactions in case-control studies. Nat. Genet. 39: 1167–1173.

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Junliang Shang or Junying Zhang.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shang, J., Zhang, J., Lei, X. et al. Incorporating heuristic information into ant colony optimization for epistasis detection. Genes Genom 34, 321–327 (2012). https://doi.org/10.1007/s13258-012-0003-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13258-012-0003-2

Keywords

Navigation