Chapter

Bioinformatics Research and Applications

Volume 6674 of the series Lecture Notes in Computer Science pp 404-415

An Integrative Approach for Genomic Island Prediction in Prokaryotic Genomes

  • Han WangAffiliated withDepartment of Computer Science, East Stroudsburg University
  • , John FazekasAffiliated withDepartment of Computer Science, East Stroudsburg University
  • , Matthew BoothAffiliated withDepartment of Computer Science, East Stroudsburg University
  • , Qi LiuAffiliated withCollege of Life Science and Biotechnology, Tongji University
  • , Dongsheng CheAffiliated withDepartment of Computer Science, East Stroudsburg University

* Final gross prices may vary according to local VAT.

Get Access

Abstract

A genomic island (GI) is a segment of genomic sequence that is horizontally transferred from other genomes. The detection of genomic islands is extremely important to the medical research. Most of current computational approaches that use sequence composition to predict genomic islands have the problem of low prediction accuracy. In this paper, we report, for the first time, that gene information and inter-genic distance are different between genomic islands and non-genomic islands. Using these two sources and sequence information, we have trained the genomic island datasets from 113 genomes, and developed a decision-tree based bagging model for genomic island prediction. In order to test the performance our approach, we have applied it on three genomes: Salmonella typhimurium LT2, Streptococcus pyogenes MGAS315, and Escherichia coli O157:H7 str. Sakai. The performance metrics have shown that our approach is better than other sequence composition based approaches. We conclude that the incorporation of gene information and intergenic distance could improve genomic island prediction accuracy. Our prediction software, Genomic Island Hunter (GIHunter), is available at http://www.esu.edu/cpsc/che_lab/software/GIHunter .

Keywords

Genomic islands gene information intergenic distance sequence composition