Journal of Computer Science and Technology

, Volume 20, Issue 4, pp 446–453

Test Data Sets and Evaluation of Gene Prediction Programs on the Rice Genome

  • Heng Li
  • Jin-Song Liu
  • Zhao Xu
  • Jiao Jin
  • Lin Fang
  • Lei Gao
  • Yu-Dong Li
  • Zi-Xing Xing
  • Shao-Gen Gao
  • Tao Liu
  • Hai-Hong Li
  • Yan Li
  • Li-Jun Fang
  • Hui-Min Xie
  • Wei-Mou Zheng
  • Bai-Lin Hao
Regular Paper

DOI: 10.1007/s11390-005-0446-x

Cite this article as:
Li, H., Liu, JS., Xu, Z. et al. J Comput Sci Technol (2005) 20: 446. doi:10.1007/s11390-005-0446-x

Abstract

With several rice genome projects approaching completion gene prediction/finding by computer algorithms has become an urgent task. Two test sets were constructed by mapping the newly published 28,469 full-length KOME rice cDNA to the RGP BAC clone sequences of Oryza sativa ssp. japonica: a single-gene set of 550 sequences and a multi-gene set of 62 sequences with 271 genes. These data sets were used to evaluate five ab initio gene prediction programs: RiceHMM, GlimmerR, GeneMark, FGENSH and BGF. The predictions were compared on nucleotide, exon and whole gene structure levels using commonly accepted measures and several new measures. The test results show a progress in performance in chronological order. At the same time complementarity of the programs hints on the possibility of further improvement and on the feasibility of reaching better performance by combining several gene-finders.

Keywords

gene predictionrice genometest setsaccuracy measureshidden Markov modelsdynamic programming

Copyright information

© Springer Science + Business Media, Inc. 2005

Authors and Affiliations

  • Heng Li
    • 1
    • 2
  • Jin-Song Liu
    • 1
  • Zhao Xu
    • 1
  • Jiao Jin
    • 1
    • 3
  • Lin Fang
    • 1
  • Lei Gao
    • 1
    • 2
  • Yu-Dong Li
    • 1
  • Zi-Xing Xing
    • 1
    • 3
  • Shao-Gen Gao
    • 1
    • 4
  • Tao Liu
    • 1
  • Hai-Hong Li
    • 1
  • Yan Li
    • 5
  • Li-Jun Fang
    • 5
  • Hui-Min Xie
    • 6
  • Wei-Mou Zheng
    • 1
    • 2
  • Bai-Lin Hao
    • 2
    • 5
    • 7
  1. 1.Beijing Genomics Institute (BGI)Academia SinicaBeijingP.R. China
  2. 2.Institute of Theoretical PhysicsAcademia SinicaBeijingP.R. China
  3. 3.Department of MathematicsBeijing Normal UniversityBeijingP.R. China
  4. 4.Institute of Systems ScienceAcademia SinicaBeijingP.R. China
  5. 5.Hangzhou Branch of BGIHangzhouP.R. China
  6. 6.Department of MathematicsSuzhou UniversitySuzhouP.R. China
  7. 7.T-Life Research CenterFudan UniversityShanghaiP.R. China