Predicting Gene Structures from Multiple RT-PCR Tests

(Extended Abstract)
  • Jakub Kováč
  • Tomáš Vinař
  • Broňa Brejová
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5724)


It has been demonstrated that the use of additional information such as ESTs and protein homology can significantly improve accuracy of gene prediction. However, many sources of external information are still being omitted from consideration. Here, we investigate the use of product lengths from RT-PCR experiments in gene finding. We present hardness results and practical algorithms for several variants of the problem and apply our methods to a real RT-PCR data set in the Drosophila genome. We conclude that the use of RT-PCR data can improve the sensitivity of gene prediction and locate novel splicing variants.


gene finding RT-PCR NP-completeness dynamic programming splicing graph 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Agrawal, R., Stormo, G.D.: Using mRNAs lengths to accurately predict the alternatively spliced gene products in Caenorhabditis elegans. Bioinformatics 22(10), 1239–1244 (2006)CrossRefPubMedGoogle Scholar
  2. Biedl, T., Brejova, B., Demaine, E., Hamel, A., Lopez-Ortiz, A., Vinar, T.: Finding hidden independent sets in interval graphs. Theoretical Computer Science 310(1-3), 287–307 (2004)CrossRefGoogle Scholar
  3. Brent, M., Langton, L., Comstock, C.L., van Baren, J.: Exhaustive RT-PCR and sequencing of all novel NSCAN predictions in Drosophila melanogaster. Personal communication (2007)Google Scholar
  4. Burge, C., Karlin, S.: Prediction of complete gene structures in human genomic DNA. Journal of Molecular Biology 268(1), 78–94 (1997)CrossRefPubMedGoogle Scholar
  5. Chen, T., Kao, M.Y., Tepel, M., Rush, J., Church, G.M.: A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry. Journal of Computational Biology 8(3), 325–327 (2001)CrossRefPubMedGoogle Scholar
  6. Gabow, H.N., Maheswari, S.N., Osterweil, L.J.: On two problems in the generation of program test paths. IEEE Trans. Soft. Eng. 2(3), 227–231 (1976)CrossRefGoogle Scholar
  7. Garey, M.R., Johnson, D.S.: Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman, New York (1979)Google Scholar
  8. Gross, S.S., Do, C.B., Sirota, M., Batzoglou, S.: CONTRAST: a discriminative, phylogeny-free approach to multiple informant de novo gene prediction. Genome Biology 8(12), R269 (2007)Google Scholar
  9. Guigo, R., et al.: EGASP: the human ENCODE genome annotation assessment project. Genome Biology 7(suppl. 1), S2 (2006)CrossRefGoogle Scholar
  10. Heber, S., Alekseyev, M., Sze, S.H., Tang, H., Pevzner, P.A.: Splicing graphs and EST assembly problem. Bioinformatics 18(suppl. 1), S181–S188 (2002)Google Scholar
  11. Kolman, P., Pankrác, O.: On the complexity of paths avoiding forbidden pairs. Discrete Applied Mathematics 157(13), 2871–2876 (2009)CrossRefGoogle Scholar
  12. Krause, K.W., Smith, R.W., Goodwin, M.A.: Optional software test planning through automated network analysis. In: Proceedings 1973 IEEE Symposium on Computer Software Reliability, pp. 18–22 (1973)Google Scholar
  13. Siepel, A., Diekhans, M., Brejova, B., Langton, L., Stevens, M., Comstock, C.L., Davis, C., Ewing, B., Oommen, S., Lau, C., Yu, H.C., Li, J., Roe, B.A., Green, P., Gerhard, D.S., Temple, G., Haussler, D., Brent, M.R.: Targeted discovery of novel human exons by comparative genomics. Genome Research 17(12), 1763–1763 (2007)CrossRefPubMedPubMedCentralGoogle Scholar
  14. Srimani, P.K., Sinha, B.P.: Impossible pair constrained test path generation in a program. Information Sciences 28(2), 87–103 (1982)CrossRefGoogle Scholar
  15. Stanke, M., Diekhans, M., Baertsch, R., Haussler, D.: Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics 24(5), 637–644 (2008)CrossRefPubMedGoogle Scholar
  16. Stanke, M., Keller, O., Gunduz, I., Hayes, A., Waack, S., Morgenstern, B.: AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Research 34(Web Server issue), W435–W439 (2006)CrossRefGoogle Scholar
  17. Wei, C., Brent, M.R.: Using ESTs to improve the accuracy of de novo gene prediction. BMC Bioinformatics 7, 327 (2006)CrossRefPubMedPubMedCentralGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Jakub Kováč
    • 1
  • Tomáš Vinař
    • 2
  • Broňa Brejová
    • 1
  1. 1.Department of Computer ScienceComenius UniversityBratislavaSlovakia
  2. 2.Department of Applied InformaticsComenius UniversityBratislavaSlovakia

Personalised recommendations