A Database Application to Manage DNA Sequence and Gene Expression Data


BioCloneDB is a user-friendly database with a web interface to assist molecular genetics laboratories in managing a local repository of sequence information linked to DNA clones. This tool is designed to assist in high-throughput sequence and gene expression projects, providing a link between both types of information. The unique feature of the application is the automation of batch sequence annotation following BLAST® searches, which is supported by easy-to-use web interfaces. Furthermore, any set of sequences can be annotated against any sequence database. This replaces the need to perform and analyse individual web BLAST® searches or the need to learn how to produce batch searches and perform analysis in a UNIX® operating system. BioCloneDB is open-source software that can be installed on Linux or UNIX® operating systems. To test the application, we used 1400 expressed sequence tags obtained from the filamentous fungus Neurospora crassa. The results were analysed and compared with published results and they show a significant change due to the accumulation of the data in the nr database (

This is a preview of subscription content, log in to check access.

Fig. 1
Table I
Table II


  1. 1.

    Altschul SF, Gish W, Miller W, et al. Basic local alignment search tool. J Mol Biol 1990; 215: 403–10

    PubMed  CAS  Google Scholar 

  2. 2.

    Altschul SF, Madden TL, Schaffer AA, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997; 25: 3389–402

    PubMed  Article  CAS  Google Scholar 

  3. 3.

    Dudoit S, Gentleman RC, Quackenbush J. Open source software for the analysis of microarray data. Biotechniques 2003 Mar; Suppl.: 45–51

    Google Scholar 

  4. 4.

    Kaminski M, editor. Microarray resources on the Web (second of many sections) [online]. Available from URL: [Accessed 2003 Aug]

  5. 5.

    Hoersch S, Leroy C, Brown NP, et al. The GeneQuiz web server: protein functional analysis through the Web. Trends Biochem Sci 2000; 25: 33–5

    PubMed  Article  CAS  Google Scholar 

  6. 6.

    Soanes DM, Skinner W, Keon J, et al. Genomics of phytopathogenic fungi and the development of bioinformatic resources. Mol Plant Microbe Interact 2002; 15: 421–7

    PubMed  Article  CAS  Google Scholar 

  7. 7.

    Moller S, Leser U, Fleischmann W, et al. EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation. Bioinformatics 1999; 15: 219–27

    PubMed  Article  CAS  Google Scholar 

  8. 8.

    Meskauskas A, Lehmann-Horn F, Jurkat-Rott K. Sight: automating genomic data-mining without programming skills. Bioinformatics 2004; 20: 1718–20

    PubMed  Article  CAS  Google Scholar 

  9. 9.

    Goesmann A, Haubrock M, Meyer F, et al. PathFinder: reconstruction and dynamic visualization of metabolic pathways. Bioinformatics 2002; 18: 124–9

    PubMed  Article  CAS  Google Scholar 

  10. 10.

    Dennis Jr G, Sherman BT, Hosack DA, et al. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol 2003; 4(5): P3

    PubMed  Article  Google Scholar 

  11. 11.

    Bard JL, Rhee SY. Ontologies in biology: design, applications and future challenges. Nat Rev Genet 2004; 5: 213–22

    PubMed  Article  CAS  Google Scholar 

  12. 12.

    Galagan JE, Calvo SE, Borkovich KA, et al. The genome sequence of the filamentous fungus Neurospora crassa. Nature 2003; 422: 859–68

    PubMed  Article  CAS  Google Scholar 

  13. 13.

    Borkovich KA, Alex LA, Yarden O, et al. Lessons from the genome sequence of Neurospora crassa: tracing the path from genomic blueprint to multicellular organism. Microbiol Mol Biol Rev 2004; 68: 1–108

    PubMed  Article  CAS  Google Scholar 

  14. 14.

    Benson DA, Boguski MS, Lipman DJ, et al. GenBank. Nucleic Acids Res 1998; 26: 1–7

    Article  CAS  Google Scholar 

  15. 15.

    Nelson MA, Kang S, Braun EL, et al. Expressed sequences from conidial, mycelial and sexual stages of Neurospora crassa. Fungal Genet Biol 1997; 21: 348–63

    PubMed  Article  CAS  Google Scholar 

Download references


This research was supported by BARD, the US — Israel Binational Agricultural Research and Development fund. DL was supported by Israel Ministry of Science grant no. 1424 to Center of Knowledge for Bioinformatics Infrastructure (COBI).

We thank Dvorah Weisman and Arye Harel for helpful advice and Zahi Paz for assistance in web design.

he authors have no conflicts of interest that are directly relevant to the content of this article.

Author information



Corresponding author

Correspondence to Dr Oded Yarden.

Additional information

Availability: BioCloneDB is available for academic use along with documentation, Screenshots, database scheme and readme files at

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Reuveni, E., Leshkowitz, D. & Yarden, O. BioCloneDB. Appl-Bioinformatics 4, 277–280 (2005).

Download citation


  • Neurospora Crassa
  • Structure Query Language
  • Annotation Procedure
  • Annotation Module
  • Common Gateway Interface