- 6 Downloads
The widely used programs BLAST (in this article, ‘BLAST’ includes both the National Center for Biotechnology Information [NCBI] BLAST® and the Washington University version WU BLAST) and FASTA for similarity searches in nucleotide and protein databases usually result in copious output. However, when large query sets are used, human inspection rapidly becomes impractical. BioParser is a Perl program for parsing BLAST and FASTA reports. Making extensive use of the BioPerl toolkit, the program filters, stores and returns components of these reports in either ASCII or HTML format. BioParser is also capable of automatically feeding a local MySQL® database with the parsed information, allowing subsequent filtering of hits and/or alignments with specific attributes. For this reason, BioParser is a valuable tool for large-scale similarity analyses by improving the access to the information present in BLAST or FASTA reports, facilitating extraction of useful information of large sets of sequence alignments, and allowing for easy handling and processing of the data.
We wish to thank Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Programa de Apoio à Pesquisa Estratégica em Saúde — Fiocruz (PAPES-Fiocruz), World Health Organization — Special Programme for Research and Training in Tropical Diseases (WHO/TDR), United Nations University — Biotechnology for Latin America and the Caribbean — Bioinformatics Network for Latin-America and Caribbean (UNU-BIOLAC LacBioNet) and Ciencia y Tecnología para el Desarrollo— Red Iberoamericana de Bioinformática (CYTED-RIB) for support.
The authors have no conflicts of interest that are directly relevant to the content of this article.
- 1.Yona G, Brenner SE. Comparison of protein sequences and practical database searching. In: Higgins D, Taylor W, editors. Bioinformatics: sequence, structure and databanks: a practical approach. Oxford: Oxford University Press, 2000: 167–90Google Scholar
- 10.BioPerl [online]. Available from URL: http://www.bioperl.org/ [Accessed 2005 June]
- 11.Henriques C, Otto TD, Catanho M, et al. Classification of transporter families in Trypanosoma cruzi [abstract no. BM128]. XXI Annual meeting of the Brazilian Society of Protozoology/XXXII Meeting of Basic Research in Chagas Disease; 2005 Nov 7–9; Caxambú, Brazil; 119Google Scholar
- 14.Catanho M, Mascarenhas D, Degrave W, et al. GenoMycDB: database for comparative analysis of mycobacterial genes and genomes. Genet Mol Res In pressGoogle Scholar