The widely used programs BLAST (in this article, ‘BLAST’ includes both the National Center for Biotechnology Information [NCBI] BLAST® and the Washington University version WU BLAST) and FASTA for similarity searches in nucleotide and protein databases usually result in copious output. However, when large query sets are used, human inspection rapidly becomes impractical. BioParser is a Perl program for parsing BLAST and FASTA reports. Making extensive use of the BioPerl toolkit, the program filters, stores and returns components of these reports in either ASCII or HTML format. BioParser is also capable of automatically feeding a local MySQL® database with the parsed information, allowing subsequent filtering of hits and/or alignments with specific attributes. For this reason, BioParser is a valuable tool for large-scale similarity analyses by improving the access to the information present in BLAST or FASTA reports, facilitating extraction of useful information of large sets of sequence alignments, and allowing for easy handling and processing of the data.
- 1.Yona G, Brenner SE. Comparison of protein sequences and practical database searching. In: Higgins D, Taylor W, editors. Bioinformatics: sequence, structure and databanks: a practical approach. Oxford: Oxford University Press, 2000: 167–90Google Scholar
- 10.BioPerl [online]. Available from URL: http://www.bioperl.org/ [Accessed 2005 June]
- 11.Henriques C, Otto TD, Catanho M, et al. Classification of transporter families in Trypanosoma cruzi [abstract no. BM128]. XXI Annual meeting of the Brazilian Society of Protozoology/XXXII Meeting of Basic Research in Chagas Disease; 2005 Nov 7–9; Caxambú, Brazil; 119Google Scholar
- 14.Catanho M, Mascarenhas D, Degrave W, et al. GenoMycDB: database for comparative analysis of mycobacterial genes and genomes. Genet Mol Res In pressGoogle Scholar