BADASS: BActeriocin-Diversity ASsessment Software

Costa, Sávio S.; da Silva Moia, Gislenne; Silva, Artur; Baraúna, Rafael A.; de Oliveira Veras, Adonney Allan

doi:10.1186/s12859-022-05106-x

BADASS: BActeriocin-Diversity ASsessment Software

Software
Open access
Published: 20 January 2023

Volume 24, article number 24, (2023)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

BADASS: BActeriocin-Diversity ASsessment Software

Download PDF

Sávio S. Costa ORCID: orcid.org/0000-0001-7288-4157¹,
Gislenne da Silva Moia²,
Artur Silva¹,
Rafael A. Baraúna¹ &
…
Adonney Allan de Oliveira Veras³

3117 Accesses
4 Citations
4 Altmetric
Explore all metrics

Abstract

Background

Bacteriocins are defined as thermolabile peptides produced by bacteria with biological activity against taxonomically related species. These antimicrobial peptides have a wide application including disease treatment, food conservation, and probiotics. However, even with a large industrial and biotechnological application potential, these peptides are still poorly studied and explored. BADASS is software with a user-friendly graphical interface applied to the search and analysis of bacteriocin diversity in whole-metagenome shotgun sequencing data.

Results

The search for bacteriocin sequences is performed with tools such as BLAST or DIAMOND using the BAGEL4 database as a reference. The putative bacteriocin sequences identified are used to determine the abundance and richness of the three classes of bacteriocins. Abundance is calculated by comparing the reads identified as bacteriocins to the reads identified as 16S rRNA gene using SILVA database as a reference. BADASS has a complete pipeline that starts with the quality assessment of the raw data. At the end of the analysis, BADASS generates several plots of richness and abundance automatically as well as tabular files containing information about the main bacteriocins detected. The user is able to change the main parameters of the analysis in the graphical interface. To demonstrate how the software works, we used four datasets from WMS studies using default parameters. Lantibiotics were the most abundant bacteriocins in the four datasets. This class of bacteriocin is commonly produced by Streptomyces sp.

Conclusions

With a user-friendly graphical interface and a complete pipeline, BADASS proved to be a powerful tool for prospecting bacteriocin sequences in Whole-Metagenome Shotgun Sequencing (WMS) data. This tool is publicly available at https://sourceforge.net/projects/badass/.

Metagenomic profiling pipelines improve taxonomic classification for 16S amplicon sequencing data

Article Open access 26 August 2023

Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics

Article Open access 13 August 2018

Mock community taxonomic classification performance of publicly available shotgun metagenomics pipelines

Article Open access 17 January 2024

Background

Characterization of bioactive molecules produced by free-living microorganisms has been very important in recent years because of their biotechnological applications. It is well known that the overwhelming majority of free-living microorganisms are not capable of being grown in laboratory conditions [1], which is a bottleneck to the identification and isolation of bioactive compounds. Thus, an alternative to search for new compounds is the well-established method of WMS, where the nucleic acids of the microbial community are extracted and sequenced directly from environmental samples [2]. Thus, genes involved in the synthesis of peptides or non-peptides bioactive compounds can be assessed. The main bottleneck lies in the development of user-friendly tools that allow the user to analyze a large amount of data in a simple and interactive way.

Several genes and molecules were prospected by WMS such as amylolytic or cellulolytic enzymes [3], antimicrobial compounds [4], antibiotic resistance genes (ARGs) [5,6,7] and bacteriocins [8]. Bacteriocins are small cationic thermostable bacterial peptides with a narrow spectrum of activity [9,10,11]. Unlike antibiotics, bacteriocins are produced by ribosomal activity and therefore are protease-sensitive. They are divided into three classes according to their synthesis mechanism.

Class I consists of peptides that after their translation, undergo structural changes. This class is also called lantibiotics. They have a molecular weight below 5 kDa and a size smaller than 28 amino acids [12, 13]. Class II is characterized by peptides that do not undergo post-translational modifications. They are larger than class I bacteriocins and have a molecular weight below 10 kDa [14]. Class III is composed of peptides with a molecular weight higher than 30 kDa. Bacteriocins of this class have a mechanism of action different from the other two classes, eliminating bacterial cells through cell wall hydrolysis [12, 15, 16].

A variety of software has been developed to search for ARGs or secondary metabolites such as nonribosomal peptide synthase (NRPS) or polyketide synthase (PKS) in WMS data [6, 17]. However, none of this software is focused on the prospecting of bacteriocins. Anti-SMASH [18] is an excellent tool to analyze genomes while RiPPER [19] works better for pan-genome data. BAGEL web tool (BActeriocin GEnome mining tooL) [17] is one of the first tools developed for the identification of peptides and bacteriocins in genome data. However, the tool has a maximum size for the input file, making difficult the analysis of WMS data.

In this article, we present BADASS software (BActeriocin-Diversity ASSessment Software), an automated pipeline with an intuitive graphical interface that allows users to analyze the diversity of bacteriocins using WMS raw data. Diversity measurement is based on the abundance and richness of the three bacteriocin classes currently described. The software is available at https://sourceforge.net/projects/badass/.

Implementation

Pipeline

The pipeline of BADASS (Fig. 1) starts with the automatic loading of bacteriocin sequences to the database. It is worth noting that this process needs to be executed only on the first use of the software. The input file consists of a WMS sequencing sample in FASTQ format. After saving the project the following steps are performed.

(a)
Quality assessment The user can choose to evaluate the raw data with a boxplot chart that correlates the Phred score of a base (y axis) with base position (x axis). This is an optional step that helps users to decide about the quality filter values that will be used in the next step. A boxplot with the result of the FastQC analysis is produced at this stage and displayed to the user.
(b)
Trimming and quality filter Raw data is trimmed to remove bases at the end of the reads with a Phred score below the cut-off value provided by the user. Subsequently, sequences are filtered according to parameters such as alignment score and e-value. The Fastx Toolkit software is used in this step.
(c)
Parser fastq2fasta The trimmed and filtered file is converted into FASTA format.
(d)
Mapping against the bacteriocin database In order to identify bacteriocins, we adapted a search method used in several works [20, 21] which consists of: Firstly, a database is built using non-redundant BAGEL4 sequences [22]. Subsequently, the BLAST+ tool [22, 23] is used to compare the translated reads against the BAGEL4 database with blastx. The best hit for each read identified as bacteriocin is retrieved. The user can define an e-value cut-off for the homology analysis.
(e)
Mapping against the 16S rRNA SILVA database The same file of the previous step is used to align the reads against the SILVA database using DIAMOND [24]. Two files in.csv format are generated. The first contains the list of subject nucleotide sequences with their respective identity values. The other file contains the best hit for each query sequence based on an identity cut-off value provided by the user. Cut-off values are adjustable in the graphical interface of BADASS.

The number of reads identified as bacteriocins and 16S rRNA are used to calculate the richness and abundance of bacteriocins in the WMS dataset as mentioned later.

Programming language and database

BADASS was developed in JAVA (https://www.oracle.com) and used the Maven tool (https://maven.apache.org/) to build and manage the project. Maven was used due to the automated management and generation of the JAR package containing the software dependencies. Swing library was used to produce the graphical interface. The database management system used to control the steps and manage the project was SQLite v.3 (https://www.sqlite.org/).

Data source and software validation

The software validation was performed using four whole-metagenome shotgun sequencing datasets. Samples were obtained in the Tucuruí Hydroelectric Power Plant water reservoir submitted in EBI database under the accession numbers ERS1560860, ERS1560861, and ERS1562591 [8] and a sample obtained from Unai's Hot Spring from the ENA (European Nucleotide Archive) database with the accession number PRJEB8864. The following parameters were used: Quality threshold: 20, minimum length: 100, minimum quality score to keep: 16, minimum percent: 80%, e-value: 10, threads: 6, identity: 50.

Quality assessment and BLAST

BADASS uses the statistical platform R v.4 (https://www.r-project.org) for quality assessment of the raw data, through the FastQC package (https://github.com/kassambara/fastqcr). Reads are trimmed and quality filtered using the FASTX-Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/). The user is able to adjust the parameters of quality filtering in the graphical interface of BADASS. BLAST v.2.0.9-3 and Diamond v.2.0.7.145 software are used to map the reads against the BAGEL4 [22, 25] and SILVA [26] databases, respectively. BAGEL4 was updated and now has about 500 Ribosomally synthesized and post-translationally modified peptides (RiPPs) (class 1), 230 non-modified bacteriocins (class 2), and 90 bacteriocins with more than 30 kDa (class 3). RippMIner [27] and the data repository MIBiG [28] were used to build the BAGEL4 database.

Abundance analysis

Diversity of bacteriocins was analyzed in terms of abundance and richness. In order to calculate the abundance of bacteriocins we adapted the formula proposed formula in studies involving the search of abundance of resistance genes in WMS data. [20]. Where: (1) n is the amount of bacteriocins that were found in sample; (2).N_{bacteriocin sequences} is the number of reads that mapped to a specific bacteriocin; (3) T_read is the average size of reads; (4) T_bacteriocin is the average size of the bacteriocin; (5) N_{16S rRNA sequences} is the number of reads that mapped to 16S rRNA sequences; and (6) T_{16S rRNA sequence} is the average size of the 16S rRNA sequence.

$$Abundance = \mathop \sum \limits_{1}^{n} \frac{{N_{bacteriocinsequences} \times \frac{{T_{read} }}{{T_{bacteriocin} }}}}{{N_{16SrRNAsequences} \times \frac{{T_{read} }}{{T_{16SrRNA} }}}}.$$

Workstation

Analyses were performed in a Desktop equipment Intel® Core™ i7-10510U CPU @ 1.80 GHz with 8 processing cores, 16 GB of RAM memory, and tests were run on Ubuntu 21.10, 64-bit, Windows 11 and macOS Ventura 13.0 operating systems.

Results and discussion

BADASS was developed using the BLAST and DIAMOND alignment tools to identify bacteriocin and 16S rRNA sequences in WMS raw data. The choice of tool can be defined in the BADASS GUI. Other studies have used similar methodologies for prospecting relevant genes such as ARGs [7, 20, 29, 30]. For example, ARGs-OAP is an online pipeline for antibiotic resistance genes detection in metagenomic data through similarity sequence analysis [5]. In addition to the homology search, BADASS calculates the abundance of each bacteriocin class by taking into account the size of the genes and the size of the reads produced by the sequencing library [22]. The number of reads identified as bacteriocins are compared to the number of reads identified as the gene of the 16S rRNA, which is present in a few copies per cell. This approach makes the size of the reads as well as the size of the genes not interfere with the analysis of abundance. Thus, this pipeline is a powerful tool for rapid and comprehensive evaluation of bacteriocin diversity using WMS raw data.

The main results of BADASS are the description of richness and the values of the abundance of bacteriocins using WMS raw data as input file in a simple and intuitive way. The software provides a set of adjustable parameters in the graphical interface. Users can also choose to process the samples using default parameters. In more detail, the results obtained by the software include: (1) quality assessment box plots of the raw data directly in the graphical interface or even in the results folder; (2) spreadsheets in.xlsx or.csv format containing information about the identified bacteriocins (richness) including their frequency (ratio between the number of reads identified as bacteriocin and the total number of reads in a sample) and abundance (calculated using the formula previously mentioned), organized by class; (3) bar plots of abundance based on the.csv files; (4) a.csv file containing the list of 16S rRNA sequences identified in the dataset including the value of percentage identity; (5) Trimmed.fastq and QV.fastq files containing the trimmed reads and the reads filtered by minimum size, respectively.

It is also possible to detect bacteriocin in genome sequences using other software. Table 1 presents several computational tools and databases developed to help in the identification of these antimicrobial peptides. The main features of each software or database are compared in the table. It is worth noting that BADASS is the only who has a graphical interface, supports WMS data, and performs diversity analysis (Table 1). BAGEL4 stands out for having one of the most complete databases containing a large number of annotated and experimentally verified bacteriocin sequences. In addition, the database is divided into three classes according to the genetic information and mechanism of action. Because of these features, BAGEL4 was used as a reference bacteriocin database in BADASS. BACTIBASE [30] is a database containing detailed information about the physicochemical properties of bacteriocins. This information allows a fast and accurate prediction of the structure–function relationship and possible target organisms of the antimicrobial peptides. Other relevant software includes BOA (Bacteriocin Operon Associator) [31] which uses Hidden Markov Models to predict bacteriocin clusters, Neubi [32] which identifies bacteriocins using a word embedding approach, and Anti-SMASH (Antibiotics and Secondary Metabolite Analysis Shell) which was launched in 2011 and is used not only for bacteriocin prediction but for a number of other secondary metabolites [18, 33].

Table 1 Comparison of the main features of software used for bacteriocin gene mining

Full size table

The identification of bacteriocins, however, is still quite challenging due to the limited number of known and experimentally analyzed sequences. Choosing the most appropriate and up-to-date tool is essential for the search and identification of bacteriocin genes. BADASS is a user-friendly software, with a robust pipeline that starts with the quality assessment of the raw data and ends with the analysis of the richness and abundance of bacteriocins.

A pilot analysis was performed using four datasets with default parameters. The results of the dataset ERR1816708 are presented in Table 2. First column of the table shows the BAGEL4 accession number and name of the bacteriocins. Other columns correspond to frequency, abundance and class, respectively. Thus, users are able to identify the diversity of bacteriocins in the dataset. Additionally, complementary analyses such as taxonomic affiliation of the microbial community are important to determine the ecological context of the described bacteriocins [8].

Table 2 CSV file generated by the BADASS presenting the names and other information of the detected bacteriocins

Full size table

Two bar plots were generated by the software containing an overview of the bacteriocin diversity. The first plot (Fig. 2) is designed based on a.csv file similar to Table 2. The plot presents the top ten most abundant bacteriocins in the dataset. Information about the bacterial species that commonly produce the peptides are presented in the legend. The second plot (Fig. 3) presents the abundance of bacteriocins by class. The best parameters for each study should be carefully chosen by the user according to it dataset characteristics. A variation in the results is expected since the parameters adjust the analysis performed by the software. The choice of parameters by the user will result in changes in the result, being able to restrict to stricter or looser parameters [34].

The time required for similarity search and post-alignment analysis has become a bottleneck as sequence costs decrease and the size of the datasets increases [35]. We also highlight that all the analysis, starting from the filtering of the raw data can be done in the BADASS pipeline. The software allows users to modify most of the parameters such as e-value, identity cut-off, and others.

Conclusions

In the environment, a countless number of microbial species coexist and, in order to succeed in colonize their ecological niches, many have developed mechanisms to eliminate other species through the production of antimicrobial molecules. In this chemical warfare, bacteriocins are narrow-spectrum antimicrobial peptides synthesized by ribosomal activity that are widely distributed in bacterial species. Thus, the development of computational tools to identify, classify and quantify bacteriocins in WMS datasets is of great importance for microbial ecology and biotechnology.

BADASS provides the user with a robust and automated computational tool with a simple and intuitive graphical interface, where the parameters can be adjusted by the user, allowing greater independence in the analysis of different samples. The integration of the software with the R statistical platform allows the generation of plots that helps in data interpretation. For those looking to prospect antimicrobial peptides in WMS raw data, BADASS is a powerful solution.

Availability and requirements

Project name: BADASS

Project home page: https://sourceforge.net/projects/badass/

Operating system(s): platform independent

Programming language: Java

Other requirements: e.g. Java 19.0.1 or higher

License: GNU GPL

Any restrictions to use by non-academics: license needed.

Availability of data and materials

The software is available at https://sourceforge.net/projects/badass/ and samples were obtained in the Tucuruí Hydroelectric Power Plant water reservoir submitted in EBI database under the accession numbers ERS1560860, ERS1560861, and ERS1562591 and a sample obtained from Unai's Hot Spring from the ENA (European Nucleotide Archive) database with the accession number PRJEB8864.

Abbreviations

BADASS:: BActeriocin-Diversity ASsessment Software
WMS:: Whole-Metagenome Shotgun Sequencing
ARGs:: Antibiotic resistance genes
NRPS:: Nonribosomal peptide synthase
PKS:: Polyketide synthase
BAGEL:: BActeriocin GEnome mining tool
ENA:: European Nucleotide Archive
RiPPs:: Ribosomally synthesized and post-translationally modified peptides

References

Stefani FOP, Bell TH, Marchand C, De La Providencia IE, El Yassimi A, St-Arnaud M, et al. Culture-dependent and -independent methods capture different microbial community fractions in hydrocarbon-contaminated soils. PLoS ONE. 2015;10:1–16.
Article Google Scholar
Handelsman J, Rondon MR, Brady SF, Clardy J, Goodman RM. Molecular biological access to the chemistry of unknown soil microbes: a new frontier for natural products. Chem Biol. 1998;5:R245–9.
Article CAS Google Scholar
Vester JK, Glaring MA, Stougaard P. An exceptionally cold-adapted alpha-amylase from a metagenomic library of a cold and alkaline environment. Appl Microbiol Biotechnol. 2015;99:717–27.
Article CAS Google Scholar
Arivaradarajan P, Paramasamy G, Nair SP, Allan E, Mullany P. Identification of an antibacterial protein by functional screening of a human oral metagenomic library. FEMS Microbiol Lett. 2015;362:1–6.
Article Google Scholar
Yang Y, Jiang X, Chai B, Ma L, Li B, Zhang A, et al. ARGs-OAP: Online analysis pipeline for antibiotic resistance genes detection from metagenomic data using an integrated structured ARG-database. Bioinformatics. 2016;32:2346–51.
Article CAS Google Scholar
Yang S-C, Lin C-H, Sung CT, Fang J-Y. Antibacterial activities of bacteriocins: application in foods and pharmaceuticals. Front Microbiol. 2014;5:241.
Google Scholar
Li H, Zheng X, Tan L, Shao Z, Cao H, Xu Y. The vertical migration of antibiotic-resistant genes and pathogens in soil and vegetables after the application of different fertilizers. Environ Res. 2021;2022(203):111884.
Google Scholar
Costa SS, Lago LAB, Silva A, Das Graças DA, Lameira J, Baraúna RA. Diversity of bacteriocins in the microbiome of the Tucuruí Hydroelectric Power Plant water reservoir and three-dimensional structure prediction of a zoocin. Genet Mol Biol. 2022;45:1–9.
Article Google Scholar
Cotter PD, Ross RP, Hill C. Bacteriocins-a viable alternative to antibiotics? Nat Rev Microbiol. 2013;11:95–105.
Article CAS Google Scholar
Yang Y, Jiang XT, Zhang T. Evaluation of a hybrid approach using UBLAST and BLASTX for metagenomic sequences annotation of specific functional genes. PLoS ONE. 2014;9:e110947.
Article Google Scholar
Montville TJ, Winkowski K, Ludescher RD. Models and mechanisms for bacteriocin action and application. Int Dairy J. 1995;5:797–814.
Article CAS Google Scholar
Riley MA, Wertz JE. Bacteriocins: evolution, ecology, and application. Annu Rev Microbiol. 2002;56:117–37.
Article CAS Google Scholar
Dobson A, Cotter PD, Ross RP, Hill C. Bacteriocin production: a probiotic trait? Appl Environ Microbiol. 2012;78:1–6.
Article CAS Google Scholar
Cotter PD, Ross RP, Hill C. Bacteriocins - a viable alternative to antibiotics? Nat Rev Microbiol. 2013;11:95–105.
Article CAS Google Scholar
Cotter PD, Hill C, Ross RP. Bacteriocins: developing innate immunity for food. Nat Rev Microbiol. 2005;3:777–88.
Article CAS Google Scholar
Huang F, Teng K, Liu Y, Cao Y, Wang T, Ma C, et al. Bacteriocins: potential for human health. Oxid Med Cell Longev. 2021;2021:1–17.
CAS Google Scholar
de Jong A, van Hijum SAFT, Bijlsma JJE, Kok J, Kuipers OP. BAGEL: a web-based bacteriocin genome mining tool. Nucleic Acids Res. 2006;34(Web Server Issue):W273–9.
Article Google Scholar
Medema MH, Blin K, Cimermancic P, De Jager V, Zakrzewski P, Fischbach MA, et al. AntiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39(SUPPL. 2):339–46.
Article Google Scholar
Santos-Aberturas J, Chandra G, Frattaruolo L, Lacret R, Pham TH, Vior NM, et al. Uncovering the unexplored diversity of thioamidated ribosomal peptides in Actinobacteria using the RiPPER genome mining tool. Nucleic Acids Res. 2019;47:4624–37.
Article CAS Google Scholar
Li B, Yang Y, Ma L, Ju F, Guo F, Tiedje JM, et al. Metagenomic and network analysis reveal wide distribution and co-occurrence of environmental antibiotic resistance genes. ISME J. 2015;9:2490–502.
Article CAS Google Scholar
Port JA, Cullen AC, Wallace JC, Smith MN, Faustman EM. Metagenomic frameworks for monitoring antibiotic resistance in aquatic environments. Environ Health Perspect. 2014;122(3):222–8.
Article CAS Google Scholar
van Heel AJ, de Jong A, Song C, Viel JH, Kok J, Kuipers OP. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins. Nucleic Acids Res. 2018;46:W278–88.
Article Google Scholar
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
Article CAS Google Scholar
Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–1.
Article CAS Google Scholar
Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2014;12:59–60.
Article Google Scholar
Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41(Database Issue):D590–6.
CAS Google Scholar
Agrawal P, Khater S, Gupta M, Sain N, Mohanty D. RiPPMiner: A bioinformatics resource for deciphering chemical structures of RiPPs based on prediction of cleavage and cross-links. Nucleic Acids Res. 2017;45:W80–8.
Article CAS Google Scholar
Kautsar SA, Blin K, Shaw S, Navarro-Muñoz JC, Terlouw BR, Van Der Hooft JJJ, et al. MIBiG 2.0: a repository for biosynthetic gene clusters of known function. Nucleic Acids Res. 2020;48:D454–8.
Google Scholar
Manoharan RK, Srinivasan S, Shanmugam G, Ahn YH. Shotgun metagenomic analysis reveals the prevalence of antibiotic resistance genes and mobile genetic elements in full scale hospital wastewater treatment plants. J Environ Manag. 2020;2021(296):113270.
Google Scholar
Hammami R, Zouhir A, Le Lay C, Ben Hamida J, Fliss I. BACTIBASE second release: a database and tool platform for bacteriocin characterization. BMC Microbiol. 2010;10:22.
Article Google Scholar
Morton JT, Freed SD, Lee SW, Friedberg I. A large scale prediction of bacteriocin gene blocks suggests a wide functional spectrum for bacteriocins. BMC Bioinform. 2015;16:1–9.
Article Google Scholar
Hamid MN, Friedberg I. Identifying antimicrobial peptides using word embedding with deep recurrent neural networks. Bioinformatics. 2019;35:2009–16.
Article CAS Google Scholar
Blin K, Shaw S, Kloosterman AM, Charlop-Powers Z, Van Wezel GP, Medema MH, et al. AntiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res. 2021;49:W29–35.
Article CAS Google Scholar
Dash S, Rahman S, Hines H, Feng W. Incremental BLAST: incremental addition of new sequence databases through e-value correction. bioRxiv. 2018:476218.
Mardis ER. A decade’s perspective on DNA sequencing technology. Nature. 2011;470:198–203.
Article CAS Google Scholar

Download references

Acknowledgements

The authors would like to thank to Universidade Federal do Pará (UFPA) for the structure to the development for this research.

Funding

The authors would like to thank to Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for the scholarship, Centrais Elétricas do Norte do Brasil (Eletronorte) for the research funding and assistance during environmental sampling, and Pró-Reitoria de Pesquisa e Pós-Graduação (PROPESP) from Universidade Federal do Pará (UFPA) for the assistance in the payment of publication fees through the PAPQ program. The funding body did not interfere with the results achieved by the research.

Author information

Authors and Affiliations

Parque de Ciência e Tecnologia Guamá, Laboratório de Engenharia Biológica, Belém, Pará, Brazil
Sávio S. Costa, Artur Silva & Rafael A. Baraúna
Faculty of Computer Engineering, Federal University of Pará, Campus Tucuruí (CAMTUC-UFPA), Belém, Pará, Brazil
Gislenne da Silva Moia
Faculty of Computing, Federal University of Pará, Campus Castanhal (FACOMP/CCAST), Belém, Pará, Brazil
Adonney Allan de Oliveira Veras

Authors

Sávio S. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Gislenne da Silva Moia
View author publications
You can also search for this author in PubMed Google Scholar
Artur Silva
View author publications
You can also search for this author in PubMed Google Scholar
Rafael A. Baraúna
View author publications
You can also search for this author in PubMed Google Scholar
Adonney Allan de Oliveira Veras
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SSC, RAB and AS participated in the design and implementation of the pipeline. GM and AOV participated in the coding of the software. All authors contributed to the adaptation of the pipeline. All authors wrote the manuscript and all authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Sávio S. Costa.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Costa, S.S., da Silva Moia, G., Silva, A. et al. BADASS: BActeriocin-Diversity ASsessment Software. BMC Bioinformatics 24, 24 (2023). https://doi.org/10.1186/s12859-022-05106-x

Download citation

Received: 01 August 2022
Accepted: 07 December 2022
Published: 20 January 2023
DOI: https://doi.org/10.1186/s12859-022-05106-x

BADASS: BActeriocin-Diversity ASsessment Software