Introduction

Mycobacterium avium subsp. paratuberculosis (MAP) is a slow-growing, acid-fast, mycobactin dependent pathogen.

Infection with this bacterium ends up in a chronic inflammation in cattle and other wild and domestic species, referred to as paratuberculosis (PTB) or Johne’s disease (JD) (Clarke 1997; Harris and Barletta 2001). Clinical signs of PTB include diarrhea, weight loss, decreased milk yield, and mortality, leading to important economic losses (McAloon et al. 2016). A major concern on MAP is the effortlessness with which the bacterium is spread, since subclinical or clinically infected animals shed the bacterium in feces and milk, empowering dissemination to susceptible calves, the environment, and dairy by-products for human consumption (Sweeney 1996, 2011; Adhikari et al. 2017; Lorencova et al. 2019). MAP-containing milk is of particular concern because the bacterium has been suggested as a possible link to Crohn’s disease (CD) in humans (Kuenstner et al. 2017).

Most infected ruminants never progress toward the clinical stage, remaining sub-clinically infected by MAP. Such epidemiological particularity possibly involves several factors including host-susceptibility, environment, and pathogenic differences between MAP strains (Ahlstrom et al. 2015; Bryant et al. 2016). MAP genotyping is a valuable tool in epidemiological exploration, leading to a better understanding of MAP-infection dynamics, associated risk factors, transmission behavior, and pathogenesis. Consequently, genotyping-based findings can support the implementation of rational and well-adapted control measures, diagnosis improvement, and vaccine development.

MAP strain K-10 complete genome by Li et al. (2005) was the first step to clarifying the molecular basis of MAP’s physiology and virulence, providing the basis for the development of further generations of diagnostic tests and vaccines.

Besides all investigation efforts worldwide, little is known about MAP genetic diversity among susceptible species and specifically, among different geographical locations —a prerequisite for a well-defined understanding of the epidemiology and biology of this bacterium (Stevenson 2015; Bryant et al. 2016). Therefore, we aimed to systematically collect and appraise the scientific evidence to answer the research question: What MAP genotypes have been isolated from cattle, sheep, and goats in Latin America and the Caribbean?

Materials and methods

This review was designed, performed, and reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Moher et al. 2009). Through it, we will refer to a citation as a title of an article, detected during the stage of title screening, to an article as an entire paper, publication, or research report.

Search strategy

The process of identifying relevant articles considered a specific research question: What MAP genotypes have been isolated from cattle, sheep, and goats in Latin America and the Caribbean?

The following search procedures were implemented on August 2017 and updated on August 2020. The topic was broken down into components and the search terms use to find relevant citations in databases were [Mycobacterium avium paratuberculosis OR paratuberculosis OR Johne*] AND [ruminant? OR cattle OR bovine? OR cow? OR goat? OR caprine? OR sheep OR ovine? OR livestock] AND [characterization OR molecular OR epidemiol* OR fingerprint* OR phylogen* OR genetic* OR genomic* OR genotype* OR diversi* OR dna OR adn OR isolate* OR strain? OR typ* OR subtype*] AND [vntr? OR variable number tandem repeat? OR miru OR mycobacterial interspersed repetitive unit? OR ssr? OR short sequence repeat? OR mlssr OR ml-ssr OR multi-locus short sequence repeat? OR multilocus short sequence repeat? OR rflp OR restriction fragment length polymorphism OR pfge OR pulsed-field gel electrophoresis OR rda OR redundancy analys* OR ghc OR generalized higher criticism OR dgge OR denaturing gradient gel electrophoresis OR snp* OR single nucleotide polymorphism? OR aflp* OR amplified fragment length polymorphism? OR hrm OR high resolution melt* OR rea OR restriction enzymes analys* OR rapd OR random amplification of polymorphic dna OR mlva OR multiple locus variable-number tandem repeat analys*].

Three search platforms (i.e., OVID®, Web of Science®, SciELO) including seven databases (i.e., MEDLINE, CAB Abstracts, Biological Abstracts®, Embase, Web of Science™ Core Collection, Biological Abstracts®, SciELO Citation Index®) were searched. The proceedings of the 3rd to the 12th International Colloquium on Paratuberculosis (ICP) —held between 1991 and 2014, were available from the platforms explored. The 13th and 14th ICP proceedings (2016 and 2018, respectively) were available at the International Association for Paratuberculosis web site (http://www.paratuberculosis.net/) and were hand-searched for existing published primary studies derived from the abstracts by two authors.

Eligibility screening

The inclusion criteria considered all languages. Nevertheless, only those articles, published in peer-reviewed journals, with an available abstract in English, French, German, Portuguese, or Spanish were initially considered. No publishing year or country limitation was considered. In the case of the ICP proceedings and other abstracts found, studies were included in the search strategy only to identify further citations in peer-reviewed journals.

The first selection of citations was done according to the information contained only in the title. Two of the authors made the selection. The exclusion of the citations was done considering the possibility to answer the research question. The reasons for exclusion were (i) not-related topics (e.g., prevalence, Crohn’s disease, economic impact, Mycobacterium bovis); (ii) not the species of interest (e.g., humans, buffaloes, wildlife, camelids); (iii) not in the region of interest (e.g., Egypt, Europe, India, USA); or (iv) not original articles (e.g., review, book). Duplicated articles were not considered. All citations selected by at least one of the two authors were considered eligible to continue in the process.

The eligible citations were screened using the abstract by two of the authors. Inclusion and exclusion criteria were the same as for the title screening. Conflicting was resolved through consensus between authors and if necessary, by a third author.

The remaining articles were studied using the entire paper by two authors to ensure that they contained relevant information (evidence) to answer the research question. Conflicts were resolved through consensus between authors and if necessary, by a third author.

A kappa coefficient was estimated at each step (i.e., title, abstract, entire paper) as previously reported (Landis and Koch 1977), to measure the inter-rater reliability (and also intra-rater reliability) among the number of yes/no in response to the acceptance of each title in each step and by each author.

Two of the authors searched the reference lists of relevant citations identified by the full-text screening for additional published primary articles (“snowballing”). A second snowballing was done considering the reference list of the articles found by the first snowballing procedure.

Data extraction and descriptive summary

After all available articles were compiled, a descriptive summary was presented by one of the authors, considering bibliographic information, country of isolation/detection, source population, matrix, sample size, genotyping method used (including restriction enzyme, when applies), locus, and results (of typing and subtyping). A second author reviewed the data extraction product.

Epidemiological diagrams

The relation among the BstEII restriction profiles was presented in a dendrogram and the different INMV profiles were presented using probable patterns of evolutionary descent between allelic profiles (clonal relationship), inferred by the goeBURST algorithm (Feil et al. 2004), through the minimum spanning tree (MST). It is assumed that the genetic distance between two INMV patterns is proportional to the difference in the number of repeats at each locus. These relations were established using PHYLOViZ 2 software (http://www.phyloviz.net/; Ribeiro-Gonçalves et al. 2016).

Results

Relevant articles accomplishment

The electronic search found 544 eligible citations (after de-duplication), potentially related to the subject of this systematic review. Manual searching of the 13th and 14th ICP proceedings (2016 and 2018, respectively) provided nine papers. The final number of citations to be furtherly filtered was 553. After reading the titles of the articles, 379 were considered irrelevant (agreed by two authors). The final number of citations based on title screening was 174 (retained by at least one author). After reading the abstracts of the citations, 107 were excluded (by both authors) and 67 original articles remained for the full-text review. Forty-three were excluded because of the criteria already described by title and abstract screening. The snowballing strategy was applied through the reference lists of the 24 definitive articles, and then a second snowballing to the two additional articles kept, leading no new results. Finally, 26 articles met the inclusion criteria of the systematic review (Fig. 1). All were published in English published between July 1989 and August 2020.

Fig. 1
figure 1

Flow chart of selection of relevant articles (PRISMA), describing the progress of the citation trough the systematic review. *Some titles content more than one exclusion criteria

The relevant articles reported the use of six different genotyping techniques (i.e., polymerase chain reaction-restriction endonuclease analysis —PCR-REA, restriction fragment length polymorphism —RFLP, type-specific-PCR, mycobacterial interspersed repetitive units-variable number of tandem repeats —MIRU-VNTR, multi-locus short sequence repeat —MLSSR, single nucleotide polymorphism —SNP) in isolates from seven countries of the region of study. One article reported the use of hsp65 PCR-REA, seven of IS1311 PCR-REA, and 11 of IS900-RFLP, using the enzymes BstEII, PstI, HaeIII, HinfI, MseI, PvuII, and BclI. MIRU-VNTR was used in eight articles, considering 17 loci, where the most frequently reported were 1658 (X3), 32, 292, 10, 25, 47, 7, and 3. MLSSR was reported in five articles, mainly considering loci 1, 2, 8, and 9. IS1311 SNP and IS900 type-specific-PCR techniques were reported by one and four articles, respectively. Genotypes found so far in the region using typing techniques were mainly C type. MIRU-VNTR mostly reported INMV 1, INMV 2, and INMV 11 subtypes, among others. MLSSR reported genotypes from four different countries, reporting nine different subtypes of which 7g–10g4ggt was the most common for loci 1, 2, and 8, respectively. Detailed information extracted from the 26 articles included in this systematic review is presented in Tables 1 and 2.

Table 1 Detailed bibliographic information extracted from the 26 relevant articles
Table 2 Detailed information extracted from the 26 relevant articles, concerning the answer to the research question

Figure 2 shows epidemiological diagrams obtained and defined over a map of the region of interest with the different MAP animal hosts of study, using the MST diagram for the INMV profiles (A) and a hierarchical clustering diagram, based in BstEII restriction profiles (B).

Fig. 2
figure 2

A goeBURST full minimum spanning tree diagram of INMV profiles, based in the MIRU-VNTRs repeats reported in cattle, sheep, and goats in Latin America and the Caribbean (July 1989–August 2020). B Hierarchical clustering diagram, based in in BstEII restriction profiles reported in cattle, sheep, and goats in Latin America and the Caribbean (July 1989–August 2020)

Discussion

Studies on MAP molecular diversity in Latin America and the Caribbean were reviewed herein, using a systematic approach for the first time on the topic. This report aimed to collect and present all the available data over the available scientific documentation on MAP genotypes isolated from cattle, sheep, and goats in the region of interest so far, expecting to be similar among countries and MAP animal hosts since the countries of the region share some conditions that could define such outcome.

Results obtained in the present review provide important advances in the knowledge of the disease and its etiological agent. Nevertheless, much remains to be accomplished as it has been described in other countries under different productive conditions (Barkema et al. 2017). Such epidemiological evidence could be valuable in the estimation of the transmission of MAP in a herd, among infected facilities from different areas (even countries) or between different animal hosts, and help to improve strategies on the control of the disease.

The number of articles on MAP molecular approaches in the region of interest suggests an increasing trend in PTB research and therefore, an increasing interest in the disease and its negative impacts. In this regard, only 10 articles were published between 1989 and 2005 (during 16 years). As of 2005, with the publication of Li et al. (2005) on the complete genome sequence of MAP, 16 more articles were published in almost 15 years from the seven different countries. It is important to mention that all articles meeting the criteria of the present systematic review were published in English —considered as the language of science, and remarkably, in non-Latin American journals. These results could be founded on the greater visibility that the researchers wanted to give to their findings on the scientific community.

The final number of articles included in the present review (n = 26) is not negligible, but it does represent a significant dilution over time. This could be a result of the interaction of different aspects, including budget limitations (which is a Latin American reality), a low frequency of collaborative research within the region and with other regions, lack of knowledge of the disease plus a low interest in research or screening for it, and because of —in most of the cases, the research front is assumed by the universities, compromising the haste in its development. In addition, since PTB is not a prioritized disease or of mandatory reporting in the region of impact (Whittington et al. 2019), and due to its prolonged incubation period and culturing and testing special features, the expectation of the research product —mainly typing, is not achieved at a short or medium term, since the molecular identification of MAP requires a considerable progress in exploration on the subject, and it is a final step on the research line.

Another aspect that could be analyzed given the results of the present systematic review is related to the evolution in the molecular analysis of MAP, where it can be seen how some countries progressed to techniques that were marking the forefront, while others preserved traditional techniques, which, added to the low number of isolates (due to the factors previously discussed), could explain the limited number of findings for a region as wide and diverse as Latin America and the Caribbean. In this respect another aspect presenting one more limitation is the productive potential or tendency of each country, which explains the lack of information for one country or another on a specific MAP animal host. For example, an interesting question would be if Venezuela does not have MAP molecular findings, since no studies on this topic have been reported in cattle or other susceptible species in this country so far.

According to our results, it can be observed that the Argentinean and Colombian isolates present an observable epidemiological relation, both in the analysis based on MIRU-VNTR profiles and in those of BstEII restriction profiles. For the rest of the isolates, corresponding to the other countries, such similarity was not observed. Any conclusion in this regard should be considered with caution, given the limited number of isolates in some countries, as previously discussed.

The different genotyping techniques presented herein are based on diverse study designs and approaches (e.g., different markers, RFLP-enzymes, matrices, sample size). Therefore, the interpretation derived —related to MAP molecular diversity in the region of interest, is not definitive. The present systematic review is an exploratory study, which allows to visualize the results that, in turn, are not definitive but can contribute with an interesting and well-based overview. In definitive, important limitations are related to the high diversity of matrices and techniques and the low number of studies. For example, Chilean isolates using MIRU-VNTR reported a different set of loci, which does not allow comparisons with the results for the same technique (and even animal species) with those reported in a different country or region of Chile. This is the reason why Chilean findings were not included in Fig. 2.

In definitive, the high variety of methods used so far to genotype Latin American and Caribbean MAP isolates and the limited number of reports makes difficult to analyze the results obtained from the original question of this systematic review. However, we identified a genetic similarity between MAP strains from cattle, goats, and sheep, unrelatedly of the matrix and the geographic origin.

Specific MIRU-VNTR and MLSSR MAP subtypes in the region of study were determined. MIRU-VNTR mostly reported INMV 1, INMV 2, and INMV 11 subtypes. Although comparisons among studies are very difficult because of the use of different analytic loci, it has to be mentioned that MIRU-VNTR profiles 1 (INMV 1) and 2 (INMV 2) have been reported as the most common genotypes found in isolates from European isolates (Stevenson et al. 2009; Douarre et al. 2011; Biet et al. 2012; Imperiale et al. 2017). INMV 11 subtypes seem to be exclusive from the region of study, since no reports on different latitudes have been published so far. MLSSR subtypes were reported from four different countries, leading to nine different subtypes of which 7g–10g–4ggt was the most common one. According to such profile, the subtypes isolated in our study are partially identified by eight codes (i.e., MLSSR 2, MLSSR 5, MLSSR 28, MLSSR 30, MLSSR 31, MLSSR 33, MLSSR 49), according to a web search on http://mac-inmv.tours.inra.fr/macinmv/index.php?p=fa_ident_ssr. Therefore, epidemiological links or relative relation of them with other results in the world is not feasible to establish.

According to the information presented herein, a change to most commonly accepted protocols must be proposed. It is well-known that MAP isolates reveal a restricted diversity and a substitution frequency consistent with a monomorphic pathogen, so it presents a challenge to genotyping (Bryant et al. 2016). Therefore, the leadership of an international entity is proposed to recommend and follow up a conserved classification that allows the tracking of findings all around the world, available from international and freely accessible repositories. This kind of approaches could improve cooperative research among universities, regions, and countries, and of course, would allow to define a differential behavior or dynamic of the study region with respect to the world (which is complicated to the date).

Regarding the methodological process followed in the present systematic review, it included a well-defined protocol based on a clearly defined research question. Two of the authors independently followed selection and results were always by consensus. In a more detailed way, an a priori written protocol (previously reported and endorsed by health-related systematic reviews) based on a clearly stated and delimited research question was followed. A comprehensive search from several databases and sources to identify relevant studies including general-purpose databases, search engines, journals, conference proceedings, book chapters, and books from 1910 (CAB Abstracts) to the date was carried out and the eligibility of included studies by using pre-established and explicit inclusion/exclusion criteria all along the process was assessed. No geographic or temporal constraints were considered, so no biases-related results are thought to be yielded. Two of the authors independently followed selection principles, and results from each searching step were always accomplished by consensus. Agreement measures were reported all along the process to assure reliability of the results. By last, data extracted from the original studies was clearly delineated. Because relevant studies varied in quality and in methodology, one of the authors constructed a matrix of findings, which were furtherly revised by a second author to assure consistency of the information extracted.

We recognize that our SR contains a modest sample of 26 original articles. However, our findings represent the most comprehensive summary of MAP genotypes isolated from cattle, sheep, and goats in Latin America and the Caribbean so far.

Gray literature was not fully considered, because many of these documents are difficult to find and acquire. Therefore, the authors are aware of missing information that could be reporting MAP genotypes in the hosts and regions of interest that may have not been identified. Snowballing (at different levels) strategy was carried out to diminished the effect of such inadvertence, plus to a hand searching of the 14th ICP proceedings, leading to a maximum yield of relevant articles. Bias may have been introduced by the exclusion of non-English, French, German, Portuguese, or Spanish language studies.

In light of the currently available data, systematically reviewing the literature on this subject is difficult as there is considerable variability in study designs and approaches among studies, since individual study designs and genotyping protocols’ characteristics varied considerably among reports. Therefore, it is important to consider the information with the prudence that it deserves and to consider that the outcome of interest (i.e., MAP isolation molecular identity) may vary from case to case. Information from scientific literature is limited, because of studies objectives, designs, and reporting. Authors consider that further studies in this area should follow standardized guidelines when designing and implementing MAP molecular research approaches and reporting their results.