Escherichia coli phylogenetic group determination and its application in the identification of the major animal source of fecal contamination

Carlos, Camila; Pires, Mathias M; Stoppe, Nancy C; Hachich, Elayse M; Sato, Maria IZ; Gomes, Tânia AT; Amaral, Luiz A; Ottoboni, Laura MM

doi:10.1186/1471-2180-10-161

Escherichia coli phylogenetic group determination and its application in the identification of the major animal source of fecal contamination

Research article
Open access
Published: 01 June 2010

Volume 10, article number 161, (2010)
Cite this article

Download PDF

You have full access to this open access article

BMC Microbiology Aims and scope Submit manuscript

Escherichia coli phylogenetic group determination and its application in the identification of the major animal source of fecal contamination

Download PDF

Camila Carlos¹,
Mathias M Pires²,
Nancy C Stoppe¹,
Elayse M Hachich³,
Maria IZ Sato³,
Tânia AT Gomes⁴,
Luiz A Amaral⁵ &
…
Laura MM Ottoboni¹

20k Accesses
197 Citations
Explore all metrics

Abstract

Background

Escherichia coli strains are commonly found in the gut microflora of warm-blooded animals. These strains can be assigned to one of the four main phylogenetic groups, A, B1, B2 and D, which can be divided into seven subgroups (A₀, A₁, B1, B2₂, B2₃, D₁ and D₂), according to the combination of the three genetic markers chuA, yjaA and DNA fragment TspE4.C2. Distinct studies have demonstrated that these phylo-groups differ in the presence of virulence factors, ecological niches and life-history. Therefore, the aim of this work was to analyze the distribution of these E. coli phylo-groups in 94 human strains, 13 chicken strains, 50 cow strains, 16 goat strains, 39 pig strains and 29 sheep strains and to verify the potential of this analysis to investigate the source of fecal contamination.

Results

The results indicated that the distribution of phylogenetic groups, subgroups and genetic markers is non-random in the hosts analyzed. Strains from group B1 were present in all hosts analyzed but were more prevalent in cow, goat and sheep samples. Subgroup B2₃ was only found in human samples. The diversity and the similarity indexes have indicated a similarity between the E. coli population structure of human and pig samples and among cow, goat and sheep samples. Correspondence analysis using contingence tables of subgroups, groups and genetic markers frequencies allowed the visualization of the differences among animal samples and the identification of the animal source of an external validation set. The classifier tools Binary logistic regression and Partial least square -- discriminant analysis, using the genetic markers profile of the strains, differentiated the herbivorous from the omnivorous strains, with an average error rate of 17%.

Conclusions

This is the first work, as far as we are aware, that identifies the major source of fecal contamination of a pool of strains instead of a unique strain. We concluded that the analysis of the E. coli population structure can be useful as a supplementary bacterial source tracking tool.

Correlation between the genomic o454-nlpD region polymorphisms, virulence gene equipment and phylogenetic group of extraintestinal Escherichia coli (ExPEC) enables pathotyping irrespective of host, disease and source of isolation

Article Open access 16 September 2014

Phylogenomic approaches to determine the zoonotic potential of Shiga toxin-producing Escherichia coli (STEC) isolated from Zambian dairy cattle

Article Open access 25 May 2016

Genotype variation and genetic relationship among Escherichia coli from nursery pigs located in different pens in the same farm

Article Open access 05 January 2017

Background

Escherichia coli, a bacterium widely spread among warm-blooded animals, has been used as an indicator of water fecal contamination. Fecal pollution in water can indicate the presence of waterborne pathogens, such as Salmonella and Giardia [1]. The identification of the major animal source of fecal contamination is extremely important for the effective management of water systems [2]. Therefore, several methods of bacterial source tracking (BST), using E. coli strains, have been developed to identify the animal source of fecal contamination. Among these methods are ribotyping, rep-PCR, antibiotic resistance profiles, among others [3]. However, until now, only one putative human-specific strain [4] and one putative animal-specific strain have been found [5].

Escherichia coli strains can be assigned to one of the main phylogenetic groups: A, B1, B2 or D [6–8]. According to Lecointre et al. [9], groups A and B1 are sister groups whereas group B2 is included in an ancestral branch. These phylo-groups apparently differ in their ecological niches, life-history [10] and some characteristics, such as their ability to exploit different sugar sources, their antibiotic-resistance profiles and their growth rate [11]. Walk et al. [12] demonstrated that the majority of the E. coli strains that are able to persist in the environment belong to the B1 phylogenetic group. Furthermore, genome size differs among these phylo-groups, with A and B1 strains having smaller genomes than B2 or D strains [13]. Johnson et al. [14] found that strains from phylo-groups B2 and D contained more virulence factors than strains from the phylo-groups A and B1.

The extraintestinal pathogenic strains usually belong to groups B2 and D [15, 16], the commensal strains to groups A and B1 [17], whilst the intestinal pathogenic strains belong to groups A, B1 and D [18]. Clermont et al. [19] have developed a PCR based method to characterize the phylo-groups using the genetic markers chuA, yjaA and the DNA fragment TspE4.C2. To increase the discrimination power of E. coli population analyses, it has been proposed the use of subgroups A₀, A₁, B1, B2₂, B2₃, D₁ and D₂, that are determined by the combination of the genetic markers [5].

Some authors analyzed the distribution of the main phylogenetic groups among E. coli strains isolated from human and animal feces. Gordon and Cowling [10] observed that the relative abundance of phylogenetic groups among mammals is dependent on the host diet, body mass and climate. Escobar-Páramo et al. [5] analyzing fecal strains isolated from birds, non-human mammals and humans, observed the prevalence of groups D and B1 in birds, A and B1 in non-human mammals, and A and B2 in humans. These authors concluded that one of the main forces that shapes the genetic structure of E. coli populations among the hosts is domestication. Baldy-Chudzik et al. [20] analyzed feces from zoo animals and found a prevalence of group B1 in herbivorous animals and a prevalence of group A in carnivorous and omnivorous animals.

The aim of this work was to analyze the distribution of phylogenetic groups and subgroups in feces from different animals and to assess the potential application of this analysis in identifying the major source of fecal contamination in the environment.

Results

In this work, 241 E. coli strains isolated from feces of different animals and 12 strains isolated from a sewage source were allocated into four phylogenetic groups (i.e. A, B1, B2 and D) and seven subgroups (i.e. A₀, A₁, B1, B2₂, B2₃, D₁ and D₂). As shown in Table 1, the strains analyzed were distributed among the seven subgroups, and the prevalence indexes calculated for the subgroups were: A₀ = 83.33%, A₁ = 83.33%, B1 = 100%, B2₂ = 50%, B2₃ = 16.67%, D₁ = 66.67 and D₂ = 66.67%. It is interesting to note that strains from group B1 were found among all the analyzed hosts, whereas strains from subgroup B2₃ were found only in humans.

Table 1 Distribution of the E. coli phylogenetic subgroups among the hosts analyzed

Full size table

The graphic representation shown in Figure 1 allowed the identification of remarkable trends among the E. coli strains from the different hosts. Humans are the only host bearing strains from all the phylo-groups, except for subgroup A₀. The strains found in the pig samples were also distributed among all phylo-groups, except for subgroup B2₃, which contains only strains from the human samples. Most of the strains from the chicken samples were included in subgroup A₀, that is, these strains did not reveal the presence of the genetic markers investigated. Most of the strains of cows, goats and sheep fell within group B1, despite the fact that four strains of cows and three of chickens were assigned to subgroup D₁ and two strains of goats and two of cows were assigned to group A₁.

A Chi-square value of 97.611, 15 degrees of freedom (D.F.), p < 0.0001, was obtained from a contingency table with the phylogenetic groups distribution among the hosts, allowing the null hypothesis, which states that there is no association between the hosts and the groups, to be rejected (p < 0.0001). This result suggests a significant difference in the E. coli population structure among the animals analyzed. A Chi-square test at the subgroup level was performed to verify the existence of an association between the hosts and the phylogenetic subgroup. The calculated 155.251 Chi-square value (30 D.F.), leads to the rejection of the null hypothesis (p < 0.0001). A Chi-square test was also performed to verify the association between the hosts and the genetic markers (chuA, yjaA and TspE4.C2). The result (Chi-square value = 87.563, 10 D.F., p < 0.0001) indicated that the genetic markers are differently distributed among the hosts (Table 2).

Table 2 Distribution of the E. coli genetic markers among the hosts analyzed

Full size table

The Shannon and Simpson diversity indexes [21, 22] were used to analyze the phylogenetic subgroup data. As shown in Table 3, the largest diversity indexes were observed for humans (Shannon index = 0.6598, Simpson index = 0.7331) and pigs (Shannon index = 0.6523, Simpson index = 0.7245), whilst the smallest diversity was observed for goats (Shannon index = 0.2614, Simpson index = 0.3203). The Pianka's similarity index was calculated using the phylogenetic subgroup distribution for each pair of hosts (Table 4). The results indicated that humans and pigs exhibited a similarity of 88.3%, whereas cows, goats and sheep exhibited an average similarity of 96%.

Table 3 Shannon's and Simpson's diversity index of each host analyzed

Full size table

Table 4 Pairwise Pianka's index of similarity among the hosts analyzed

Full size table

A Correspondence Analysis (CA) was performed using the phylogenetic groups and subgroups distribution and the genetic markers distribution (Tables 1 and 2). The bidimensional representation of subgroups distribution in each host is shown in Figure 2. This bidimensional representation can explain 93.74% of the total inertia. The horizontal axis represents 73.85% of the total inertia, which is responsible for the major separation. According to this analysis, the subgroup distribution was similar for cows, goats and sheep and for pigs and humans (Figure 2). A sewage sample was included in the CA (Figure 2). This sample included the following subgroups: A₀ (one strain), A₁ (five strains), D₁ (four strains) and D₂ (two strains). As expected, this subgroup distribution was similar to the one found for humans (Figure 2).

The CA using the genetic markers distribution resulted in a bidimensional representation that can explain 100% of the total inertia (Figure 3), being the horizontal axis responsible for 92.04% of it. According to this analysis, the genetic markers distribution was similar for cows, goats and sheep and for humans, chickens and pigs. The sewage sample, in which six strains presented the chuA gene, five the yjaA gene and two the TspE4.C2 fragment, was plotted near the human sample (Figure 3).

The discrimination power of the phylogenetic groups A, B1, B2 and D was also tested using CA (Figure 4). According to this analysis, the bidimensional representation of the phylo-groups relative abundance can explain 99.1% of the total inertia, being the horizontal axis responsible for 82.54% of it. This analysis revealed that the phylo-group distribution among cows, goats and sheep, which presented a predominance of strains of the B1 group, was similar. Humans, chickens and pigs remained separated. E. coli strains isolated from two Rivers, Jaguari and Sorocaba, located in the State of São Paulo, Brazil, and previously analyzed by Orsi et al. [23], were also included in this CA analysis (data not shown). The strain composition of the Jaguari River included 42 strains of group A, 13 strains of group B1 and six strains of group D. The Sorocaba River included 45 strains of group A, 14 strains of group B1, one strain of group B2 and eight strains of group D. The strains distribution among the phylo-groups, from both rivers, was similar to the one observed for chickens and pigs. The sewage sample was also included in this CA and once again, this sample was similar to humans (Figure 4).

The results obtained with the classifier tools BLR and PLS-DA using the genetic markers are summarized in Table 5. The separation between E. coli strains of omnivorous and herbivorous mammals presented the lowest classification error rate (17% on average), while the highest classification error rate (25% on average) was observed between E. coli strains of humans and non-humans. Both classifier tools demonstrated that the chuA and the yjaA genes were more informative to discriminate between E. coli strains of human and non-human sources (data not shown). The PLS-DA tool showed that the yjaA gene and the TspE4.C2 DNA fragment were more informative to discriminate between E. coli strains of herbivorous and omnivorous mammals. The error rate for BLR and PLS-DA was higher in the prediction of human than in non-human samples (data not shown). However, when the feeding habit of mammals was considered in the separation, the error rate for both tools was higher in the prediction of the herbivorous samples.

Table 5 Classification error rates obtained by validation of supervised learning classifier tools (BLR and PLS-DA)

Full size table

Discussion and Conclusions

This study demonstrated that phylogenetic subgroup, group and genetic markers distribution are not randomly distributed among the hosts analyzed. The results showed a similarity between the E. coli population structure of humans and pigs (omnivorous mammals) and of cows, goats and sheep (herbivorous mammals). Humans and pigs exhibited the highest diversity indexes, while goats and sheep exhibited the lowest ones. Using the simulations of the EcoSim software [24], it was possible to conclude that the diversity indexes are significantly different among the herbivorous and omnivorous mammals. The Pianka's similarity index showed that the human sample was more similar to the pig sample (88.3% of overlap). Cows, goats and sheep also presented a high overlap (96% on average), while chickens presented the lowest values.

Cows, goats and sheep are ruminant mammals which differ in many gut characteristics from other animals. Humans and pigs present common gut characteristics because they are monogastric animals (reviewed in [25]). Besides the gut characteristics, the diet of the host appears to have selected the phylo-group profile in the Brazilian mammals analyzed in this work. Omnivorous mammals presented a prevalence of phylo-group A, while the herbivorous mammals presented a prevalence of phylo-group B1. Previous research by Gordon and Cowling [10] revealed a different result from ours, identifying a prevalence of strains of phylo-group B2 among herbivorous and omnivorous mammals and a prevalence of phylo-goup B1 among birds and carnivorous mammals, which supports their hypothesis of geographic effects in the E. coli population structure among hosts. However, they also concluded that phylo-groups A and B1 are "generalists" and B2 and D are "specialists", which is in agreement with our data since strains of group B1 were found in all the hosts analyzed, followed by subgroups A₀ and A₁. On the other hand, subgroup B2₃ was present only in the human sample. Therefore, our results suggest that B2 strains, especially subgroup B2₃, could be a good indicator of human feces contamination.

Group B1 was prevalent among the herbivorous hosts. However, this phylo-group is not a promising indicator of herbivorous feces contamination because it was found in all the hosts analyzed, and, apparently, most E. coli strains that are able to survive in the environment, belong to this group [12]. According to our data, the distribution analysis of phylo-groups A and D is a powerful discriminating tool since both groups presented a considerable contribution to the Chi-square values (data not shown).

The chuA and yjaA genes were rarely found in strains of cows, goats and sheep but were commonly found in human, chicken and pig strains. Sobieszczańska [26] showed that 95.5% of the enteroaggregative E. coli strains carried the chuA gene, which encodes for a haem receptor. Strains belonging to group B2 were not found in cows, goats and sheep. Other studies have demonstrated that B2 and D strains are usually more pathogenic than A and B1 strains [16, 17, 27, 28]. In fact, verocytotoxin-producing E. coli, like O157:H7, belongs to group D [29] and cattle are the main reservoirs of this pathogen. The prevalence of groups B2 and D and of the chuA and yjaA genes in humans and pigs might suggest that fecal contamination by these animals can present a high risk of extra-intestinal pathogenic E. coli. Thus, the correct identification of this kind of fecal contamination can also be useful to the appropriate management of environmental pollution.

Correspondence analysis is a descriptive/exploratory technique, based on Chi-square values, that allows the exploration of the structure of the data. In the three CA models performed, similar distribution patterns were observed among E. coli strains of herbivorous mammals and among strains of omnivorous mammals. Furthermore, the CA of subgroup distribution allowed the discrimination of omnivorous mammals. Similar results were observed by Baldy-Chudzik et al. [20]. These authors suggested that the E. coli strains of group B1 are best adapted to herbivorous, whereas strains of group A are best adapted to omnivorous mammals. The three CA models correctly predicted the animal/human source of the external validation sample (sewage), indicating that a significant part of the E. coli phylo-group diversity was covered by the strains database, which reveals the stability of the models. E. coli samples from the Jaguari and Sorocaba Rivers [23] were also used to test the CA model based on phylo-group distribution. Our analysis suggested that pigs were the major source of fecal contamination in both rivers, which is in agreement with Orsi et al. [23], confirming that the major source of fecal contamination of these rivers was non-human. Therefore, these results indicate that the CA model can be efficiently applied in the discrimination of E. coli strains from different animal sources.

Both classifier tools (BLR and PLS-DA) and both validation methods (cross-validation and train-test) exhibited similar overall error rates for each strain separation analyzed. This way, the statistical method used did not show a significant interference in the obtained results. Excluding the chicken sample, the best classification was obtained when the E. coli strains were separated according to the feeding habits of the hosts (omnivorous and herbivorous mammals). Although the classification error rates found could be considered high, similar error rates were observed in other BST studies [30, 31].

Since it is very difficult to find host-specific strains or genetic markers [4, 32], in this work we propose a new approach to identify the animal source of fecal contamination in water systems. This approach is based on the specificity of the E. coli population structure instead of host-specific strains. Geographic variation of the E. coli population structure was reported in the literature [10, 32] and since the relative abundance of phylo-groups among hosts can be easily characterized, this approach can be implemented in different regions of the world as a supplementary bacterial source tracking tool. Although our data is consistent in showing the potential applicability of this approach, we are aware that there might be some limitations due to the limited number of fecal pollution sources analyzed.

Methods

The present study has been approved by the Research Ethics Committee of the State University of Campinas School of Medical Sciences.

Escherichia coli Strains

Two hundred and forty one strains of E. coli were isolated (collected with sterile swabs) from fecal samples of a variety of hosts (Table 6). Each strain was isolated from a single animal. These strains were used to build the calibration set for further statistical analysis.

Table 6 Source and number of E. coli strains used in this study

Full size table

Twelve sewage strains isolated by CETESB (Table 6), the organization responsible for the control of environmental pollution, sewage, and water quality in the State of São Paulo, Brazil, were used as the external validation set. The sewage samples were collected in 2008 at the Jesus Neto sewage treatment plant.

The strains were isolated as described by Orsi et al. [23], with modifications. Samples were analyzed using the membrane filter technique with modified mTEC agar (Difco) and incubated for 2 h at 35 ± 0.5°C and 22--24 h at 44.5 ± 0.2°C. Typical colonies were streaked on EMB agar (Merck). Isolated colonies were tested for citrate utilization, lactose fermentation, oxidase, l-lysine decarboxylase, motility, glucose and sucrose fermentation, tryptophan deamination, indole production, urea hydrolysis and sulfide production. Isolates with an E. coli profile were inoculated into LB broth at 37°C overnight. One isolated colony from each EC positive sample was selected for further analyses.

Phylogenetic group determination

The phylogenetic group of each strain was determined according to Clermont et al. [19], by multiplex PCR of the genes chuA and yjaA and the DNA fragment TspE4.C2. The amplification products were separated in 2% agarose gels containing ethidium bromide [33]. After electrophoresis, the gel was photographed under UV light, and the strains were assigned to the phylogenetic groups B2 (chuA+, yjaA+), D (chuA+, yjaA-), B1 (chuA-, TspE4.C2+) or A (chuA-, TspE4.C2-).

To increase the strains discrimination, subgroups or phylotypes were determined as follows: subgroup A₀ (group A), chu A-, yja A-, TspE4.C2-; subgroup A₁ (group A), chu A-, yj aA+ TspE4.C2-; group B1, chu A-, yja A-, TspE4.C2+; subgroup B2₂ (group B2), chu A+, yja A+, TspE4.C2-; subgroup B2₃ (group B2), chu A+, yja A+, TspE4.C2+; subgroup D₁ (group D), chu A+, yja A-, TspE4.C2- and subgroup D₂ (group D), chu A+, yjA-, TspE4.C2+ [5].

Bioinformatic and statistical analysis

A graphic representation was used to map the occurrence of the genetic markers chuA, yjaA and TspE4.C2 in the E. coli strains isolated from the different hosts. For this, the genetic markers were scored as present/absent in each strain analyzed, and the graphic was drawn with the software Pajek v. 1.22 http://vlado.fmf.uni-lj.si/pub/networks/pajek/. This graphic provides a useful representation of the E. coli phylo-groups among the different hosts. It contains two sets of nodes -- genetic markers and samples -- and edges between them. An edge between two nodes means that the genetic marker was detected for a given strain.

The prevalence index (P) was calculated by dividing the number of hosts exhibiting a particular subgroup by the total number of hosts analyzed. The results were expressed as percentages [34].

The Pianka's index was calculated to evaluate the subgroup overlap between two hosts by using the formula: O = ∑p_jp_k/√∑p_j²∑p_k², where pj and pk are subgroups proportions in the hosts j and k, respectively. The results were expressed as percentages [35].

The Chi-square test, the Simpson's diversity index and the Shannon's index were performed with the BioEstat v. 5.0 software [36], using the phylogenetic subgroup data. The EcoSim software [24] was used to test the differences among the diversity indexes by using resampling. The frequencies of phylogenetic groups, subgroups and genetic markers were compared among the hosts by using the CA, which was performed by using STATISTICA 6.0 [37]. The sewage sample was used to challenge the CA models as an external validation sample.

The classifier tools Binary Logistic Regression (BLR) and Partial Least Saquares -- Discriminant Analysis (PLS-DA) were performed with the software TANAGRA 1.4 [38]. For these analyses, the hosts were separated into humans and non-humans, human and non-human mammals, omnivorous and herbivorous mammals. The genetic markers were scored as present/absent. The cross-validation of these analyses was carried out by using five repetitions and ten fold parameters, and the train-test was carried out using 70% of the samples as a training set and ten repetitions of assessment.

References

Field KG, Samadpour M: Fecal source tracking, the indicator paradigm, and managing water quality. Water Research. 2007, 41: 3517-3538. 10.1016/j.watres.2007.06.056.
Article CAS PubMed Google Scholar
United States Environmental Protection Agency: Microbial source tracking guide document. EPA/600/R-05/064. 2005, U.S. Environmental Protection Agency
Google Scholar
Meays CL, Broersma K, Nordin R, Mazumder A: Source tracking fecal bacteria in water: a critical review of current methods. J Environ Manage. 2004, 73: 71-79. 10.1016/j.jenvman.2004.06.001.
Article PubMed Google Scholar
Clermont O, Lescat M, O'Brien CL, Gordon DM, Tenaillon O, Denamur E: Evidence for a human-specific Escherichia coli clone. Environ Microbiol. 2008, 10: 1000-1006. 10.1111/j.1462-2920.2007.01520.x.
Article CAS PubMed Google Scholar
Escobar-Páramo P, Le Menac'h A, Le Gall T, Amorin C, Gouriou S, Picard B, Skurnik D, Denamur E: Identification of forces shaping the commensal Escherichia coli genetic structure by comparing animal and human isolates. Environ Microbiol. 2006, 8: 1975-1984. 10.1111/j.1462-2920.2006.01077.x.
Article PubMed Google Scholar
Herzer PJ, Inouye S, Inouye M, Whittan TS: Phylogenetic distribution of branched RNS-linked multicopy single-stranded DNA among natural isolates of Escherichia coli. J Bacteriol. 1990, 172: 6175-6181.
PubMed Central CAS PubMed Google Scholar
Ochman H, Selander RK: Standard reference strains of Escherichia coli from natural populations. J Bacteriol. 1984, 157: 690-693.
PubMed Central CAS PubMed Google Scholar
Selander RK, Caugant DA, Ochman H, Musser M, Gilmour MN, Whittam TS: Methods of multilocus enzyme electrophoresis for bacterial population genetics and systematics. Appl Environ Microbiol. 1986, 51: 873-884.
PubMed Central CAS PubMed Google Scholar
Lecointre G, Rachdi L, Darlu P, Denamur E: Escherichia coli molecular phylogeny using the incongruence length difference test. Mol Biol Evol. 1998, 15: 1685-1695.
Article CAS PubMed Google Scholar
Gordon DM, Cowling A: The distribution and genetic structure of Escherichia coli in Australian vertebrates: host and geographic effects. Microbiology. 2003, 149: 3575-3586. 10.1099/mic.0.26486-0.
Article CAS PubMed Google Scholar
Gordon DM: The influence of ecological factors on the distribution and genetic structure of Escherichia coli. In Escherichia coli and Salmonella typhimurium. American Society for Microbiology. 2004,http://www.ecosal.org/ecosal/index.jsp
Google Scholar
Walk ST, Alm EW, Calhoun LM, Mladonicky JM, Whittam TS: Genetic diversity and population structure of Escherichia coli isolated from freshwater beaches. Environ Microbiol. 2007, 9: 2274-2288. 10.1111/j.1462-2920.2007.01341.x.
Article PubMed Google Scholar
Bergthorsson U, Ochman H: Distribution of chromosome length variation in natural isolates of Escherichia coli. Mol Biol Evol. 1998, 15: 6-16.
Article CAS PubMed Google Scholar
Johnson JR, Delavari P, Kuskowski M, Stell AL: Phylogenetic distribution of extraintestinal virulence-associated traits in Escherichia coli. J Infect Dis. 2001, 183: 78-88. 10.1086/317656.
Article CAS PubMed Google Scholar
Johnson JR, Stell AL: Extended virulence genotypes of Escherichia coli strains from patients with urosepsis in relation to phylogeny and host compromise. J Infect Dis. 2000, 181: 261-272. 10.1086/315217.
Article CAS PubMed Google Scholar
Picard B, Garcia JS, Gouriou S, Duriez P, Brahimi N, Bingen E, Elion J, Denamur E: The link between phylogeny and virulence in Escherichia coli extraintestinal infection. Infect Immun. 1999, 67: 546-553.
PubMed Central CAS PubMed Google Scholar
Bingen E, Picard B, Brahimi N, Mathy S, Desjardins P, Elion J, Denamur E: Phylogenetic analysis of Escherichia coli strains causing neonatal meningitis suggests horizontal gene transfer from a predominant pool of highly virulent B2 group strain. J Infect Dis. 1998, 177: 642-650. 10.1086/514217.
Article CAS PubMed Google Scholar
Pupo GM, Karaolis DKR, Lan R, Reeves PR: Evolutionary relationships among pathogenic and nonpathogenic Escherichia coli strains inferred from multilocus enzyme electrophoresis and mdh sequence studies. Infect Immun. 1997, 65: 2685-2692.
PubMed Central CAS PubMed Google Scholar
Clermont O, Bonacorsi S, Bingen E: Rapid and simple determination of the Escherichia coli phylogenetic group. Appl Environ Microbiol. 2000, 66: 4555-4558. 10.1128/AEM.66.10.4555-4558.2000.
Article PubMed Central CAS PubMed Google Scholar
Baldy-Chudzik K, Mackiewics P, Stosik M: Phylogenetic background, virulence gene profiles, and genomic diversity in commensal Escherichia coli isolated from ten mammal species living in one zoo. Vet Microbiol. 2008, 131: 173-184. 10.1016/j.vetmic.2008.02.019.
Article CAS PubMed Google Scholar
Pielou EC: Ecological diversity. 1975, New York Wiley, 165-
Google Scholar
Simpson EH: Measurement of diversity. Nature. 1949, 163: 688-10.1038/163688a0.
Article Google Scholar
Orsi RH, Stoppe NC, Sato MIZ, Prado PL, Ottoboni LMM: Phylogenetic group distribution among Escherichia coli isolated from rivers in São Paulo State, Brazil. World J Microbiol Biotechnol. 2008, 24: 1573-1577. 10.1007/s11274-007-9644-8.
Article Google Scholar
Gotelli NJ, Entsminger GL: EcoSim: Null models software for ecology. Version 5.0. 2000, Acquired Intelligence Inc. & Kesey-Bear,http://homepages.together.net/~gentsmin/ecosim.htm
Google Scholar
Apajalahti j: Comparative gut microflora, metabolic challenges, and potential opportunities. J Appl Poult Res. 2005, 14: 444-453.
Article CAS Google Scholar
Sobieszczańska BM: Distribution of genes encoding iron uptake systems among enteroaggregative Escherichia coli strains isolated from adults with irritable bowel syndrome. Clin Microbiol Infect. 2008, 14: 1083-1086. 10.1111/j.1469-0691.2008.02093.x.
Article PubMed Google Scholar
Boyd EF, Hartl DL: Chromosomal regions specific to pathogenic isolates of Escherichia coli have a phylogenetically clustered distribution. J Bacteriol. 1998, 180: 1159-1165.
PubMed Central CAS PubMed Google Scholar
Le Gall T, Clermont O, Gouriou S, Picard B, Nassif X, Denamur E, Tenaillon O: Extraintestinal virulence is a coincidental by-product of commensalism in B2 phylogenetic group Escherichia coli strains. Mol Biol Evol. 2007, 24: 2373-2384. 10.1093/molbev/msm172.
Article CAS PubMed Google Scholar
Bidet P, Mariani-Kurkdjian P, Grimont F, Brahimi N, Courroux C, Grimont P, Bingen E: Characterization of Escherichia coli O157: H7 isolates causing haemolytic uraemic syndrome in France. J Med Microbiol. 2005, 54: 71-75. 10.1099/jmm.0.45841-0.
Article CAS PubMed Google Scholar
Hassan WM, Ellender RD, Wang SY: Fidelity of bacterial source tracking: Escherichia coli vs Enterococcus spp and minimizing assignment of isolates from nonlibrary sources. J Appl Microbiol. 2007, 102: 591-598. 10.1111/j.1365-2672.2006.03077.x.
CAS PubMed Google Scholar
Mohapatra B, Broersma K, Nordin R, Mazumder A: Evaluation of repetitive extragenic palindromic-PCR for discrimination of fecal Escherichia coli from humans, and different domestic- and wild-animals. Microbiol Immunol. 2007, 51: 733-740.
Article CAS PubMed Google Scholar
Gordon DM: Geographical structure and host specificity in bacteria and the implications for tracing the source of coliform contamination. Microbiology. 2001, 147: 1079-1085.
Article CAS PubMed Google Scholar
Sambrook J, Fritsch EF, Maniatis T: Molecular Cloning: a laboratory manual. 1998, N.Y., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, 2
Google Scholar
Bush AO, Lafferty KD, Lotz JM, Shostak AW: Parasitology meets ecology on its own terms: Margolis et al. revisited. J Parasitol. 1997, 83: 575-583. 10.2307/3284227.
Article CAS PubMed Google Scholar
Pianka ER: The structure of lizard communities. Ann Rev Ecol Syst. 1973, 4: 53-74. 10.1146/annurev.es.04.110173.000413.
Article Google Scholar
Ayres M, Ayres JRM, Ayres DL, Santos AS: BioEstat 4.0: Aplicações estatísticas nas áreas das ciências biológicas e médicas. 2005, Belém: Sociedade Civil Mamirauá, CNPq
Google Scholar
StatSoft Inc: Electronic Statistics Textbook. StatSoft. 2007,http://www.statsoft.com/textbook/stathome.html
Google Scholar
Rakotomalala R: TANAGRA: un logiciel gratuit pour l'enseignement et la recherché. In: Actes de EGC. 2005, 2: 697-702.
Google Scholar
Gomes TAT, Ramos SRTS, Rodrigues D, Rassi V, Toledo MRF, Vieira MAM, Gatti SV, Holck P, Griffin PM: Etiology of acute diarrhea in children 1 to 5 years old in São Paulo, Brazil. In Abstracts of the 94th General Meeting of the American Society for Microbiology. abstr. C-299. 1994, 543-
Google Scholar
Vicente HIG, Amaral LA, Cerqueira AMF: Shigatoxigenic Escherichia coli serogroup O157, O111 and O113 in feces, water and milk samples from dairy farms. Braz J Microbiol. 2005, 36: 217-222. 10.1590/S1517-83822005000300003.
Article CAS Google Scholar
Silveira WD, Ferreira A, Brocchi M, Hollanda LM, Castro AFP, Yamada AT, Lancelloti M: Biological characteristics and pathogenicity of avian Escherichia coli strains. Vet Microbiol. 2002, 85: 47-83. 10.1016/S0378-1135(01)00482-5.
Article Google Scholar

Download references

Acknowledgements

This work was supported by a grant from Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP 2007/55312-6). CC received a fellowship from FAPESP (FAPESP 2007/57025-4). LMMO received a research fellowship from Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq). The authors thank Dr. Wanderley Dias da Silveira for providing the E. coli strains from chicken feces. We are indebted to Dr. Ricardo Antunes de Azevedo for a critical reading of the manuscript.

Author information

Authors and Affiliations

Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas - UNICAMP, C. P. 6010, 13083-875, Campinas, S. P., Brasil
Camila Carlos, Nancy C Stoppe & Laura MM Ottoboni
Programa de Pós Graduação em Ecologia, Instituto de Biologia, Universidade Estadual de Campinas - UNICAMP, 13083-970, Campinas, S. P., Brasil
Mathias M Pires
Departamento de Análises Ambientais, Companhia Ambiental do Estado de São Paulo - CETESB, 05459-900, São Paulo, S. P., Brasil
Elayse M Hachich & Maria IZ Sato
Departamento de Microbiologia, Imunologia e Parasitologia, Universidade Federal de São Paulo - UNIFESP, 04023-062, São Paulo, S. P., Brasil
Tânia AT Gomes
Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista - UNESP, 14884-900, Jaboticabal, S. P., Brasil
Luiz A Amaral

Authors

Camila Carlos
View author publications
You can also search for this author in PubMed Google Scholar
Mathias M Pires
View author publications
You can also search for this author in PubMed Google Scholar
Nancy C Stoppe
View author publications
You can also search for this author in PubMed Google Scholar
Elayse M Hachich
View author publications
You can also search for this author in PubMed Google Scholar
Maria IZ Sato
View author publications
You can also search for this author in PubMed Google Scholar
Tânia AT Gomes
View author publications
You can also search for this author in PubMed Google Scholar
Luiz A Amaral
View author publications
You can also search for this author in PubMed Google Scholar
Laura MM Ottoboni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laura MM Ottoboni.

Additional information

Authors' contributions

CC and LMMO conceived and designed the study. CC performed the experiments, the statistical analysis and wrote the manuscript. MMP performed the bioinformatic analysis. NCS, TATG and LAA isolated the majority of the E. coli strains used in the work. MIZS and EMH participated in the discussion of the experimental results. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Carlos, C., Pires, M.M., Stoppe, N.C. et al. Escherichia coli phylogenetic group determination and its application in the identification of the major animal source of fecal contamination. BMC Microbiol 10, 161 (2010). https://doi.org/10.1186/1471-2180-10-161

Download citation

Received: 03 February 2010
Accepted: 01 June 2010
Published: 01 June 2010
DOI: https://doi.org/10.1186/1471-2180-10-161

Escherichia coli phylogenetic group determination and its application in the identification of the major animal source of fecal contamination