Abstract
Buffalo is an important livestock species. Here, we present a comprehensive metagenomic survey of the microbial communities along the buffalo digestive tract. We analysed 695 samples covering eight different sites in three compartments (four-chambered stomach, intestine, and rectum). We mapped ~85% of the raw sequence reads to 4,960 strain-level metagenome-assembled genomes (MAGs) and 3,255 species-level MAGs, 90% of which appear to correspond to new species. In addition, we annotated over 5.8 million nonredundant proteins from the MAGs. In comparison with the rumen microbiome of cattle, the buffalo microbiota seems to present greater potential for fibre degradation and less potential for methane production. Our catalogue of microbial genomes and the encoded proteins provides insights into microbial functions and interactions at distinct sites along the buffalo digestive tract.
Similar content being viewed by others
Introduction
Buffalo (Bubalus bubalis, also known as water buffalo) is a globally important domestic animal of immense value to humans, providing economic value from milk, meat, and leather production and draft power1. The worldwide estimated population of more than 200 million buffaloes is relied upon by more than two billion people—a greater number than for any other domesticated animals2. Buffaloes are even-toed, hoofed mammals of the Bovidae family, genus Bubalus, and tribe Bovini. There are two subspecies of the domesticated water buffalo: swamp buffalo (Bubalus bubalis carabanesis, 2 N = 48) and river buffalo (Bubalus bubalis bubalis, 2 N = 50)3,4.
The distribution of swamp buffaloes overlaps closely with that of rice agriculture in East and Southeast Asian countries (e.g. China, Vietnam, and Thailand), where they have served as the primary draft animals for rice cultivation for thousands of years5. Buffaloes are herbivores, and most of their diet is composed of plant fibre such as forage grasses. They possess a characteristic four-chambered stomach (FC stomach, including the rumen, reticulum, omasum, and abomasum for short) that is specialized for transforming the forage with low nutritional value into high-quality animal protein and adapting to the life cycle of forage grass6. The digestibility of organic matter was greatest by buffaloes is greatest when the animals are fed hay7.
The importance of the microbiota of different sites in the digestive tract is now well recognized and has been linked to the functions and physiology of the corresponding DT sites8. For example, the rumen, the first and most important stomach site in ruminants, contains abundant bacteria capable of digesting cellulose (the main component of plant cell walls), including Fibrobacter9,10,11,12, Ruminococcus13,14, and Butyrivibrio15,16 bacteria; similarly, Prevotella, a group of bacteria capable of degrading noncellulose plant fibres, is abundant in the rumen17,18,19,20. Archaea are also abundant in the rumen and other sites of the FC stomach; however, they are the main producers of methane, a major greenhouse gas21,22,23 and are thus targets for elimination24. In addition to their roles in food digestion and nutrient absorption, the rumen and gut microbiota have been linked to more pronounced phenotypes, such as the milk production and quality of cattle25,26. To date, research on the microbial ecology along the DT of ruminants has mostly focused on the rumen27,28,29,30,31,32,33,34,35,36, along with a few studies on the gut/faeces25,37, while other sites have been largely overlooked, despite accumulating evidence supporting their important roles. For example, in humans, the gut microbiota has been linked to many aspects of human life, including health38,39 and diseases40,41,42,43,44, development45,46, responses to drugs and treatments47,48,49. In addition, recent studies on the buffalo DT microbiota have mostly used 16 S rRNA gene sequencing35, while whole-genome metagenomics has been applied to only a few samples25.
The application of metagenomic next-generation sequencing (mNGS) has greatly facilitated the reconstruction of large numbers of metagenome-assembled genomes (MAGs) in model organisms, including cows30,36,50,51,52, goats37, pigs53,54, mice55,56, and chickens57,58,59,60, and revealed their associations with the health and diseases of their hosts61,62. In this study, we set out to fill the gaps in ruminant research by performing a comprehensive survey of the microbial ecology of the buffalo’s DT. We collected in total 695 samples from eight DT sites in three sections: the FC stomach, intestine, and rectum. We performed metagenomic sequencing and obtained 4960 MAGs, which covered in total 85% of the raw sequencing reads. We identified known interactions between microbes and the rumen, where particular enrichment of fibre-digesting and methane-producing microbes was observed, in addition to novel and important interactions in other FC stomach sites and DT sections. We annotated 5,862,748 non-redundant proteins from the MAGs, many of which showed differences in abundances and were related to site-specific functions. Thus, our taxonomic and functional characterization of the microbial of the buffalo’s DT provides insight into microbiota functions and interactions with distinct DT sites and will contribute to overcoming the barriers to manipulating the gastrointestinal microbiota to improve animal productivity in buffalo as well as the related ruminant animals.
Results
Generation and quality assessment of MAGs along the digestive tract of buffalo
To provide a comprehensive collection of microbes associated with the digestive systems of buffalo, we collected in total 695 samples from different sites along with the DTs of buffaloes from six provinces of China (Supplementary Data 1), including 211 samples from four distinct sites of the FC stomach (rumen, reticulum, omasum, and abomasum), 85 samples from three sites in the intestine (caecum, colon, and jejunum) and 399 samples from the rectum (faeces) (Fig. 1a). The buffaloes varied in terms of their breeds (three main breeds: river buffalo, swamp buffalo, and hybrid buffalo), developmental stages (calf, breeding, and adult), and sexes (oxen and cows) (Supplementary Data 1).
We subjected the 695 samples to mNGS using the Illumina NovaSeq 6000 platform for paired-ended sequencing with a read-length of 150 bp. After removing vector and low-quality sequences and contamination from the host and food genomic sequences, we obtained a total of 11 Tb of clean data for further analyses (see Methods for details). On average, we obtained 41,842,231 pairs of clean reads and 6,244,074,222 bases from each sample.
To obtain metagenome-assembled genomes (MAGs), we adopted a customized bioinformatics analysis workflow (see Supplementary Fig. 1 for a graphical representation; Methods). Briefly, the clean reads were assembled by using metaSPAdes63 and MEGAHIT;64 the resulting 109,471,448 contigs were grouped into 58,094 bins using metaBAT265 with the default parameters. All bins were aggregated and dereplicated using dRep66 (v.2.3.2) with the default overlap threshold (-nc 0.1) and were clustered into strain-level and species-level genome bins at 99% and 95% of the average nucleotide identity (ANI), respectively, followed by the application of CheckM67 (v.1.0.18) for quality assessment. Finally, we obtained a non-redundant set of 4960 bins (MAGs) at the strain level with completeness ≥80% and contamination ≤10% and 3255 MAGs at the species level. Rarefaction analysis indicated that the curves could reach a plateau when using samples from the FC stomach or rectum or from sites all combined, although not when using intestinal samples, as fewer of these samples were available (Fig. 1b).
Among the resulting MAGs at the strain level, 1575 (31.75%) were high-quality draft genomes as defined by Bowers et al.68 with ≥90% completeness, ≤5% contamination, and the inclusion of the 23 S, 16 S, and 5 S rRNA genes and at least 18 tRNAs, while 3228 (65.08%) met the score criterion defined by Parks et al.69 (completeness − (5 × contamination) ≥50) (Fig. 1c, d). The sizes of the MAGs ranged from 402 kilobases (Kb) to 6.1 megabases (Mb), with a median of 2.1 Mb (Fig. 1f). Moreover, a similar proportion of the quality assessment results (e.g. 1214 (37.3%) and 1951 (59.9%)) corresponded to high- and medium-quality MAGs (Supplementary Fig. 2a, b), and the approximate distributions of N50 values, genome sizes, and the numbers of contigs per genome were obtained among the species-level MAGs (Supplementary Fig. 2c–e). Compared with the complete genomes in the NCBI RefSeq prokaryotic genome database (457 archaeal and 28,011 bacterial genomes at the complete and chromosomal assembly levels; downloaded as of June 28, 2021), our strain-level MAGs were indeed shorter (Supplementary Fig. 3a). However, due to the lack of representativeness of buffalo microbial genomes in public databases, the differences could in part be due to the distinct characteristics of buffalo MAGs. In addition, we believe that the completeness as measured by the presence of key genes is a better index than genome sizes since the latter parameter varies significantly even within species. Hence, we also compared the completeness and genome sizes of the buffalo MAGs with those of 4941 cattle rumen MAGs from Stewart et al.30, and we found that although the buffalo MAGs were also shorter than the cattle rumen MAGs (Supplementary Fig. 3b), the former presented significantly higher completeness values than the latter (Supplementary Fig. 3c). These results may suggest that buffalo-associated microbes tend to have compact genomes; however, further validation is needed to verify this conclusion.
The MAGs contained 1 to 679 contigs, with a median of 102. The contig sizes of the MAGs ranged from 2.5 Kb to 1.5 Mb, with N50 values (50% of assembled bases in contigs larger than the N50 value) ranging from 5.4 kb to 1.4 Mb (Fig. 1e, g). To further access the quality of the resulting MAGs, we annotated tRNA and 16 S rRNA genes encoded by these MAGs. All MAGs encoded tRNA genes with an average of 18.6 tRNA genes per MAG; in total, 3899 of the strain-level MAGs and 3233 of the species-level MAGs encoded at least 18 tRNAs, indicating the high quality of the MAGs. However, according to the strain- and species-level MAGs, respectively, only 285 and 210 MAGs encoded at least one complete 16 S rRNA gene, while an additional 394 and 314 MAGs contained a partial 16 S rRNA gene (Supplementary Data 2); these results are consistent those of Stewart et al.30.
To determine whether our MAGs could improve the coverage of microbial genomes associated with the buffalo DT relative to NCBI RefSeq genomes and other public datasets, we mapped all the metagenomic clean reads to the resulting strain-level MAGs and compared the overall mapping rates against the MAGs of selected organisms (human70, chicken57, pig53, and cattle30) and reference microbial genomes from public databases (NCBI RefSeq genomes plus Hungate collection genomes71, referred to as the BFAP dataset). A high mapping rate indicates that the majority of the microbial genomes in a sample can be represented by the target dataset, while a lower mapping rate indicates the existence of novel genomes in the sample that is not covered by the target dataset. This strategy has been extensively used in metagenomic analyses30,72. As shown in Fig. 1h, the buffalo raw reads showed the highest mapping rate to our MAGs with an average of 85%, followed by the cattle Rumen Uncultured Genomes (RUGs) of Stewart et al.30, the BFAP dataset, and the MAGs of humans, pigs, and chickens.
Our FC stomach samples showed lower mapping rates to the reference genomes and MAGs than the samples from the intestines and rectum (Fig. 1h), Similar results could be observed in gut microbial datasets from humans, pigs, and chickens. However, the mapping rates against the cattle dataset were an exception because this dataset consisted of MAGs from rumen metagenomes30. These results were consistent with the fact that FC stomach metagenomes are less well in public databases, and most metagenomic studies have focused on the gut. The overall mapping rates of the FC stomach samples against cattle and buffalo MAGs were close (Fig. 1h, orange boxes), suggesting that their FC stomach metagenomes were similar; however, the mapping rates of the buffalo intestine and rectum samples against cattle RUGs were significantly lower (less than 65%), suggesting that gut metagenomes were underrepresented in Stewart et al. ‘s data30. Since the cattle and buffalo are closely related, we also mapped raw sequencing reads of cattle metagenomes obtained from30 from our MAGs. On average, 71% of the cattle reads could be mapped to our MAGs, compared to 82.5% mapped to cattle RUGs (Supplementary Fig. 4).
Together, we obtained a total of 4960 strain-level MAGs and 3255 species-level MAGs indicating potential new strains and species, respectively, which significantly improved the coverage of the microbes in the buffalo digestive tract, especially those in sections of the intestine and rectum.
Taxonomic annotation of buffalo MAGs
We next assigned taxonomic classifications to the MAGs using the Genome Taxonomy Databasse Toolkit (GTDB-TK)73. Almost of the MAGs could be classified into known taxonomic categories at the higher levels, such as the kingdom, phylum, class, and order levels; however, at more refined levels especially the species level, only 460 (9.3%) of the strain-level MAGs and 415 (12.7%) of the species-level MAGs could be classified as known species (Fig. 2a and Supplementary Fig. 2f), indicating that most of the MAGs were novel (i.e. not present in GTDB-TK). Bacteria and archaea showed similar classification results among the strain- and species-level MAGs (Fig. 2b, c and Supplementary Fig. 2g). For the following analysis, we used the strain-level MAGs unless stated otherwise.
As shown in Fig. 2, the dominant phyla according to the numbers of MAGs that could be assigned were Bacteroidota (n = 2200) and Firmicutes_A (n = 1736) (dominated by classes Bacteroidia and Clostridia), followed by Verrucomicobiota (n = 191), Firmicutes (n = 172), Spirochaetota (n = 164), and Proteobacteria (n = 147). The dominant orders included Bacteroidales (n = 2194) and Oscillospirales (n = 1101), while the dominant families included Bacteroidaceae (n = 676), CAG-272 (n = 588), Rikenellaceae (n = 483), and UBA932 (n = 368). At the genus level, the dominant groups included Alistipes, RC9, and Prevotella (Supplementary Fig. 5). Seventy-four archaeal MAGs belonging to three phyla, Halobacterota (n = 42), Thermoplasmatota (n = 20), and Euryarchaeota (n = 12), were identified. The top three genera included Methanocorpusculum (n = 27), Methanomicrobium (n = 13), and Methanobrevibacter_A (n = 11), all of which are methanogens.
Taxonomic characteristics of MAGs in different sections along the digestive tract
To investigate the distributions of the 4960 MAGs in different sections along with the DT and their putative interactions with the host, we first determined the coverage and relative abundances of all the MAGs in each sample. We mapped the clean reads of a sample to all MAGs and calculated the coverage of a MAG as the total aligned bases divided by the total bases of the MAG74, and the relative abundance of a MAG as the percentage of reads mapped to the MAG among the total reads mapped to all MAGs (see Methods for details).
Using a coverage 1X coverage as the cut-off of presence/absence, we found that 3032, 3081, and 4141 MAGs were present in at least one sample from the three sections (Fig. 3a). According to this criterion, we found that 1692 (34% out 4960) MAGs were present in all three sections, 1910 MAGs (38.5%) were present in samples from two sections, and only a small proportion of the MAGs was section specific (499 in FC stomach and 859 in rectum; Fig. 3a). We did not find intestine-specific MAGs according to this criterion, likely because the intestine is positioned between the FC stomach and rectum (Figs. 1a and 3a). At the phylum level, five phyla were FC stomach specific, among which three (UBP3, UBP6, and Eremiobacterota) were designated candidatus, indicating that thet belonged to unknown phyla. Regarding the other two phyla, Euryarchaeota consists of species related to methane metabolism, while Synergistota consists of oral bacteria in humans, was first identified in the goat rumen, and is usually associated with toxic compound degradation75. The main rectum-specific was Verrucomicrobiota, which is found in soil, water, and faeces76.
We further analyzed the relative abundances of the MAGs along with the DT and found significantly different alpha diversities among all sections, measured according to both the Shannon and Simpson indexes (Supplementary Fig. 6). The rectum presented the highest diversity, followed by the FC stomach and intestine (Supplementary Fig. 6), which was partially consistent with the numbers of MAGs identified in the three sections (Fig. 3a). We then applied principal coordinates analysis (PCoA) to reveal the Bray-Curtis distances among the samples and found that the overall microbial profiles of the intestine and rectum were similar (R2 = 0.048; P = 0.001, pairwise nonparametric multiple analysis of variance (MANOVA)), while both were significantly different from that of the FC stomach (R2 = 0.192, 0.270, P = 0.001, 0.001, pairwise nonparametric MANOVA; Supplementary Fig. 7a). Interestingly, samples from the jejunum of the intestine formed their own cluster and were distant from the samples from other DT sites (Supplementary Fig. 7b), indicating a distinctive microbial structure at the site; this was also consistent with the alpha diversity of the jejunum, which was different from those of other parts of the DT (Supplementary Fig. 6).
To maximize the discovery of DT-associated microbial genomes, we collected samples from animals that differed in their geographical locations, breeds, sexes, and developmental stages, which are factors known to affect the microbial compositions. We thus also investigated the effects of these host and environmental factors on the microbiota using permutational multivariate analysis of variance (PERMANOVA) implemented in the R package ‘vegan’. As expected, the DT sections exerted the strongest effects in both the single- and multifactor analyses, followed by geography, breed, developmental stage, and sex (Fig. 3b). Since the sampling strategy applied in this study was optimized to compare microbial compositions across DT sections, we focused on the comparative analysis between DT sections and left the comparisons of other factors for future studies.
Distinctive patterns of MAGs along the digestive tract coincide with their functions
As shown in Fig. 3e, the overall microbial structures of the FC stomach, intestine, and rectum were different from each other (Fig. 3e), with Firmicutes and Bacteroidota being the two most abundant phyla. Firmicutes_all, including Firmicutes, Firmicutes_A, Firmicutes_B, and Firmicutes_C according to GTDB, accounted for 79%, 85%, and 90% of the total microbial abundances in three DT sections on average. Interestingly, Bacteroidota showed a decrease in abundances along the digestive tract (Fig. 4a), while Firmicutes_all showed the opposite pattern (Fig. 4b). Consequently, the Firmicutes to Bacteroidota ratio (F/B ratio, Fig. 4c) was lowest in the FC stomach and highest in the rectum. Previous results have linked an increased F/B ratio with an increased capacity for harvesting energy from the diet77,78, consistent with the physiological roles of the three sections. In addition, the F/B ratio in the rumen has been linked to the milk fat yield in cows79,80,81.
The decreasing abundances of Bacteroidota along the DT were partly due to lower counts of Prevotella, the main genus of Bacteroidota that was highly abundant in the FC stomach, especially in the rumen, whereas its counts were significantly lower in other sections and DT sites (Fig. 4h); this genus accounted for 33.1% of the total microbial abundances in the rumen. Prevotella species are associated with noncellulose plant fibre degradation and are the largest single bacterial group reported in the rumen of cattle and sheep under most dietary regimes82. Most of the species in this genus are unclassified, including the most abundant species (Supplementary Fig. 8). In addition to the role of Prevotella species in plant degradation, they also played an important role in preventing ruminal acidosis82,83,84.
Cellulose is the main component of the cell wall of plants, and its degradation by DT-associated microbes is crucial in buffalo. Fibrobacter, Ruminococcus, and Butyrivibrio85 have generally been considered as the main microbes responsible for cellulose digestion (the substrates of Buyrivibrio are diverse, including cellulose, hemicellulose, and proteins). We thus explored their distributions along with the DT. As expected, Fibrobacter_all (including Fibrobacter and Fibrobacter_A according to GTDB; Fig. 4e), Ruminococcus_all (including Ruminococcus_E, Ruminococcus_A, and Ruminococcus according to GTDB; Fig. 4f), and Butyrivibrio_all (including Butyrivibrio_A and Butyrivibrio according to GTDB; Fig. 4g) were all significantly abundant in the FC stomach86,87 and were less abundant in other DT sites (Fig. 4e–g). Among these taxa, Fibrobacter_all was the most abundant. Our results showed that the total abundances of Fibrobacter_all were significantly higher in all four FC stomach sites (i.e. rumen, reticulum, omasum, and abomasum) than in the other two sections; however, we found the highest abundance of Fibrobacter_all in the omasum rather than in the rumen (widely believed to the site with the highest abundance of this group), indicating that the omasum may play important roles in cellulolytic digestion.
All of the archaeal species that we identified were methanogens, and they were highly abundant in the FC stomach and intestine (Fig. 4d). These results contradicted the current understanding that only the FC stomach alone, especially the rumen, is the main organ of methane metabolism88 and highlight the importance of the intestine in methane metabolism. Strikingly, close examination revealed that the total abundance of these methanogens peaked in the omasum (Fig. 4d), coinciding with Fibrobacter_all (Fig. 4e). Interestingly, archaea and Fibrobacter_all showed significant correlations in their overall abundances in both the FC stomach and intestine (Fig. 4i). In addition, we performed correlation analysis between all bacterial genera and archaea (Supplementary Data 3), and found that Fibrobacter_all showed the highest correlation ecoefficiency with the methanogens (Supplementary Fig. 9). These results strongly suggested that Fibrobacter_all may play important roles in methane production. Previous studies showed that methanogens in the rumen potentially affect the metabolism of microbes that degrade cellulose by utilizing hydrogen, and our results suggested that they play similar roles in the omasum. Together, these results highlighted the important roles of the omasum in both methane metabolism and cellulose degradation and the possible functional link between these two processes89.
In total, we identified 359 differentially abundant taxa at the genus level using the linear discriminant analysis effect size (LEfSe)90 with linear discriminant analysis (LDA) scores >2. Among these taxa, 172, 42, and 145 showed significantly higher abundances in the FC stomach, intestine, and rectum, respectively (Supplementary Data 4; Supplementary Fig. 10). In addition to those mentioned above, the other differentially abundant taxa may also contribute to the physiological functions of the corresponding DT sections and should be investigated further.
Functional characteristics of MAGs in different sections along the digestive tract
We next explored the proteomic contents of the buffalo metagenome and their putative functions. We predicted in total 9,470,238 proteins based on 4960 MAGs; after clustering by CD-HIT91, we obtained a non-redundant proteome dataset of 5,862,748 proteins. We annotated these proteins by comparing the amino acid sequences with the eggNOG database92 using eggNOG-mapper93 and the CAZy94 database using dbCAN295. A total of, 4,787,680 proteins (81.7% out of total) could be annotated according to one or both methods; among which 114,989 and 4,672,691 were annotated by using CAZy and eggNOG, respectively.
Only a small proportion of the proteins (114,989; 1.96% out of total) were annotated as enzymes according to CAZy (Fig. 5a). The majority of these protenins were annotated as glycoside hydrolases (GH, 76,224, 66.29% out of 114,989), after which glycosyltransferases (GT, 21,402, 18.61%) and carbohydrate-binding modules (CBM, 11,645, 10.13%) were the most annotated groups; the remaining three categories including carbohydrate esterases (CE, 3804, 3.31%), polysaccharide lyases (PL, 1879, 1.63%) and predicted auxiliary activities (AA, 35; 0.03%), contained far fewer proteins. Overall, the annotated proteins showed decent sequence identities against the proteins in the CAZy database, with an average identity of 59.11% (Supplementary Fig. 11); however, only a small proportion (3.8% out of the total annotated proteins) showed identities higher than 90%, indicating a poor representation of our MAGs in public databases.
We then calculated the relative abundance of all proteins in each sample and compared the distributions of their abundance (see Methods). As shown in Fig. 5b–g, all six abovementioned CAZy families showed significant differences in terms of their relative abundances (the sum of the abundance of all proteins was 100%) among the three DT sections. Interestingly, all CAZy protein families except for the auxiliary activity (AA) family showed the highest abundances in the FC stomach, followed by the rectum and then the intestine, supporting the central role of the FC stomach in food digestion and processing (Fig. 5b–g); the rumen often showed the highest abundances of these CAZy families among all DT sites (except for the carbohydrate-binding module (CBM), for which the omasum displayed the highest abundance), although other FC stomach sites also contained significantly more abundant CAZy proteins than the other DT sections (Fig. 5b–g). Again, we identified the jejunum as an outlier site that contained the lowest percentages of all six CAZy families (Fig. 5b–g), likely due to its low microbial diversity (Supplementary Fig. 6).
Comparisons of rumen microbiota between buffalo and cattle
The rumen is often is considered the most important FC stomach site in ruminants. The rumen is the first section of the DT and the largest of the FC stomach sites; it governs the first steps of feedstuff degradation, and is the main site of methane production26. The microbiota of the rumen plays important role in its functioning and was the first microbiota of the FC stomach to be studied by researchers26,96. We thus compared the taxonomic and functional profiles of the buffalo rumen microbiota with those of cattle which was recently made available by Stewart et al.30. To perform a fair comparison, we obtained 4941 rumen uncultured genomes (RUGs, similar to MAGs; see ref. 30.), encoding 4,879,163 non-redundant proteins and calculated their abundances in each sample using the same methods employed in this study (Methods).
As shown in Fig. 6a, we found that buffalo and cattle showed significant differences in the two most dominant phyla (Firmicutes_all and Bacteroidota) and consequently in the F/B ratios (Fig. 6a). Bacteroidota species, especially those in the dominant genus of the phylum, Prevotella, can utilize lactate84,97,98 and are capable of degrading noncellulose plant fibres. Therefore, the higher abundances of Bacteroidota, Prevotella, and Butyrivibrio_all identified in the buffalo rumen than in that of cattle, as well as the similar levels of Fibrobacter_all (responsible for cellulolytic plant fibre digestion) between the two (Fig. 6a), suggested that buffalo is better adapted to coarse forage than cattle88. Conversely, we found significantly higher levels of Archaea and Ruminococcus_all in cattle, and these species all played important roles in methanogenesis through biohydrogenation and glycolysis (Fig. 6b), suggesting that buffalo may produce significantly less methane than cattle.
We also compared the abundance of rumen protein families (i.e. CAZy) between buffalo and cattle. Surprisingly, all six families were significantly abundant in buffalo (Fig. 6c). This result further suggested that buffalo has a greater carbohydrate metabolism capacity than cattle. In attention, comparison based on all CAZy proteins (Supplementary Fig. 12) revealed a significant abundance of GT and CBM in cattle, indicating that the rumen of cattle contained a higher proportion of enzymes related to the formation of glycosidic bonds than the rumen of buffalo.
We note that the differences observed here were likely due to the different diets of buffalo and cattle. However, due to limited dietary information available from the public dataset of cattle, we were unable to disentangle the effects of the diets on the differences in the microbial composition forom those of other factors. Future studies are thus needed for further elucidation of this issue.
Discussion
As an important livestock species, buffalo provides humans with milk, meat, leather, and draft power. Similar to other ruminants, the DT is the key to the quality and wellbeing of buffalo and shows intense interactions with microbes. However, the lack of microbial reference genomes from different sites of the DT has greatly hindered our understanding of the functional interactions between DT sites and their microbial ecology and our ability to modulate the physiology and economically important phenotypes of buffalo via the DT microbiota. More importantly, the comprehensive profiling of the methanogenic microbes that are presumably inhabited the rumen may provide us with insights into contribution to reducing the emission of methane, an important source of greenhouse gases. Recent studies on ruminant microbiota have mostly focused on the rumen, while explorations of the digestive tracts are still lacking. To fill in these gaps, we performed a comprehensive survey on the microbial ecology of the buffalo DT. We collected 695 samples from eight DT sites within three sections, including the FC stomach, intestine, and rectum. To further increase the representativeness of our study, we obtained samples from six different geographical (Guangxi, Henan, Anhui, Yunnan, Hainan, and Hubei; Supplementary Data 1, Supplementary Table 1; Supplementary Fig. 1a), three breeds (river, swamp, and hybrid buffalo; Supplementary Data 1, Supplementary Table 2), both sexes and two developmental stages (Supplementary Data 1, Supplementary Table 3).
We performed mNGS on these samples and obtained 4960 MAGs. These MAGs greatly improved the coverage of the raw sequencing reads from 64% in the public databases (the combination of reference genome datasets including BFAP genomes from NCBI RefSeq) to 85%. Taxonomic annotation revealed that all MAGs could be classified into known phyla; however, more than 90% of the MAGs were novel at the species level. Thus, our dataset represents a great expansion of the available buffalo microbiomes.
The samplings of different sites of the DT allowed us to better understand the functional associations between the DT sites and their microbial ecology. For example, we found that Firmicutes and Bacteroidota, the two most dominant phyla, showed distinct abundance patterns along with the DT: Firmicutes abundance increased along the digestive tract (Fig. 4a), while Bacteroidota showed the opposite pattern (Fig. 4b). Consequently, the Firmicutes to Bacteroidota ratio (F/B ratio, Fig. 4c) was lowest in the FC stomach and highest in the rectum. The F/B ratio is related to energy harvesting;77,78 its trend thus coincides with the physiological transition from food digestion (FC stomach) to energy harvesting (intestine) along with the DT. Our data also allowed us to validate known interactions between microbes and DT sites, including the enrichment of fibre-digesting and methane-producing microbes in the FC stomach. Surprisingly, we found that Fibrobacter, a group of cellulolytic bacteria reported to mainly colonize the rumen, showed higher relative abundance in the omasum; its distributions coincided with that of archaea, the main methane producers, whose abundances also peaked in the omasum. Archaea were also highly abundant in both the FC stomach and intestine and showed positive correlations with Fibrobacter, indicating their roles in methane production in both sections. Based on the abundance of the microbes at the phylum and genus levels, we identified several features that are abundant in the FC stomach, intestine, and rectum. These results highlighted the importance of obtaining samples from all DT sites, especially sites other than the rumen.
We also evaluated the functional capacity of the microbial ecology of the MAGs at the DT sites by annotating the protein-coding genes of the MAGs and comparing them against the CAZy and eggNOG databases. We found that all six CAZy families showed significant differences among the examined DT sections, suggesting that they play different roles associated with distinct sections of the DT. Our data showed poor sequence identity with public data, indicating the existence of previously unidentified protein sequences and, thus, novel functions encoded by the DT-associated MAGs.
The rumen is the most important section in the DT; its microbiota has recently been extensively explored in cattle. We thus took the opportunity to compare the rumen microbes between these two closely related model ruminants. Our results showed significant differences between buffalo and cattle. For example, we observed a higher abundance of microbes with the ability to degrade fibres in the buffalo rumen than in that of cattle. In addition, the relative abundance of methane-producing archaeal species in buffalo was significantly lower than that in cattle, indicating less production of methane and more complete use of energy from feedstuff.
The microbial composition is affected by both intrinsic factors, such as the DT section, sex, breeds, and developmental stages, and environmental factors, such as geographical locations. In this study, we found that all examined factors exerted significant effects on the DT microbiota of buffalo in both single- and multifactor analyses (see Materials and Methods); notably, the DT section exerted the strongest effects, followed by geography, breed, developmental stage, and sex. Since the sampling strategy applied in this study was optimized to compare the microbial compositions across DT sections, we focused on the comparative analysis between DT sections and left the comparisons of other factors for future studies.
Our catalogue of microbial genomes and their encoded proteins represents the largest effort thus far to characterize the microbial ecology of all major sections of the DT. Our study provides insight into the microbial functions and interactions with distinct DT sites, and has generated valuable resources for the community interested in microbial interventions for achieving better buffalo quality.
Methods
Sample collection
A total of 695 samples were collected for metagenome sequencing. To ensure the diversity of the samples, they were collected from three breeds (153, 430, and 112 samples from river, swamp, and hybrid buffaloes, respectively), six regions (Guangxi, Henan, Anhui, Yunnan, Hainan, and Hubei in China), two sexes (females and males, respectively) and two developmental stages (adult buffaloes and calves; see Supplementary Data 1 for details). Among the 695 samples, 497 were obtained from live buffalo, including 98 rumen samples collected via a stomach tube and 399 faecal samples collected from the rectum by using disposable soft, long-arm gloves. The remaining 198 samples were all collected from slaughtered buffalo, including 31 rumen samples, 28 reticulum samples, 32 omasum samples, 22 abomasum samples, 24 jejunum samples, 31 caecum samples, and 30 colon samples (Supplementary Data 1, Supplementary Table 4). The eight sampled DT sites were divided into three sections: the FC stomach (rumen, reticulum, omasum, and abomasum), intestine (jejunum, caecum, and colon), and rectum (see also Fig. 1). All samples were immediately frozen after collection in liquid nitrogen and stored at −80 °C until DNA extraction. Most samples were subjected to DNA extraction within a week after collection.
DNA extraction, library construction, and metagenomics sequencing
Three grams of each sample was employed for DNA extraction. DNA was extracted via a bead-beating method using a mini-bead beater (Biospec Products; Bartlesville, USA), followed by phenol-chloroform extraction. The solution was precipitated with ethanol, and the pellets were suspended in 50 μL of Tris-EDTA buffer. DNA was quantified using a NanoPhotometer® (IMPLEN, CA, USA) following staining using a Qubit® 2.0 Fluorometer (Life Technologies, CA, USA). DNA samples were stored at −80 °C until further processing.
Library preparation was performed according to the TruSeq DNA Sample Preparation Guide (Illumina, 15026486 Rev. C) method and procedure using 500 ng DNA as the template. Qualified libraries were selected for paired-ended sequencing on the Illumina NovaSeq 6000 platform with a read-length of 150 base pairs (PE150).
Quality control and removal of host- and food-associated genomes
Paired-end raw sequencing reads were first trimmed by using Trimmomatic99 (v.0.35) to remove vectors and low-quality bases. Sequences longer than 110 bases with an average base quality greater than 30 after trimming were retained for further analysis. To remove possible reads from the host or food, genomic sequences of buffalo100, Glycine max101, Zea Mays102, and Medicago truncatula103 were downloaded from NCBI and used as references in Bowtie2104(v.2.3.3) analysis with the options ‘-p 10 --very-sensitive’. Reads that aligned concordantly to references were removed as contamination. Thus 20.3% of the bases were removed on average (Supplementary Data 5, 6). The remaining “clean reads” were used for further analysis.
Generation and quality assessment of metagenomics-assembled genomes (MAGs)
MEGAHIT (v.1.2.8) and metaSPAdes (v.3.13.0) were used for single-sample assembly. MEGAHIT automatically determines the best k-mer value; however, metaSPAdes allows users to manually select a range of k-mers. To test the best k-mer parameter, we randomly selected 16 samples and performed their assembly using metaSPAdes with seven different k-mers ranging from 21 to 141 with a step size of 20. We then used two quality metrics, N50 and assembled reads, to test the assembly results. For each of the tested samples, we selected two sets of resulting scaffolds, one of which showed the best N50 (referred to as “Kmer_N50), while the other showed the greatest number of assembled reads (Kmer_MRate). Because modern assemblers including metaSPAdes, do not report the number of assembled reads, we used BWA-MEM105 (v.0.7.15) to align the reads to the scaffolds and calculated the mapping rate. As shown in Figure S11, we found significantly higher proportion of assembled reads (mapping rates) in the Kmer MRate group than in the Kmer_N50 group; this was expected because the Kmer_MRate group was optimized for higher mapping rates. For comparison, we also used MetaBAT265 (2.12.1) to group the scaffolds into bins using the generated scaffolds for each sample and calculated the mapped rate of the reads to the bins. We did not find a significant difference in the mapping rates between the two groups (Supplementary Fig. 13 and Supplementary Data 7). Although the Kmer_MRate method assembled more reads than the Kmer_N50 method at the scaffold level, the two did not show significant differences at the scaffold level. Furthermore, we tested the effects of the quality metrics (Kmer_MRate vs. Kmer_N50) on the assembled bins. All bins were aggregated and dereplicated using dRep66 (v.2.3.2), followed by CheckM (v.1.0.18) for quality assessment. When the -sa = 0.99 option of dRep was used (i.e. the MAGs were dereplicated at the strain level), we obtained more MAGs using the Kmer_MRate method; however, the Kmer_MRate method generated significantly fewer high-quality MAGs (69 vs. 85 under the Kmer_N50 method; Supplementary Fig. 14a). Additionally, the MAGs generated by Kmer_MRate presented significantly shorter N50s (Supplementary Fig. 14B) and genome lengths (Supplementary Fig. 14C), and were significantly fragmented (Supplementary Fig. 14D). We found similar results at the species level (i.e. MAGs dereplicated using –sa = 0.95). In addition, the Kmer_MRate required an additional 60 CPU hours per sample per k-mer on average to align the reads to the scaffolds; thus, we thus used the Kmer_N50 method to select the best k-mer in this study.
Then, for each sample, we compared the total length, N50, and contig numbers of the assembly results from the two assemblers, and selected the assembly with a relatively long total length and N50 as the final assembly of this sample. Coassembly was performed for each of the three sections by combining all samples as the input using only MEGAHIT because of its lower time and memory consumption relative to metaSPAdes. However, as shown in Supplementary Fig. 15, metaSPAdes could generate contigs with a longer N50 than MEGAHIT through comparison with N50 values among other measurements of assembly quality. Thus, the assembly results of the two tools were used to increase the coverage and quality of the resulting contigs (single-sample results were binned, and the resulting bins were combined).
MetaBAT265 (2.12.1) was used to group contigs into bins. First, BWA-MEM105 (v.0.7.15) was used to map reads to the contigs (MEGAHIT) and scaffolds (metaSPAdes) to obtain the depths of the contigs (MEGAHIT) or scaffolds (metaSPAdes) in each sample. The results were saved in SAM files. Then, SAMtools106 (v.1.8) was used to convert the SAM files to BAM format. Finally, MetaBAT2 was used to calculate coverage from the resulting BAM files and output the results of the bins. As a result, single-sample binning produced a total of 58,041 bins, while an additional 53 bins were obtained from coassembly binning. We referred to these bins as MAGs.
All bins were dereplicated using dRep66 (v.2.3.2) with the option ‘dereplicate_wf -p 16 -comp 80 -con 10 -str 100 -strW 0 –nc 0.1’. The ‘-nc 0.1’parameter is recommended by the authors of the dRep paper. However, we noted that different options were used in these publications(e.g. ‘-nc 0.3’ in Almeida et al.107 and ‘-nc 0.6’ in Almeida et al.70). We thus tested four different options of the ‘-nc’ parameter (i.e. -nc = 0.1, 0.2, 0.3, or 0.6) in selected samples at the strain (-sa 0.99) and species (-sa 0.95) levels. We selected four groups of samples to test the ‘-nc’ parameter. For each group, we obtained the same number of dereplicated bins regardless of the ‘-nc’ options (Supplementary Data 8). We also combined the bins from all samples and performed dereplication, and we again obtained the same number of dereplicated bins (Supplementary Data 9). Therefore, the default threshold (0.1) of the “-nc” option of dRep was used in this study. After binning, CheckM67 (v.1.0.18) was used to assess the quality of the resulting MAGs. After removing MAGs with < 80% completeness or > 10% contamination, the remaining MAGs were processed via two clustering steps to remove replicates with the default parameters. The first step involved a rapid primary algorithm (Mash, ANI = 0.9), and the second involved a more sensitive algorithm (ANI = 0.99 and 0.95). Genomes with <99% ANI belong to different strains, while those with <95% ANI belong to different species. Then, we removed MAGs larger than 10 Mb. Finally, 4960 non-redundant strain-level and 3255 species-level MAGs were obtained.
To calculate the coverage of each MAG in each sample, the clean reads of each sample were mapped to the 4960 MAGs using BWA-MEM with the default parameters. After converting the resulting SAM files to BAM format using SAMtools106 (v.1.8), BEDTools74 (v.2.27.1) was employed to calculate the coverage of the MAGs, which was defined as the total bases mapped to a MAG in a sample divided by its length.
Comparisons with reference microbial genomes and MAGs associated with model organisms
To determine whether our MAGs could improve the coverage of microbial genomes associated with the buffalo DT, MAGs of model organisms including humans70, chickens57, pigs53, and cattle30, were downloaded from their respective sources. In addition, a BFAP dataset including reference microbial genomes (bacterial, fungal, archaeal, and protozoan) was generated from the NCBI RefSeq genome database and the Hungate collection genomes71.
BWA-MEM was used to map the “clean reads” to the above datasets and our MAGs as references. The mapping rate was calculated for each sample as the percentage of clean reads mapped to each of the reference datasets.
To compare the sizes of the buffalo MAGs with public datasets, we also downloaded the NCBI RefSeq prokaryotic genomes with assembly levels of ‘chromosome’ and ‘complete’ assembly levels; as of June 28, 2021, this dataset consisted of 457 archaeal and 28,011 bacterial genomes.
Taxonomic assignments of buffalo MAGs
The taxonomic assignment of the MAGs was performed using the GTDB-TK (v.1.4.1)73 via its “classify_wf” workflow. The results were visualized in GraPhlAn (v.1.1.3))108 as a phylogenetic tree.
Annotation and functional characterization of MAG-encoded proteins
MAG-encoded proteins were predicted using Prodigal109 (v.2.6.3), tRNA genes were annotated using tRNAscan-SE (v 2.0) and 16 S rRNA genes were predicted using Barrnap (v 0.9). CD-HIT91 (v.4.8.1) was used to cluster predicted proteins with the option ‘-c 0.95 -n 10 -d 0 -M 16000 - T 8’. The resulting non-redundant proteins were searched against the CAZy database using dbCAN295 and the EggNOG database using eggnog-mapper (v.4.5)92.
Estimation of the relative abundance of MAGs and proteins in each sample
To enable the identification of differentially abundant MAGs and proteins in different sites/sections along with the DT, the relative abundance of MAGs and proteins was calculated based on their respective coverage in each sample. The coverage of each protein was calculated via the same method described aboce for the coverage of MAGs, and the relative abundance of a MAG (gene) in each sample was calculated as the percentage of the coverage of the MAGs (protein) relative to the sum of the coverage values of all MAGs (proteins).
Effects of host and environmental factors on the microbiota compositions
PERMANOVA implemented in the R package ‘vegan’ was used to investigate the effects of host and environmental factors, such as geography, DT sections, developmental stages, breed, and sex, on the microbiota composition. PERMANOVA is a permutation-based extension of multivariate analysis of variance to a matrix of pairwise distances that partitions within-group and between-group distances to permit the assessment of the effect of an exposure or intervention (grouping factor) on the sampled microbiome110. We performed both single- (i.e. consider one factor at a time) and multifactor (i.e. considering all factors at the same time) analyses.
Identification of differential taxa between groups
To identify microbes that showed significant differences in abundances between sample groups of interest, we first calculated the relative redundancy of the MAGs in all samples. We mapped the clean reads of a sample to all MAGs and calculated the relative abundance of a MAG as the percentage of reads mapped to the MAG among the total reads mapped to all MAGs in the sample; thus, the total relative abundance of all MAGs in a sample is always 100%. Relative abundances at higher taxonomic levels, such as genera, families, and orders, were also determined by summing the abundances of their daughter clades according to the phylogenetic tree provided by GTDB-TK.
LDA implemented in the LEfSe tool90 was used to identify differential taxa between groups of samples. We chose those with LDA scores >2 as differential taxa. The Wilcoxon test was used to reveal the statistical significance of the relative abundances of the differential taxa between groups. Then, we selected taxa that are known to play an important role in physiological functions related to the grouping context.
Identification of genera positively related to archaea
To identify the genera that were positively related to archaea, we calculated the correlations between archaea and all other major taxa at the phylum and genus levels. The relative abundance values of all the genera of the archaea phylum were combined, and the genera that were positively related to the archaea phylum were selected based on the significance of the correlations among all the relevant results. Then, a search was performed to obtain information on the genera, and the genera, whose functional information could not be retrieved were manually removed.
Ethics approval
The investigation was approved by the Experimental Animal Ethics Committee, College of Animal Science and Technology, Guangxi University, under reference number Gxu-2021-010.
Statistics
All processed data, unless otherwise stated, were loaded into R (v.3.6.3, https://www.r-project.org/), for analyzed or visualization.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The raw sequencing data used in this study are available in the NCBI SRA database under accession code PRJNA656389. The 4960 strain-level MAG data used in this study are available in the Figshare database under accession code 17000302. The data of the 3255 species-level MAG data used in this study are available in the Figshare database under accession code 17081366. Source data are provided with this paper and these data have been made public. Source data are provided with this paper.
Code availability
The software parameters and scripts used in the analysis have been uploaded on GitHub (https://github.com/fengtong-bio/MEABDT).
References
Cockrill, W. R., Fao, R. & AGA. The husbandry and health of the domestic buffalo. Trop. Animal Health Produc. 7, 123 (1975).
Scherf, B. & Scherf, B. in World Watch List for Domestic Animal Diversity (Food and Agriculture Organization of the United Nations, 2000).
Tanaka, K., Solis, C. D., Masangkay, J. S., Maeda, K. I. & Namikawa, T. Phylogenetic relationship among all living species of the genus Bubalus based on DNA sequences of the cytochromeb gene. Biochem. Genet. 34, 443–452 (1996).
Kumar, S. et al. analyses of Indian water buffalo support a distinct genetic origin of river and swamp buffalo. Animal Genet. https://doi.org/10.1111/j.1365-2052.2007.01602.x (2007).
Cockrill, W. R., Fao, R., Campaign, S. & Aga. The buffaloes of China. XF2006131266 (Food and Agriculture Organization of the United Nations, 1978).
Lei, Y., Zhang, K., Guo, M., Li, G. & Wang, X. Exploring the spatial-temporal microbiota of compound stomachs in a pre-weaned goat model. Front. Microbiol. 9, 1846 (2018).
Batista, H. A. M., Autrey, K. M. & Von Tiesenhausen, I. M. E. V. Comparative in vitro digestibility of forages by buffalo, zebu, and holstein cattle. J. Dairy Ence 65, 746–748 (1982).
Sommer, F. & Bckhed, F. The gut microbiota—masters of host development and physiology. Nat. Rev. Microbiol. https://doi.org/10.1038/nrmicro2974.
Burnet, M. C. et al. Evaluating models of cellulose degradation by fibrobacter succinogenes S85. PLoS ONE 10, e0143809 (2015).
Raut, M. P., Couto, N., Karunakaran, E., Biggs, C. A. & Wright, P. C. Deciphering the unique cellulose degradation mechanism of the ruminal bacterium Fibrobacter succinogenes S85. Sci. Rep. 9, 16542 (2019).
Neumann, A. P. & Suen, G. The phylogenomic diversity of herbivore-associated Fibrobacter spp. is correlated to lignocellulose-degrading potential. mSphere 3, e00593–18 (2018).
Palevich, N. et al. Comparative genomics of rumen Butyrivibrio spp. uncovers a continuum of polysaccharide-degrading capabilities. Appl. Environ. Microbiol. 86, e01993–01919 (2019).
Anju, K. et al. Impact of levels of total digestible nutrients on microbiome, enzyme profile and degradation of feeds in buffalo rumen. PLos ONE 12, e0172051 (2017).
Liu, H. et al. Microbial and metabolic alterations in gut microbiota of sows during pregnancy and lactation. FASEB J. https://doi.org/10.1096/fj.201801221RR (2019).
Derakhshani, H. et al. Linking peripartal dynamics of ruminal microbiota to dietary changes and production parameters. Front. Microbiol. 7, 2143 (2017).
Vargas, J. E. et al. Effect of sunflower and marine oils on ruminal microbiota, in vitro fermentation and digesta fatty acid profile. Front. Microbiol. 8, 1124 (2017).
Chen, T. et al. Fiber-utilizing capacity varies in Prevotella- versus Bacteroides-dominated gut microbiota. Sci. Rep. 7, 2594 (2017).
Stanislawski, M. A., Dabelea, D., Lange, L. A., Wagner, B. D. & Lozupone, C. A. Gut microbiota phenotypes of obesity. npj Biofilms Microbiomes 5, 18 (2019).
De Vadder, F. et al. Microbiota-produced succinate improves glucose homeostasis via intestinal gluconeogenesis. Cell Metab. 24, 151–157 (2016).
Rubino, F. et al. Divergent functional isoforms drive niche specialisation for nutrient acquisition and use in rumen microbiome. ISME J. 11, 932–944 (2017).
Zhang, Q. et al. Bayesian modeling reveals host genetics associated with rumen microbiota jointly influence methane emission in dairy cows. ISME J. 14, 2019–2033 (2020).
Kamke, J. et al. Rumen metagenome and metatranscriptome analyses of low methane yield sheep reveals a Sharpea -enriched microbiome characterised by lactic acid formation and utilisation. Microbiome 4, 1–16 (2016).
Borrel, G., Brugère, J.-F., Gribaldo, S., Schmitz, R. A. & Moissl-Eichinger, C. The host-associated archaeome. Nat. Rev. Microbiol. 18, 622–636 (2020).
Misiukiewicz, A. et al. Review: Methanogens and methane production in the digestive systems of nonruminant farm animals. Animal 15, 100060 (2020).
Zhang, J., Xu, C., Huo, D., Hu, Q. & Peng, Q. Comparative study of the gut microbiome potentially related to milk protein in Murrah buffaloes (Bubalus bubalis) and Chinese Holstein cattle. Sci. Rep. 7, 42189 (2017).
Matthews, C. et al. The rumen microbiome: a crucial consideration when optimising milk and meat production and nitrogen utilisation efficiency. Gut Microbes 10, 115–132 (2019).
Moraïs, S. & Mizrahi, I. The road not taken: the rumen microbiome, functional groups, and community states. Trends Microbiol. 27, 538–549 (2019).
Xue, M. et al. Assessment of rumen microbiota from a large cattle cohort reveals the pan and core bacteriome contributing to varied phenotypes. Appl. Environ. Microbiol. https://doi.org/10.1128/AEM.00970-18 (2018).
Franco-Lopez, J., Duplessis, M., Bui, A., Reymond, C. & Ronholm, J. Correlations between the Composition of the Bovine Microbiota and Vitamin B 12 Abundance. mSystems 5, e00107–20 (2020).
Stewart, R. D. et al. Compendium of 4,941 rumen metagenome-assembled genomes for rumen microbiome biology and enzyme discovery. Nat. Biotechnol. 37, 953–961 (2019).
Zebeli, Q., Ghareeb, K., Humer, E., Metzler-Zebeli, B. U. & Besenfelder, U. Nutrition, rumen health and inflammation in the transition period and their role on overall health and fertility in dairy cows. Res Vet. Sci. 103, 126–136 (2015).
Denman, S. E. et al. Metagenomic analysis of the rumen microbial community following inhibition of methane formation by a halogenated methane analog. Front. Microbiol. https://doi.org/10.3389/fmicb.2015.01087 (2015).
Cremonesi, P. et al. Evaluation of the effects of different diets on microbiome diversity and fatty acid composition of rumen liquor in dairy goat. Animal 12, 1–11 (2018).
Lee, K. T. et al. Metagenomic mining and functional characterization of a novel KG51 bifunctional cellulase/hemicellulase from black goat rumen. J. Agric. Food Chem. https://doi.org/10.1021/acs.jafc.8b01449 (2018).
Henderson, G. et al. Rumen microbial community composition varies with diet and host, but a core microbiome is found across a wide geographical range. Sci. Rep. 5, 14567 (2015).
Stewart, R. D. et al. Assembly of 913 microbial genomes from metagenomic sequencing of the cow rumen. Nat. Commun. 9, 1–11 (2018).
Peng, X. et al. Genomic and functional analyses of fungal and bacterial consortia that enable lignocellulose breakdown in goat gut microbiomes. Nat. Microbiol. https://doi.org/10.1038/s41564-020-00861-0 (2021).
Arumugam, M. et al. Enterotypes of the human gut microbiome. Nature 473, 174–180 (2011).
Wu, S. et al. GMrepo: a database of curated and consistently annotated human gut metagenomes. Nucleic Acids Res. 48, D545–D553 (2020).
Pedersen, H. K. et al. Human gut microbes impact host serum metabolome and insulin sensitivity. Nature 535, 376–381 (2016).
Qin, J. et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 490, 55–60 (2012).
Li, J. et al. Gut microbiota dysbiosis contributes to the development of hypertension. Microbiome 5, 14 (2017).
Wirbel, J. et al. Meta-analysis of fecal metagenomes reveals global microbial signatures that are specific for colorectal cancer. Nat. Med. 25, 679–689 (2019).
Dai, D., Wang, T., Wu, S., Gao, N. L. & Chen, W. H. Metabolic dependencies underlie interaction patterns of gut microbiota during enteropathogenesis. Front. Microbiol. 10, 1205 (2019).
Stewart, C. J. et al. Temporal development of the gut microbiome in early childhood from the TEDDY study. Nature 562, 583–588 (2018).
Pronovost, G. N. & Hsiao, E. Y. Perinatal interactions between the microbiome. Immunity 50, 18–36 (2019).
Forslund, K. et al. Disentangling type 2 diabetes and metformin treatment signatures in the human gut microbiota. Nature 528, 262–266 (2015).
Routy, B. et al. Gut microbiome influences efficacy of PD-1-based immunotherapy against epithelial tumors. Science 359, 91–97 (2018).
Jiang, P., Lai, S., Wu, S., Zhao, X.-M. & Chen, W.-H. Host DNA contents in fecal metagenomics as a biomarker for intestinal diseases and effective treatment. BMC Genomics 21, 348 (2020).
Glendinning, L., Genc, B., Wallace, R. J. & Watson, M. Metagenomic analysis of the cow, sheep, reindeer and red deer rumen. Sci. Rep. 11, 1990 (2021).
Wilkinson, T. et al. 1200 high-quality metagenome-assembled genomes from the rumen of African cattle and their relevance in the context of sub-optimal feeding. Genome Biol. 21, 229 (2020).
Watson, M. New insights from 33,813 publicly available metagenome-assembled-genomes (MAGs) assembled from the rumen microbiome. bioRxiv https://doi.org/10.1101/2021.04.02.438222 (2021).
Xiao, L. et al. A reference gene catalogue of the pig gut microbiome. Nat. Microbiol. 1, 16161 (2016).
Chen, C. et al. Expanded catalog of microbial genes and metagenome-assembled genomes from the pig gut microbiome. Nat. Commun. 12, 1106 (2021).
Xiao, L. et al. A catalog of the mouse gut metagenome. Nat. Biotechnol. 33, 1103–1108 (2015).
Lavrinienko, A., Tukalenko, E., Mousseau, T. A., Thompson, L. R. & Watts, P. C. Two hundred and fifty-four metagenome-assembled bacterial genomes from the bank vole gut microbiota. Sci. Data 7, 312 (2020).
Huang, P. et al. The chicken gut metagenome and the modulatory effects of plant-derived benzylisoquinoline alkaloids. Microbiome 6, 211 (2018).
Maki, J. J., Bobeck, E. A., Sylte, M. J. & Looft, T. Eggshell and environmental bacteria contribute to the intestinal microbiota of growing chickens. J. Anim. Sci. Biotechnol. 11, 60 (2020).
Gilroy, R. et al. Extensive microbial diversity within the chicken gut microbiome revealed by metagenomics and culture. PeerJ 9, 133 (2021).
Glendinning, L., Stewart, R. D., Pallen, M. J., Watson, K. A. & Watson, M. Assembly of hundreds of novel bacterial genomes from the chicken caecum. Genome Biol. 21, 1–16 (2020).
Jian, X., Zhu, Y., Ouyang, J., Lei, Q. & Zhou, W. Alterations of gut microbiome accelerate multiple myeloma progression by increasing the relative abundances of nitrogen recycling bacteria. Blood 134, 688–688 (2019).
Ridaura, V. K. et al. Gut microbiota from twins discordant for obesity modulate metabolism in mice. Science 341, 1241214 (2013).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Li, D., Liu, C. M., Luo, R., Sadakane, K. & Lam, T. W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31, 1674–1676 (2015).
Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
Olm, M. R., Brown, C. T., Brooks, B. & Banfield, J. F. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 11, 2864–2868 (2017).
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
Bowers, R. M. et al. Minimum information about a single amplified genome (MISAG) and a metagenome- assembled genome (MIMAG) of bacteria and archaea. Nat. Biotechnol. 36, 660–660 (2018).
Parks, D. H. et al. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nat. Microbiol. 3, 253–253 (2018).
Almeida, A. et al. A unified sequence catalogue of over 280,000 genomes obtained from the human gut microbiome. Nat Biotechnol. https://doi.org/10.1038/s41587-020-0603-3 (2019).
Seshadri, R. et al. Cultivation and sequencing of rumen microbiome members from the Hungate1000 Collection. Nat. Biotechnol. 36, 359–367 (2018).
Pasolli, E. et al. Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle. Cell 176, 649–662. e620 (2019).
Chaumeil, P. A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics https://doi.org/10.1093/bioinformatics/btz848 (2019).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Allison, M. J., Mayberry, W. R., Mcsweeney, C. S. & Stahl, D. A. Synergistes jonesii, gen. nov., sp.nov.: a Rumen bacterium that degrades toxic pyridinediols. Systemapplmicrobiol 15, 522–529 (1992).
Chiang, E. et al. Verrucomicrobia are prevalent in north-temperate freshwater lakes and display class-level preferences between lake habitats. PLoS ONE 13, e0195112 (2018).
Turnbaugh, P. J. et al. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature 444, 1027–1031 (2006).
Ley, R. E., Turnbaugh, P. J., Klein, S. & Gordon, J. I. Microbial ecology: human gut microbes associated with obesity. Nature 444, 1022–1023 (2006).
Jami, E., White, B. A. & Mizrahi, I. Potential role of the bovine rumen microbiome in modulating milk composition and feed efficiency. PLoS ONE 9, e85423 (2014).
Jewell, K. A., McCormick, C. A., Odt, C. L., Weimer, P. J. & Suen, G. Ruminal bacterial community composition in dairy cows is dynamic over the course of two lactations and correlates with feed efficiency. Appl. Environ. Microbiol. 81, 4697–4710 (2015).
Lima, F. S., Oikonomou, G., Lima, S. F., Bicalho, M. L. S. & Bicalho, R. C. Prepartum and postpartum Rumen fluid microbiomes: characterization and correlation with production traits in dairy cows. Appl Environ. Microbiol. 81, 1327–1337 (2014).
Flint, H. J. & Stewart, C. S. in Encyclopedia of Food Microbiology (ed. Robinson R. K.) (Elsevier, 1999).
Avgustin, G., Wallace, R. J. & Flint, H. J. Phenotypic diversity among ruminal isolates of Prevotella ruminicola: proposal of Prevotella brevis sp. nov., Prevotella bryantii sp. nov., and Prevotella albensis sp. nov. and redefinition of Prevotella ruminicola. Int J Syst Bacteriol. 284–288 https://doi.org/10.1099/00207713-47-2-284 (1997).
Bandarupalli, V. V. K. & St-Pierre, B. Identification of a candidate starch utilizing strain of Prevotella albensis from bovine Rumen. Microorganisms 8, 2005 (2020).
Palevich, N. et al. Comparative genomics of rumen Butyrivibrio spp. uncovers a continuum of polysaccharide-degrading capabilities. Appl. Environ. Microbiol. 86, e01993–01919 (2019).
Ransom-Jones, E., Jones, D. L., McCarthy, A. J. & McDonald, J. E. The Fibrobacteres: an important phylum of cellulose-degrading bacteria. Microb Ecol. 63, 267–281 (2012).
Brown, D. W. & Moore, W. E. C. Distribution of butyrivibrio fibrisolvens in nature. J. Dairy Ence 43, 1570–1574 (1960).
Wang, Q. et al. A comparative study on rumen ecology of water buffalo and cattle calves under similar feeding regime. Vet. Med. Sci. 6, 746–754 (2020).
Prins, R. A., Hungate, R. E. & Prast, E. R. Function of the omasum in several ruminant species. Comp. Biochem. Physiol. Part A Physiol. 43, 155–163 (1972).
Segata, N. et al. Metagenomic biomarker discovery and explanation. Genome Biol. 12, R60 (2011).
Fu, L., Niu, B., Zhu, Z., Wu, S. & Li, W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150–3152 (2012).
Huerta-Cepas, J. et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 44, D286–D293 (2016).
Huerta-Cepas, J. et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol. Biol. Evol. 34, 2115–2122 (2017).
Cantarel, B. L. et al. The carbohydrate-active enZymes database (CAZy): an expert resource for glycogenomics. Nucleic Acids Res. 37, D233–D238 (2009).
Zhang, H. et al. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 46, W95–W101 (2018).
McCann, J. C., Wickersham, T. A. & Loor, J. J. High-throughput methods redefine the Rumen microbiome and its relationship with nutrition and metabolism. Bioinform Biol. Insights 8, 109–125 (2014).
Rodríguez, F. Control of lactate accumulation in ruminants using Prevotella bryantii. 3085941 (ProQuest Dissertations Publishing, 2003).
Rodriguez, F., Rasmussen, M. & Allison, M. CONTROL OF LACTATE ACCUMULATION IN A GOAT ACIDOSIS MODEL USING PREVOTELLA BRYANTU. This manuscript has been reproduced from the microfilm master UMI films the text directly from the original or copy submitted Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer, 68 (American Society for Microbiology, 2002).
Bolger, A. M., Marc, L. & Bjoern, U. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Luo, X. et al. Understanding divergent domestication traits from the whole-genome sequencing of swamp- and river-buffalo populations. Natl Sci. Rev. 7, 686–701 (2020).
Shen, Y. et al. Update soybean Zhonghuang 13 genome to a golden reference. Sci. China Life Sci. 62, 1257–1260 (2019).
Jiao, Y. et al. Improved maize reference genome with single-molecule technologies. Nature 546, 524–527 (2017).
Tang, H. et al. An improved genome release (version Mt4.0) for the model legume Medicago truncatula. BMC Genomics 15, 312 (2014).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. ArXiv https://doi.org/10.1093/bioinformatics/btp698 (2013).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Almeida, A. et al. A unified catalog of 204,938 reference genomes from the human gut microbiome. Nat. Biotechnol. 39, 105–114 (2021).
Asnicar, F., Weingart, G., Tickle, T. L., Huttenhower, C. & Segata, N. Compact graphical representation of phylogenetic data and metadata with GraPhlAn. PeerJ 3, e1029 (2015).
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
Kelly, B. J. et al. Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA. Bioinformatics 31, 2461–2468 (2015).
Acknowledgements
Q.Y.L. acknowledges support from the National Natural Science Fund (U20A2051, 31760648, and 31860638), the Guangxi Natural Science Foundation (AB18221120), and the Guangxi Distinguished Scholars Program (201835). We acknowledge Prof. Jiaxiang Huang, Dr. Guangsheng Qin, and Hui Li for providing partial buffalo samples. We also acknowledge Prof. Jianghua Shang, Assistant Prof. Chunyan Yang, Dr. Jieping Huang, Dr. Hui Li, and Miss Siwen Wu for manuscript polishing and editorial assistance.
Author information
Authors and Affiliations
Contributions
Q.L., W.H.C., and G.H. designed and directed the research; F.T., Z. Li, K.C., Y.L., Z. Liu, Y.X., B.L., C.Y., L.Y., D.S., and Q.L. participated in the sampling and background investigation of buffalo; T.W., N.L.G., and F.T. analyzed the data and wrote the paper with results from all authors; S.W. and Y.D. participated in the production of the chart; A.P. participated in the revision of the article. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tong, F., Wang, T., Gao, N.L. et al. The microbiome of the buffalo digestive tract. Nat Commun 13, 823 (2022). https://doi.org/10.1038/s41467-022-28402-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-28402-9
- Springer Nature Limited
This article is cited by
-
A compendium of ruminant gastrointestinal phage genomes revealed a higher proportion of lytic phages than in any other environments
Microbiome (2024)
-
Single-cell transcriptomics across 2,534 microbial species reveals functional heterogeneity in the rumen microbiome
Nature Microbiology (2024)
-
The multi-kingdom microbiome of the goat gastrointestinal tract
Microbiome (2023)
-
The unique gut microbiome of giant pandas involved in protein metabolism contributes to the host’s dietary adaption to bamboo
Microbiome (2023)
-
Differences in the luminal and mucosal gut microbiomes and metabolomes of oriental rat snake (Ptyas mucosus)
Applied Microbiology and Biotechnology (2023)