Evaluation of Full-Length Versus V4-Region 16S rRNA Sequencing for Phylogenetic Analysis of Mouse Intestinal Microbiota After a Dietary Intervention

Katiraei, Saeed; Anvar, Yahya; Hoving, Lisa; Berbée, Jimmy F. P.; van Harmelen, Vanessa; Willems van Dijk, Ko

doi:10.1007/s00284-022-02956-9

Evaluation of Full-Length Versus V4-Region 16S rRNA Sequencing for Phylogenetic Analysis of Mouse Intestinal Microbiota After a Dietary Intervention

Short Communication
Open access
Published: 30 July 2022

Volume 79, article number 276, (2022)
Cite this article

Download PDF

You have full access to this open access article

Current Microbiology Aims and scope Submit manuscript

Evaluation of Full-Length Versus V4-Region 16S rRNA Sequencing for Phylogenetic Analysis of Mouse Intestinal Microbiota After a Dietary Intervention

Download PDF

Saeed Katiraei^1,2,
Yahya Anvar^1,4,
Lisa Hoving^1,2,
Jimmy F. P. Berbée^2,3,
Vanessa van Harmelen^1,2 &
…
Ko Willems van Dijk^1,2,3

5243 Accesses
14 Citations
3 Altmetric
Explore all metrics

Abstract

The composition of microbial communities is commonly determined by sequence analyses of one of the variable (V) regions in the bacterial 16S rRNA gene. We aimed to assess whether sequencing the full-length versus the V4 region of the 16S rRNA gene affected the results and interpretation of an experiment. To test this, mice were fed a diet without and with the prebiotic inulin and from cecum samples, two primary data sets were generated: (1) a 16S rRNA full-length data set generated by the PacBio platform; (2) a 16S rRNA V4 region data set generated by the Illumina MiSeq platform. A third derived data set was generated by in silico extracting the 16S rRNA V4 region data from the 16S rRNA full-length PacBio data set. Analyses of the primary and derived 16S rRNA V4 region data indicated similar bacterial abundances, and α- and β-diversity. However, comparison of the 16S rRNA full-length data with the primary and derived 16S rRNA V4 region data revealed differences in relative bacterial abundances, and α- and β-diversity. We conclude that the sequence length of 16S rRNA gene and not the sequence analysis platform affected the results and may lead to different interpretations of the effect of an intervention that affects the microbiota.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The composition of gut microbiota have been associated with a variety of pathophysiological conditions, including obesity, low-grade inflammation, and overt disease [1,2,3]. We [4,5,6] and others [7, 8] have exploited possibilities to beneficially affect microbiota using probiotics or dietary compounds that affect the composition and/or activity of the gut bacteria. To determine the success of intervention, the composition of gut microbiota are commonly determined by massive parallel sequencing of one of the variable (V) regions of the bacterial 16S rRNA gene [9]. Sequence analysis of the V-region of 16S rRNA has proven to be a powerful tool to describe the composition of bacterial communities [10, 11]. However, the resolution of the taxonomic description of the communities is limited by the uniqueness of the V-region sequences and available reference databases [12]. Numerous different bacterial species have almost identical V-region sequences which makes distinguishing of these bacteria based on a single V-region impossible. The currently available 16S reference databases that are used for taxonomic classification of 16S sequencing data are still quite limited and do not contain a reference sequence for all experimentally obtained 16S sequences [13, 14]. Therefore, some 16S V-region sequences can only be assigned up to the family and/or genus level or cannot be assigned at all.

Massive parallel sequencing of 16S rRNA V-regions has been made possible by the development of next-generation sequencing technology (NGS). A typical NGS run on an Illumina MiSeq will provide several million 250 bp paired-end reads per flow cell. The advantage of high throughput is countered by the relatively short reads that are produced by NGS. Although many of the limitations of short-read sequencing can be addressed using computational approaches, it is extremely challenging, if not impossible, to assemble longer sequences composed of highly homologous parts. Examples of this are repeated sequences in the human genome, but also the repeated sequences in the genomes of various bacteria that constitute the microbiota. A number of so-called third-generation sequencing technologies have been developed to overcome these limitations by sequencing very long amplicons. One such approach is developed by Pacific BioSciences (PacBio) and is termed single-molecule real-time (SMRT) sequencing [15].

We aimed to assess whether sequencing the full-length 16S rRNA gene using SMRT sequencing affected the results and interpretation of a dietary intervention compared to sequencing only the V4 region of this gene. This study included two experimental conditions; a Western-type diet (WTD) and a WTD complemented with the fibre inulin. Inulin is a fructose polymer that can only be degraded by intestinal microbiota and therefore strongly favours the expansion of specific intestinal microbiota [16,17,18,19]. To compare the effects of the dietary intervention measured on either the PacBio or Illumina MiSeq platform, we performed taxonomic analysis and diversity analysis on primary and derived data sets.

Materials and Methods

Cecum Samples

Cecum content was collected for microbial analysis. The cecum samples used in study were obtained in the context of a larger study of which the results were published recently [5].

DNA Isolation

From cecum samples, genomic DNA was extracted using phenol: chloroform: isoamyl alcohol (25: 24: 1) (Invitrogen), precipitated with isopropanol, and washed with 70% ethanol.

PacBio Sequencing

16S rRNA full-length amplification was performed using degenerate primers containing 5’ M13 universal tail sequences (Table S1). The 16S locus was amplified using LA Taq polymerase (Takara) with 400 μM dNTPs, 50 ng DNA template, and 400 nM of each primer in 1 × LA buffer + magnesium with 30 cycles of PCR (20 s 94 °C, 30 s 48 °C, 2 min 68 °C). PCR reactions were size selected using 0.65 × AMPure XP beads (Beckman Coulter). Amplicons were barcoded in a second PCR reaction containing universal tail oligos complementary to the M13 universal tail sequences (Table S1). Barcodes were added using Herculase II Taq polymerase (Agilent) with 250 μM dNTPs, 2ul of purified PCR product, and 400 nM of each primer in a 1 × reaction buffer with 5 cycles of PCR (20 s 95 °C, 20 s 58 °C, 2 min 72 °C). The barcoded amplicons were size selected using 0.65 × AMPure XP beads (Beckman Coulter). 500 ng of barcoded amplicons were prepared for sequencing using the amplicon template preparation protocol, 2015 release (Pacific Biosciences) including DNA damage repair and SMRTbell adapter ligation. Libraries were sequenced on the Pacific Biosciences RSII using MagBead loading with 6 h of movie time and P6-C4 chemistry.

In Silico Isolation of V4 Regions from Full-Length 16S rRNA PacBio Sequencing Data

V4 regions from full-length 16S rRNA PacBio data set were in silico isolated by the V-ripper script [20] using forward primer (5′‐GTGCCAGCMGCCGCGGTAA‐3′) and the reverse primer (5′‐GGACTACHVGGGTWTCTAAT‐3′). Subsequently, isolated sequences with length between 100 and 300 bp were retained.

Illumina Sequencing

Genomic DNA was sent to the Broad Institute of MIT and Harvard (Cambridge, USA). Microbial 16S rRNA was amplified targeting the hyper‐variable V4 region using forward primer 515F (5′‐GTGCCAGCMGCCGCGGTAA‐3′) and the reverse primer 806R (5′‐GGACTACHVGGGTWTCTAAT‐3′). The cycling conditions consisted of an initial denaturation of 94 °C for 3 min, followed by 25 cycles of denaturation at 94 °C for 45 s, annealing at 50 °C for 60 s, extension at 72 °C for 5 min, and a final extension at 72 °C for 10 min. Sequencing was performed using the Illumina MiSeq platform generating paired‐end reads of 175 bp in length in each direction. Overlapping paired‐end reads were subsequently aligned. Details of this protocol have previously been described [21].

Sequencing Data Analysis

All three data sets were analysed using the operational taxonomic unit (OTU) approach. This was done by using the QIIME pipeline [22]. We used SILVA 132 QIIME release as reference OTU taxonomy database. Prior to OTU picking, each data set was quality filtered by sickle version 1.33 and low-quality reads were discarded. Open reference OTU picking strategy with 97% sequence similarity and minimum OTU size of two reads was used. The α-diversity metric based on observed OTUs was calculated continuously from 50 reads/sample up to 3300 reads/sample with increasing steps of 50 reads, with 10 × rarefaction. Unweighted UniFrac distances, with 10 jack-knifed replicates was measured at rarefaction depth of 3000 reads per sample, based on the unfiltered OTU table and relative bacterial abundance was determined. Prior to relative abundance visualization, rare taxa that were present at less than 0.1% were filtered. Sequence data are submitted to SRA database and are accessible with BioProject accession number PRJNA786882.

Results

Sequencing Depth

Cecum content from mice fed a WTD without or with 10% inulin for 11 weeks was collected (n = 2 per group) and genomic DNA was extracted. The full-length 16S rRNA gene was amplified for PacBio sequencing, and the V4 region of the bacterial 16S rRNA gene was PCR amplified for Illumina short-read sequencing. To determine platform bias in the data sets obtained from the PacBio and Illumina platforms, a 16S rRNA V4 region data set was generated in silico from the full-length 16S rRNA PacBio data set (V4 PacBio). Table S2 shows that the read count obtained by PacBio and Illumina sequencing are in range of a typical run for the platforms, and the reads have the correct mean read length for the full-length 16S rRNA (approx. 1500 bp) and V4 region (approx. 250 bp). Interestingly, the V4 PacBio read count for individual samples are approximately 50% of the read count for the full-length 16S rRNA PacBio data they were derived from (Table S2). The V-ripper script in combination with the used primer sequences, apparently, does not recognize 50% of the full-length 16S rRNA sequences.

Sequencing Data Analysis

For operational taxonomic unit (OTU) picking, open reference OTU picking strategy with 97% sequence similarity and minimum OTU size of two reads was used. The minimum OTU size of at least two sequences/OTU ensured that singletons are excluded from the data. Table 1 shows the number of OTUs for individual samples and the number of sequences that these OTUs contained. In the 16S rRNA full-length PacBio data set, proportionally more reads were discarded in the OTU picking step compared to both 16S rRNA V4 data sets. These discarded reads were singletons and sequences that failed to align with the reference database. Furthermore, sequencing full-length 16S rRNA resulted in a higher percentage of unassigned taxa (2.9–8.4% of total reads) compared to both V4 data sets (0.05–0.6% of total reads; Table 1). These were reads without any reference sequence available in the reference database. The number of unassigned reads in the full-length 16S rRNA data set was in particular higher for samples of inulin-fed mice compared to samples of control mice.

Table 1 Microbial community analysis

Full size table

Full-Length 16S rRNA Results into Higher α-Diversity

The OTU richness was assessed by plotting α-diversity versus sequencing depth. The α-diversity expressed as number of unique observed OTUs was calculated continuously from 50 reads/sample up to 3300 reads/sample with increasing steps of 50 reads, with 10 × rarefaction. Already at a sequencing depth of 300 reads/sample, α-diversity of 16S rRNA full-length PacBio samples was increased compared to both V4 PacBio and V4 Illumina data sets for control and inulin-fed samples (Fig. 1), while α-diversity of the V4 PacBio and V4 Illumina data sets were comparable. These data show that sequencing the full-length 16S rRNA resulted in a higher number of unique OTUs, already at a relatively low sequencing depth.

Using Full-Length 16S rRNA Reveals a Different Bacterial Phylogeny as Compared with V4 Region

The between sample diversity, or β-diversity, was determined by calculating unweighted UniFrac distances. This is a validated and widely used quantitative distance metric for studying microbial community clustering that takes the phylogeny of communities into account [23, 24]. Principal coordinate analysis was performed and the variation explained by the first two principal coordinates is plotted in (Fig. 2). Principal coordinate (PC)1, which explains 34.8% of the data, clearly separates the full-length 16S rRNA PacBio data from the V4 amplicon data. The unweighted UniFrac distance for the V4 PacBio data set was comparable with the UniFrac distance of Illumina V4 regions, indicating limited sequencing platform bias in determining β-diversity. In order to assess the robustness of the UniFrac distance 10 × jack-knifing at 3000 reads/sample was performed for all samples. The jack-knifing variance, indicated by the ellipsoids around the data points, was smaller for the full-length 16S rRNA sequenced samples compared to both V4 data sets (Fig. 2). This indicates that a longer amplicon length provided a more robust UniFrac distance assignment.

Using Full-Length 16S rRNA Gene Results in a Different Bacterial Composition and Relative Abundance

In addition to diversity analyses, we aimed to study if sequence length affected the taxonomic analysis outcome. We hypothesized that a longer amplicon length increased the resolution of the analysis by detecting additional taxa which would not be observed by sequencing the V4 region only. Therefore, we compared the full-length 16S rRNA PacBio samples with the V4 PacBio and V4 Illumina data sets. In this way, we could exclude platform bias and detect the effects of amplicon length on taxonomic analysis after a dietary intervention.

Genus level is considered as the maximum resolution of 16S sequencing. Therefore, we compared relative abundance of bacterial taxa in the three data sets at genus level. Sequencing the full-length 16S rRNA gene showed a different relative abundance at genus levels compared to both V4 data sets, both for samples of control and inulin-fed mice (Fig. 3). Bacterial relative abundances of V4 PacBio and V4 Illumina data sets were comparable for control samples. For inulin-fed mice, sample In2 showed variation in relative abundance for several taxa between the V4 PacBio and V4 Illumina data set (Fig. 3). Interestingly, relative abundance of the genus Faecalibaculum that blooms with inulin intervention was higher in the full-length 16S rRNA data set compared to both V4 data sets. Relative abundance of the uncultured genus of Muribaculaceae family that increases with inulin intervention was lower in the full-length 16S rRNA data set compared to both V4 data sets (Fig. 3). Relative abundance of the Bacteroides genus that decreases with inulin intervention was higher in full-length 16S rRNA data set compared to both V4 data sets (Fig. 3). Remarkably, the genus Lactobacillus was detected in the V4 PacBio and V4 Illumina data sets for both dietary conditions, but was completely absent inform the full-length 16S rRNA PacBio data set for both dietary conditions. After inulin intervention, other taxa like GCA-900066575, Lachnospiraceae-UCG006, Lachnospiraceae uncultured genus, Oscillibacter and Ruminiclostridium 9 were detected in both V4 data sets, and were also almost or completely absent in the full-length 16S rRNA PacBio data set. Taken together, this taxonomic analysis shows that sequencing the full-length 16S rRNA gene results in a different bacterial composition and relative abundance of bacterial species both for control and inulin-fed mice compared to determining the sequence of the V4 region only.

Discussion

We hypothesized that sequencing the full-length 16S rRNA gene would provide a higher resolution in terms of diversity and taxonomic analyses compared to sequencing a single short amplicon of the 16S rRNA marker gene such as the V4 region.

Our results show that in the in silico-extracted V4 PacBio data set, individual samples have approximately 50% of the read count of the full-length 16S rRNA PacBio data set. This reduction in read count after in silico isolation of the V4 sequences from the full-length 16S rRNA data set might be caused by variability in the primer sequences. It is known that primer choice for sequencing hypervariable regions of 16S rRNA influences sequencing outcome, due to the fact that primers do not cover the 16S rRNA V4 flanking region for all bacteria [25,26,27]. These data could indicate that a proportion of the taxa that are identified by full-length 16 s rRNA gene sequencing are not detected by sequencing the V4 region only. Alternatively, although the circular consensus sequencing approach of PacBio has a very low error rate, this could also explain a proportion of the V-regions that could not be extracted using the V-ripper script. However, since PacBio sequencing errors are random, this would have no consequences on the distribution and phylogenetic assignment of the extracted sequences.

In addition to primer choice, other factors including the DNA extraction method and choice of the 16S V-region may affect experimental outcome and introduce biases to the diversity and taxonomic analysis. DNA extraction method: Mackenzie et al. studied the effects of different DNA extraction methods, including commercially available DNA isolation kits and the phenol: chloroform: isoamyl alcohol method [28]. Different DNA isolation methods resulted in different DNA yield, DNA quality, and relative abundance of taxon-assigned OTUs. Other studies addressing microbial DNA extraction methods report similar issues [29, 30]. These results emphasize that it is important, if at all possible, to be consistent in the use of a DNA extraction method. Choice of 16S V-region: Sequencing the V4 region in combination with Illumina MiSeq platform has been widely used for taxonomic and diversity analysis [11, 31]. More recently, a combination of two regions like the V2–V3 or V3–V4 region have been used for this purpose [32]. Burkin et al. compared V2–V3 with V3–V4 regions in water samples and reported that V2–V3 sequencing has higher resolution for lower-rank taxa [32]. Abellan-Schneyder et al. conducted an extensive study including six different combinations of the V-regions on human gut and mock samples [33]. They recommended sequencing of V3–V4 regions for human gut samples, but also mentioned that primer choice has significant influence on the resulting microbial composition [33]. Since there seems no consensus on which V-regions provides the best results, investigators should consider the choice for their desired V-region carefully based on the experimental design and sample type. The cecum samples used in study were obtained in the context of a larger study of which the results were published as mentioned in the Materials and Methods section [5]. In order to maintain comparability with previously obtained data we have used the V4 region in this current study.

Diversity analyses and taxonomic analysis are based on OTUs. An OTU is described as a cluster of sequences with a minimum amount of sequence identity; in the case of genus level the threshold for sequence identity is set at 97% similarity [9]. Since OTU picking is based on sequence identity, sequence length can thus affect the number and composition of OTUs in a given data set. α-diversity metric observed that OTUs showed increased number of unique OTUs for the full-length 16A rRNA PacBio data set compared to both V4 data sets.

In addition, our results showed that β-diversity is affected by the sequence length. β-diversity analysis was performed by calculating the unweighted UniFrac distances. The unweighted UniFrac distance is a qualitative distance metric which takes the phylogeny of the sample into account [24]. The PCoA plot of unweighted UniFrac distance is based on the number of shared and unshared branches of the phylogenetic tree of the samples and is therefore a measure of heterogeneity of the bacterial population [23, 24]. Since 16S rRNA full-length PacBio and V4 Illumina sequenced samples are separated in the PCoA plot, we can conclude that these samples had different phylogenetic trees which reflected different bacterial compositions. As samples of the V4 PacBio data set and the V4 Illumina clustered together, we can conclude that the difference in phylogenetic trees and thus bacterial composition is not due to platform bias (PacBio vs Illumina), but caused by the difference in sequence length. Furthermore jack-knifing variance, which determines how often the cluster results are recovered using random subsets of the data, was smaller for the full-length 16S rRNA PacBio samples compared to both V4 data sets and shows that sequencing full-length 16S rRNA resulted in increased robustness of the data [24]. It has previously been shown that the PacBio platform can be used for studying microbiota communities [34, 35]. Based on our findings and the fact that β-diversity metric UniFrac can distinguish bacterial communities at a depth of 50 reads/sample [23], we suggest that the PacBio platform can be used to study intestinal microbial communities at a lower sequencing depth. This allows multiplexing multiple samples on a single-molecule real-time (SMRT) cell in order to reduce resources and sequencing costs.

In addition to diversity analysis, interpretation of experimental outcome requires insight into the bacterial composition of a sample to understand e.g. which bacterial species are able to convert a dietary compound. Taxonomic analysis of the three data sets showed that sequencing full-length 16S rRNA resulted in a different bacterial composition as relative abundances of taxa were increased or decreased with 16S rRNA full-length PacBio after inulin intervention compared to both V4 data sets. Interestingly, the genus Lactobacillus was completely absent in the full-length 16S rRNA PacBio data set, while being detected in both V4 data sets. This difference in taxa detection is of major importance for interpretation of biological data. It should be mentioned that in our previous article, exclusively relied on 16S rRNA V4 region sequencing by Illumina, we reported that the genus Allobaculum bloomed after inulin intervention [5]. However, here we report that Faecalibaculum bloomed after inulin intervention. Faecalibaculum is closely related to Allobaculum with 86.9% sequence similarity and was recently isolated from laboratory mice [36]. Microbial data of our initial article were analysed using the Greengenes 13.8 reference database and for the current work we used the SILVA 132 reference database which likely explains this discrepancy in annotation.

Sequencing the full-length 16S rRNA gene resulted in the detection of a higher percentage of unassigned reads compared to sequencing the V4 regions only. Interestingly, in our study the percentage of unassigned reads was higher in samples of inulin-fed mice. This finding might suggest that at least part of the bacterial taxa blooming on inulin are in this unassigned fraction of the data. Since we cannot assign these reads, we cannot fully utilize the advantage of full-length 16S rRNA gene sequencing compared to V4 sequencing.

Conclusion

Taken together, we conclude that sequencing the full-length 16S rRNA gene provides a different view regarding bacterial relative abundance, in-sample diversity, and in in-between-sample diversity, as compared to V4 sequencing regardless of sequence analysis platform. This clearly has implications for interpretation of biological data after a dietary intervention.

References

Turnbaugh PJ, Ley RE, Mahowald MA et al (2006) An obesity-associated gut microbiome with increased capacity for energy harvest. Nature 444:1027–1031. https://doi.org/10.1038/nature05414
Article PubMed Google Scholar
Scher JU, Abramson SB (2011) The microbiome and rheumatoid arthritis. Nat Rev Rheumatol 135:612–615. https://doi.org/10.1038/nrrheum.2011.121
Article CAS Google Scholar
Sharon G, Garg N, Debelius J et al (2014) Specialized metabolites from the microbiome in health and disease. Cell Metab 20:719–730. https://doi.org/10.1016/j.cmet.2014.10.016
Article CAS PubMed PubMed Central Google Scholar
Hoving LR, Katiraei S, Heijink M et al (2018) Dietary Mannan oligosaccharides modulate gut microbiota, increase fecal bile acid excretion, and decrease plasma cholesterol and atherosclerosis development. Mol Nutr Food Res. https://doi.org/10.1002/mnfr.201700942
Article PubMed Google Scholar
Hoving LR, Katiraei S, Pronk A et al (2018) The prebiotic inulin modulates gut microbiota but does not ameliorate atherosclerosis in hypercholesterolemic APOE*3-Leiden.CETP mice. Sci Rep 8:16515. https://doi.org/10.1038/s41598-018-34970-y
Article CAS PubMed PubMed Central Google Scholar
Katiraei S, de Vries MR, Costain AH et al (2020) Akkermansia muciniphila Exerts Lipid-Lowering and Immunomodulatory Effects without Affecting Neointima Formation in Hyperlipidemic APOE*3-Leiden.CETP Mice. Mol Nutr Food Res. https://doi.org/10.1002/mnfr.201900732
Article PubMed Google Scholar
Chen Z, Guo L, Zhang Y et al (2014) Incorporation of therapeutically modified bacteria into gut microbiota inhibits obesity. J Clin Investig 124:3391–3406. https://doi.org/10.1172/JCI72517
Article CAS PubMed PubMed Central Google Scholar
Plovier H, Everard A, Druart C et al (2017) A purified membrane protein from Akkermansia muciniphila or the pasteurized bacterium improves metabolism in obese and diabetic mice. Nat Med 23:107–113. https://doi.org/10.1038/nm.4236
Article CAS PubMed Google Scholar
Drancourt M, Bollet C, Carlioz A et al (2000) 16S ribosomal DNA sequence analysis of a large collection of environmental and clinical unidentifiable bacterial isolates. J Clin Microbiol 38:3623–3630. https://doi.org/10.1073/pnas.0504930102
Article CAS PubMed PubMed Central Google Scholar
Claesson MJ, Wang Q, O’Sullivan O et al (2010) Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions. Nucleic Acids Res. https://doi.org/10.1093/nar/gkq873
Article PubMed PubMed Central Google Scholar
Caporaso JG, Lauber CL, Walters WA et al (2011) Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci USA 108(Suppl):4516–4522. https://doi.org/10.1073/pnas.1000080107
Article PubMed Google Scholar
Yang B, Wang Y, Qian P-Y (2016) Sensitivity and correlation of hypervariable regions in 16S rRNA genes in phylogenetic analysis. BMC Bioinform 17:135. https://doi.org/10.1186/s12859-016-0992-y
Article CAS Google Scholar
Poretsky R, Rodriguez-R LM, Luo C et al (2014) Strengths and limitations of 16S rRNA gene amplicon sequencing in revealing temporal microbial community dynamics. PLoS ONE. https://doi.org/10.1371/journal.pone.0093827
Article PubMed PubMed Central Google Scholar
Knight R, Vrbanac A, Taylor BC et al (2018) Best practices for analysing microbiomes. Nat Rev Microbiol 16:410–422. https://doi.org/10.1038/s41579-018-0029-9
Article CAS PubMed Google Scholar
Fichot EB, Norman RS (2013) Microbial phylogenetic profiling with the Pacific Biosciences sequencing platform. Microbiome 1:10. https://doi.org/10.1186/2049-2618-1-10
Article PubMed PubMed Central Google Scholar
Moens F, Weckx S, De Vuyst L (2016) Bifidobacterial inulin-type fructan degradation capacity determines cross-feeding interactions between bifidobacteria and Faecalibacterium prausnitzii. Int J Food Microbiol 231:76–85. https://doi.org/10.1016/j.ijfoodmicro.2016.05.015
Article CAS PubMed Google Scholar
Roberfroid MB (2005) Introducing inulin-type fructans. Br J Nutr 93:S13. https://doi.org/10.1079/BJN20041350
Article CAS PubMed Google Scholar
Dewulf EM, Cani PD, Claus SP et al (2013) Insight into the prebiotic concept: lessons from an exploratory, double blind intervention study with inulin-type fructans in obese women. Gut 62:1112–1121. https://doi.org/10.1136/gutjnl-2012-303304
Article CAS PubMed Google Scholar
Catry E, Bindels LB, Tailleux A et al (2017) Targeting the gut microbiota with inulin-type fructans: preclinical demonstration of a novel approach in the management of endothelial dysfunction. Gut. https://doi.org/10.1136/gutjnl-2016-313316
Article PubMed Google Scholar
Allard G, Ryan FJ, Jeffery IB, Claesson MJ (2015) SPINGO: a rapid species-classifier for microbial amplicon sequences. BMC Bioinform 16:324. https://doi.org/10.1186/s12859-015-0747-1
Article CAS Google Scholar
Gevers D, Kugathasan S, Denson LA et al (2014) The treatment-naive microbiome in new-onset Crohn’s disease. Cell Host Microbe 15:382–392. https://doi.org/10.1016/j.chom.2014.02.005
Article CAS PubMed PubMed Central Google Scholar
Caporaso JG, Kuczynski J, Stombaugh J et al (2010) QIIME allows analysis of high- throughput community sequencing data Intensity normalization improves color calling in SOLiD sequencing. Nat Methods 7:335–336. https://doi.org/10.1038/nmeth0510-335
Article CAS PubMed PubMed Central Google Scholar
Lozupone C, Lladser ME, Knights D et al (2011) UniFrac: an effective distance metric for microbial community comparison. ISME J 5:169–172. https://doi.org/10.1038/ismej.2010.133
Article PubMed Google Scholar
Lozupone C, Knight R (2005) UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol 71:8228–8235. https://doi.org/10.1128/AEM.71.12.8228-8235.2005
Article CAS PubMed PubMed Central Google Scholar
Takahashi S, Tomita J, Nishioka K et al (2014) Development of a prokaryotic universal primer for simultaneous analysis of Bacteria and Archaea using next-generation sequencing. PLoS ONE. https://doi.org/10.1371/journal.pone.0105592
Article PubMed PubMed Central Google Scholar
Klindworth A, Pruesse E, Schweer T et al (2013) Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res 41:1–11. https://doi.org/10.1093/nar/gks808
Article CAS Google Scholar
Martinez-Porchas M, Villalpando-Canchola E, Ortiz Suarez LE, Vargas-Albores F (2017) How conserved are the conserved 16S-rRNA regions? PeerJ 5:e3036. https://doi.org/10.7717/peerj.3036
Article CAS PubMed PubMed Central Google Scholar
Mackenzie BW, Waite DW, Taylor MW (2015) Evaluating variation in human gut microbiota profiles due to DNA extraction method and inter-subject differences. Front Microbiol 6:1–11. https://doi.org/10.3389/fmicb.2015.00130
Article CAS Google Scholar
Janabi AHD, Kerkhof LJ, McGuinness LR et al (2016) Comparison of a modified phenol/chloroform and commercial-kit methods for extracting DNA from horse fecal material. J Microbiol Methods 129:14–19. https://doi.org/10.1016/j.mimet.2016.07.019
Article CAS PubMed Google Scholar
Lim MY, Song EJ, Kim SH et al (2018) Comparison of DNA extraction methods for human gut microbial community profiling. Syst Appl Microbiol 41:151–157
Article CAS Google Scholar
Poli MC, Orange J (2017) Variation in microbiome LPS immunogenicity contributes to autoimmunity in humans. Pediatrics 140:S184–S185. https://doi.org/10.1542/peds.2017-2475W
Article Google Scholar
Bukin YS, Galachyants YP, Morozov IV et al (2019) The effect of 16S rRNA region choice on bacterial community metabarcoding results. Sci Data 6:190007. https://doi.org/10.1038/sdata.2019.7
Article CAS PubMed PubMed Central Google Scholar
Abellan-Schneyder I, Matchado MS, Reitmeier S et al (2021) Primer, pipelines, parameters: issues in 16S rRNA gene sequencing. mSphere. https://doi.org/10.1128/mSphere.01202-20
Article PubMed PubMed Central Google Scholar
Wagner J, Coupland P, Browne HP et al (2016) Evaluation of PacBio sequencing for full-length bacterial 16S rRNA gene classification. BMC Microbiol 16:274. https://doi.org/10.1186/s12866-016-0891-4
Article CAS PubMed PubMed Central Google Scholar
Pootakham W, Mhuantong W, Yoocha T et al (2017) High resolution profiling of coral-associated bacterial communities using full-length 16S rRNA sequence data from PacBio SMRT sequencing system. Sci Rep 7:2774. https://doi.org/10.1038/s41598-017-03139-4
Article CAS PubMed PubMed Central Google Scholar
Chang DH, Rhee MS, Ahn S et al (2015) Faecalibaculum rodentium gen. nov., sp. nov., isolated from the faeces of a laboratory mouse. Antonie van Leeuwenhoek, Int J Gen Mol Microbiol 108:1309–1318. https://doi.org/10.1007/s10482-015-0583-3
Article CAS Google Scholar

Download references

Funding

This research was financially supported by Rembrandt Institute of Cardiovascular Sciences (RICS) and Cardiovascular Research Netherlands (CVON IN-CONTROL).

Author information

Authors and Affiliations

Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
Saeed Katiraei, Yahya Anvar, Lisa Hoving, Vanessa van Harmelen & Ko Willems van Dijk
Einthoven Laboratory for Experimental Vascular Medicine, Leiden University Medical Center, Leiden, The Netherlands
Saeed Katiraei, Lisa Hoving, Jimmy F. P. Berbée, Vanessa van Harmelen & Ko Willems van Dijk
Division of Endocrinology, Department of Medicine, Leiden University Medical Center, Leiden, The Netherlands
Jimmy F. P. Berbée & Ko Willems van Dijk
Department of Human Genetics, Leiden Genome Technology Center (LGTC), Leiden University Medical Center (LUMC), Leiden, The Netherlands
Yahya Anvar

Authors

Saeed Katiraei
View author publications
You can also search for this author in PubMed Google Scholar
Yahya Anvar
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Hoving
View author publications
You can also search for this author in PubMed Google Scholar
Jimmy F. P. Berbée
View author publications
You can also search for this author in PubMed Google Scholar
Vanessa van Harmelen
View author publications
You can also search for this author in PubMed Google Scholar
Ko Willems van Dijk
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study concept and design: SK, YA, and KWvD. NGS data acquisition: SK and YA. Mouse studies and sample processing: SK and LH. Statistical analysis and interpretation of data: SK, YA, VvH, and KWvD. Writing of the manuscript: SK, VvH, and KWvD. Critical revision of the manuscript: YA, JFPB, and VvH. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ko Willems van Dijk.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Research Involving Animals and their Data or Biological Material

Mouse experiments were performed in compliance with Dutch government guidelines and the Directive 2010/63/EU of the European Parliament and had received approval from the University Ethical Review Board (Leiden University Medical Center, The Netherlands).

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 16 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Katiraei, S., Anvar, Y., Hoving, L. et al. Evaluation of Full-Length Versus V4-Region 16S rRNA Sequencing for Phylogenetic Analysis of Mouse Intestinal Microbiota After a Dietary Intervention. Curr Microbiol 79, 276 (2022). https://doi.org/10.1007/s00284-022-02956-9

Download citation

Received: 09 August 2021
Accepted: 24 June 2022
Published: 30 July 2022
DOI: https://doi.org/10.1007/s00284-022-02956-9

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evaluation of Full-Length Versus V4-Region 16S rRNA Sequencing for Phylogenetic Analysis of Mouse Intestinal Microbiota After a Dietary Intervention

Abstract

Introduction

Materials and Methods

Cecum Samples

DNA Isolation

PacBio Sequencing

In Silico Isolation of V4 Regions from Full-Length 16S rRNA PacBio Sequencing Data

Illumina Sequencing

Sequencing Data Analysis

Results

Sequencing Depth

Sequencing Data Analysis

Full-Length 16S rRNA Results into Higher α-Diversity

Using Full-Length 16S rRNA Reveals a Different Bacterial Phylogeny as Compared with V4 Region

Using Full-Length 16S rRNA Gene Results in a Different Bacterial Composition and Relative Abundance

Discussion

Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Research Involving Animals and their Data or Biological Material

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 16 kb)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation