Evaluation of PCR conditions for characterizing bacterial communities with full-length 16S rRNA genes using a portable nanopore sequencer

Fujiyoshi, So; Muto-Fujita, Ai; Maruyama, Fumito

doi:10.1038/s41598-020-69450-9

Evaluation of PCR conditions for characterizing bacterial communities with full-length 16S rRNA genes using a portable nanopore sequencer

Article
Open access
Published: 28 July 2020

Volume 10, article number 12580, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Evaluation of PCR conditions for characterizing bacterial communities with full-length 16S rRNA genes using a portable nanopore sequencer

Download PDF

So Fujiyoshi^1,2,
Ai Muto-Fujita³ &
Fumito Maruyama^1,2

12k Accesses
26 Citations
3 Altmetric
Explore all metrics

Abstract

MinION (Oxford Nanopore Technologies), a portable nanopore sequencer, was introduced in 2014 as a new DNA sequencing technology. MinION is now widely used because of its low initial start-up costs relative to existing DNA sequencers, good portability, easy-handling, real-time analysis and long-read output. However, differences in the experimental conditions used for 16S rRNA-based PCR can bias bacterial community assessments in samples. Therefore, basic knowledge about reliable experimental conditions is needed to ensure the appropriate use of this technology. Our study concerns the reliability of techniques for obtaining accurate and quantitative full-length 16S rRNA amplicon sequencing data for bacterial community structure assessment using MinION. We compared five PCR conditions using three independent mock microbial community standard DNAs and established appropriate, standardized, better PCR conditions among the trials. We then sequenced two mock communities and six environmental samples using Illumina MiSeq for comparison. Modifying the PCR conditions improved the sequencing quality; the optimized conditions were 35 cycles of 95 °C for 1 min, 60 °C for 1 min and 68 °C for 3 min. Our results provide important information for researchers to determine bacterial community using MinION accurately.

Species-level resolution of 16S rRNA gene amplicons sequenced through the MinION™ portable nanopore sequencer

Article Open access 28 January 2016

Direct 16S rRNA-seq from bacterial communities: a PCR-independent approach to simultaneously assess microbial diversity and functional activity potential of each taxon

Article Open access 31 August 2016

Using custom-built primers and nanopore sequencing to evaluate CO-utilizer bacterial and archaeal populations linked to bioH2 production

Article Open access 09 October 2023

Introduction

Most microbes in the natural environment have not yet been cultured, but recent molecular technological advances make it possible to study them without cultivation. New technologies have allowed breakthroughs to be made in the elucidation of roles of microbes in the natural environment and in the fields of human health; for example, in investigations of the human gut microbiome^1,2 and in bio-engineering for agriculture, bioremediation and industry³. Molecular techniques have provided researchers with various analytical procedures for understanding microbial communities using clone libraries^4,5, T-RFLP (terminal restriction fragment length polymorphism) analysis^6,7, and DGGE (denaturing gradient gel electrophoresis) techniques^8,9. Full-length bacterial 16S rRNA genes have historically been sequenced using conventional molecular cloning and Sanger sequencing, but this approach is time-consuming, expensive, and has low throughput¹⁰. Currently, MiSeq sequencing (Illumina, San Diego, CA, USA) is the most widely used platform for 16S rRNA gene amplicon sequencing for microbial community analysis. PCR is conducted on the variable regions (V2, V3, and V4) in bacteria, with the primers focusing on the conserved region of 16S rRNA¹¹. MiSeq, which has become popular for its high precision (99.9%), enables the PCR amplicon sequence determination by merging overlapped region of paired 300 nt facing reads¹². However, in terms of taxonomic resolution, comparative analyses have revealed the importance of the target region and the choice of the primer pair, as revealed by the following studies in this area. Cai et al.¹³ reported the effects of the 16S rRNA gene primer sets and recommended the use of V3 and V4 primer pairs for several environmental sample types. On the other hand, Guo et al.¹⁴ proposed the use of the V1 and V2 regions for analysing the functional bacterial groups in a sludge sample. However, Wang et al.¹⁵ recommended using the V5, V6, and V7 regions for ascertaining the bacterial community structure in aging flue-cured tobaccos because chloroplast and mitochondrial genes have lower co-amplification levels. Kindworth et al.¹⁶ showed that, based on the comparison of microbial community obtained using the multiple universal primer sets, each universal primer set generate significant differences in taxonomic spectrum. The short-read lengths (100–300-bp) inherent in the single universal primer set techniques also prevent species-level analyses in microbial ecology¹⁷.

In 2012, introduction of the high-throughput Pacific Biosciences (PacBio, Menlo Park, CA, USA) sequencer facilitated structural analysis of microbial communities¹⁸. The PacBio platform can obtain full-length 16S rRNA gene sequences, which increases taxonomic resolution by sequencing the number of the informative sites. Its primary limitations lie in its lack of versatility and exemplified by tedious sample preparation¹⁹. In 2014, the Nanopore MinION sequencer (Oxford Nanopore Technologies, London, UK), now regarded as breakthrough in DNA sequencing, was developed. It contains several intriguing features that enable real-time, on-site analyses of any genetic material. The device has been used in diverse ways in various fields, including drug-resistance gene analyses and assessment of the rapid gain in reptile and amphibian biodiversity in rainforests^20,21. MinION starts to be used more often and its sequencing quality has been improving with higher sequencing read accuracy in 1D sequencing (94%). Recently, an increasing number of studies have reported their concerns about on-site and real-time measurements using MinION^22,23,24. These MinION-based gene sequencing techniques have provided new insight into microbial community structures much more rapidly and easily than ever before. The optimization, establishment and standardization of methods for the quantitative evaluation of microbial composition in the environment are inevitable. This would allow scientists to accurately answer the fundamental question of microbial ecology: what kind of and how many microorganisms are present in the environment.

PCR-based 16S rRNA analysis of bacterial community structure is subject to biases from the PCR-related conditions. These include the template concentration, DNA polymerase choice, number of cycles used, amplification reaction time, and the reaction temperature^{25,26,27,28,29}. Bacterial communities can also be reconstructed by only collecting the 16S rRNA sequences obtained from metagenomes, thereby avoiding PCR bias; however, PCR-free libraries require relatively large amounts of input DNA, and are impractical for many sample types³⁰. Therefore, cost-effective marker gene amplicon sequencing is often preferred over metagenomic sequencing for microbial community analysis because it enables the assessment of uncultivable organisms.

With this background, the aim of this study was to evaluate MinION PCR conditions through three approaches: (1) sequencing the full-length bacterial 16S rRNA gene from a single bacterial species to examine our bioinformatics pipeline; (2) sequencing the amplicon of full-length bacterial 16S rRNA gene from three different types of bacterial mock community DNAs under five different PCR conditions; and (3) sequencing the amplicon of full-length 16S rRNA genes from six environmental samples to compare the results with those of bacterial 16S rRNA V3–V4 regions sequenced using MiSeq.

Results

MinION data filtering by length

We initially used the Ribosomal Database Project (RDP) classifier version 2.11 (https://rdp.cme.msu.edu/)³¹, and the RDP classifier 16S training set No:16 as database (https://sourceforge.net/projects/rdp-classifier/files/RDP_Classifier_TrainingData/RDPClassifier_16S_trainsetNo16_rawtrainingdata.zip/download). However, this tool erroneously assigned Vibrio as Allomonas. Analysis with another tool (mothur³²) required an excessively long run time. We eventually chose to use Burrows-Wheeler Aligner (BWA-MEM, v. 15. 0.7 or v. 0.7. 17)³³ with a database derived from the RDP³⁴ as described in Methods. The MinION sequence length distribution and species identification accuracy were investigated using a single bacterial species, Vibrio cholerae. A commercially available kit (16S Rapid Sequencing Kit and 16S Barcoding Kit; Oxford Nanopore Technologies) with primers for full-length 16S rRNA amplicon sequencing on the MinION platform was used. The distribution of sequencing read lengths showed the highest frequency at around 1,500-base reads. Both shorter (5-base) and longer (200,000-base) reads also appeared. Three-step filtering ranging from 1,000–2,000 bases, 1,200–1,800 bases, and 1,400–1,600 bases was used to include the highest frequency length (1,500-base) in each step. As shown in Fig. 1, the hit ratio (V. cholerae/total reads) increased from 75% (2,998/3,994 reads) without filtering to 86% (1,489/1,735 reads) after filtering with 1,400–1,600 bases. Hence, we filtered the reads by length, using those in the 1,400–1,600 base range thereafter.

PCR conditions based on mock communities

The results obtained from the ZymoBIOMICS mock community were compared for the five PCR conditions shown in Fig. 2, with detailed information for each condition being described in Table 1. The Goods coverage values were greater than 99% for all samples (Table 2). Bray–Curtis dissimilarity³⁵ was used as a measure for assessing the difference between the observed and theoretical communities structures for each PCR condition. Bray–Curtis dissimilarity is bounded between 0 and 1, where 0 means that the two compared samples have the same composition, and 1 means the two sites do not share any species³⁶. The dissimilarity value for PCR condition T0 was 0.28, whereas those for T1, T2, T3 and T4 were 0.40, 0.24, 0.25 and 0.24, respectively (in detail and species level data, see Supplemental Table S2). The initial T0 and prefered T4 conditions were also compared using two other mock communities (that is, an even mix of 10 strains; ATCC 10, and an even mix of 20 strains; ATCC 20) (Fig. 3). For ATCC 10, the dissimilarity values between the observed and theoretical values for T0 and T4 were almost the same, at 0.253 and 0.257, respectively. However, in the case of ATCC 20, the dissimilarity values for T0 and T4 were 0.338 and 0.233, respectively. This tendency was more pronounced at the species-level data (Supplemental Table S3). We obtained a bacterial community composition similar to the theoretical one under T4 conditions, compared with that obtained under the T0 conditions. Almost all the genera except for Bifidobacterium were detected under both conditions. When the ATCC 10 and ATCC 20 mock communities were analysed using MiSeq, the dissimilarity values were 0.184 and 0.216, respectively. These values are smaller than those for MinION under T0 and T4 conditions.

Table 1 PCR conditions used in this study.

Full size table

Table 2 Sequence reads and the proportions of “Unassigned” generated per sample.

Full size table

PCR conditions using environmental samples and MiSeq sequencing

We applied our optimized PCR condition (T4) to environmental samples comprising bathtub inlet biofilms, showerhead feed water and showerhead biofilms from a bathroom (n = 6 samples). Each extracted DNA prepared from these samples was used separately as a PCR template, and 16S rRNA gene amplicon libraries were sequenced on both MinION and MiSeq platforms. The Goods coverage values were greater than 96% for all samples (Table 2). Figure 4 shows the 15 most prevalent genera in the samples. Under MinION T4 conditions, the genus distribution was similar to those under MiSeq, but only for the B6 sample (Fig. 4B). The remaining five samples from MinION with T4 output data resemble those generated under MinION T0 conditions. Table 2 lists the read numbers from MiSeq, MinION T0 and MinION T4. Regarding the proportion of the “Unassigned” category, the values from MiSeq, MinION T0 and MinION T4 were 11.9% (SD 11.7), 5.5% (SD 6.3) and 0.0% (SD 0.0), respectively. The values did not differ significantly for MinION T0 to MinION T4 (p value = 0.45 > 0.05), or MinION T0 to MiSeq (p- value = 0.36 > 0.05). Conversely, the MinION T4 value was significantly lower than that of MiSeq (p value = 0.044). At the genus level, the bacterial compositions from MinION T4 showed greater taxonomic resolution than those from MiSeq.

Discussion

We investigated whether fractional changes in taxonomic assignment and bacterial community composition exist in the comparison of PCR conditions using the MinION sequencer (Oxford Nanopore Technologies) with mock community and environmental samples. We also compared the results from the bacterial community samples from MinION with those from the MiSeq sequencer (Illumina). The search for new analytical tools with shorter run times has progressed considerably with the third-generation MinION sequencing platform, because of its rapid and easy handling³⁷. The increased information content inherent from longer read lengths help researchers with alignment-based taxonomy assignment¹⁷. With the ability to generate longer read lengths, MinION analysis can target the entire 16S rRNA gene coding region to offer highly accurate, sensitive and rapid pathogen detection^20,38. Our goal was to determine the better conditions under which accurate bacterial community structuring data could be obtained using a nanopore sequencer. DNA amplification was performed for 35 cycles in all our PCR protocols. Several studies have shown that larger PCR cycle numbers cause chimera generation and interfere with bacterial community structure analysis^39,40. Hence, minimizing the number of PCR cycles by optimizing the starting template conditions and concentrations is important²⁶. However, in our situation, reducing the number of PCR cycles to less than 35 decreased the number of environmental DNA samples that were amplified. Tap water has relatively lower microbial density than that in other environmental samples such as sea water and soil⁴¹. Therefore, the procedures used in this study may also be applied to samples with low microbial cell densities, such as atmospheric (~ 10⁴ cells m⁻³)⁴² samples, too.

The sequencing data from the strictest filtration range (1,400–1,600 bases) provided 86% matching to V. cholerae in our database (Fig. 1). This indicates that high-resolution analysis at the species level is possible with MinION by eliminating extraneous read data. Jethro et al.¹¹ stated that by using full-length sequences it is possible to classify nearly all environmental sequences into correct species. The read number (1,735 reads) after this treatment (43% of 3,994 reads in total) was used for subsequent analyses. Decreasing the read number caused no problem in this experiment because the precise mapping of only one species (V. cholera) was the main objective. Mitsuhashi et al.²³ and Nakagawa et al.⁴³ reported that a 5-min and 3-min running time on MinION, respectively, were enough for detecting specific bacteria. However, deeper sequencing is required to obtain better estimates of bacterial community structure and higher Goods coverage values^44,45. We conducted a 48-h MinION operation for the mock communities and environmental samples to provide sufficient read numbers in our study (Table 2).

Optimal PCR conditions need to be established to obtain accurate bacterial community structure analyses using MinION. We therefore compared five different PCR conditions using mock communities in preliminary experiments (Table 1). The dissimilarity values within the communities were smaller with the T2 condition (longer annealing time than T0), the T3 condition (longer time for both annealing and extension than T0) and the T4 condition (longer time for all stages than T0), than that of the T0 condition (Fig. 2). These results suggest that the polymerase extension time does not affect the bacterial community structure analysis. Conversely, a shorter annealing time, as in the T3 condition, resulted in relatively higher dissimilarity values compared with those from the other cases (T2 and T4). Considering the higher dissimilarity achieved under T1 together with the results under T3, a longer annealing time was deemed necessary for the proper assessment of bacterial community structure using full-length 16S rRNA PCR analysis. As shown in Fig. 3, at the ATCC10, the difference between T0 and T4 condition is not significant. Whereas at the ATCC20, the bacterial composition obtained from T4 condition was closer to the theoretical values than those obtained from the T0 condition. This is more precise at the species-level (Supplemental Table S3). These results suggest that the T4 PCR conditions with longer reaction times provide better results than the T0 condition when the sample diversity is high. However, Bifidobacterium was not detected by MinION analysis using either the T0 or T4 conditions. Previous publications have shown that the universal primers commonly used for metagenomic analyses (such as the 27F primer) possess limitations related to amplification bias. The 27F forward primer used in the 16S Barcoding Kit (SQK-RAB204, Oxford Nanopore Technologies) contains three base-pair mismatches against Bifidobacterium (27F primer: 5′-AGAGTTTGATCMTGGCTCAG-3′); that is, the sequence of the B. adolescentis primer site is 5′-AGGGTTCGATTCTGGCTCA-3′ (the mismatched bases are underlined)⁴⁶. Conversely, Hu et al.⁴⁷, for example, detected Bifidobacterium species by sequencing with universal primers (384F and 806R) in MiSeq. Thus, primer sequence modifications are required to avoid preferential detection of particular taxa, so that a broad range of bacteria species is covered, as was the case here with our B. adolescentis.

MinION has lower read accuracy but can generate much longer read lengths than those from MiSeq. Nygaard et al.⁴⁸ analysed building-dust microbiomes using MinION and MiSeq and showed that, at the genus and species levels, MinION reported greater taxonomic resolution than MiSeq. Long reads help alignment-based assignment of taxonomy as well, because of their increasing taxonomical information content. In this study, under the MinION T4 condition, all the environmental samples showed better taxonomic resolution at the genus level than that under MiSeq, the same as previously reported^17,48. Many papers have been published on software developments and shorter running times with MinION. For example, Kai et al.³⁸ reported on the possibility of decreasing the sequencing time of MinION by direct PCR approaches and found that a 3-min sequencing run generated a sufficient number of reads for taxonomic assignment and less than two hours was required for identifying appropriate bacterial species.

Characterization of environmental bacterial communities requires both qualitative and quantitative information through appropriate sequence read filtering as well as experimental procedures. Here, we have demonstrated, for the first time, that the accurate data on bacterial communities using MinION can be generated by comparing and choosing appropriate PCR conditions. The reaction condition in this study are the longest among PCR conditions compared to previous studies on bacterial community structure analysis with 16S rRNA with the MinION system (see Supplemental Table S4); however, using this condition we were able to obtain bacterial community structures that were comparable in quality with MiSeq.

Methods

Sample and DNA preparation

The full-length bacterial 16S rRNA gene from V. cholerae DNA, obtained through the courtesy of Dr. Taichiro Takemura (Nagasaki University, Japan), was used to examine the bioinformatics pipeline. PCR conditions evaluation was initially performed using a reference genomic DNA (Zymo Research Corp., Irvine, CA, USA; https://www.zymoresearch.com). The ZymoBIOMICS microbial community DNA standard (ZymoBIOMICS catalog # D6305) contains a mixture of genomic DNAs isolated from the pure cultures of eight bacterial and two fungal strains, and an equal molar quantity of 16S rDNA from each organism is provided. PCR conditions were examined for the mock community DNA samples (10 Strain Even Mix Genomic Material (MSA-1000) and 20 Strain Even Mix Genomic Material (MSA-1002); American Type Culture Collection (ATCC), Manassas, VA, USA) as well as environmental biofilm and water samples (that is, the insides of showerheads, bathtub inlets and showerhead feed water) in Japan. The biofilm samples were collected as described previously with a swab⁴⁹. Two litter of showerhead feed water was filtered on-site using a 50-mL syringe (Terumo corporation, Tokyo, Japan) and 0.2-µm filter cartridge (Sterivex, Millipore, MA, USA). The samples were immediately put in a cool box, carried to the lab and kept at − 20 °C. DNA was extracted from these samples using the DNeasy PowerBiofilm Kit (QIAGEN, Germantown, MD, USA) with slight modification⁵⁰. The extracted DNA was kept at − 20 °C after purification and precipitation using Dr. GenTLE precipitation carrier (Takara Bio Inc.). The concentrations and purities of the extracted DNA preparations were determined using the Spectro/Fluorometer (DS-11FX+, DeNovix, Wilmington, USA) and QuantiFluor dsDNA System (Promega, Madison, WI, USA).

PCR conditions

Polymerase amplification efficiency was initially checked using 13 biofilm and water samples collected from bathrooms. Five different DNA polymerase enzymes were tested and MightyAmp DNA polymerase v. 2 (Takara Bio Inc.) provided the highest amplification efficiency among other polymerases used in our evaluation, as reported previously elsewhere⁵¹. The PCRs were conducted using a primer pair (27F and 1492R) specific for the 16S rRNA gene-targeting sequence contained in the library preparation kit (SQK-RAS201 or SQK-RAB204, Oxford Nanopore Technologies). Some samples were barcoded using a rapid amplicon barcoding kit (SQK-RAB201, Oxford Nanopore Technologies) according to the manufacturer’s protocol (see Supplemental Table S1 for details). The first PCR condition (T0) involved a pre-heating step at 98 °C for 2 min, 35 cycles at 98 °C for 10 s, 60 °C for 15 s and 68 °C for 2 min. Alternative PCR conditions (T1–T4) for the duration of each step are listed in Table 1. The PCR on V. cholerae DNA was performed using the T0 condition. Table 1 shows the four different PCR conditions (T1–T4) that were used with the ZymoBIOMICS mock community sample. The ATCC mock community and the environmentally-sourced samples were amplified using two different PCR conditions (T0 and T4). The amplified fragments were separated on 2% agarose gels, stained with Safelook Load-Green (Wako, Osaka, Japan), and visualized on the FAS Nano Gel Document System (Nippon Genetics, Tokyo, Japan).

Nanopore sequencing library construction

After purifying the PCR products (50 μl each) with 30 μl of Agencourt AMPure XP beads (Beckman Coulter, Tokyo, Japan), the amount and purity of DNA eluted with 10 μl of buffer solution (10 mM Tris-HCl pH 8.0, with 50 mM NaCl) was determined using a Spectro/Fluorometer (DS-11FX+, DeNovix) and QuantiFluor dsDNA system (Promega). Purified amplicon DNA (100 or 50 fmol) was used as input DNA for the MinION-compatible libraries. The amplicons were added to 1 μl of rapid adapter (Oxford Nanopore Technologies) and incubated at room temperature for the required time.

Nanopore sequencing and base-calling

The nanopore sequencing libraries were separately run on FLO-MIN106 R9.4 flow cells (Oxford Nanopore Technologies) after performing platform quality control analysis. The amplicon library (11 μl) was diluted with running buffer (35 μl) containing 3.5 μl of nuclease-free water and 25.5 μl of loading beads. A 48-h sequencing protocol was initiated using the MinION control software, MinKNOW from v. 1.6.1-1.10.23 (Supplemental Table S1 contains the detailed information). MinION sequence reads (that is, fast5 data) were converted into fastq files using Albacore v. 1.2.1 or v. 2.1.3 software (Oxford Nanopore Technologies) and filtered with the threshold of a mean quality score over 7. (Supplemental Table S1 contains the detailed information). Figure 5 shows the study’s workflow.

Nanopore sequencing data analysis

Sequence length distribution was examined for each base-called fastq file using FastQC (v 0.11. 2) (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/)⁵². SeqKit 0.8.0 (https://bioinf.shenwei.me/seqkit/)⁵³ and original ruby script were used to filter the sequence data by three lengths of 1,000–2,000, 1,200–1,800 and 1,400–1,600 to include the highest read length frequency (1,500 bases), with the filtering effects examined by calculating the hit ratio to V. cholerae in all the leads. All data except for V. cholerae were filtered with a read length of 1,400–1,600. After filtering, the sequence reads were mapped using BWA-MEM (v. 15. 0.7 or v. 0.7. 17; https://github.com/lh3/bwa)³³ with the MinION analysis option (-x ont2d)⁵⁴ to a database derived from the RDP (RDP Release 11, Update 5, Sept. 30. 2016; 3,356,809 aligned and annotated 16S rRNA sequences)³⁴ and the top hit was used for the genus and species assignment. The RDP hierarchy browser (https://rdp.cme.msu.edu/hierarchy/hb_intro.jsp) was used with the following filters: strain = “Type”; source = “isolates”; size “≥ 1,200”; quality = “Good”; taxonomy = “Nomenclatural” to generate a downloaded set of 12,227 sequences.

Illumina sequencing

The 16S rRNA sequencing library was constructed according to the Illumina 16S Metagenomic Sequencing Library Preparation protocol (Illumina) targeting the V3 and V4 hypervariable regions of the 16S rRNA genes using primers 341F (5′-CCTACGGGNGGCWGCAG-3′) and 805R (5′-GACTACHVGGGTATCTAATCC-3′)¹⁶. MightyAmp DNA Polymerase v. 2 (Takara Bio Inc.) was used for the PCRs. The initial PCR was performed using region-specific primers to ensure compatibility with the Illumina index and sequencing multiplex adapters. The amplified fragments were separated on 2% agarose gels, stained with Safelook Load-Green (Wako), and visualized on the FAS Nano Gel Document System (Nippon Genetics). The amount of purified DNA recovered was quantified using a Spectro/Fluorometer (DS-11FX+, DeNovix). An equimolar mixture of all PCR products was sent to a commercial company for 2 × 300 bp paired-end sequencing on the MiSeq platform using Illumina MiSeq v3 Reagent Kit (Fasmac, Kanagawa, Japan).

Illumina sequencing data analysis

Illumina 16S rRNA amplicon sequence data were demultiplexed, and index sequences were removed using MiSeq Control Software (MCS) v2.6. Paired forward and reverse sequences were merged using ‘make.contings’ with the default parameter of mothur³² (v. 1.39.5). The merged sequence reads were assigned taxonomy using BWA-MEM³³ against RDP³⁴, using the same database and parameters without ‘ont2d’ option as our nanopore sequence data.

Data analysis

All data analysis was carried out with R (v. 3.3.1)⁵⁵. Bacterial community dissimilarities for the different PCR conditions were calculated by the Bray–Curtis index with the ‘vegan’ package (v. 2.5-5)³⁵. Initially, MiSeq reads were randomly sampled to eliminate read number differences when comparing of unassigned percentages in the MiSeq, MinION T0 and MinION T4 runs. After normalization and confirming there were no significant differences between the number of reads (p value = 0.6758 > 0.05), the unassigned ratio was compared using Tukey’s honest significant difference test, and the variances in the data from the three groups were found not to be equal (F value < 0.01).

Data availability

The mock microbial community DNA standards can be obtained from ZymoBIOMICS (catalog # D6305) and ATCC (catalog # MSA-1000 and MSA-1002). All the DNA sequences generated in the present study have been deposited in the DNA Data Bank of Japan (DDBJ) under the BioProject number PRJDB9684.

References

Turnbaugh, P. et al. The human microbiome project. Nature 449, 804–810 (2007).
ADS CAS PubMed PubMed Central Google Scholar
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012).
ADS CAS PubMed PubMed Central Google Scholar
Sivasubramaniam, D. & Franks, A. Bioengineering microbial communities: their potential to help, hinder and disgust. Bioengineered 7, 137–144 (2016).
CAS PubMed PubMed Central Google Scholar
Dunbar, J. et al. Levels of bacterial community diversity in four arid soils compared by cultivation and 16S rRNA gene cloning. Appl. Environ. Microbiol. 65, 1662–1669 (1999).
CAS PubMed PubMed Central Google Scholar
Cottrell, M. T. & Kirchman, D. L. Community composition of marine bacterioplankton determined by 16S rRNA gene clone libraries and fluorescence in situ hybridization. Appl. Environ. Microbiol. 66, 5116–5122 (2000).
CAS PubMed PubMed Central Google Scholar
Clement, B. G., Kehl, L. E., DeBord, K. L. & Kitts, C. L. Terminal restriction fragment patterns (TRFPs), a rapid, PCR-based method for the comparison of complex bacterial communities. J. Microbiol. Methods 31, 135–142 (1998).
CAS Google Scholar
Brunk, C. F., Avaniss-Aghajani, E. & Brunk, C. A. A computer analysis of primer and probe hybridization potential with bacterial small-subunit rRNA sequences. Appl. Environ. Microbiol. 62, 872–879 (1996).
CAS PubMed PubMed Central Google Scholar
Ferris, M. J., Muyzer, G. & Ward, D. M. Denaturing gradient gel electrophoresis profiles of 16S rRNA-defined populations inhabiting a hot spring microbial mat community. Appl. Environ. Microbiol. 62, 340–346 (1996).
CAS PubMed PubMed Central Google Scholar
Muyzer, G., de Waal, E. C. & Uitterlinden, A. G. Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16S rRNA. Appl. Environ. Microbiol. 59, 695–700 (1993).
CAS PubMed PubMed Central Google Scholar
Liu, L. et al. Comparison of next-generation sequencing systems. J. Biomed. Biotechnol. https://doi.org/10.1155/2012/251364 (2012).
Article PubMed PubMed Central Google Scholar
Johnson, J. S. et al. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat. Commun. 10, 5029 (2019).
ADS PubMed PubMed Central Google Scholar
Quail, M. A. et al. A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers. BMC Genom. 13, 341 (2012).
CAS Google Scholar
Cai, L. et al. Biased diversity metrics revealed by bacterial 16S pyrotags derived from different primer sets. PLoS ONE 8, e53649 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Guo, F., Ju, F., Cai, L. & Zhang, T. Taxonomic precision of different hypervariable regions of 16S rRNA gene and annotation methods for functional bacterial groups in biological wastewater treatment. PLoS ONE 8, e76185 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Wang, F. et al. Assessment of 16S rRNA gene primers for studying bacterial community structure and function of aging flue-cured tobaccos. AMB Express 8, 182–182 (2018).
PubMed PubMed Central Google Scholar
Klindworth, A. et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 41, e1 (2013).
CAS PubMed Google Scholar
Wommack, K. E., Bhavsar, J. & Ravel, J. Metagenomics: read length matters. Appl. Environ. Microbiol. 74, 1453–1463 (2008).
CAS PubMed PubMed Central Google Scholar
Earl, J. P. et al. Species-level bacterial community profiling of the healthy sinonasal microbiome using pacific biosciences sequencing of full-length 16S rRNA genes. Microbiome 6, 190 (2018).
PubMed PubMed Central Google Scholar
Whon, T. W. et al. The effects of sequencing platforms on phylogenetic resolution in 16 S rRNA gene profiling of human feces. Sci. Data 5, 180068 (2018).
CAS PubMed PubMed Central Google Scholar
Schmidt, K. et al. Identification of bacterial pathogens and antimicrobial resistance directly from clinical urines by nanopore-based metagenomic sequencing. J. Antimicrob. Chemother. 72, 104–114 (2016).
PubMed Google Scholar
Pomerantz, A. et al. Real-time DNA barcoding in a rainforest using nanopore sequencing: opportunities for rapid biodiversity assessments and local capacity building. Gigascience. https://doi.org/10.1093/gigascience/giy033 (2018).
Article PubMed PubMed Central Google Scholar
Benítez-Páez, A., Portune, K. J. & Sanz, Y. Species-level resolution of 16S rRNA gene amplicons sequenced through the MinION^TM portable nanopore sequencer. Gigascience https://doi.org/10.1186/s13742-016-0111-z (2016).
Article PubMed PubMed Central Google Scholar
Mitsuhashi, S. et al. A portable system for rapid bacterial composition analysis using a nanopore-based sequencer and laptop computer. Sci. Rep. 7, 5657 (2017).
ADS PubMed PubMed Central Google Scholar
Srivathsan, A. et al. Rapid, large-scale species discovery in hyperdiverse taxa using 1D MinION sequencing. BMC Biol. 17, 96 (2019).
CAS PubMed PubMed Central Google Scholar
Smyth, R. P. et al. Reducing chimera formation during PCR amplification to ensure accurate genotyping. Gene 469, 45–51 (2010).
CAS PubMed Google Scholar
Gołębiewski, M. & Tretyn, A. Generating amplicon reads for microbial community assessment with next-generation sequencing. J. Appl. Microbiol. 128, 330–354 (2020).
PubMed Google Scholar
Hongoh, Y., Yuzawa, H., Ohkuma, M. & Kudo, T. Evaluation of primers and PCR conditions for the analysis of 16S rRNA genes from a natural environment. FEMS Microbiol. Lett. 221, 299–304 (2003).
CAS PubMed Google Scholar
D’Amore, R. et al. A comprehensive benchmarking study of protocols and sequencing platforms for 16S rRNA community profiling. BMC Genom. 17, 55 (2016).
Google Scholar
Wu, J. Y. et al. Effects of polymerase, template dilution and cycle number on PCR based 16 S rRNA diversity analysis using the deep sequencing method. BMC Microbiol. 10, 255 (2010).
PubMed PubMed Central Google Scholar
Aird, D. et al. Analyzing and minimizing PCR amplification bias in illumina sequencing libraries. Genome Biol. 12, R18 (2011).
CAS PubMed PubMed Central Google Scholar
Wang, Q., Garrity, G. M., Tiedje, J. M. & Cole, J. R. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 73, 5261–5267 (2007).
CAS PubMed PubMed Central Google Scholar
Schloss, P. D. et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 75, 7537–7541 (2009).
CAS PubMed PubMed Central Google Scholar
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint, arXiv:1303.3997 (2013).
Cole, J. R. et al. Ribosomal database project: data and tools for high throughput rRNA analysis. Nucleic Acids Res. 42, D633–D642 (2014).
CAS PubMed Google Scholar
Dixon, P. VEGAN, a package of R functions for community ecology. J. Veg. Sci. 14, 927–930 (2003).
Google Scholar
Hill, M. O. Diversity and evenness: a unifying notation and its consequences. Ecology 54, 427–432 (1973).
Google Scholar
Nicholls, S. M., Quick, J. C., Tang, S. & Loman, N. J. Ultra-deep, long-read nanopore sequencing of MOCK microbial community standards. Gigascience https://doi.org/10.1093/gigascience/giz043 (2019).
Article PubMed PubMed Central Google Scholar
Kai, S. et al. Rapid bacterial identification by direct PCR amplification of 16S rRNA genes using the MinION^TM nanopore sequencer. FEBS Open Bio. 9, 548–557 (2019).
CAS PubMed PubMed Central Google Scholar
Suzuki, M. T. & Giovannoni, S. J. Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR. Appl. Environ. Microbiol. 62, 625–630 (1996).
CAS PubMed PubMed Central Google Scholar
Gohl, D. M. et al. Systematic improvement of amplicon marker gene methods for increased accuracy in microbiome studies. Nat. Biotechnol. 34, 942–949 (2016).
CAS PubMed Google Scholar
Penna, V. T., Martins, S. A. & Mazzola, P. G. Identification of bacteria in drinking and purified water during the monitoring of a typical water purification system. BMC Public Health 2, 13 (2002).
PubMed PubMed Central Google Scholar
Burrows, S. M. et al. Bacteria in the global atmosphere: part 2—modeling of emissions and transport between different ecosystems. Atmos. Chem. Phys. 9, 9281–9297 (2009).
ADS CAS Google Scholar
Nakagawa, S. et al. Rapid sequencing-based diagnosis of infectious bacterial species from meningitis patients in Zambia. Clin. Transl. Immunol. 8, e01087 (2019).
Google Scholar
Good, I. J. The population frequencies of species and the estimation of population parameters. Biometrika 40, 237–264 (1953).
MathSciNet MATH Google Scholar
Lemos, L. N., Fulthorpe, R. R., Triplett, E. W. & Roesch, L. F. Rethinking microbial diversity analysis in the high throughput sequencing era. J. Microbiol. Methods 86, 42–51 (2011).
CAS PubMed Google Scholar
Walker, A. W. et al. 16S rRNA gene-based profiling of the human infant gut microbiota is strongly influenced by sample processing and PCR primer choice. Microbiome 3, 26 (2015).
PubMed PubMed Central Google Scholar
Hu, L. et al. Assessment of Bifidobacterium species using groEL gene on the basis of Illumina MiSeq high-throughput sequencing. Genes 8, 336 (2017).
PubMed Central Google Scholar
Nygaard, A. B., Tunsjø, H. S., Meisal, R. & Charnock, C. A preliminary study on the potential of nanopore MinION and illumina MiSeq 16S rRNA gene sequencing to characterize building-dust microbiomes. Sci. Rep. 10, 3209 (2020).
ADS CAS PubMed PubMed Central Google Scholar
Ichijo, T. et al. Distribution and respiratory activity of mycobacteria in household water system of healthy volunteers in Japan. PLoS ONE 9, e110554 (2014).
ADS PubMed PubMed Central Google Scholar
Arai, S. et al. Assessment of pig saliva as a Streptococcus suis reservoir and potential source of infection on farms by use of a novel quantitative polymerase chain reaction assay. Am. J. Vet. Res. 79, 941–948 (2018).
CAS PubMed Google Scholar
Lu, Q., Hu, H., Mo, J. & Shu, L. Enhanced amplification of bacterial and fungal DNA using a new type of DNA polymerase. Aust. Plant Pathol. 41, 661–663 (2012).
CAS Google Scholar
Andrews, S. FastQC: a quality control tool for high throughput sequence data. Available online at: https://www.bioinformatics.babraham.ac.uk/projects/fastqc.
Shen, W., Le, S., Li, Y. & Hu, F. SeqKit: a cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS ONE 11, e0163962 (2016).
PubMed PubMed Central Google Scholar
Jain, M. et al. Improved data analysis for the MinION nanopore sequencer. Nat. Methods 12, 351–356 (2015).
CAS PubMed PubMed Central Google Scholar
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org (2018).

Download references

Acknowledgements

This work was supported by the Kyoto University Research Funds for Young Scientists to SF, the Japan Society for the Promotion of Science under Grants-in-Aid for Scientific Research (KAKENHI) (grant numbers 16H01782/18K19674/18KK0436, awarded to F.M.) Grant-in-Aid for Young Scientists (KAKENHI) (grant number 17K17885, awarded to A.M.) and the Japan Agency for Medical Research and Development (project number JP20fk0108129h0501, awarded to F.M.). This work was done as a part of the research activity of NAIST Data Science Center. We also thank Sandra Cheesman, PhD, from Edanz Group (www.edanzediting.com/ac) for editing a draft of this manuscript.

Author information

Authors and Affiliations

Office of Industry-Academia-Government and Community Collaboration, Hiroshima University, 1-3-2 Kagamiyama, Higashi-Hiroshima, Hiroshima, 739-8511, Japan
So Fujiyoshi & Fumito Maruyama
Laboratorio de Ecología Microbiana Aplicada, Departamento de Ciencias Químicas y Recursos Naturales, Scientific and Biotechnological Bioresource Nucleus (BIOREN-UFRO), Universidad de La Frontera, Ave. Francisco Salazar 01145, Temuco, Chile
So Fujiyoshi & Fumito Maruyama
Data Science Center, Nara Institute of Science and Technology, Ikoma, Nara, Japan
Ai Muto-Fujita

Authors

So Fujiyoshi
View author publications
You can also search for this author in PubMed Google Scholar
Ai Muto-Fujita
View author publications
You can also search for this author in PubMed Google Scholar
Fumito Maruyama
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.F. and F.M. conceived and designed the experiments. S.F. performed the experiments. S.F. and A.F. analyzed and interpreted the data. S.F. wrote the paper. All the authors, S.F., A.F. and F.M. reviewed drafts of the manuscript.

Corresponding authors

Correspondence to So Fujiyoshi or Fumito Maruyama.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This artiãcle is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fujiyoshi, S., Muto-Fujita, A. & Maruyama, F. Evaluation of PCR conditions for characterizing bacterial communities with full-length 16S rRNA genes using a portable nanopore sequencer. Sci Rep 10, 12580 (2020). https://doi.org/10.1038/s41598-020-69450-9

Download citation

Received: 16 April 2020
Accepted: 29 June 2020
Published: 28 July 2020
DOI: https://doi.org/10.1038/s41598-020-69450-9
Springer Nature Limited

Associated content

Top 100 in Microbiology

Collection 31 March 2021

Evaluation of PCR conditions for characterizing bacterial communities with full-length 16S rRNA genes using a portable nanopore sequencer

Abstract

Similar content being viewed by others

Species-level resolution of 16S rRNA gene amplicons sequenced through the MinION™ portable nanopore sequencer

Direct 16S rRNA-seq from bacterial communities: a PCR-independent approach to simultaneously assess microbial diversity and functional activity potential of each taxon

Using custom-built primers and nanopore sequencing to evaluate CO-utilizer bacterial and archaeal populations linked to bioH2 production

Introduction