Skip to main content
Log in

Quality-filtering vastly improves diversity estimates from Illumina amplicon sequencing

  • Brief Communication
  • Published:

From Nature Methods

View current issue Submit your manuscript

Abstract

High-throughput sequencing has revolutionized microbial ecology, but read quality remains a considerable barrier to accurate taxonomy assignment and α-diversity assessment for microbial communities. We demonstrate that high-quality read length and abundance are the primary factors differentiating correct from erroneous reads produced by Illumina GAIIx, HiSeq and MiSeq instruments. We present guidelines for user-defined quality-filtering strategies, enabling efficient extraction of high-quality data and facilitating interpretation of Illumina sequencing results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2: The α and β diversity comparisons of mock community reads filtered using select phred_quality_score (q) settings (data set 1).

Similar content being viewed by others

References

  1. Yatsunenko, T. et al. Nature 486, 222–227 (2012).

    Article  CAS  Google Scholar 

  2. Gilbert, J.A. & Meyer, F. ASM Microbe 7, 64–69 (2012).

    Google Scholar 

  3. Reeder, J. & Knight, R. Nat. Methods 7, 668–669 (2010).

    Article  CAS  Google Scholar 

  4. Quince, C. et al. Nat. Methods 6, 639–641 (2009).

    Article  CAS  Google Scholar 

  5. Caporaso, J.G. et al. Proc. Natl. Acad. Sci. USA 108, 4516–4522 (2011).

    Article  CAS  Google Scholar 

  6. Minoche, A.E. et al. Genome Biol. 12, R112 (2011).

    Article  CAS  Google Scholar 

  7. Caporaso, J.G. et al. Nat. Methods 7, 335–336 (2010).

    Article  CAS  Google Scholar 

  8. Caporaso, J.G. et al. ISME J. 6, 1621–1624 (2012).

    Article  CAS  Google Scholar 

  9. Bokulich, N.A. et al. PLoS ONE 7, e36357 (2012).

    Article  CAS  Google Scholar 

  10. Bokulich, N.A., Bamforth, C.W. & Mills, D.A. PLoS ONE 7, e35507 (2012).

    Article  CAS  Google Scholar 

  11. Lozupone, C. & Knight, R. Appl. Environ. Microbiol. 71, 8228–8235 (2005).

    Article  CAS  Google Scholar 

  12. Edgar, R.C. Bioinformatics 26, 2460–2461 (2010).

    Article  CAS  Google Scholar 

  13. Wang, Q., Garrity, G.M., Tiedje, J.M. & Cole, J.R. Appl. Environ. Microbiol. 73, 5261–5267 (2007).

    Article  CAS  Google Scholar 

  14. DeSantis, T.Z. et al. Appl. Environ. Microbiol. 72, 5069–5072 (2006).

    Article  CAS  Google Scholar 

  15. Caporaso, J.G. et al. Bioinformatics 26, 266–267 (2010).

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank G. Giannoukos (Broad Institute of MIT and Harvard), I. Rasolonjatovo (Illumina), M. Gebert (University of Colorado, Boulder) and L. Wegener Parfrey (University of Colorado, Boulder) for contributing mock community sequencing data used in this study, and S. Huse and A. Gonzalez for useful feedback and discussions of this manuscript. This work was supported in part by grants from the US National Institutes of Health (NIH DK78669 to J.I.G., NIH R01HD059127 to D.A.M. and NIH U54HG004969 to D.G.), the Juvenile Diabetes Research Fund (D.G.), the Crohn's and Colitis Foundation of America (J.I.G. and D.G.), and the Howard Hughes Medical Institute. N.A.B. was supported by the 2012–2013 Dannon Probiotics Fellow Program (The Dannon Company) and a Wine Spectator scholarship.

Author information

Authors and Affiliations

Authors

Contributions

N.A.B., J.G.C., D.A.M. and R.K. conceived and designed the experiments; N.A.B. performed the experiments and data analysis. All authors contributed sequencing data sets and wrote the manuscript.

Corresponding author

Correspondence to J Gregory Caporaso.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–16, Supplementary Tables 1–9, Supplementary Note (PDF 21952 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bokulich, N., Subramanian, S., Faith, J. et al. Quality-filtering vastly improves diversity estimates from Illumina amplicon sequencing. Nat Methods 10, 57–59 (2013). https://doi.org/10.1038/nmeth.2276

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nmeth.2276

  • Springer Nature America, Inc.

This article is cited by

Navigation