Abstract
Microbial samples taken from an environment often represent mixtures of communities, where each community is composed of overlapping assemblages of species. Such data represent a serious analytical challenge, as the community structures will be present as complex mixtures, there will be very large numbers of component species, and the species abundance will often be sparse over samples. The structure and complexity of these samples will vary according to both biotic and abiotic factors, and classical methods of data analysis will have a limited value in this setting. A novel Bayesian modeling framework, called BioMiCo, was developed to meet this challenge. BioMiCo takes abundance data derived from environmental DNA, and models each sample by a two-level mixture, where environmental OTUs contribute community structures, and those structures are related to the known biotic and abiotic features of each sample. The model is constrained by Dirichlet priors, which induces compact structures, minimizes variance, and maximizes model interpretability. BioMiCo is trained on a portion of the data, and once trained a BioMiCo model can be employed to make predictions about the features of new samples. This chapter provides a set of protocols that illustrate the application of BioMiCo to real inference problems. Each protocol is designed around the analysis of a real dataset, which was carefully chosen to illustrate specific aspects of real data analysis. With these protocols, users of BioMiCo will be able to undertake basic research into the properties of complex microbial systems, as well as develop predictive models for applied microbiomics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Shade A, Caporaso JG, Handelsman J et al (2013) A meta-analysis of changes in bacterial and archaeal communities with time. ISME J 7(8):1493–1506
Boon E, Meehan CJ, Whidden C et al (2014) Interactions in the microbiome: communities of organisms and communities of genes. FEMS Microbiol Rev 38(1):90–118
El-Swais H, Dunn KA, Bielawski JP et al (2015) Seasonal assemblages and short-lived blooms in coastal north-west Atlantic Ocean bacterioplankton. Environ Microbiol 17(10):3642–3661
Dunn KA, Moore-Connors J, MacIntyre B et al (2016) Early changes in microbial community structure are associated with sustained remission after nutritional treatment of pediatric Crohn’s disease. Inflamm Bowel Dis 22(12):2853–2862
Dunn KA, Moore-Connors J, MacIntyre B et al (2016) The gut microbiome of pediatric Crohn’s disease patients differs from healthy controls in genes that can influence the balance between a healthy and dysregulated immune response. Inflamm Bowel Dis 22(11):2607–2618
Tromas N, Fortin N, Bedrani L et al (2017) Characterising and predicting cyanobacterial blooms in an 8-year amplicon sequencing time course. ISME J 11(8):1746–1763
Leibold MA, Holyoak M, Mouquet N et al (2004) The metacommunity concept: a framework for multi-scale community ecology. Ecol Lett 7(7):601–613
Burke C, Steinberg P, Rusch D et al (2011) Bacterial community assembly based on functional genes rather than species. Proc Natl Acad Sci U S A 108(34):14288–14293
Friedman J, Hastie T, Tibshirani R (2001) The elements of statistical learning. Data mining, inference, and prediction. In: Springer series in statistics, New York
Knights D, Costello EK, Knight R (2011) Supervised classification of human microbiota. FEMS Microbiol Rev 35(2):343–359
Weiss S, Xu ZZ, Peddada S et al (2017) Normalization and microbial differential abundance strategies depend upon data characteristics. Microbiome 5(1):27
Shafiei M, Dunn KA, Boon E et al (2015) BioMiCo: a supervised Bayesian model for inference of microbial community structure. Microbiome 3(1):8
Zarraonaindia I, Smith DP, Gilbert JA (2013) Beyond the genome: community-level analysis of the microbial world. Biol Philos 28(2):261
Moore-Connors JM, Dunn KA, Bielawski JP et al (2016) Novel strategies for applied metagenomics. Inflamm Bowel Dis 22(3):709–718
Gajer P, Brotman RM, Bai G et al (2012) Temporal dynamics of the human vaginal microbiota. Sci Transl Med 4(132):132ra52
Nugent RP, Krohn MA, Hillier SL (1991) Reliability of diagnosing bacterial vaginosis is improved by a standardized method of gram stain interpretation. J Clin Microbiol 29(2):297–301
Bashwih RO (2016) Inference and investigation of marine microbial community structures in the global oceans (Masters Thesis). Dalhousie University Thesis database (http://hdl.handle.net/10222/72144)
Shafiei M, Dunn KA, Chipman H et al (2014) BiomeNet: A Bayesian model for inference of metabolic divergence among microbial communities. PLoS Comput Biol 10(11):e1003918
Tang C (2016) Statistical approaches for matching the components of complex microbial communities (Masters Thesis). Dalhousie University Thesis database (http://hdl.handle.net/10222/71708)
Acknowledgments
This work was supported by NSERC Discovery Grant (DG3645-2015) and a Schulich Joint Research Project (JRP 48677) to JPB. We thank Joseph R. Mingrone for helpful discussions, and for direct assistance with the computational resources. We thank Noor Youssef, Christopher Jones and Hong Gu for helpful discussions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Dunn, K.A., Andrews, K., Bashwih, R.O., Bielawski, J.P. (2018). Bayesian Inference of Microbial Community Structure from Metagenomic Data Using BioMiCo. In: Beiko, R., Hsiao, W., Parkinson, J. (eds) Microbiome Analysis. Methods in Molecular Biology, vol 1849. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8728-3_17
Download citation
DOI: https://doi.org/10.1007/978-1-4939-8728-3_17
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-8726-9
Online ISBN: 978-1-4939-8728-3
eBook Packages: Springer Protocols