From molecules to dynamic biological communities

McDonald, Daniel; Vázquez-Baeza, Yoshiki; Walters, William A.; Caporaso, J. Gregory; Knight, Rob

doi:10.1007/s10539-013-9364-4

From molecules to dynamic biological communities

Open access
Published: 05 February 2013

Volume 28, pages 241–259, (2013)
Cite this article

Download PDF

You have full access to this open access article

Biology & Philosophy Aims and scope Submit manuscript

From molecules to dynamic biological communities

Download PDF

Daniel McDonald^1,2,
Yoshiki Vázquez-Baeza³,
William A. Walters⁴,
J. Gregory Caporaso^5,6 &
…
Rob Knight^1,2,3,7

3238 Accesses
12 Citations
1 Altmetric
Explore all metrics

Abstract

Microbial ecology is flourishing, and in the process, is making contributions to how the ecology and biology of large organisms is understood. Ongoing advances in sequencing technology and computational methods have enabled the collection and analysis of vast amounts of molecular data from diverse biological communities. While early studies focused on cataloguing microbial biodiversity in environments ranging from simple marine ecosystems to complex soil ecologies, more recent research is concerned with community functions and their dynamics over time. Models and concepts from traditional ecology have been used to generate new insight into microbial communities, and novel system-level models developed to explain and predict microbial interactions. The process of moving from molecular inventories to functional understanding is complex and challenging, and never more so than when many thousands of dynamic interactions are the phenomena of interest. We outline the process of how epistemic transitions are made from producing catalogues of molecules to achieving functional and predictive insight, and show how those insights not only revolutionize what is known about biological systems but also about how to do biology itself. Examples will be drawn primarily from analyses of different human microbiota, which are the microbial consortia found in and on areas of the human body, and their associated microbiomes (the genes of those communities). Molecular knowledge of these microbiomes is transforming microbiological knowledge, as well as broader aspects of human biology, health and disease.

Introduction to Genetic, Genomic, and System Analyses for Communities

Bioinformatics for Microbiome Research: Concepts, Strategies, and Advances

Best practices for analysing microbiomes

Article 23 May 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction: the revolution in DNA sequencing provides new insight into a range of microbial phenomena

Microbial ecology used to be a small and specialized field that struggled to identify more than a tiny proportion of the Earth’s microbial biodiversity. Part of the problem was due to the prevalence of pure-culture methods, in which microorganisms had to be removed from their natural environments (which included communities of other organisms) and cultured in laboratories. Recent advances in molecular techniques, sequencing technologies and computational methods have enabled researchers to explore the microbial world at unprecedented levels, with a focus on the natural habitats of microorganisms. The combination of these advances has so far produced remarkable insight into the role of microorganisms in human health and their powerful effects on the natural world, while at the same time developing novel evidence about the evolution and diversification of life on Earth. In this article, we discuss how these advances have allowed researchers to create new lines of inquiry, we summarize important biological and philosophical results from recent publications, and we discuss how our improved understanding of microbial ecology may affect our lives in the coming years.

The last decade has seen a transformation and democratization of DNA sequencing (Shendure and Ji 2008). High-throughput sequencing, of the type necessary to characterize the complex microbial communities that inhabit our bodies, used to be the exclusive province of a few large sequencing centers: only research groups with access to substantial resources could engage in sequencing projects. Now, a benchtop machine that fits in an individual investigator’s laboratory can produce billions of 100-nucleotide sequences per month. For comparison, a bacterial genome from the gut is typically about three million nucleotides and the human genome is about three billion nucleotides. However, the number of bacterial genomes that inhabit a human implies that they contribute far more genes than does our human genome (Turnbaugh et al. 2007). Playing music from a digital file once required a high-end workstation but can now be performed on a handheld device because transistors can now be packed more densely onto a microchip. In the same way, characterizing the types (e.g., the strains, species or phyla) of microbes present in a given sample (the microbiota) or the genes present in these microbes (the microbiome) are problems that can be addressed with a fixed amount of sequencing that is rapidly becoming cheaper and more accessible.

These transformations in sequencing technology have correspondingly changed what it means to undertake a sequencing project. When sequences were very expensive (in the late 1970s and early 1980s), it was a substantial accomplishment to sequence even one gene from one species. Correspondingly, the focus was on identifying genes that acted as the best phylogenetic markers. These were short fragments of sequences from which inferences about the patterns of evolution were likely to match the inferred patterns of evolution of the corresponding species. These markers therefore provided efficient readouts of evolutionary history while minimizing sequencing costs. For example, ribosomal RNA genes, which play essential structural and catalytic roles in the ribosome and are thought to be almost exclusively vertically transmitted (Lawrence 1999; Amann et al. 1995), have been especially useful for reconstructing phylogenetic trees, including phylogenetic trees of organisms that have not been isolated in pure culture (Pace 1997). Initial studies focused on the 5S rRNA gene (Woese and Fox 1977), although expansion to longer rRNA genes, notably the small subunit rRNA, has allowed substantially greater phylogenetic resolution (Lane et al. 1985; Winker and Woese 1991). Here we describe several conceptual changes deeper sequencing has led to already, and will refine in the future.

From catalogs to robust, reproducible community patterns

The initial focus on cataloging the rRNA genes in individual species allowed phylogenies of known taxonomic groups to be reconstructed. This work provided the framework for our initial understanding that life on Earth falls into at least three distinct lineages: the Archaea, the Bacteria, and the Eukarya (initially described as the archaebacteria, the eubacteria, and the urkaryotes, respectively) (Woese and Fox 1977). These findings, which focused on sequencing DNA from known species, were soon complemented by a radical idea: that these phylogenetic marker genes could be isolated from unknown species via bulk DNA extraction directly from the environment. This technique, pioneered by the Pace lab (Pace et al. 1986), allowed researchers to start building catalogs of the known and unknown organisms, the DNA of which was present in any given environment. As the cost of sequencing DNA declined, the focus on sequencing single marker genes such as the 16S rRNA gene expanded to include shotgun metagenomic surveys, in which total DNA extracted from a sample is fragmented and sequenced. Both approaches are widely employed today. Marker-gene surveys are used to investigate the microbiota of a sample, and metagenomic surveys are used to investigate both the microbiota and the microbiome of a sample. These two views of microbial communities can yield different findings, because functional genes are frequently transferred horizontally (i.e., between different lineages). In contrast, rRNA genes are almost always transferred vertically. However, several recent studies have shown similar patterns emerging from studies involving both types of data (Turnbaugh et al. 2009a; Fierer et al. 2012b; Harris et al. 2013).

The 26 years of sequencing since Pace’s first community sequencing efforts have revealed a picture of 85+ phyla within the bacteria alone, and in some cases as many as 15 new candidate phyla have been detected in a single study (Ley et al. 2006; Harris et al. 2013). The bacterial and archaeal census has been estimated to reach as many as 10⁶–10⁹ species (Schloss and Handelsman 2004), when calculated using sequence similarity criteria. Robust patterns of microbial community composition have now been observed, in a wide range of host-associated and free-living contexts. For example, human body sites are highly distinct from one another and highly diverse among individuals (Costello et al. 2009; HMP-Consortium 2012). Although any two humans are >99 % identical in their genome composition (Venter et al. 2001), there are no species-level OTUs (operational taxonomic units) shared across the gut microbial communities of all humans (Yatsunenko et al. 2012). This lack of shared OTUs suggests that many of the phenotypic differences that we see between humans may arise from differences in our microbiota, rather than differences in our genomes. We suspect that this observation will drive many advances in medicine in the coming years. For example, lean and obese individuals differ systematically in their gut microbial communities (Ley et al. 2006; Turnbaugh et al. 2009a; Knights et al. 2011) but much less so in their genomic composition. Obesity can be identified 90 % of the time using the bacteria in the feces alone (Knights et al. 2011), but with only 58 % accuracy from variations in the genomes of different individuals (Sandholt et al. 2010). Similarly surprising insights have arisen in environmental microbiology. For example, pH has been found to be the main driver of microbial communities in soil (Lauber et al. 2009; Rousk et al. 2010; Chu et al. 2010; Fierer et al. 2012a), and salinity plays a crucial role in structuring both free-living bacterial and archaeal communities across many environments (Lozupone and Knight 2007; Caporaso et al. 2011b; Tamames et al. 2010; Auguet et al. 2010). These patterns can be striking: for example, seasonal patterns in marine water microbial diversity are highly reproducible in the Western English Channel (Gilbert et al. 2012), with the same organisms dominating microbial communities in different seasons annually. However, most of the organisms present in any given season are found even at just a single time-point if more sequences (millions rather than thousands) are collected from the sample (Caporaso et al. 2012). These results suggest that seasonal differences do not arise from the presence or absence of community members, but rather from variations in the abundance of organisms that are always present. This finding reinforces the point that much of what we think we know about the microbial world may be limited by the amount of sequencing that it is cost-effective to perform. The work to catalog Earth’s microbial diversity has thus produced a compendium of rich and detailed observations, and efforts such as the Earth Microbiome Project (Gilbert et al. 2010; Knight et al. 2012) will round out our encyclopedia of our microbial world. But cataloging alone is insufficient: a list of the species present in a rainforest, for example, speaks little to the interactions, functions or potential of the organisms so listed.

The problem with phylogenetic marker gene surveys, such as the 16S rRNA gene sequencing projects described above, is that they tell us the ‘who’, without the ‘how’, thus failing to answer the most pressing questions. For instance, how can an organism live at pH 0 (Edwards et al. 2000), and what can such capacities teach us about the potential for pollution mitigation or for life on other planets? Endeavors such as the Genomic Encyclopedia of Bacteria and Archaea (GEBA) (Wu et al. 2009) perform whole-genome sequencing on organisms that are as phylogenetically divergent as possible from previously sequenced organisms. Even a small amount of this phylogenetically targeted genome sequencing provides novel gene discovery that greatly outpaces gene discovery from organisms chosen arbitrarily or at random. Targeted sequencing can inform us about the reproducibility of the evolutionary process among organisms from different lineages that adapt to similar environments. For example, comparative genomics based on whole-genome data, and linked to rich evolutionary history and detailed environmental information (derived from marker gene databases and marker gene surveys, respectively), can offer insights into which types of biochemical or regulatory functions are necessary to survive in a given environment. These results enable an understanding of the systems biology of microbial communities, which can ultimately be applied to engineer microbial communities to treat disease, generate electricity, or clean up hazardous waste sites. However, marker gene surveys still improve our understanding of microbial ecology and enable novel findings and technological applications. We will focus on this technique for the remainder of the paper to show how this is the case.

How do we know which microbes are present?

A key problem with studies of the microbiome lies in determining which organisms are present. All stages of the process, including DNA extraction, amplification of specific target genes, clustering of sequences, and identification of taxonomic group are prone to both error and bias (Hamady and Knight 2009). As the number of sequences involved in a given study has grown, reliance on advanced computational methods has increased (Gonzalez and Knight 2012). However, the algorithm that is chosen can have large impacts both on beliefs about what organisms are present in a given environment (Liu et al. 2008) and how many kinds of organisms are present (Kunin et al. 2010; Quince et al. 2009). Even defining kinds of organisms is complicated at the microbial level. In lieu of a robust definition of a microbial species (Cohan 2002), the percentage of sequence identity of a marker gene is often used to define operational taxonomic units or OTUs. For example, most 16S rRNA gene-based studies treat a cluster of sequence fragment ‘reads’ (the output of a DNA sequencing instrument, and thus the typical observation in studies of microbial communities) that are >97 % identical to one another as members of the same OTU. 97 % identity is treated as a proxy for species-level groupings of organisms, although this definition is known to be problematic for several reasons. One is that the rate of evolution of the 16S rRNA gene differs among taxonomic lineages, so the same number of differences in the sequence may represent different times since divergence from a most recent common ancestor. The choice of algorithm for assigning sequences to OTUs can also have a large impact on which sequences are clustered into the same OTU and on how many OTUs are observed in a study. For example, it is not clear whether a 97 % sequence identity threshold means that each sequence added to an OTU must be 97 % similar to all other sequences in the OTU cluster, or whether each sequence should be 97 % similar to the sequence that defines the center of the cluster (i.e. the cluster centroid) (Schloss and Handelsman 2005; Schloss and Westcott 2011). Because neither laboratory nor computational protocols are standardized, reported differences among studies often stem from differences in methodologies rather than from differences in the underlying biology. And because techniques for performing meta-analyses of microbiome data are still only emerging, it is often difficult to standardize a reanalysis, and comparisons of results across studies and especially among laboratories must be performed with caution.

Modern marker-gene-based studies often investigate the composition of microbial communities at the OTU level, due to difficulties in relating counts of short DNA sequence fragments to named species. Although short reads of sequences (100–400 bases is currently typical, depending on sequencing platform) from the genomes of well-studied organisms can often be assigned at least to the family level, and sometimes at the genus or species level, many sequences cannot confidently be assigned to known named taxonomic groups. The limitation here is primarily the amount of information present in short reads of marker genes for differentiating closely related taxa. Figure One shows that when working with the most informative region of the 16S rRNA gene for broad analyses of bacterial and archaeal communities, the fraction of reads that can be assigned to taxonomic groups increases as expected with the length of the sequence. In real-world experiments (as opposed to the simulation presented in Fig. 1) this effect is exacerbated by PCR and sequencing biases and errors.

Our inability to assign detailed taxonomy to short reads is often not important for many of the questions that are interesting to address at the community level. Phylogenetic diversity calculations allow us to determine the relative similarity of microbial communities, using similarity of the fragment of the marker gene as a proxy for the relatedness of the organisms represented by those marker genes. Although in principle horizontal gene transfer, the movement of genes among different genomes, could obscure the phylogenetic pattern, in practice the difference in gene content between two organisms closely tracks the differences in marker genes such as the 16S rRNA gene (Zaneveld et al. 2010; Konstantinidis and Tiedje 2005). However, there are cases in which genomes with identical 16S rRNA genes have markedly different properties (e.g., Bacillus cereus, a harmless soil bacterium, and Bacillus anthracis, the causative agent of anthrax, are almost indistinguishable except for a plasmid that confers pathogenicity (Ivanova et al. 2003)). Additionally, our conclusions are limited by our depth of sequencing (i.e., the number of marker gene sequence reads collected from a sample). A study that collects 1,000 sequences per sample will miss species that are only present at an abundance of one cell in a million. These limitations to knowledge are widely appreciated by specialists, but are often omitted in popular accounts and in descriptions for non-specialists.

Is there a core human microbiome?

Our initial expectations of the microbial diversity living within and on human beings were limited and biased because relatively few microbes can be grown in culture (Rappé and Giovannoni 2003; Staley and Konopka 1985) and because many phylogenetically and functionally distinct kinds of microbes are difficult to distinguish by morphological or biochemical characteristics. For instance, Escherichia coli was believed to be a common and abundant gut microorganism inhabiting most members of the human population. However, culture-independent surveys based on 16S rRNA gene sequencing and/or shotgun metagenomic sequencing (in which all the DNA from a given community is extracted and analyzed) typically find it at less than 1 % abundance in the gut of healthy adults (Eckburg et al. 2005; Turnbaugh et al. 2009a; Costello et al. 2009; Qin et al. 2010). The scientific and medical community sought to determine the “core” microbiome of humans at the level of microbial species shared by everyone (Turnbaugh et al. 2007). Surprisingly, such a core does not seem to exist at the level of species; instead what appears to be shared are microbial functions (Turnbaugh et al. 2009a; Qin et al. 2010). One suggestion is that there might be a few types of common but only partially overlapping (or perhaps non-overlapping) microbial communities. One study found just three “enterotypes” or types of gut bacterial communities in human populations (Arumugam et al. 2011), although this simplistic picture appears not to be true when additional subjects and populations are considered (Wu et al. 2011; MacDonald et al. 2012; Jeffery et al. 2012; Claesson et al. 2012; Yatsunenko et al. 2012; HMP-Consortium 2012). However, the idea that human gut microbial communities might be classified into just a few types is conceptually appealing and has received much media attention (Brandon 2011; Yong 2012; Zimmer 2011), so debate on this topic is likely to continue. The microbial diversity revealed due to improvements in culture-independent techniques, in part due to the vast decrease in sequencing costs noted above, has been remarkable. There are no shared OTUs across the gut communities of all humans, even at a depth of coverage of one million sequences per sample (HMP-Consortium 2012). This unexpected finding has given rise to the idea of microbes as personal identification markers (Fierer et al. 2010). In addition, because monozygotic twins differ in their microbiota (Turnbaugh et al. 2009a; Yatsunenko et al. 2012), it could be argued that our microbiota are more personally unique than our own genomes.

In some sense, whether or not there is a core microbiome is a purely definitional issue. Finding a core depends on the level at which sequences are aggregated (grouping together more similar or more distantly related groups of organism, for example), the abundance threshold that may be set deliberately or may be intrinsically limited by technology or study design (for example, if only 1,000 sequences per sample are collected, organisms that are as rare as one in a million microbes will be missed), and the fraction of individuals that a taxon must appear into be considered “core” (for example, the MetaHIT consortium used a 50 % threshold (Qin et al. 2010)). Some kind of core can always be defined. A more productive research direction is to ask whether there are systematic differences among the microbial communities of every human that can be correlated with the physiological state of each individual.

Microbial community states associated with disease

Much attention has focused on testing whether differences in microbial diversity correlate with physiological states, especially disease states. For example, Ruth Ley, Peter Turnbaugh and colleagues in the laboratory of Jeffrey Gordon embarked on an exploration of changes in the microbiota associated with obesity in different mouse models. This seminal work revealed robust differences in the gut communities of these mice compared with lean mice, both in the case of genetically induced obesity in the ob/ob leptin model (Ley et al. 2005) and in diet-induced obesity (Turnbaugh et al. 2008). Remarkably, increased adiposity was transmissible to genetically normal mice on a standard, calorie-controlled diet by transferring these microbial communities from the obese mice to the normal mice (Turnbaugh et al. 2006, 2008). The major taxonomic difference between the mice microbiota was the relative abundance of the phyla Bacteroidetes and Firmicutes. This finding has been shown to hold for human hosts as well (Ley et al. 2006), although the same pattern has not been replicated in all human studies (Duncan et al. 2008; Schwiertz et al. 2010). As mentioned above, we can now predict—based on the microbial community composition alone—whether an individual is obese or lean at 90 % accuracy (Knights et al. 2011) while predictions based on host genomic markers perform little better than chance (Sandholt et al. 2010). Interestingly, these predictions work best when the microbes are classified into broad groups. Clustering the sequences into groups at the 80 % sequence identity level (corresponding approximately to bacterial phyla) actually works better than clustering the sequences into groups at the 97 % sequence identity level (corresponding approximately to bacterial species) for classifying people as lean or obese. These more detailed analyses at the species-proxy level do, however, provide better resolution when classifying multiple samples from the same site (Knights et al. 2011). A possible explanation for the improved predictability using phylum-level classification could be that differences in biochemical pathways are differentially represented across phyla but conserved across OTUs within phyla. These biochemical pathways are the primary features that differentiate obese from lean individuals. Models trained on data that are too specific (i.e., clustered at 97 % identity rather than a lower percent identity) are prone to overfitting, and have reduced predictive capacity. But it is important to bear in mind that the phylogenetic levels at which bacteria are associated with particular states may vary considerably, depending on the ecology of the particular phenotype or disease.

Recent large-scale endeavors, such as the Human Microbiome Project (NIH 2012), the American Gut (Human-Food-Project 2012) and the Personal Genome Project (Personal-Genome-Project 2012) are opening up new opportunities for analysis because they are building a base of healthy microbiomic data against which disease states (collected by some of these projects) can be contrasted. This is important because of the breadth of diseases associated with the microbiome. Disease states that have been found to be associated with features of the microbiome include inflammatory bowel disease (Frank et al. 2007; Michail et al. 2012), wasting diseases (Gordon et al. 2012), obesity (Kallus and Brandt 2012), halitosis (Kazor et al. 2003), dental caries (Yang et al. 2012), and perhaps even autism (Finegold et al. 2010). The gut microbiome appears to be causal for certain disease states, and is not just a biomarker. Causality can be inferred when, for example, fecal transplantation (and thus microbiota inoculation) in human subjects is used successfully to treat inflammatory bowel disease (IBD—primarily ulcerative colitis) (Landy et al. 2011) and insulin sensitivity associated with metabolic syndrome (Vrieze et al. 2012). These results indicate that gut microbes play an active role in these disease states and are not merely effects of the host’s condition. It is possible that in the not-to-distant future a microbiome sample will become a normal component of a health checkup. Microbiome analyses may be used to diagnose disease and could provide possible avenues for the prevention of disease through predictive tests. As we mentioned above, molecular samples from microbial communities may track or predict disease states better than does the human genome.

Changes in the microbiome over time

Microbial ecology shares similarities with traditional ecology, yet there are some important differences. In the ecology of macroorganisms, it is often possible to observe interactions directly, such as predation or competition for resources. Such observations are much more difficult in the microbial world, and ecological interactions must often be inferred from statistical variations in sequence data instead. Species definitions, although notoriously problematic even for macroorganisms, are even more difficult in microbes, and operational definitions based on similarities in DNA sequences must be used instead (as already discussed). Additionally, the cost of DNA sequencing posed a barrier until recently to collecting the detailed time-series and spatial datasets that are necessary for ecological modeling in microorganisms. However, some aspects of microbial ecology are substantially easier than in large-organism ecology. For example, the reliance on DNA sequence data means that with advances in technology, even a deep sampling of the population (millions of individuals) can be performed rapidly, and observation biases are likely to be less profound than when attempting to glimpse rare and elusive insects or mammals. The ability to collect large-scale information about microbial populations is likely to allow classical ecological models to be applied to the microbial world far more effectively than has been possible in macroecology, because more types of microbes can be simultaneously observed with large population sizes and with replicated sampling.

Ecological principles offer more than just ways to stratify the human population (e.g., by disease state). At infancy, our microbial populations go through remarkable changes in structure prior to reaching a resemblance to most adult communities. Inoculation is not necessarily from our mothers, and is substantially influenced by delivery mode. Microbial communities of children delivered vaginally initially tend to resemble their mother’s vaginal communities, while the microbial communities of children delivered by C-section initially tend to resemble human skin communities. Skin inoculations may be obtained from the mother, the medical staff involved in the delivery, or hospital surroundings (many of which harbor communities resembling human skin) (Biasucci et al. 2010; Dominguez-Bello et al. 2010). Stabilization of the microbiota of human children occurs around the third year of life (Yatsunenko et al. 2012), but routine disruptions, adjustments and fluctuations appear to be normal in healthy individuals (Costello et al. 2009; Caporaso et al. 2011a). While in general, the intra-individual microbiome variation is less than inter-individual, the amount of variability over long time periods (Caporaso et al. 2011a) gives rise to the idea of microbial “weather” in which microbial communities react to dietary and health conditions (even as they causally affect them). This phenomenon may be especially important in determining the health of the elderly (Claesson et al. 2012).

A revelatory aspect to studies of the microbiome is that classical ecological models and datasets previously only obtainable for a few economically important systems, such as fisheries, are now testable on the microbial scale because of the ability to assess simultaneously the relative abundance of thousands of species in thousands of samples (Gonzalez et al. 2011). However, this move towards accounts of microbial communities in terms of alternative stable states and dynamical systems (Costello et al. 2012; Lozupone et al. 2012; Gajer et al. 2012) is not entirely without peril. In the absence of theories of underlying causes, defining the number and boundaries of these states can be technique-dependent and implicitly theory-laden in ways difficult to identify—especially by investigators who are not specialists in the relevant mathematical techniques. With the availability of larger datasets and the ability to track communities over time, key ecological concepts such as resilience, alternative stable states, predator–prey cycling, and bottom-up versus top-down regulation of ecosystems will be increasingly important. However, it is equally important not to forget the lessons learned from past applications of these techniques, especially in traditional ecological modeling. For example, it has been known for almost four decades that Lotka-Volterra predator–prey dynamics with time lags produce patterns that would appear as completely uncorrelated between two species that in fact do interact deterministically (Fig. 2) (Holling 1973). However, this fact is routinely ignored in network analyses that seek to find connections among organisms by building a network in which nodes correspond to organisms, and edges correspond to pairs of organisms that are correlated. Correlation is usually assessed by determining whether the abundances of two taxa are correlated across a set of samples, typically using the Pearson correlation coefficient that assumes that all interactions are linear. In other words, taxa are linked if their correlation coefficient exceeds an arbitrary researcher-defined threshold. These networks are often used to find groups of organisms that “co-occur”, presumably because of shared environmental preferences or because of mutualistic ecological interactions. Hence these network methods, which often rely on linear correlations among organisms to detect relationships (Qin et al. 2010; Steele et al. 2011; Barberán et al. 2012), would incorrectly assert organisms to lack ecological connections even when these connections are fully deterministic. This happens simply because the inference procedure requires an understanding of the time-evolution of the system in order to find these causal links.

The analysis of time-series in microbial ecology has also been limited because the performance of standard signal processing methods is degraded with uneven sampling periods and small numbers of data points (Moller-Levet et al. 2003; Mason 1978; Mallat 1989). Such degradations have historically been common in microbial ecology datasets due to the cost of obtaining the data. However, we have already obtained valuable information about the temporal dynamics of a few microbial communities, such as the assembly of an infant’s gut microbiome and its transition towards a healthy human adult gut microbiome (Koenig et al. 2011). In the few cases in which even sampling has been performed or can be assumed, techniques exist to detect abrupt disruptions (Beltran et al. 1994; Mallat and Zhong 1992). In these contexts, such disruptions could mean one of the interventions that has been shown to have large effects in mice or humans such as diet change (Turnbaugh et al. 2009b) or antibiotic administration (Dethlefsen et al. 2008; Dethlefsen and Relman 2011). Therefore, as in disease surveillance, choosing a specific analytical approach (for example co-occurrence analysis, clustering analysis, and control systems analysis) depends to a large extent on whether the goal is to monitor a trend, detect an outbreak or provide general awareness of the possibility of change (Robertson and Nelson 2010).

Conclusions and outlook

Overall, the ability to collect far larger amounts of sequence data has led to much broader and deeper characterizations of the human microbiome and microbial communities in other habitats, especially when linked to rich contextual information about the provenance and status of each sample (Knight et al. 2012). In particular, the increased use of time-series studies (enabled by the decline in the cost of sequencing) allows us to apply for the first time a wide range of ecological models to the microbial world. Perturbation experiments are especially important for understanding how microbial communities change and for understanding groups of species that change together and interact in complex ways. However, this expanded body of ecological data introduces substantial epistemic issues, especially in regard to how data are interpreted via models and concepts. For example, the definition of OTUs at both the organism and the gene level (e.g. in the construction of “gene catalogs” (Qin et al. 2010)) is in many respects a return to phenetic methods, which have been criticized due to their lack of theoretical justification and their instability when more data are added (de Quieroz and Good 1997). The methodological principle of clustering sequences at some threshold before analysis is also not well grounded theoretically. One example would be if a single nucleotide change in the 16S rRNA gene of a single species distinguished exactly lean from obese humans, or co-varied perfectly with disease severity in IBD. Such findings would be of enormous importance yet would be missed completely by current techniques. Similarly, we know that because of factors such as horizontal gene transfer, gene- and taxon-level analysis will not map precisely on to each another, yet the data to perform such analysis and the theoretical framework for reconciling differences is at this point largely lacking.

Some of the solutions to these problems are being sought in large-scale projects such as the Earth Microbiome Project (Gilbert et al. 2010; Knight et al. 2012). These research consortia are working towards understand relationships among microbial processes across different systems and timescales. They will be especially important for identifying which theoretical constructs across different scales and levels of analysis are especially useful both for understanding and predicting microbial community responses. And as this article has made clear, the availability of large datasets and the development of new methods with which to analyze them have already produced dramatic changes in how the microbial world is understood, and its relationship to the rest of the biological world. As the many human microbiome studies discussed above show, microbial ecology—especially molecular microbial ecology, even at its relatively crude stage of development—is transforming how human biology itself is understood. This transformation, which we expect to occur not just in human biology but in traditional ecology and biology more broadly, will raise philosophical issues that require the attention of scientists and philosophers. We have indicated just some of these issues, dealing with the units of analysis and the causal powers associated with them, and how imperfect methods and models become more refined and effective in the process of inquiry. Philosophy of biology itself can learn a great deal from these recent and future developments in microbial ecology, as other papers in this special issue demonstrate.

References

Amann RI, Ludwig W, Schleifer KH (1995) Phylogenetic identification and in situ detection of individual microbial cells without cultivation. Microbiol Rev 59(1):143–169
Google Scholar
Arumugam M, Raes J, Pelletier E, Le Paslier D, Yamada T, Mende DR, Fernandes GR, Tap J, Bruls T, Batto JM, Bertalan M, Borruel N, Casellas F, Fernandez L, Gautier L, Hansen T, Hattori M, Hayashi T, Kleerebezem M, Kurokawa K, Leclerc M, Levenez F, Manichanh C, Nielsen HB, Nielsen T, Pons N, Poulain J, Qin J, Sicheritz-Ponten T, Tims S, Torrents D, Ugarte E, Zoetendal EG, Wang J, Guarner F, Pedersen O, de Vos WM, Brunak S, Dore J, Antolin M, Artiguenave F, Blottiere HM, Almeida M, Brechot C, Cara C, Chervaux C, Cultrone A, Delorme C, Denariaz G, Dervyn R, Foerstner KU, Friss C, van de Guchte M, Guedon E, Haimet F, Huber W, van Hylckama-Vlieg J, Jamet A, Juste C, Kaci G, Knol J, Lakhdari O, Layec S, Le Roux K, Maguin E, Merieux A, Melo Minardi R, M’Rini C, Muller J, Oozeer R, Parkhill J, Renault P, Rescigno M, Sanchez N, Sunagawa S, Torrejon A, Turner K, Vandemeulebrouck G, Varela E, Winogradsky Y, Zeller G, Weissenbach J, Ehrlich SD, Bork P (2011) Enterotypes of the human gut microbiome. Nature 473(7346):174–180. doi:10.1038/nature09944
Article Google Scholar
Auguet JC, Barberan A, Casamayor EO (2010) Global ecological patterns in uncultured Archaea. ISME J 4(2):182–190. doi:10.1038/ismej.2009.109
Article Google Scholar
Barberán A, Bates ST, Casamayor EO, Fierer N (2012) Using network analysis to explore co-occurrence patterns in soil microbial communities. ISME J 6(2):343–351. doi:10.1038/ismej.2011.119
Article Google Scholar
Beltran JR, Garcia-Lucia J, Navarro J (1994) Edge detection and classification using Mallat’s wavelet. In: Image Processing, 1994. Proceedings. ICIP-94, IEEE, 1994. IEEE Conference Publications, pp 293–297
Biasucci G, Rubini M, Riboni S, Morelli L, Bessi E, Retetangos C (2010) Mode of delivery affects the bacterial community in the newborn gut. Early Hum Dev 86(Suppl 1):13–15. doi:10.1016/j.earlhumdev.2010.01.004
Article Google Scholar
Brandon K (2011) Gut-bacteria mapping finds three global varieties. WIRED. http://www.wired.com/wiredscience/2011/04/gut-bacteria-types/. Accessed 16 Jan 2013
Caporaso JG, Lauber CL, Costello EK, Berg-Lyons D, Gonzalez A, Stombaugh J, Knights D, Gajer P, Ravel J, Fierer N, Gordon JI, Knight R (2011a) Moving pictures of the human microbiome. Genome Biol 12(5):R50. doi:10.1186/gb-2011-12-5-r50
Article Google Scholar
Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Lozupone CA, Turnbaugh PJ, Fierer N, Knight R (2011b) Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci USA 108(Suppl 1):4516–4522. doi:10.1073/pnas.1000080107
Article Google Scholar
Caporaso JG, Paszkiewicz K, Field D, Knight R, Gilbert JA (2012) The Western English Channel contains a persistent microbial seed bank. ISME J 6(6):1089–1093. doi:10.1038/ismej.2011.162
Article Google Scholar
Chu H, Fierer N, Lauber CL, Caporaso JG, Knight R, Grogan P (2010) Soil bacterial diversity in the Arctic is not fundamentally different from that found in other biomes. Environ Microbiol 12(11):2998–3006. doi:10.1111/j.1462-2920.2010.02277.x
Article Google Scholar
Claesson MJ, Jeffery IB, Conde S, Power SE, O’Connor EM, Cusack S, Harris HM, Coakley M, Lakshminarayanan B, O’Sullivan O, Fitzgerald GF, Deane J, O’Connor M, Harnedy N, O’Connor K, O’Mahony D, van Sinderen D, Wallace M, Brennan L, Stanton C, Marchesi JR, Fitzgerald AP, Shanahan F, Hill C, Ross RP, O’Toole PW (2012) Gut microbiota composition correlates with diet and health in the elderly. Nature 488(7410):178–184. doi:10.1038/nature11319
Article Google Scholar
Cohan FM (2002) What are bacterial species? Annu Rev Microbiol 56:457–487. doi:10.1146/annurev.micro.56.012302.160634
Article Google Scholar
Costello EK, Lauber CL, Hamady M, Fierer N, Gordon JI, Knight R (2009) Bacterial community variation in human body habitats across space and time. Science 326(5960):1694–1697. doi:10.1126/science.1177486
Article Google Scholar
Costello EK, Stagaman K, Dethlefsen L, Bohannan BJ, Relman DA (2012) The application of ecological theory toward an understanding of the human microbiome. Science 336(6086):1255–1262. doi:10.1126/science.1224203
Article Google Scholar
de Quieroz K, Good DA (1997) Phenetic clustering in biology: a critique. Q Rev Biol 72(1):3–30
Article Google Scholar
DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Anderson GL (2006) Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol 72(7):5069–5072. doi:10.1128/AEM.03006-05
Google Scholar
Dethlefsen L, Relman DA (2011) Incomplete recovery and individualized responses of the human distal gut microbiota to repeated antibiotic perturbation. Proc Natl Acad Sci USA 108(Suppl 1):4554–4561. doi:10.1073/pnas.1000087107
Article Google Scholar
Dethlefsen L, Huse S, Sogin ML, Relman DA (2008) The pervasive effects of an antibiotic on the human gut microbiota, as revealed by deep 16S rRNA sequencing. PLoS Biol 6(11):e280. doi:10.1371/journal.pbio.0060280
Article Google Scholar
Dominguez-Bello MG, Costello EK, Contreras M, Magris M, Hidalgo G, Fierer N, Knight R (2010) Delivery mode shapes the acquisition and structure of the initial microbiota across multiple body habitats in newborns. Proc Natl Acad Sci USA 107(26):11971–11975. doi:10.1073/pnas.1002601107
Article Google Scholar
Duncan SH, Lobley GE, Holtrop G, Ince J, Johnstone AM, Louis P, Flint HJ (2008) Human colonic microbiota associated with diet, obesity and weight loss. Int J Obes 32(11):1720–1724. doi:10.1038/ijo.2008.155
Article Google Scholar
Eckburg PB, Bik EM, Bernstein CN, Purdom E, Dethlefsen L, Sargent M, Gill SR, Nelson KE, Relman DA (2005) Diversity of the human intestinal microbial flora. Science 308(5728):1635–1638. doi:10.1126/science.1110591
Article Google Scholar
Edwards KJ, Bond PL, Gihring TM, Banfield JF (2000) An archaeal iron-oxidizing extreme acidophile important in acid mine drainage. Science 287(5459):1796–1799
Article Google Scholar
Fierer N, Lauber CL, Zhou N, McDonald D, Costello EK, Knight R (2010) Forensic identification using skin bacterial communities. Proc Natl Acad Sci USA 107(14):6477–6481. doi:10.1073/pnas.1000162107
Article Google Scholar
Fierer N, Lauber CL, Ramirez KS, Zaneveld J, Bradford MA, Knight R (2012a) Comparative metagenomic, phylogenetic and physiological analyses of soil microbial communities across nitrogen gradients. ISME J 6(5):1007–1017. doi:10.1038/ismej.2011.159
Article Google Scholar
Fierer N, Leff JW, Adams BJ, Nielsen UN, Bates ST, Lauber CL, Owens S, Gilbert JA, Wall DH, Caporaso JG (2012b) Cross-biome metagenomic analyses of soil microbial communities and their functional attributes. Proc Natl Acad Sci USA 109(52):21390–21395. doi:10.1073/pnas.1215210110
Article Google Scholar
Finegold SM, Dowd SE, Gontcharova V, Liu C, Henley KE, Wolcott RD, Youn E, Summanen PH, Granpeesheh D, Dixon D, Liu M, Molitoris DR, Green JA 3rd (2010) Pyrosequencing study of fecal microflora of autistic and control children. Anaerobe 16(4):444–453. doi:10.1016/j.anaerobe.2010.06.008
Article Google Scholar
Frank DN, St Amand AL, Feldman RA, Boedeker EC, Harpaz N, Pace NR (2007) Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases. Proc Natl Acad Sci USA 104(34):13780–13785. doi:10.1073/pnas.0706625104
Article Google Scholar
Gajer P, Brotman RM, Bai G, Sakamoto J, Schutte UM, Zhong X, Koenig SS, Fu L, Ma ZS, Zhou X, Abdo Z, Forney LJ, Ravel J (2012) Temporal dynamics of the human vaginal microbiota. Sci Transl Med 4(132):132ra152. doi:10.1126/scitranslmed.3003605
Article Google Scholar
Gilbert JA, Meyer F, Jansson J, Gordon J, Pace N, Tiedje J, Ley R, Fierer N, Field D, Kyrpides N, Glockner FO, Klenk HP, Wommack KE, Glass E, Docherty K, Gallery R, Stevens R, Knight R (2010) The earth microbiome project: meeting report of the “1 EMP meeting on sample selection and acquisition” at Argonne National Laboratory October 6. Stand Genomic Sci 3(3):249–253. doi:10.4056/aigs.1443528
Article Google Scholar
Gilbert JA, Steele JA, Caporaso JG, Steinbruck L, Reeder J, Temperton B, Huse S, McHardy AC, Knight R, Joint I, Somerfield P, Fuhrman JA, Field D (2012) Defining seasonal marine microbial community dynamics. ISME J 6(2):298–308. doi:10.1038/ismej.2011.107
Article Google Scholar
Gonzalez A, Knight R (2012) Advancing analytical algorithms and pipelines for billions of microbial sequences. Curr Opin Biotechnol 23(1):64–71. doi:10.1016/j.copbio.2011.11.028
Article Google Scholar
Gonzalez A, Clemente JC, Shade A, Metcalf JL, Song S, Prithiviraj B, Palmer BE, Knight R (2011) Our microbial selves: what ecology can teach us. EMBO Rep 12(8):775–784. doi:10.1038/embor.2011.137
Article Google Scholar
Gordon JI, Dewey KG, Mills DA, Medzhitov RM (2012) The human gut microbiota and undernutrition. Sci Transl Med 4(137):137ps112. doi:10.1126/scitranslmed.3004347
Article Google Scholar
Hamady M, Knight R (2009) Microbial community profiling for human microbiome projects: tools, techniques, and challenges. Genome Res 19(7):1141–1152. doi:10.1101/gr.085464.108
Article Google Scholar
Harris JK, Gregory Caporaso J, Walker JJ, Spear JR, Gold NJ, Robertson CE, Hugenholtz P, Goodrich J, McDonald D, Knights D, Marshall P, Tufo H, Knight R, Pace NR (2013) Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat. ISME J 7(1):50–60. doi:10.1038/ismej.2012.79
Article Google Scholar
HMP-Consortium (2012) Structure, function and diversity of the healthy human microbiome. Nature 486(7402):207–214. doi:10.1038/nature11234
Article Google Scholar
Holling CS (1973) Resilience and stability of ecological systems L. Annu Rev Ecol Syst 4:1–23
Article Google Scholar
Human-Food-Project (2012) American Gut—what’s in your gut? Indiegogo. http://humanfoodproject.com/american-gut. Accessed 16 Jan 2013
Ivanova N, Sorokin A, Anderson I, Galleron N, Candelon B, Kapatral V, Bhattacharyya A, Reznik G, Mikhailova N, Lapidus A, Chu L, Mazur M, Goltsman E, Larsen N, D’Souza M, Walunas T, Grechkin Y, Pusch G, Haselkorn R, Fonstein M, Ehrlich SD, Overbeek R, Kyrpides N (2003) Genome sequence of Bacillus cereus and comparative analysis with Bacillus anthracis. Nature 423(6935):87–91. doi:10.1038/nature01582
Article Google Scholar
Jeffery IB, Claesson MJ, O’Toole PW, Shanahan F (2012) Categorization of the gut microbiota: enterotypes or gradients? Nat Rev Microbiol 10(9):591–592
Article Google Scholar
Kallus SJ, Brandt LJ (2012) The intestinal microbiota and obesity. J Clin Gastroenterol 46(1):16–24. doi:10.1097/MCG.0b013e31823711fd
Article Google Scholar
Kazor CE, Mitchell PM, Lee AM, Stokes LN, Loesche WJ, Dewhirst FE, Paster BJ (2003) Diversity of bacterial populations on the tongue dorsa of patients with halitosis and healthy patients. J Clin Microbiol 41(2):558–563
Article Google Scholar
Knight R, Jansson J, Field D, Fierer N, Desai N, Fuhrman JA, Hugenholtz P, van der Lelie D, Meyer F, Stevens R, Bailey MJ, Gordon JI, Kowalchuk GA, Gilbert JA (2012) Unlocking the potential of metagenomics through replicated experimental design. Nat Biotechnol 30(6):513–520. doi:10.1038/nbt.2235
Article Google Scholar
Knights D, Parfrey LW, Zaneveld J, Lozupone C, Knight R (2011) Human-associated microbial signatures: examining their predictive value. Cell Host Microbe 10(4):292–296. doi:10.1016/j.chom.2011.09.003
Article Google Scholar
Koenig JE, Spor A, Scalfone N, Fricker AD, Stombaugh J, Knight R, Angenent LT, Ley RE (2011) Succession of microbial consortia in the developing infant gut microbiome. Proc Natl Acad Sci USA 108(Suppl 1):4578–4585. doi:10.1073/pnas.1000081107
Article Google Scholar
Konstantinidis KT, Tiedje JM (2005) Genomic insights that advance the species definition for prokaryotes. Proc Natl Acad Sci USA 102(7):2567–2572. doi:10.1073/pnas.0409727102
Article Google Scholar
Kunin V, Engelbrektson A, Ochman H, Hugenholtz P (2010) Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates. Environ Microbiol 12(1):118–123. doi:10.1111/j.1462-2920.2009.02051.x
Article Google Scholar
Landy J, Al-Hassi HO, McLaughlin SD, Walker AW, Ciclitira PJ, Nicholls RJ, Clark SK, Hart AL (2011) Review article: faecal transplantation therapy for gastrointestinal disease. Aliment Pharmacol Ther 34(4):409–415. doi:10.1111/j.1365-2036.2011.04737.x
Article Google Scholar
Lane DJ, Pace B, Olsen GJ, Stahl DA, Sogin ML, Pace NR (1985) Rapid determination of 16S ribosomal RNA sequences for phylogenetic analyses. Proc Natl Acad Sci USA 82(20):6955–6959
Article Google Scholar
Lauber CL, Hamady M, Knight R, Fierer N (2009) Pyrosequencing-based assessment of soil pH as a predictor of soil bacterial community structure at the continental scale. Appl Environ Microbiol 75(15):5111–5120. doi:10.1128/AEM.00335-09
Article Google Scholar
Lawrence JG (1999) Gene transfer, speciation, and the evolution of bacterial genomes. Curr Opin Microbiol 2(5):519–523
Article Google Scholar
Ley RE, Backhed F, Turnbaugh P, Lozupone CA, Knight RD, Gordon JI (2005) Obesity alters gut microbial ecology. Proc Natl Acad Sci USA 102(31):11070–11075. doi:10.1073/pnas.0504978102
Article Google Scholar
Ley RE, Harris JK, Wilcox J, Spear JR, Miller SR, Bebout BM, Maresca JA, Bryant DA, Sogin ML, Pace NR (2006) Unexpected diversity and complexity of the Guerrero Negro hypersaline microbial mat. Appl Environ Microbiol 72(5):3685–3695. doi:10.1128/AEM.72.5.3685-3695.2006
Article Google Scholar
Liu Z, DeSantis TZ, Andersen GL, Knight R (2008) Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers. Nucleic Acids Res 36(18):e120. doi:10.1093/nar/gkn491
Article Google Scholar
Lozupone CA, Knight R (2007) Global patterns in bacterial diversity. Proc Natl Acad Sci USA 104(27):11436–11440. doi:10.1073/pnas.0611525104
Article Google Scholar
Lozupone CA, Stombaugh JI, Gordon JI, Jansson JK, Knight R (2012) Diversity, stability and resilience of the human gut microbiota. Nature 489(7415):220–230. doi:10.1038/nature11550
Article Google Scholar
MacDonald NJ, Parks DH, Beiko RG (2012) Rapid identification of high-confidence taxonomic assignments for metagenomic data. Nucleic Acids Res 40(14):e111. doi:10.1093/nar/gks335
Article Google Scholar
Mallat SG (1989) Multifrequency channel decomposiitons of images and wavelet models. IEEE Trans Acoust Speech Signal Process 37(12):2091–2110
Article Google Scholar
Mallat S, Zhong S (1992) Characterization of signals from multiscale edges. IEEE Trans Pattern Anal Mach Intell 14(7):710–732
Article Google Scholar
Mason JS (1978) A new computation procedure for the discrete Fourier transform. IEE Proc G 2(1):16–20
Google Scholar
Michail S, Durbin M, Turner D, Griffiths AM, Mack DR, Hyams J, Leleiko N, Kenche H, Stolfi A, Wine E (2012) Alterations in the gut microbiome of children with severe ulcerative colitis. Inflamm Bowel Dis 18(10):1799–1808. doi:10.1002/ibd.22860
Article Google Scholar
Moller-Levet CS, Cho KH, Wolkenhauer O (2003) Microarray data clustering based on temporal variation: FCV with TSD preclustering. Appl Bioinform 2(1):35–45
Google Scholar
NIH (2012) Human microbiome project. National Institutes of Health. http://commonfund.nih.gov/hmp. Accessed 16 Jan 2013
Pace NR (1997) A molecular view of microbial diversity and the biosphere. Science 276(5313):734–740
Article Google Scholar
Pace NR, Stahl DA, Lane DJ, Olsen GJ (1986) The analysis of natural microbial populations by ribosomal RNA sequences. Adv Microb Ecol 9:1–55
Google Scholar
Personal-Genome-Project (2012) Personal genome project. Personal Genome Project. http://www.personalgenomes.org. Accessed 16 Jan 2013
Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, Nielsen T, Pons N, Levenez F, Yamada T, Mende DR, Li J, Xu J, Li S, Li D, Cao J, Wang B, Liang H, Zheng H, Xie Y, Tap J, Lepage P, Bertalan M, Batto JM, Hansen T, Le Paslier D, Linneberg A, Nielsen HB, Pelletier E, Renault P, Sicheritz-Ponten T, Turner K, Zhu H, Yu C, Li S, Jian M, Zhou Y, Li Y, Zhang X, Li S, Qin N, Yang H, Wang J, Brunak S, Dore J, Guarner F, Kristiansen K, Pedersen O, Parkhill J, Weissenbach J, Bork P, Ehrlich SD, Wang J (2010) A human gut microbial gene catalogue established by metagenomic sequencing. Nature 464(7285):59–65. doi:10.1038/nature08821
Article Google Scholar
Quince C, Lanzen A, Curtis TP, Davenport RJ, Hall N, Head IM, Read LF, Sloan WT (2009) Accurate determination of microbial diversity from 454 pyrosequencing data. Nat Methods 6(9):639–641. doi:10.1038/nmeth.1361
Article Google Scholar
Rappé MS, Giovannoni SJ (2003) The uncultured microbial majority. Annu Rev Microbiol 57:369–394. doi:10.1146/annurev.micro.57.030502.090759
Article Google Scholar
Robertson C, Nelson TA (2010) Review of software for space-time disease surveillance. Int J Health Geogr 9:16. doi:10.1186/1476-072X-9-16
Article Google Scholar
Rousk J, Baath E, Brookes PC, Lauber CL, Lozupone C, Caporaso JG, Knight R, Fierer N (2010) Soil bacterial and fungal communities across a pH gradient in an arable soil. ISME J 4(10):1340–1351. doi:10.1038/ismej.2010.58
Article Google Scholar
Sandholt CH, Sparso T, Grarup N, Albrechtsen A, Almind K, Hansen L, Toft U, Jorgensen T, Hansen T, Pedersen O (2010) Combined analyses of 20 common obesity susceptibility variants. Diabetes 59(7):1667–1673. doi:10.2337/db09-1042
Article Google Scholar
Schloss PD, Handelsman J (2004) Status of the microbial census. Microbiol Mol Biol Rev 68(4):686–691. doi:10.1128/MMBR.68.4.686-691.2004
Article Google Scholar
Schloss PD, Handelsman J (2005) Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness. Appl Environ Microbiol 71(3):1501–1506. doi:10.1128/AEM.71.3.1501-1506.2005
Article Google Scholar
Schloss PD, Westcott SL (2011) Assessing and improving methods used in operational taxonomic unit-based approaches for 16S rRNA gene sequence analysis. Appl Environ Microbiol 77(10):3219–3226. doi:10.1128/AEM.02810-10
Article Google Scholar
Schwiertz A, Taras D, Schafer K, Beijer S, Bos NA, Donus C, Hardt PD (2010) Microbiota and SCFA in lean and overweight healthy subjects. Obesity 18(1):190–195. doi:10.1038/oby.2009.167
Article Google Scholar
Shendure J, Ji H (2008) Next-generation DNA sequencing. Nat Biotechnol 26(10):1135–1145. doi:10.1038/nbt1486
Article Google Scholar
Staley JT, Konopka A (1985) Measurement of in situ activities of nonphotosynthetic microorganisms in aquatic and terrestrial habitats. Annu Rev Microbiol 39:321–346. doi:10.1146/annurev.mi.39.100185.001541
Article Google Scholar
Steele JA, Countway PD, Xia L, Vigil PD, Beman JM, Kim DY, Chow CE, Sachdeva R, Jones AC, Schwalbach MS, Rose JM, Hewson I, Patel A, Sun F, Caron DA, Fuhrman JA (2011) Marine bacterial, archaeal and protistan association networks reveal ecological linkages. ISME J 5(9):1414–1425. doi:10.1038/ismej.2011.24
Article Google Scholar
Tamames J, Abellan JJ, Pignatelli M, Camacho A, Moya A (2010) Environmental distribution of prokaryotic taxa. BMC Microbiol 10:85. doi:10.1186/1471-2180-10-85
Article Google Scholar
Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, Gordon JI (2006) An obesity-associated gut microbiome with increased capacity for energy harvest. Nature 444(7122):1027–1031. doi:10.1038/nature05414
Article Google Scholar
Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI (2007) The human microbiome project. Nature 449(7164):804–810. doi:10.1038/nature06244
Article Google Scholar
Turnbaugh PJ, Backhed F, Fulton L, Gordon JI (2008) Diet-induced obesity is linked to marked but reversible alterations in the mouse distal gut microbiome. Cell Host Microbe 3(4):213–223. doi:10.1016/j.chom.2008.02.015
Article Google Scholar
Turnbaugh PJ, Hamady M, Yatsunenko T, Cantarel BL, Duncan A, Ley RE, Sogin ML, Jones WJ, Roe BA, Affourtit JP, Egholm M, Henrissat B, Heath AC, Knight R, Gordon JI (2009a) A core gut microbiome in obese and lean twins. Nature 457(7228):480–484. doi:10.1038/nature07540
Article Google Scholar
Turnbaugh PJ, Ridaura VK, Faith JJ, Rey FE, Knight R, Gordon JI (2009b) The effect of diet on the human gut microbiome: a metagenomic analysis in humanized gnotobiotic mice. Sci Transl Med 1(6):6ra14. doi:10.1126/scitranslmed.3000322
Article Google Scholar
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang J, Gabor Miklos GL, Nelson C, Broder S, Clark AG, Nadeau J, McKusick VA, Zinder N, Levine AJ, Roberts RJ, Simon M, Slayman C, Hunkapiller M, Bolanos R, Delcher A, Dew I, Fasulo D, Flanigan M, Florea L, Halpern A, Hannenhalli S, Kravitz S, Levy S, Mobarry C, Reinert K, Remington K, Abu-Threideh J, Beasley E, Biddick K, Bonazzi V, Brandon R, Cargill M, Chandramouliswaran I, Charlab R, Chaturvedi K, Deng Z, Di Francesco V, Dunn P, Eilbeck K, Evangelista C, Gabrielian AE, Gan W, Ge W, Gong F, Gu Z, Guan P, Heiman TJ, Higgins ME, Ji RR, Ke Z, Ketchum KA, Lai Z, Lei Y, Li Z, Li J, Liang Y, Lin X, Lu F, Merkulov GV, Milshina N, Moore HM, Naik AK, Narayan VA, Neelam B, Nusskern D, Rusch DB, Salzberg S, Shao W, Shue B, Sun J, Wang Z, Wang A, Wang X, Wang J, Wei M, Wides R, Xiao C, Yan C, Yao A, Ye J, Zhan M, Zhang W, Zhang H, Zhao Q, Zheng L, Zhong F, Zhong W, Zhu S, Zhao S, Gilbert D, Baumhueter S, Spier G, Carter C, Cravchik A, Woodage T, Ali F, An H, Awe A, Baldwin D, Baden H, Barnstead M, Barrow I, Beeson K, Busam D, Carver A, Center A, Cheng ML, Curry L, Danaher S, Davenport L, Desilets R, Dietz S, Dodson K, Doup L, Ferriera S, Garg N, Gluecksmann A, Hart B, Haynes J, Haynes C, Heiner C, Hladun S, Hostin D, Houck J, Howland T, Ibegwam C, Johnson J, Kalush F, Kline L, Koduru S, Love A, Mann F, May D, McCawley S, McIntosh T, McMullen I, Moy M, Moy L, Murphy B, Nelson K, Pfannkoch C, Pratts E, Puri V, Qureshi H, Reardon M, Rodriguez R, Rogers YH, Romblad D, Ruhfel B, Scott R, Sitter C, Smallwood M, Stewart E, Strong R, Suh E, Thomas R, Tint NN, Tse S, Vech C, Wang G, Wetter J, Williams S, Williams M, Windsor S, Winn-Deen E, Wolfe K, Zaveri J, Zaveri K, Abril JF, Guigo R, Campbell MJ, Sjolander KV, Karlak B, Kejariwal A, Mi H, Lazareva B, Hatton T, Narechania A, Diemer K, Muruganujan A, Guo N, Sato S, Bafna V, Istrail S, Lippert R, Schwartz R, Walenz B, Yooseph S, Allen D, Basu A, Baxendale J, Blick L, Caminha M, Carnes-Stine J, Caulk P, Chiang YH, Coyne M, Dahlke C, Mays A, Dombroski M, Donnelly M, Ely D, Esparham S, Fosler C, Gire H, Glanowski S, Glasser K, Glodek A, Gorokhov M, Graham K, Gropman B, Harris M, Heil J, Henderson S, Hoover J, Jennings D, Jordan C, Jordan J, Kasha J, Kagan L, Kraft C, Levitsky A, Lewis M, Liu X, Lopez J, Ma D, Majoros W, McDaniel J, Murphy S, Newman M, Nguyen T, Nguyen N, Nodell M, Pan S, Peck J, Peterson M, Rowe W, Sanders R, Scott J, Simpson M, Smith T, Sprague A, Stockwell T, Turner R, Venter E, Wang M, Wen M, Wu D, Wu M, Xia A, Zandieh A, Zhu X (2001) The sequence of the human genome. Science 291(5507):1304–1351. doi:10.1126/science.1058040
Article Google Scholar
Vrieze A, Van Nood E, Holleman F, Salojarvi J, Kootte RS, Bartelsman JF, Dallinga-Thie GM, Ackermans MT, Serlie MJ, Oozeer R, Derrien M, Druesne A, Van Hylckama Vlieg JE, Bloks VW, Groen AK, Heilig HG, Zoetendal EG, Stroes ES, de Vos WM, Hoekstra JB, Nieuwdorp M (2012) Transfer of intestinal microbiota from lean donors increases insulin sensitivity in individuals with metabolic syndrome. Gastroenterology 143(4):913–916.e7. doi:10.1053/j.gastro.2012.06.031
Google Scholar
Wang Q, Garrity GM, Tiedje JM, Cole JR (2007) Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol 73(16):5261–5267. doi:10.1128/AEM.00062-07
Article Google Scholar
Winker S, Woese CR (1991) A definition of the domains Archaea, Bacteria and Eucarya in terms of small subunit ribosomal RNA characteristics. Syst Appl Microbiol 14(4):305–310
Article Google Scholar
Woese CR, Fox GE (1977) Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proc Natl Acad Sci USA 74(11):5088–5090
Article Google Scholar
Wu D, Hugenholtz P, Mavromatis K, Pukall R, Dalin E, Ivanova NN, Kunin V, Goodwin L, Wu M, Tindall BJ, Hooper SD, Pati A, Lykidis A, Spring S, Anderson IJ, D’Haeseleer P, Zemla A, Singer M, Lapidus A, Nolan M, Copeland A, Han C, Chen F, Cheng JF, Lucas S, Kerfeld C, Lang E, Gronow S, Chain P, Bruce D, Rubin EM, Kyrpides NC, Klenk HP, Eisen JA (2009) A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea. Nature 462(7276):1056–1060. doi:10.1038/nature08656
Article Google Scholar
Wu GD, Chen J, Hoffmann C, Bittinger K, Chen YY, Keilbaugh SA, Bewtra M, Knights D, Walters WA, Knight R, Sinha R, Gilroy E, Gupta K, Baldassano R, Nessel L, Li H, Bushman FD, Lewis JD (2011) Linking long-term dietary patterns with gut microbial enterotypes. Science 334(6052):105–108. doi:10.1126/science.1208344
Article Google Scholar
Yang F, Zeng X, Ning K, Liu KL, Lo CC, Wang W, Chen J, Wang D, Huang R, Chang X, Chain PS, Xie G, Ling J, Xu J (2012) Saliva microbiomes distinguish caries-active from healthy human populations. ISME J 6(1):1–10. doi:10.1038/ismej.2011.71
Article Google Scholar
Yatsunenko T, Rey FE, Manary MJ, Trehan I, Dominguez-Bello MG, Contreras M, Magris M, Hidalgo G, Baldassano RN, Anokhin AP, Heath AC, Warner B, Reeder J, Kuczynski J, Caporaso JG, Lozupone CA, Lauber C, Clemente JC, Knights D, Knight R, Gordon JI (2012) Human gut microbiome viewed across age and geography. Nature 486(7402):222–227. doi:10.1038/nature11053
Google Scholar
Yong E (2012) Gut microbial ‘enterotypes’ become less clear-cut. Nature. http://www.nature.com/news/gut-microbial-enterotypes-become-less-clear-cut-1.10276. Accessed 16 Jan 2013
Zaneveld JR, Lozupone C, Gordon JI, Knight R (2010) Ribosomal RNA diversity predicts genome diversity in gut bacteria and their relatives. Nucleic Acids Res 38(12):3869–3879. doi:10.1093/nar/gkq066
Article Google Scholar
Zimmer C (2011) Bacterial ecosystems divide people into 3 groups, scientists say. New York Times. http://www.nytimes.com/2011/04/21/science/21gut.html. Accessed 16 Jan 2013

Download references

Acknowledgments

We would like to thank Maureen O’Malley and our referees for their in-depth and insightful commentary and suggestions. This work was supported in part by NSF IGERT award 1144807 and the Howard Hughes Medical Institute.

Author information

Authors and Affiliations

Department of Computer Science, University of Colorado at Boulder, Boulder, CO, USA
Daniel McDonald & Rob Knight
BioFrontiers Institute, University of Colorado at Boulder, Boulder, CO, USA
Daniel McDonald & Rob Knight
Department of Chemistry and Biochemistry, University of Colorado at Boulder, Boulder, CO, USA
Yoshiki Vázquez-Baeza & Rob Knight
Department of Molecular, Cellular and Developmental Biology, University of Colorado at Boulder, Boulder, CO, USA
William A. Walters
Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
J. Gregory Caporaso
Argonne National Laboratory, Institute for Genomics and Systems Biology, Argonne, IL, USA
J. Gregory Caporaso
Howard Hughes Medical Institute, Boulder, CO, USA
Rob Knight

Authors

Daniel McDonald
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiki Vázquez-Baeza
View author publications
You can also search for this author in PubMed Google Scholar
William A. Walters
View author publications
You can also search for this author in PubMed Google Scholar
J. Gregory Caporaso
View author publications
You can also search for this author in PubMed Google Scholar
Rob Knight
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rob Knight.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

McDonald, D., Vázquez-Baeza, Y., Walters, W.A. et al. From molecules to dynamic biological communities. Biol Philos 28, 241–259 (2013). https://doi.org/10.1007/s10539-013-9364-4

Download citation

Received: 01 November 2012
Accepted: 23 January 2013
Published: 05 February 2013
Issue Date: March 2013
DOI: https://doi.org/10.1007/s10539-013-9364-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

From molecules to dynamic biological communities

Abstract

Similar content being viewed by others

Introduction to Genetic, Genomic, and System Analyses for Communities