Only Simpson Diversity can be Estimated Accurately from Microbial Community Fingerprints

Haegeman, Bart; Sen, Biswarup; Godon, Jean-Jacques; Hamelin, Jérôme

doi:10.1007/s00248-014-0394-5

Only Simpson Diversity can be Estimated Accurately from Microbial Community Fingerprints

Short Commentary
Published: 29 March 2014

Volume 68, pages 169–172, (2014)
Cite this article

Microbial Ecology Aims and scope Submit manuscript

Bart Haegeman¹,
Biswarup Sen²,
Jean-Jacques Godon³ &
…
Jérôme Hamelin³

1053 Accesses
17 Citations
1 Altmetric
Explore all metrics

Abstract

Lalande et al. (Microb. Ecol. 66(3):647–658, 2013) introduced a promising approach to quantify microbial diversity from fingerprinting profiles. Their analysis is based on extrapolating the abundance of the phylotypes detectable in a fingerprint towards the rare phylotypes of the community. By considering a set of reconstructed communities, Lalande et al. obtained a range of estimates for phylotype richness, Shannon diversity and Simpson diversity. They reported narrow ranges indicating accurate estimation, especially for Shannon and Simpson diversities. Here, we show that a much larger set of reconstructed communities than the one considered by Lalande et al. is consistent with the fingerprint. We find that the estimates for phylotype richness and Shannon diversity vary over orders of magnitude, but that the estimates for Simpson diversity are restricted to a narrow range (around 10 %). We conclude that only Simpson diversity can be estimated accurately from fingerprints.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Lalande J, Villemur R, Deschênes L (2013) A new framework to accurately quantify soil bacterial community diversity from DGGE. Microb Ecol 66(3):647–658
Article PubMed Google Scholar
Loisel P, Harmand J, Zemb O, Latrille E, Lobry C, Delgenès JP, Godon J J (2006) Denaturing gradient electrophoresis (DGE) and single-strand conformation polymorphism (SSCP) molecular fingerprintings revisited by simulation and used as a tool to measure microbial diversity. Environ Microbiol 8(4):720–731
Article CAS PubMed Google Scholar
Blackwood CB, Hudleston D, Zak DR, Buyer JS (2007) Interpreting ecological diversity indices applied to terminal restriction fragment length polymorphism data: insights from simulated microbial communities. Appl Environ Microbiol 73(16):5276–5283
Article PubMed Central CAS PubMed Google Scholar
Haegeman B, Hamelin J, Moriarty J, Neal P, Dushoff J, Weitz JS (2013) Robust estimation of microbial diversity in theory and in practice. ISME J 7(6):1092–1101
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgments

This work was supported by the SYSCOMM project DISCO (ANR-09-SYSC-003) and by the TULIP Laboratory of Excellence (ANR-10-LABX-41).

Author information

Authors and Affiliations

Centre for Biodiversity Theory and Modelling, Centre National de la Recherche Scientifique, Moulis, France
Bart Haegeman
Department of Environmental Engineering and Science, Feng Chia University, Taichung, Taiwan
Biswarup Sen
INRA, UR50, Laboratoire de Biotechnologie de l’Environnement, Narbonne, France
Jean-Jacques Godon & Jérôme Hamelin

Authors

Bart Haegeman
View author publications
You can also search for this author in PubMed Google Scholar
Biswarup Sen
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Jacques Godon
View author publications
You can also search for this author in PubMed Google Scholar
Jérôme Hamelin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bart Haegeman.

Appendix

Here, we describe the reconstructed communities of Fig. 1 and the diversity estimates shown in Fig. 2.

First, we extracted the fingerprint peak areas from Fig. 1 of Ref. [1]. The total area of the 34 extracted peak equals 20 % of the total area under the fingerprinting profile (hence, the peak-to-signal ratio PSR =0.20 in the terminology of Ref. [1]). The remaining 80 % of the area under the profile corresponds to the background (that is, the subpeak background percentage SBP =0.80 in the terminology of Ref. [2]).

Second, we constructed four communities consistent with the fingerprint data. The 34 most abundant phylotypes correspond to the fingerprint peaks. The relative abundance of these phylotypes is equal to the peak areas divided by the total area under the profile. Hence, the total relative abudance of the most abundant phylotypes is equal to 0.20. We chose the abundance distribution of the rare phylotypes such that the following conditions are satisfied: (1) the total relative abundance of the rare phylotypes is equal to 0.80 and (2) the abundance of a rare phylotype is smaller than the abundance of each of the most abundant phylotypes.

We report the abundance distribution of the rare phylotypes as rank-abundance curves, that is, we give the relationship between relative abundance p _i and rank i for the rare phylotypes (with rank i < 34):

The red community has 10³ phylotypes. Its rank-abundance curve is quadratic on a log-log plot, ln p _i = −3.391 − 0.8554 ln i + 0.03750 (lni)² for 34 < i ≤ 10³.
The yellow community has 10⁴ phylotypes. Its rank-abundance curve is linear on a log-log plot, ln p _i = −2.924 − 0.8535 ln i for 34 < i ≤ 10⁴.
The green community has 10⁵ phylotypes. Its rank-abundance curve is linear on a log-log plot, ln p _i = −2.492 − 0.9750 ln i for 34 < i ≤ 10⁵.
The blue community has 10⁶ phylotypes. Its rank-abundance curve is linear on a log-log plot, ln p _i = −2.294 − 1.0306 ln i for 34 < i ≤ 10⁶.

For the yellow, green and blue communities, the abundance distribution of the rare phylotypes is power law. For the red community this distribution is approximately power law (the rank-abundance curve is slightly convex, see Fig. 1, right-hand panel). For a community with 10³ phylotypes, a power law distribution for the rare phylotypes does not match smoothly the abundance of the dominant phylotypes.

Third, we computed three diversity metrics for the four reconstructed communities: phylotype richness D ₀, Shannon diversity D ₁,

$$ D_{1} = \mathrm{e}^{H} \qquad \text{with} \quad H = - \sum\limits_{I} p_{i}\ln p_{i}, $$

(1)

and Simpson diversity D ₂,

$$ D_{2} = \frac{1}{C} \qquad \text{with} \quad C = \sum\limits_{i} p_{i}^{2}. $$

(2)

The notation D ₀, D ₁ and D ₂ refers to Hill diversities of order 0, 1 and 2 (see Ref. [4] for details). Because Hill diversities can be interpreted as effective numbers of phylotypes, they are intercomparable. Therefore, we prefer to use the transformed diversity metrics D ₁ and D ₂ rather than Shannon diversity index H and Simpson concentration index C. We find:

For red community: D ₀=10³, D ₁=7.4 10² and D ₂=4.1 10².
For yellow community: D ₀=10⁴, D ₁=2.8 10³ and D ₂=5.0 10².
For green community: D ₀=10⁵, D ₁=7.7 10³ and D ₂=5.2 10².
For blue community: D ₀=10⁶, D ₁=1.7 10⁴ and D ₂=5.3 10².

Finally, we generalized the analysis to a much large set of reconstructed communities. More precisely, we considered all reconstructed communities satisfying conditions (1) and (2) above. This set, although it contains unrealistic communities (for example, communities with an abrupt transition from dominant to rare phylotypes), is useful to obtain lower and upper bounds for the estimation range of the diversity metrics. Indeed, it is possible to determine the community in this set yielding the lowest and highest diversity estimates. The lowest diversity estimate is obtained for a community in which all the rare phylotypes have the same abundance as the smallest abundance of the most abundant phylotypes. The highest diversity estimate is obtained for a community in which there are a large number R of rare phylotypes which all have the same relative abundance 0.20/R.

The results of this further analysis are shown as the grey-shaded regions in Fig. 2. The lower end of these regions are equal to the lowest diversity estimate. At the upper end, the shade of grey becomes gradually lighter, corresponding to the higest diversity estimate with R ranging from 10⁴ to 10⁷. It is interesting to note the dependence of the highest diversity estimate on the number of rare phylotypes R for the three diversity metrics: when R is large, the estimate for phylotype richness increases proportional to R, the estimate for Shannon diversity increases proportional to ln R and the estimate for Simpson diversity tends to a fixed value. This establishes another argument of why Simpson diversity can be estimated more accurately than Shannon diversity and phylotype richness.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Haegeman, B., Sen, B., Godon, JJ. et al. Only Simpson Diversity can be Estimated Accurately from Microbial Community Fingerprints. Microb Ecol 68, 169–172 (2014). https://doi.org/10.1007/s00248-014-0394-5

Download citation

Received: 24 July 2013
Accepted: 10 February 2014
Published: 29 March 2014
Issue Date: August 2014
DOI: https://doi.org/10.1007/s00248-014-0394-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Only Simpson Diversity can be Estimated Accurately from Microbial Community Fingerprints

Abstract

Access this article

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation