The theory of games and microbe ecology

Using game theory we provide mathematical proof that if a species of asexually reproducing microbes does not possess maximum variability in competitive abilities amongst its individual organisms, then that species is vulnerable to replacement by competitors. Furthermore, we prove that such maximally variable species are neutral towards each other in competition for limited resources; they coexist. Our proof is constructive: given one species which does not possess maximum variability, we construct a species with the same (or lower) mean competitive ability which can invade, in the sense that its expected value in competition is positive whereas the expected value of the non-maximally variable species is negative. Our results point towards the mechanistic underpinnings for the frequent observations that (1) microbes are characterized by large intra-specific variability and that (2) the number of extant microbe species is very large.

intra-specific variability has even been documented in marine metazoa [12]. Moreover, demonstrable variation among clonal individuals has shown intra-specific variability enhances species dispersal rates, cell encounters and population distributions [8]. Recent empirical data confirm that variations in motility at the strain level manifest themselves in different population distributions [6]. Such intra-specific variability may enhance species survival and co-existence and has been suggested to be adaptive based on model simulations [10]. Observations of intra-specific variability in marine microbes are not restricted to artificial laboratory settings. Satellite observations show the distribution of a species of photosynthetic plankton persisting for months, over hundreds of kilometers along the North American west coast [2,17]. Laboratory experimentation on this plankton species documents intra-specific variability in physiology and movement behaviors [9] that provide explanatory power for the species persistence documented with satellite observations. Thus, in the laboratory and field tremendous intra-specific variability is demonstrable but difficult to explain theoretically.
Beyond the tremendous intra-specific physiological and behavioral variability discussed above, there is also an atonishing observable taxonomic diversity within marine microbes. This vast diversity of species occupies all branches on the tree of life [13]. Recent investigations have shown that although thousands of species have already been described and discovered, many if not most marine microbes are yet to be identified [15,18]. Although the number of planktonic species is unknown, the likely number of species exceeds most if not all other groupings of organisms. Using data collected globally, researchers estimated that there may be around 150,000 eukaryotic plankton species [15]. Moreover, if prokaryotic species are to be included, there are likely millions of species of marine microbes and their total number may not be knowable [16].
Here, we introduce a game theoretic model for microbes which aims to understand the mechanistic underpinnings that yield the observable diversity in species and behaviors. The model interpolates between microscopic, individual level competition and macroscopic, species level survival. Justified by observations, species are characterized by intra-specific variability, manifested in different distributions of their competitive abilities. To avoid trivial outcomes, the mean competitive ability is restricted to have a maximum value of 1 2 . This can also be interpreted as assuming species are equally limited by energy and resources, when the species is assessed cumulatively. Species do vary in their trait distributions, which imparts different degrees of intra-specific variability from degenerate (= invariant) distributions to the uniform distribution with maximal variability. Species are represented by specific distributions that do not change over time but individuals' competitive abilities are chosen at random within each fixed cumulative distribution. Randomization of individual competitive abilities over time within a species represents competition in a dynamic system, where species are characterized by multiple physiological, behavioral and morphological traits, that can impart changing degrees of fitness as a function of the particular abiotic and biotic conditions.
We have rigorous mathematical proof (see Appendices A & B) that according to the rules governed by this game theoretic model, only those species that possess maximal variability amongst individual organisms have a unique quality: there is no other species with the same (or lower) mean competitive ability which has positive expected value in competition. In other words, there is no species that can outcompete the maximally variable species. On the other hand, for any species which does not possess maximal variability amongst individual organisms, in the proof of Theorem 1, we explicitly construct a species with the same (or lower) mean competitive ability which has positive expected value in competition, whereas the original species has negative expected value. The maximally variable species is the only species which does not have such vulnerability. Moreover, in Proposition 1, we prove that such maximally variable species are neutral towards each other in competition for limited resources; they coexist. Fig. 1 When two individuals compete, there are three possible outcomes for each individual: win=self-replicate; lose=die; draw=maintain status quo for each competitor. The table shows the pay-off function of individual level competition. A payoff +1 corresponds to win, a payoff -1 corresponds to lose, and a payoff 0 corresponds to a draw.
Over time, if species tend toward maximum variability, then they simultaneously tend towards neutral coexistence. In this sense, we provide a plausible mathematical explanation for two widely documented but poorly understood characteristics of microbes: (1) enormous intra-specific variability and (2) tremendous biodiversity.
This work is organized as follows. In section 2 we introduce our game theoretic model for microbes. This sets the stage for our mathematical results which are presented in section 3. Concluding remarks are offered in section 4. The complete, rigorous mathematical proofs of the results presented in section 3 are contained in appendix A and appendix B.

Game theory for microbes
The mathematics behind our approach is non-cooperative game theory. This is justified by the lack of evidence that individual microbes communicate in such a way as to create coalitions or to cooperate. Biofilms are, however, an exception [4]. Competition occurs between individuals. The outcome of competition between individuals, no matter how biologically complex, can be reduced to three possibilities: win, lose, or draw (see Figure 1). Due to the asexual reproduction of microbes, cumulative success of individuals in competition can be interpreted as population increase, cumulative losses lead to population decrease, and cumulative draws to neither gains nor losses.
In the game theoretic approach to examine competition on the level of species, we assume that each species is comprised of several individual organisms. Each individual has a competitive ability, which is not necessarily constant, but may change from one round of competition to the next. The competitive ability (CA) is represented by a number between 0 and 1. We assume that the species as a whole has a mean competitive ability (MCA) which does not exceed 1/2. The range of both the competitive abilities as well as the mean competitive abilities was chosen arbitrarily and our results are independent of the choice of specific values. At each round of competition, individual organisms are randomly put in pairs. The organism with the higher CA doubles whilst the organism with the lower CA perishes; in case of equal CAs both organisms simply persist. The magnitude of difference in CA between competitors is irrelevant.
A key, and we think novel, feature of our model is that the individual CAs are variable, depending on the distribution of CAs for a given species. Thus, the assembly of the individuals in a species represent the distribution of CAs, and their collective mean CA is equal to the species's MCA. This situation is representative of genetically identical individuals that temporarily differ in competitive ability, due to for example their recent exposure to resources or metabolic demands, the particular state in their life cycle and such. Finally, competition in a dynamic system requires the nature of the competition to vary, for example, competition may be with respect to resource acquisition in one round, and predator defence in another. Thus, individuals that were superior relative to the mean in one round may be inferior within the next round.
This model applies to any arbitrary number of simultaneously competing species, with arbitrary (not necessarily equal) population sizes. Each species does not compete in a series of tournaments against all its competitors, but rather, it is competing against all other species simultaneously. So, our model shows how each species will fare in competition with any arbitrary number of other species.

Same-same but different
For each MCA value, there are numerous ways to assign the individual CAs which have this same MCA value. We term this "same-same but different." We recall the definition of strategic behavior distribution (SBD) from [10] specified to this setting. Here, an SBD is defined to be a particular way to assign CAs to individuals such that the MCA is equal to a fixed value. Interestingly, SBDs having the same MCA nonetheless have dramatically different ecological repercussions depending on the particular characteristics of the SBD.
Largely in order to reduce complexity, it is common to assume that variation among individuals and even species can adequately be represented by a mean or maximal response (e.g. [3]). In this case, that would correspond to all individuals being equal and having CA always equal to the MCA. We refer to this degenerate SBD as the invariant SBD. We shall see in the proof of our main result that it is easily susceptible to invasion by a competing SBD which also has MCA less than or equal to 1/2.
In general, we use here SBDs which are neither purely deterministic nor purely probabilistic. The SBDs are not purely deterministic because the CA of each individual is in general non-constant. For example, we define the bimodal SBD to select half of the organisms of the species to have CA=1, and the remaining half to have CA=0. Any odd-man-out gets 0 or 1 with probability 1/2. However, this selection occurs at each round of competition, so an organism which had CA=0 in the first round of competition could have CA=1 in the second round of competition. Moreover, for an organism which had CA=1 and succeeded in a round of competition and thereby replicates, both it and its clone could have either CA=1 or CA=0 in the next round of competition. Thus, the model is not deterministic. On the other hand, the model is also not purely probabilistic in the sense that each individual is not assigned its CA=1 or 0 each with probability 1/2 , with the probabilities of each individual being independent. If that were the case, then it would be possible, but unlikely, that at a specific round of competition, all individuals had CA=0. In our model, as long as the number of individuals is at least two, this is impossible. Thus, the SBDs we use are neither purely deterministic nor purely probabilistic. This fluidity of differences among individuals requires the species under consideration to be an asexually reproducing microbial species. This restricts our results to microbes, because asexual reproduction will quickly regenerate individuals with identical genetic capacity within the SBD of the species, and thus offsetting losses due to an inferior CA.
Below, to determine the success or lack thereof of defined SBDs, we use this game theoretic model to compute the expected value of SBDs in competition. A positive expected value implies population growth. A negative expected value implies population decrease. An expected value of 0 implies stable population, neither increase or decrease.

Theoretical model and main results
The competitive abilities are selected from the values: where the number M above is an integer greater than or equal to 6. Here it is convenient to introduce the notation N A is the probability that an individual organism has competitive ability (CA) equal to x j . We are interested in the expected value of A in competition with species B. In fact, our model works equally well if we consider B to be all other species which compete with A for limited resources, because we do not require the total number of organisms, N A and N B , respectively, to be equal. We do assume that N A and N B are both positive.

Expected value in competition
We assume that organisms are randomly paired to compete. Then, we compute the probability that the organisms in species A which have competitive ability equal to x i compete with an inferior organism by first counting all of the organisms of species B which have lower CA: The probability of competing with such an organism is given by dividing by the total number of organisms of species B, In a similar way, we compute the probability of competing with a superior organism is Since we do not assume N A and N B are equal, some organisms might not compete at all. If N A < N B then only N A organisms out of the total N B organisms in species B actually compete, so the probability that an organism from species B competes is N A N B . In this case, all organisms from species A compete. If N B < N A , then only N B organisms out of the total N A organisms in species A compete, so the probability that an organism from species A competes is N B N A . In this case, all organisms from species B compete. So, in general, we define Then, the probability that an organism from species A actually competes is and the probability that an organism from species B actually competes is Hence, the expected value for the competition of the organisms in species A with competitive ability The expected value for species A in competition with species B is then calculated by summing the expected value for all competitive abilities: The competition has a zero-sum dynamic, To see why this is true, we use the above considerations to compute: It is then straightforward to compute that This zero-sum dynamic may be interpreted for competition for limited resources. Furthermore we impose the energy and resource constraint corresponding to the assumption that the MCA does not exceed 1/2. The MCA is computed by taking the average of all competitive abilities, so we define Then, the constraint is that for both species

The uniform SBD
Our first result shows that a certain SBD, known as the uniform SBD, is neutral in competition to all other SBDs which have the same MCA as that of the uniform SBD. At the same time, we also show that the uniform SBD has positive expected value in competition with any SBD whose MCA is less than the MCA of the uniform SBD.
Proposition 1 Let A be a species with MCA at most 1 2 . Let U be the uniform SBD, which is the unique SBD having n A (x i ) equal and positive for all i = 0, . . . , M . Then MCA(U ) = 1 2 , and with equality if and only if the MCA of species A is equal to 1 2 . We note that this is independent of the number of organisms in species A and species U .

Logical argumentation
To see why this is true, we use the definition of E[A, U ] to write out this expected value; see (3) in Appendix A. At first, the expression may look complicated, but using the definition of the uniform SBD, as done in (4) of Appendix A, the expression simplifies. With these mathematical manipulations and resulting simplifications, the expression for the expected value of species A in competition with the uniform species U becomes Above, N c is the number of competing organisms, which we recall is the smaller of N A and N U . The number M is how we have defined the competitive values, as the numbers 0, 1 M , 2 M , and up to 1 = M M . In the parentheses we see 2 MCA(A). By the constraint on the MCA, we know that This shows that E This holds for all strategies, B. We use the zero-sum dynamic once more to see that We therefore see that the definition for A to be an NES holds true. Hence, we see that every ESS is also an NES, but we shall see below that not every NES is an ESS.

The unique NES
Our main theorem proves that (1) there is no ESS in our setting, but (2) the uniform SBD is the unique NES.
Theorem 1 Let A be a species comprised of N ≥ 4 individual organisms and which has MCA less than or equal to 1/2. Assume that M ≥ 6. Then, there are two mutually exclusive possibilities: There is an SBD for a species B comprised of at most N individuals which has MCA less than or equal to 1/2 and which has positive expected value in competition with species A.
Therefore, we conclude: 1. There is no SBD for species with at least N ≥ 4 individual organisms and which has MCA less than or equal to 1/2 which is an ESS. 2. The unique NES is the uniform SBD.

Strategy of the proof
We have already demonstrated for the uniform SBD, U , that E[B, U ] ≤ 0 for any SBD, B, which has MCA less than or equal to 1 2 . So, the task at hand is to understand species, A, which are not uniform. We therefore begin by assuming that A is not the uniform SBD. We then proceed with several straightforward steps.

First simplification
We begin by making a simplifying observation. If MCA(A) < 1 2 , then we have shown that the uniform species has positive expected value in competition with species A. So, in that case, the theorem is proven. To complete the proof, we can henceforth assume that MCA(A) = 1 2 .

Pairs of CAs
The next idea is to look at the number of organisms at pairs of competitive abilities, starting with the lowest CA = 0 together with the highest CA = 1. We look at n A (x 0 ) and n A (x M ). There are three possibilities: either n A (x 0 ) > n A (x M ) or n A (x 0 ) < n A (x M ), or n A (x 0 ) = n A (x 1 ).

In the first case, if n
So, in this case we have found a species B which has MCA at most 1 2 (in fact equal to 1 2 ), and the same number of organisms as species A, but which has positive expected value in competition versus species A.

In the second case, n
We compute in §B.2 that In the third case, n A (x 0 ) = n A (x M ). In this case we don't do anything, but instead proceed to look at n A (x 1 ) and n A (x M −1 ).

Pair-by-pair
We look next at n A (x 1 ) and n A (x M −1 ). There are the same three cases as above. In the first and second cases, we can define a species B which has MCA(B) ≤ 1 2 , and the total number of organisms of species B does not exceed N . If n A On the other hand, if n A (x 1 ) < n A (x M −1 ), then we define In §B.3 we show that the number of organisms in species B is N , and the MCA of species B is 1 2 . We compute there that In the last case, n A (x 1 ) = n A (x M −1 ), we do not do anything, just look at the next pair, looking at n A (x 2 ) and n A (x M −2 ). In this way, we continue pair-by-pair.

Second simplification
There are now two mutually exclusive possibilities: either at some step we found a pair which is lopsided, that is n A (x k ) = n A (x M −k ) for some k = 0, 1, . . .. Once we reach such a pair, we show in §B.4 how to construct the species B which has E[B, A] > 0. So, in all such cases, the theorem has been proven. It therefore only remains to consider the case when for all k.

Exploiting non-uniformity
We are now reduced to the case where n A (x k ) = n A (x M −k ) holds for all k. That means that the SBD is symmetric around the competitive ability 1 2 . However, we assumed that A is not-uniform. This means that the numbers n A (x k ) cannot all be identical. So, some of the n A (x k ) are larger, and some are smaller. It is precisely this lack of uniformity which allows us to find a weakness and exploit it with species B. The details of this process are contained in §B.5.

Concluding remarks
Evolution, or survival of the fittest, in the case of microbes must be approached from a different perspective in comparison to macro-organisms. Rather than assessing the fitness of individual microbes and applying evolution to individuals interpreted as representing a species, we propose that it is more fitting to consider cohorts of individuals that collectively but not individually represent the entire species. Individual microbes do not always behave in a way that maximizes fitness, nor does loss of an individual necessarily result in the loss of the genotype, because asexual reproduction yields multiple clones and can rapidly increase the abundance of specific clones or replace lost individuals representing a particular clone. Thus, loss of individuals can have no consequence for the species as a whole, suggesting a very different risk distribution, where the species as a whole may nonetheless survive and remain variable with respect to their trait distribution. Thus, we analyze evolutionary fitness of species by interpolating between the individual-level competitions and the species-level growth or attrition.
As we are assuming competition for limited resources, if one species has a positive expected value, implying population growth, the other species has a negative expected value, implying population decrease. Presuming that species evolve so as to minimize vulnerability to invasion and attrition by other species, rather than to maximize fitness, then all species of microbes which survive would evolve to the uniform SBD. Obviously, the rate of change in the SBD will be influenced by the environmental conditions, the presence of co-occurring species and to what degree SBDs are similar or overlap with co-occurring species (e.g. character displacement). Irrespective of any external factors, the uniform SBD is the unique SBD which contains the maximal variability amongst its individuals. Moreover, it is the unique SBD which is not vulnerable to replacement by species with other SBDs with the same (or lower) MCA. There are two mutually exclusive options: evolve towards the uniform SBD or risk vulnerability to invasion.
Interestingly, the uniform SBD is also neutral in competition with all other SBDs which have the same mean competitive ability. Thus, our second result shows that the uniform SBD coexists with all other SBDs which have the same MCA. Hence, it predicts that microbes which evolve to the uniform SBD simultaneously evolve to co-exist. We therefore have discovered a plausible mathematical explanation for two prevalent characteristics of microbes which until now have been inexplicable in the context of competition theory: (1) maximal variability of traits amongst individuals of a single species and (2) co-existence of virtually limitless different species of microbes. For multi-cellular organisms, intra-specific variability has been suggested to be an important characteristic in the management of biodiversity [19]. Our results imply that a quantification of the trait distributions of microbial organisms is essential to understand processes that drive ecosystem functioning, including ocean ecosystems and the global carbon cycle.
Acknowledgements Susanne Menden-Deuer was supported through the National Aeronautics and Space Administration (award #NNX15AL2G). Julie Rowlett is grateful to Ksenia Fedosova, Philip Gerlee, and Jeffrey Steif for stimulating discussions and insightful comments. Many thanks to Medet Nursultanov for carefully checking the proofs and for suggesting the SBDs defined in (18) and (20). We are also grateful to Edvin Wedlin for suggesting the term "non-exploitable strategy," NES. Constructive feedback by three anonymous reviewers has improved a prior version of this manuscript.

A Proof of Proposition 1
We compute the expected value, where above Nu denotes the total number of uniform organisms, and nu(x j ) denotes those with competitive ability equal to x j . Similarly, we use N A to denote the total number of organisms of species A, and is the number of competing organisms. By definition of the uniform SBD, Similarly, Hence, The last inequality follows because MCA(A) ≤ 1 2 . So, we see that equality holds if and only if MCA(A) = 1 2 .

B Proof of Theorem 1
Let A be a species comprised of N individual organisms. Let x k = k M for 0 ≤ k ≤ M , and n A (x k ) be the number of organisms of species A with competitive ability x k .

B.1 First simplification
If the MCA of A is less than 1 2 , then the uniform SBD has MCA equal to 1/2, and by Proposition 1, So, we may henceforth assume that the MCA of species A is in fact equal to 1/2. We shall prove the result by an induction argument. In this argument, we will be considering pairs of competitive abilities, (x k , x M −k ). To begin, we consider the SBD in which species A has half of its organisms with competitive ability equal to 0, and the other half with competitive ability equal to 1. In case N is odd, any "odd man out" has competitive ability 0 or 1 each with probability 1/2. We call this the bimodal SBD.
In this case, we define species B to have Then, we compute that the MCA of this species is We have used above the assumption that M and N are both greater than or equal to 4. We compute the expected value Henceforth, we may assume that the SBD of species A is not the bimodal SBD. By this assumption n A (0) and n A (1) cannot both be equal to N/2. It will now be convenient to introduce the notation Moreover, we will henceforth assume that M is even. Were M not even, we would simply make the partition twice as fine. Then, we use the constraint that the MCA is equal to 1 2 to compute: Re-arranging, Hence, we obtain the rather useful identity that

B.2 Pairs of CAs
We proceed to look at n A (0) and n A (1). If n A (0) > n A (1), let B be the bimodal SBD.
If this is the case, we compute the expected value of B in competition against A is Hence, the bimodal species satisfies the statement of the theorem in competition against species A.
If on the other hand we have We We now compute the expected value from the other individuals. For this we recall the notation (5). For an individual of species A with CA equal to 1 2 + k M , we count that there are individuals of species B which have strictly higher competitive abilities. We similarly count that there are individuals of species B which have strictly lower competitive abilities. Hence, the expected value from the individuals of species A with CA equal to 1 2 + k M is By the symmetry in the definition of species B, for an individual of species A with CA equal to 1 2 − k M , these numbers are switched, so we compute the expected value is We therefore compute the total expected value, Now, we see that (6) is quite useful: Recall that by definition (5) Hence, So, in case n A (0) and n A (1) are unequal, or in case A is the bimodal SBD, we have proven the theorem.

B.3 Pair-by-pair
We now proceed inductively. Assume that n A (0) and n A (1) are identical. We now consider If these are unequal, in the same way as above, we can construct a species, B, which has higher expected value in competition. In case n A 1 M > n A 1 − 1 M we define the species B to have its organisms equally distributed at 1 M and 1 − 1 M . By the same calculation as above, we compute the expected value On the other hand, if we define Then we compute that the number of organisms in species B is Moreover, so defined the MCA of species B is 1 2 . In species, A, the n A (1) organisms with CA equal to one always win, whereas the n A (0) organisms with CA equal to 0 always lose. Hence their cumulative contribution to E[A, B] is zero. We therefore compute By (6), this is Hence, we have computed that where the final inequality follows from (10) together with the observation that M −2 M −3 > 1. We continue by induction. We assume that n A (x j ) = n A (x M −j ) for all j = 0, . . . , k − 1.
This will be important in our calculation of the expected value for species, B, defined below. We set We first verify that the species, B, has at most N organisms: Next, we verify that the MCA of species, B, is at most 1 2 , So, we define Then, species B has MCA precisely equal to 1 2 . We must show that this is consistent with the necessary condition that ε 0 ≤ ε 1 : .
So, we will choose ε 0 > 0 sufficiently small so that ε 0 ≤ 1 2(M −1) . Since M is fixed, this is certainly possible. Since the total number of organisms of species A is N = N A , whereas the total number of organisms of species B is we see that We therefore calculate the expected value By symmetry, n A (0) = n A (1) = n A (x M ), so we have We recall the definition of ε 1 in (13) to write Since n A (0) is fixed and positive, we can achieve this inequality by simply taking ε 0 > 0 sufficiently small. Specifically, we demand that 0 < ε 0 < n A (0) This argument is for the case that the SBD of A is symmetric about 1 2 , is not the uniform SBD, and we have (11). For the general case in which the SBD of A is symmetric about 1 2 and is not the uniform SBD, let First, we assume that j < M 2 − 1. Then, we define the SBD of B quite similarly as in the case of (11). Specifically, we let We similarly require ε 0 ≤ ε 1

Hence,
It is convenient to group the terms with ε 0 all together, so we must show that n A (x k )   + (2 + 2j) (n A (x j ) + n A (x j+1 ))   < (M − j − 1) (n A (x j ) + n A (x j+1 )) .
By (16), and since j > 0, We therefore see that Hence, it is enough to prove that we can choose ε 0 > 0 sufficiently small so that n A (x k ) + (2 + 2j) (n A (x j ) + n A (x j+1 ))   < n A (x j ).
Finally, we consider the last remaining cases. We still have the assumptions that A is not the uniform SBD, and that its SBD is symmetric about 1 2 . In these cases we have j defined as in (14) satisfying We first handle the case j = M 2 . When this is the case, by the definition of (14), using (5), we note that we have the inequality n A 1 2 = n 0 > n 2 = n −2 = n A 1 2 Then, we define the species B to have We calculate the MCA of species B: Since the distribution of A is symmetric about the point 1 2 , we compute the expected value (using the notation (5)) E[B, A] = 1 3 (−n −1 − n 0 − n 1 − n 2 ) + 2 3 (n 0 + n −1 + n −2 − n 2 ) = n 0 − n 2 3 > 0.
In the very last case, we have j defined in (14) with Then, by the definition of j, the symmetry of the distribution of A, and using the notation (5), In this case, we shall exploit this by defining We compute the MCA: We compute the expected value However, we have proven in the proposition that for all SBDs with MCA less than or equal to 1 2 , Hence, U is the unique NES.