The minimum number \(N_C^{\circ }\) of communities required to obtain the observed joint diversity is a parameter that provides no direct information on structural characteristics relating, for example, to the distribution of diversity over communities. Yet, \(N_C^{\circ }\) can be transformed into an indicator of structural characteristics under the community-centered perspective when considering the deviation of \(N_C^{\circ }\) from \(N_C\). To see this, consider that \(N_C^{\circ }=N_C\) is equivalent to \(v({{{\varvec{\mathsf{{ q}}}}}}_C^{N_C})=v_{TC}\), which in turn is equivalent to \({{{\varvec{\mathsf{{ q}}}}}}_C^{N_C}={{{\varvec{\mathsf{{ q}}}}}}\). The latter follows directly from the specification of \(N_C^{\circ }\) via equalization of the community representations within the types (see also third paragraph in the second section). Therefore, \(N_C^{\circ }=N_C\) only if the same communities are represented at equal frequencies within each type. From a more familiar perspective, this is equivalent to “\(N_C^{\circ }=N_C\) only if communities are not differentiated for their type compositions and are equally sized.” Recall that at the other extreme, \({N_C^{\circ }=1}\) only if communities are completely differentiated for their type compositions irrespective of the community sizes.
The latter suggests consideration of \(N_C^{\circ }\) as indicating community differentiation provided the state of the absence of community differentiation includes equal community sizes to make the absence of differentiation “complete.” This inclusion makes sense when variable community sizes can be argued to have an ecological impact. In fact, it is hard to conceive a community ecological scenario in which community sizes and their variability are irrelevant. Maintaining this extended concept of differentiation, it follows that
\(N_C^{\circ }\)varies from 1 to \(N_C,\)while structural characteristics vary from complete differentiation to complete absence of differentiation among communities.
When compared with the observed number of communities, \(N_C^{\circ }\) thus displays its twofold nature as a minimum number of communities and as an indicator of the amount of differentiation among communities.
Characterization of the structural aspects encoded in \(N_{T<C}^{e}\) must be traced back to \(N_C^{\circ }\) because of the “leading” function of the latter quantity. For example, \(N_{T<C}^{e}=1\) is equivalent to \(N_C^{\circ }=v_{TC}^e\), and this excludes any variation in types within and between communities. The metacommunity therefore consists of a single type (is monomorphic), which is in accordance with an effective number of types equal to 1. To assess the other extreme of \(N_{T<C}^{e}\), i.e., \(N_{T<C}^{e}=N_T\), let N again be the largest natural number with \(v({{{\varvec{\mathsf{{ q}}}}}}^N_C)\le v_{TC}\) so that \(v^e({{{\varvec{\mathsf{{ q}}}}}}_C^N)\le N\cdot N_T\le N_C^{\circ }\cdot N_T=v_{TC}^e\). In the case of \(N_C^{\circ }=N\), the chain of inequalities implies that \(v_{TC}^e=v^e({{{\varvec{\mathsf{{ q}}}}}}_C^N)\le N\cdot N_T=v_{TC}^e\) and \(v^e({{{\varvec{\mathsf{{ q}}}}}}_C^N)=N\cdot N_T\), with the consequence that the marginal type distribution must be uniform. Since for given \(v_{TC}\) and marginal type frequencies, any change in \(N_C^{\circ }\) goes along with a change in \(N_{T<C}^{e}\) but not in N, one infers that generally \(N_{T<C}^{e}=N_T\) only if the marginal type distribution is uniform. Therefore,
while \(N_{T<C}^{e}\)varies from 1 to \(N_T,\)the marginal type frequencies start with monomorphism and end with maximum polymorphism (where all types are equally frequent).
This result is obvious for Rényi-diversities, since there always \(N_{T<C}^{e}=v^e_T\).
The structural characteristics inherent in the factorization of joint diversity illustrate more vividly the conceptual advantages of the present approach, which focuses on the numbers of communities and types that essentially determine metacommunity diversity. By “essential,” two issues are addressed, factors and their interaction. The factors are specified by the minimally required number of communities (which depends on their differences in type composition) and the associated number of types (effectively represented in the minimally required number of communities). Their interaction is “orthogonal” in the sense of an independent operation in producing the joint diversity.
The factorization reminds of the common approaches to partitioning diversity in such a way that the total (metacommunity) type diversity \(\gamma\) results as a product of the type diversity within communities (\(\alpha\)) with a quantity referred to as (type) “diversity” between communities (\(\beta\)), i.e., \(\gamma =\alpha \cdot \beta\) (see, e.g., Jost 2007). This parallelism asks for pointing out potential relationships between the two approaches with special reference to the diversity within (\(\alpha\)) and between (\(\beta\)) communities as these do not explicitly appear in the factorization.
\(\alpha\)- and \(\beta\)-diversity
Chiu et al. (2014, eq. (6)) introduced an alternative concept of \(\alpha\)-diversity for Rényi-diversities (Hill numbers) that relies on joint diversities and is termed “the effective number of species per assemblage” (types per community). It is treated within the frame of diversity partitioning. To show how their concept relates to the present factorization of joint diversity, note that for Rényi-diversities \(v^e({{{\varvec{\mathsf{{ q}}}}}}^N_C )=v_T^e\cdot N\), and this is set equal to \(v_{TC}^e\) to obtain \(N_C^{\circ }=v_{TC}^e/v_T^e\). Since \(N_C^{\circ }\le N_C\), one arrives at \(v_{TC}^e/N_C\le v_T^e\), where \(v_{TC}^e/N_C\) is argued by Chiu et al. (2014) to replace the classical versions of \(\alpha\)-diversity, i.e.,
$$\begin{aligned} \alpha =\alpha '=v_{TC}^e/N_C \end{aligned}$$
Since \(v_{TC}^e\le N_T\cdot N_C\), one has \(\alpha '\le N_T\), which is meaningful. However, if all communities are monomorphic, \(v_{TC}^e=v_C^e\), and this allows for \(\alpha '=v_C^e/N_C<1\). The interpretation of \(\alpha '\) as “number of species per assemblage” should therefore not be misunderstood to indicate some kind of average of the diversities within communities (as applies to classical \(\alpha\)), since such averages are always greater or equal to 1 for effective numbers. Yet, as is demonstrated in Chao and Chiu (2016), \(\alpha '\) has a broader application including variance decomposition approaches.
In particular, \(v_T^e\) equals \(\gamma\)-diversity, so that \(\alpha '\le \gamma\) and \(\beta =\beta '=\gamma /\alpha '=N_C\cdot v_T^e/v_{TC}^e\). For Rényi-diversity \(N_C^{\circ }=v_{TC}^e/v_T^e\), which yields
$$\begin{aligned} \beta '=N_C/N_C^{\circ }, \end{aligned}$$
and thus reveals the direct connection to the present factorization approach. It follows that \(\beta '\le N_C\) with equality only for complete community differentiation, as it comes close to conventional views of partitioning total diversity (\(\gamma\)) into its component within (\(\alpha\)) and between (\(\beta\)) communities. The authors refer to their \(\beta '\) as an “effective number of...completely distinct assemblages.”
Transferring the general properties of \(N_C^{\circ }\) to \(\beta '=N_C/N_C^{\circ }\), it becomes apparent that only in the complete absence of community differentiation (in the extended sense of equality of relative type frequencies among communities and equal community sizes) is \(\beta '=1\). Variability in community sizes always implies \(\beta '>1\), and \(\beta '\) assumes its maximum of \(N_C\) for complete community differentiation only. Chiu et al. (2014) arrived at a similar result for the special case of Rényi-diversities, where complete absence of community differentiation appears as “all assemblages are identical in species absolute abundances” (p.26).
Considering these relationships, it is tempting to accept the definition of \(\alpha '=v_{TC}^e/N_C\) by Chiu et al. (2014) as a general approach (applying to all diversity measures), and see how this would fit into the common \(\alpha\)-\(\beta\)-\(\gamma\) frame. Indeed, replacing \(\gamma\) (the marginal diversity effective number of types) by \(N_{T<C}^{e}\) (the associated effective number of types) one obtains \(\beta '=\gamma /\alpha '=N_{T<C}^{e}/(v_{TC}^e/N_C)=N_{T<C}^{e}/(N_{T<C}^{e}\cdot N_C^{\circ }/N_C)=N_C/N_C^{\circ }\). Again \(\gamma \ge \alpha '\) with equality, however, only in the complete absence of community differentiation.
Replacement of \(\gamma\) by \(N_{T<C}^{e}\) is justified in the first place by the observation that in the classical approach to partitioning diversity (relying on averaged diversities within communities and where \(\gamma =v_T^e\)), effects of type-community associations as taken account of in the joint diversity are not explicitly considered. On the other hand, the example of Rényi-diversity (where \(N_{T<C}^{e}=v_T^e\)) confirms the existence of families of diversity measures, for which effects of association can be captured in one component (\(N_C^{\circ }\)), while the other component (\(N_{T<C}^{e}\)) maintains its appearance as a marginal diversity. This characterizes the relationships between the present approach of factorizing joint diversity and the common \(\alpha\)-\(\beta\)-\(\gamma\) approach to the partitioning of (type-)diversity.
The fact that the thus generalized \(\beta '\) varies between 1 and \(N_C\) and, because it equals \(N_C\) only for completely differentiated communities, again reminds of the habit to interpret \(\beta\) as an effective number of distinct communities. Yet, the above general reasoning brought forth against quantifying differentiation in terms of numbers of distinct communities is not remedied by the present \(\beta '\). The numerical example in Table 2 demonstrates that each of the individual communities may be clearly differentiated from the others without any of them being completely distinct. In the common interpretation, the values of \(\beta '\) in Table 2 would suggest that more than half of the three communities (\(\beta ' >1.7\)) are “completely distinct.” Distinctness of two among three communities, however, implies that all three are distinct, which contradicts \({\beta '<3}\).
There is actually no need to invoke numbers of distinct communities. The present \(\beta '\) simply reverts the minimum characteristic of \(N_C^{\circ }\) so that \(\beta '\)specifies the maximally possible number of communities that realize the observed joint diversity given the observed trait distribution. The maximum becomes 1 (\(\beta '=1\)) in the complete absence of differentiation (including equal community sizes) and reaches its largest value (\(\beta '=N_C\)) for complete differentiation. Even in the case of complete differentiation, it would be questionable to address \(\beta '=N_C\) as an effective number of “distinct communities,” since this interpretation ignores unequal community sizes so that the same number of communities may apply to the case where one of 10 communities covers 99% of the metacommunity or where all 10 communities have equal share.
To allow for comparison across data sets, normalizations of \(\beta '\) such as \((\beta '-1)/(N_C-1)\) or \((N_C-N_C^{\circ })/(N_C-1)\) may be required. Their interpretation is analogous, with the difference that the latter normalization is \(N_C^{\circ }\) times the former. The two indices are of the general type where the realized value of a variable is relativized with respect to its minimum and maximum value. In the first case, the variable is \(\beta '\) in the second \(N_C^{\circ }\). Both indices can be interpreted as indices of dissimilarity among the communities (for further such indices of the Sørensen, Jaccard, or Morisity type, see, e.g., Chao et al. 2019, Table 1). The absence of type diversities or effective numbers of types in all of these indices is conspicuous. It makes apparent that the information on type distributions that is relevant in the assessment of structural aspects of community differentiation inherent in joint diversities can be summarized in a single quantity, namely \(N_C^{\circ }\). However, remember that \(N_C^{\circ }\) is conditional on the observed type frequencies.
The type-centered perspective
Transition to the dual analogue shifts the focus to the type variable as leading factor with associated effective number of communities. The implied change from the community-centered to the type-centered perspective concentrates on the minimum number \(N_T^{\circ }\) of types that is required to produce the observed metacommunity diversity measured by its joint diversity and given the observed community sizes. To appreciate the significance of this analogue, it is again helpful to take a look at the structural information that can be extracted from \(N_T^{\circ }\) by considering the difference between \(N_T\) and \(N_T^{\circ }\).
Now, \(N_T^{\circ }=1\) only if all types are completely differentiated for their community affiliations, which is tantamount to monomorphism of all communities. Herewith, different communities may be monomorphic for the same type or for different types. Conversely, \(N_T^{\circ }\) reaches its maximum of \(N_T\) only in the absence of differentiation of types for their community affiliations (type differentiation) together with equal frequencies of all types, i.e., for “complete” absence of type differentiation. Thus, as \(N_T^{\circ }\) moves from 1 to \(N_T\), structural characteristics start with complete type differentiation and thus monomorphism of communities and tend toward complete absence of type differentiation.
In an analogous manner, one concludes that the associated number \(N_{C<T}^{e}\) of communities equals 1 if only one community exists, and it equals \(N_C\) only if all communities have equal size.
These demonstrations suggest application of the type-centered perspective to the above-treated method of partitioning diversity proposed by Chiu et al. (2014). \(\gamma\) then becomes \(\gamma _d=N_{C<T}^{e}\), \(\alpha '\) becomes \(\alpha _d=v_{TC}^e/N_T\), and \(\beta '\) becomes \(\beta _d=N_T/N_T^{\circ }\). For Rényi-diversity, the corresponding quantities are \(\gamma _d=v_C^e\), \(\alpha _d=v_{TC}^e/N_T\), and \(\beta _d=N_T\cdot v_C^e/v_{TC}^e\). Now, \(\beta _d\) equals the maximally possible number of types that realize the observed joint diversity given the observed community sizes.
The unfamiliar type-centered perspective can be transferred into more familiar perceptions by considering that for given community sizes, the joint diversity decreases, and thus, \(\beta _d\) increases, as the communities become less polymorphic. The evolutionary and ecological connotations then become apparent when realizing the consequences, for example, of competitive exclusion, drift under isolation, endemism or specialization. All of these are characterized by tendencies toward monomorphism. The opposite, i.e., the complete absence of monomorphism and thus complete polymorphism, implies equal type frequencies in addition to the absence of community differentiation. “Complete” absence of community differentiation is not relevant here, since community sizes are arbitrarily fixed observations.
In a more general context, systems of social or biochemical interaction that imply self- or cross-incompatibility within, between or across communities may directly influence type polymorphisms and their associations with communities. Herewith, self-incompatibility as coded by S-allele systems or realized in heterotypic mating or other heterotypic preferences, for example, tend to stabilize, balance and increase polymorphism, while systems of cross-incompatibility (such as heterozygote disadvantage, homotypic mating or other homotypic preferences, including inbreeding) destabilize polymorphisms and by this imply tendencies toward monomorphism.
Since in self-incompatibility systems, heterotypic combinations are preferred irrespective of community affiliations, potential differences in community affiliation between types tend to be equalized. This situation is characterized by small \(\beta _d\)-values (\(N_T^{\circ }\) close to \(N_T\)) implying high polymorphism within communities. In the same manner, cross-incompatibility tends to enhance differences in community affiliation between types so as to prevent incompatible contacts within the same community. \(\beta _d\)-values are large in this case and imply low polymorphism within communities. From a generalized incompatibility point of view, \(\beta _d\) can therefore be conceived of as reflecting effects on type diversity as are exerted by the continuum of compatibility reactions ranging between complete self- and cross-incompatibility.
Comparison between the two perspectives
It remains to demonstrate how the two perspectives are connected. Since both perspectives rest on the same joint diversity \(v_{TC}\), one obtains \(N_{C<T}^{e}/N_{T<C}^{e}=N_C^{\circ }/N_T^{\circ }\), so that the ratio between numbers of communities and numbers of types is the same for associated effective numbers and for minimum numbers. Among other relationships between numbers of types and numbers of communities, such as \(\gamma /\gamma _d=N_T^{\circ }/N_C^{\circ }\) and \(\alpha '/\alpha _d=N_T/N_C\), it is interesting to see how the same numbers differ between perspectives. In this case, comparisons are to be made between minimum and associated numbers. Taking \(N_C^{\circ }\) and \(N_{C<T}^{e}\), a brief glance at the values in Table 2 informs us that the minimum numbers are all smaller than the associated numbers. The same holds for the number of types \(N_T^{\circ }\) and \(N_{T<C}^{e}\).
The latter observation becomes evident when considering the fact that \(N_C^{\circ }/N_{C<T}^{e}=N_T^{\circ }/N_{T<C}^{e}=N_C^{\circ }\cdot N_T^{\circ }/v_{TC}^e\). Since \(N_C^{\circ }\) and \(N_T^{\circ }\) are minimum numbers based on the respective marginal distributions, one expects that \(N_C^{\circ }\cdot N_T^{\circ }\le v_{TC}^e\) holds in general. For Rényi-diversities, where \(N_C^{\circ }\cdot N_T^{\circ }/v_{TC}^e=v_{TC}^e/(v_T^e\cdot v_C^e)\), it is known that the inequality indeed applies for order \({a=1}\). Yet, the inequality does not extend to orders \({a\ne 1}\) (Gregorius 2010). Hence, in general the quotient is inappropriate for quantifying structural aspects of joint distributions. This confirms that structural characteristics of metacommunities should be assessed separately and independently between the community- and the type-centered perspective.
Indeterminate (latent) community subdivisions
Often, the spatial distribution of communities does not show a sufficient degree of fragmentation to allow for determination of a unique subdivision into subcommunities. As a consequence, more than one subdivision may become relevant, each of which may show a different partitioning of the same underlying overall type diversity. Similarly, different criteria for subdivision, such as the spatial distribution of environmental variables, may be relevant in comparative causal analyses of observed genetic or species variation. In this case, the states of each variable give rise to a subdivision into “communities,” which in turn enables comparison of the effects of the various variables on the observed type variation.
The relevant studies are typically of the kind known from the analysis of latent variables (or factors), which is established especially in population genetics and implemented in methods such as structure (Pritchard et al. 2000), baps (Corander et al. 2003), or geneland (Guillot et al. 2005). When adapted to the present situation, these methods proceed from assignments of the observed individuals to hypothetical communities, where the assignments can be generated by one model, and a second model associates the community affiliations of the individuals with their types. The result of each such assignment is evaluated for fulfillment of special qualification criteria. The aim is to find assignments or functions thereof that optimize the qualification (for a demonstration of the general concept of the model-based analysis of latent causal factors see Gregorius 2018).
An assignment of the above kind specifies a joint distribution of types and community affiliations and therefore allows computation of the joint diversity together with its factorization into \(N_C^{\circ }\) and \(N_{T<C}^{e}\). Here, the associated effective number of types \(N_{T<C}^{e}\) is of only secondary concern, since the marginal type distribution does not vary with the assignments. Particularly for Rényi-diversities, \(N_{T<C}^{e}\) is unaffected by the assignments, since it equals the marginal type diversity. Thus, \(N_C^{\circ }\), \(N_C\) and \(v_{TC}\) retain their significance as indicators of structural characteristics when applied to the assignments. In particular, \(N_C^{\circ }\) increases strictly with \(v_{TC}\) because of the invariance of the marginal type distribution.
In most cases, the models underlying an analysis of latent factors are based on probability laws that specify likelihoods or posterior probabilities of the assignments. These probabilities serve as the primary qualification criterion that is to be maximized (especially maximizing likelihood). This allows consideration of the most likely structural characteristics indicated by \(N_C^{\circ }\) and \(N_C\) and their relations (in terms of \(\beta '\), for example). In many applications, however, such as the methods cited above, the probability laws are used in MCMC-algorithms to estimate the expected values of the indicators, instead of the posterior probabilities of assignments as subdivisions of all individuals into communities.