Be different to be better: the effect of personality on optimal foraging with incomplete knowledge

Many animal populations include a diversity of personalities, and these personalities are often linked to foraging strategy. However, it is not always clear why populations should evolve to have this diversity. Indeed, optimal foraging theory typically seeks out a single optimal strategy for individuals in a population. So why do we, in fact, see a variety of strategies existing in a single population? Here, we aim to provide insight into this conundrum by modelling the particular case of foraging seabirds, that forage on patchy prey. These seabirds have only partial knowledge of their environment: they do not know exactly where the next patch will emerge, but they may have some understanding of which locations are more likely to lead to patch emergence than others. Many existing optimal foraging studies assume either complete knowledge (e.g. Marginal Value Theorem) or no knowledge (e.g. Lévy Flight Hypothesis), but here we construct a new modelling approach which incorporates partial knowledge. In our model, different foraging strategies are favoured by different birds along the bold-shy personality continuum, so we can assess the optimality of a personality type. We show that it is optimal to be shy (resp. bold) when living in a population of bold (resp. shy) birds. This observation gives a plausible mechanism behind the emergence of diverse personalities. We also show that environmental degradation is likely to favour shyer birds and cause a decrease in diversity of personality over time.


Introduction
When should an animal leave its foraging patch in search of another? This is a core concern for understanding behavioural choices in animal populations, with a long history of theoretical and empirical study. In an environment where predators have complete knowledge of their surroundings, Charnov (1976) proposed that the optimal foraging strategy will depend on the quality and distribution of patches in the environment. Charnov's Marginal Value Theorem states that an animal should leave the patch it is foraging in when their rate of energy gain drops below the average energy gain for the environment, which includes the cost of travelling between patches. Optimal foraging theory has been extended to more detailed stochastic models (Oaten 1977), which have been used to understand how model attributes affect foraging, such as prey distribution (Higginson and Ruxton 2015), patch distribution (Nonaka and Holme 2007;Rodríguez-Gironés and Santamaría 2006) and group competition (Laguë et al. 2012).
In the case of seabirds, however, individuals only have partial knowledge of their ocean environment, and are constantly gathering new information whilst foraging. Whilst predictability of foraging patches is low in oceanic waters, in temperate and polar regions, features such as continental shelf edges, frontal zones and upwellings create a slightly more predictable foraging habitat, which seabird populations have been observed to exploit (Weimerskirch 2007). The uncertainty in the environment and competition for more predictable resources creates an explorationexploitation trade-off foraging behaviour; the individual must chose to either explore the surrounding area to find higher quality resource patches and gain knowledge of their surroundings for future foraging trips, or exploit already-known habitats for food (Eliassen et al. 2007).
Recent studies have shown that an individual's choice of foraging strategy, i.e., where it lies on the exploration-exploitation trade-off, is highly correlated to its personality on the bold-shy continuum (Patrick et al. 2017). Bold birds tend to show higher levels of aggressive behaviour in response to novel objects, compared to shyer birds (Patrick et al. 2013). They are also more likely to favour exploration than their shyer counterparts; they will visit more patches, spending less time foraging in each and will spend less time searching between each patch. Shyer birds are more likely to favour exploitation; they will visit fewer patches, but they will fully exploit each patch they visit and will spend more time searching for higher quality patches (Patrick et al. 2017).
The aim of this paper is to explore seabird foraging strategies in an environment where individuals do not have complete knowledge of their surroundings. In particular, we focus on the differences in individual strategies that arise from personality differences along the shy-bold continuum within the population. In doing so, we aim to provide insight into why a diversity of personalities may have evolved, as well as understanding which strategies (and hence personalities) will be optimal in different environments. The ultimate aim is then to shed light on which personalities are likely to benefit or suffer from environmental change, and therefore give predictions as to how seabird personality is likely to evolve as a result of such changes.
For this, we propose a simple model to describe the energetics of seabirds foraging within a patchy resource environment. We assume that the location and quality of patches is not known to birds, but that they have some idea of the overall quality of the environment. In particular, we assume they have an idea of how long they are likely to need to search before finding their next patch. We begin by examining the optimal strategy for how long a bird should stay in a patch, assuming a fixed search time. The optimality question is phrased in two different ways: maximising rate of energetic gain and minimising the risk of unsuccessful foraging. The latter is defined to be a foraging trip where the energy lost through searching outweighs the energy gained through foraging.
We then examine the effect of personality on foraging strategy, by introducing a trade-off curve along the bold-shy continuum, whereby bold (resp. shy) birds have shorter (resp. longer) search times leading to discovery of the lower (resp. higher) quality patches. We calculate the optimal foraging strategy, and hence personality, given details about the personality make-up of the population. In doing so, we show whether it is optimal to be different or to be the same as everyone else.

Single bird model
We begin by considering the foraging behaviour of an individual seabird in the ocean, modelled as an environment with patchy resources. We examine individual in-patch foraging times for maximising rate of energy gain and minimising risk of energy loss.
Our model is split into two separate phases of behaviour: foraging in a patch and searching for a new patch. A seabird will forage in a patch. Then, when it is energetically favourable to, it will leave the patch to find a new patch. We define this process as a forage-search event, and each foraging trip will consist of multiple forage-search events. We determine how successful individuals are at foraging by considering their change in energy whilst foraging (Fig. 1).
During the foraging phase of the model, we consider the temporal behaviour of two variables: patch quality p(t) and individual energy gain u(t). Patch quality decays exponentially, whilst the individual gains energy from the patch at a rate proportional to the patch quality. The governing equations are with solutions where p 0 > 0 is the initial patch quality, > 0 is the rate of decay of the patch quality during foraging, u * > 0 is the initial energy of the individual upon entering the patch, > 0 is the proportion of available forage converted to energy by an individual, and 1 is the rate of energy loss due to metabolic functioning. Equation (3) assumes patch quality decays exponentially during foraging. Equation (4) assumes that the energy gain of an individual is proportional to the quality of the patch at time t, and that the individual loses energy due to metabolic functioning at a constant rate.
During the searching phase, we assume that an individual loses energy u at an additional rate 2 to account for additional energy costs whilst flying, where u * > 0 is the initial energy of the individual upon leaving the patch. We also model search times for an individual as a random variable t s ∼ Exp( ).
(1) dp dt = − p, To analyse the behaviour of the model, we introduce dimensionless variables and dimensionless parameters into our model, as detailed in

Foraging times
When should an individual leave a patch in search of another? To forage successfully, an individual may want to attempt to maximise its rate of gain in energy. As foraging ( ) = * − (1 + ) . efficiency will decrease the longer an individual stays in a patch, the individual may wish to move to another patch soon after arriving at the first patch. However, due to uncertainty in knowledge of where patches will be and how high quality they will be, searching for a new patch can be risky. Fig. 1 An example of how energy is gained and lost by an individual during a foraging trip. During a foraging event, an individual gains energy at a rate proportional to the quality of patch. As the patch depletes, so too does the rate of energy gain, and at some point the bird decides to leave the patch in search of another Longer search times may result in an individual losing more energy whilst searching than gained whilst foraging. This presents an alternative foraging strategy where an individual will stay foraging in a patch longer to minimise its risk of unsuccessful foraging; that is, losing more energy searching than gained foraging. In this section, we derive these optimal foraging times for both scenarios: maximising energy gain and minimising risk.

Maximising rate of energy gain
Given an expected search time, what is the optimal length of time for an individual to forage in a single patch in order to maximise its rate of energy gain? To answer this, we consider the rate of energy gain for one forage-search event. We consider the case where the search time is always 1∕Λ = [ s ] , the mean of s , and assume the initial patch quality 0 is known to the individual. The rate of energy gain for one forage-search event is given, in dimensionless parameters, by where f is the (dimensionless) energy gained whilst foraging, and f and s are the (dimensionless) times spent foraging and searching respectively (Table 1). Although a foraging trip will contain multiple forage-search events, we base our analysis on one and assume that patch quality does not vary sufficiently to have a big effect on optimality. (13) The rate of energy gain r is maximised when f = * f , given by where W is the Lambert W function, the inverse of the function f (z) = ze z (Appendix 7). In Fig. 2, we see that this time increases as expected search time increases, initial patch quality decreases, and the flight energy cost increases. All of these three situations can be considered as examples of worsening environmental conditions for the animal.

Minimising the risk of unsuccessful foraging
The environment which seabirds forage in is uncertain, and changing environmental conditions will likely increase this uncertainty. Environmental uncertainty motivates an individual foraging time which aims to minimise the probability that a forage-search event is unsuccessful. Here we revert to modelling search times as a random variable, s ∼ Exp(Λ) , and ask how long should an individual forage in a patch to minimise their risk of losing energy during a forage-search event?
To answer this, we calculate the probability, P( s > f ) , that one forage-search event is unsuccessful, i.e. the (dimensionless) energy gained whilst foraging, f , is less than the (dimensionless) energy lost whilst searching, s . As s is dependent on the random variable s (Table 1), we have that This foraging time maximises the amount of energy gained during the foraging phase, f . Therefore, to minimise the probability of an unsuccessful foraging event, the individual should stay foraging in the patch until their rate of net energy gain becomes negative. As c f only depends upon 0 , this minimum risk foraging time is only affected by the patch quality, p 0 and the individual's foraging ability, quantified by , the rate at which available forage is converted to energy (Table 1). (Note that parameters and Λ only affect the probability of an unsuccessful forage-search event, and not the time an individual should stay in a patch to minimise this probability.)

Analysis of foraging times
In Fig. 2 we compare the effect of model parameters on the foraging time that maximises the rate of energy gain (Eq. 14), and the foraging time that minimises the probability of net energy loss (Eq. 16). Increasing dimensionless parameter 1∕Λ increases the expected search time 1∕ , decreasing dimensionless parameter 0 decreases the initial patch quality p 0 , and increasing dimensionless parameter increases the flight energy cost 1 . The time * f increases as the search times get longer, the patch quality decreases, and the flight energy cost increases. All of these factors can be considered as features that make the environment less favourable for foraging seabirds. Consequently, the rule of thumb is that, to maximise energetic gain, a poorer environment for foraging should be mitigated by longer foraging bouts.
The minimal risk foraging time, c f , however, shows a rather different trend. This value only depends on 0 . Converse to the effect of 0 on * f , here we see that c f decreases as 0 decreases. This means that, as conditions worsen, individuals should decrease foraging time if they are seeking to reduce risk, but increase foraging time if seeking to maximise average gains. Note that as conditions worsen and the rate of energy gain r( f ) decreases to 0, the optimal foraging times under our two different optimality scenarios, * f and c f , converge until * f = c f , when r( f ) = 0 . When r( f ) = 0 , the energy gained foraging is equal to the energy lost whilst searching. In the scenario where the optimal rate of energy gain an individual can achieve is 0, the associated optimal foraging time will be the time taken to maximise energy gain whilst in a patch, else the individual would be able to adjust their foraging time to achieve a positive rate of energy gain. This foraging time would be * f = ln 0 , which is equivalent to c f .
In Fig. 3a we compare the effect of model parameters on the optimal rate of energy gain r( * f ) (Eq. 13) when f = * f (Eq. 14). In Fig. 3b we compare the effect of model parameters on P( s > c f ) , the minimal risk of an unsuccessful forage-search event (Eq. 15) when f = c f (Eq. 16). We see the optimal rate of energy gain decreases and the probability of risk increases under less favourable conditions: longer expected search times, higher flight energy costs, and lower initial patch qualities.

Personality as a driver for a foraging strategy trade-off
We have presented two indicators of foraging success for individuals choosing the length of time they forage in a patch: optimising the rate of energy gain and minimising the risk of unsuccessful foraging. Whilst foraging time is the only parameter individuals have complete control over, individuals can prioritise patches of higher quality or shorter search times between patches. This results in a foraging strategy trade-off.
Different individual personalities tend to prefer different foraging strategies (Patrick and Weimerskirch 2014;Patrick et al. 2017). Bolder personalities favour exploration; they will move quickly around the environment (high expected search rate Λ and short expected search time 1∕Λ ), visiting more patches which are more likely to be smaller and of lower quality (low initial patch quality 0 ). Shyer personalities favour exploitation; they will spend longer searching for a patch (low Λ , short 1∕Λ ), visiting fewer patches of higher quality (high 0 ).

Trade-off
We model this foraging strategy trade-off by extending our forage-search model presented in Eqs. (3)−(5) to include a trade-off between 0 , the initial patch quality, and Λ , the expected search rate. We use the following (phenomenological) functional form for our trade-off curve where c 1 and c 2 are constants. The +1 term enforces the 0 > 1 condition required to make the rate of gain in energy whilst foraging in Eq. (8) initially positive. The shape of the trade-off curve is modified to account for competition between individuals with a similar strategy by varying b, a measure of population boldness. When b is increased, bold individuals have lower patch qualities and shy individuals have higher patch qualities to account for a change in competition, and vice-versa as b is decreased. Figure 4 shows example trade-off curves. Individual foraging strategies are represented by a point on a given trade-off curve, which dictates the expected search time and patch quality during a forage-search event. We assume that 0 is the quality of a typical foraging patch for an individual with expected search time 1∕Λ . Whilst initial patch quality will vary across multiple forage search events, and these initial patch qualities will be unknown to the individual until they reach a patch, we assume 0 is a mean and variation around this mean will not have a large effect on optimality. There is an intermediate strategy at Λ = c 2 and 0 = c 1 + 1 . Bolder individuals with shorter expected search times and lower patch qualities lie on the curve where Λ > c 2 and 0 > c 1 + 1 , whilst shy individuals with higher patch qualities and longer expected search times lie on the curve where Λ < c 2 and 0 < c 1 + 1 . As we vary population boldness b, we assume the expected search time for each individual, 1∕Λ , remains fixed. Increasing population boldness b results in bold individuals with higher Λ , having a lower initial patch quality 0 due to increased competition, and shy individuals with a lower Λ , having a higher initial patch quality 0 due to decreased competition.

Optimal rate analysis
We measure foraging success by the rate of energy gain r( f ) during a forage-search event, as defined in Eq. (13). When evaluating r at f = * f , the optimal foraging time in Eq. (14), we obtain the optimal rate of energy gain r( * f ) for an individual with expected search time 1∕Λ . How does population boldness affect this? Substituting in the foraging strategy trade-off in Eq. (17) gives where is the optimal foraging time for an individual of expected search time 1∕Λ within a population of boldness b. In Fig. 5 we see that this foraging time increases for individuals with longer expected search times (i.e. shyer individuals).
Maximising r( * f ) with respect to Λ gives the optimal position on the trade-off curve, with expected search time denoted by 1∕Λ * (derived in Appendix 8). This search time gives the most successful individual foraging strategy for an individual foraging in a population of boldness b, as presented in Fig. 6. As population boldness increases, the optimal expected search time 1∕Λ * increases, so in a population of bolder individuals, individuals with a longer expected search time (i.e. shyer individuals) are most successful, and vice-versa.
We also look at the success of the population as a whole, by considering b-values that represent a shy population, an intermediate population, and a bold population (insets of the optimal expected search time, 1∕Λ * = 1∕c 2 (derived in Appendix 8.1). In both the bold and shy populations, we see the individuals with highest optimal rates of energy gain r( * f ) are those with contrasting personalities to the rest of their population. In the intermediate population, the optimal expected search time 1∕Λ * has a lower optimal rate of energy gain r( * f ) than in the bold and shy population. Individuals with expected search times close to the intermediate strategy are more successful than individuals with outlying strategies.

Minimal risk analysis
Here, we measure the risk of unsuccessful foraging by considering the probability, P( s > f ) , that the net energy change in one forage-search event is negative. How does population boldness affect an individual's risk of unsuccessful foraging? Recall that this risk is minimised by f = c f (Eq. 16). Substituting the trade-off curve from Eq. (17) into Eq. (16), we see that the minimal risk foraging time for an individual with expected search time 1∕Λ in a population of boldness b is  In Fig. 5 we see that this foraging time increases for individuals with longer expected search times (shyer individuals). The probability of foraging unsuccessfully given this minimal risk foraging time is then By minimising Eq. (21) with respect to Λ , we find the search time required to minimise the probability of unsuccessful foraging. This is given implicitly by the following equation (see Appendix 10 for derivation) We denote by Λ c (b) the value of Λ that solves Eq. (22) for a given value of b. Figure 7 shows the relationship between population boldness b and the expected search time 1∕Λ c of (20) an individual choosing an optimal strategy for minimising risk. In a population of bolder individuals, it is optimal to be a shy individual with a longer expected search time, and in a population of shyer individuals, it is optimal to be a bold individual with a shorter expected search time.
We also look at the minimal individual risk within the whole population, for selected b-values showing a shy, intermediate, and bold population (insets of Fig. 7). Again the intermediate population is defined as the b value for which the minimal risk individual strategy is the intermediate expected search time, 1∕Λ * = 1∕c 2 (derived in Appendix 10.1). In a population of shyer individuals, the individuals with lowest individual minimal risk are relatively bold, and in a population of bolder individuals, the individuals with lowest risk are relatively shy. In the intermediate population, the minimal risk individual strategy 1∕Λ c has higher risk than in the bold and shy population. Individuals with expected search times close to the intermediate strategy have lower risk than individuals with outlying strategies.

Discussion and conclusions
We have derived analytic expressions for optimal foraging strategies in a patchy environment on which individuals only have partial knowledge. We have shown how these strategies depend upon the underlying environment, the individual's personality, and the personality composition of the population. Previous empirical work has shown that personality affects an individual's foraging strategy along the exploration-exploitation trade-off (Patrick et al. 2017) but it is not fully resolved as to why different strategies should exist in different individuals. By modelling an individual's personality as a position on this trade-off curve, we have shown that it is optimal to have a different personality to the rest of the population. Specifically, in a population of shy (resp. bold) individuals, it is optimal to be bold (resp. shy) (Figs. 6 and 7). From this, we expect that over time shy (resp. bold) populations would become more bold (resp. shy), moving towards an intermediate population. In principle, an intermediate population would remain homogeneous. However, it is likely that bolder or shyer individuals would enter the population through birth or immigration, creating a need for the population to be rebalanced by shyer or bolder individuals. This supports evidence showing that personality may be mainted in the population due to frequency dependent selection (FDS) linked to reproduction (Dingemanse et al. 2004) and life history trade-offs (Wolf et al. 2007).
Our conclusions can be viewed as an example of negative FDS. FDS is known to act throughout the animal kingdom as rarity minimises competition and conspecific encounter. Studies have shown the FDS can give rise to, and maintain, personality difference themselves (e.g. Wolf and McNamara 2012) and foraging specialisations (e.g. Bolnick et al. 2003;Rueffler et al. 2006). Our study builds on this work and shows that FDS alters the optimal exploration-exploitation strategy in a population. Assuming personality is fixed within an individual (Sih et al. 2004), selection on the strategy individuals use to acquire resources may lead to selection on personality type itself, and alter the composition of populations. To assess the foraging success of individuals, we present two theoretical optimal foraging times for individuals foraging in a patchy ocean environment: optimising the net rate of energy gain (the optimal rate strategy), and minimising the risk of losing more energy searching for a new patch than gained foraging (the minimal risk strategy). Optimal foraging theory (Charnov 1976) presents an optimal time for foraging in a known environment, via the Marginal Value Theorem (MVT). Indeed, if one sets = 0 (no energetic cost of movement) in our single-individual optimal rate strategy model, we have the same scenario as in MVT and return the same optimal foraging time, given in Eq. (14).
However, seabirds do not have such complete, deterministic knowledge so are unable to determine this optimal time. Rather, individuals will have knowledge of the patch they are currently foraging in, as well as an intrinsic strategy (along the bold-shy continuum) which determines the amount of time they are likely to spend searching for a good patch and consequently the quality of the patches they are likely to settle in. This leads to both our examination of the minimum risk strategy as well as the effect of personality on optimality. Our study can be viewed as providing a general theory of optimal foraging whenever such partial environmental knowledge and personality variations are in place.
For an individual to minimise its probability of foraging unsuccessfully, it should stay in a patch until it is fully exploited (Eq. 16); that is, when the rate of energy gain from foraging drops below the rate of metabolic energy loss. However, for an individual to maximise its rate of energy gain, it should leave earlier (Eq. 14), even though the rate of energy gain from the patch is higher than the rate of metabolic energy loss. Less favourable conditions (i.e. longer expected search times, lower initial patch qualities and increased flight energy cost) lead to an increase in optimal rate foraging time, which converges to the minimal risk foraging time as conditions become so bad that the foraging environment becomes unprofitable (Fig. 2). Less favourable conditions also lead to individuals being less successful at gaining energy whilst foraging, and incurring an increased risk of foraging unsuccessfully (Fig. 3).
As well as modelling a single individual, we also compared foraging times and success of different personalities within a population, via the inclusion of the foraging strategy trade-off (Eq. 17, Fig. 4). We showed that it is optimal for shy individuals, who prioritise higher patch qualities over shorter search times, to stay in patches longer to maximise their net energy gain whilst foraging (Fig. 5), which supports empirical observations in seabirds (Patrick et al. 2017). In less favourable environments, the optimal rate foraging time increases; therefore shy birds will be more resilient to the effects of a degrading environment. Conversely, bolder birds may have increased foraging success if climate change causes their environment to improve locally.
Our modelling framework is based on the concept of energetic costs and gains. These vary depending on both the species being studied, due to differences in physical features such as body size and wing span, and environmental conditions such as wind. A key parameter in our model is the effect of flight energy costs on individual foraging success, given by the ratio = 2 ∕ 1 (Table 1), where 1 is the metabolic energy rate, and 2 is the flight energy cost (Dunn et al. 2018). Calculating this ratio for a particular species requires careful consideration of physical and environmental factors. For example, wandering albatross use increased winds to their advantage, saving energy costs by soaring, whereas diving birds such as puffins have high energy costs fighting against the wind (Elliott et al. 2014;Weimerskirch et al. 2000, Furness andBryant 1996). Kittiwakes likewise have relatively high flight energy costs; they can soar but high winds increase flapping time (Gabrielsen et al. 1987).
The optimal rate of energy gain decreases with and the risk of unsuccessful foraging increases (Figs. 2 and 3). Consequently, birds that have increased flight energy cost (e.g. due to being less efficient at flying long distances) have a higher probability of the forage-search event being unsuccessful. In this case, it is more optimal to spend a long time depleting a patch before moving on (a conservative strategy). Conversely, birds who are able to soar for long distances without expending much energy may benefit by moving on from a patch before it has been depleted in order to search for a better patch (a less conservative strategy).
As well as seabird physiology, the overall quality of the environment is also a key aspect of our model. This comes in via the parameter 0 (Table 1) which can be thought of as a dimensionless proxy for the quality of a typical foraging patch. Both increased water temperatures and the presence of fisheries can have an effect on patch quality (Furness and Tasker 2000). Anthropogenic actions are responsible for current changes in both these factors. We have shown that these can result in changes in optimal behaviour (Fig. 3) and therefore alter the evolutionary trajectory of seabirds.
Alongside changes in overall environmental quality, changes in environmental uncertainty can also affect the optimal foraging strategy. Such uncertainty is likely to lead to longer expected search times, which can be viewed through the dimensionless parameter 1∕Λ in our model. As 1∕Λ increases, the rate energy gain decreases and the probability of an unsuccessful forage increases (Fig. 3). Consequently, similar to environmental degradation, environmental uncertainly can have a negative effect on foraging and may drive changes in the personality composition of seabird populations towards favouring a narrow and shyer range of personalities. We are currently examining an individual based model that builds on this to incorporate explicitly the effects of movement, variable patch quality, and competition for resources, in which we aim to gain more detailed insight into the effects of environmental changes on the evolution of personality, and will be the subject of future work.
In conclusion, we have given mechanistic underpinnings for the emergence of differing personalities in seabird populations, via consideration of their foraging strategies in environments on which they have partial knowledge. Furthermore, we have shown that changes in environment, such as those currently caused by anthropogenic actions (e.g. increased fishing, climate change), will affect the optimal personality types and thus the evolution of personality in seabird populations. Overall, we predict that environmental degradation, either by increased uncertainty or decreased overall foraging quality, will cause the personality pool of seabirds to become shyer and less diverse.
Varying the shape of the trade-off curve for optimal rate analysis How does the shape of the trade-off curve in Eq. (17) affect these results? By varying parameters c 1 and c 2 , we alter the shape of the curve and show the effects of changing initial patch qualities and expected search times for the whole population. Increasing c 1 increases 0 by a factor of c 1 for all individuals on the trade-off curve, which increases an individual's patch quality. Increasing c 2 increases Λ by a factor of c 2 for all individuals on the trade-off curve, which decreases an individual's expected search time.
When varying c 2 , an individual's position on the tradeoff curve is now defined by c 2 ∕Λ , as this maintains their position on the trade-off curve as the shape of the curve is altered, and so too is their expected search time.
Supplementary Figure S1 shows the effects of varying c 1 and c 2 on the optimal individual within a population of boldness b. As we increase patch quality by increasing c 1 , we see the optimal expected search time within the population increase, so the optimal individual foraging strategy becomes more shy. If we decrease expected search times by increasing c 2 , this also favours shy individuals, with the optimal individual strategy within the population favouring individuals with longer expected search times.

Derivation of minimal risk individual in a population
We use Eq. (38) to find the individual expected search time 1∕Λ which minimises the individual risk P c in a population of boldness b. We obtain the expression which gives the minimal risk expected search time 1∕Λ c for an individual in a population of boldness b.

Intermediate population boldness b
When considering the minimal risk of foraging unsuccessfully for an individual with a population of boldness b, we define the intermediate population boldness as the population where the intermediate individual, with expected search time 1∕Λ = 1∕c 2 , is least likely to be unsuccessful, i.e when

Varying the shape of the trade-off curve for minimal risk analysis
We again consider the effects of changing the shape of the trade-off curve in Eq. (17), defining individual foraging strategies within the population on the curve by c 2 ∕Λ . In Supplementary Figure S2, as we increase c 1 , we increase the patch quality for individuals, which causes the minimal risk individual strategy to become more bold. However, as we vary c 2 , varying expected search times for the population as a whole, the minimal risk individual foraging strategy is unaffected. We still see variation in the risk to each individual P( s > f ) as in Fig. 3, but the individual with the lowest risk remains unchanged.