Evolution of sustained foraging in three-dimensional environments with physics

Chaumont, Nicolas; Adami, Christoph

doi:10.1007/s10710-016-9270-z

Evolution of sustained foraging in three-dimensional environments with physics

Open access
Published: 07 June 2016

Volume 17, pages 359–390, (2016)
Cite this article

Download PDF

You have full access to this open access article

Genetic Programming and Evolvable Machines Aims and scope Submit manuscript

Evolution of sustained foraging in three-dimensional environments with physics

Download PDF

Nicolas Chaumont¹^nAff2 &
Christoph Adami³

7012 Accesses
3 Citations
Explore all metrics

Abstract

Artificially evolving foraging behavior in simulated articulated animals has proved to be a notoriously difficult task. Here, we co-evolve the morphology and controller for virtual organisms in a three-dimensional physical environment to produce goal-directed locomotion in articulated agents. We show that following and reaching multiple food sources can evolve de novo, by evaluating each organism on multiple food sources placed on a basic pattern that is gradually randomized across generations. We devised a strategy of evolutionary “staging”, where the best organism from a set of evolutionary experiments using a particular fitness function is used to seed a new set, with a fitness function that is progressively altered to better challenge organisms as evolution improves them. We find that an organism’s efficiency at reaching the first food source does not predict its ability at finding subsequent ones because foraging efficiency crucially depends on the position of the last food source reached, an effect illustrated by “foraging maps” that capture the organism’s controller state, body position, and orientation. Our best evolved foragers are able to reach multiple food sources over 90 % of the time on average, a behavior that is key to any biologically realistic simulation where a self-sustaining population has to survive by collecting food sources in three-dimensional, physical environments.

How morphological development can guide evolution

Article Open access 17 September 2018

Virtual Spatiality in Agent Controllers: Encoding Compartmentalization

Evolving and generalising morphologies for locomoting micro-scale robotic agents

Article Open access 01 December 2022

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Foraging is essential to the survival of countless animal species, and involves a variety of behaviors ranging from basic chemotaxis in bacteria [1] and C. elegans [2] to elaborate cooperative behaviors, for example in group hunting used by predators such as lions [3] or bottlenose dolphins [4, 5]. Despite field-specific nuances, foraging involves three basic skills: Sensing and locating a target, approaching it so that it is within reach, and finally handling it in some manner. In biology, foraging implies the search for a resource, usually food. Then, either it is consumed on site (such as nectar for hummingbirds) or it is brought back to the nest (e.g., in bees), in which case foraging also involves homing. The inherent link between the forager and the resource foraged stems from the necessity to develop some type of morphological, physiological or behavioral apparatus for finding, reaching and processing the resource. This forager–resource interaction is at the origin of a variety of fundamental phenomena such as mutual forager–resource co-evolution [6, 7], predator–prey co-evolutionary arms race [8, 9], or the emergence of migratory, nomadic, and sedentary behaviors [10–12]. Foraging can also give important insights into ecosystem stability and food web dynamics[13–15].

The study of foraging strategies has led to many advances, and profoundly influenced engineering and computer science. The method of stochastic optimization is a prime example of this influence, with the development of well-established search algorithms such as ant colony optimization (ACO) [16], as well as particle swarm optimization (PSO) [17]; more recent search heuristics based on foraging concepts are the bacterial foraging optimization algorithm (BFOA) [18], and artificial bee colony (ABC)[19]. Information foraging theory (IFT), which is a direct application of optimal foraging theory, has deeply altered human–computer interaction research [20, 21], and has recently been applied to software engineering [22]. Coordination displayed by foraging social insects inspire new heuristics in robotics for navigation [23], homing [24], and division of labor [25].

In biology, despite contributing to the mechanistic explanation of foraging behavior at the genetic [26–29], molecular [30], and neural levels [31, 32], the study of foraging and its evolution has not sufficiently improved our understanding of this phenomenon to the point where it is reproducible in artificial legged animals. One reason for this failure may be that goal-directed behavior in higher animals has taken a very long evolutionary route, by gradually evolving from ancient instances in organisms that can be traced back to the first prokaryotes [30]. Therefore, studying the evolution of foraging is mostly concerned with comparative genomics, physiology, and neuroscience, and provides few if any clues about the necessary evolutionary pressures, as well as behavioral milestones, that enable the evolution of dependable, sustained foraging behavior.

For this reason, the reproduction of foraging in an artificial medium—for example for the purpose of studying aspects of foraging that are impossible to study in vivo—still poses very serious challenges to engineers and researchers and remains a subject of very active research [33–39]. Indeed, the problem is a classical benchmark against which newer optimization techniques are constantly tested (see, e.g., [40, 41]). The most obvious applications of foraging might be in robotics, because both animals and robots exist and interact in the same real world, are subject to the same physical laws, and because foraging is the precursor for many real world applications in robotics (e.g., automatic vacuum cleaning, planetary exploration, land mine clearing, bomb or hazardous material handling, rescue operations, etc.). Foraging in robotics research has a slightly different meaning than in biology, and refers to the task of reaching one or more objects, collecting them, and taking them to one or several locations. Homing is almost always implied. Because of technical limitations, the idea of using the collected object as a resource to provide additional autonomy to the robot, such as a usable spare part for self repair or connecting an additional battery to provide additional energy autonomy (the equivalent of eating for animals), is essentially absent, except in very rare cases [42, 43].

An area where foraging holds an important place is the simulation of three-dimensional, physical environments. Such environments constitute ideal prototype worlds that provide sufficient realism to test various concepts in robotics [44–47]. Simulations in 3D physical environments are much easier to set up compared to their hardware counterpart, they can be executed many times faster than real time, and allow the same experiment to be reproduced perfectly (see, e.g., [48]). Simulated environments with physics are also extensively used outside the field of robotics, such as the simulation of physical phenomena in computer graphics [49–53], and gaming [54], to name but a few. In biology, environments with a full simulation of physics can be used to study the evolution and usage of physical and behavioral traits that are specifically adapted to overcome the constraints dictated by the simulated laws of physics, and can help better understand how similar traits evolved and are used by real organisms [55–58]. Although this work takes a different approach, agent based simulations have been used for decades to study species interactions, including foraging [59–61].

The path to the evolution of foraging in 3D articulated virtual organisms is lit by several beacons, such as the evolution of forward locomotion [62–66], fighting [67, 68], competing for a single food source [62], and the evolution of light following [69, 70]. Goal-directed movement to multiple arbitrary locations has been achieved independently by Pilat et al. [39], who obtained results similar to ours but using a very different method and fitness function, suggesting that sustained foraging is a robust evolutionary trait.

Evolving autonomous sustained foraging allows a whole population of foraging organisms to inhabit a 3D physical world to autonomously feed, reproduce, and evolve to produce a fully sustainable population [71] that could even radiate into distinct species and give rise to ecosystems. This kind of simulation framework would enable the study of a variety of aspects related to evolutionary population dynamics, such as predator–prey dynamics [72–74], exaptation [75], modes of speciation, mate selection, group formation, as well as communication and ecosystem stability, within a framework that spans several levels of organization ranging from the response of a single neuron all the way up to the dynamics of an entire ecosystem.

In this work, we report the evolution of 3D articulated organisms subject to the laws of physics, that are able to forage food items randomly placed many body lengths away from them reliably and multiple times. We expect that such foragers are able to survive autonomously in an open-ended environment where they can reproduce after gathering enough food sources, and found a sustainable population (see also [71]).

2 Methods

In this work we use a 3D “virtual creatures” environment that is inspired by Sims’ work [62, 69]. Virtual creatures are composed of several 3D rectangular solid body components that are connected via simple hinge joints. Body components are not allowed to penetrate other body parts unless directly connected. A creature’s phenotype is represented by a rooted tree whose nodes correspond to the body parts and whose links correspond to joints. The genotype of a virtual creature in turn is a directed graph, with nodes that represent the information of body-part size, as well as recursion. The edges of the graph correspond to joints and store contact position information. The virtual organism’s behavior is generated by neural signals converted to torque impulses applied at the joints that connect the rectangular boxes that form the body. Sensors dedicated to touch, proprioception and food source detection ensure organisms have a way to sense their environment and act upon it.

2.1 Virtual organisms

The virtual organisms used in this work are implemented in a platform we call “EVO” (Evolution of Virtual Organisms). The platform has been used before in [65] and [76], and the walkers we evolve are very similar to those obtained previously. The main differences lie in the genome encoding and the neural controller. This version of EVO provides a default grammar and a set of 13 keywords to describe the genome structure, and the rules to initialize and mutate it. The genome description is specified in a script-like fashion (see Supplementary Material for a description of the script) and stored in a file. Once read, the description is interpreted and stored in data structures that are used by plugins in charge of initializing and mutating the organism’s genome. Without this script, the user would have to code by hand, compile, and debug the functions that initialize and mutate each new genetic structure, and go back to the code each time the structure is modified, which is laborious. These modifications do not involve any coding or re-compilation in the current version of EVO, and the genetic description can be written much faster than if coded, typically in a few minutes.

The controller used in this work has more neural types compared to the 2006–2007 version, implementing all those used by Sims [62, 69]. The types are defined by the manner in which they process inputs: Sum, Product, Divide, SumThreshold, GreaterThan, SignOf, Min, Max, Abs, If, Interpolate, Sin, Cos, Atan, Log, Exp, Sigmoid, Integrate, Differentiate, Smooth, Memory, Wave, Saw, and constant. The introduction of additional neural types is expected to help out evolution via their diversity, but whether this indeed occurred was not tested.

In addition to the contact sensors, each joint has an angular proprioceptive sensor. Furthermore, the root block—from which the rest of the morphology is created—has two sensors that return the angle and the distance to the closest food source. The angle is measured between the organism’s forward vector and the vector that originates from the organism’s root block center of gravity, and points to the closest food source.

2.2 Evolutionary algorithm

The evolutionary algorithm used in this work is a steady-state genetic algorithm (SSGA) [77, 78] that starts by evaluating an initial population of 200 organisms. The evaluation phase typically consists of simulating each individual in the population with the same set of targets and calculating the geometric mean of the fitness obtained for each target. Since the individual and its environment are destroyed and recreated anew for each simulation, the organism retains no history about its past experience with previous targets.

Once every organism is evaluated, in the selection phase 20 % of the population survives, and is used to repopulate the vacant 80 %. Then selection strategies can be used to choose a survivor such as elite and tournament. In elite selection, the organisms with the highest fitness in the population are guaranteed to survive, while in roulette selection the organisms with higher fitness have a higher probability to leave offspring. In tournament selection, the best of a group of organisms (with given tournament size) is selected to populate the next generation, with as many repeats of the tournament as necessary to fill up the population.

The reproduction phase is very similar to that of Sims [62, 69]: An empty slot is filled with an organism generated from the survivors. The new individual is either a clone with 30 % probability, or a recombination between two survivors. In either case, the new organism undergoes a 1 % per-site mutation rate, which corresponds to about 1–10 mutation events per genome. Once reproduction is completed, the algorithm goes through a new evaluation phase and repeats the same sequence for typically 40–50 generations. Such a short evolutionary time is sufficient to witness the emergence of promising foraging behaviors without spending computing power on poor organisms. Unlike the emergence of forward locomotion that is systematic in EVO [76], foraging builds upon skills that are much more rare, and thus less likely to evolve. For instance, while forward locomotion emerges robustly within 20 generations, only 3 % of the simulations (out of 400 repeats) successfully evolved organisms that took at least two different trajectories when exposed to different food sources after 40 generations. This skill, namely the ability to exhibit a behavior that is conditional on food source placement, is a necessary milestone before reliable foraging can emerge. Yet, over 30 variations of the same experiment run for 40 generations with the present set-up would have to be repeated on average before it appears in only one of them, on average. For this reason, each simulation configuration is run in sets of 100–480 repeats with the same parameters save the random number seed. Each repeat produces an evolved organism that we inspect visually and whose performance is compared against the organisms from the other repeats (see Fig. 1 for an overview of the transfer strategy).

The best evolved organism from a set of repeats is called a founder organism because it exhibits key behavioral qualities that are desirable for foraging (such as locomotion in the early stages of evolution, and the flexibility to steer towards multiple directions in later generations), and is used to found the next generations of foragers. For example, the first founder organism (organism 3069) evolved after 40 generations from a population of organisms seeded with random initial genotypes, and was selected among the fittest from 400 repeats (from run 2986 to run 3385, see Fig. 2). Each repeat is numbered according to the order in which it is executed, and each evolved organism is numbered after the evolutionary experiment that produced it. The first founder organism is then used for the next set of 480 repeats, where each population in the repeats is composed of exact clones of organism 3069. This procedure is repeated five times (with different numbers of repeats, and different fitness landscapes that are particular to that stage, see below) to yield the final foragers (see Fig. 2).

2.3 Placement of food sources

A critical aspect of autonomous foraging is the organism’s ability to use the relevant sensory information to approach and reach nearby targets. To evolve such an ability, the organism has to be exposed to a large number of different target positions, which is why in this work we vary those locations within and across generations. When evaluating the fitness of a single organism, it is impractical to uniformly sample enough random target positions to recreate a representative approximation of the task of foraging. Indeed, with the current setup, we would need to test each organism on at least 20–30 different targets to cover the available space, which corresponds to several weeks of simulation for a single experiment. However, in the early stages of evolution where the organisms have a very limited ability to approach food sources, their fitness is overwhelmingly dictated by the targets’ positions: the fittest individuals are therefore those for which the greatest number of targets (usually one) happen to be at the right place. Therefore, the fittest are, at that stage, not the most efficient foragers but instead they are the luckiest ones. As the foragers become increasingly capable, their dependence on the target positions dwindles to the point where the fittest individuals are almost always the most efficient ones.

We use an approach inspired by this observation to reduce the stochastic effect of random food source positions while keeping the simulation time to a minimum (within a day to a week per experiment): We first find a target placement that yields the fastest emergence of target-following behaviors within a fixed amount of time, then we use this placement in conjunction with an increasing amount of noise to allow the organisms to reach food sources at arbitrary positions. The optimal placement uses only four food source locations compared to 20 or 30 as discussed earlier. A significant drawback to this approach is the fitness sensitivity to stochastic fluctuations on the food source placements. Those random fluctuations can be mitigated by evolving the initial population in an almost noise-free environment, and gradually increasing the noise for each subsequent evolutionary stage once the best organism’s performance is reliable given the previous amount of noise.

The algorithm for the placement of food sources has two components. First, we select for the ability to move conditionally on food source location, by positioning the source 10 m away from the organism (measured between the organism’s root block and the food source’s centers of gravity), at a specific angle. For example, by choosing two locations (angles 0$^{\circ }$ and 180$^{\circ }$), we select to move forward when the food source is located in front, but move backwards (or turn around and then move forwards) when the source is in the back (see Fig. 3b). We can also use the four cardinal directions to select for conditional movement (Fig. 3c) or more (Fig. 3d). Within each conditional movement adaptation step, we perform a noise adaptation step, where we randomize the source placement by applying noise along the horizontal plane, by an amount that is proportional to the distance used in the first step (Fig. 4). This step is important in the early transfer stages of evolution (defined in Fig. 2) because very little noise is used in those: in that case the food source positions carry a strong and consistent signal of directionality. As the foragers become more efficient, the noise component is increased up to the point where the noise adaptation step is meaningless and is skipped altogether. For the noise adaptation step, we repeat each primary direction (defined by the angle) three times, so that an organism can reach up to three food sources in sequence (with positions that differ only in the amount of noise applied). Note, however, that the internal state of the organism is not reset when the experiment is repeated; instead we are really asking the organism to reach up to three food sources in sequence. We chose these two different evaluation steps because adaptation to noise and adaptation to drastically different locations appear to be different skills: it is possible, for example, to evolve to conditionally react to sources right in front and to the back, and still be incapable to tolerate even the smallest deviations from these two positions. At the same time, it is possible to tolerate noise in one direction, but not be capable to conditionally react to arbitrary directions.

Each fitness evaluation in the noise adaption step yields a fitness value W for reaching up to three food sources in sequence [see Eq. (7) below]. This fitness is then combined with the rest of the other fitness values (obtained in the conditional movement steps) in a geometric average to produce a composite fitness $\overline{W}$ [Eq. (8)]. We can use several different procedures for the placement of food sources with respect to the organism. In a “single angle” movement step, all the food sources in the sequence are in the same location (Fig. 3a), 10 m in front of the organism (possibly with noise). Thus, after reaching the first food source, the organism would not need to turn as the next one would appear at the same (zero degree) angle. In a “two angle” setup, the first three food sources are placed at 0$^{\circ }$, but in a second evaluation phase, all food sources appear at 180$^{\circ }$ (see Fig. 3b).

Let us consider an organism evaluated across four evaluation steps (Fig. 3c), so that each angle interval is $(2 \pi ) / 4 = \pi /2$ radians. We test the individual with a maximum of 3 food sources in a row, and would create the first one in front, at a 0$^{\circ }$ angle (possibly with noise). If the forager can reach the first food source on time, then the second target position is added to the organism’s current position and appears at a 0$^{\circ }$ angle (plus noise) as well, as would the third target if the second one were reached. After the first conditional movement step terminates, the second such step begins with a food source placed at an angle of $\pi /2$ radians, i.e., on the left. Each new food source (up to three) within this conditional movement step will appear at the same angle, 10 m away from the organism. The last two steps will create food sources at angles of $\pi$ radians and $3 \pi / 4$ radians, which correspond to food sources in the back and on the right, respectively. At any point in time, an individual sees only one food source. In such a setup, an organism can reach a maximum of 12 food sources (three at each angle). It is possible to test more sophisticated placement schemes, but a scheme as that in Fig. 3d would be much more costly (24 evaluations), and is not used here.

When noise is applied to the food source location (see Fig. 4), each target position within a sequence is uniformly randomized by adding noise $\nu$ proportional to a percentage p of the distance to the food source $\delta$, so that $\nu = \delta \times U(p, -p),$ where U(x, y) is the uniform distribution over a 2D plane spanned by the x and y axes. For instance, a noise level of 5 % applied to a food source placed 10 m away from the individual corresponds to two uniformly distributed perturbations of at most ±50 cm added to the target’s x and y coordinates. This is equivalent to drawing squares of $1 \mathrm{m}^2$ centered on each food source, and choosing a position uniformly within the perimeter of those squares (Fig. 4).

2.4 Simulation

Before evaluation, each individual has to successfully pass a validity test that discards those individuals that are either prone to create numerical instabilities within the physics simulator, or that have no chance of moving or sensing. To determine the latter, an organism is removed from the simulation if it has only one block, if its neural network is disconnected from the motors, sensors, or both, or if there is any interpenetration among the body parts at the very first time step. This strategy significantly decreases the time spent on evaluating individuals that have no chance of succeeding [79]. The simulation creates a new empty environment, consisting of a flat surface (ground) and the first food source in the evaluation sequence. Subsequently, the organism is generated from its genome description and placed above the ground as in Ref. [76]. The individual’s proximity and angular sensors are initialized to target the food source. The physics in the virtual world is simulated using the Open Dynamics Engine [80] in discrete time intervals of 0.02 s (i.e., 50 iterations per second). Immediately after being created, an organism undergoes a series of activations and deactivations (described in Ref [76]) to prevent unsuitable “cheating behaviors” such as morphologies that manage to traverse some distance just by falling over because of the pull of gravity, without using its brain at all (such cheaters were previously described by Sims in Ref. [69]). Cheaters inhibit the evolution of efficient gaits because they are much more likely to emerge by chance than individuals with cyclic, coordinated motions.

As soon as the organism completes the activation sequence, a timer is initialized (at 10, 30 or 60 s depending on the evolution stage), which is the maximum amount of time given to reach the next food source before the simulation terminates. At each time step, the individual’s root block position and the distance returned by its proximity sensor are recorded. When the center of gravity of the organism’s root block is within 2 m from the food source (called the absorption radius), the individual absorbs it instantaneously and the next food source in the sequence appears. The proximity and angle sensors are then updated to target the new food source and the timer is reset, allowing the simulation to proceed for the same amount of time as for the initial food source. If the organism reaches the last food source in the sequence, the food item remains in place and the simulation stops after time runs out. When the simulation ends, the organism’s positions and distances to the sequence of targets are used to evaluate its fitness.

2.5 Fitness function

Each time step, the organism is rewarded for approaching, or penalized for moving away, from a target. For the current food source s, the fitness contribution $W_{s}$ is computed as a function of the distance $\Delta d_{t} = d_{t} - d_{t-1}$ travelled towards the food source between the two consecutive time steps t and $t-1$. The values $d_t$ and $d_{t-1}$ are the organism’s proximity sensor readings at time t and $t-1$ respectively:

$$\begin{aligned} W_s = \prod _{t=1}^{T_s-1}{(1 + \Delta d_{t})}, \end{aligned}$$

(1)

where $t=1\ldots T_s$ are time steps during which the food source s exists in the environment. Depending on how fast an organism reaches the target, the variable $T_s$ is different for each food source $s = 1 \ldots N$.

For a given number of time steps $T_s$, $W_s$ is largest when all the distances $\Delta d_t$ are equal. For example, in the case of two $\Delta d$ (i.e., for three time steps), the fitness can be pictured as the two-dimensional area $W_s = \Delta d_1\times \Delta d_2$, which is maximal when $\Delta d_1 = \Delta d_2$ form a perfect square. The same reasoning can be applied for three $\Delta d$s that span a volume; the statement that $W_s$ is largest when all the distances $\Delta d_t$ are equal can easily be proven by induction for higher dimensions. This property has the advantage of favoring smooth periodic motions, because for a given number of time steps $T_s$, those individuals that progress at a constant speed towards the food source are rewarded the most.

The total fitness accumulated in a simulation run for approaching targets $W_a$, is the product of the individual fitnesses $W_s$ for each individual food source in the sequence:

$$\begin{aligned} W_a = \prod _{s=1}^{S}{W_{s}} = \prod _{s=1}^{S}{ \left( \prod _{t=1}^{T_s-1}{(1 + \Delta d_{t})} \right) }. \end{aligned}$$

(2)

Unfortunately, no interesting behavior emerges with this fitness function: the best organisms move very little because locomotion is not rewarded enough in the early stages of evolution. To counteract this, we use an additional term that explicitly rewards locomotion using the total distance traveled by an organism between the first ($p_0$) and the last ($p_f$) position recorded during simulation:

$$\begin{aligned} W_l = C \times \min (p_0-p_f, 1.0). \end{aligned}$$

(3)

A scaling constant $C=100$ yields the desired results in early generations, but the term $W_l$ can become overwhelmingly dominant when the organisms evolve to travel greater distances, since it is apparently easier to develop a faster gait than to better steer towards a target. The result is a rapid emergence of fast movers, but poor target followers. To limit the influence of $W_l$, the maximum distance rewarded was capped at 1 m.

If the goal of selection is target-reaching behaviors, then an organism that reaches food sources should receive a higher fitness if it succeeds than any other that fails. However, the fitness $W_a\times W_I$ does not explicitly reward reaching any food source per se, only the distance travelled towards them. This implies that a forager that reaches nearby targets is penalized over those that steadily approach distant ones, without ever reaching any. To correct for this, we add a term $W_r$: a fitness bonus that is added to the fitness each time a food source is reached such that organisms that reach more targets are fitter than those that reach fewer. The bonus is designed such that it is about the same order of magnitude as the fitness accumulated for approaching the target. Assuming the organism reaches S food sources in T time steps where each food source is initially placed 10 m away, it will travel an average distance of $\Delta d = \frac{10\times S}{T}$ at each time step. Substituting in (2) and multiplying by the total number of food sources S, the total bonus $W_r$ for reaching S food sources is:

$$\begin{aligned} W_r = S \times \left( 1 + \frac{10 \times S}{T} \right) ^T. \end{aligned}$$

(4)

Since $S < 10$ and $T > 1000$, the equation can be approximated by

$$\begin{aligned} W_r \approx S \times e^{10 \times S} \end{aligned}$$

(5)

This bonus is added to the overall fitness:

$$\begin{aligned} W = W_l \times \prod _{s=1}^{S}{W_{s}} + W_r. \end{aligned}$$

(6)

With this fitness function, the combined effect of rewarding the organism for approaching food sources with the added bonus for reaching them puts a very strong evolutionary pressure for reaching the easiest food source: As soon as an organism evolves to reach the first food source, it capitalizes on this successful direction to reach the next one without developing the ability to move towards other directions. A solution is to discourage the organism to go towards the next food source if it already reached one in this direction by reducing $W_s$ in Eq. (2):

$$\begin{aligned} W = \prod _{s=1}^{S}{W_{s}} = \prod _{s=1}^{S}{ \left( \prod _{t=1}^{T_s-1}{\left(1 + \frac{\Delta d_{t}}{2^{s}}\right)} \right) } \end{aligned}$$

(7)

Each time a new food source is absorbed, the reward $\Delta d_t$ is halved. The bonus $W_r$, however, remains identical, so that the organism still has a very strong incentive to reach food sources. Only the order of priority to reach them is changed. The organism’s fitness across all repeats $\overline{W}$ is the geometric average of each individual fitness $W_i$ [calculated using Eq. (8)] across all R repeats, where R is determined by the number of primary directions used in the conditional movement step described above:

$$\begin{aligned} \overline{W} = \left( \prod _{i=1}^{R}{W_i} \right) ^{1/R}. \end{aligned}$$

(8)

2.6 Measuring the organism’s foraging ability

Foraging performance is assessed by placing food sources on a two-dimensional grid or map, and by testing the individual for each food source on that map, one at a time. Each cell on that foraging map is colored depending on the organism’s average speed of approach: Lighter shades of red correspond to faster speeds toward the food source whereas lighter shades of blue encode faster speeds away from the food source. Darker shades indicate slower speeds. If the food source is placed close enough to the center of the grid, it would be in direct contact with the individual, and be automatically absorbed even if the forager does not move at all. The foraging map would display inconsistent color patterns in the center that do not reflect the organism’s foraging ability at all. This is why there are no food sources within 4 m (i.e., two absorption radii) from the grid center. The resulting foraging map is a snapshot of the organism’s ability to forage at a given resolution.

Foraging maps in this work span a constant square surface of $20 \times 20$ m (which is the same area used to place food items in the evolutionary runs) and carry either $11 \times 11$ (low resolution) or $101 \times 101$ (high resolution) food sources, which correspond to spacings between two adjacent food sources of 2 m and 20 cm, respectively (Fig. 6). Because the area in the center of the foraging map is empty, an $11 \times 11$ foraging map has only 112 food sources, and not 121 as expected in a fully populated grid. While the typical fitness evaluation of an organism in an evolutionary run takes about 5 s, the computation of a low resolution foraging map takes about 2 min, and about 4 h for a high resolution foraging map.

As soon as an organism reaches a food source, it is possible to draw a foraging map for all the second food sources, conditional on the first one. Such a conditional foraging map provides insights about how the organism’s foraging ability changes after reaching specific food sources. However, the computation time approximately doubles, since for each pixel on the conditional foraging map the organism has to be simulated for two targets: the first one and all the second ones given the resolution of the map. For each conditional foraging map, the first target location is the same. Clearly, calculating all conditional foraging maps scales quadratically, so instead we present them only for selected first-target locations.

3 Results

3.1 Foraging in more than one direction

In order to effectively force organisms to use their sensory information, food sources have to be placed at least in opposite directions, as in Fig. 3b. We found that if the targets are placed in front of the organism but at different locations, each individual almost always evolves its own unique forward trajectory, regardless of the actual food source positions. In that way, by just going forward the organisms are able to obtain (with a single unconditional behavior) some reward from all three food sources, without using their sensors at all. When two targets are placed in opposite directions (Fig. 3b), the fittest and most effective strategies consisted in wiggling towards the front one, and wiggling backwards toward the target behind. Both gaits are very similar and apart from the obvious progression in opposite directions, are otherwise impossible to distinguish if inspected separately. Very few strategies involve turning towards the food source, and those that do turn look very laborious and can never reach a second target that is behind, because they are incapable of completing the turn in time. Moreover, this wiggling behavior appears to be particularly inflexible: when exposed to an environment with four food sources, those individuals almost never acquire the ability to turn left or right, and when they do, it is always very inefficient. The best results were obtained when we exposed individuals to four food sources (Fig. 3c). We also tried exposing the individuals to eight food sources, with four additional targets on the diagonals (Fig. 3d), but even though it yielded slightly better results for a fixed number of generations, such runs took twice as much time to complete: With the same amount of CPU time and the same number of replicates, we had much better results by running two sets of replicates with only four food sources each, where the second set is seeded with the best organisms from the first set, rather than running one set with eight food sources.

If an organism has to reach one food source and the simulation terminates only when time runs out, regardless of whether it reaches the target or not, then distance matters: If the food source is placed far enough from the organism that it has no chance of reaching it within the time allowed, then evolution capitalizes on the fitness gathered from approaching a single target by optimizing the current gait to become ever faster, without developing the ability to go toward other directions. On the other hand, if an organism can reach its first target in the early generations, then it cannot improve its fitness by approaching this food source any more, and is forced to adapt to seek additional food sources in other directions. Among the set of 400 initial runs, no organism is able to efficiently evolve a gait within 40 generations that is effective for the four directions. Reasonably high fitness values (on the order of a hundred) are good indicators of efficient foraging gaits, while mediocre gaits typically have fitness [measured by Eq. (8)] of the order of 5. An unusually fit individual (given the other member is the population), especially in the early stages of evolution, is usually a signature of a bad strategy. For example, “jumping” organisms, that use motor impulses that violently propel them in the air, can manage to randomly hop around covering greater distances than their peers in the population, and happen to land on or close to a food source purely by chance.

3.2 Improving foraging coverage

Typically, the addition of noise to the target positions is an effective means to improve the forager’s coverage. With little or no noise, organisms show a reduced ability to forage in their environment, being generally constrained to the close vicinity of the four target positions they were exposed to. The target positions that are within the organism’s reach are typically clustered together, forming one ore more patches as revealed by the foraging map. As the organism’s foraging ability improves, the patches grow and eventually coalesce into bigger connected areas. The remaining targets that are still out of reach form compact patches that the organism does not seem to be able to approach. For food sources located in such areas, the organism remains stationary, although it actively wiggles using a gait that is superficially similar to the efficient gait used to reach targets. That inefficient gait, however, seems to be completely insensitive to the targets: the organism acts as if it could not sense their position (see Fig. 5), as if the food sources were in a “blind spot”. However, as the targets are always sensed regardless of their position, the term “zone of confusion” is more accurate. The relationship between the efficient and inefficient gaits will be discussed in more details in Sect. 4.

The amount of noise used to evolve the first founder organism was set to 0.1 % (see Fig. 2), which corresponds to a maximum perturbation of only ±1 cm (as targets are positioned 10 m away from the organism before noise is applied) along both planar coordinates (Fig. 4). The amount of noise is drastically increased in each transfer to 5 % in the first transfer, and 50 % in the second, after which the positions are uniformly random. We found that if the noise is increased too fast, the overwhelming majority of the runs’ fittest organisms wander about randomly. After the first transfer, when promising organisms are selected from the first-stage runs and are used to seed a new evolutionary run, their ability to cope with noise seems to increase exponentially. After two transfers, starting from a noise level of 0.1 %, the organisms are typically able to cope with noise up to 20–50 %. Past those noise levels, they are able to reach food sources positioned uniformly around them, without missing any.

After each evolutionary stage, the elites of each run improve their foraging ability in one of two qualitatively different ways: either they were able to reach more distant food sources, or they evolved the capacity to forage in a new direction. Another clear trend is that evolving the ability to forage in new directions is a rare occurrence compared to evolving the ability to exploit existing directions by reaching farther targets: turning appears harder to evolve than running. For this reason, we selected organisms primarily for their ability to reach food sources in all directions rather than reaching distant ones ( Fig. 6).

Despite the advantage of allowing organisms to forage over greater areas, using noise carries an important drawback: it alters the organisms’ fitness and rankings between generations. Not only is the best organism’s fitness in one generation different from the fitness value in the next generation, but it might not be the fittest anymore. Since the food source positions are randomized for each new generation, the organism’s fitness fluctuates throughout an evolutionary run. The fluctuations are more drastic as the noise level increases, sometimes over ten orders of magnitude (see for example generation 7 in Fig. 7d). This upward trend in fitness fluctuations is due to the enormous amount of fitness that efficient foragers are able to accumulate. Under this regime, it is hard to design a selection scheme that consistently favors promising foragers across generations. In previous work when evolving forward locomotion or throwing [65, 76], transferring the best organism along with others chosen with roulette selection to the next generation was sufficient to allow a monotonic increase in fitness within an evolutionary run. Almost always, higher fitness meant better, that is, more functional, organisms. In the evolution of autonomous foragers, this is not true any more: even at very low noise levels, selecting only one elite does not guarantee that good genes are going to be passed on to the next generation. Instead, as discussed earlier, the fittest organism could be the result of chance alone, where a food source happens to be placed directly on the individual’s path. Chance can, however, be mitigated by increasing the number of elites. We found that five elites seem to be the threshold at which there is a marked difference in the algorithm’s ability to find efficient organisms. We used eight elites in order to further increase the probability of selecting good genes, while the rest of the survivors were chosen with roulette selection. Under these conditions, the optimization algorithm manages to significantly increase the organism’s foraging efficiency because the rankings within a population are then consistently preserved: A good forager at a given generation is also going to perform well in the next generation.

After two transfers, the best of the evolved organisms across the repeats is able to forage in all directions (see Fig. 8b, left panel), and can reach any target on the test grid within a minute of simulated time (i.e., $60\times 50=3000$ time steps). Interestingly, after reaching the first food source, the organism reaches the next target only a quarter of the time on average, and never reaches four or more (Fig. 8a). Indeed, after reaching the first target, the organism’s foraging map switches to a different one that has substantial zones of confusion (see Fig. 8b, right panel). To achieve autonomous and sustained foraging, an organism has to be able to collect food sources multiple times in sequence. As our goal is to use such a forager as a seed ancestor to self-reproduce as it gathers energy from food, and ultimately seed a sustainable population in an open environment, we have further evolved the best organism to reliably forage several food sources in sequence.

3.3 Reaching more than one food source in sequence

If organism 6809’s foraging ability were not conditional on the food sources reached, it would forage all of them, regardless of whether they are the first or the subsequent ones, and would produce a foraging profile in Fig. 8a with a line close to 100 % success rate. However, its behavior as Fig. 8a shows is drastically altered upon reaching the first food source. The conditional foraging maps are qualitatively similar independently of the location of the first source, in that the organism attains on average only 25 % of the second food sources, as opposed to 100 % for the first foraging map.

This observation is important for the rest of the discussion: organism 6809 and its descendants exhibit a clear qualitative difference between the initial foraging map (which corresponds to the organism’s ability to forage the first food source) and the conditional foraging maps that capture the organism’s ability to reach the subsequent food sources. Each foraging map corresponds to a probability that an organism reaches a target (the success rate in Fig. 8a). For one food source, this probability is equal to the success rate because the initial foraging map is unique. The success rate for reaching multiple food sources is contingent on reaching the previous ones, so it can never increase: the success rate to reach two food sources is the probability to reach the first multiplied by the probability to reach the second. For each repeat in Fig. 8a, the same organism reaches the first food source at different positions, so the conditional foraging map is different each time. The second probability (leaving the first food source to reach the second one) is an average of all the probabilities associated with all the conditional foraging maps. This second probability here is approximated by the ratio between the success rate for reaching the second food source divided by the success rate for the first one. The same logic applies to the success rate for three or more food sources. For example, in Fig. 8a, the ratios between two consecutive success rates (from the first to the fourth food source) yield low values of 0.28, 0.179, and 0, respectively. These numbers mean that the organism is able to move on to the next food source only 28 % of the time after reaching the first food source, 17.9 % of the time after reaching the second one, and never after reaching the third. These percentages are clearly in stark contrast with the ability to systematically attain the first target.

Had the experimental setup placed enough selection pressure on the ability to reach more than one food source, the conditional foraging maps would have displayed more red pixels. In this section we describe our shift from the fitness function used in the first three stages to a different one that increases the evolutionary pressure on the organism’s ability to forage more than one food source. Since our best individual after two transfers is able to reach the first food source systematically, we dropped the reward for it: only the second and subsequent targets are rewarded. This change corresponds to altering the fitness term in Eq. (2) so that the outer product skips $W_s$ for $s=1$ and factors the remaining fitness components for $s=2 \ldots S.$ The fitness function W thus becomes:

$$\begin{aligned} W = W_l \times \prod _{s=2}^{S}{W_{s}} + W_r \end{aligned}$$

(9)

This alteration corresponds to the fitness function (b) in Fig. 2. At this stage, we also change the method to select the survivors: we obtained better results for the new fitness function by switching from roulette to tournament selection with a tournament of size 5. Finally, the food sources are now placed completely at random, even though there still are four evaluations per individual with a maximum of three attainable food sources per evaluation.

Although a majority of the evolutionary runs in this stage led to organisms very similar to the ancestor (organism 6809), a few have a clearly different foraging ability (see, e.g., Fig. 9). This type of organism is able to reach the first food source only 40 % of the time, but once reached, it gets to the next one almost systematically. The conditional foraging map for the second food source for this organism is qualitatively similar to that of the earlier ancestor’s initial foraging map (see Fig. 8b). This suggests the hypothesis that different “neural modules” are used by organism 6981 to reach the first as opposed to all subsequent food sources, and that the change in fitness landscape (removing a reward for the first food source) created organisms where the modules were simply swapped ( Fig. 10).

We observed one exception among the foragers that consistently forage subsequent food sources: an individual (7046) was able to reach 80 % of the food sources initially instead of 40 %, and proceeds to the following ones even more systematically (see Fig. 11a). Organism 7046 is used as the ancestor for the last transfer where the fitness function is simply the number of food sources reached across all the simulations (see Fig. 2). The number of evaluations was also increased to six per individual for this last stage, with the same pattern of three sequential targets per evaluation (fitness function (c) in Fig. 2). The organisms evolved from this ancestor are significantly better foragers, and usually reach the first food source about 90 % of the time and miss very few afterwards. We identified two variations of very fit organisms: The first type reaches the first food source the most often (99 % of the time, see Fig. 11c), but misses the next ones about 4 % of the time on average. The second type (Fig. 11d) has the lowest miss rate for the following food sources of about 1.5 % on average, but reaches the first food source less often (91 % of the time). If evaluated for 10 food sources, the first and second types’s total rates of success are about 86 and 85 % respectively. Note that we never selected for reaching more than three food sources in sequence, but efficient foragers reach an arbitrary number.

4 Discussion

Unlike undirected or directed locomotion, ground foraging proved to be much harder to evolve and posed several problems that could not be anticipated from our previous experience with the systematic emergence of locomotive gaits [65]. The first difficulty is to evolve behaviors conditional on the food source positions, which requires placing the targets in opposite directions. One might argue that this behavior might not be so hard to acquire if the organisms were allowed to evolve for longer than only 40 generations, but in comparison, forward locomotion emerges within five generations. In addition to saving CPU power, the small number of generations mitigates the problem of over-fitting the population to a particular fitness peak. It does not completely prevent over-fitting, however: efficient foragers in a two-food-source-setup such as illustrated in Fig. 3b, when transferred to an environment with four targets (Fig. 3c) in a fashion similar to behavior chaining [37], are far less effective at going to the additional food sources than evolving organisms with the four food sources at the outset. The forward and backward wiggling behaviors, although well-suited for the two-food-source setup, appears to occupy a very narrow fitness peak: when mutated, the organism’s fitness drops so much that the mutant cannot survive for long enough to be reconfigured into a potentially more effective forager able to turn right and left. This example illustrates that pre-evolved behaviors do not necessarily facilitate further evolution, even though the skills acquired in the previous environment are readily useful and necessary for success in the new environment.

Only very little noise could be applied to the food source positions in order to evolve directed foraging in the first 40 generations. As opposed to random wandering, directed foraging is optimal if the information provided by the food source positions helps the foragers to improve their fitness significantly: the noise in the food source placement is translated as random fitness fluctuations in the population, and the organism’s attempts to approach food sources aims at reducing this fluctuation. If the noise results in perturbations that exceed the maximum distance by which an organism can approach a food source, then the organism’s foraging skill is useless since its fitness is dictated by the target positions. Selection cannot detect the organisms that exhibit intermediate foraging abilities, and this behavior is therefore not selected for. As a consequence, the chances of seeing an efficient forager with complex neural wiring that emerges from scratch becomes effectively nil, and evolution instead takes an easier path and improves the individual’s body coordination to traverse greater distances in random directions, without processing any information from the target positions.

Even when the noise is low enough that very few simulations yield randomly behaving organisms, the fraction of behaviors that visually look promising is very sensitive to the noise level. For example, in an initial set of 150 replicates evolved for 40 generations with 0.1 % of noise (not part of the set studied here), 5 organisms were clearly trying to move in at least three different directions, and were deemed to be worthy ancestors for the next set of repeats. If the noise level is raised to 5 % with otherwise the same experimental setup, no interesting organism emerges in 200 replicates, and if the noise is raised to 20 % and the number of replicates is increased to 500, no interesting behavior arises even after 100 generations. We suspect that this extreme intolerance to noise in the early generations is a property peculiar to the type of neural controller used in this work. It would be interesting to map the evolution of a successful forager’s controller resilience to noise across its line of descent, and see how mutations reconfigure the neural network up to the point where it acquires tolerance to arbitrary noise.

Once an efficient one-food-source forager is discovered, we expected it to be able to reach the next food source with the same efficiency. Instead, we found that foraging one food source, and sustained, repetitive foraging, appear to be two entirely different skills. Acquiring the former does not guarantee the emergence of the latter, as can be seen in organism 6809 (Fig. 8). From the organism’s foraging profile in Fig. 8a, it is tempting to conclude that the organism operates in two distinctive modes: One that allows it to reach any food source initially, and another (where it misses 75 % of the time) that appears to be used for all the remaining food sources (explaining the power-law decay of foraging efficiency). The foraging profiles for the descendants 6981 (Fig. 9), 7046 (Fig. 11a), 7420 (Fig. 11c), and 7513 (Fig. 11d) also support this conclusion: Regardless of the initial success rate at reaching the first food source, the organisms seem to miss the following ones at a similar rate. While looking at the foraging patterns for the first food sources for individuals 6981, 7046, 7420 and 7513, we notice a general trend in the pattern for individual 6981 (Fig. 12) that looks like a red flare or smoke morphs into the intermediate pattern of individual 7046 (Fig. 13) as if it was wrapped around the center, and slightly differentiates into two even more aggressively wrapped versions in individuals 7420 (Fig. 14) and 7513 (Fig. 15). A similar observation applies to the conditional foraging maps for the second food sources: across these four organisms, we can see that except for the map that is mostly black in Fig. 12, all the maps share the same flare-like patterns that radiate away from the center in a straight direction, as opposed to the whirling patterns described earlier. The descendants from organism 6981 conceivably evolved two qualitatively distinct behaviors, one for reaching the first food source, and another for the remaining food sources, and both apparently evolved separately.

A key behavior switch seem to have occurred between organisms 6809 and 6981: the flare-pattern in one of organism 6809’s conditional foraging maps (upper right corner in Fig. 8b) surfaces in its descendant’s initial foraging map almost identical, only rotated. Conversely, the flare-like pattern found in organism 6809’s initial foraging map seem to be expressed in two of its descendant’s conditional foraging maps (upper left and lower right corners in Fig. 12). This observation together with the, apparently independent, modification of each behavior suggest that they might be implemented by the neural network as two distinct functional modules that can be swapped and evolved individually, which appears to be precisely what evolution took advantage of.

The patterns described so far appear to be overlaid on simpler ones that seem to be independent of the target positions. For instance, the bottom right foraging map’s background in Fig. 8b shows a gradient pattern that is most likely explained by a single forward motion, regardless of the target position. Such a constant motion might be a remnant from the ancestral gait that yields a basal fitness from forward motion alone [Eq. (3)]. This ancestral behavior remains because until the second-to-last transfer, we kept the term that rewards forward motion in the fitness function. Blind forward motion has evolved first, and provided a building block upon which more complex behaviors have likely emerged. We hypothesize that when a food source position does not trigger the efficient foraging behavior, the organism instead uses the (likely modified) ancestral forward gait as a fallback behavior, causing the individual to move invariably towards the same arbitrary direction.

5 Conclusion

Despite its omnipresence in nature, the evolution in silico of efficient foraging for articulated robots that can be put on par with that of animals (such as small mammals) has remained out of reach. Inching towards the reproduction of such behavior in artificial media, we have presented here the first instance of efficient virtual, three-dimensional articulated foragers that can reach food sources in a sustained fashion in their physical environment (note that while [39] appeared before this work, they cite an earlier version of the present manuscript as their inspiration). The positioning pattern of the food sources, coupled with an adequate gradual increase of noise in their positions, can drastically reduce the number of necessary targets without compromising the emergence of efficient foragers. Those organisms have been shown to effectively use their sensory information to steer towards the food sources, and they do so at increasingly arbitrary positions as they are further refined by evolution.

The organisms’ ability to move towards a food source can be visualized on a two-dimensional plot (a foraging map) that gives an instant overview of the complex behavioral patterns displayed by foragers when tested on thousands of different food source positions. Those foraging maps help explain the foraging efficiency on successive food sources, and provide valuable insight into how the foraging behavior changes from ancestor to descendants. The conditional foraging map for the second food sources show how an organism’s ability to reach a food source at the same relative position can drastically change depending on the position of the first food source. Surprisingly, evolved foragers that are efficient at reaching the first food source do not perform consistently when attempting to reach multiple targets, and appear to switch their behavior after reaching the first food source. Those organisms, when evolved further, improve their overall efficiency and consistency, but conserve those two distinct behavioral modes.

Despite drastic fitness fluctuations induced by imposing noise on food source locations (even between two consecutive generations) the Genetic Algorithm was able to detect significantly fitter organisms across generations so that evolution was able to improve the foraging performance substantially. This gave rise, after less than 300 generations of total evolution along the line of descent, to multiple foraging strategies that can reach the first food source in over 90 % of the cases and miss the following ones at most 4 % of the time on average. In future work, it would be instructive to analyze the forager’s neural networks to study how they are modified on a per-generation basis, and compare those changes to how the foraging map is altered. Such an analysis might provide insights to the origin and nature of the behavior switch displayed by organism 6809 and its descendants. However, such an analysis would be computationally expensive: the evaluation of the foraging maps alone across approximately 300 generations would take about two CPU-months for a resolution of $101\times 101$ food sources.

It is possible that much of the dynamics that we report here are specific to the neural controller used, a controller that is composed of very high-level functions packaged into a single neuron. The evolvability of such a neural network can be questioned, as usually only a few neurons actually participate in a complex behavior (most neurons are neutral to knockout and therefore appear to be without function). The few functional neurons offer few and often very brittle degrees of freedom for evolution to act on. It would also be instructive to use exactly the same tools and the same parameters to evolve foragers with different types of controllers, such as continuous time recurrent neural networks, as in Refs. [37, 81, 82], or radically different ones such as the Markov networks in [83, 84] and see what previous evolutionary obstacles they ease, or which new ones they impose.

The most immediate usage of these robust evolved foragers is to simulate a population of clones in an open-ended environment, to see if by gathering enough food items, individuals can reproduce and give rise to a sustainable population. Such a confirmation would pave a straight path to ecological experiments where adaptive radiation can give rise to diversified ecotypes, and ultimately distinct species, among other experiments. The promise for evolutionary experimentation of such a population is tremendous. For example, the morphology or behavior of one type of organisms might provide substantial constraints that could prevent inter-species gene flow via hybrid inviability with other organism’s morphologies or behaviors. Such Dobzhansky–Muller incompatibilities (see, e.g., [85]) have never been based on evolved morphologies or behaviors. Mate selection could also be grounded in physical realism: organisms may conceivably choose their mates based on morphological clues only, that could eventually be combined into species-specific signatures that reflect a specialized physical apparatus that evolved to allow its bearer to survive in a specific niche [86]. In fact, studying the co-evolution of morphology and behavior, and its impact on the dynamics of speciation, is probably the most tantalizing prospect for experiments with the open-ended evolution of these creatures. Geographic separation, as well as various landscapes with different available resources could induce groups of organisms to geographically split, but also evolve the ability to take advantage of distinctive types of landscapes through the development of appropriate physical and behavioral features.

The emergence of stable ecosystems would allow us to study questions in ecology, such as food web dynamics with the added degree of physical and behavioral realism. Metabolic rate, behavioral, organism size, and animal range size variations to name a few, would all be naturally simulated for free. Previously, all those variables have only been modeled as mathematical abstractions. However, even if several tens of species emerge, the total number of individuals needed to keep the ecosystem stable has to be substantial, numbering in the order of a thousand at least. Even though the current version of the software is able to simulate about a hundred of organisms on a single core, the physics engine needs to be more optimally used, and EVO needs to be parallelized to scale to one or two more orders of magnitudes and simulate an ecosystem on a single multi-core computer.

References

L. Szurmant, G.W. Ordal, Diversity in chemotaxis mechanisms among the bacteria and archaea. Microbiol. Mol. Biol. Rev. 68, 301–319 (2004)
Article Google Scholar
S. Ward, Chemotaxis by the nematode Caenorhabditis elegans: identification of attractants and analysis of the response by use of mutants. Proc. Natl. Acad. Sci. USA 70, 817 (1973)
Article Google Scholar
P.E. Stander, Cooperative hunting in lions: the role of the individual. Behav. Ecol. Sociobiol. 29, 445–454 (1992)
Article Google Scholar
S.K. Gazda, R.C. Connor, R.K. Edgar, F. Cox, A division of labour with role specialization in group-hunting bottlenose dolphins (Tursiops truncatus) off Cedar Key, Florida. Proc. R. Soc. Lond. B 272, 135 (2005)
Article Google Scholar
B.L. Sargeant, J. Mann, P. Berggren, M. Krützen, Specialization and development of beach hunting, a rare foraging behavior, by wild bottlenose dolphins (Tursiops sp.). Can. J. Zool. 83, 1400–1410 (2005)
Article Google Scholar
A. Pauw, J. Stofberg, R.J. Waterman, Flies and flowers in Darwin’s race. Evolution 63, 268–279 (2009)
Article Google Scholar
N. Muchhala, J.D. Thomson, Going to great lengths: selection for long corolla tubes in an extremely specialized bat–flower mutualism. Proc. R. Soc. Lond. B 276, 2147 (2009)
Article Google Scholar
P.A. Abrams, The evolution of predator–prey interactions: theory and evidence. Annu. Rev. Ecol. Syst. 31, 79–105 (2000)
Article Google Scholar
H. Toju, T. Sota, Imbalance of predator and prey armament: geographic clines in phenotypic interface and natural selection. Am. Nat. 167, 105–117 (2006)
Article Google Scholar
H. Dingle, V.A. Drake, What is migration? Bioscience 57, 113–121 (2007)
Article Google Scholar
C.R. Maher, D.F. Lott, A review of ecological determinants of territoriality within vertebrate species. Am. Midl. Nat. 143, 1–29 (2000)
Article Google Scholar
T. Mueller, W.F. Fagan, Search and navigation in dynamic environments—from individual behaviors to population distributions. Oikos 117, 654–664 (2008)
Article Google Scholar
A.P. Beckerman, M. Uriarte, O.J. Schmitz, Experimental evidence for a behavior-mediated trophic cascade in a terrestrial food chain. Proc. Natl. Acad. Sci. USA 94, 10735 (1997)
Article Google Scholar
M. Kondoh, Foraging adaptation and the relationship between food-web complexity and stability. Science 299, 1388 (2003)
Article Google Scholar
A. Thierry, O.L. Petchey, A.P. Beckerman, P.H. Warren, R.J. Williams, The consequences of size dependent foraging for food web topology. Oikos 120, 493–502 (2011)
Article Google Scholar
M. Dorigo, G. Di Caro, Ant colony optimization: a new meta-heuristic, in Proceedings of the 1999 Congress on Evolutionary Computation (CEC99), vol. 2 (IEEE, 1999)
J. Kennedy, R. Eberhart, Particle swarm optimization, in Proceedings of IEEE International Conference on Neural Networks, vol. 4 (IEEE, 1995), pp. 1942–1948
K.M. Passino, Biomimicry of bacterial foraging for distributed optimization and control. Control Syst. Mag. IEEE 22, 52–67 (2002)
Article Google Scholar
D. Karaboga, An idea based on honey bee swarm for numerical optimization. Techn. Rep. TR06 (Erciyes University Press, Erciyes, 2005)
P. Pirolli, Information Foraging Theory: Adaptive Interaction with Information, vol. 2 (Oxford University Press, New York, 2007)
Book Google Scholar
W.T. Fu, P. Pirolli, SNIF-ACT: a cognitive model of user navigation on the world wide web. Hum. Comput. Interact. 22, 355–412 (2007)
Google Scholar
S.D. Fleming, C. Scaffidi, D. Piorkowski, M. Burnett, R. Bellamy, J. Lawrance, I. Kwan, An information foraging theory perspective on tools for debugging, refactoring, and reuse tasks. ACM Trans. Softw. Eng. Methodol. 22, 14 (2013)
Article Google Scholar
A. Walker, J. Hallam, D. Willshaw, Bee-havior in a mobile robot: the construction of a self-organized cognitive map and its use in robot navigation within a complex, natural environment, in IEEE International Conference on Neural Networks (IEEE, 1993), pp. 1451–1456
P. Gaussiera, C. Joulaina, J.P. Banquetb, S. Leprêtrea, A. Revela, The visual homing problem: an example of robotics/biology cross fertilization. Robot. Auton. Syst. 30, 155–180 (2000)
Article Google Scholar
T.H. Labella, M. Dorigo, J.-L. Deneubourg, Division of labor in a group of robots inspired by ants’ foraging behavior. ACM Trans. Auton. Adapt. Syst. 1, 4–25 (2006)
Article Google Scholar
H.S. Pereira, M.B. Sokolowski, Mutations in the larval foraging gene affect adult locomotory behavior after feeding in Drosophila melanogaster. Proc. Natl. Acad. Sci. USA 90, 5044 (1993)
Article Google Scholar
K.K. Ingram, P. Oefner, D.M. Gordon, Task-specific expression of the foraging gene in harvester ants. Mol. Ecol. 14, 813–818 (2005)
Article Google Scholar
R.E. Page, M.K. Fondrk, G.J. Hunt, E. Guzman-Novoa, M.A. Humphries, K. Nguyen, A.S. Greene, Genetic dissection of honeybee (Apis mellifera l.) foraging behavior. J. Hered. 91, 474 (2000)
Article Google Scholar
M.B. Sokolowski, H.S. Pereira, K. Hughes, Evolution of foraging behavior in Drosophila by density-dependent selection. Proc. Natl. Acad. Sci. USA 94, 7373 (1997)
Article Google Scholar
T.T. Hills, Animal foraging and the evolution of goal-directed cognition. Cogn. Sci. 30, 3–41 (2006)
Article Google Scholar
J.M. Gray, J.J. Hill, C.I. Bargmann, A circuit for navigation in Caenorhabditis elegans. Proc. Natl. Acad. Sci. USA 102, 3184 (2005)
Article Google Scholar
S. Grillner, P. Wallén, K. Saitoh, A. Kozlov, B. Robertson, Neural bases of goal-directed locomotion in vertebrates—an overview. Brain Res. Rev. 57, 2–12 (2008)
Article Google Scholar
J. Ventrella, Designing emergence in animated artificial life worlds, in Virtual Worlds, ed. by J.-C. Heudin (Springer, Berlin, 1998), pp. 143–155
Chapter Google Scholar
L. Yaeger, Computational genetics, physiology, metabolism, neural systems, learning, vision, and behavior or PolyWorld: life in a new context, in Proceedings of the Artificial Life III Conference, ed. by C.G. Langton (Addison-Wesley, Boston, 1994), pp. 263–298
Google Scholar
V. Braitenberg, Vehicles: Experiments in Synthetic Psychology (MIT Press, Cambridge, 1986)
Google Scholar
R. Brooks, A robust layered control system for a mobile robot. IEEE J. Robot. Autom. 2, 14–23 (1986)
Article MathSciNet Google Scholar
J. Bongard, Behavior chaining: incremental behavior integration for evolutionary robotics. Artif. Life 11, 64 (2008)
Google Scholar
P. Dasgupta, Multi-robot task allocation for performing cooperative foraging tasks in an initially unknown environment, in Innovations in Defence Support Systems-2. Studies in Computational Intelligence, vol. 338, ed. by L. Jain, E. Aidman, C. Abeynayake (Springer, Berlin, 2011), pp. 5–20
Chapter Google Scholar
L.P. Marcin, T. Ito, R. Suzuki, T. Arita, Evolution of virtual creature foraging in a physical environment, in Artificial Life 13, ed. by C. Adami, D.M. Bryson, C. Ofria, R.T. Pennock (MIT Press, Cambridge, 2012), pp. 430–432
Google Scholar
H. Guo, Y. Meng, Distributed reinforcement learning for coordinate multi-robot foraging. J. Intell. Robot. Syst. 60, 531–551 (2010)
Article MATH Google Scholar
D. Xu, B.L. Liu, R.B. Zhang, Modeling and modifying the foraging strategy in swarm robots. Key Eng. Mater. 467, 269–274 (2011)
Article Google Scholar
W.M. Shen, R. Kovac, M. Rubenstein, SINGO: a single-end-operative and genderless connector for self-reconfiguration, self-assembly and self-healing, in Proceedings of the 2009 IEEE International Conference on Robotics and Automation (IEEE Press, Piscataway, 2009), pp. 4253–4258
I. Ieropoulos, J. Greenman, C. Melhuish, I. Horsfield, EcoBot-III: a robot with guts, in Proc. Artif. Life XII, ed. by H. Fellermann, M. Dörr, M.M. Hanczyc, L.L. Laursen, S. Maurer, D. Merkle, P.-A. Monnard, K. Stoy, S. Rasmussen (MIT Press, Cambridge, 2010), pp. 733–740
Google Scholar
J. Ventrella, Explorations in the emergence of morphology and locomotion behavior in animated characters, in Artificial Life IV: Proceedings of the Fourth International Workshop on the Synthesis and Simulation of Living Systems, ed. by R.A. Brooks, P. Maes (MIT Press, Cambridge, 1994), pp. 436–441
H. Lipson, J.B. Pollack, Automatic design and manufacture of artificial lifeforms. Nature 406, 974–978 (2000)
Article Google Scholar
J. Bongard, V. Zykov, H. Lipson, Resilient machines through continuous self-modeling. Science 314, 1118 (2006)
Article Google Scholar
H.C.H. Chiu, M. Rubenstein, W.-M. Shen. ’Deformable Wheel’—a self-recovering modular rolling track, in Proceedings of the 2008 International Symposium on Distributed Robotic Systems (Tsukuba, Japan, 2008), p. 429
J. Klein, L. Spector, 3d multi-agent simulations in the breve simulation environment, in Artificial Life Models in Software, 2nd edn., ed. by M. Komosinski, A. Adamatzky (Springer, Dordrecht, 2009), pp. 79–106
Chapter Google Scholar
N. Chentanez, M. Müller, Real-time Eulerian water simulation using a restricted tall cell grid. ACM Trans. Graph. 30, 82 (2011)
Article Google Scholar
D. Baraff A. Witkin, Large steps in cloth simulation, in Proceedings of the 25th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), ed. by S. Cunningham, W. Bransford, M. F. Cohen (ACM Press, New York, 1998), pp. 43–54
P. Faloutsos, M. van de Panne, D. Terzopoulos, The virtual stuntman: dynamic characters with a repertoire of autonomous motor skills. Comput. Graph. 25, 933–953 (2001)
Article Google Scholar
C. Zheng, D.L. James, Rigid-body fracture sound with precomputed soundbanks. ACM Trans. Graph. 29, 69 (2010)
Google Scholar
J.N. Chadwick, D.L. James, Animating fire with sound. ACM Trans. Graph. 30, 84 (2011)
Article Google Scholar
J. Schrum R. Miikkulainen, Evolving multimodal networks for multitask games, in Proceedings of the IEEE Conference on Computational Intelligence and Games (CIG 2011) (IEEE Press, Piscataway, 2011), pp. 102–109
G.S.P. Miller, The motion dynamics of snakes and worms, in SIGGRAPH ’88: Proceedings of the 15th Annual Conference on Computer Graphics and Interactive Techniques, ed. by Richard J. Beach (ACM Press, New York, 1988), pp. 169–173
Y.S. Shim, C.H. Kim, Generating flying creatures using body–brain co-evolution, in Proceedings of the 2003 ACM SIGGRAPH/Eurographics symposium on Computer animation (Eurographics Association 2003), pp. 276–285
K. Iwadate, I. Suzuki, M. Yamamoto, M. Furukawa, Evolving amphibian behavior in complex environment, in ECAL’09 Proceedings of the 10th European conference on Advances in Artificial Life: Darwin meets von Neumann, ed. by G. Kampis, I. Karsai, E. Szathmáry (Springer, Heidelberg, 2011), pp. 107–114
K. Nakamura, I. Suzuki, M. Yamamoto, M. Furukawa, Acquisition of swimming behavior on artificial creature in virtual water environment, in ECAL’09 Proceedings of the 10th European conference on Advances in Artificial Life: Darwin meets von Neumann, ed. by G. Kampis, I. Karsai, E. Szathmáry (Springer, Heidelberg, 2011), pp. 99–106
V. Grimm, U. Berger, F. Bastiansen, S. Eliassen, V. Ginot, J. Giske, J. Goss-Custard, T. Grand, S.K. Heinz, G. Huse, A. Huth, J.U. Jepsen, C. Jørgensen, W.M. Mooij Birgit Møller, G. Pe’er, C. Piou, S.F. Railsback, A.M. Robbins, M.M. Robbins, E. Rossmanith, N. Røger, E. Strand, S. Souissi, R.A. Stillman, R. Vabø, U. Visser, D.L. DeAngelis, A standard protocol for describing individual-based and agent-based models. Ecol. Model. 198, 115–126 (2006)
Article Google Scholar
M.A. Senior, M.A. Charleston, M. Lihoreau, J. Buhl, D. Raubenheimer, Evolving nutritional strategies in the presence of competition: a geometric agent-based model. PLoS Comput. Biol. 11, e1004111 (2015)
Article Google Scholar
E.S. Huff, J.E. Leahy, D. Hiebeler, A.R. Weiskittel, C.L. Noblet, An agent-based model of private woodland owner management behavior using social interactions, information flow, and peer-to-peer networks. PloS One 10, e0142453 (2015)
Article Google Scholar
K. Sims, Evolving virtual creatures, in Proceedings of the 21st Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH ’94 (ACM, New York, 1994), pp. 15–22
M. Komosinski, S. Ulatowski, Framsticks: towards a simulation of a nature-like world, creatures and evolution, in Advances in Artificial Life. Lecture Notes in Computer Science, vol. 1674, ed. by D. Floreano, J.-D. Nicoud, F. Mondada (Springer, Berlin, 1999), pp. 261–265
N. Lassabe, H. Luga, Y. Duthen, Evolving creatures in virtual ecosystems, in Advances in Artificial Reality and Tele-existence. Lecture Notes in Computer Science, vol. 4282, ed. by Z. Pan, A. Cheok, M. Haller, R.W.H. Lau, H. Saito, R. Liang (Springer, Berlin, 2006), pp. 11–20
Chapter Google Scholar
N. Chaumont, R. Egli, C. Adami, Evolution of virtual catapults, in Proceedings of Artificial Life X, ed. by L.M. Rocha, L.S. Yaeger, M.A. Bedau, D. Floreano, R.L. Goldstone, A. Vespignani (MIT Press, Cambridge, MA, 2006), pp. 262–268
Google Scholar
J.E. Auerbach, J.C. Bongard, Dynamic resolution in the co-evolution of morphology and control, in Proceedings of Artificial Life XII, ed. by H. Fellermann et al. (2010), pp. 451–458
T. Miconi, In silicon no one can hear you scream: evolving fighting creatures, in Proceedings of the 11th European Conference on Genetic Programming, ed. by M. O’Neill, L. Vanneschi, S. Gustafson, A. Isabel, I. De Falco, A. Della Cioppa, E. Tarantino (Springer, Berlin, 2008), pp. 25–36
T. Miconi, Evosphere: evolutionary dynamics in a population of fighting virtual creatures, in IEEE Congress on Evolutionary Computation (CEC 2008) (IEEE Press, 2008), pp. 3066–3073
K. Sims, Evolving 3D morphology and behavior by competition. Artif. Life 1, 353 (1994)
Article Google Scholar
M.L. Pilat, C. Jacob, Evolution of vision capabilities in embodied virtual creatures, in Proceedings 12th Annual conference on Genetic and Evolutionary Computation, GECCO ’10 (ACM Press, New York, 2010), pp. 95–102
N. Chaumont, C. Adami, Sustainable population of autonomous foragers in a 3d environment with physics, in Proceedings of Artificial Life XIII, ed. by C. Adami, D.M. Bryson, C. Ofria, R.T. Pennock (MIT Press, Cambridge, 2012), pp. 581–583
Google Scholar
T. Ito, M.L. Pilat, R. Suzuki, T. Arita, Population and evolutionary dynamics based on predator–prey relationship in 3d physical simulation, in Proceedings of Artificial Life 14, vol. 14, ed. by H. Sayama, J. Rieffel, S. Risi, R. Doursat, H. Lipson (MIT Press, Cambridge, 2014), pp. 105–112
Google Scholar
N. Ouannes, N. Djedi, Y. Duthen, H. Luga, Predator–prey coevolution in a physically simulated environment, in 3rd International Conference on Control, Engineering & Information Technology (IEEE, 2015), pp. 1–6
T. Arita, M. Joachimczak, T. Ito, A. Asakura, R. Suzuki, Alife approach to eco-evo-devo using evolution of virtual creatures. Artif. Life Robot. 21, 141–148 (2016)
Article Google Scholar
A. Asakura, R. Suzuki, T. Arita, Evolving 3D virtual creatures through exaptation triggered by environmental change. Artif. Life Robot. 20, 244–250 (2015)
Article Google Scholar
N. Chaumont, R. Egli, C. Adami, Evolving virtual creatures and catapults. Artif. Life 13, 139–157 (2007)
Article Google Scholar
D. Whitley, J. Kauth, Genitor: a different genetic algorithm, in Rocky Mountain Conference on Artificial Intelligence (Colorado State University, Denver, 1988), pp. 118–130
G. Syswerda, Uniform crossover in genetic algorithms, in Proceedings of the Third International Conference on Genetic Algorithms, ed. by J. Schaffer (Morgan Kaufmann, San Mateo 1989), pp. 2–9
J.C. Bongard, Innocent until proven guilty: reducing robot shaping from polynomial to linear time. IEEE Trans. Evol. Comput. 15, 571–585 (2011)
Article Google Scholar
R. Smith, ODE: open dynamics engine. http://opende.sourceforge.net. Last accessed May 6 2016
R. Beer, Toward the evolution of dynamical neural networks for minimally cognitive behavior, in From Animals to Animats: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, ed. by P. Maes, M. Mataric, J. Meyer, J. Pollack, S. Wilson (MIT Press, Cambridge, 1996), pp. 421–429
R. Beer, The dynamics of active categorical perception in an evolved model agent. Adapt. Behav. 11, 209–243 (2003)
Article Google Scholar
J.A. Edlund, N. Chaumont, A. Hintze, C. Koch, G. Tononi, C. Adami, Integrated information increases with fitness in the simulated evolution of autonomous agents. PLoS Comput. Biol. 7, e1002236 (2011)
Article MathSciNet Google Scholar
L. Marstaller, A. Hintze, C. Adami, Cognitive systems evolve complex representations for adaptive behavior. Neural Comput. 25, 2079–2107 (2013)
Article MathSciNet Google Scholar
H.A. Orr, M. Turelli, The evolution of postzygotic isolation: accumulating Dobzhansky–Muller incompatibilities. Evolution 55, 1085–1094 (2001)
Article Google Scholar
D.I. Bolnick, B.M. Fitzpatrick, Sympatric speciation: models and empirical evidence. Annu. Rev. Ecol. Evol. Syst. 38, 459–487 (2007)
Article Google Scholar

Download references

Acknowledgments

This research was funded in part by the National Science Foundation’s BEACON Center for the Study of Evolution in Action under Contract No. DBI-0939454. The authors would like to thank Charles Ofria and Robert Pennock for support and discussions. We thank the Michigan State University High Performance Computing Center and the Institute for Cyber-Enabled Research (iCER).

Author information

Nicolas Chaumont
Present address: Atreca, Inc., 500 Saginaw Dr., Redwood City, CA, 94063, USA

Authors and Affiliations

The Natural Capital Project, Stanford, CA, 94305, USA
Nicolas Chaumont
Microbiology and Molecular Genetics, BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI, 48824, USA
Christoph Adami

Authors

Nicolas Chaumont
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Adami
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christoph Adami.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 165 kb)

Supplementary material 2 (MP4 5936 kb)

Appendix: Foraging maps for the first and second food sources for several individuals

See Figs. 12, 13, 14 and 15.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Chaumont, N., Adami, C. Evolution of sustained foraging in three-dimensional environments with physics. Genet Program Evolvable Mach 17, 359–390 (2016). https://doi.org/10.1007/s10710-016-9270-z

Download citation

Received: 05 January 2016
Revised: 10 May 2016
Published: 07 June 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s10710-016-9270-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evolution of sustained foraging in three-dimensional environments with physics

Abstract

Similar content being viewed by others

How morphological development can guide evolution

Virtual Spatiality in Agent Controllers: Encoding Compartmentalization

Evolving and generalising morphologies for locomoting micro-scale robotic agents

1 Introduction