The results are firstly a methodological contribution, an analytical framework that will enable characterization of experimental situations elsewhere, and, secondly, the analysis of the diversity of experimental situations in our case study.
An analytical framework to study experimental situations
The framework is composed of 13 variables (placed between quotes) organized in the three previously identified stages: Design, Management and Evaluation (Table 1).
Table 1 The framework for analyzing an experimental situation Design stage
During this stage, farmers designed their experimental situations and chose what to test and how to organize the experiment? Two variables were related to the farmers’ goals from the conceptual model: the “objective” of the experimental situation and the “link to previous experimental situations.” Three variables were related to the planned experimental design: the degree of “novelty,” the “spatial scale,” and the existence of “simultaneous comparisons.” The variable “objective” represented what the farmers wished to modify in their farming production process and was composed of three modalities. In the first two, the farmers were aiming at direct improvement of their commercial crops or of the agroecosystem (biological regulations and ecological processes). The third modality was the case in which experimental situations were designed to solve a specific agronomic problem. The variable “link to previous experimental situations” described the relationship between the practice being tested and the practices in the previous experimental situations, with six modalities (the last being absence of linkage): the practice was set up to improve the agroecological processes targeted, economic or work performance, or feasibility compared to the previous experimental situation. One modality consisted in transposing a practice to other crops or to other parts of the farm, targeting similar effects. Another consisted in strictly repeating previous experiments. The variable “novelty” informed on the degree to which the practice being tested was new compared to the practices already implemented by the farmer and comprised three modalities. In the first one, the farmers tested a similar practice, meaning they already knew how to set up the practice and what effect was targeted. In the second one, the farmers tested a new practice based on a previously known logic (e.g., a regulation process through the release of previously tested biocontrol agents). In the third one, the farmers tested a new logic, thereby leading to more uncertainties or more elements to be taken into account, because they were not familiar with either the effects targeted or with the practices that would enable them to reach their target. The variable “spatial scale” informed on the number of plots in which the practices were being tested. This variable had four modalities. In the lowest one, the farmers tested practices in less than one plot (e.g., in strips within a field), thus minimizing risks in the case of failure. In the highest one, the farmers experimented on the whole acreage and the experimental situation was thus close to real farming conditions, including in terms of work organization and economic results. Finally, the last variable called “simultaneous comparison” informed on whether or not the farmers simultaneously compared several technical modalities. Some compared several modalities the same year, in the same plot or in several plots. Others compared the modalities with a reference situation, either planned or identified by opportunity because it made sense.
Management stage
During this stage, the farmers implemented the practices they planned to test, modified them if necessary, and collected information. The stage comprised five variables. The variable “progress” reflected the actions farmers added or transformed during the experimental situation to deal with unpredictable events. Three modalities were possible. Farmers adjusted the experiment while it was underway. Or they stopped it because of bad results. Or they did not change anything. Four variables described the information collected by the farmers and used to make their judgments and possibly to discuss their results. These variables were binary depending on whether the information was collected or not: “information on the setup” referred to the farmers’ technical capacity to set up their experimental situation. “Information on crops” referred to either the development or the yield of commercial crops. “Information on the agroecosystem” gathered all information on the biophysical and ecological aspects of the agroecosystem apart from commercial crops. “Information on work” referred to the workload or to the ergonomics of the practices.
Evaluation stage
This stage described the way farmers evaluate the results of their experimental situations and had three variables. Two represented how farmers judge their results. We distinguished achievement of the goal, i.e., results that were in line with the farmers’ expectations (defined with the variables “objective” and “link to previous experimental situations” in the Design stage) and “discoveries,” which concerned new and unexpected observations made while the experiment was underway. Each variable had three modalities. Splitting judgments between the two variables helped distinguish what farmers could validate/invalidate from what they had recently had to deal with. The variable “discussion of the result” described if and how farmers discussed their results. They could put forward an ex-post hypothesis to explain the success or failure of the experimental situation, or find an explanation for the technical failure. They could identify confounding effects when they considered that results might be due to other factors than those being tested. They could also relativize the results of the experimental situation because particular cropping conditions interacted with the practice being tested. Alternatively, the farmers might not discuss results at all.
The analytical framework we built proved to be efficient in coding the farmers’ discourses in the case study (Table 1, two last columns). Concerning the Design stage, most experimental situations aimed at improving the functioning of the agroecosystem in relation with previous experimental situations, with a limited degree of novelty, on one or two plots, and without comparing several modalities. Concerning the Management stage, the experimental situations were rarely adjusted while they were underway, and most information collected concerned commercial crops and the functioning of the agroecosystem. During the Evaluation stage, most results of the experimental situations achieved the farmers’ goals; few discoveries were made by the farmers; results were mainly not discussed.
These results show that the proposed framework makes it possible to describe experimental situations in detail based on textual analysis of farmers’ discourses. The three stages and 13 variables are sufficiently generic because it was possible to describe the wide range of the 181 experimental situations in the case study concerning 17 farmers and involving 10 types of agroecological practices and two production systems (market gardening and arable crops). The framework is an original contribution to the issue of farmers’ experiments because it focuses on the whole process from motives to evaluation of practices. The framework is focused on farmers’ knowledge production, and its use will help foster exchanges between farmers and agronomists on agroecology. Very few frameworks on experiments are available in the literature. In most cases, each scientist identifies only few key points. For instance, Quiroz (1999) studied farmers’ experiments in terms of the number and size of trial repetitions, location (and size) of trials, religion and other beliefs, and evaluation of experiments (agronomic aspects, yield, and economic benefits). In her review, Saad (2002) focused on steps, use of controls and replications, location, scale, the number of factors, and observation. Elements are described individually but they lack consistency between them. On the other hand, Kummer (2011) built a framework to identify several types of farmers’ experiments. This framework is composed of five variables: motives (personal, economic, problem solving or none), methodological approach, information sources, personal identification, and duration. The first difference between Kummer’s framework and ours is that the author considered information sources and personal identification as part of the experimental activity, which we did not. Conversely, the author did not include evaluation variables that we considered to be part of an experimental activity. Moreover, variables are qualified by five levels of intensity but detailed criteria used to assign a particular level are missing (for instance, how are qualified farmers’ methodological approaches qualified as highly elaborate or less elaborate?). Our framework focuses more on the consistency between the three stages of the experimental situations: How is the experimental situation set up depending on the farmer’s goals? Are the farmer’s conclusions consistent with what was observed during the experiment? How are results evaluated with respect to the stated goals? Therefore, we think that our framework is more useful to help farmers reflect on and design their experimental situations.
Another originality of our framework is that it makes it possible to deepen the analysis of the short-term dimension of the experimentation. Very few studies focus on this time scale, and we found no similar framework in the literature. Even if our framework is focused on short-term experimentation, it remains compatible with the long-term process thanks to the variable “link to previous experimental situations.” Yet a long-term experimental process was observed in our sample (134 out of the 181 experimental situations studied were linked with previous ones) and as was the case in Mak (2001), Quiroz (1999), Saad (2002), and Kummer (2011). The present study shows that the past is important to understand farmers’ experimentation. It makes it possible to position an experimental situation in terms of motives (what is the main objective of this practice? what is being changed compared to previous practices?) and novelty. Futures studies will be able to take changes in the experimental situations created by the farmers over the years into account using our framework.
While it is widely accepted that agroecology requires being adaptable, it is surprising that in our case study, practices were so rarely adjusted while the experiments were underway. One possible reason is that our framework does not take into account the time farmers needed to conceive their experimental situations: Sometimes, the experiment was designed a relatively short time before the first action was taken. Adaptation could also occur on a multi-year time scale, between experimental situations. For these different reasons, future multi-year analyses of farmers’ experimentation are needed to better understand the dynamics of changes and learning in agroecology.
One limitation of our work is that the method relies on farmers’ recollections, as most experiments were conducted in the past. In experiments that were conducted in the distant past, the farmers surveyed had difficulty remembering some details, particularly because they very rarely keep written records of experiments. This is why it could be difficult to inform variables or modalities as the farmers’ starting goals, or on-going adaptations of the experimental situation. What is more, we only focused on the broad types of information collected during the experiment (4 variables), not on its precise nature and the way farmers collected it, although this would be interesting information to collect. Indeed, Toffolini et al. (2016) pointed to the need to develop new indicators to help farmers engage in the agroecological redesign of their farming systems, including through experimentation. As the use of such indicators is difficult to identify in past experimental situations, they could be discovered more easily with complementary methods, for example, regular monitoring of on-going experimental situations (Baars 2011).
The diversity of experimental situations in the case study
This section describes the diversity of the 181 experimental situations in 10 groups (Table 2). All the descriptive variables of the framework contributed significantly to the grouping. The five most significant variables were “link to previous experimental situations,” “simultaneous comparison” and “novelty” in the Design stage, “progress” in the Management stage, and “discussion of the results” in the Evaluation stage. “Information on work” and “information on the setup” were the least significant descriptive variables. Most groups were described by several variables belonging to each of the three stages of the framework, except for groups 3, 6, 8, and 10, which only contained variables from two stages. The main result of the statistical analysis was the identification of four clusters of experiments that differed in their goals, management, and ways of interpreting the results: experiments to improve previously experimented practices (groups 1 to 5), failed experiments that did not reach the farmers’ goals and were often stopped before the end (groups 6–7), breakthrough experiments where the farmer tried a practice that revealed a new logic for the first time (group 8), and finally experiments based on simultaneous comparisons of several modalities (groups 9–10). After describing the four clusters and the corresponding groups based on the significant descriptive variables, in the last section, we analyze whether some groups are specific to a particular practice, farmer, or type of production.
Table 2 Groups of experimental situations identified in the case study
Improvement experiments (groups 1–5)
The first cluster accounts for half the groups and for 113 out of the 181 experimental situations identified. The five groups correspond to a low or medium level of novelty of the practices being tested. In group 1—low novelty practices to improve yield—farmers had already used the practice being tested and their aim was to improve the yield of the commercial crop; they were thus sufficiently confident to implement it at a large scale. Information on commercial crops was collected but not on the rest of the agroecosystem; no change was made during the experiment, and the results were more or less what the farmers expected. The experimental situations did not lead to discoveries. Groups 2 and 3 are characterized by a medium level of novelty. In group 2—modifying a practice to improve work or economic performance—farmers had already tested the agroecological processes they were targeting. This time, they were adjusting the practices to make them easier or cheaper to implement. As a consequence, information was mainly collected on work. These experimental situations were often applied at a large scale. In group 3—extension of a logic to other farm elements—the experimental situations consisted in transposing an agroecological logic that had already been assessed in the previous experimental situations. When doing so, they started to experiment at a small scale because the practices had to be adapted to another part of the cropping system. Group 4—low novelty experiments with good results—and group 5—experimental situations to improve the mastery and understanding of an agroecological practice—consisted in testing a practice they had already tested, this time with a focus on the agroecosystem and not on the commercial crops. They consisted in two levels of mastery: in group 4, the practice was sufficiently assessed so that farmers mainly repeated the previous experiment without adjusting it during the experiment; they often reached the goals targeted and did not discuss the results. On the contrary, in group 5, the aim was to improve the feasibility of a practice that had already been tested but was not completely satisfactory. Farmers still had to adapt practices during the experiment and discussed the results in order to better reach their goals. These different groups of experimental situations could be part of larger processes that Lyon (1996) called “learning during action” and Millar (1994) “adaptive experiments,” based on small changes without taking large risks. Group 3—extension of a logic to other farm elements—is quite original. It reveals that farmers may experiment in order to adapt and transpose some agroecological practices to other parts of their cropping system.
Failed experiments (groups 6–7)
The second cluster, although a minority (19 experimental situations out of 181) gathers two groups, group 6—discovering technical limits—and group 7—interruption by discovering agroecological limits—, where the practices being tested were judged negatively and the experiments were frequently stopped in the middle. The experimental situations of these groups led to far more discoveries and discussion of the results, either technical and based on how to set up practices (group 6) or for agroecological and agronomical reasons (group 7), where ex-post hypotheses were formulated. These two groups are close to what Chantre (2011) called “trial and error” learning processes. Farmers experiment in a relatively short time on their own, and most of these experiments did not achieve the goals set by the farmers.
Breakthrough experiment
The third cluster corresponds to group 8 alone (29 experimental situations out of 181). This group is made up of experimental situations in which new agroecological logics are being tested for the first time. These experimental situations represent the moment a farmer decides to try a new practice (that involves a new logic) he/she discovered off the farm or imagined and receives evidence of its interest. It is surprising that most of the results of these experiments were not discussed. Two hypotheses can be proposed to explain this phenomenon. Either the expectations from a very new practice are fuzzy, as noted by Toffolini (2016), who showed that farmers trying out practices based on a new agronomic principle first wanted to know if they were consistent rather than aiming at precisely quantifying the processes. Or, a breakthrough experiment involves a practice admittedly unknown to the farmer concerned, but already proven by numerous other farmers. This group is original and could not be related to other types of farmers’ experiments in the literature.
Comparison experiments (groups 9–10)
The last cluster is also a minority (20 out of 181 experimental situations), but group 9 and group 10 are the only groups in which several modalities were compared or compared with a reference in the same plot. In group 9—comparison of several modalities to improve agroecological processes that affect the development of commercial crops—several modalities were implemented in the same plot to simultaneously compare agroecological process of practices targeting a very precise goal: to identify the best modality for developing commercial crops. The practices experimented had not been the subject of previous experimentation but used a logic that resembled one the farmer had used previously. For example, farmer 3 tested no-till maize for the second time. This time, he controlled ray grass weeds before sowing and also sowed faba as a previous cover crop. He compared two modalities: strip till in one plot and direct sowing in the other. Compared to group 9, group 10—small scale experiments with a reference—occurs in a broader range of cases. For example, farmer 1 tested no till on a small strip in a vegetable bed and tried mulching with ramial chipped wood for the first time. He compared changes in the soil structure and in biological life with that in other parts of the vegetable beds that he tilled normally. Most often, the reference is not specifically designed as such but is the farmer’s current practice. These two groups of experimental situations are close to the “accurate experiments” of Kummer (2011), conventional approaches of Lyon (1996), because they both involve simultaneous comparison and small scale.
As a first discussion of the 10 groups, it should be noted that all the variables of the framework were significant for one group or another. Among the five variables that contributed the most to grouping, three belonged to the Design stage, one to Management, and one to Evaluation. In particular, the variables “link to previous experimental situations” and “novelty” were useful to interpret how farmers designed the experiments in several groups, and the variables “progress” and “discussion of the results” were useful to interpret how farmers dealt with surprises or problems in terms of actions and analysis. These variables should therefore be the priority in future attempts to investigate experimental situations in other production systems. Conversely, the variable “objective” frequently did not discriminate among groups. This may be due to the fact the three modalities (improving the yield of a commercial crop, improving agroecosystem functioning, or solving an agronomic problem) are too broad. For the statistical analysis, we chose to distinguish farmers’ positive and negative judgments of the results, which enabled us to identify failed experiments (groups 6 and 7) although their number was low. A bigger sample would perhaps have revealed a wider range of experimental situations that can lead to negative results. Finally, it will now be necessary to assess the robustness of the 10 groups in comparison with other data sets and, in particular, to check if the two original groups in this study (groups 3 and 8) are identified again.
Are some groups linked to the supplementary variables? The “type of practices” and “type of production” were the most significant, and 7 groups out of 10 were significantly correlated with at least one of their modalities. “Farmer’s name” was also significant; six groups were respectively characterized by six different farmers. However, no group was 100% characterized by one farmer and no farmer restricted his/her experimental situations to one particular group. Overall, none of the 10 groups was 100% characterized by one particular modality of the supplementary variables. Our framework is therefore sufficiently generic to cover a large diversity of contexts but specific contexts may lead to a greater proportion of particular groups of experimental situations. For example fertilization was frequently tested in group 1—low novelty practices to improve yield—which can be explained by the fact that it is a classical question for farmers that are closely linked to yields of commercial crops. Reduced tillage practices were common in group 10—small-scale experimental situations with a control—probably because implementing two or more modalities of soil tillage in one plot is feasible for farmers as the modalities will not spill over and interact, contrary to releasing biocontrol agents, for example. Ecological infrastructure practices were the main aspects tested in group 2—practice modification to improve work or economic performance. Concerning the type of production, arable crops were significant in three groups: group 1 (low novelty practices to improve yield) as fertilization is a priority issue in arable crops, group 7 (interruption due to discovering agroecological limits) as testing a cover crop or reduced tillage practices provide clear information on whether it is worth continuing or not, and group 9 (comparison of modalities to improve agroecological process impacting commercial crop development). Market gardening was significant in only one group (group 2—modifying a practice to improve work or economic performance). Because vegetable production is complex and time-consuming, market gardeners frequently experiment how to improve work or save money. Farmer 17 was significant in group 2—practice adjustment to improve work or economic performance—because he frequently tested permanent garden beds and focused on ways to improve work. Farmer 8 was also a market gardener and was significant for group 3—extension of a logic to other farm element. Based on good results with ladybugs and parasitoid wasps, he tested the transfer of different endemic auxiliaries to various crops and pests. Therefore, these elements of the context need to be taken into account in future studies, either for different types of production, for other agroecological practices, or farmers’ contexts. On the contrary, the other supplementary variables “production label” (separating organic and conventional farming) was not significant.
We now discuss the results according to the scale of experiments and the nature of comparisons. First of all, more than 75% of the experimental situations were conducted at a small scale (one or two plots or greenhouses) or less. Our results suggest that the size of the experimental plots is correlated with the risk of failure as perceived by the farmer. Large-scale experimental situations were mostly used in groups 1 and 2, where the practices being tested have a low or medium level of novelty and, probably, a limited risk of failure. On the contrary, failed experimental situations mostly occurred at small scale, probably because farmers were fully aware that the practice they were testing was risky. One questioning point is that breakthrough experiments (group 8) were not necessarily carried out on small- or very small-scale plots. One possible explanation, based on Kummer’s results (2011), is that the farmers implemented such very innovative practices on a large scale when there was no alternative technique to solve an agronomic problem or because they already had sufficient confidence in the practice. The second key dimension deals with comparisons. Very few experimental situations (13%) involved a simultaneous comparison, even with a reference, whereas for scientists, it is embedded in the experimental activity. Although this figure is consistent with what Kummer (2011) and Sumberg and Okali (1997) found (15 and 39%, respectively), it does not mean that farmers do not use a reference, but more likely that their references are based on memories (for instance, what does a field without a cover crop look like in late summer). Yet, farmers who compared different modalities were rare. We explain this by the fact that farmers are usually looking for “best bet” practices or combination of practices. And if one option seems promising, to save time and reduce risks, they will only try that one. For instance, in a participatory experiment, for ease of implementation and evaluation, most farmers chose only one option (associating maize and velvet bean) and tested it on single plots with no replication (Buckles and Perales 1999). To sum up, it seems that farmers, at least those with experience, usually know how to limit risks when they are testing a new practice. They balance the risks as a function of the novelty of a practice, its spatial scale, and, possibly, information seeking. They also limit the time dedicated to reflection and organization. In most cases, they test one “best bet” practice on one plot. Risky situations may still arise when farmers experiment very different practices at a large scale. To avoid such risks, they should either first conduct some experiment at a small scale or make sure they are well informed before starting. One major aid for farmers who wish to experiment would be to look for more simultaneous comparisons. In particular, we believe that farmers who experiment should receive more support in including two or more modalities, which would ensure the outcomes are more efficient. However, one condition first needs to be fulfilled. Several equally worthwhile modalities need to be identified; otherwise, if one modality appears to be much better than the others, the farmers will very likely test only that one. In this context, one way to help the farmers would be to evaluate with them during the Design stage, which modalities could be considered, and, if need be, to encourage them to set them up (choice of the plot, regular observations with the farmer, and visits to other farmers).
Outlook for development and research
First, individual experimental situations should be encouraged because they have the advantage of suiting a particular farmer’s motives and need for change. This kind of approach is close to experiential learning as defined by Baars (2011), which enables the production of situated knowledge in collaboration with farmers. Our work enabled us to access farmers’ ongoing and past experimental situations, but it was sometimes difficult for farmers to remember all of them well, and they were rather tacit. The proposed framework appeared to be efficient in helping them be more explicit. A simple action to help farmers would be to encourage them to record their experimental situations: taking notes on their experimental situations would favor their reflexivity as well as help avoid important information being forgotten. In return, farmers should be helped to conceive new experimental situations. The task would be to reflect on previous experimental situations to clarify the new design stage: explain their motives, discuss possible risks, and think about appropriate modalities to try.
Second, collective experiments on agroecological systems are expanding (Navarrete et al. 2018). By studying farmers’ learning processes, Chantre et al. (2014) underlined the importance of interaction with others (farmer development groups, peer farmers, an advisor) in acquiring new knowledge, including through experimentation. Experimental situations may also help other farmers. As every farmer will explore practices differently, discussing experimental situations can help identify complementarities or summarize the knowledge produced. When the interest of a topic grows in a group of farmers, a collective experiment could be set up. Different modalities should be chosen among members and tested, as proposed by Buckles and Perales (1999). For that purpose, using the framework with a group of farmers could help collect and organize information on experiments, which could then be used to warn new farmers about possible “unpleasant surprises” when certain practices are used or to enable them to benefit from what others already discovered to save time and for cognitive economy. Collecting and capitalizing on experimental situations is an opportunity to empower farmers in their knowledge production. The interest of farmers’ experimental situations is that the emphasis is not on the knowledge content itself but rather on the way it is produced (Briggs 2013). Experimental situations ensure that the context of situated knowledge production is there to be discussed, for instance in meetings between farmers. This helps question the context (the main objective of the experimental situation, links with previously tested practices, and the degree of novelty) and the results (farmers’ judgments and discussion). Our framework could also be used as a support tool for discussion on a field trip during which the farmers present their own experiments. What is he/she referring to? To a breakthrough experimental situation on a single plot or to a practice being tested for the 5th time, aimed at improving agroecological processes?
Finally, are the farmers’ and the scientists’ experimental processes complementary? Toffolini et al. (2017) refer to farmers gaining insight into their actions when they acquire fundamental knowledge, the farmers’ experimental situations could be discussed with agronomists to explain some of the results achieved on the farm in the light of agronomical theory or concepts, thereby sharing scientific and fundamental knowledge with farmers. Following Baars (2011), Hoffman et al (2007), and Sumberg and Okali (1997), farmers’ experimental situations can be considered as a complementary activity to scientific experiments to produce knowledge. Capitalizing on farmers’ experimental situations in agroecology could be a way to integrate farmers’ empirical knowledge in scientific innovation and knowledge, one of the five ways proposed by Doré et al. (2011) to deepen research on ecological intensification.