1 Introduction

In many river basins, increasing resource demands lead to increased interconnectedness of human behavior and the water system [33, 47, 55]. The conservation of natural resources partly depends on the collective effort by individual actors. To effectively build on this collective action by individuals, management organizations need to take into account the way in which individuals make decisions in the context of their natural and social environments. The process of the diffusion of technologies or innovations over a social system is well-studied [40]. However, the roles of its drivers are not well-known, particularly in relation to water and soil conservation. The theory on the diffusion of innovations [42] describes how innovations diffuse over a network of socially-connected users. It has been applied in many fields including economics and marketing ([5], see also [22]). For capturing the important role of human behavior in socio-ecological systems, while accounting for the relevant heterogeneity of the physical and social environment, Agent-Based Modeling (ABM) or geo-simulation (when geographically-explicit) continues to gain popularity [2, 30, 41, 53]. Successful examples of ABM models for representation diffusion of innovation include Schwarz and Ernst [46] and Kuandykov and Sokolov [27] and articles in a special issue on the topic [7]. Relevant literature on applications to conservation issues does exist [4, 51], in particular in relation to agriculture ([3, 45], Van Duinen 2015). Methodological challenges include those related to (1) the design and parameterizing of agent decision models [49]; (2) verification, validation, and sensitivity analysis [6, 19]; and (3) the role of social networks and subjective norms. A subjective norm is the perceived social pressure to perform or not to perform the behavior [1]. In literature on social networks, it is argued that participation in social networks and distribution of motivations among populations are a function of network size, weak and strong ties, and elite influence [48]. Applied to the adoption of innovations, the probability of adoption is influenced by the characteristics of the innovation, the characteristics of the agents (including the position in the social network), and the characteristics of the environmental context (e.g., geographical settings).

The main purpose of the paper is to demonstrate that empirical data on subjective norms and participation in collective action can be used to parameterize a geographically-explicit ABM, simulating the diffusion of soil conservation effort. Next to demonstrating how our modeling approach allows for making use of these empirical data, we also evaluate the effect of selecting different approaches for model parameterization to encapsulate particular heterogeneous features of the local case-study environment. For these purposes our newly developed CONSERVAT model is introduced. The name CONSERVAT is inspired by earlier model applications labeled ANIMAT (simple robots, [61]) and CONSUMAT (consumer behavior, [13, 14], Kangur et al. in press). Using CONSERVAT as a modified version of the original CONSUMAT approach allows us to study the effects of social influence and collective action in a real-world physical environment.

1.1 The Lake Naivasha Basin

Lakes within endorheic basins often support unique ecosystems and are sensitive to variations in the inflow of nutrients and sediments, affected by human activity in upstream areas. For such lakes, excessive influxes of nutrients and sediments could lead to undesired rises in turbidity levels and sedimentation rates. Sedimentation in lakes and eutrophication may occur simultaneously as sediment loading often relates to phosphorus loading (e.g., [20]). These sedimentation rate increases, due to soil erosion, often result from unsustainable agricultural practices. However, soil erosion (soil loss) from agricultural areas can be influenced through the adoption of soil conservation practices by farmers. Lake Naivasha (Fig. 1) is a fragile (and degraded) endorheic lake ecosystem supporting a rich variety of flora and fauna [11, 57]. Economic developments in the area are strongly connected to the lake ecosystem and the hydrological processes in the Lake Naivasha basin (3400 km2; [34, 37]). The upstream part of the Lake Naivasha basin comprises the Gilgil, Malewa, and Turasha sub-basins (Fig. 1). The lake ecosystem is vulnerable to physical-chemical degradation by nutrients and sediments and over-abstraction of water resources (e.g. [8]). Many human activities depend on clean freshwater from the lake whereas both the availability and quality of freshwater vary considerably over space and time [36]. In addition to natural climatic variations, water availability is affected by recent investments in irrigated agriculture and by population growth [28, 39]. These developments have substantially increased water demand and unsustainable agricultural land use practices. In particular directly around Lake Naivasha, water demand (both groundwater and surface water) has increased, fueled by the large-scale production of cut flowers for export markets [56]. Challenges faced by smallholder farmers throughout the upstream parts of the basin are associated with unsustainable agricultural land use practices [60] and high rates of soil loss [38] and soil fertility loss partly counterbalanced by soil conservation efforts [60]. Drivers of erosion include seasonal cropping patterns and land fragmentation. Farming occurs on steep slopes that require land management practices to ensure sustainable agricultural use. The study by Willy and Holm-Müller [60] explored the effect of subjective norms and participation in collective action initiatives on the SCE in the Lake Naivasha basin. A subjective norm is the social pressure that makes people feel obliged to undertake a behavior (in our case, adoption of a technology) just because important people within their social networks feel they should do so or have already done it themselves. In Willy and Holm-Mueller (2013), subjective norms were found to have played a significant role in driving the diffusion of soil conservation technologies (Willy and Holm-Mueller 2013). Their study also showed that participation in collective action and subjective norms differ between local populations and that this affects SCE levels. In this study, we take these spatial differences into account and evaluate the effects of this heterogeneity.

Fig. 1
figure 1

The three upstream sub-basins of the Lake Niavasha basin: Gilgil (G), Malewa (M), and Turasha (T). The locations of 307 survey respondents [60] are also shown

2 Method

To assess the processes leading to soil erosion and subsequent sedimentation downstream, the relevant physical processes and social mechanisms need to be analyzed in an integrated way. In the CONSERVAT model, basin level processes (i.e., sediment fluxes associated with erosion and sedimentation) emerge from micro-scale local processes that are influenced by the activities of local farm households (soil conservation practices in this case). Relevant processes (related to hydrology, soil loss, and human behavior) are represented in a geographically-explicit agent-based model. In this section, first the CONSERVAT model is introduced (Section 2.1), then the method for model parameterization is described (Section 2.2), followed by a description of model simulations and evaluation of results (Section 2.3).

2.1 The CONSERVAT Model

Our CONSERVAT model is described using standard elements of the ODD protocol [9]. Here we briefly introduce the model, while the full ODD description is included in the Appendix.

The CONSERVAT model is used to evaluate subjective norms on (i) the spatiotemporal diffusion pattern of Soil Conservation Effort (SCE) levels; and (ii) the resulting reduction in lake sedimentation rates for Lake Naivasha. In the model, smallholder farm-households are represented by agents at the local level (local catchments, see Fig. 2). Groups of neighboring agents (within a maximum distance of five kilometers) represent peers in the social environment of individual agents. The modeled natural environment consists of a spatial river basin structure with accompanying hydrological and soil loss processes. The spatial river basin structure includes local catchments for which soil loss is calculated annually. Collections of adjacent local catchments form sub-basins for which rainfall-runoff is determined monthly. At the downstream end of the Lake Naivasha basin, a lake and a connected aquifer are modeled using a simple monthly water balance model [56]. The overall spatial structure of the CONSERVAT model is presented in Fig. 2. State variables include Farm-household strategy (repetition, optimizing, inquiring, and imitation) and Farmer choice (regarding the adoption of an elevated SCE level: from level 0 to level 1 or 2). State variables that are related to hydrological and soil loss processes include runoff in sub-basin streams (m3 month−1), water in the lake (m3) and in the connected aquifer (m3) and soil loss from farms and local catchments (tons year−1), sediment inflow into the lake, (tons year−1) and sedimentation on the lake bottom (mm year−1).

Fig. 2
figure 2

Top right: The lake Naivasha basin with three sub-basins (left frame) and local catchments in which agents are located. Bottom right: The variables in the farmer box are fully explained in Table 1. The Unified Soil Loss Equation (USLE), accounts for Erosivity (R), Erodability (K), Topography (LS), Land Cover (C) and Conservation Practices (CP) as defined by Wischmeier and Smith [62]. The preference (pi(t)) depends on recently experienced soil loss (personal preferences for soil conservation). The rule that is applied for updating agent preferences for soil loss reduction is shown in Table 2. Besides personal preferences, the adopted strategy (optimizing, repetition, inquiring, or imitation) also depends on characteristics of social networks. Left: Parts of the graphical user interface (GUI) of the CONSERVAT model including a dynamic geographical map, water levels in Lake Naivasha, soil loss in the basin, sedimentation in Lake Naivasha and an adoption-diffusion pattern. For a more detailed representation of the GUI, see the Appendix. The model is also available at OpenABM: https://www.openabm.org/model/4597/version/1/view.

Modeling the process of adoption of a SCE level using ABM is based on the CONSUMAT approach [13, 16]. It is well-known that in modeling, in the interaction between natural phenomena and social processes, there is a confusing amount of theories that can be chosen to describe social and individual behaviors. Not to say that there are conflicting theories [44]. Focusing on economics, mainstream economics is often narrowed down to rational behavior in a neo-classical paradigm. In Janssen and Jager [18], Jager & Janssen [12], and Kangur et al. [21], it is discussed how individuals are engaged in broader cognitive processes than only rationality, in deciding how to behave. On the one hand, individuals want to satisfy their needs, but on the other hand they are uncertain, with respect to the difference between expected and actual outcomes of their behavior. These two aspects lead to four possible behaviors: repetition (i.e., repeat past habits, imitation (i.e., copy others), inquiring (compare to others to decide), and optimizing (i.e., deliberation). Our approach builds on a version of the CONSUMAT model as described in Janssen [17]. Levels of uncertainty (Unc) and need satisfaction (LNS) provoke an agent to decide on the soil conservation effort to perform, according to one out of the four possible strategic behaviors (Fig. 2).

In our CONSERVAT model, both personal preferences for soil conservation and characteristics of social networks affect decisions of agents. Consequently, subjective-norm effects are incorporated into agent decision making. The strategy applied by an individual agent depends on an expected Level of Need Satisfaction (LNS) and the Uncertainty (Unc) a person faces with regard to adopting an elevated SCE level. This uncertainty is influenced by the adopted SCE levels of peers. Agents compare their expected LNS and uncertainty, with a minimum required LNS (LNSmin) and maximum tolerated uncertainty level (Uncmax) and consequently choose one out of four strategies to apply: repetition, optimizing, inquiring and imitation.

In line with Wejnert [58], as discussed above, the need to conserve soil (LNS of a SCE level) is influenced by characteristics of the adopter, characteristics of the product (the SCE level, resulting in soil conservation), and the environmental context:

  • a personal preference for soil conservation of a farmer (pi) in the context of the actual local soil loss situation, see Table Appendix 3 (Appendix)

  • a product characteristic (δj), (i.e., the erosion reduction factor associated with a SCE level at a specific location)

  • the share of peers employing a particular SCE level (xj)

The level of uncertainty regarding a certain SCE level (Uncij) is also influenced by the share of peers employing that particular level of soil conservation effort (xj). The following equations are used for determining the expected level of need satisfaction (LNS) and uncertainty (Unc) of farmer i and SCE level j:

$$ LN{S}_{ij}={\beta}_i\left(1-/{\delta}_j-{p}_i/\right){x}_j $$
(1)
$$ Un{c}_{ij}=\left(1-{\beta}_i\right)\left(1-{x}_j\right) $$
(2)

where βi reflects the insensitivity of a farmer to its social environment. Equation 1 reflects how personal preferences and those of peers affect personal satisfaction levels. Equation 2 describes how uncertainty levels of individuals are affected by the observed share of peers employing a particular SCE level (xj). Agents with a high β are less sensitive to the choices of others than those with a low β. In our CONSERVAT model, agents can choose between four consecutive and increasing SCE levels:

  • SCE level 0: No conservation measures applied

  • SCE level 1 g: Grass measures applied (planting grass strips, mostly using Napier grass). The conservation practice factor (CP) is between 0 and 0.9 depending on the location.

  • SCE level 1 t: Terrain measures applied (bench terraces or contour farming). The conservation practice factor (CP) is between 0 and 0.9 depending on the location.

  • SCE level 2: Both measures are applied. The conservation practice factor (CP) is the combined effect of the two erosion reduction factors δ.

The CONSERVAT model is developed to run for the period 1965–2010 using a monthly time step. Over the simulation period 1965–2010, the agent population is kept constant at a level of 10,000 agents (households) for the entire Lake Naivasha basin. The majority of these agents is positioned in one of three sub-basins: Gilgil (G), Malewa (M), and Turasha (T), see Fig. 1. The chosen agent-population is based on the estimated average household size of around eight in 1965 [63], corresponding to a population of ~ 80,000 with their livelihoods being dependent on small-scale farming practices. No agents disappear and no new agents are created. Also, the social links between the agents and their peers remain unchanged. To still account for an increase in actual population over the period 1965–2010, the intensity of farmland use (and related soil loss) between 1965 and 2010 is assumed to gradually increase by a factor 3.5, along with actual population growth in the area: ~ 80,000 around 1965 and ~ 284,000 in 2009 [23,24,25,26].

2.2 Parameterization of the CONSERVAT Model

For the description of the parameterization method for human behavior, we use the framework proposed by Smajgl et al. [49], see Fig. 3 and Table 1. In Table 1, the methods applied are described along with a summary of the data used. The initialization of agent characteristics from our survey data is done in two alternative ways. Agent attribute values are either assumed to be homogeneous for all three sub-basins together (GMT) or specified heterogeneously by sub-basin (G, M, and T) based on corresponding sub-samples of the social survey data (Fig. 1). As stated in the “Introduction” section and as is shown in Table 1, our survey data show interesting differences between the three sub basins for the variables “Participation in collective action: CAPART”, “Perception that soil erosion is a problem: PECEROYES”, and “Sensitivity to subjective norms: SUBNORM”. The differences for these variables are expected to affect the diffusion of SCE levels over the basin. Therefore, we compare a situation in which the entire agent population is regarded as homogeneous over the entire study area and a situation in which sub-sets of the household survey data-set are used to initialize populations of sub-basins.

Fig. 3
figure 3

Framework for parameterization of the CONSERVAT model, adapted from Smajgl et al. [49]. The implementation of these methods (M1–M5) is included on Table 1

Table 1 Overview of methods used for parameterization of agent behavior in the CONSERVAT model. The distribution of the values for parameterization of agent attributes is either assumed homogeneous for all three sub-basins together (GMT) or heterogeneous, i.e., specified by sub-basin (G,M and T)

For calibration of our CONSERVAT model the following two additional parameters have been introduced:

  • Probability of adoption (prob_adoption): This fraction determines the number of agents that annually considers to alter (elevate) their soil conservation effort (SCE). Only those agents engage in decision-making according to the scheme shown in Fig. 2

  • Soil loss calibration (RKLS_calib): In order to achieve realistic values of overall soil loss [62] the USLE parameters (RKLS) are multiplied by a factor. This is done to account for the effects of erosion-inducing activities that add to those accounted for in the USLE method. This also includes erosion from bare lands and roads surrounding farmland. A factor of 2.5 was found to be realistic and results in sedimentation rates that are in the same range of Stoof-Leichenring et al. (2011).

2.3 Model Simulations and Evaluation of Results

We test the effect of different ways for initializing the model using data from a household survey on spatiotemporal diffusion pattern of adopted SCE and on the sedimentation rate of Lake Naivasha. Note that in the previous section, spatial scale resolution (either lumped, i.e., covering the entire basin, or distributed to sub-basins) is an important issue for parameterization. Therefore, the following simulation specifications and initializations apply:

  • Homogenous initialization: Differences among individuals in agent population (10,000 agents) are distributed homogeneously over the entire study area (all three sub-basins from Fig. 1, lumped: GMT). Initialization values used are those for GMT as specified in Table 1.

  • Heterogeneous initialization: The agent population is initialized according to sub-basin specific information (G, M and T) as specified in Table 1. The survey-respondent population was categorized into three sub-basins (Fig. 1).

These model runs for homogenous initialization and heterogeneous initialization are performed to explore the effects of sub-basin specific information on diffusion of SCE levels. Specific details on the initialization of the CONSERVAT model can be found in the appendix. In addition, model runs for the two reference situations are carried out to determine the benchmarks of the model output on lake sedimentation:

Reference-no SCE: The entire agent population (10,000 agents) remains at SCE level 0 throughout the simulation period.

Reference-max SCE: The entire agent population (10,000 agents) remains at SCE level 2 throughout the simulation period.

An important variable to judge the performance of our model in the next section is to evaluate how our model simulation results compare to the reconstructed spatiotemporal diffusion pattern of adopted soil conservation effort [60] for the period 1966–2010. The reconstructed spatiotemporal diffusion pattern was categorized into three sub-basins (G, M, and T) for five consecutive periods of 10 years: 1960–1969; 1970–1979; 1980–1989; 1990–1999; and 2000–2009.

Next to comparing observed and simulated spatiotemporal diffusion patterns of SCE, we also compare the simulated sedimentation rate in Lake Naivasha with an independent data set on the observed sedimentation rate in Lake Naivasha [52].

Finally, to test the sensitivity of our model outcomes, a local sensitivity analysis has been applied in which the main model parameters are varied separately. To evaluate the influence of randomness, the model was tested for three arbitrarily chosen random-seeds. For all final simulation runs, the random seed has been fixed to a single value to avoid differences in initialization of model runs.

3 Results

3.1 Model Simulation Results on Diffusion Patterns

Results of our simulations are presented in Figs. 4 and 5. The outcomes presented in Fig. 4 show how the diffusion of SCE levels evolves over time. Figure 4 reveals a considerable difference between the results of homogeneous and heterogeneous initializations. This is the effect of using sample subsets of sub-basin populations for agent initialization (heterogeneous initialization) rather than using the entire household survey respondent set for initialization (homogeneous initialization). The difference is partly explained by the different ratios between the number of survey samples and the actual agent population that is used to represent the farmer population. For the homogeneous initialization, the respondents from the Malewa (M) sub-basin are relatively over-represented and respondents from the other sub-basins are relatively under-represented (Table 1, under M5). This plays a role in all stages of parameterization of agent behavior in the CONSERVAT model (Table 1, Fig. 3). Another part is explained by the diffusion process itself, since SCE levels do not easily rise in places with low initial SCE levels.

Fig. 4
figure 4

Model output for CONSERVAT simulations of the diffusion of SCE levels 0, 1, and 2 for homogeneous and heterogeneous initializations

Fig. 5
figure 5

Model results for the diffusion of soil conservation effort for different initial levels of effort: SCE 0, left panel, SCE 1, middle panel and SCE 2, right panel. Homogeneous initialization (solid line); heterogeneous initialization (dotted line); empirical data (circles; observed by [60]). First row, entire upper basin (GMT), followed by the sub-basins of Gilgil (G), Malewa (M), and Turasha (T)

In Fig. 5, model simulation results are presented for the entire upper basin (homogeneous initialization: aggregated basin GMT) and by sub-basin (heterogeneous initialization: sub-basins G, M, and T). A visual comparison between model simulation results and empirical data on the reconstructed adoption pattern of soil conservation technology shows that the simulation results are in the same range as the empirical data. There are however considerable differences between the three selected sub-basins, particularly in relation to the type of initialization applied.

The difference between simulation outcomes for homogeneous and heterogeneous initializations is particularly large for the Gilgil (G) sub-basin. This can be explained by the relatively large difference between initially adopted SCE levels of both initializations for the agent population. For the Gilgil (G) sub-basin, the diffusion of adoption is relatively slow and limited presumably due to below-average values for CAPART (implying less social links per agent), and a high sensitivity to the local subjective norm (SUBNORM) which in this case hampers the diffusion process since the initial adoption rate was 0% (Table 1).

For the Malewa (M) sub-basin, the diffusion of adoption is slower in case of heterogeneous initialization in comparison to homogeneous initialization. This is explained by the initial adoption rates of high SCE levels which is lower in case heterogeneous initialization. Also, the Malewa (M) sub-basin specific values for SUBNORM and CAPART (see Table 1) are less favorable for diffusion than is the case for Homogeneous initialization.

For the Turasha (T) sub-basin, the opposite is true from the Malewa (M) sub-basin. The initial adoption rate of high SCE levels is much higher in case of Homogeneous initialization. The initial adoption rates of high SCE levels in this sub-basin is also high in comparison to the other two sub-basins. In the case of the Turasha (T) sub-basin, the high values for sensitivity to the subjective norm (SUBNORM) and participation in collective action activities (CAPART) speed up the diffusion of a higher SCE level rather than to hamper it.

The physiological sensitivity of the area to erosion is highest in the Gilgil (G) sub-basin, as determined using the USLE approach [62]. The highest value for the people’s perception of erosion to be a problem is also highest in that area (Willy and Holm-Muller, 2013). However, the diffusion of adoption of higher SCE levels is very slow in this part of the basin (Fig. 5). This applies both to our model output and the empirical data. This strongly suggests that farmers are more sensitive to subjective norms than to actual soil losses. We judge this to be an important finding.

3.2 Model Simulation Results on the Sedimentation Rate

In the data poor environment of Lake Naivasha Kenya, we have been able to compare our model simulation results to independent sample data on lake sedimentation (Stoof-Leichensring et al. 2011). In the CONSERVAT model, lake sedimentation results from soil loss due to land use practices in the upstream area. This emergent variable (sedimentation, or rather “reduced sedimentation”) results from numerous individual soil conservation efforts.

If we define a situation of all agents adopting a SCE level 2 over the entire period of 1965–2010 as the maximum achievable reduction, then the actual adoption rate [60], as reproduced by our model, leads to 59% of this maximum achievable reduction. In Fig. 6, the maximum reduction is the solid line (Reference-Max-SCE). Our model simulation results for lake sedimentation (for both homogeneous and heterogeneous initializations) closely resemble the actual lake sedimentation rate [52], thus suggesting that the application of soil conservation practices has substantially reduced lake sedimentation. Importantly, as a second result, comparing our model simulation results with the empirical sedimentation pattern [52] we show that our results catch the upward trend and change in observed lake sedimentation.

Fig. 6
figure 6

Sedimentation rate of Lake Naivasha for the period 1965–2010. Our model results (for both homogeneous and heterogeneous initializations) resemble the sedimentation pattern that was found by a core sample inside Lake Naivasha [52] to a large extent

3.3 Sensitivity Analysis

The results of the local sensitivity analysis [10] are presented in Table 2. Varying the random seed has only very limited influence on simulation results for both SCE levels and sedimentation in Lake Naivasha at the end of the simulation period (2010). Both output variables are quite sensitive to variations in the calibration parameter Prob_adoption, whereas variations in RKLS_calib (the other calibration parameter) only affect sedimentation values.

Table 2 Results of the local sensitivity analyses. All parameters relate to homogeneous initialization, except for the last line

In the CONSERVAT model decision-making is dominated by imitation, repetition or inquiring, depending on the settings chosen for LNSmin (level of need satisfaction) and Uncmax (individual uncertainty). For the selected settings (LNSmin = 0.05 and Uncmax = 0.05) imitation appears to be the dominant strategy. Substantially higher values for Uncmax result in lower aggregate levels of SCE and consequent higher values of sedimentation. This is because the dominant agent strategy becomes repetition. For substantially higher levels of LNSmin the dominant agent strategy becomes inquiring which also leads to somewhat lower aggregate levels of SCE and consequent higher values of sedimentation.

The effect of initialization (homogeneous or heterogeneous) is also considerable as already shown (Figs. 46).

4 Discussion

The approach that was used for parameterization of the CONSERVAT model has been described using the framework of Smajgl et al. [49] including Methods M1, M2, M3, M4, and M5 as referred to in Fig. 3 and Table 1. The five methods applied are discussed below. For the identification of agent classes (M1), a very simple distinction was made between survey respondents [60] that indicated to participate in collective action activities and those that did not. Using this information, it was decided to allot the former with a maximum of six social links and the others only three. This rather pragmatic procedure is admittedly quite arbitrary. Further exploration of the number of social links could yield insights in the propagation of the actual diffusion. In this respect, it would be quite interesting to explore the influence of alternative designs of social networks such as opinion-leader, village, or hierarchical networks [48].

In the CONSERVAT model, agents and their social network do not change during the period 1965–2010. However, in reality agent-specific attributes and behavior do change and also the social networks change. In order to maintain an acceptable level of simplicity and transparency of the model, the model was kept as simple as possible while still allowing to show how subjective norms affect diffusion of soil conservation efforts in the study area. This simplification of reality implies that an agent in the CONSERVAT model represents a number of actual farmers in a certain spatial unit.

A fundamental problem in social sciences and ABM is how to derive observations of a social system over time [19]. Fortunately, the survey by Willy and Holm-Müller [60] allowed us to reconstruct the diffusion of SCE levels for the Lake Naivasha basin for the period 1960–2010. However, respondent data on changes (over time) in sensitivity to subjective norms (SUBNORM), participation in collective activities (CAPART) and perceptions that soil erosion is a problem (PECEROYES) were not available for this study. For the specification of the value for the agent attribute for “the perception that erosion is a problem” (M2), some methodical considerations deserve attention. As we have assumed in our model it would be quite well conceivable that when erosion occurs it will increasingly be perceived as a problem. However, from the survey parameter PERCEROYES we cannot derive changes in perceptions by survey respondents over time. Moreover, it is questionable whether the local erosion experienced (compared to the previous 3 years) is the only important factor for farm-household preference in how much erosion has occurred. Farm-household preference could also be influenced by other factors such as repetitive occurrence of erosion events (consecutive wet years can trigger somebody to act eventually), the introduction of mechanical tools, access to funding, support by conservation organizations, and the introduction of market-based conservation practices in the basin.

Respondents from the Malewa sub-basin are over-represented in the farm-household survey. This skewed representation of sub-set respondents relative to the entire survey affects model results as seen from the difference between the outcomes (Table 1). The influence of different spatial scales for initialization (homogeneous and heterogeneous initializations; methods M4, M5) has been explored in this study. As seen in Table 2 it makes a lot of difference when other spatial units are used to aggregate survey data for model initialization. However, aiming at water management, soil conservation and the reduction of sedimentation, partitioning of a social survey into three sub-basins seems the logical choice. But, other spatial units could have been used as well.

Despite the limitations of the approach used, the results of this study suggest that using household survey data for parameterizing a basin-level ABM is an important element to practitioners in land and water management. Identifying spatial differences allows decision-makers to evaluate where interventions could be most effective. The added value of using an ABM in addition to the already informative survey data is that the innovation diffusion process (affected by subjective norms) is included in the modeling process. In order to increase added value of using ABM for land and water management, surveys should be carefully designed, with this purpose of the ABM in mind.

Two parameters for calibration were used: prob_adoption and RKLS_calib. The former parameter (prob_adoption) affects the speed of the diffusion process. This parameter does not affect the decision model of individual agents. It merely affects the number of agents that engage in decision making for a particular time step. Other factors that affect the speed of the diffusion process are the time-step used for decision-making (1 year) and the number of links to the peers. With regard to the other calibration parameter (RKLS_calib) erosion other than farmland areas (roads, wind) may also be very relevant, however uncertain. Moreover, the position of soil loss has not been explicitly considered when translated into sedimentation of Lake Naivasha. This means that erosion and sedimentation inside streams and rivers due to runoff variations (for example: flash floods) have not been considered.

Using a reconstructed time-series is not ideal. It is based on respondents that look back on their own agricultural history (sometimes all the way back to 1960). The accuracy of individual memories is debatable. Also, farmers that stopped farming before 2011 could have never been part of the survey. The people now looking back to 1960s were very young back then. It is arguably not representative for the farmer population around the 1960s. For more robust data, an additional survey would be needed and considerable additional effort would be necessary. This is however beyond the scope of the current study.

Also with regard to the observations for sedimentation in Lake Naivasha some critical remarks are justified. Whether the sediment core used [52] is representative for the sedimentation in the entire lake is questionable. Using only one particular position in the lake could easily lead to an under- or overestimation. Also, the process of sedimentation distribution over the lake bottom is likely to be influenced by lake level variations (hydraulic effects at mouth of the river) which have not been accounted for.

Although a sensitivity analysis has been performed for this study (Table 2) there are several aspects that could be further explored. Adequate validation of the CONSERVAT approach could be done by applying quantitative methods and systematic sensitivity analysis (Lorscheid et al. 2011). In particular, alternative settings of Uncmax and LNSmin and their influence on strategic behavior (repetition, imitation, inquiring, and optimizing) of agents could be further explored. Similarly, initializing and running our model using three random-seed settings only, does not guarantee reliable model results. For this study a visual comparison (qualitative validation) served the purpose of demonstrating the usefulness of using farm-household survey data for parameterization and qualitative validation of agent-based models. However, follow-up studies are advised to put considerable effort in conducting thorough and systematic validation.

5 Conclusion

This study shows that household survey data on subjective norms and collective action can effectively be used to parameterize a geographically-explicit agent-based model for natural resource management. Particularly in data-poor environments, the use of social survey data for model-parameterization could advance the development of tools for evaluating the basin-level effects of management alternatives for soil conservation in both the natural and social domains.

The CONSERVAT approach that has been introduced in this paper allows for including both natural influence and social influences (i.e., subjective norms) on the diffusion of soil conservation efforts. It has been demonstrated that individual decisions on adopting soil conservation technologies are affected by subjective norms and that it is possible to reproduce this adoption process. Our demonstration also shows that different approaches for model parameterization result in relevant differences in model outcomes. This study is innovative in the sense that we incorporate empirical data on subjective norms in a geographically-explicit ABM that integrates the physical and social system for an environmental issue. This is an important addition to recent studies incorporating social networks [29, 54]. The model is developed as a first step to effectively use ABM for supporting decision-making on natural resource management (soil conservation).