The following scenarios address the primary goal of the paper that demonstrates the model’s capabilities. For scenarios, a 26-node cluster is utilized in the outputs discussed, with Repast Simphony 2.1 (2014) used for runs and R (2014) and Java Apache Commons Mathematics Library (2014) applied for statistical analysis.
Scenario 1: Size Hierarchy Matching
The first modeling case investigates how well the ABM method can match known settlement size hierarchies from the MBA and IA, providing a general validation of the model. Ranges of input values are given in Table 2; these inputs are utilized in a parameter sweep (see North and Macal 2007) that represent qualitatively greater and lesser influences on the model, allowing one to evaluate the importance of given variables. Population (u) for all sites is initially set to 200; this means there are a total of 8,600 and 15,600 households for the MBA and IA subcenarios, respectively. No assumption is being made about the actual population or household sizes in the past, as the value 200 simply reflects an internal way for the model to measure relatively which sites become larger than others once people begin to migrate in the simulations. Other population values, in fact, could have been chosen, with 200 being useful to calculate population size ratios for all settlements used in outputs. What this means is that the population is used as a proxy rather than an absolute number that is then compared with settlement size (ha) as estimated from survey. In other words, the portion of the total population on a site can be directly compared with the portion of hectares out of the survey total, making the simulation and survey results comparable. All simulations are executed for 100 time ticks and up to 10 parameter runs for parameter settings, which allow results to stabilize and utilize different random seeds to account for stochasticity. Results are averaged with nearly 300,000 parameter combinations used in the subscenarios. To measure how well simulated results match empirical data, regression analysis is applied for each parameter combination.
Table 2 Parameters and their value ranges tested in scenario 1
Figure 4 shows regression (r
2) results, ranging from 0 to 1.0 (shaded areas), using an ordinary least squares regression on the ratios of simulated site populations, used as a proxy for simulated size, and surveyed site sizes for the MBA (scenario 1a) and IA (scenario 1b) cases, indicating how well simulated results fit survey results. In essence, Fig. 4 shows which variable settings lead to the simulated population data to match more closely to empirical site sizes, with darker colors indicating greater fit between the simulated and empirical data. Results that show r
2 > 0.98, that is a relatively high goodness-of-fit between empirical and simulated results for the MBA and IA, are shown in Figs. 5 and 6, respectively. These figures provide the frequency of simulations that have these high fit values, rather than just simply if a setting has a close fit with the survey as shown in Fig. 4. This gives an idea which parameter settings and combinations generally have more close-fitting results. The r
2 > 0.98 range is found to indicate both a very close visual (i.e., qualitative) and statistical fit, which is why it is used. Each frequency count in Figs. 5 and 6 represents an averaged parameter variation result in which r
2 > 0.98; there are 702 parameter variations that fall within this threshold in the MBA, while there are only 42 in the IA case. For scenario 1a, the best-fit parameter setting has r
2 = 0.998, where α = 0.2, β = 6.6, c = 0.8, m = 0.5, and b = 7. For the IA, the best-fit results (r
2 = 0.994) are α = 1.7, β = 4.2, c = 0.26, m = 0.34, and b = 1. Figure 7 shows two examples, one MBA and the other IA, where results had a strong fit with empirical data.
Where there is a close correspondence between simulated and empirical MBA data, Fig. 5 shows that α ranges between 0 and 5 closely fit the survey data’s settlement size hierarchy, while the results for β are largely between 4 and 10.2. This indicates a relatively moderate to a low emphasis on α and greater impediment to movement (i.e., higher β) leads to the settlement size hierarchy observed. As for c, the range is mostly between 0.4 and 0.8, with some results having a close fit near 0.3 and 1.0. Movement probability (m), on the other hand, is almost always near 0.5, showing that a very narrow m range allows close-fit results, while well-fitting b, or benefits an agent brings to a site, values mostly cluster around 4–8. These results can be interpreted to mean there is relatively moderate to high cost in flow, relative to return on benefits provided by individuals, while very high or low benefits by agents do not often lead to well-fitting results. The benefit factor begins to become relevant when other agents from the same social group, in this case from the same initial site, are found in other sites. Variable m shows much greater restriction, around 0.5, and there is a reasonable chance an agent could move if benefits from their current settlement are negative or lower than other sites around them. The movement value is not to the extent where people immediately leave their site, but it shows that movement should occur frequently, even if interactions are mostly across short distances (i.e., moderate to high β values).
For the IA, cases that meet the r
2 > 0.98 values have α ranging between 0 and 2 and β mostly lower than 1 but also ranging between 3 and 6 to a lesser extent. There is less return on settlement attractiveness than the MBA case, showing less importance on settlement advantages in reinforcing site size, but far less restriction to movement, allowing flow to be more dispersed. For m, values range between 0 and 0.1 and 0.3 and 0.5. In essence, very low probability of movement or a moderate probability lead to observed results. This indicates two possible movement range frequencies, rather than just one as in the MBA case, are possible for the IA, where very few people move or more frequent movement is found. This will be further discussed in scenario 2. The other variables appear to be more random or have less of a clear pattern; c ranges between 0 and 0.6 and 0.7 and 1.0 seem to lead to the observed simulated results. For b, most of the close-fit results range between 0 and 9, with some between 10 and 11. In essence, α, β, and m appear to have narrower ranges in leading to a close fit between simulated and empirical results for the IA case, while the values of the other variables have very wide ranges.
Figure 8 shows settlement sizes, for two example results that are typical for well-fit results in this scenario, using standard deviation on simulated population to indicate where larger sites are located. The figure also applies Nystuen and Dacey (1961) graphs, as similarly used in Davies et al. (2014), of settlement connections based on movement of people to sties. While Fig. 8a shows site 1, Tell al-Hawa, is not the largest site in simulations, as observed in the MBA of the NJS survey, Fig. 8b does show the IA scenario does sometimes lead to site 1 being the largest simulated site, matching the NJS survey. In Fig. 8a, what is evident is that the MBA case has a large portion of sites with multiple links, showing a high portion of local interactions or movements between neighboring sites, with only two sites not having multiple links, where movement of people is σ > 0. In the IA case, the portion of σ > 0 links for sites is fewer (63/78 sites). Overall, a greater number of links in the MBA case indicates more overall movement, although much of it is concentrating toward neighboring sites that then connects to larger sites. In the IA, movement is more diffuse and there are fewer hubs attracting a large number of movements. Furthermore, for all in-degree links, the highest number is 24 in the MBA case, while it is 14 in the IA, showing the higher level of local interaction and migration in the MBA case. There are also 12/43 sites with 10 or more in-degree links, while it is 6/72 in the IA.
As for degree centrality, based on total number of movements going to or through a site, site 37 is the most central in the MBA case, while it is site 144 in the IA. If the average and standard deviation for number of movements in links is observed, the results are 742 and 1,213 for the MBA respectively and 877 and 1,682 for the IA, respectively. While this represents the fact there are more people in the IA case, it is evident that there is also more variability in IA movements. On the other hand, the results indicate greater movement of people through different sites in the MBA case (6,730 movements on average versus 4,485 in the IA), as people made their way to the larger sites. For the MBA, people do not simply move to neighboring sites, but movements continue until people reach the larger sites such as site 127, which are more attractive than others, leading to population concentration at attractive sites and greater differences in population between sites. This shows that people did not immediately find the most attractive site, rather the limitations on movement, as represented by β, dampened long-distance interactions. In the IA, diffuse movements and lack of attractive sites create more of an even population in the IA scenario, with a higher portion of sites having 0–2σ for population. The IA example applies a m of 0.36, while Fig. 6 demonstrates it is possible to get well-fit results with a much lower m (i.e., less than 0.1). This will be discussed further in scenario 2. Overall, this scenario has demonstrated that the simulation model does create overall site hierarchies that match different periods’ survey results.
Scenario 2: Size and Rank Matching
Although the first scenario indicates simulations do closely match site size hierarchy between empirically surveyed and simulated sites, matching not only the hierarchy but the ordinal rank in size of observed and simulated sites proves to be more difficult. In other words, the model in scenario 1 shows that model output often does not have a close match between the ordinal rank-size for specific sites. In fact, when the same regression in scenario 1 is applied so that site rank and size are compared with the matching simulated output, the best results are r
2 values of 0.87 and 0.5 for the MBA and IA, respectively. Such results are of no surprise since geography is what mostly gives sites initial advantages over other sites in scenario 1. This indicates a need to apply additional factors that allow some sites to have initial advantages to enable them to reach greater size than other sites, while allowing a closer correspondence of simulated rank and size for each site. To enable sites to have advantages relative to other sites, t, which controls this aspect, is utilized. This variable also has the benefit of accounting for edge effects, as areas outside of the simulation could be providing benefits or disadvantages that sites receive and affecting site interactions.
What is likely evident in the periods studied is that sites did have advantages or benefits that allowed them to become more populated than other sites. A method comparable to Davies et al. (2014) is employed by looking at categories, or ranges, of site’s empirical size estimates in order to create values for t. In this case, rather than predetermining the number of categories of size used for t, variations of t are simulated by testing this parameter to see what the minimal number of t value categories, or differences, are needed so that more than half of the largest ten sites (Table 1) are forecasted by the simulation. The purpose of this approach is that it would demonstrate the model’s capability in forecasting larger sites without overly fitting the model (i.e., many different t value categories) and indicates that the model has a far better chance at determining likely larger sites than random chance. For the MBA, four t (i.e., endogenous/exogenous benefits given to a site) categories are found to be needed in order to correctly forecast more than half the ten largest sites. In this case, seven of the ten largest sites are forecasted when t values are 3, 2, 1, and 0.5 for sites that are >10, 10–5, 5–1, and 1> ha respectively in empirical survey size (Fig. 9a). The result of this in a Spearman’s rank correlation coefficient is 0.61, while a Pearson correlation coefficient test between the simulated and observed site sizes produced 0.94 for the MBA. Both these statistical measures are used because high coefficient values in both tests demonstrate the best rank, which Spearman’s test captures, and size fit, which Pearson’s correlation coefficient indicates. Overall, the results demonstrate that the t categories do produce rank-size values that match reasonably the survey record. The best matching parameters (Fig. 9) for the four t value categories in the MBA are α = 0.4, β = 9.7, c = 0.4, m = 0.5, and b = 6.5.
Additionally, looking at the average distance between the observed rank categories, that is the sizes used from the NJS survey to create the t values, and simulated rank values, which is what the simulation produces in the rank category of a site, the result is about 1.31 km (Fig. 9b). In this case, this value is called the distance rank error. Therefore, even in cases where the rank of simulated sites did not closely match the observed results, the distance rank error indicates the simulated site is not far from the correct size category. The interaction links for sites, in relation to connectivity of sites, show very similar results and structure to Fig. 8a, with site 14 being the most central based on total number of movements, as the population migrates to the large sites (e.g., site 1). However, the overall distribution of movements per link is nearly identical to Fig. 8a.
For the IA case (Fig. 10a), five categories for t are needed to enable a greater than 50 % matching of the ten largest sites. If there are four t value categories, 50 % accuracy for forecasting the largest ten sites is achieved, but not greater. Sites ranging in empirical survey sizes of >10, 10–4, 4–2, 2–1, 1> ha with simulated t values of 5, 4, 3, 2, and 1, respectively are used here. The best Pearson correlation result, where more than half of the ten largest sites are forecasted, is about 0.84, while the Spearman result is 0.79. This indicates, while the overall correlation is not as good as the MBA case, as densely located sites create more nearby areas where migration maybe drawn to, the Spearman result indicates this case does a better job in reproducing the ranks in the empirical results. In this case, seven of the ten largest sites are forecasted, where α = 1.5, β = 1.8, c = 0.2, m = 0.001, and b = 2.5. The distance rank error for the IA case is 1.04 km on average (Fig. 10b). Unlike the MBA case and Fig. 8b in scenario 1, the results show a very different interaction link structure, with site 1 having the most interactions by a wide margin, and sites 138 and 48 at a distant second and third respectively in migrations. Site 1 has 77 links, indicating every site interacted with it. What the results suggest is that while m is very low, because β is relatively low (i.e., it is relatively easy to move) people from throughout the survey area migrate to site 1 directly, rather than through intermediate sites, because of the site’s advantages. Such a structure is similar to what is shown in Fig. 6 (movement probability graph) in scenario 1, which shows that very low m probabilities could lead to settlement structures observed for the IA. Mostly, however, β values are lower than what is evident in the MBA case. In essence, Fig. 10b highlights that a second model, one where there is low m, can lead to structures observed in the IA, in addition to what is shown in Fig. 8b. This case indicates that when movement does happen it is focused on a site with advantages with distance not being a major factor.
Scenario 3: Survey Sampling and Robustness
At any given time, only a subset of the surveyed sites may have existed within the periods studied, as survey results may not be able to clearly identify subperiods within the MBA and IA. To ameliorate a situation where sites may have not been contemporary, and to assess the robustness of the results achieved earlier through random sampling, a repeated sampling approach is applied where only a portion of sites is executed in a given simulation run (i.e., a bootstrapping method). This portion of sites is sampled using a range of probabilities, where a given site will not be in a simulation run, that are 1/5, 1/3, 2/5, and 1/2, with each of these variations run for 500 different simulation runs for the MBA and IA cases using the parameter settings from the results in scenario 2. The results are then averaged for all sites so that an overall rank-size hierarchy is achieved, even though not all sites are simulated and the combination of sites differs in each simulation run. This approach allows us to see how sensitive results are when sites are removed from simulations and to see if the overall patterns observed in the last scenario are relatively meaningful and reproducible by seeing if similar patterns are achieved in this scenario.
The results for these probability scenarios, for both cases, are given in Table 3; as before, both Spearman’s and Pearson’s correlation coefficients are given, as this provides stronger rank and size correlations. For the MBA, the 1/5 and 1/3 probabilities show a relatively strong Pearson’s r value, while the 1/2 probability indicates a large decrease in this value. Nevertheless, the Spearman’s correlation coefficient value is relatively consistent, indicating that the rank order stays relatively stable between scenarios. In all cases, more than five of the ten largest sites are forecasted; in fact, the weakest Pearson’s r correlation did very well in forecasting the largest sites, even if the site size hierarchy results are weaker than other cases. Overall, the results show that the rank and size hierarchy of sites is maintained fairly well and relatively comparable to the empirical data until the simulation has more than 40 % of the sites missing at any given time. The Spearman’s rank correlation coefficient and number of top 10 sites forecasted gives some confidence that the results achieved in scenario 2 are meaningful even if part of the dataset is used. Figure 11 indicates MBA output, which is the 1/3 probability case, which has the best correlation coefficients for scenario 3. Results here show that sites 1, 43, 93, and 127 are forecasted to be in the top 10 largest in both scenarios 2 and 3. One possible interpretation is that the results suggest most of these sites would have been long-lived and contemporary, as the overall rank and size hierarchy are more closely maintained if many or all sites are present in a given scenario. Results in Fig. 11 indicate interactions that are somewhat similar to what is observed in Fig. 9; however, the main difference is there are more varied links with greater than 0σ movements, which represents the variability of movements from case to case due to some sites being removed or added based on the probability. For the overall average, the most central node is site 30, followed closely by sites 19 and 18, respectively. While these results are different from what is seen in scenario 2, structurally they are similar as sites near site 1 play an important conduit role in moving people closer to the high population sites. Movements are also seen to be mostly between nearby sites, with movements averaging 4.75 km distance.
Table 3 Results from scenario 3 testing for sampling and robustness of the modeled survey region’s cases
For the IA (Table 3), the Pearson’s r value is 0.88 when 1/2 of the sites are not simulated in a given run, with improving Pearson’s r values greater than 1/5 probability for sites not being in simulation runs. In addition, the Spearman’s rank correlation coefficient value improves for probability values between 1/5 and 2/5 of sites not simulated in runs. However, in the two cases, it is evident that forecasting the top 10 sites is not always greater than five. Figure 12, which has 1/2 probability, indicates the scenario with the best correlation coefficients and most forecasted top 10 sites. This output is a reflection of the greater variability found between runs in the scenario from case to case. Despite the fact that Fig. 12 appears to show more noisy interactions, for both scenarios 2 and 3 in the IA, sites 1, 2, 10, 48, 111, 130, and 138 are forecasted to be among the largest ten sites. This case shows many interactions where movement is greater than 0σ for links, which is once again a reflection of the variability found in given runs. However, looking at the overall average, and very similar to what was seen in Fig. 10 in scenario 2, site 1 is the most central as people are able to travel relatively farther distances to an attractive site. Site 138 is the second most central, as it is in scenario 2, where it forms a smaller regional center to the southwest of site 1. As with scenario 2’s IA case, many movements are long-distance and not just between sites next to or very close to each other. Interactions, or movements of people between sites, are on average covering 9.36 km in the IA in Fig. 11, indicating much more distant interactions than the MBA case. Although the IA case seems to forecast fewer of the top 10 largest sites, Pearson’s r and Spearman’s ρ values suggest there is a good degree of confidence in the results achieved in scenario 2. In fact, the results could suggest that many of these sites were not contemporary and existed for shorter periods within the IA, as the Pearson’s r value improves in cases where the probability of a site not being in a simulation increases, while Spearman's ρ is best when 2/5 of the sites are removed. Admittedly this is speculative; however, the results do suggest that the rank-size hierarchy demonstrated in scenario 2 appears to be a meaningful pattern as comparable or even better results are achieved via subsampling.