Introduction

The extraordinary capacity of human societies to domesticate landscapes has promoted global alterations in natural ecological processes, ecosystems and species distributions (Boivin et al. 2016). Landscape domestication is defined as a continuum of land transformations by humans extending from semi-natural landscapes to cultivated lands and densely settled areas (Clement 1999; Ellis 2015; Clement and Cassino 2018) in which human manipulation of species populations and soil properties resulted in more secure and productive areas (Clement 1999; Erickson 2008). Evidence of landscape domestication has been found in extensive areas that - to the untrained eye - may seem natural (van Gemerden et al. 2003; Dambrine et al. 2007; Ross 2011; Levis et al. 2017). For instance, previous studies have shown that species richness and soil nutrients increase near ancient Roman settlements abandoned for millennia (Dambrine et al. 2007). Similarly, soil phosphorus and calcium are significantly higher in old habitation sites of the First Nations along the British Columbian Coast (Trant et al. 2016). In Mesoamerican forests, a higher abundance of plant species used by Maya people for daily needs persists in densely-settled forest areas even after centuries of human abandonment (Ross 2011). In southern Amazonia, a mosaic of domesticated landscapes was detected in an area of approximately 50,000 km2 in the Upper Xingu River basin (Heckenberger et al. 2008) and was recently estimated to occur in an area of approximately 400,000 km2 across the entire southern region (de Souza et al. 2018).

Since domestication involves the manipulation of species populations and ecosystem functioning for human welfare (Clement 1999), this process depends on both the energy invested in ecosystem transformation and cumulative cultural and material inheritances of a given society (Ellis 2015). Thus, it is not surprising that wide-ranging evidence of this process has been found across Amazonia (Clement et al. 2015). The most common signatures left by pre-Columbian peoples in densely settled areas are patches of Amazonian Dark Earth (ADE or Terra Preta de Índio - TPI) (Heckenberger and Neves 2009). These anthropogenic soils are rich in nutrients, ceramics, plant and animal remains, and charcoal. They were created by the accumulation and burning of domestic waste in habitation zones (Neves et al. 2003; Woods et al. 2013). Thus, ADE sites are considered pre-Columbian refuse heaps with high fertility. Anthropogenic dark-brown to brown soils are slightly richer in nutrients compared with adjacent soils, but they are generally less fertile than ADE. These brown soils often contain many pieces of charcoal, but few or no ceramics and occur in the surroundings of ADE, which suggests that they probably originated from past cultivation practices, such as mulching and burning of infield waste (Woods et al. 2013).

Charcoal records have also been used to identify ancient cultivated landscapes across Amazonia (Mayle and Power 2008; Mayle and Iriarte 2014), as fire was the main tool used by indigenous people to transform forest landscapes around their villages (Erickson 2008). Although fire regimes are often linked with past human disturbance (Mayle and Power 2008), caution is needed given that the length and severity of dry seasons was most likely associated with changes in precipitation regimes during the Holocene as well as with human activities (Maezumi et al. 2018b).

Signs of landscape domestication can also be detected by assessing the distribution and abundance of useful, managed and/or domesticated species (Clement and Cassino 2018), as observed in Central Amazonia (Levis et al. 2012; Ferreira et al. 2019), Western Amazonia (Franco-Moraes et al. 2019) and across the Amazon basin (Levis et al. 2017). At least 85 arboreal species, mostly used for food, have populations domesticated to some degree by pre-Columbian peoples in Amazonia (Levis et al. 2017), and they are concentrated in and around settlements due to numerous historical management practices (Levis et al. 2018). Thus, landscapes around human settlements can be analysed as a durable record of the activities practiced by human societies (Balée 2006).

The persistence of domestication processes beyond densely settled areas is, however, still controversial for several reasons. First, ecological and archaeological studies have mostly investigated human impacts only along the margins of major Amazonian rivers in areas expected to have been heavily modified (McMichael et al. 2017b). Second, scholars are using different methods, proxies and datasets in different localities, but each method detects specific legacies of human activities and their results may seem contradictory (Piperno et al. 2015, Stahl 2015). Third, the effects of pre-Columbian activities on forests could have disappeared after their abandonment due to natural processes (Bush et al. 2015; McMichael et al. 2017a), since many old settlements were abandoned and hidden by forest re-growth for centuries (Denevan 1992). Fourth, and equally important, modern indigenous societies often decide to settle in archaeological sites or places they had occupied before (Rival 2007; Politis 2009), making the distinction between pre-Columbian and post-conquest management practices particularly challenging (McMichael et al. 2017a, but also see Junqueira et al. 2017).

One question that remains open is whether pre-Columbian landscape domestication persisted in old-growth forests over time or whether legacies of past domestication processes are a result of recent management practices. Here we compared pre-Columbian vs. recent human influences on forest landscapes surrounding past and current villages. To disentangle their effects, we first mapped all pre-Columbian villages (ADE sites) and current villages known by local people in the study areas, and then we randomly sampled 27 forest plots across distance gradients from these villages. Because past and current Amazonian people often use forests for hunting and gathering more intensively near their villages, human intervention in the landscape is expected to decrease with increasing distance from villages (Heckenberger et al. 2008). By sampling forests at different distances to pre-Columbian and current villages, we assessed their relative influences in forest soils and plant resources. Because anthropogenic soils (ADE and dark brown to brown soils) were created extensively in pre-Columbian times, we expected to find higher levels of soil fertility in old-growth forests closer to pre-Columbian villages than to current villages. However, since ecological knowledge is continuously transmitted between human generations inside and outside their territories over time (Balée 2000; Cámara-Leret et al. 2019), we expected a cumulative effect of past and current management on the availability of useful and domesticated plants for local people within these forests.

Materials and methods

Ethical procedures and authorizations

Following Brazilian law and local rules, all authorizations to conduct the study were obtained before field work. The study was approved by the Brazilian Ethics Committee for Research with Human Beings (Process n° 10,926,212.6.3001.5020 and n° 1.396.762/2016), the Regional Coordinator of the Brazilian National Indigenous Foundation (FUNAI – CR Madeira, n° 001/APIJ/2016), the Brazilian System of Protected Areas (SISBIO, process n° 47,373–1, 45,094–1 and 53,041–2), and National Institute of Historic and Artistic Heritage (01494.000171/2011–78). In each village, the local or indigenous leadership approved the study and all informants signed free informed consent terms when they participated in our activities.

Study sites and their long-term human history

To expand the comparison between pre-Columbian and modern societies, we studied the landscape around six villages inside three distinct protected areas - one Indigenous Land and two National Forests inhabited by traditional societies - located in the upper-middle Madeira River (Jiahui Indigenous Land and Humaitá National Forest - FLONA Humaitá) and the lower Tapajós River (Tapajós National Forest - FLONA Tapajós; Fig. 1). The villages were chosen because their residents agreed to participate in our study and they knew about the location of ADE sites around the villages. Environmental conditions vary between these two river basins. A minimum of three dry months occur along the middle-upper Madeira and five dry months along the lower Tapajós (Kummerow et al. 1998). Altitude also varies between forest plots in the Madeira River basin (80–110 m) and in the Tapajós River basin (150–200 m) (Fig. 1a).

Fig. 1
figure 1

a) The location of the protected areas studied in the Brazilian Amazon, b) the Tapajós National Forest in the lower Tapajós River basin, and c) the Humaitá National Forest and Jiahui Indigenous Land in the upper-middle Madeira River basin. Pre-Columbian villages were sedentary settlements with anthropogenic soils (ADE sites). The approximate location of some ADE sites were obtained with participatory mapping and complemented with the information collected during guided tours. With guided tours we validated the location of most ADE sites. Some sites near the Tapajós were mapped by Schaan et al. (2015). Altitude varies among forest plots in the Madeira River basin (80–110 m) and in the Tapajós River basin (150–200 m), and this variation was detected using SRTM (Shuttle Radar Topography Mission) images available at http://www.dsr.inpe.br/topodata/

Four of six villages - two in the FLONA Tapajós and two in the FLONA Humaitá - are inhabited by traditional societies, and two villages are in the Jiahui Indigenous Land. Traditional societies have existed for at least 100 years; most of them are descendants of migrants who intermarried with local indigenous peoples, but they are not members of an indigenous group. Inhabitants of Jiahui Indigenous Land are members of the Jiahui indigenous group that speak a Tupi-Guarani language (Peggion 2006). Traditional societies and the Jiahui indigenous group regularly practice farming, fishing, hunting, timber and non-timber forest product extraction, and, in the case of the FLONA Tapajós, the villages are also involved in community-based tourism. In this study, we defined current villages as the areas currently inhabited (most were established approximately 120 years ago) and pre-Columbian villages as the sites with ADE, which are likely older than 350 years (de Souza et al. 2019).

Although all archaeological sites mapped in both regions contain ADE sites with ceramics, which indicate sedentary pre-Columbian occupation (Neves et al. 2003), the ancient histories of the regions differ. Archaeological sites of the lower Tapajós River basin are mainly associated with the Tapajó or Santarém pottery, which is affiliated with the Incised and Punctate tradition (Stenborg 2016; Figueiredo 2019), while many sites in the middle-upper Madeira River basin are multi-componential, with the most recent layer containing the Polychrome pottery tradition (Moraes and Neves 2012; Barreto et al. 2016). Before this study, 148 archaeological sites had been recorded in the lower Tapajós River basin, in an area of 10,000 km2, including 13 sites inside the FLONA Tapajós (e.g., Schaan et al. 2015; Stenborg, 2016; Stenborg et al. 2018; Figueiredo 2019). Almost 70% of these archaeological sites are located on the interfluve (away from major Amazonian rivers), 6% are located on hillsides, and only 24% are located along the Tapajós River, secondary rivers and lakes (Stenborg et al. 2018). Despite the abundance of archaeological sites in interfluvial areas, riverside settings have a longer occupation history, likely starting about 4300 cal BP (calibrated years Before Present) (Maezumi et al. 2018a). Pre-Columbian activities intensified around 1250 cal BP and remained until European conquest and subsequent colonization (500 cal BP, Maezumi et al. 2018a). In contrast to the lower Tapajós River region, archaeological sites of the upper-middle Madeira River near Humaitá were poorly described and few sites were mapped before this study. However, the occupation of the upper Madeira River basin is much older than of the lower Tapajós, starting around 12,000 BP (Miller 1992) and intensified around 1000 BP when the Polychrome pottery tradition expanded along the Madeira (Moraes and Neves 2012). The expansion of this tradition has been likely associated with the expansion of Tupi speaking groups (Barreto et al. 2016).

Management practices probably intensified when human populations expanded around a thousand years ago in both river basins (Levis et al. 2018). After European conquest, indigenous populations collapsed and a new wave of human expansion occurred during the rubber boom (in the late nineteenth century), when both the Tapajós and Madeira basins were occupied by their current inhabitants. Management activities that occurred during this period may also have influenced forest structure and composition, since local people mentioned that at that time they used forests more extensively than they do today. In the last century, people have favoured the development of rubber agroforests by actively planting rubber trees (Hevea brasiliensis) in their manioc fields and managing them in fallows that became agroforests (Schroth et al. 2003).

Ethnoecological and archaeological methods

In each village we conducted the following activities: (1) free listing interviews with 33 key informants in the FLONA Tapajós, 24 in the FLONA Humaitá and 12 in the Jiahui Indigenous Land about which trees and palms are useful for them, their types of uses, and the forest management activities related to these plants; (2) participatory mapping and guided tours along 80 km of trails in the Tapajós with the seven most experienced informants and approximately 115 km of trails in the Madeira with the 15 most experienced informants to describe the extension of their activities in the forest and the location of currently unoccupied ADE sites, and to identify the plants they currently manage in the forest; and (3) 27 forest inventory plots randomly placed at different distances from current and pre-Columbian villages. We used snowball sampling techniques to find informants who know and use forest species (Albuquerque et al. 2014), and collected the GPS points of all ADE sites and of useful plants cited in the interviews. In total we interviewed 24 men and 9 women in the Flona Tapajós, whose ages varied from 25 to 83 years; 20 men and 4 women in the Flona Humaitá, whose ages varied from 19 to 68 years; and 10 men and 2 women in the Jiahui Indigenous Land, whose ages varied from 18 to 85 years. For more details about these ethnoecological methods see Cassino et al. (2019).

Based on local knowledge and previous study (Levis et al. 2017), we created four categories of useful plants. Useful plants were defined as species that have been used for any reason in the past or are currently used by local people. First, we classified all useful plants into two categories: 1) useful plants that are not managed; and 2) useful plants that are managed today (see Levis et al. 2018 for more details about the management categories). Within the currently managed plants, we categorized a third group of plants that are more intensively managed, because people occasionally plant them in cultivated landscapes, here called cultivated plants. Because plant domestication is long-term process that is difficult to detect using only local interviews, we used the literature to determine which species have populations with some degree of domestication reported somewhere in Amazonia (Levis et al. 2017) and created a fourth category of useful plants, namely domesticated plants. As we wanted to compare pre-Columbian and post-Columbian impacts on forest resources, our analyses focused on useful and domesticated plants.

Inside the FLONA Tapajós, the extension of the pre-Columbian villages (ADE) in most of the sites was measured with the assistance of a member of the community and handheld GPS. In addition, in two ADE sites, shovel test pits supervised by an archaeologist (coauthor CGF) were excavated in two transects to delimit their extension. Handheld GPS and Google Earth images were also used to map the area occupied by current houses and homegardens in the villages we worked with (Maguarí and Jamaraquá). In the Madeira basin, we could not measure the size of the ADE sites because of logistical limitations.

We also calculated the density of pre-Columbian and current villages that occur in the study area in both basins and the distance from these villages to major rivers, perennial rivers and streams. We used all rivers of the HydroSHEDS dataset to define streams and “upcell” values greater than 15,000 to define perennial rivers of approximately 10 m width, following the study of McMichael et al. (2014). We estimated study area in both basins based on the extension of the local people’s activities in the forest described during participatory mapping and guided-tours: 12,400 ha in the Flona Tapajós; 15,800 ha in the Flona Humaitá; 25,600 ha in the Jiahui Land.

Forest inventories

We randomly sampled forest plots of 0.5 ha (50 × 100 m) at different distances from pre-Columbian (0–4.1 km) and current villages (0.9–15.4 km). Plots were restricted to old-growth forest on terra-firme terrain located on plateaus in both basins (Fig. 1b-c). We did not sample forests located more than 4 km from pre-Columbian sites, because the mean distance we found between pairs of sites in the Madeira study area is 5 km and in the Tapajós study area is 2 km. To sample the plots at different distances from pre-Columbian villages, we created buffers around the ADE sites, differing in size (0–1, 1–2, 2–4 km), based on land use zones described in a previous study (Heckenberger et al. 2008). We used these buffer classes to randomly sample one location for our inventory plot per pre-Columbian village per buffer zone. We selected a location for the plot where the buffers did not overlap more than 50% with a neighbouring buffer class. Distances and buffers were established based on the information of ADE sites gathered from participatory mapping and with GPS during guided tours, and were mapped with QGIS 2.18.25 software.

During the plot inventories, trees and palms with diameters at breast height (dbh) ≥ 1 cm were sampled in sub-plots of 0.01 ha, trees and palms with dbh ≥ 10 cm were sampled in sub-plots of 0.25 ha, and trees and palms with dbh ≥ 30 cm were sampled in the full 0.5 ha plot. We counted and measured the diameter of all living trees and palms present in the plot, but we only identified and collected botanical vouchers for useful species with vernacular names listed in the interviews. Botanical identification was based on the Species List of the Flora do Brazil available at http://www.floradobrasil.jbrj.gov.br. The vouchers were deposited at the INPA Herbarium, the UFOPA Herbarium, and the EAFM Herbarium. For the analysis, we used the vernacular names given by informants because these are the units that people actually use and manage. Some vernacular names refer to single botanical species, but others refer to a group of botanical species that share similar traits, and which often belong to the same botanical genus or family (Berlin 1992).

Soil data

To detect human effects on forest soils and to identify fire events, we used a post-hole digger to collect soils and pieces of charcoal in three locations along the central plot line from 0 to 40 cm depth. Sample points cover a gradient from dark soils and dark-brown soils to yellow-brown soils. Pieces of charcoal visible to the naked eye were quantified every 10 cm of the 0 to 40 cm samples done in each of the three locations per plot; frequency of charcoal was calculated by the presence/absence of these pieces in each point at each depth, so 100% frequency is charcoal presence at all 12 points/depths per plot. We also measured the frequency of soil charcoal below 20 cm and above 20 cm and found that the frequencies of charcoal in these two soil layers were highly correlated (Spearman’s rank correlation = 0.71, p < 0.001). Due to lack of funding we could not date the charcoal found in our samples, but as suggested in the literature we assumed that charcoal above 20 cm is more likely to be associated with modern fires and below this layer is more likely to be associated with pre-Columbian fires (McMichael et al. 2012). ADE sites in the Flona Tapajós were dated between 3362 and 3160 cal BP to 666–622 and 612–559 cal BP (Figueiredo 2019), and 530–450 cal BP (Maezumi et al. 2018a). Also, ADE sites across Amazonia are likely older than 350 years BP according to the literature review of de Souza et al. (2019).

Only soil samples from 0 to 30 cm depth were dried and analysed in the Plant and Soil Thematic Laboratory at INPA for chemical and physical properties. Total phosphorus was determined by acid digestion using concentrated sulphuric acid (Quesada et al. 2010) and available phosphorus was determined by Mehlich I Teixeira et al. (2017). Exchangeable Ca, Mg, K, Na and Al were determined by the silver thiourea method (Pleysier and Juo 1980). Organic carbon was determined by Walkley and Black (1934), and pH was analysed in water at 1: 2.5 ml. The percentage of sand, silt and clay was also calculated following (Teixeira et al. 2017).

Data analysis

All statistical analyses were performed in the R platform (3.5.1, R Development Core Team 2018). To evaluate if pre-Columbian and current villages were equally distributed along major rivers, perennial rivers and streams, we measured the minimum linear distances between villages and from villages to all river types using the “NNJoin” tool of QGIS version 2.18.25. As proxies for the intensity of human activities in forest plots, we measured the walking distance from forest plots to the current villages (in kilometers - km) and the linear distance from forest plots to pre-Columbian villages (km), because we don’t know where pre-Columbian pathways were located.

We explored the variability of soil chemical and physical variables across forest plots using Principal Component Analysis (PCA) based on a correlation matrix. Soil variables with skewed distributions (total P, available P, Ca, Mg) were log-transformed (log10 + 1). We included only the percentage of clay and sand in the PCA, because texture variables sum to 100%, which means they are complementary. The PCA was used to evaluate the spatial distribution of soil properties and help identify the soil variables that are most associated with the variation in soil fertility.

To understand how pre-Columbian and current human activities have influenced forest soils, forest structure and forest resources we used structural equation modelling (SEM). SEM is based on multiple regression equations, and can be used to test indirect and direct effects and investigate causality between variables (Grace et al. 2010). To do so, we defined a priori SEM models including the direct and indirect effects of human activities and edaphic factors on forest structure and forest resource variables (Fig. S1), and the models were adjusted if necessary (Grace et al. 2010). Correlations among factors are considered and reported when significant correlations are detected. Because we defined models a priori in SEM, the models are considered to be statistically accepted when they cannot be rejected (p > 0.05), meaning that the proposed relationships accurately describe the data. We ran separate models to understand how forest structure variables (stand basal area, density of canopy stems and density of sub-canopy stems) and forest resource variables (relative basal area and relative density of useful and domesticated plants used for food and construction) are related to the four fixed predictor variables: distance from plots to pre-Columbian villages, distance from plots to current villages, soil phosphorus, and soil charcoal (0–40 cm) (Fig. S1). The sum of all stems with dbh ≥ 1 cm was calculated to define the density of sub-canopy stems in 1 ha and the sum of all stems with dbh ≥ 10 cm was calculated to define the density of canopy stems in 1 ha. The stand basal area (BA) is the sum of the basal area of all stems with dbh ≥ 10 cm in 1 ha because sub-canopy stems make a small contribution to BA. Basin (Tapajós and Madeira) was incorporated as a random factor in all models; because samples within the same region share similar characteristics as they share similar environments and we wanted to account for the spatial aggregation of our data within basins.

Structural equation models were created using mixed-linear models, which were evaluated using the “lme” function of the “nlme” package (Pinheiro et al. 2018) and performed using the “psem” function of the piecewise SEM package (Lefcheck 2016). The PCA analysis was run using the “prcomp” function and visualized using the “autoplot” function of the “ggfortify” package (Tang et al. 2016). Conditional plots were used to visualize the mixed-effect models using the “visreg” package (Breheny and Burchett 2017).

Results

Spatial patterns of pre-Columbian and modern human occupations

Pre-Columbian villages were smaller in estimated size, but more densely distributed across the landscapes (Table 1). The total area occupied by 12 pre-Columbian villages in the FLONA Tapajós is approximately 93 ha (mean size = 8 ha) out of our study area of 12,400 ha, while the estimated area occupied by houses and homegardens of the two current riverside villages is approximately 46 ha of 12,400 ha. In the FLONA Tapajós, the two current villages are located along the Tapajós River banks with a density of 0.02 villages/km2 in the study area, whereas 19 pre-Columbian villages were spread across the study area with a density of 0.15 villages/km2, with some pre-Columbian villages located up to 15 km from this river’s margin (Fig. 1; Table 1). In the Madeira River basin, pre-Columbian and current villages were found along this river’s banks in the FLONA Humaitá and approximately 70 km from this river’s margin in the Jiahui Indigenous Land. The total amount of the study area in the Flona Humaitá was 15,800 ha and the Jiahui Indigenous Land was 25,600 ha. In the FLONA Humaitá, the density of the current villages was 0.01/km2, whereas the density of pre-Columbian villages was 0.03 villages/km2. In the Jiahui Indigenous Land, the density of current villages was 0.01 villages/km2, whereas the density of pre-Columbian was 0.02 villages/km2. Although the density of pre-Columbian villages in the Tapajós is much higher than the density of villages in Madeira, we detected similar densities along these rivers´ banks and in areas distant from major and perennial rivers (70 and 20 km, respectively), but near small streams.

Table 1 Distribution of current and pre-Columbian villages, rivers and streams in the two study areas. Major rivers are the Tapajós and Madeira rivers.

Forest resources around pre-Columbian and modern human occupations

Based only on local knowledge, in the Tapajós villages we identified 193 useful plants, of which 140 were cited by the informants as managed, and of these 52 are occasionally cultivated. In the Madeira villages, we identified 167 useful plants, of which 126 are managed, and of these 49 are occasionally cultivated. Based on the list of domesticated species of Levis et al. (2017), we found that 13% of the 193 useful plants in the Tapajós and 14% of the 167 useful plants in the Madeira were domesticated by pre-Columbian people somewhere in Amazonia (i.e., not necessarily in the lower Tapajós or upper-middle Madeira).

Twenty useful plant species dominate half of the basal area across forest plots in both regions and at least a quarter of the total density of individuals (for more details of dominant species see Table S1). In both regions, current people have managed at least 15 of the 20 dominant useful species and pre-Columbian peoples likely domesticated four of them. Caryocar villosum, Dipteryx odorata, Hymenea courbaril, Bertholletia excelsa were the four dominant domesticated species found in the Tapajós plots, and B. excelsa, C. villosum, Theobroma subincanum, Caryocar glabrum were the dominants found in the Madeira plots. Thus, the structure of these forests around human occupations is mostly driven by these 20 dominant species, managed for food and construction materials, and sometimes domesticated for food.

Soil patterns within and across basins

Soil texture differs between our study sites in the Tapajós and Madeira basins (Table 2). The first axis of the PCA (PC1) explained 32.9% of the variation in soil data and separated plots with high concentrations of clay and organic matter in the Tapajós basin from plots with higher concentrations of sand in the Madeira basin (Fig. S2 and S3). The PC1 was highly correlated with clay and sand, and represents a gradient of soil texture. The second axis of the PCA (PC2) explained 31.1% of the variation in soil data and represents a gradient of soil fertility, separating plots with higher pH and nutrient contents (Ca, P, Mg) from plots with lower values of these variables. The forest plot sampled over an ADE site had a considerably higher value of soil nutrients, such as P, Ca, Mg, and pH, than all other plots. Three forest plots in the Madeira basin are located on Plinthosols, while the other plots in the Madeira basin and all plots in the Tapajós basin are located on Ferrasols; for more details of differences in soil variables in each basin, see Table 2).

Table 2 Mean, median, minimum and maximum values for each explanatory variable are presented per basin (Tapajós and Madeira).

We found that pre-Columbian human activities are associated with an increase in soil fertility (Fig. 2). Ca, total P, and the scores of the soil fertility axis (PC2) were higher in forest soils closer to pre-Columbian villages and significantly decreased with distance from forest plots to these villages (Fig. 2a, c and g, p = 0.002, p < 0.001 and p = 0.015, respectively). However, most soil variables were not related to the distance to current villages (Fig. 2d, f and h), except for Ca that increased slightly with distance from plots to current villages (Fig. 2b, p = 0.048).

Fig. 2
figure 2

Effects of the distance from plots to pre-Columbian villages and the distance from plots to current villages on soil calcium (a,b), total phosphorus (c, d), magnesium (e, f) and the second axis of the PCA that represents the soil fertility gradient (g, h). Triangles are forest plots in the Tapajós River basin (18 plots) and dots are forest plots in the Madeira River basin (9 plots). Regression plots were obtained from mixed-models and the fitted line indicates significant fixed-effects detailed in Table S2: lme (soil variable ~ distance to pre-Columbian villages + distance to current villages + (1 | Basin)). Significance probabilities (p values) are shown and R2’s are given to indicate the total variation in a dependent variable that is explained by the combined fixed variables (R2 marginal) and together with the random factor (R2 conditional). Calcium, total phosphorus and magnesium were log-transformed (log10 + 1)

Given the strong association of total phosphorus to the soil fertility axis (Table 2) and given that total soil phosphorus was previously recognized as the best indicator of anthropogenic enrichment of ADE sites (Alho et al. 2019), total P was chosen to represent soil fertility in our structural equation models. Additionally, together with nitrogen, phosphorus is the most limiting macronutrient in tropical forest soils (Elser et al. 2007), it determines crop production in Amazonian agroforestry systems (McGrath et al. 2000), and is considered a strong predictor of forest structure and dynamics across Amazonia (Quesada et al. 2012).

Effects of pre-Columbian and current landscape domestication on soils, fire events and vegetation

All SEM models were supported (Fisher C = 2.696, p = 0.26), suggesting that they described the data with sufficient accuracy. We found a significant effect of the distance to pre-Columbian villages on soil phosphorus (Fig. 2c, and Fig. 3). Total phosphorus content was higher in forest soils closer to pre-Columbian villages and decreased with distance from forest plots to these villages (Fig. 2c, β = − 0.75, p < 0.001). However, total phosphorus was not related to the distance to current villages (Fig. 2d, p = 0.596). Even after removing the forest plot on ADE site from our analysis, the total amount of phosphorus in forest soils decreased significantly with the distance from plots to pre-Columbian villages (Table S3, β = − 0.48, p = 0.014). We did not find a significant effect of the distance to pre-Columbian villages nor of the distance to current villages on the frequency of charcoal in forest soils (Fig. 3 and Table S2), although charcoal frequency decreased slightly with the distance to current villages (Table S2, β = − 0.31, p = 0.056). Charcoal was common in all forest plots of the two river basins, suggesting that wild and/or anthropogenic fire events were widespread through time (Fig. S4). Charcoal pieces were found from 0 to 40 cm of soil depth, which suggests that more than one fire event occurred in the past.

Fig. 3
figure 3

The structural equation models that show the direct and indirect effects of distance to pre-Columbian and current villages on a) stand basal area, b) density of sub-canopy stems, and c) density of canopy stems. Black continuous arrows show significant pathways, grey dotted arrows show non-significant pathways, and double-sided arrows show correlations. The standardized regression coefficient (β) and significance probabilities (p values: ≤ 0.001 ‘***’; ≤ 0.01 ‘**’; ≤ 0.05 ‘*’) are shown. R2’s are given to indicate the total variation in a dependent variable that is explained by the combined fixed variables (R2 marginal) and together with the random factor (R2 conditional). Detailed information of all structural equation models is presented in Table S2. Total phosphorus was log-transformed (log10 + 1)

We found a direct and an indirect effect of pre-Columbian activities on the metrics of forest structure, total density of sub-canopy stems and total tree basal area, respectively (Fig. 3, Table S2). The distance from plots to pre-Columbian villages had a negative indirect effect (− 0.34), through total soil phosphorus, on the total basal area of the forests, whereas the density of sub-canopy stems increased with distance from plots to pre-Columbian villages (β = 0.46, p = 0.048).

Moreover, the relative densities and basal areas of domesticated species used for food and construction were significantly affected by total P in soils, and by pre-Columbian and current human activities (Fig. 4). The distance to current villages had a positive direct effect on the relative basal area and density of domesticated species currently used for construction (β = 0.63 and 0.59, p < 0.001, see Fig. 5c and d, respectively), whereas the relative basal area and density of domesticated species used for food were not affected (Fig. 5a-b). Total P and the distance from forests to pre-Columbian villages had a positive and direct effect on the relative density of domesticated species used for food (Fig. 4b, β = 0.94 and 0.35, p < 0.001 and 0.004, respectively). In contrast, the distance from forest plots to pre-Columbian villages had a negative indirect effect, through total P on the relative density of domesticated species used for food (Fig. 4b, indirect effect = − 0.70) but had no effect on species used for construction. Interestingly, the basal area of all food species is negatively correlated with the basal area of all construction species (Fig. S5), showing that they do not dominate the same forest plots.

Fig. 4
figure 4

The structural equation models that show the direct and indirect effects of distance to pre-Columbian and current villages on a) relative basal area of domesticated species used for food, b) relative density of domesticated species used for food, c) relative basal area of domesticated species used for construction, and d) relative density of domesticated species used for construction. Black continuous arrows show significant pathways, grey dotted arrows show non-significant pathways, and double-sided arrows show correlations. The standardized regression coefficient (β) and significance probabilities (p values: ≤ 0.001 ‘***’; ≤ 0.01 ‘**’) are shown. R2’s are given to indicate the total variation in a dependent variable that is explained by the combined fixed variables (R2 marginal) and together with the random factor (R2 conditional). Detailed information of all structural equation models is presented in Table S2. Note that all species were domesticated for food, but some are currently used for construction. Total phosphorus was log-transformed (log10 + 1)

Fig. 5
figure 5

Effects of the distance from plots to current villages on a) relative basal area of domesticated species currently used for food, b) relative density of domesticated species currently used for food, c) relative basal area of domesticated species currently used for construction, and d) relative density of domesticated species currently used for construction. Triangles are forest plots in the Tapajós River basin (18 plots) and dots are forest plots located in the Madeira River basin (9 plots). Regression plots were obtained from mixed-models and the fitted line indicates significant fixed-effects detailed in Table S2: lme (soil variable ~ distance to pre-Columbian villages + distance to current villages + total P + charcoal + (1 | Basin)). Significance probabilities (p values) are shown and R2’s are given to indicate the total variation in a dependent variable that is explained by the combined fixed variables (R2 marginal) and together with the random factor (R2 conditional). Total phosphorus was log-transformed (log10 + 1). Note that all species were domesticated for food, but some are currently used for construction

Discussion

We detected effects of long-term domestication processes on forests and soils up to 4 km from pre-Columbian villages and as far as 15 km from current villages. In comparison with current traditional and indigenous villages located in protected areas, pre-Columbian villages were apparently smaller, but more densely distributed across landscapes in both regions. Our results suggest that pre-Columbian societies increased soil nutrients - mainly calcium and phosphorus - in forest soils, resulting in the formation of ADE and extensive anthropogenic brown soils across the landscape, as predicted by Denevan (2004). In contrast, we did not find a strong and significant effect of current management activities on forest soils. Other studies also found that extensive land use systems were developed in many parts of Amazonia (de Souza et al. 2019) and areas of visible human influence may extend up to 5 km from pre-Columbian settlements (Heckenberger et al. 2008). By enriching soil nutrients around their villages, pre-Columbian societies altered environmental conditions, which favored mainly species used for food that may be nutrient-demanding (Milla et al. 2015). Nowadays, indigenous and traditional societies maintain food resources in old-growth forests near their villages, while construction materials are likely being overharvested. Overall, human legacies seem to persist in old-growth forests surrounding pre-Columbian and contemporary villages due to past soil fertilization and past and current management.

Pre-Columbian and modern spatial occupation patterns

Interfluvial areas in terra-firme forests are usually considered to have sustained low population densities due to environmental limitations, and human disturbance intensity is hypothesized to decrease exponentially up to 15 km from major Amazonian rivers (Bush et al. 2015). However, we found the same density of ADE sites in riverside and in interfluvial areas more than 16 km from perennial rivers and we did not find a decay of past human occupation as we moved away from major rivers. ADE sites were found in both riverside and interfluvial areas near small streams or depressions, up to 70 km from the Madeira River and 15 km from the Tapajós River, suggesting that the probability of finding sedentary occupation sites in interfluvial areas is higher than previously predicted, as hypothesized by (Levis et al. 2014). Other studies have shown a widespread network of pre-Columbian villages from 20 to 50 ha in the upper Xingu River basin (Heckenberger et al. 2008) and historical documents reported large indigenous villages with populations of 1000 or more across other interfluvial areas (e.g., Denevan 2014) Forest landscapes on nutrient-poor soils of the upper Negro River also hold legacies of historical landscape domestication up to 13 km from perennial rivers and up to 750 m from streams (Franco-Moraes et al. 2019). Thus, it may be too simplistic to expect that the influence of pre-Columbian people in Amazonian landscapes is mostly predicted by the distance to major rivers, since perennial and temporary rivers are abundant across the basin (Junk et al. 2011) and these small rivers or streams were likely densely occupied or intensively used by pre-Columbian peoples (see maps of Levis et al. 2014; McMichael et al. 2014; Palace et al. 2017). In places where small rivers were scarce, e.g., east of the lower Tapajós River, people used natural depressions and built artificial ponds (Stenborg 2016), expanding even more their territories.

Although pre-Columbian villages were found to be spatially correlated with current villages along major Amazonian rivers (McMichael et al. 2017a), they are not correlated in interfluvial areas. Current traditional societies do not occupy and manage their lands and soil properties to the same extent as past societies did (Heckenberger et al. 2008) for different reasons, such as migration to Amazonian towns (Parry et al. 2010), agricultural intensification (Jakovac et al. 2017) and land use restrictions imposed by protected areas (Amaral et al. 2013). Consequently, the domestication of forest landscapes by modern societies is more restricted to perennial river margins, especially in protected areas, while past domestication was widespread.

Effects of pre-Columbian and current landscape domestication on old-growth forests

We found strong effects of past activities on nutrients of forest soils, especially on calcium and total phosphorus, towards the core of pre-Columbian villages (ADE sites), but a weak and negative effect near current villages. Even though most studies have defined ADE and non-ADE sites as discrete categories due to sharp contrasts in soil nutrients between these soils (e.g., Alho et al. 2019). Fraser et al. (2011) proposed a continuum of soil fertility from the core of ADE sites towards the adjacent soils in Central Amazonia. Schmidt et al. (2014) confirmed these observations, showing a widespread pattern of anthropogenic soil formation surrounding domestic areas in several regions of the Amazon basin. Brown soils around domestic zones have been observed in much more extensive areas, not only around ADE core areas in Central and Eastern Amazonia (Denevan 2004; Fraser et al. 2011), but also in naturally rich soils of Western Amazonia (Quintero-Vallejo et al. 2015). Overall, our results suggest that pre-Columbian peoples increased essential nutrient concentrations in Amazonian soils not only in their kitchen refuse piles and homegardens, but across a larger area than measured by previous studies (Fraser et al. 2011; Schmidt et al. 2014, Fig. 2). The extensive effect of past societies on modern forest soils is probably a result of pre-Columbian land use systems that combined agroforestry and forest management with efficient practices of soil improvement around villages (Denevan 2004; Stahl 2015; de Souza et al. 2019).

Across the basin, the fertility of terra-firme forest soils are correlated with a natural gradient of pedogenic development that influences physical, chemical and biological processes (Quesada et al. 2010). Total phosphorus has been used to infer soil fertility, due to its correlation with pedogenic development, and to predict forest biomass and composition (Quesada et al. 2010, 2012). Soils rich in nutrients occur mainly on young terrains under western Amazonian forests, whereas old soils poor in nutrients are commonly found under Eastern and Central Amazonian forests. Soil fertility is generally higher in the Tapajós forests than in the Madeira forest, possibly due to their natural variation in soil texture, since Tapajós soils contain a much higher percentage of clay and Madeira soils a higher percentage of sand (see Table 2). Another factor that may explain these differences is the intensity of pre-Columbian activities between these regions. Because lower Tapajós forests were more densely occupied by pre-Columbian societies than middle-upper Madeira forests (see Table 1), forest soils within the lower Tapajós region may have been strongly altered by these societies. By increasing the amount of soil nutrients and pH in their villages and likely in the areas surrounding them (Fraser et al. 2011; Quintero-Vallejo et al. 2015), pre-Columbian societies expanded the heterogeneity of forest soils and reproduced basin-wide natural soil variation in their domesticated landscapes, creating a wide range of soil conditions to cultivate domesticated plants adapted to different levels of soil nutrients (Junqueira et al. 2016).

The basal area of old-growth forests was affected by pre-Columbian soil fertilization (Fig. 3a), confirming our initial expectation that the higher nutrient availability common in anthropogenic soils would influence forest structure. Forests on and around pre-Columbian villages had a lower sub-canopy stem density and a higher basal area than forests away from these sites. However, using indices from satellite imagery, (Palace et al. 2017) found that biomass, tree height and tree cover are lower in forests located on ADE than random forest sites within 50 km of ADE. Since we worked no more than 4 km from ADE (see also (Ferreira et al. 2019), we found small-scale differences in forest structure that may be difficult to detect using only satellite data. The forest on top of ADE had an extremely high value of stand basal area (48 m2/ha) compared to our other study plots (Table S4) and other plots elsewhere across Amazonia (Baker et al. 2004), driven mainly by two individuals of sumaúma (Ceiba pentandra), which is an indicator of ADE (Clement et al. 2003). In general, forest soils with high nutrient availability promote species with high diameter increments and low wood density (Quesada et al. 2012), such as sumaúma. This broad-scale Amazonian pattern suggests that some good colonizers may persist in old-growth forests after human abandonment due to changes in soil conditions rather than due to extensive cultivation during past occupations.

Amazonian societies transformed forest landscapes around their villages by practices that not only involved direct cultivation of new seeds and seedlings, but they also selectively removed some plants (Levis et al. 2018) and certainly harvested timber products (Peres 1994; Odonne et al. 2019) The historical overharvesting of timber species could explain why the relative density and basal area of plants used for construction decreased near villages, even though the relative density of species used for food does not show the same pattern (Fig. 5). Although people use timber resources, they need to cut the trees to get timber and they may not actively replace them. In contrast, people collect food resources without the need to cut the tree and these food species have traits for fast resource acquisition that may cope better with more soil nutrients (Milla et al. 2015), and therefore persist in soils improved by pre-Columbian activities (Maezumi et al. 2018a). Ethnographic and paleoecological studies focusing on different categories of useful species may help to understand how long people have been benefiting or overharvesting plants used for food and construction surrounding their villages. For instance, pollen of useful species in lake sediments in the vicinity of archaeological sites can be used to infer human activities and enrichment of edible species during the Holocene (Maezumi et al. 2018a).

Domesticated plants were most affected by pre-Columbian and current landscape domestication inside habitation zones. Domesticated plants were extremely abundant in archaeological sites across Amazonia (Levis et al. 2017) and were strongly associated with anthropogenic soils in the Madeira River basin, as observed by Junqueira et al. (2010, 2017). Many domesticated species used today (e.g., Spondias mombin) were shown to be abundant in very fertile anthropogenic soils (Junqueira et al. 2016; Maezumi et al. 2018a), suggesting that only with very high nutrient levels found in ADE sites or in naturally fertile soils will we find higher densities of domesticated species across the basin. Indeed, the density of domesticated species is higher on archaeological sites and in geological regions where fertile soils predominate, e.g., southwestern Amazonia (Levis et al. 2017). Studies in Amazonian landscapes with naturally and anthropogenically fertile soils demonstrated that domesticated species can occur over large forest areas even centuries after they were abandoned by native populations (Paz-Rivera and Putz 2009; Quintero-Vallejo et al. 2015). However, in Amazonian landscapes with naturally poor soils (Quesada et al. 2010), fully domesticated populations may only persist on ADE sites and even there will require continuing management (Clement 1999), but incipiently domesticated populations may flourish in anthropogenic brown soils around these sites (e.g., Caryocar villosum, see Table S1 and Alves et al. 2016).

Conclusions

We found that pre-Columbian villages were densely distributed in Amazonian forest landscapes, and that the influence of past societies has persisted in forest soils that are no longer enriched by indigenous and traditional societies. Pre-Columbian soil fertilization contributed to increase the total basal and the abundance of domesticated species used for food in old-growth forests but not for species used for construction. Currently, modern societies maintain food resources in forests near their villages, while timber resources may be overharvested.

While recent land use histories are considered an important driver of secondary forest succession (Arroyo-Rodríguez et al. 2017), long-term histories are often overlooked in old-growth forests (Dambrine et al. 2007). As our findings reveal, the analysis of land use histories across a much longer time scale is crucial to understand the heterogeneity of soils and forest composition at a landscape scale in Amazonia. Particularly, the distribution of food resources of great importance to modern societies are affected by pre-Columbian soil fertilization and may depend on current management to persist. Therefore, to guide effective conservation and management plans that aim to maintain forest resources in Amazonian landscapes, we must understand how they are shaped by past and current human activities.