Introduction

Primate multilevel societies (MLS) are a form of social organization where the joint activities of a set of core social units (foraging, resting, and traveling) scale-up to generate higher societal levels, such as bands (Grueter et al. 2020). Units do not have territorial boundaries, but ranges, which completely overlap with those of other units. Interactions with other units are characterized by a lack of persistent hostility, with units generally tolerating each other’s presence. Although the units are part of a larger collective (e.g., a band), they can temporarily bud off from the band (e.g., for foraging). This form of fission–fusion where the integrity of the core unit is maintained, is distinct from the more atomistic pattern seen in single-tiered societies, such as those of chimpanzees (Pan troglodytes) and spider monkeys (Ateles spp.), where a group (community) divides into smaller sub-groups (parties) of variable size or composition and each community has its own territory or home range (Rodseth et al. 1991; Grueter et al. 2012a).

A prime example of a species where MLS are almost universally present is humans (Rodseth et al. 1991; Grueter et al. 2012a; Dyble et al. 2016). Human MLS are based on family groups that are integrated into a network of increasingly larger social tiers, such as bands and tribes (Layton and O'Hara 2010). The MLS of nonhuman primates are not as complex as human MLS. However, several tiers of social stratification can be found in a limited number of species (Grueter et al. 2012b). MLS have evolved in African papionin species, such as geladas (Theropithecus gelada) and hamadryas baboons (Papio hamadryas) ( Kummer 1968; Dunbar and Dunbar 1975; Swedell 2011; Swedell and Plummer 2012), and in Asian colobine species, in particular, the odd-nosed group, including snub-nosed (Rhinopithecus spp.) and proboscis monkeys (Nasalis larvatus) (Grueter et al. 2022). The minimum reproductive unit in the MLS of most nonhuman primates is the one-male–multifemale group, but MLS that are not based on such groups, i.e., multimale–multifemale groups, have recently been reported in Guinea baboons (Papio papio) (Patzelt et al. 2014, but also see Goffe et al. 2016) and Rwenzori colobus (Colobus angolensis ruwenzorii) (Stead and Teichroeb 2019; Miller et al. 2020).

This nested arrangement of grouping tiers allows individuals to be concurrently associated with multiple grouping levels and to harness the adaptive benefits that each level entails while minimizing costs such as feeding competition (Grueter et al. 2017). For example, the band level allows core units to ‘recruit’ extra-unit males for collective defense when needed (Xiang et al. 2013) and the core unit offers a safe haven for individuals to exchange social services, such as grooming or allocare (e.g. Dunbar 1979; Yu et al. 2013). When fission–fusion of core units is superimposed on a multilevel organization, the system becomes even more flexible, as core units also have the option to separate from the higher-level group when the availability and distribution of food resources demand it (Schreier and Swedell 2012).

By studying the characteristics of MLS in distantly related species, we can understand what environmental conditions and/or potential selection pressures are associated with such traits in different lineages. MLS are generally absent in the primates most closely related to humans (i.e., the great apes), although evidence is inconclusive in some cases (Grueter and Wilson 2021; but see Morrison et al. 2019). Consequently, studies on primates that are phylogenetically distantly related but exhibit multilevel structures are important for elucidating the evolutionary paths leading to MLS in humans (Grueter et al. 2012a).

Proboscis monkeys are endemic to the island of Borneo. This species is a large, sexually dimorphic arboreal colobine that typically inhabits mangrove forests, peat swamps, and riverine forests (Grueter et al. 2022). The basic components of their society, i.e., the minimum reproductive units (core units), are one-male–multifemale units (OMUs) which assemble with each other in riverside trees (Bennett and Sebastian 1988; Yeager 1991; Boonratana 2002), with varying degrees of aggregation depending on seasonal fluctuations in food abundance or predation pressure (Matsuda et al. 2010a). Several studies suggest that there is a multilayered social organization to this association of OMUs along the river (Bennett and Sebastian 1988; Yeager 1991; Boonratana 2002), although one study suggests that ecological factors, such as food availability and predation pressure, would be sufficient to explain the cohesion of OMUs without assuming a higher order structure of this type (Matsuda et al. 2010a). In addition to OMUs, all-male or bachelor groups (AMGs) are also found, consisting mostly of immature males who are often accompanied by adult males (Murai 2004; Matsuda et al. 2020a). Although males of these AMGs potentially start OMUs in the future (Murai 2004), little is known about their social structuring and no study yet has examined the social structure of proboscis monkeys from a genetic basis.

The aim of this study was to determine the social structure of proboscis monkeys both by direct observation of multiple identified groups and genetic analysis, which was not achieved in previous studies (Boonratana 2002; Matsuda et al. 2010a). The 6-month study of Yeager was based on the identification of multiple groups (OMUs and AMGs) and examining the interactions between them (Yeager 1991). However, we observed long-term interactions between the groups for 35 months. Given that proboscis monkeys are thought to form MLS, certain OMUs are expected to frequently maintain close proximity on their sleeping trees along the river. To gain a comprehensive picture of their complex society, we analyzed the degree of kinship between individuals within a population of proboscis monkeys through fecal DNA, albeit at a different time of the year from the time of direct observation. In proboscis monkeys, both males and females disperse from their natal OMUs to other OMUs or AMGs before they reach maturity (Matsuda et al. 2012a). However, their social networks, based on grooming interactions, are decidedly female-centered (Yeager 1990b; Matsuda et al. 2012b). We assume that the emergence of societies in which females are central to social interactions would be based on a high degree of kinship between females within OMUs (Tinsley Johnson et al. 2013). Therefore, we hypothesize that although both sexes disperse between groups, females would tend to disperse to more proximal OMUs than males (Guo et al. 2015; but see Fang et al. 2022) and/or to be motivated to maintain post-dispersal contact with maternal female kin (Städele et al. 2016). We also hypothesize that males disperse over longer distances than females, whose dispersal distances are shorter. Lastly, we discuss the similarities and differences between the findings from this study and those from studies of closely related snub-nosed monkeys, with better studied MLS (Grueter 2022).

Methods

It was not possible to record data blind because our study involved focal monkeys in the field.

Study site and animals

We conducted the study in a riverine forest along the Menanggul River, a tributary of the Kinabatangan River, Sabah, Malaysia (118°30′ E, 5°30′ N). The mean minimum and maximum daily temperatures were approximately 24 °C and 30 °C, respectively, and the mean annual precipitation at the site was 2474 mm (Matsuda et al. 2019). The river level varied by approximately 1 m daily. The average river level rose > 3 m during seasonal flooding. The riverine forest was inhabited by eight diurnal primate species, including our study subject, the proboscis monkey. Proboscis monkeys under observation were well habituated to observers in boats, because this area is one of the main tourist attractions in the region, with many boats and tourists visiting the Menanggul River almost on a daily basis during the last decade.

Behavioural observation in 1999–2002

Along the Menanggul River, using a GPS unit, location points were collected at intervals of 50 m over a transect stretching from the river mouth of the Menanggul River up to a point 6 km upstream. Proboscis monkeys typically return at night to the riverbank to sleep in our study site (Matsuda et al. 2010b). In this study, the core observation area was set up from the river mouth up to 4 km upstream, where boat-based observations were performed in the early morning (06:00–09:00) and/or late afternoon (15:00–18:00). However, if we could not find groups of proboscis monkeys in the area for up to 4 km, we extended the survey to the upstream area of up to 6 km.

We conducted the behavioural observations from February 1999 to October 2001, and in May and June 2002 (560 days). By late June 1999, eight OMUs and one AMG were identified by distinguishing all adult males and a few adult females based on their physical features, such as scars and nose shapes. The number of OMUs observed in the study area was consistently eight (Koda et al. 2018); the mean number of observed days for each group (OMUs and AMGs) was 187.3 (standard deviation = 117.5) with a range from 12 to 366 days (see Supplementary Table 1 in details). The replacement of the dominant males, named Ba, Wa, and Bu, was observed on three occasions during the study period in August 2000, November 2000, and October 2001, respectively (Murai et al. 2007). In each case, the replacing males came from the AMGs. We refer to OMUs before the male replacement occurred as “senior” OMUs (Ts, Be, Po, Bo, Yo, De, Pu, and Ki) and OMUs after the male replacement as “junior” OMUs (BoWa, PoBu, and KiBa) for the sake of convenience. Throughout the study period, the mean number of individuals comprising OMUs was 18, ranging from 8 to 34, and AMGs consisted of approximately 30 males (Supplementary Table 2). The names of the OMUs and AMGs and the locations of their sleeping sites were recorded whenever proboscis monkeys were found along the river in the late afternoon.

Genetic analysis

DNA sampling in 2015–2016

From July 2015 to April 2016, we collected fecal samples from proboscis monkeys in the study area for genetic analyses. A boat survey was conducted in the late afternoon to detect the group locations and record the GPS coordinates of their sleeping sites: early morning the next day, when the monkeys were still asleep, the sites were revisited while we waited in the boat until the monkeys moved inside the forest. As proboscis monkeys often defecated shortly before moving into the forest, fresh feces were collected by carefully exploring the ground near their sleeping trees after they had left them. However, several groups often stayed in the trees along the river in close proximity and it was difficult to reliably distinguish the group to which the respective feces on the forest floor corresponded. We rubbed the surface of the fecal pellets with cotton swabs, and dipped the swabs in 2 mL tubes containing 1 mL lysis buffer, consisting of 0.5% SDS, 100 mM EDTA, 100 mM Tris–HCl, and 10 mM NaCl (Longmire et al. 1997). The swabs were discarded, and the tubes were stored at room temperature for ~ 400 days after collection until they were delivered to the laboratory, where they were kept at − 80 °C until DNA extraction. The DNA of 307 samples was extracted with QIAamp DNA Stool Mini Kit (Qiagen) using TE (1 M Tris–Cl, 0.05 M EDTA; pH 8.0) as elution buffer.

Microsatellite screening and sex identification

Twenty-four markers (23 polymorphic microsatellite loci and 1 fragment of the DEAD-box gene) were amplified for genetic analyses. The DEAD-box gene was used to identify sex (Inoue et al. 2016). The microsatellite loci were selected based on the method of Inoue et al. (2016) and Salgado-Lynn et al. (2010), with four PCR multiplex reactions optimized for this study (Supplementary Table 2).

PCR was performed using the Applied Biosystems Veriti 96 Well Thermal Cycler PCR machine. All PCR amplifications were performed in 15 µL reactions, containing 1 × Master Mix (Qiagen Multiplex Kit), 0.4 µg/µL of BSA, 0.2 µM of each primer used for the multiplex combinations, and 2 µL of 10–100 ng of template DNA. After an initial incubation at 95 °C for 15 min, PCR amplification was performed for 40 cycles, consisting of denaturation at 94 °C for 45 s, annealing at 60 °C or 62 °C for 90 s, extension at 72 °C for 90 s, and a final extension at 72 °C for 10 min. Then, 2% agarose gels were used to visualize band quality and verify target band size. The PCR products were sent to First BASE Laboratories (Kuala Lumpur, Malaysia) for fragment analysis. The results of fragment analysis were scored using the software GeneMarker version 2.6.3 and corrected by eye.

We used 20 samples for a pilot study to determine the number of positive PCR repetitions needed to obtain a reliable genotype. A consensus threshold (100 simulations; the range of repetitions was from two to several) was produced in GEMINI v.1.4.1 (Valière et al. 2002) in the “Consensus Threshold Test” module. The consensus threshold was used in a “PCR Repetition Test,” also in GEMINI (1000 simulations; the range of repetition was two to 10), which showed that three positive repetitions for each multiplex were sufficient to achieve a reliable genotype, with a maximum of five positive repetitions to clarify ambiguities. However, if there were no PCR products after three PCR rounds in two or more multiplexes, the sample was not included in the analysis.

Of the 307 samples from which DNA extraction was performed, 267 fecal samples were genotyped, and only 197 were included in relatedness analysis (< 10% missing data). Noted that feces from the same individuals were excluded to prevent duplicate analyses after the genetic profiling; a sample of the same individual with the least missing data was retained for analysis. GenAlEx 6.51b2 (Peakall and Smouse 2006, 2012) was used to calculate heterozygosity, frequency-based statistics, and polymorphism (see Supplementary Table 3). A Hardy–Weinberg Equilibrium (HWE) exact test was performed separately for males (N = 64) and females (N = 133) in Genepop 4.7.5 (Raymond and Rousset 1995; Rousset 2008) using default Markov chain parameter values. Neither group was in HWE (p < 0.001).

Data analysis

Degree of association between groups

The distance between the edges of the sleeping site location of each OMU or AMG along the river was used as a criterion to determine if they were alone or in association. In association was defined as 100 m or less between the edges of the sleeping site of the relevant OMU or AMG. The rationale for this approach follows previous studies where OMUs of few individuals or OMUs with identified or unidentified individuals were considered members of an OMU if they were observed to be within a distance of 100 m of an OMU (Kern 1964; Kawabe and Mano 1972; Macdonald 1982; Salter et al. 1985). Indeed, long-term focal tracking of a specific OMU also indicates that group members are rarely more than 50 m from each other (Matsuda et al. 2009), hence it would be appropriate to define the close proximity of two or more groups as within 100 m of each other, which was also recommended as being operationally suitable by Yeager (1992). As a result, OMUs were in close proximity within 100 m in 23.7% of the total number of observation days in this study (Supplementary Fig. 1). However, the frequency with which the relevant groups had sleeping sites within 100 m from each other is not an appropriate index of the degree of their association as the frequency of appearance of each group along the river varied. Thus, to assess intergroup associations, we calculated the percentage of days when the relevant groups had sleeping sites within 100 m from each other, based on the number of days when the sleeping sites of relevant groups were detected along the river. As all groups were identifiable and boat surveys along the riverside were carried out one way, no duplicate groups were observed during the survey time. In addition, although groups almost always return to the riverside trees in the late afternoon to sleep (Matsuda et al. 2010b), each group appeared along the riverside at a different time of day. Therefore, it was not always spotted them all during the survey time.

We generated weighted social networks between the groups based on the degree of association calculated using the R package ‘igraph’ (Csardi and Nepusz 2006). In this study, two types of social network diagrams were drawn including all study groups to visualise how their social network changed, before and after the replacement of the dominant male in the OMUs (see detailed observation periods of each group in Supplementary Table 1), i.e., the network including the data from 8 senior OMUs and 1 AMG (before replacement) and the one including the data from 5 senior OMUs, 3 junior OMUs and 1 AMG (after replacement). Additionally, we used cluster analysis (Quinn and Keough 2002) to assess similarity relationships for the associations among study groups based on the same dataset using the social networks. The degree of association among groups was clustered using Euclidean distance and Ward’s linkage method (Quinn and Keough 2002). The cluster analysis did not include the OMU “PoBu” due to limited observation cases, i.e. < 20 cases (see Supplementary Table 1 for details). The quality of a clustering was validated by an analysis of silhouette values (Rousseeuw 1987), which vary from − 1 to 1 and represent the tightness of the data points within a cluster and the separation between different clusters in a given model. The silhouette value \(S({x}_{i})\) for a single data point \({x}_{i}\) is computed as

$$S ({x}_{i}) = \frac{b \left({x}_{i}\right)- a ({x}_{i})}{max(a({x}_{i}), b ({x}_{i}))}$$

where \(a ({x}_{i})\) is the average Euclidean distance of \({x}_{i}\) to all points in the same cluster, and \(b ({x}_{i})\) is the minimum average Euclidean distance of \({x}_{i}\) to all other clusters in which \({x}_{i}\) is not a member. Thus, low values of \(a ({x}_{i})\) indicate that \({x}_{i}\) is representative of the cluster, whereas high values of \(b ({x}_{i})\) indicate that \({x}_{i}\) is much different from the other clusters. Higher values denote higher clustering quality, whereas negative values denote that the data point should otherwise belong to a different cluster. By computing,

$${S}_{k} = \frac{1}{{n}_{k}}{\sum }_{{x}_{i}\in {C}_{k}}S ({x}_{i})$$

as the average of \(S({x}_{i})\) over all \({n}_{k}\) points in cluster \({C}_{k}\), one obtains an indicator of how well separated the cluster is from all others. An indicator for the complete clustering is obtained by averaging \({S}_{k}\) over all \(K\) deduces both \(a({x}_{i})\) and \(b({x}_{i})\) simultaneously, and thus, a larger number of clusters does not necessarily improve the silhouette score. This property makes the silhouette a better indicator for cluster quality than, for example, the log-likelihood of data, because the latter merely increase with a larger \(K\).

Relatedness and Fst

We used the Moment Estimate of Relatedness method of Wang (2002) as implemented in Mer2 to estimate relatedness between all pairs of individuals and to calculate mean total relatedness (R) and its standard deviation using 10,000 bootstrap replicates, which simulate allele frequencies. Using a Kolmogorov–Smirnov test (KS-test), we tested for differences in relatedness between male-male and female-female dyads in two data sets. The first data set included all dyads, and the values for dyads with negative estimated relatedness estimates were transformed to zero as negative relatedness values indicate no shared alleles due to common descent.

The effect of spatial autocorrelation was measured to determine the effect of geographical distance between individuals and the relatedness they showed for females and males separately. For this, we used the geographic distance between individuals measured with respect to the mouth of the river and R values between them with negative values converted to zero (i.e., no kinship). Based on these data, we tested whether there was a correlation between geographical distance and genetic distance (isolation by distance) using the Kendall rank correlation coefficient as the data was not normally distributed (one sample KS-test, p < 0.001).

Given that our analysis counted each sample as a single point, the statistical power to detect a relationship between degree of relatedness and geographical distance was potentially weak. To increase the statistical power of the data, we subdivided the entire set of samples into zones (i.e., with multiple pairwise comparisons per zone to more accurately estimate means and variances) and tested for a relationship between the mean relatedness between zones and the corresponding geographic distance. In other words, the rationale for carrying out this analysis using bins is that by grouping samples in bins, we increased the statistical power to detect differences between bins. Based on the location points along the river where the fecal samples were collected, we divided the locations into distance bins as measured from the river mouth: (i) 0–500, (ii) 501–1000, (iii) 1501–2000, (iv) 2001–2500, (v) 2501–3000, (vi) 3001–3500, and (vii) 3501–4000 m. Notably, we excluded zone 1001–1500 m, because we collected only one fecal sample in that section. For each distance bin, we calculated the average relatedness of the dyads in the bin and used that for the correlation against geographical bins and the upper boundary of the distance bin. Based on these data, we tested whether there was a correlation between geographical zone and genetic distance using the Pearson’s r was used as the data was normally distributed (Shapiro Wilk test, p > 0.05). In addition, within those sampling zones, we estimated pairwise divergence FST of Wright (1951) with the method of Weir and Cockerham (1984) between all pairs of zones and we used Mantel tests to determine the relationship of the matrix of geographical distance between pairs of zones, e.g., two adjacent zones were separated by 500 m, whereas the most distant zones were separated by 4000 m. The p threshold for the Mantel test was corrected with the FDR method of Benjamini and Yekutieli (2001). The FST range is from zero (no differences between allele frequencies in the two groups compared) to one (completely different allele frequencies in the two groups compared). We calculated them with the package hierfstat (Goudet 2005) in R (R Core Development Team 2023).

Limitations of analysing data collected at different time periods

In this study, although behavioural data were collected by direct observation based on individually identified groups, genetic samples were collected from the same population more than 10 years after those observations were carried out, and the data collected at these different times were combined to discuss the social structure of the proboscis monkeys. When the genetic samples were collected, the previous identification sheets could not be applied, so the samples had to be collected without identifying individuals. Noted, however, even if a group could be identified, as noted above, several groups often stayed in close proximity to trees along the river, making it difficult to reliably distinguish each faeces on the forest floor corresponding to which group. Anyhow, although the data were collected from the same population, it should be noted in interpreting the results of this study that the behavioural observation data and the genetic data were not linked in a real-time context. However, we believe that even if generational changes in individuals would occur over a period of 10 years, the overall basic social structure would not have changed significantly and the genetic structure within the population would not have changed significantly either. This is because the population dynamics of the proboscis monkeys at the study site remained stable within the 10-year period (Matsuda et al. 2020a).

Results

Direct observation of social structure in 1999–2002

The total number of observations per group is provided in Supplementary Table 1. Groups differed in the number of observations, ranging from 12 (PoBu) to 366 (Po). The weighted graph, based on the degree of association of each group pair, is shown in Fig. 1a. Based on the evaluation using silhouette values, hierarchical cluster analysis indicated two major clusters, i.e., band A, including the OMUs, Ts, Be, Po, and Bo (\({S}_{k} =\) 0.236); and band B, including the OMUs, Yo, De, Pu, and Ki, and the AMG (\({S}_{k} =\) 0.273; Fig. 1b). A junior OMU (BoWa) kept belonging to band A (\({S}_{k} =\) 0.186) even after the group was replaced by young males (Wa) from the AMG (Fig. 1c). Replaced PoBu was not included to the clustering analysis due to the limited number of observed cases, i.e. < 20. However, another junior OMU (KiBa) moved to band A from band B (\({S}_{k}\) = 0.308) after the group of Ki was replaced by Ba. The AMG flexibly changed the band while replacements occurred (Fig. 1d).

Fig. 1
figure 1

Network graph of the one-male–multifemale units (OMUs) and all-male group (AMG) identified with cluster analyses, illustrated based on the association index. The thickness of the edge reflects the degree of association index and the height between two clusters in a dendrogram indicates the dissimilarity/distance between them, from February 1999 to August 2000, when the first replacement of males in OMUs was observed (a, b), and thereafter, to June 2002, including two further replacements of males in OMUs (c, d). The cluster analysis did not include PoBu due to limited observation cases, i.e. < 20, and the two statistically estimated communities (bands) are colored

The two bands in the study site were likely to segregate to areas in which they slept along the Menanggul River, although these widely overlapped. The downstream area of the Menanggul River was predominantly occupied by band B, as shown by a mean distance of sleeping sites from the river mouth of 923 m, ranging from 0 to 2000 m (Fig. 2). By contrast, band A mostly occupied the upstream area, with a mean sleeping site distance from the river mouth of 2042 m, ranging between 400 and 5800 m. The home range of the AMG spread over the Menanggul River, with a mean sleeping site distance from the mouth of the Menanggul of 2837 m and a wider range from 0 to 6000 m.

Fig. 2
figure 2

Distribution of sleeping sites of each band and all-male group (AMG) along the Menanggul River. The dotted lines in the graph represent the mean location of the sleeping location based on the distance from the river mouth of each band and AMG. Different bands and the AMG are colored

Genetic analysis of social structure in 2015–2016

Overall relatedness pattern

The total number of relatedness pairwise comparisons measured among the 197 samples was 35,511, of which 27,777 (78%) were negative numbers, thus indicating that those pairs of samples are not related to each other. Of the remaining dyads with positive relatedness values, there were no identical twins, 187 parent–offspring or full sibling dyads, and ~ 1207 relationships that could be considered cousins or half-siblings (Supplementary Fig. 2).

Female vs. male relatedness

There was a significant difference in relatedness values between male-male and female-female dyads when including all same-sex dyads (KS-test p < 0.001), with the males showing on average a higher relatedness (0.0481 ± standard deviation: 0.109) than the females (0.0369 ± 0.094).

Relatedness based spatial autocorrelation

Based on relatedness values, a significant negative correlation between geographic distance and genetic distance was found for both sexes, with females (N = 9045 dyads) showing a negative Kendall’s τ =  − 0.066 (p < 0.001) and males (N = 2016 dyads) showing a slightly less negative Kendall’s τ =  − 0.053 (p = 0.0015; Fig. 3). Overall, as geographic distance increased, the degree of relatedness decreased, indicating that the more distant the two individuals were, the lower their relatedness.

Fig. 3
figure 3

Relationship between geographic distance and genetic distance among the 197 samples measured. The solid lines are the mean relatedness across distance classes for the males (red) and females (black) and the dashed lines are the upper and lower boundaries of the mean plus and minus standard deviation of the relatedness for males and females across distances, respectively

When investigating the correlation between relatedness values and distance zone, for each sex, a significant reduction was observed in relatedness as geographical distance increased (female Pearson’s r =  − 0.906, p = 0.0048; male Pearson’s r =  − 0.947, p = 0.0012), with approximately 82% of the difference in relatedness explained by distance for females and 89% for males.

Pairwise differentiation along the river and isolation by distance

The relationship between geographical distance and degree of differentiation (FST) was examined after dividing the samples into zones based on the location from which their fecal samples were collected. Significant differences were found in the degree of FST in the pairwise comparisons of 4 out of 21 zones-by-zone comparisons for males (Table 1). Males tended to exhibit significant differentiation with individuals living in the uppermost zone of the river (3501–4000 m) from those living near the river mouth, which is indicated by an increase in genetic differentiation with an increase in geographical space between them (Mantel test squared correlation coefficient r2 = 0.603, p = 0.0055), and thus, approximately 60% of the difference in allele frequency between males can be explained by the geographical distance between the sampling points. Although females in 5 out of 21 zones-by-zone comparisons showed significant genetic differentiation based on their FST values (Table 1), FST values did not increase with geographic distance (Mantel test squared correlation coefficient r2 = 0.022, p = 0.6730).

Table 1 Pairwise differentiation (FST) along the river, divided the locations into distance bins that reflected samples separated from each other by (i) 0–500, (ii) 501–1000, (iii) 1501–2000, (iv) 2001–2500, (v) 2501–3000, (vi) 3001–3500, and (vii) 3501–4000 m. Notably, the zone 1001–1500 m was excluded as only one fecal sample was collected in that section. Bold numbers denote the pairs that were statistically significant

Discussion

The findings strongly suggest that the proboscis monkey exhibits a form of MLS, consisting of several core reproductive units and bachelor groups interwoven with higher-order bands. Our study also highlights sex-specific relatedness differences, a negative correlation between relatedness and geographic distance, and distinct genetic differentiation patterns along the river for males and females. Notably, males showed slightly higher relatedness than females, though a negative correlation between relatedness and geographic distance was consistent for both sexes, indicating decreasing relatedness with increased distance. In addition, pairwise differentiation along the river was significant for males, with 60% of allele frequency differences explained by geographic distance, though females showed genetic differentiation in specific zones without an increase in differentiation with distance. These results suggest that females migrated randomly across both short and long distances, whereas males tended to migrate relatively shorter distances than females, possibly leading to patrilineal characteristics within the population of the studied species. Thus, it is more likely that our hypothesis that proboscis monkeys form a matrilineal, multileveled society is ruled out and that it would be rather patrilineal. The form of MLS inferred from either the results of direct observation or genetic analysis, and that inferred from the integration of these two results, will be discussed in more detail below.

Inferring MLS based on observational data

The findings of this study revealed that the social system of the proboscis monkey takes the form of a MLS, with OMUs (core reproductive units) and the AMG (bachelor group) woven together into a higher-level band. Several studies have reported that OMUs of proboscis monkeys are not territorial and sleep in close proximity in the trees along the riverside where they spend the night (Bennett and Sebastian 1988; Boonratana 2002). In particular, the study by Yeager (1991), in which multiple groups were identified and their relationships examined over 6 months, provide the most detailed analysis of the social organization of proboscis monkeys. The results of Yeager are generally in accordance with our results: two bands consisting of multiple OMUs and AMGs were detected, and the two bands were loosely segregated upstream and downstream of the river, with a certain degree of overlap in their distribution ranges.

Although this study confirmed that the proboscis monkey forms MLS, a previous study conducted by us at the same site (Matsuda et al. 2010a) was ambiguous in this regard. That study showed that changes in local density of OMUs along the river (defined as the number of observed sleeping sites of OMUs other than the focal OMU on a 1000 m long segment along the river using the sleeping site of the focal OMU as the midpoint) can be predicted by spatial heterogeneity along the river in relation to predation pressure and food availability. Namely, locations with a narrower river width are advantageous for predator avoidance, because animals can escape by leaping to the other side of the river when attacked by a predator, and better foraging locations where clumped food patches are abundant are also advantageous. In other words, the local density of OMUs may increase where better sleeping conditions are available, which suggests that associations among units are induced by habitat features and are not necessarily the outcome of social attraction. However, this previous study focused on only one identified focal OMU, whereas other OMUs that stayed around that focal OMU were not identified and treated as local density. Hence, the aspect of whether a particular affinity among specific OMUs existed during the period of increased local density was ignored. Integrating the results of our previous study and this study, it appears that the proboscis monkeys form MLS with specific OMUs at their core, and that the degree of association among units varies seasonally, depending on food abundance and predation pressure. Indeed, the association between identified OMUs of proboscis monkeys at other study sites have been reported to be seasonal, although no ecological explanations for such seasonality were provided (Yeager 1991). A detailed review has shown that ecological conditions, such as food abundance, are merely permissive and do not seem to drive the nested nature of the MLS of Asian colobines (Grueter and van Schaik 2009). However, it is premature to discard the impact of ecological factors, such as resource abundance and predation pressure, on the evolution of MLS in proboscis monkeys (see also Grueter 2022).

In our long-term study of 35 months, we observed three occasions of adult male replacement in the OMUs, which offered new insights into the social system of the proboscis monkey. We found that, after the replacement of the dominant male, an OMU would occasionally switch band affiliation. Changes in band affiliation have also been documented for the AMG. Our observational protocol did not allow us to explore the reasons behind switches in band affiliation. These could include avoidance of social disadvantages, e.g., a low unit dominance rank. Understanding the motivations for changes in band affiliation requires further detailed individual-level observations of both OMUs and AMG.

Inferring MLS based on genetic data

Although several genetic studies have been performed on proboscis monkeys, e.g., population genetics focusing on mitochondrial control region (Munshi-South and Bernard 2011), development of polymorphic microsatellite loci (Salgado-Lynn et al. 2010; Inoue et al. 2016), and their application to zoo studies (Ogata and Seino 2014), this study was the first to examine their social structure genetically by estimating relatedness in a large number of individuals (N = 197) in the wild. Nonetheless, the results still need to be interpreted with caution due to the potential error-prone of estimating dyadic relatedness from a limited number of microsatellites. Using 24 markers still carries a chance of such a misclassification of kinship (Csillery et al. 2006; Van Horn et al. 2008). In fact, the standard deviation of the estimated relatedness values was large.

The negative correlation between genetic and geographic distances is comparable to that reported for many other primates (e.g. Hagell et al. 2013; Mbora and McPeek 2014; Oklander et al. 2017). By contrast, the significantly higher degree of relatedness between males than females in the population at the study site was rather unexpected in the proboscis monkey, for which a more female-centered social interaction network has been observed (Yeager 1990b; Matsuda et al. 2012a, b). Thus, social bonds between females may be driven by factors other than relatedness. A notable feature of social interactions in colobine monkeys, including proboscis monkeys, is non-mothers handling infants, i.e., allomothering behaviour (Davies and Oates 1994; Matsuda et al. 2012b, 2022), although sometimes handlers hurt or abuse the infant and do not provide genuine care (McKenna 1979). Social bonds between females may be enhanced through such allomothering behaviours (Zhang et al. 2012; Matsuda et al. 2015), which can have various functional benefits, e.g., increased foraging efficiency for mothers and parental training of nulliparous females (Maestripieri 1994). Therefore, strong ties between females within groups may develop even in the absence of close relatedness (McKenna 1979).

FST-based analyses indicated that although females randomly migrated short or long distances, males were more likely to migrate relatively shorter distances than females and rarely migrated long distances, such as from the mouth of the river to > 4000 m upstream (Table 1). This may simply indicate that, in the two different bands, males disperse more proximally within the band to which they belong, whereas females can disperse proximally or distally within and out of the band.

Inferring MLS based on matching observational and genetic data

When combining the results of the direct observations conducted in different periods from genetic research, it is more likely that females disperse more distally than males based on the results of FST analysis, which is seemingly contradictory to the results through the direct observation, i.e. the AMG utilised the riverside more extensively than OMUs, which included females (Fig. 2). If a larger community structure encompassing bands A and B, which might be termed a “herd”, exists in the proboscis monkey society (Fig. 4), this may explain why Fst was significantly different between males in the upper and lower streams. Males may disperse within a band as well as between bands, whereas females may be able to disperse beyond the herd. Because the MLS of golden snub-nosed monkeys (Rhinopithecus roxellana), a phylogenetically close relative of proboscis monkey, has been proposed to form grouping levels above the two-tier structure of OMUs and bands (Fig. 4; Qi et al. 2014, 2017), the possibility of a three-tiered social organization in proboscis monkey cannot be ruled out.

Fig. 4
figure 4

A schematic illustration of (a) a multilevel societies (MLS) in golden snub-nosed monkey based on Qi et al. (2014) where the herd level is based on spatial proximity between a band and one or more all-male units (AMUs) and the troop level is based on the spatial proximity between two herds; (b-1) proboscis monkey MLS based on genetic data only; and (b-2) proboscis monkey MLS inferred from observational and genetic data. Symbols and colors indicate males vs. females and lines show dispersal patterns

The MLS of proboscis monkeys is comparable to that of snub-nosed monkeys, especially in terms of the flexible dispersal patterns seen in both sexes (Qi et al. 2009; Zhang et al. 2012). However, there is a subtle difference in that in golden snub-nosed monkeys, each breeding band composed of OMUs is accompanied by all-male units (AMUs) which jointly form a herd (Qi et al. 2014, 2020), whereas in proboscis monkeys, the AMG covers multiple bands composed of OMUs (Fig. 2), although the closeness of the AMG to either band would appear to be flexibly changing over time. This inference is based on both genetic data and direct observations (Fig. 4). As discussed above, results from the genetic data alone suggest that proboscis monkeys exhibit MLS similar to that of snub-nosed monkeys. Although the lack of analysis of individual-level behaviour is a limitation of this study, the more constrained dispersal patterns in males than those in females (which did not seem to migrate beyond the herd) may be a difference between the social systems of proboscis and snub-nosed monkeys. Accordingly, the social system of proboscis monkeys would be unique in that it has a patrilineal genetic basis. Assuming the MLS in proboscis monkeys is patrilineally structured, the following two observations would be well explained: (1) male–male fighting causing serious injury is rarely observed in proboscis monkeys, although sexual competition between males is intense, as evidenced by their large noses and body sizes (Koda et al. 2018; Matsuda et al. 2020b), and (2) infanticide has not been observed in natural populations of proboscis monkeys. Invoking inclusive fitness theory, if males are related to each other within a band or herd, the adult males of the OMUs are expected to tolerate each other and avoid excessive fighting over females, and likewise, the males of AMGs are expected to avoid taking over the group through serious physical challenges to the adult males of the OMUs. This aspect of the social system of proboscis monkey bears resemblance to the society of hamadryas baboons where males of the same clan (the tier above the OMU in their MLS) are philopatric, patrilineally related, spatially associated, and frequently engaged in social interactions and show mutual respect of female ‘ownership’ ( Kummer et al. 1974; Abegglen 1984; Schreier and Swedell 2009; Städele et al. 2014). However, this scenario differs from that of the non-patrilineally based MLS of R. roxellana, where there is even collective action involving OMU males (Xiang et al. 2013).

Although infanticide has been reported in proboscis monkeys under provisioned conditions with a high population density (Agoramoorthy and Hsu 2005), infanticide has not been reported under natural conditions. Females with infants have also been reported to transfer to other OMUs, but no infanticide or attacks by dominant males have been observed in these cases (Bennett and Sebastian 1988; Matsuda et al. 2012a). After the takeover of an OMU by a male from an AMG, the risk of infanticide will be reduced if the usurping male is related to the ousted male. Hence, the mechanism underlying the lack of infanticide in proboscis monkeys may be different from that of the snub-nosed monkey, where infanticide is thought to be constrained by paternity uncertainty resulting from extra-unit mating (Qi et al. 2020). Because extra-unit mating has not been reported in proboscis monkeys (Yeager 1990a; Murai 2006; Boonratana 2011), it is unlikely that a similar scenario would act as an additional or alternative deterrent to infanticide.

Understanding why a MLS with a patrilineal basis, a key step in the evolutionary history of human society (Chapais 2008), emerged in a phylogenetically distantly related primate species, such as the proboscis monkey, may provide valuable clues about human social evolution. However, the evolutionary pathways leading to MLS in humans and colobines appear to be different; in humans, MLS are likely the product of internal fractionation of ancestrally multimale-multifemale groups whereas colobine MLS seem to have emerged as a result of the amalgamation of originally autonomous OMUs (Grueter et al. 2012a). A comprehensive understanding the internal structure of multileveled societies in various animal species and inferring the process of their social evolution would further shed light on the uniqueness of human societies and contribute to developing insights into the evolutionary process of our societies.