Introduction

Reconstructing activity in the past is a major objective of biological anthropology. Researchers mostly follow the principle of “functional adaptation of the bone” (Lieberman and Friedlaender 2005; Ruff et al. 2006), according to which bone morphology can alter during life to ensure its endurance against changes in activity-induced strain (Ruff et al. 2006). On this basis, various skeletal proxies of physical activity have been proposed, such as cross-sectional geometric properties, trabecular architecture, or the bone surface areas where ligaments and/or muscles attach (often defined as “entheses”; e.g., Bennell et al. 1997; Benjamin et al. 2002; Benjamin et al. 2006; Vickerton et al. 2014; Kivell 2015; Dunmore et al. 2020). Similarly, other approaches have focused on the degenerative aspects of bone and studied osteoarthritis and/or enthesopathy (e.g., Cope et al. 2005; Villotte and Knusel 2014; Molnar et al. 2011; Milella 2018; Laffranchi et al. 2020).

The most usual approaches for the study of entheseal changes are different scoring systems relying on visual observation of the entheseal expression and/or enthesopathy (Hawkey and Merbs 1995; Robb 1998; Stirland 1998; Mariotti et al. 2004, 2007; Villotte 2006; Villotte et al. 2010; Niinimaki 2011; Cashmore and Zakrzewski 2013; Weiss 2015; Kubicka and Myszka 2020), for the ultimate purpose of reconstructing physical activity in the past (e.g., Molnar 2010; Havelkova et al. 2011, 2013; Niinimaki 2012; Weiss et al. 2012). Nevertheless, several studies focusing explicitly on scoring entheseal changes have reported that these methods present issues of reproducibility (Davis et al. 2013; Henderson et al. 2016; Wilczak et al. 2017; Jorgensen et al. 2020). Other authors, such as Wilczak (1998) and Henderson (2013), used 2D measurements of entheses. With the recent advent of high-definition three-dimensional (3D) scanning, new methods of entheseal measurement were developed (e.g., Noldner and Edgar 2013). These were used to quantify entheseal 3D form and analyze it using complex and rigorous statistical analyses (Karakostis and Lorenzo 2016). Importantly, the latest developments in 3D scanning technologies using structured-light handheld models have greatly decreased the time required to generate high-resolution scans. For instance, the equipment used in the present study (see the section below) can provide a 3D model of a human hand bone within less than 5 minutes (including the process of texture mapping).

Most previous 3D studies rely on isolated entheses located in the upper limb (e.g., Noldner and Edgar 2013; Nolte and Wilczak 2013; Karakostis and Lorenzo 2016). Indeed, the upper limbs are the part of the skeleton that is expected to be the most impacted by repeated activity during life compared with the lower limbs or the axial skeleton (Cashmore and Zakrzewski 2013; 335). However, as some authors previously suggested, muscles act in coordination, and studying only isolated entheses leads to a substantial loss of functional information (Robb 1998; Stirland 1998; Henderson and Alves Cardoso 2012).

Recently, a new virtual approach for analyzing entheses was introduced, which involves the use of a detailed protocol for precise delineation and measurement of 3D entheseal surfaces (e.g., Karakostis and Lorenzo 2016; Karakostis et al. 2017, 2021). This is followed by multivariate analyses aiming to reconstruct patterns of habitual coordination among different muscles (i.e., synergies). This method, which was first developed by one of us in 2016 (author FAK; Karakostis and Lorenzo 2016), has recently been named the “Validated Entheses-based Reconstruction of Activity” (V.E.R.A.) method in a dedicated literature review (Karakostis and Harvati 2021). Its reliability has been validated both on human skeletons with long-term occupational documentation (Karakostis et al. 2017) as well as in three experimental studies involving diverse laboratory animal species (Karakostis et al. 2019a, b; Castro et al. 2021). In particular, Karakostis and colleagues (2017) reported consistent differences between lifelong heavy manual laborers (presenting an entheseal pattern associated with a power-grasping muscle group) and long-term workers of lower intensity and/or more mechanized occupations (showing an entheseal pattern corresponding to a thumb-index precision-grasping muscle group). Importantly, these multivariate entheseal patterns were not associated with variation in biological age (all individuals were less than 50 years old, given the results of previous research showing the increase of pathological lesions at entheses after around this age and the decrease of a functional signal; e.g., see Mariotti et al. 2007; Villotte et al. 2010), body-size, or sex. Despite these promising findings, as highlighted by these authors themselves, further research is necessary for elucidating the exact effects of ancestry and sex on the multivariate patterns among entheses (Karakostis et al. 2020, 2021; Karakostis and Harvati 2021). Such studies are needed to address the applicability of the method V.E.R.A. for interpopulation comparisons, evaluating the method’s capacity to identify potential differences in habitual activity among individuals of diverse population origins.

In this framework, the present pilot study applied this novel methodology to the well-preserved hand bones of a small sample originating from the Pretoria Bone Collection (South Africa), which is reliably documented for the individuals’ sex, age at death, and ancestry (European or African).

Material and methods

Material

A total of 39 individuals from the Pretoria Bone Collection (PBC), South Africa, were analyzed for this research (L’Abbé et al. 2021). The PBC comprises more than 800 individual skeletons (body donors), who lived and died in the broader area of South Africa from 1942 onwards. Although the sample is arguably small, it represents individuals of known characteristics with excellently preserved left- and right-hand bones (a condition that is extremely rare among anthropological collections). The documentation of the PBC includes sex, body size, age at death, and ancestry. In this study, individuals of varying origins are referred to as either “South Africans with African ancestry” or “South Africans with European ancestry”. Reflecting previous applications of V.E.R.A. on hand entheses, the six following hand bones were analyzed for both anatomical sides (left and right): the first and fifth metacarpals (MTC1 and MTC5), the first, second, and fifth proximal phalanges (PP1; PP2 and PP5), and the first distal phalanx (DP1). Considering the known impact of degenerative changes on entheseal morphology (Bloebaum and Kopp 2004; Milella et al. 2012; Nolte and Wilczak 2013; Villotte and Knusel 2013), individuals between 20 and 49 years at death were initially chosen from both sexes and ancestries. However, given the lack of individuals below 50 years of age in the PBC, individuals between 50 and 59 years of age at death were also included. Following the VERA protocols (Karakostis and Lorenzo 2016; see review by Karakostis and Harvati 2021), individuals with entheses showing potentially pathological lesions with more than 1 mm in diameter (either osteophytic or osteolytic; see Mariotti et al. 2004, 2007) were excluded from the sample. This decision was based on previous histological research showing that small enthesophytes (spurs) do not seem to compromise the microscopic bone-tendon interface in humans (Benjamin et al. 2001). These were extensive lesions of several mm in diameter, which appear to have resulted from post-traumatic bone remodeling. Depending on the analysis (see below) individuals were grouped by sex, ancestry, and the interaction between the two grouping factors (i.e., African ancestry females, African ancestry males, European ancestry females, and European ancestry males). Table 1 outlines the distribution of the sample, dividing individuals into arbitrary age groups for descriptive purposes. It should be clarified here that the statistical assessment of the effects of age on our results (see below) did not rely on any categorization into age groups, but instead used the exact documented age of each individual (see subsection “Statistical analysis”).

Table 1 Distribution of the 39 individuals of the sample

Delimitation and measurement of entheses

This paper employs the V.E.R.A. protocols for delineating entheseal areas on the bones, which were introduced in previous research (e.g., Karakostis and Lorenzo 2016; Karakostis et al. 2017, 2019a, b). The steps of this method are described in detail in a recent literature review by Karakostis and Harvati (2021). The first step involves the development of a 3D surface scan of each bone to be analyzed. An Artec 3D Space Spider scanner (Artec Inc., Luxembourg) was used together with the associated Artec Studio Professional 14 software. Then, each of the nine entheses located on the six bones was virtually isolated and their surfaces were measured in square millimeters with the Meshlab software (CNC Inc., Rome). Figure 1 shows the location of each enthesis and Table 2 explains the properties of each related muscle. Following V.E.R.A, to delineate and isolate the entheses, three criteria were used: the difference of elevation between the entheseal region and the rest of the surrounding bone, the difference in color shade, and the variation in surface complexity. This process is facilitated through the precise use of specific virtual tools (e.g., imaging filters). First the (optional) “equalize vertex color” filter permits to emphasize color differentiation on the bone surface. Then the “discrete curvatures” tool is used to color-map the elevation and complexity of the surface, allowing for the refinement of the delineation. Finally, after virtually selecting the entheseal area plus a narrow band of the surrounding flatter bone surface, the “compute curvatures principal directions” tool is used to precisely delineate the exact borders of the entheseal area and remove the surrounding flatter bone surface (see Karakostis and Lorenzo 2016; Karakostis and Harvati 2021). Finally, the remaining entheseal 3D area was measured in mm2, using Meshlab’s measuring tools. For an observer experienced in the use of VERA, the above delineation process can be performed within 5 minutes for each enthesis (Karakostis and Harvati 2021). Table 3 presents the descriptive statistics for each enthesis, based on all 39 individuals. The intra-observer repeatability of this method of 3D delineation (VERA protocol) for this sample was tested for each enthesis using intra-observer tests. The first author (LB) executed twice the delineation of the left entheses of six individuals, which were randomly chosen among the sample. Then, a paired t-test between the first and the second 3D area measurements was run. In addition, the Lin’s concordance coefficient was also calculated to compare the measurements of the first session and the second one (following Lin 1989; Altman 1991; McBride 2005; Quinn et al. 2009).

Fig. 1
figure 1

The areas of the nine analyzed entheses located on the six hand bones, left side. Opponens pollicis (OP) on MTC1 (radial view); Abductor pollicis (AbP)*, Flexor pollicis brevis (FPB)*, Adductor pollicis (AdP) et Extensor pollicis brevis (EPB) on PP1 (from the top: radial, ulnar, and dorsal view); Flexor pollicis longus (FPL) on DP1 (palmar view); First dorsal interroseus (FDI) and First palmar interroseus (FPI) on PP2 (from the top: radial and ulnar view); Extensor carpi ulnaris (ECU) on MTC5 (ulnar view); and Abductor digiti minimi (AbDM)* and Flexor digiti minimi (FDM)* on PP5 (ulnar view). The exact scale is not respected. *Those entheses were combined as the muscles insert in the same region of the bone, and thus individual delimitation per muscle attachment was not possible (for more details, see Karakostis and Lorenzo 2016; Karakostis et al. 2017)

Table 2 Properties of the analyzed entheses’ muscles (Moore et al. 2014; Platzer 2014; Karakostis et al. 2017, 2019a)
Table 3 Descriptive analysis of raw entheseal measurements (in mm2) based on all 39 individuals for each enthesis, right and left hand sides separated

Statistical analysis

Initially, to evaluate the potential effects of overall size in our analyses, our entheseal 3D area measurements (in mm2) were adjusted based on the geometric mean approach (e.g., Richmond et al. 2016) and following V.E.R.A.’s standard protocols (e.g., Karakostis et al. 2017, 2018, 2021). Subsequently, all statistical tests (see below) were applied both on raw as well as on size-adjusted entheseal 3D area measurements.

Assessing the effects of age and estimated body mass

The influence of estimated body mass on the entheseal patterns of individuals was analyzed with Spearman’s correlation tests (rs) involving the scores of the primary component of the principal component analysis (PCA; see subsection below). The process was repeated both for raw (unadjusted) and size-adjusted values (see below). We focused on the first principal component (PC1) because this axis showed different tendencies by ancestry (see plots in the “Results” section). In these comparisons, the femoral head diameter (measured in mm, using a Mitutoyo digital sliding caliper) was used as a proxy of body mass (see Ruff et al. 1991). Nevertheless, we have confirmed that either using femoral head diameters or predicted body mass values (calculated based entirely on the femoral head diameter as a single predictor variable) provided perfectly identical results in our correlation tests. Moreover, based on additional Spearman’s correlation tests, we estimated the influence of biological age on the observed entheseal patterns, by comparing between documented age and the PC1 of each PCA (before and after size-adjustment using the geometric mean approach).

Multivariate analysis of variance (MANOVA)

For investigating whether the observed entheseal differences between sexes and ancestries in our sample are statistically significant, a series of MANOVAs were applied on the raw 3D area measurements as well as on the data adjusted for overall size using the geometric mean approach. Initially, a comparison between males and females revealed that the size-adjusted dataset does not present significant sexual dimorphism in our sample’s left- and right-hand bones (see below, in the “Results” section). Based on this finding, and considering the admittedly limited size of our sample, the analyses focusing on ancestry relied on individuals from both sexes. All assumptions required for MANOVA were met, including the absence of outliers, approximate normality, and homogeneity of variance–covariance matrices (Field 2017). Outliers were identified and eliminated using the standard “z-score” approach (Field 2017). The approximate normality of the entheseal measurements was confirmed using histograms and normal probability plots (Field 2017). Importantly, for each performed MANOVA, a Box’s M test confirmed the equality of variance–covariance matrices. To account for the possibility that multiple tests have increased the probability of a type I error, a Holm-Bonferroni sequential correction procedure was applied. Regarding sample size, the minimum recommended requirement was met (i.e., the number of individuals was greater than the number of independent factors; see Field 2017). In addition, we ensured that the a priori statistical power of all performed MANOVAs was adequate using the open-access software G*Power version 3.9.1.4 (Faul et al. 2007), which showed values well above 0.90 for all analyses (see the “Results” section) (Suresh and Chandrashekara 2012). The same software package was used to calculate effect sizes for all significant differences observed (Faul et al. 2007).

Principal component analyses (PCAs)

Considering the significant differences observed in the size-adjusted measurements of the right anatomical side (see the “Results” section), we used PCAs to investigate the multivariate patterns among entheses that lead to this ancestry-related variation in our sample. This analysis was performed in the software PAST (Hammer et al. 2001), relying on a correlation matrix.

A total of four PCAs were conducted (two for each anatomical side), involving raw and size-adjusted measurements (based on the geometric mean procedure; see above). The left and the right side were analyzed separately, to study the possible influence of activity impacting one side more than the other one. In all PCAs, the number of the PCs that were taken into account for each PCA was determined using standard scree plots in the software PAST (Field 2017). The entire sample was included in the analysis and individuals were color-labeled by ancestry. Therefore, all four PCAs conducted in this study did not assume groups a priori.

In the PCA on the size-adjusted dataset, all individuals showing extreme PC scores (i.e., above 3 or below − 3) were excluded from the analyses (e.g., see Field 2017). After excluding them and re-running the PCAs, one more case plotted slightly above 3 on PC2 (see Fig. 2). To fully ensure that this borderline extreme value did not considerably affect the distribution of individuals in the PCA plot and the associated PC loadings, we re-run the PCAs again without them. As the PCA patterns did not alter in any way, we maintained this very slightly extreme value in our plots (Fig. 2), to avoid further decreasing the small sample size of our study. Similarly, concerning the PCA on raw entheseal 3D values (i.e., before size adjustment), the analysis was re-run after removing 10 outliers. Nevertheless, some extreme values on PC2 re-emerged in the second analysis (see Fig. S1). As above, we confirmed that these extreme values did not considerably affect the observed PCA patterns (and statistical results) by re-running the PCAs without these extreme cases and repeating all correlation and comparison tests.

Results

Repeatability and the effects of age and estimated body mass

Table 4 presents the results of repeatability tests between the first and second series of measurements. All paired tests showed no significant differences between repetitions (p-values above 0.05), while the calculated Lin’s concordance coefficient was 0.88, indicating adequate repeatability (see Lin 1989; Altman 1991).

Table 4 Results of the intra-observer error tests, showing no significant differences between repetitions (p > 0.05)

Table 5 presents the results of the Spearman’s correlation test between PC1 scores (from the PCA involving all individuals except outliers) and femoral head diameter (as a proxy of body mass). Before adjusting for size, the correlation between PC1 values and the femoral head diameter appears to be significant and relatively high. However, after size-adjusting the data based on the geometric mean approach, the femoral head diameter is no longer correlated with PC1 scores (p-values of 0.966 and 0.416). Table 6 presents the results of the same test, but this time comparing between PC1 and age-at-death. Once again, the PC1 relying on raw entheseal measurements shows a correlation with biological age in the left anatomical side. Nevertheless, size-adjusting the measurements based on the geometric mean appears to diminish the association between age-at-death and PC1 scores (p-values of 0.471 and 0.993).

Table 5 Spearman correlation tests (rs) between PC1 values and femoral head diameters used as a proxy of body mass
Table 6 Spearman correlation test (rs) between PC1 values and age at death

MANOVA tests

Table 7 presents the results of the MANOVA tests comparing between sexes, for each anatomical side separately. Before size adjustment, the raw 3D entheseal values show a significant difference between sexes in the left anatomical side. The effect size for this difference (Cohen’s f-squared) was calculated to be 1.52, indicating substantial mean variation between males and females (Faul et al. 2007). Nevertheless, once entheseal measurements were adjusted using the geometric mean, there were no statistically significant differences between sexes for either anatomical side.

Table 7 MANOVA test results comparing females and males of the sample

Given this result, and to maximize our sample sizes for subsequent ancestry comparisons, we then pooled males and females together within each ancestry group. Table 8 displays the results of the tests comparing African and European ancestry individuals. In one of the comparisons, which focuses on the right-hand entheses, the difference is statistically significant (p-value: 0.01; see Field 2017). The effect size (Cohen’s f-squared) for that test was 1.05, indicating considerable deviation between the group means (Faul et al. 2007). Furthermore, the a priori statistical power of all comparisons is well above 0.90 (Suresh and Chandrashekara 2012). It must be highlighted that this variation by ancestry in our sample only emerged when using the size-adjusted 3D surface areas. In contrast, these ancestry differences in our South African sample were not apparent when using the raw 3D surface measurements (i.e., prior to adjusting using the geometric mean).

Table 8 MANOVA test results comparing between African ancestry and European ancestry individuals of the sample

Principal component analysis (PCA)

The PCAs conducted on the raw variables, whose components showed significant associations with biological age and femoral head diameter (Tables 5 and 6), are shown in the supplements (Fig. S1). Regarding the PCAs on size-adjusted measurements, Fig. 2 illustrates the PC scores of individuals when all nine entheseal measurements are included as variables. Following the scree-plot approach, the first three PCs were plotted (see the “Materials and Methods” section). The plot shows the distribution of individual scores from both ancestries of our sample (based on right side, size-adjusted data). As mentioned before, no groups were assumed a priori for these analyses, but the individuals of each group were color-labeled differently for visualization purposes.

Fig. 2
figure 2

Plot of the first three PCs from the PCA based on 9 variables, right hand side (size adjusted entheseal measurements), and all individuals except for two extreme outliers (see the “Materials and Methods” section). African ancestry individuals labeled in dark blue and European ancestry individuals labeled in light blue. No groups were assumed a priori for this analysis

As shown in these plots (Fig. 2 left and right), most individuals of African ancestry tend to be mostly located on the negative side of the PC1. On the contrary, the PC1 scores of European ancestry individuals are mostly positive. Table 9, which presents the factor loadings for each principal component of the PCAs, shows that the positive values of PC1 represent individuals with proportionally larger AbP/FPB, AdP, and FPI (entheseal variables with a factor loading above 0.3), while negative values reflect proportionally larger EPB and FPL entheses (variables with loadings under − 0.3).

Table 9 Eigenvalues and factors loadings of the three PCAs

Discussion

In line with previous works employing the V.E.R.A. approach (e.g., Karakostis and Lorenzo 2016; Karakostis and Harvati 2021), this study’s analyses suggest that size-adjustment combined with multivariate analysis of entheseal 3D areas can be used to adequately control for the effects of estimated body mass (Table 5), age-at-death (Table 6), and sexual dimorphism (Table 7) on muscle attachment surface areas. Conversely, before adjusting for size, these factors showed a significant effect on entheseal variation, confirming that the overall raw size of entheses (i.e., before adjustment) is naturally associated with the individual’s biological characteristics (e.g., Karakostis et al. 2017). On this basis, we encourage future 3D entheseal studies focusing on physical activity to consider the benefits of size adjustment and multivariate analysis.

Importantly, in our size-adjusted dataset, differences between ancestries (African and European) were statistically significant in our South African sample, explicitly for the entheseal multivariate patterns of the right-hand side. This difference is distinctive both in the statistical output of the MANOVA tests (Table 8) as well as in the PCA plots assuming no prior group differences (Fig. 2). Overall, these findings suggest that ancestry has a significant effect on the multivariate correlations among entheses in our small South African sample. Moreover, an observation of the PCA loadings (Table 9) suggests that this variation between individuals of different ancestries might be likely more driven by ancestry-based differences in habitual manual activities rather than genetic variation. In particular, Table 9 shows that the PC1 entheseal pattern of African ancestry individuals is mainly characterized by proportionally larger entheses of muscles EPB and FPL. These muscles are known to be closely synergistic (e.g., Marzke et al. 1998; Karakostis and Lorenzo 2016), and their combined recruitment allows the thumb’s proximal phalanx to extend back, while its distal phalanx flexes. Previous work has highlighted the importance of this interaction between muscles EPB and FPL for any power-grasping human gesture (Marzke et al. 1998; Karakostis et al. 2017). Similarly, the PC1 pattern of European ancestry individuals mainly involves entheses of a muscle group that tends to be typically vital for precision grasping (Moore et al. 2014; Kivell 2015; Karakostis et al. 2017). Particularly, the highest factor loadings were presented by the entheses of muscles AbP/FPB and AdP (Table 9), which are responsible for thumb abduction, adduction, and flexion. At the same time, entheses FPI and AbDM/FDM (which show lower factor loadings) correspond to muscles adducting the index finger and flexing and/or abducting the fifth ray, respectively. It is worth emphasizing that these entheseal differences by ancestry were also very similar within each sex separately and did not significantly correlate with biological age or femoral head diameter (a standard proxy of body mass).

Importantly, the possibility that variation by ancestry in our sample is driven by differences in habitual physical activity is also supported by the fact that only the right-hand side showed a significant difference between groups. This is even though both anatomical sides were perfectly preserved in all individuals of our sample. Essentially, if the observed differences were exclusively related to systemic factors of interindividual variability (such as genetic background), the results would not be expected to differ between the anatomical sides of the same individuals. Furthermore, the right side is typically more solicited during movement in most humans, given that most living humans are reported to be right-handed. Therefore, we could reasonably expect that, on a statistical basis, the right side might possibly be more influenced by the effects of activity compared to the left anatomical side (Steele and Mays 1995). In this regard, it is interesting to consider the historical context of this specific collection. The collection began accumulating in 1942, during the Apartheid period (L’Abbé et al. 2021). Many of the African ancestry individuals included in this collection were likely involved in strenuous manual and physical labor that was not as common among individuals of European ancestry, who also had access to a better health care system (e.g., Christopher 1990; Huschka and Mau 2006; Mariotti 2012; Nattrass 2019). Our results on potential activity-induced skeletal differences between ancestries in our small South African sample may have likely identified a reflection of these social inequalities.

Despite the promising results of this pilot study, further research on increased sample sizes is required to validate our observations. Indeed, 39 individuals are evidently not representative of the entire South African population, nor of the different ancestries included in it (L’Abbé et al. 2021; Alves Cardoso and Henderson 2013; Lopreno et al. 2013). Therefore, we would like to highlight as much as possible that our study’s results and interpretations should never be generalized to all individuals of African and European ancestry, as they strictly refer to the sample of this study and its directly associated geo-chronological and historical context.

In the future, to further investigate the exact influence of other major factors of ancestry-based variability (e.g., genes), an adequate number of well-preserved and documented individuals from various ancestries and environmental contexts should be studied. It would also be important to include individuals with documented long-term occupational activities (e.g., see Karakostis et al. 2017). Overall, the findings of this study suggest that such future research endeavors would greatly benefit from employing the recently developed V.E.R.A method (Karakostis and Lorenzo 2016; see review by Karakostis and Harvati 2021).

Conclusion

The present pilot study applied a recently developed and experimentally validated method of entheseal analysis (V.E.R.A. protocols) on a small but well-documented historical collection from South Africa, including individuals of African and European ancestry. After controlling for the effects of overall entheseal size, which was correlating with systemic factors (estimated body size, age-at-death, and biological sex), we found significant differences between ancestries in the right anatomical side. The observed entheseal differences appear to directly reflect the habitual performance of more strenuous manual activities in individuals of our sample with African ancestry. Future research on increased sample sizes is required to confirm these results and further elucidate the factors driving entheseal variation by ancestry.