Introduction

Sexing paleofaunal remains can provide important data on a broad spectrum of issues. These include the animal’s life history, extent of dimorphism, socio-ecological structure and behavior, as well as predator-prey relations and herd management strategies1,2,3,4,5,6,7. However, sex determination of fossil fauna is severely hampered by the often fragmentary nature of specimens, as it relies on the presence of sex-specific morphological features (e.g. on the pelvis, bacula, horns etc.) that are often missing or broken8,9. Additionally, to determine an animal’s sex, researchers have commonly relied on bone and tooth measurements (osteometry), and more recently, on geometric morphometrics10,11,12,13,14,15. However, these methods also rely on the presence of well-preserved remains, while there is often ambivalence in interpreting the mechanism/s responsible for size patterning, since this may be confounded by factors such as climate change, nutrition and domestication1,16. While, the advent of aDNA analyses has offered revolutionary possibilities for sexing ancient fauna remains7,11,13,17, in many situations its application is limited by DNA degradation and contamination18. Clearly, additional tools that will enable sexing of fossil (and fragmentary modern) animal remains will be invaluable to researchers in zoology, archaeology, archaeozoology and paleontology.

To explore new methods of sexing mammals, researchers have turned to the analysis of amelogenin, an essential protein for tooth enamel development in mammals19,20,21,22. The enamel proteome is composed of around 90% of amelogenin (Amels) dimorphic proteins, comprising the X-linked AmelX that is present in both males and females and the Y-linked AmelY that is only present in males23,24,25. Compared to AmelX, AmelY carries multiple single nucleotide polymorphisms (SNPs) or DNA mutations (e.g. Intron 3, 6 and Exon 5 regions), which translate into single amino acid variations (SAAVs). Thus, the identification of the unique AmelY peptides distinguishes males from females based on relative peptide abundance, and enables the unambiguous identification of males. Numerous studies have used this concept to determine the sex of recent animals by sampling their soft tissues and blood26,27,28,29,30,31,32,33,34,35. This approach also served as the foundation of groundbreaking sex determination in humans first undertaken by Stewart et al.36,37 and Parker et al.38 through the analysis of native peptides in the tooth enamel proteome using mass spectrometry techniques. Native peptides in enamel are the result of in-vivo enamel protease digestion, such as matrix-metalloproteinase 20 (Mmp20) and kallikrein-related peptidase 4 (Klk4) enzymes22,39,40. The extraction of the native amelogenin peptides facilitates rapid analysis, without the need for enzymatic digestion that accompanies bottom-up proteomic pipelines that may contribute to sample loss in low abundance samples41.

This approach has opened the door to sex identification of ancient remains by focusing on tooth enamel, the most resistant material in the skeleton, thereby overcoming the limitations imposed by poor bone preservation commonly encountered in aDNA and other standard sex determination methods outlined above. Not surprisingly, proteomic analyses of enamel has been extensively applied to sex ancient human populations including infants and ancient primates42,43,44,44,45,46,47,48,49,50,51,52,53,54.

Far fewer studies have been undertaken on tooth enamel of fossil fauna. An exception is the study by Cappellini et al.55 who successfully identified both the sex and species of fossil rhinoceros remains from the site of Dmanisi, dating to 1.77–1.9 Mya. As a control, they used a male Medieval caprine’s raw spectral data and the AmelY sequence from modern sheep as a matching database. The authors exploited single amino acid variances (SAAVs) in the AmelY sequence at position 171-where valine (V) replaces methionine (M), found in the corresponding position of the AmelX sequence. This key distinction enabled the sex identification of ancient faunal remains from this site.

In the current study, we demonstrate that Label Free Quantification (LFQ) of native peptides extracted from teeth of modern bovine samples permits confident identification of sex. This method was then successfully applied to twelve archaeological samples from a Neolithic site dated to the 8th–7th millennia BC, using the same selected peptides. To our knowledge, this is the first application of a method to determine an animal’s sex using bovine tooth enamel.

Results

Sex determination in modern and ancient bovine enamel

Sex determination can be assessed through the identification and quantification of the unique peptides of AmelY (for  our workflow, refer to Fig. 1). We selected peptides carrying SAAVs: Leu46, Tyr48, namely ‘LRYPYP’ (AmelY;[M+2]\(^{2+}\) 404.7212 m/z) and ‘LRYPYPSY’ (AmelY;[M+2]\(^{2+}\) 529.7689 m/z). The sequence alignment of these unique peptides is illustrated in Fig. 2. In addition, to ensure the correct sex determination and quality of our acid enamel extraction, AmelX dimorphic unique peptides were also quantified. We selected peptides carrying SAAVs, Ser 44, Ile46 and His48. AmelX selected peptides were: ‘SM(ox)IRHPYP’(AmelX;[M+2]\(^{2+}\) 508.7527 m/z) and ‘IRHPYPSY’ (AmelX;[M+2]\(^{2+}\) 516.7667 m/z). A BLAST search for these sequences revealed that they exclusively exhibit homology with amelogenin proteins (reviewed sequence by Uniprot). The identifier (ID) for each of these peptides was validated manually (example in Fig. 3b) and the isotopic envelope for the associated peak was corrected (example in Fig. 3a). Sex determination was performed by label-free quantification (LFQ) of native dimorphic peptides unique to AmelY. We found modern male samples had an intensity of AmelY unique peptides between XIC 6.47 \(10^7\) and 1.34 \(10^8\) for the short peptide version and between 2.76 \(10^7\) and 8.97 \(10^8\) for the long peptide version. For an example of the disparity between the sexes, in Fig. 4 we plotted sample WIS101 (a modern male) and sample WIS202 (a modern female). Figure 5 shows the intensities of modern male (blue dots) and female (red dots) samples. In addition, we calculated the average AmelX deamidation that was between 21% and 16.42% for Asp and Gln respectively.

Figure 1
figure 1

Proteomic workflow for sex determination in modern and archaeological tooth samples. Teeth were cleaned and etched in acid. Acid-dissolved peptides were desalted and run on LC-MS instruments using the DDA method. The data search was undertaken using a Byonic search engine and quantified with Skyline. For greater details see Methods.

Figure 2
figure 2

To illustrate the unique sequence of the selected peptides, we aligned AmelY and AmelX sequences of cattle with our selected AmelY and AmelX peptide sequences (red, green lines respectively) as follows: ‘LRYPYP’ (AmelY;[M+2]\(^{2+}\) 404.7212 m/z), ‘LRYPYPSY’ (AmelY;[M+2]\(^{2+}\) 529.7689 m/z); AmelX peptide sequences (green lines): ‘SM(ox)IRHPYP’ (AmelX;[M+2]\(^{2+}\) 508.7527 m/z) and ‘IRHPYPSY’ (AmelX;[M+2]\(^{2+}\) 516.7667 m/z). SAAVs between the dimorphic proteins are show in light blue.

Figure 3
figure 3

XIC of the isotopic envelope precursor obtained for modern male WIS101 for two unique AmelY and two unique AmelX peptides. (a.1) ‘LRYPYP’ AmelY;[M+2]\(^{2+}\) 404.7212 m/z (a.2) ‘LRYPYPSY’ AmelY;[M+2]\(^{2+}\) 529.7689 m/z; AmelX unique peptides: (a.3) ‘SM(ox)IRHPYP’ (AmelX;[M+2]\(^{2+}\) 508.7527 m/z and (a.4) ‘IRHPYPSY’ AmelX;[M+2]\(^{2+}\) 516.7667 m/z. Retention time was as expected based on a previous Byonic search ID. Figures from (b.1) to (b.4) illustrate MS2 spectra of the peptide ID, below is the isotopic envelope of the precursor.

Figure 4
figure 4

XIC of the isotopic envelope precursor obtained using Skyline software for (a) a modern male WIS101 (b) a modern female WIS202, for two unique AmelY and two unique AmelX peptides. First, ‘LRYPYP’ AmelY;[M+2]\(^{+2}\) 404.7212 m/z [(a.1) male and (b.1) female)], and ‘LRYPYPSY’ AmelY;[M+2]\(^{+2}\) 529.7689 m/z [(a.2) male and (b.2) female)]. Second, AmelX unique peptides: ‘SM(ox)IRHPYP’ (AmelX;[M+2]\(^{+2}\) 508.7527 m/z [(a.3) male and (b.3) female)] and ‘IRHPYPSY’ AmelX;[M+2]\(^{+2}\) 516.7667 m/z [(a.4) male and (b.4) female)]. The retention time was as expected based on a previous Byonic search ID. AmelY peptides were exclusively found in male samples, while female samples lacked the unique AmelY peptides. However, both unique peptides of AmelX were found in both sexes.

Figure 5
figure 5

Ion intensities of unique AmelY peptides vs AmelX unique peptide intensities, (a.1) ‘LRYPYP’ (AmelY;[M+2]2+ 404.7212 m/z) vs ‘SMoxIRHPYP’ (AmelX;[M+2]\(^{2+}\) 508.7527 m/z); (a.2) ‘LRYPYPSY’ (AmelY;[M+2]\(^{2+}\) m/z 529.7689) vs ‘SMoxIRHPYP’ (AmelX;[M+2]\(^{2+}\) 508.7527 m/z); (a.3) SM(ox)IRHPYP (AmelY;[M+2]\(^{2+}\) 404.7212 m/z) vs ’IRHPYPSY’ (AmelX;[M+2]\(^{2+}\) 516.7667 m/z); (a.4) ‘LRYPYPSY’ (AmelY;[M+2]\(^{2+}\) m/z 529.7689) vs ‘IRHPYPSY’ (AmelX;[M+2]\(^{2+}\) 516.7667 m/z). Modern male: dark blue dots, modern female: bordeaux dots; archaeological samples (Beisamoun) males (identified by this method): violet dots, and archaeological samples (Beisamoun) females: orange dots. Results are for eight modern samples and twelve archaeological samples.

As proof of concept, we applied the above method to the analysis of archaeological enamel obtained from twelve bovine teeth from the Neolithic (8th–7-th millennia BC) site of Beisamoun (for details on the sites see Materials section below). We were able to determine sex in all archaeological samples. AmelY unique peptides identified six males out of twelve archaeological samples from Beisamoun at XIC \(2.76 \times 10^7\) to \(5.33 \times 10^9\) (see Fig. 5violet dots). In addition, AmelX unique peptides were found across all archaeological samples at XIC between \(1.10 \times 10^8\) and \(6.87 \times 10^9\) (see Fig. 5, red dots). Sex determination was possible due to the presence of AmelX unique peptides, and the presence or lack of AmelY. See also SI 1 and SI 2.

Global deamidation was used as a tool to validate antiquity and study diagenesis in the archaeological samples56,57,58. We calculated Asp and Gln bulk deamidation of AmelX and compared the modern and archaeological samples. Archaeological samples showed 3 and 5 fold higher occupancy of deamidation (Asp and Gln, respectively) compared to modern samples, validating the ancient origin of our fossil samples.

Discussion

We have presented here, a method for sex determination in bovine enamel based on LFQ of peptides unique to AmelY. We selected two peptides unique to AmelY (’LRYPYP’ and the ’LRYPYPSY’) that are found exclusively in male bovines. We also found two peptides unique to AmelX (‘SMoxIRHPYP’ and ‘IRHPYPSY’). All four peptides re found in both modern and archaeological samples in abundance, allowing reliable quantification and thereby robust sex determination. The use of peptidomics is advantageous, as it simplifies sample preparation and reduces the introduction of sample preparation artefacts, while still providing the desired methodological fidelity.

Conclusions

Our method provides a simple and reliable method for sex determination in bovines using dental enamel of modern and archaeological samples, through the application of LFQ peptidomics. It will be of particular use for sexing poorly preserved osteological samples, those lacking diagnostic morphology, or assemblages where DNA methods cannot be applied due to poor preservation.

Ethics statement

The modern cattle samples used in this study derive from animals that were sold for slaughter to an authorized slaughterhouse (Beit Mitbahaim Tira Ltd.) and were not intentionally killed for this study. Lower jaws used in this study are routinely discarded by the slaughterhouse as they have no commercial value. The Neolithic site of Beisamoun was excavated under permit from the Israel Antiquities Authority (granted to co-authors FB & HK). For details on the site, its location and excavations, see section on Archaeological samples below. Permission to sample bovine teeth was obtained from the excavators as well as from the archaeozoologist (LKH) working on this assemblage (all of whom are co-authors on this paper). The collection is currently held in the laboratory of the latter researcher, in the National Natural History Collections of the Hebrew University of Jerusalem.

Methods

Modern samples

A total of eight modern cattle lower jaws were donated by an Israeli meat manufacturer, Beit Mitbahaim Tira Ltd. (see Table 1). For photographs of samples see SI 3.

Table 1 The table summarizes modern sample information including: Sample number, Proteomic Batch, Animal ID, Animal Species, Sex, Tooth type (M=molar; PM=premolar; I=incisor).

Archaeological samples

The 12 archaeological teeth samples analyzed derive from the Neolithic site of Beisamoun (see Table 2, map ref NIG 25403-8/77682): The Neolithic site of Beisamoun is located in the Upper Jordan Valley, within the flood-plain of the now extinct Hula Lake, a region that is characterised by a Mediterranean climate59. The locality was first excavated in the 1960s–1971 by Monique Lechevallier who reported finding remnants of a mid-Pre-Pottery Neolithic B village (PPNB, first half of the 8th millennium BC; Lechevallier 1978)60. Renewed excavations in 2007 through 2016, in an adjacent area were undertaken by a Franco-Israeli mission. They exposed in situ layers dating to the late PPNB (second half of the 8th millennium BC) and Pre-Pottery Neolithic C (PPNC, first half of the 7th millennium BC). The finds included architectural remains and built installations, with clearly identifiable successive phases of building and their associated lithic artefacts (e.g. arrowheads, sickle blades, axes, adzes, burins, scrapers, retouched blades etc.), groundstone artefacts (grinding slabs, mortars, hand stones, vessels etc.) predominantly made on basalt and limestone, bone tools, shell, stone and bone ornaments, as well as human burials and faunal remains61,62. The PPNC sediment is composed of a sandy-clay sediment with fine-grained inclusions of charcoal, clay, and ochre61. For photographs of samples, see SI 3.

Table 2 The table summarizes archaeological sample information including: Sample Number, Sample Locus and Square in the archaeological site of Beisamoun, Animal Species, Sex by Amelogenin (this research study), Tooth type  (M=molar; PM=premolar; I=incisor; U=upper; L=lower), Period  (PPNC=Pre-Pottery Neolithic C; PPNB=Pre-Pottery Neolithic B).

Samples preparation and cleaning

Teeth (modern and archaeological) were first cleaned mechanically using soft brushes to exclude any external material (e.g. sediments). Using bistoury and tweezers under a microscope binocular (Nikon SMZ800N, see picture SI 6),  a clean piece of enamel (10 mm \(\times\) 15 mm) was extracted. A final examination was carried out using a Nikon SMZ800N binocular microscope. Finally, etching was performed under a chemical hood (see subsection, Sample Preparation for MS).

Sample preparation for MS

The extracted and clean piece of bovine enamel was immersed for 30 s in 3% \(\text {H}_{2}\text {O}_{2}\), rinsed with DDW and the solution discarded. Then, the enamel piece wasmanually ground and etched for 2 min in freshly prepared 5% HCl. This solution was discarded. A second etching step in 1000 \(\upmu\)l 5% HCl for 60 min was performed at room temperature, and the solution was retained in a new Eppendorf on ice. The piece of enamel was then allowed to completely dissolve in 1000 \(\upmu\)l 5% HCl (aprox. 90 min). Then, both solutions were combined to a final volume of 2000 \(\upmu\)l (about 100 mg dry weight). The samples were then frozen and stored in \(-80\;{^circ}\)C for desalting.

Mass spectrometry

Dissolved samples were desalted using Oasis HLB 96 well plate (Waters), using the manufacturer’s instructions; samples were loaded into the wells using vacuum pull, followed by three washes of 300ul 0.1% TFA. The peptides were then eluted by passing 50μl 50% Acetonitrile, 0.1% FA. The resulting peptides were loaded into a reversed-phase Symmetry C18 trapping column (20*0.18 mm, 5  \(\upmu\)m particle, Waters) and resolved on an analytical HSS T3 column (250*0.07 5mm, 1.8  \(\upmu\)m particles, Waters), mounted on a nanoAquity (Waters) nanoLC instrument. Peptides were eluted from the column using a 50-minute gradient from 4 to 30%B (99.9% acetonitrile, 0.1% formic acid) at 350 nl/min flow. The peptides were eluted into a Q-Exactive HF mass spectrometer (Thermo Fisher) using a FlexIon nano-ESI source, through a 20  \(\upmu\)m ID emitter (Fossil IonTech, Madrid), at 2.8 kV. Blank injections interspaced samples to allow washing off peptide carryover. Data were acquired in data-dependent acquisition (DDA) mode using Top10 method. MS1 resolution was set to 120,000@ 400 m/z, with a mass range of 350–1650 m/z, using a maximum injection time of 60 ms. Precursors selected for MS2 were limited to charge states of 2–8, intensity threshold of \(3.3^4\) and dynamic exclusion of 20s. Precursors were isolated using a window of 1.7 m/z, AGC target was set to \(10^5\) or a maximum of 60ms injection time and fragmented using HCD at 27 normalized collision energy (NCE). Scans were performed using a first mass setting of 100 m/z at 15,000 resolution (@200 m/z).

Data analysis

Data was searched using a Byonic search engine (Protein Metrics Inc) with a database that was tailored for modern and ancient samples against the relevant sequences (P02817 AMELX BOVIN Amelogenin, X isoform, Bos taurus and Q99004 AMELY BOVIN Amelogenin, Y isoform Bos taurus, Uniprot). Fixed and variable modifications were adjusted as follows: common2: Oxidation (F, H, K, M, P, W, Y), Deamidated (N, Q, R); rare1: Acetyl (NTerm), Gln to pyro-Glu (NTerm Q), Glu to pyro-Glu (NTerm E) and Phosphorilation (S); rare2: Di-oxidation (F, M, P, W, Y). maximum of 2 common and 2 rare modifications were allowed. MS1 tolerance was set to 10ppm while MS2 was set to 20ppm. Data was filtered using Byonic’s target-decoy method set to 1 % percent FDR. We used a post-search cut-off of LogProb \(>3\) (FDR \(< 0.001\)) and score \(>200\). Identified unique AmelY peptides were examined manually to avoid false positive identifications. Quantification was performed using Skyline software (version 23.1.0.455). Byonic mzID files were imported into Skyline along with the RAW files and used to identify the correct peak for quantification positively. Quantification was performed by XIC extraction of the MS1 signal. Quantification of AmelY base on peptide1: ‘LRYPYP’ (AmelY;[M+2]\(^{2+}\) 404.7212 m/z); peptode2:‘LRYPYPSY’ (AmelY;[M+2]\(^{2+}\) 529.7689 m/z). AmelX: peptide3:‘SMoxIRHPYP’ (AmelX;[M+2]\(^{2+}\) 508.7527 m/z); peptide4:‘IRHPYPSY’ (AmelX;[M+2]\(^{2+}\) 516.7667 m/z). Ion fragment validation was performed manually. We performed sequence similarity searches using protein-protein BLAST program (www.uniprot.org/blast). Bulk modification calculations were performed by in-house RStudio scrips.