Meta-proteomic analysis of two mammoth’s trunks by EVA technology and high-resolution mass spectrometry for an indirect picture of their habitat and the characterization of the collagen type I, alpha-1 and alpha-2 sequence

Cucina, Annamaria; Di Francesco, Antonella; Saletti, Rosaria; Pittalà, Maria Gaetana Giovanna; Zilberstein, Gleb; Zilberstein, Svetlana; Tikhonov, Alexei; Bublichenko, Andrey G.; Righetti, Pier Giorgio; Foti, Salvatore; Cunsolo, Vincenzo

doi:10.1007/s00726-022-03160-6

Meta-proteomic analysis of two mammoth’s trunks by EVA technology and high-resolution mass spectrometry for an indirect picture of their habitat and the characterization of the collagen type I, alpha-1 and alpha-2 sequence

Original Article
Open access
Published: 17 April 2022

Volume 54, pages 935–954, (2022)
Cite this article

Download PDF

You have full access to this open access article

Amino Acids Aims and scope Submit manuscript

Meta-proteomic analysis of two mammoth’s trunks by EVA technology and high-resolution mass spectrometry for an indirect picture of their habitat and the characterization of the collagen type I, alpha-1 and alpha-2 sequence

Download PDF

Annamaria Cucina¹,
Antonella Di Francesco¹,
Rosaria Saletti¹,
Maria Gaetana Giovanna Pittalà¹,
Gleb Zilberstein²,
Svetlana Zilberstein²,
Alexei Tikhonov³,
Andrey G. Bublichenko³,
Pier Giorgio Righetti⁴,
Salvatore Foti¹ &
…
Vincenzo Cunsolo ORCID: orcid.org/0000-0002-2349-5494¹

3112 Accesses
7 Citations
20 Altmetric
2 Mentions
Explore all metrics

Abstract

The recent paleoproteomic studies, including paleo-metaproteomic analyses, improved our understanding of the dietary of ancient populations, the characterization of past human diseases, the reconstruction of the habitat of ancient species, but also provided new insights into the phylogenetic relationships between extant and extinct species. In this respect, the present work reports the results of the metaproteomic analysis performed on the middle part of a trunk, and on the portion of a trunk tip tissue of two different woolly mammoths some 30,000 years old. In particular, proteins were extracted by applying EVA (Ethylene–Vinyl Acetate studded with hydrophilic and hydrophobic resins) films to the surface of these tissues belonging to two Mammuthus primigenus specimens, discovered in two regions located in the Russian Far East, and then investigated via a shotgun MS-based approach. This approach allowed to obtain two interesting results: (i) an indirect description of the habitat of these two mammoths, and (ii) an improved characterization of the collagen type I, alpha-1 and alpha-2 chains (col1a1 and col1a2). Sequence characterization of the col1a1 and col1a2 highlighted some differences between M. primigenius and other Proboscidea together with the identification of three (two for col1a1, and one for col1a2) potentially diagnostic amino acidic mutations that could be used to reliably distinguish the Mammuthus primigenius with respect to the other two genera of elephantids (i.e., Elephas and Loxodonta), and the extinct American mastodon (i.e., Mammut americanum). The results were validated through the level of deamidation and other diagenetic chemical modifications of the sample peptides, which were used to discriminate the “original” endogenous peptides from contaminant ones. The data have been deposited to the ProteomeXchange with identifier < PXD029558 > .

Meta-proteomic analysis of the Shandrin mammoth by EVA technology and high-resolution mass spectrometry: what is its gut microbiota telling us?

Article Open access 28 August 2021

Zooarchaeology by Mass Spectrometry (ZooMS) Collagen Fingerprinting for the Species Identification of Archaeological Bone Fragments

Comparing extraction method efficiency for high-throughput palaeoproteomic bone species identification

Article Open access 26 October 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Paleoproteomics represents one of the recent fields which allows the characterization of past human diseases (Hendy et al. 2016; D’Amato et al. 2018), the reconstruction of the human diet (Warinner et al. 2014; Shevchenko et al. 2018; Greco et al. 2018; Tanasi et al. 2021), and helps us to improve our understanding of the phylogenetic relationships of extant and extinct species (Cappellini et al. 2014, 2018; Welker et al. 2015). In this respect, collagen has extraordinary longevity compared to other proteins (Buckley and Collins, 2011; Rybczynski et al. 2013) and, therefore, represents an ideal system for sequencing and investigating phylogeny (Buckley et al. 2011, 2019, Cappellini et al. 2011). In fact, collagen sequence information can give a valuable support to assess species relationships, especially when analysing samples from places not advantageous for genome preservation. This was the case, for example, of the assessment of extant and extinct sloths’ relationship (Presslee et al. 2019), the resolution of the evolutionary history of South America ungulates (Welker et al. 2011), and the reconstruction of extinct and extant camels’ correlation (Buckley et al. 2019). However, it is important to consider the limitations in phylogenetic constructions based on partial sequence data (Buckley et al. 2019). Proboscidea order (see Fig. 1) is composed of various extinct families, most of which (i.e., Moeritherium) are grouped as early Proboscideans. Other extinct families are Deinotheriidae, Mammutidae (i.e., Mammut americanum), Gomphotheriidae, and Stegodontidae (Shoshani 1998). The Elephantidae is the only extant family. Among the Elephantidae, Elephas and the extinct Mammuthus are recognized as belonging to the same monophyletic clade at morphological level. Loxodonta instead represents a sister group (Shoshani and Tassy 2005). Molecular analyses, and in particular DNA sequencing, support this reconstruction (Rohland et al. 2007; Miller et al. 2008). On the other hand, proteomic studies may help to better understand these relationships but also allow to obtain additional information. Several works about the sequencing of ancient mammoth and mastodon collagen peptides extracted from bones have already been published (Schweitzer et al. 2002; Asara et al. 2007; Buckley et al. 2011). On the contrary, although collagen preservation in soft tissues has been known for a long time (Goodman et al. 1980), a comprehensive collagen sequence characterization in this kind of material has not yet been reported.

As well known, paleoproteomic studies can be significantly challenging due to contaminations, which may mislead the characterization of protein composition or affect the detection of the ancient proteins mainly consisting of short and altered peptide fragments (Cleland et al. 2020). In fact, even if proteins are more resistant than DNA to time, it is important to highlight that diagenesis deeply affects the original protein sequences. Various precautions should be adopted to minimize the effects of contamination from modern proteins; in addition, authentication and validation criteria are needed to discriminate ancient proteins from contaminants (Hendy et al. 2018). In the last decade, ancient proteins have been studied for their patterns of degradation and diagenetic chemical modifications, and how these patterns can be used as markers for endogenous, ancient proteins as opposed to potential modern contaminants (Hill et al. 2012; Cleland et al. 2015; Cappellini et al. 2012). On this respect, it should be highlighted that recently Differential Ion Mobility Spectrometry (DMS) has proven to be a valuable tool in proteomics (Campbell et al. 2015; Winter et al. 2019), because provides separations that are orthogonal to both the MS and the LC (Campbell et al. 2012; Kafle et al. 2014), and in principle may permit a more complete understanding of the proteoforms diversity of biological systems. Although the implementation of DMS in MS workflows poses a number of challenges that must be considered, DMS represents an emerging technology that is capable of resolving multiply modified peptide variants, and should ultimately find utility in the analysis of biomolecules formed from complex biological samples, also including ancient ones. Finally, taking into account the inestimable value of archaeological samples, minimally destructive or non-destructive sampling techniques are needed. One of the more promising non-invasive techniques, known under the acronym EVA (ethylene–vinyl acetate impregnated with hydrophilic and hydrophobic resins) was introduced by Manfredi et al. 2017 and recently applied in our lab to a gut tissue sample of a woolly mammoth (i.e., the Shandrin’s mammoth) (Cucina et al. 2021). In particular, by coupling the EVA technology, a shotgun approach, and the use of high-resolution mass spectrometry it was possible to perform a meta-proteomic analysis which allowed us to get insight into the gut microbiota composition and turned out to be related to the diet of the Shandrin mammoth and its habitat.

In the present paper, we extended this approach to two other woolly mammoth’s tissues some 30,000 years old: the middle part of a trunk tissue sample, discovered in Sanga-Yuryakhsky, Yakutia, Russia (Petrova et al. 2017), and the portion of a trunk tip tissue of a mammoth discovered in the permafrost on the banks of the Bolshaya Baranikha River in the Kolyma district (Flerov 1931).

Particularly, using the same approach carried out on the Shandrin mammoth, we obtained two interesting results: (i) an indirect description of the habitat of these two mammoths; (ii) an improved characterization of the collagen type I, alpha-1 and alpha-2 chains.

Materials and methods

Chemicals

The chemicals employed during the analysis were of the highest purity commercially available and used without further purification. Formic Acid (FA), Ammonium bicarbonate (AMBIC), dithiothreitol (DTT), iodoacetamide (IAA) were purchased from Aldrich (St. Louis, Missouri, USA), ammonia from Carlo Erba (Milan, Italy); sequencing Grade Modified Porcine Trypsin from Promega (Madison, WI, USA); water and acetonitrile (ACN) (OPTIMA® LC/MS grade) for LC/MS analyses from Fisher Scientific (Milan, Italy). All the chemicals listed above were exclusively employed for the present study.

The Kolyma and Sanga-Yuryahskii mammoths

The collection access number of the samples of Kolyma mammoth is 71922. The tip of a mammoth trunk was found in the middle course of the Kolyma River, on the waterside of Bolshaya Baranikha river, in 1924. Later, in 1929 a local resident of Srednekolymsk town Kondratyeva hand over trunk to geologist K. Ya. Pyatkovsky, who donated it to the Leningrad Zoological Museum. The exhibit was described by KK Flerov in 1931. The absolute age of the specimen, as well as the conditions of the finding, are not known.

The collection access number of the samples of Sanga-Yuryah mammoth is 31738/2. A large part of the skeleton, two legs, the middle part of the trunk and separate bones with soft tissues of a small female mammoth, were excavated by K. K. Vollosovich expedition in 1908 on the Sanga Yuryakh river, in the northern part of the Yano-Indigirskaya lowland. The absolute age of the specimen is 29,500 years.

Protein sampling by EVA diskette

A special plastic-like film based on ethylene–vinyl acetate (EVA) as the binder of ground AG 501 Bio-Rad mix-bed cation/anion exchange resins was prepared as reported by Righetti et al. (2019). Protein sampling by EVA diskettes was carried out at the Zoological Museum of the Russian Academy of Sciences in St. Petersburg. For sampling large sample surfaces, it is necessary to find regions with a higher concentration of proteins. For this search, the fluorescence of phenylalanine, tyrosine, and tryptophan under UV illumination was studied.

UV LED for illumination and a digital camera with a special optical filter for fluorescence detection were used. The fluorescence level at each point was displayed in pseudo colors on the instrument interface (green, yellow, and red—in order of increasing fluorescence intensity) (see Fig. 2a). This made it possible to quickly identify regions for sampling on paleontological samples. This portable system was made in SpringStyle Tech Design Ltd for quick examination of protein traces’ presence on paleontological and archaeological samples.

A SpringStyle sensor for formaldehyde (FA) residuals detection, to check the gut before sampling was applied. The selection criterion for the Museum's exhibits was the absence of formaldehyde processing of paleontological samples. Most of the samples from the paleontological collection in the 70 s of the twentieth century were treated with formaldehyde. The specimen of the mammoth trunk is one of the few exhibits that did not undergo this formaldehyde treatment.

The EVA diskette was gently humidified with ultrapure water and then placed on sample gut cavities in three regions for 60 min. To prevent drying of the EVA films, they were covered with parafilm® from the outside (Fig. b).

Protein extraction protocol

With the aim to minimize cross-contamination with other biological samples, protein extraction and sample handling were performed in a laboratory “clean room” dedicated to ancient protein analysis and using dedicated chemicals, lab glassware, and equipment. Surfaces and equipment were washed with 50% 2-propanol before use. Non-latex gloves were used. A section of the EVA diskette (5 mm × 5 mm) was cut with a scalpel and the proteins trapped in its film were eluted sequentially with 200 μL of volatile buffers (formate at pH 3, followed by ammonia at pH 10) and finally with volatile solvents (acetonitrile) to collect positively and negatively charged as well as hydrophobic proteins. The dried eluate was suspended in 50 mM AMBIC and the proteins were quantified by a fluorimetric assay using the Qubit Protein Assay kit with the Qubit 1.0 Fluorometer (ThermoFisher Scientific, Milan, Italy) (Saletti et al. 2018).

Then, about 7 μg of protein eluate was reduced, for 3 h at room temperature, by 5.4 μg of DTT (concentration of the stock solution: 10 mM) and alkylated, 1 h in the dark at 20 °C, by 11 μg of IAA (concentration of the stock solution: 45 mM). The sample was finally digested by 0.14 μg of porcine trypsin at an enzyme–substrate ratio of 1:50 (overnight, 37 °C). The resulting peptide mixture solutions were dried under vacuum (Concentrator Plus, Eppendorf), re-dissolved in 50 µL of 5% aqueous FA, filtered by ultracentrifugation (750 µL, 0.2 µm Nonsterile Micro-Centrifugal Filters, Sepachrom, Rho, Milan), and analyzed by UHPLC/high-resolution nanoESI–MS/MS. An empty diskette of EVA film was used as a control sample. It was processed and analyzed by proteomics in the same way as the EVA diskettes placed in contact with the surface of the mammoth samples. Only a meagre number of background peptides was identified by database search in the EVA control sample (see Supplementary Material, Table S10).

Mass spectrometry analysis

Mass spectrometry data were acquired via a Thermo Fisher Scientific Orbitrap Fusion Tribrid® (Q-OT-qIT) mass spectrometer (Thermo Fisher Scientific, Bremen, Germany). Liquid chromatography was carried out using a Thermo Scientific Dionex UltiMate 3000 RSLCnano system (Sunnyvale, CA). One microliter of peptide mixture was loaded onto an Acclaim ®Nano Trap C18 Column (100 µm i.d. × 2 cm, 5 µm particle size, 100 Å). After washing the trapping column with solvent A (H₂O + 0.1% FA) for 3 min at a flow rate of 7 μL/min, the peptides were eluted from the trapping column onto a PepMap® RSLC C18 EASY-Spray column (75 µm i.d. × 50 cm, 2 µm particle size, 100 Å). Peptides were separated by elution at a flow rate of 0.25 µL/min at 40 °C by a linear gradient of solvent B (ACN + 0.1% FA) in A, 5% for 3 min, followed by 5% to 65% in 85 min, 65%–95% in 5 min, finishing by holding 95% B 5 min, 95%–5% in 10 min and re-equilibrating at 5% B for 15 min. The eluting peptide cations were converted to gas-phase ions by electrospray ionization using a source voltage of 1.75 kV and introduced into the mass spectrometer through a heated ion transfer tube (275 °C). Survey scans of peptide precursors from 200 to 1600 m/z were performed at 120 K resolution (@ 200 m/z). Tandem MS was performed by isolation at 1.6 Th with the quadrupole, HCD fragmentation with a normalized collision energy of 35, and rapid scan MS analysis in the ion trap (low-resolution MS/MS analysis). Only those precursors with charge state 2 ÷ 4 and intensity above the threshold of 1∙10³ were sampled for MS². The dynamic exclusion duration was set to 60 s with a 10 ppm tolerance around the selected precursor and its isotopes. Monoisotopic precursor selection was turned on. The instrument was run in full speed mode with 3 s cycles, meaning it would continuously perform MS² events until the list of non-excluded precursors diminished to zero or 3 s, whichever is shorter. MS/MS spectral quality was enhanced by enabling the parallelizable time option (i.e., using all parallelizable time during full scan detection for MS/MS precursor injection and detection). Mass spectrometer calibration was performed using the Pierce® LTQ Velos ESI Positive Ion Calibration Solution (Thermo Fisher Scientific). MS data acquisition was carried out by utilizing the Xcalibur v. 3.0.63 software (Thermo Fisher Scientific). To avoid cross-contamination with other biological samples, all solvents were prepared freshly, and ancient samples were not processed or analyzed in one batch with modern references. Moreover, to minimize carryover during nLC-MS/MS runs, from three to five blank runs were performed before each analysis using the same gradient program. Spectra acquired in the last blank run were searched by PEAKS and MaxQuant softwares against the SwissProt database without taxonomy restrictions and using the same parameters of the archaeological samples.

Database search for metaproteomic analysis

Mass spectrometry data were processed by MaxQuant (MQ) software 1.6.17.0 (https://www.maxquant.org/). The raw data were analyzed and searched against three different databases separately: (i) a database, including Swiss-Prot and TrEMBL sequences of Loxodonta africana, Elephas maximus, Mammut americanum, and Mammuthus primigenius (30,304 entries, February 2021) that in the text is indicated as Proboscidea database; (ii) the Viridiplantae database (Swiss-Prot; 40,656 entries, February 2021); and (iii) a Bacteria and Nematoda database including 334,868 and 5099 Swiss-Prot entries from bacteria and nematode, respectively (February 2021). Moreover, the common Repository of Adventitious Proteins (c-RAP; https://www.thegpm.org/crap/) contaminant database was enabled in all database searches.

The first step of database search was carried out using the following parameters: (a) tryptic peptides with a maximum of 3 missed cleavage sites; (b) cysteine carbamidomethylation as a fixed modification; (c) oxidation of methionine, the transformation of N-terminal glutamine and N-terminal glutamic acid residue to pyroglutamic acid form, oxidation of proline and the deamidation of asparagine and glutamine as variable modifications. The match type was “match from and to”. The decoy mode was “revert”. PSM, Protein, and Site decoy fraction FDR were set at 0.01 as the threshold for peptide and protein identifications. The minimum score for modified and unmodified peptides was set at 40. All the other parameters were set as default. In the data analysis, only peptides with intensity over the Max Quant threshold were considered.

Each database was investigated to identify additional chemical modifications and improve peptides identifications. All the parameters were the same as the previous step; in particular, the following chemical modifications were investigated, as variable modifications: (i) oxidation, di-oxidation, formation of kynurenine, and formation of oxo-lactone, for tryptophan residues; (ii) oxidation, di-oxidation, iodination and di-iodination, and formation of dopaquinone, for tyrosine residues; (iii) acetylation of lysine; (iv) di-oxidation of methionine; (v) tri-oxidation of cysteine. A protein was considered identified if a minimum of two peptides were matched. Finally, to be sure of the species assigned by the software to each protein identified, all the identified peptides underwent BLASTp (Basic Local Alignment Search Tool for protein) searches through the NCBI database (http://blast.ncbi.nlm.nih.gov/Blast.cgi) to validate species identifications and to rule out conserved peptides between species. The peptides in common between the three groups were not considered as valid identification either for further calculations.

Calculation of the level of deamidation and other chemical modifications

An estimation of the percentage of deamidation for each sample was calculated with the aid of a freely available command-line script for Python 2.x (https://github.com/dblyon/deamidation), which uses the MaxQuant “evidence.txt” file (Mackie et al. 2018). The calculations were done separately for potentially original peptides and potential contaminants peptides as previously reported (Mackie et al. 2018; Tanasi et al. 2021). Analogously, estimation of the percentage of the other chemical modifications investigated was obtained applying the same model of the deamidation script, separately for potentially original and potentially contaminant peptides (See Supplementary Material).

Metaproteomics analysis at peptide level

The metaproteomics analyses were performed by consulting the open-source web application Unipept (Unipept 4.3; http://unipept.ugent.be) (Mesuere et al. 2015), using the peptide matches with an ion score greater than 40, assigned by Max Quant to all the peptides matched, and with intensity over the Max Quant threshold.

The tool for metaproteomics analysis is realized for tryptic peptides, obtained with a shotgun approach, from environmental samples. It can calculate the Lowest Common Ancestors (LCA) of a group of peptides, giving an insight into the biodiversity of the sample, and integrating complementary functional analysis (Mesuere et al. 2018). Thanks to its algorithm, it shows the most specific taxonomic level for each peptide. Ubiquitarian peptides are generically assigned to “organism”.

Database search for sequence characterization of collagen

With the aim to characterize collagen amino acid sequence, nLC-nESI MS/MS data were analyzed and searched against the all UniProt publicly available entries of: (i) collagen type I, alpha 1 chain (col1a1; total of 772 sequences), and (ii) alpha 2 chain (col1a2; total of 694 sequences) using integrated PEAKS X De Novo sequencing (v. 10.0, Bioinformatics Solutions Inc., Waterloo, ON Canada) and the MaxQuant software.

The amino acid sequences generated by PEAKS de novo sequencing software from each spectrum were searched using the SPIDER algorithm, a dedicated search tool of PEAKS that is specially designed to detect peptide mutations and perform cross-species homology search (Han et al. 2005). Database search was carried out using the following parameters: (i) tryptic peptides with a maximum of 3 missed cleavage sites; (ii) cysteine carbamidomethylation as a fixed modification; (iii) oxidation of methionine, oxidation of proline, the transformation of N-terminal glutamine and N-terminal glutamic acid residue to pyroglutamic acid form, and the deamidation of asparagine and glutamine as variable modifications. Database search by PEAKS was carried out using the following parameters: the precursor mass tolerance threshold was set to 15 ppm and the maximum fragment mass error was set to 0.6 Da; Peptide spectral matches (PSM) were validated using a Target Decoy PSM Validator node based on q values at a 1% False Discovery Rate (FDR); score thresholds for Peptide spectral matches (PSMs) were set to obtain for each database search FDR values, for PSMs, Peptide sequences, and Proteins identified, below the 1.0% value.

Database search by Max Quant software was carried out using the parameters previously described for metaproteomics analysis and an ion score greater than 50.

Results

Metaproteomic analysis

Metaproteomic analysis was carried out, by MQ software, investigating three different databases: Proboscidea, Viridiplantae, and Bacteria/Nematoda. The results were analyzed at both protein and peptide levels (i.e., not considering also the proteins from which these peptides come). In particular, all the “original peptides” identified by Max Quant were used to perform an additional analysis by Unipept search engine, which uses the UniProt database, a version of the NCBI taxonomy, and an LCA algorithm (see Material and Methods section), to achieve a global vision about the taxonomic distribution of all peptides.

Protein identification

Proteins related to Mammoth

By searching the Proboscidea database, 35 and 14 proteins with at least 2 peptides were identified in “trunk” and “trunk tip” samples, respectively. Four proteins were identified in both the samples investigated. To validate species identifications, all the peptides which allowed the designation of these proteins were subjected to a sequence search by BLASTp. By this search, the identified proteins were classified into three groups: (i) 27 proteins that may be specifically related to mammoth; these proteins were authenticated by at least a peptide (peptide marker) related only with the Elephantidae species (Loxodonta africana, Mammut americanum or more in general Elephantidae); (ii) 14 proteins presenting peptides related to more species (Mammalia, Eutheria, Afrotheria, and not specific), but not coming from Homo sapiens; and finally (iii) 8 proteins related to Mammalia or not specific, but also including human. The list of proteins is reported in Table 1 (the complete list of proteins and peptides is reported in Supplementary Tables S1 and S2). As expected, most of these proteins, such as collagens, plakophilin, desmoglein, junction plakoglobin, are skin-related proteins. We also identified three keratins (i.e., keratin 32, keratin 35, and the IF rod domain-containing protein, see Supplementary Table S7) that were characterized by peptide trait sequences shared between the Loxodonta africana and human keratins. Therefore, also these proteins might be related to the mammoth, and considered as potentially endogenous components of the sample. Nevertheless, no L. africana specific peptide belonging to these keratins was recognized. Moreover, the deamidation level of the keratin-related peptides showed a similar profile to the other peptides coming from protein contaminants. Therefore, since it was not possible to establish the exact origin of these peptides, as a precaution, the above-mentioned keratins were considered contaminants.

Table 1 Classification of the proteins identified by searching Proboscidea database and after peptide BLAST search in “trunk” and “trunk tip” samples. More details are reported in the Supplementary Material (Tables S1 and S2)

Full size table

Proteins related to Viridiplantae, Bacteria, and Nematoda

MS data were also used to investigate separately the Viridiplantae, and the Bacteria/Nematoda databases. By searching the Viridiplantae database a total of eight proteins with at least two peptides (Table 2; the complete lists of proteins and peptides are reported in the Supplementary Tables S3 and S4) were identified. Four proteins were identified in the trunk, whereas the others were in the trunk tip sample. Species validation, carried out as above reported, evidenced that five proteins presented diagnostic peptides of Mesangiospermae, the other three showed marker peptides of Brassicaceae, Arabidopsis thaliana, and Amborella trichopoda, respectively.

Table 2 Classification of the proteins identified by searching Viridiplantae and Bacteria/Nematoda database and after peptide BLAST search in “trunk” and “trunk tip” samples. More details are reported in the Supplementary Material (Tables S3, S4, S5 and S6)

Full size table

By the same approach, investigation of the database including only Bacteria and Nematode, allowed the identification of only one protein (i.e., the cytidylate kinase) with at least two peptides, which was specific of the Mesoplasma florum, a bacterium isolated from plants and insects (Table 2; the complete lists of proteins and peptides are reported in the Supplementary Tables S5 and S6).

Unipept analysis

Peptides related to mammoth

To achieve a comprehensive vision about the taxonomic distribution of all peptides identified by Max Quant, the Unipept search analysis was performed. In the “trunk” sample, unipept analysis of the 774 peptides characterized by searching the Proboscidea database allowed the classification of 651 sequences that were specific for the domain of Eukaryota. Among these sequences, 190 were specific for the family of Elephantidae, in particular specific of Loxodonta africana, and only one sequence specific of Mammut americanum (Fig. 3a).

In the “trunk tip” sample, unipept analysis of the 479 peptides allowed the classification of 409 sequences that were specific for the domain of Eukaryota. 158 sequences were specific for the superorder of Afrotheria, and included 135 sequences which instead were specific for the family of Elephantidae (Fig. 3b), and in particular Loxodonta africana.

Peptides related to Viridiplantae

In the “trunk” sample, among the 496 peptides identified by searching the Viridiplantae database, 396 sequences were classified, by unipept analysis, specific for the clade Viridiplantae. Figure 4a shows the tree-graph results of Unipept investigation. Most of the peptides (386 sequences, 97%) were related to the clade of Streptophyta, whereas about 2% (9 peptides) to the clade of Chlorophyta. Moreover, this approach revealed that among the Streptophyta-related sequences, 93% (360 sequences) belong to the class of Magnoliopsida, and were mainly specific (185 sequences, 51%) of the Brassicaceae family. Among Magnoliopsida, the clade of Petrosavidae was well represented (64 sequences), but also the class of Fabales and Solanales (11 sequences) were identified.

In the “trunk tip” sample, among the 361 peptides identified by searching the Viridiplantae database, 295 sequences were classified, by unipept analysis, specific for the clade Viridiplantae. The tree-graph results of Unipept investigation, reported in Fig. 4b, show the same distribution of the peptides classified in “trunk” sample.

Peptides related to Bacteria and Nematoda

In the “trunk” sample, among the peptides identified by searching the Bacteria/Nematoda database, 359 were specific to Bacteria, whereas 26 sequences were specific to Nematoda phylum. All the peptides related to Nematoda were specific to Rhabditida (Fig. 5a), an order of phytoparasitic and zooparasitic microbivorous nematodes living in soil, and most of them were specific to the Caenorhabditis genus. Only one sequence was referred to Spirurina infraorder. In Bacteria (Fig. 6a) two main phyla were identified: Proteobacteria (42%; corresponding to 151 peptides), and Firmicutes (23%; corresponding to 84 peptides). The remaining peptides were related to other phyla including Actinobacteria (5%; 17 peptides), Bacteroidetes (4%; 13 peptides), Tenericutes (3%; 12 peptides), Cyanobacteria (3%; 11 peptides), and other bacteria phyla represented by a lower number of peptides.

In the “trunk tip” sample, 319 peptides were specific to Bacteria and 25 ones were specific to Nematoda phylum. The tree-graph results of Unipept investigation, reported in Fig. 5b (for Nematoda) and 6b (for Bacteria), show a similar distribution of the peptides classified in “trunk” sample.

Level of deamidation and other chemical modifications

To discriminate the original endogenous components, present in the investigated samples, from components that are instead probably contaminants related to the post-excavation history of the sample, we calculated the deamidation level. As known, asparagine and glutamine residues naturally deamidate over time (Robinson et al. 2001, 2002; Schroeter et al. 2016). Even if different environmental factors, such as temperature, pH, and the inherent properties of proteins, may affect the level of deamidation process, it has been observed that the level of this modification is generally higher in ancient molecules than in modern ones. Therefore, the deamidation levels of asparagine and, mainly, of glutamine residues may be used as biomolecular indicators of deterioration and the natural aging of proteins in archeology and paleo-materials.

The deamidation levels of asparagine and glutamine residues were calculated for all the three types of peptides classified as “original”: i.e., peptides related to Mammoth, Viridiplantae, and Bacteria/Nematoda. These results were compared with the deamidation level of those peptides classified as “contaminants” (Fig. 7), because belonging to the proteins of the c-RAP database. Figure 7 shows that the deamidation level of the “original peptides” ranges from 35 to 63% in both samples investigated. On the contrary, “contaminant peptides” present a deamidation level that is always below 10%. Moreover, taking into account that other forms of spontaneous, and non-enzymatic, modifications could also be a sign of proteins damage due to the exposure to light or oxidative environmental factors (Pattinson 2012; Davies 2016), the level of the oxidation products at tryptophan, tyrosine, cysteine, and methionine residues was calculated for both the “original” and “contaminant” peptides (Stadtman et al. 2005; Pattison et al. 2012; Mikšík et al. 2016; Cannizzo et al. 2012). Interestingly, the comparison of the level of the oxidation products in original and in contaminant peptides confirms the trend already observed for the deamidation (see Supplementary Figure S1).

Characterization of the alpha-1 type I collagen sequence

The dedicated database search of MS data against all the UniProt publicly available entries of col1a1 by the PEAKS X and MaxQuant software allowed the identification of a total of 40 tryptic fragments (see Supplementary Table S8). These tryptic sequences are mainly related to the col1a1 unreviewed entry (UniProt Accession No. G3SSE0; length: 1058 amino acid residues in the mature protein (Exposito et al. 2002; Buckley et al. 2011)) of the modern African elephant (Loxodonta africana), that, therefore, was chosen as the reference sequence. Overall, MS data allowed to confidently characterize about 65% of the Mammuth primigenus col1a1 primary structure (Fig. 8). Most identified sequences were identical to the reference from L. africana, although some differences were also observed. In particular, de novo interpretation of the MS/MS spectrum of the triply-charged ion at m/z 850.4013 (Fig. 9) allowed to deduce the sequence GNDGATGAAGPPGPTGPAGPPGFPGAVGAK which corresponds to the tryptic fragment G¹⁶²NDGATGAAGPPV¹⁷⁴SPTGPAGPPGFPGAVGAK¹⁹² of the col1a1 unreviewed entry (UniProt Accession No. G3SSE0) of L. africana, but carrying the deletion of the valine residue at position n. 174 and the substitution of the serine at position n. 175 with a glycine residue.

GNDGATGAAGPP-GPTGPAGPPGFPGAVGAK (Mammuth primigenus)
¹⁶²GNDGATGAAGPPVSPTGPAGPPGFPGAVGAK¹⁹² (Loxodonta africana)

This result was confirmed by the identification of many MS/MS scans (see Supplementary Table S8). It is important to highlight that the differences observed at positions 174 and 175 are most likely due to an error in the unreviewed entry G3SSE0_LOXAF. Indeed, it is well known that collagen molecules are made up of three alpha chains involved in the formation of a triple-helical structure (Exposito et al. 2010). At the primary structure level, the sequence of alpha chains consists of repeating Gly-Xaa-Yaa triplets, called the collagenous domain or triple helix motif. The triple helix is stabilized by the presence of glycine as every third residue, a high content of proline and hydroxyproline, interchain hydrogen bonds, and electrostatic interactions (Persikov et al. 2005), involving lysine and aspartate (Fallas et al. 2009). The presence of the VS trait in the G3SSE0_LOXAF entry would interrupt the repetition of Gly-Xaa-Yaa triplets, impairing the collagen triple helix; thus it represents a very unlikely mutation in collagen. On the other hand, a glycine residue is reported in the same position in another annotated version of the sequence of collagen alpha-1(I) chain of Loxodonta africana (UPI0005406912 entry; NCBI Reference Sequence: XP_010592644.1).

Moreover, a glycine residue at position 175 was previously detected by Buckley (Buckley et al. 2011; see Fig. 8), that has characterized the col1a1 sequence extracted from the bone powder of a woolly mammoth dredged from the North Sea, a modern African elephant (Loxodonta africana), and an Asian elephant (Elephas maximus) specimens, and is also typical of the col1a1 sequence of M. americanum (SwissProt Accession No. P0C2W8). An additional error in the unreviewed entry G3SSE0_LOXAF was detected. The tryptic fragment T³⁹³GPPGPAG⁴⁰⁰QDGRPGPPGPPGAR⁴¹⁴ (Table S8, Fig. 8) shows as expected, at position 400, the presence of a glycine residue, which instead is lacking in the G3SSE0_LOXAF entry. Again, the absence of the glycine residue at this position would interrupt the repetition of Gly-Xaa-Yaa triplets, impairing the collagen triple helix; thus it represents a mistake in the entry G3SSE0_LOXAF. De novo interpretation of the MS/MS spectra of the triply charged ions at m/z 738.6978 and 828.8729, allowed the identification of two amino acid substitutions in the M. primigenius col1a1 here investigated with respect to the L. africana counterpart. In detail, our experimental MS/MS data allowed to deduce the sequences VGPPGPSGNAGPPGPPGPAGKEGAK, and GPPGSAGTPGKDGLNGLPGPIGPPGPR, respectively (Figs. 10 and 11).

The first one corresponds to the sequence V⁷²³GPPGPSGNAGPPGPPGPAGKEGGK⁷⁴⁷, but shows the substitution of the glycine at position 746 with an alanine residue. The second one instead corresponds to the tryptic fragment G⁹⁸²PPGSAGAPGKDGLNGLPGPIGPPGPR¹⁰⁰⁸, carrying the amino acid substitution Ala⁹⁸⁹ → Thr. It is important to highlight that comparison of the col1a1 sequences coming from different Proboscidea species (Fig. 8) reveals that the presence of an alanine residue at position 746 and a threonine at position 989 appear unique of the col1a1 sequence of the M. primigenus here investigated, and, therefore, could be used to reliably distinguish the Mammuthus primigenius with respect to the other two genera of elephantids (i.e., Elephas and Loxodonta), and the extinct American mastodon (i.e., Mammut americanum).

Furthermore, MS data here obtained allowed to detect the presence of a proline residue at position 701, which appears characteristic of Elephantidae, and could be used as marker to distinguish them from the M. americanum, which instead has an alanine (Fig. 8). Finally, our MS data show that the primary structure of the M. primigenius col1a1 sequence here investigated has a glycine and an alanine at positions 795 and 857, respectively (Fig. 8). These two positions are shared with the unreviewed col1a1 entry of L. africana, and the reviewed entry counterpart of M. americanum. On the contrary, the Elephantidae (i.e., M. primigenius, L. africana and E. maximus) col1a1 sequences reported by Buckley (Buckley et al., 2011) show the presence of an alanine (at position 795) and a serine (at position 857).

Charaterization of the alpha-2 type I collagen sequence

The dedicated database search of MS data against all the UniProt publicly available entries of alpha-2 type I collagen (col1a2) by the PEAKS X and MaxQuant softwares allowed the identification of a total of 31 tryptic fragments (see Supplementary Table S9). These tryptic sequences are mainly related to the col1a2 unreviewed entry (UniProt Accession No. G3TIC0; length: 1040 amino acid residues in the mature protein) of the modern African elephant (Loxodonta africana), that thus was selected as reference sequence. Overall, MS data allowed to confidently characterize about 50% of the Mammuth primigenus col1a2 primary structure (Fig. 12). The characterized sequence was identical to the reference from L. africana, except for an amino acid point difference. In particular, de novo interpretation of the MS/MS spectrum of the double-charged ion at m/z 784.8686 (Fig. 13) allowed to deduce the sequence GSDGEAGSAGPAGPPGLR which corresponds to the tryptic fragment G³⁰³SS³⁰⁵GEAGSAGPAGPPGLR³²⁰ of L. africana, but carrying a substitution of the serine residue at position n. 305 with an aspartic acid. This substitution is supported by the presence of the MS/MS spectrum of the triple-charged ion at m/z 575.6152, corresponding to the sequence RGSDGEAGSAGPAGPPGLR (see Supplementary Table S9). However, although the corresponding amino acid trait with the asparagine at position 305 wasn’t detected, the presence of a deamidated asparagine instead of the aspartic acid, at this position, cannot be excluded. It is interesting to note that the col1a2 sequences of Elephantidae (i.e., M. primigenius, L. africana, and E. maximus) and of M. americanum, as reported by Buckley (Buckley et al. 2011), show a serine residue at position n. 305. Thus the presence of a different amino acid residue at this position could be used as marker to distinguish the Siberian Mammuthus primigenius with respect to the other Proboscidea species.

Furthermore, as previously suggested by Buckley (Buckley et al. 2011), MS data here obtained allowed to detect the presence of a threonine residue at position 781, which seems peculiar of Elephantidae, and could be used as a marker to distinguish them from the M. americanum, which instead has a serine (Fig. 12).

Discussion

Metaproteomic analysis

Metaproteomics analysis of two well-preserved mammoth remains was carried out. In particular, we investigated a mammoth trunk tip, found in 1924 in the permafrost soil on the banks of the Bolshaya Baranikha River in the Kolyma district, and a trunk fragment of a female mammoth discovered in Sanga-Yuryakhsky (Yakutia, Russia), dated as 29,500 years, on the basis of radiocarbon analyses. The structure of the bilobed trunk tip of the mammoth, discovered in the Kolyma district, suggested that it could pluck large bunches of grass and moss much easier than the Indian and African elephants. Probably, the mammoth ate mainly arboreal vegetation during winter, and herbaceous plants during summer (Boeskorov et al. 2016). Paleoecological conditions in the Kolyma region have been studied via the pollen spectra analysis (PSA) of a ground sample that adhered to a rhinoceros corpse and two samples from enclosing sediments. The composition of the pollen was dominated by grasses and shrubs; the taxonomic distribution presented mainly Gramineae (i.e., Poaceae), Asteraceae (wormwoods) and Cyperaceae, fewer Caryophillaceae, and Ranunculaceae. Other taxa observed were Fabaceae, Brassicaceae, Lamiales and other families of plants. Tree-shrub vegetation was highlighted by the presence of pollen of dwarf birches, willows, alder and larch. A similar environment could be attributed to the Yakutia mammoth. In fact, also the trunk fragment of this female mammoth was discovered in the proximity of a palustris area in East Siberia (Tolmachoff 1929).

Most of the results of our metaproteomic analysis for both samples agree with the previous paleobotanical studies, with a considerable identification of Graminaceae (Petrosavidae) peptides, followed by the other families mentioned above. An exception concerns the high percentage of Brassicaceae peptides, although this result could be reasonably related to the predominance of Arabidopsis thaliana entries in the Viridiplantae database. Furthermore, the identification of Chlorophyta species is probably due to the presence of a river in the proximity of the mammoth remains. Interestingly, an Amborella trichopoda protein (i.e., the protein TIC 214) was identified in the trunk tip sample. This plant is considered one of the ancestral angiosperms, and it is typical of a pluvial forest. Altogether, these results supported the hypothesis of a different North Siberia habitat during the Late Pleistocene. Moreover, the composition of microorganism-related peptides found in the trunks surface reflected that observed in the Shandrin mammoth gut (Cucina et al. 2021), with a predominance of Proteobacteria, followed by Firmicutes, probably attributed to the presence of grass more than tree shrub. Interestingly, in the trunks surface the presence of Firmicutes, and particularly of Bacilli, seemed to be higher. According to Omeliansky, who in 1908 carried out bacteriological research on the “Sanga-Yurach”, during the late Pleistocene quite a diverse flora of microorganisms, different kinds of bacteria, mold, and yeast existed, while in the modern trunk, mucosal microflora presented mainly a variety of aporose bacilli (Neustroev et al. 2014). Thus, it is complicated to establish if the presence of these bacteria should be attributed to a posteriori colonization, or if it reflects the presence of ancient microorganisms very similar to the bacilli that colonize the trunk microflora of the modern Proboscidea.

Finally, our metaproteomic analysis revealed that most of the peptides identified in Nematoda database belong to Caenorhabditis, a genus of nematodes that lives in bacteria-rich environments, such as compost piles, decaying dead animals and rotting fruit (Frézal and Félix 2015).

Collagen type I, alpha-1 and alpha-2 chains

By coupling the non-invasive EVA technology and a shotgun proteomic approach based on the high-resolution mass spectrometry, we were able to characterize about 65% of the col1a1 and 50% of the col1a2 sequences of the Siberian woolly mammoth (Mammuthus primigenius), including some amino acid traits of the protein never characterized before. At least for the part of the covered sequence, it appears that the col1a1 and the col1a2 of the wolly mammoth and the unreviewed col1a1 and col1a2 entries of Loxodonta africana reported in the databases (G3SSE0_LOXAF, XP_010592644.1, and G3TICO_LOXAF), are almost identical, showing only two amino acid substitutions for col1a1 and one amino acid substitution for col1a2. Our data, obtained at protein level, confirm the sequencing of the nuclear genome which suggests a difference at amino acid level of about 0.22% among Elephantids (Miller et al. 2008). More in detail, our data highlighted some errors in the unreviewed entry G3SSE0_LOXAF of col1a1 at positions 174, 175, and 400 that would interrupt the repetition of Gly-Xaa-Yaa triplets, impairing the collagen triple helix; thus they represent a very unlikely mutation in collagen.

Instead, the three observed substitutions (i.e., Gly⁷⁴⁶ → Ala, and Ala⁹⁸⁹ → Thr for col1a1 and Ser³⁰⁵ → Asp for col1a2) seem more interesting, because appear unique of the col1a1 and col1a2 sequences of the M. primigenus here investigated, and, therefore, could be used to reliably distinguish the Siberian mammoth with respect to the other two genera of elephantids (i.e., Elephas and Loxodonta), and the extinct American mastodon (i.e., Mammut americanum). However, the limited extent of this analysis may prevent a definitive conclusion, and a larger number of samples together with a combination of different digestion approaches might implement and support these identifications.

Conclusions

Shotgun proteomic analysis of mammoth trunk samples carried out by coupling the EVA film technology and high-resolution mass spectrometry allowed in-depth exploration of the metaproteome composition together with an improved characterization of the primary structure of the alpha-1 and alpha-2 chains of collagen type I.

Metaproteomic analysis allowed to identify proteins specific to the tissue investigated, but also some plant proteins related to the animal environment. Moreover, taxonomic distribution of all the identified peptides evidenced the predominance of some specific taxonomies among Viridiplantae, Bacteria, and Nematoda which are in agreement with the reconstructed habitat of Late Pleistocene Siberia. It should be noted that our results represent an indirect picture of the proteins present in the samples, because the intrinsic nature of the surface sampling of EVA technology. Thus the information obtained might not be fully exhaustive. However, it should be also highlighted that the use of EVA diskettes avoids the destruction of part of a sample of ancient tissue, a practice which is (obviously) discouraged by most museums.

Concerning the characterization of the collagen type I, alpha-1 and alpha-2 sequences, our MS data allowed to improve the previous sequence coverage obtained from the bone powder of a woolly mammoth dredged from the North Sea, as reported by Buckley (Buckley et al. 2011). In addition, we detected some differences between M. primigenius and other Proboscidea, and identified three potentially diagnostic amino acid mutations that could be used to reliably distinguish the Mammuthus primigenius with respect to the other two genera of elephantids (i.e., Elephas and Loxodonta), and the extinct American mastodon (i.e., Mammut americanum).

In conclusion, it is important to evidence, in paleoproteomics, the importance of authentication of ancient environmental proteins and peptides, particularly for open systems, such as skin samples. Overall, the approach here employed, including the authentication method based on a deep analysis of peptides modifications and damage, highlight the importance of this kind of investigation, which could integrate and support the classical paleoclimatology studies, but also improve our understanding of the phylogenetic relationships between extant and extinct species.

References

Asara JM, Schweitzer MH, Freimark LM, Phillips M, Cantley LC (2007) Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry. Science 316(5822):280–285. https://doi.org/10.1126/science.1137614 (PMID: 17431180)
Article CAS PubMed Google Scholar
Boeskorov GG, Mashchenko EN, Plotnikov VV, Shchelchkova MV, Protopopov AV, Solomonov NG (2016) Adaptation of the woolly mammoth Mammuthus primigenius (Blumenbach, 1799) to habitat conditions in the glacial period. Contemp Probl Ecol 9:544–553. https://doi.org/10.1134/S1995425516050024
Article Google Scholar
Buckley M, Collins M (2011) Collagen survival and its use for species identification in Holocene-lower Pleistocene bone fragment. Antiqua 1:1. https://doi.org/10.4081/antiqua.2011.e1
Article Google Scholar
Buckley M, Larkin N, Collins M (2011) Mammoth and Mastodon collagen sequences; survival and utility. Geochim Cosmochim Acta 75(7) 2007–2016, ISSN 0016–7037, https://doi.org/10.1016/j.gca.2011.01.022.
Buckley M, Lawless C, Rybczynski N (2019) Collagen sequence analysis of fossil camels, Camelops and c.f. Paracamelus, from the Arctic and sub-Arctic of Plio-Pleistocene North America, J Proteomics 194: 218–225, ISSN 1874–3919, https://doi.org/10.1016/j.jprot.2018.11.014.
Campbell JL, Le Blanc JC, Kibbey RG (2015) Differential mobility spectrometry: a valuable technology for analyzing challenging biological samples. Bioanalysis 7(7):853–856. https://doi.org/10.4155/bio.15.14
Article CAS PubMed Google Scholar
Campbell JL, Le Blanc JC, Schneider BB (2012) Probing electrospray ionization dynamics using differential mobility spectrometry: the curious case of 4-aminobenzoic acid. Anal Chem 18; 84(18): 7857–7864. https://doi.org/10.1021/ac301529w.
Cannizzo ES, Clement CC, Morozova K, Valdor R, Kaushik S, Almeida LN, Follo C, Sahu R, Cuervo AM, Macian F, Santambrogio L (2012) Age-related oxidative stress compromises endosomal proteostasis. Cell Rep 2(1):136–149. https://doi.org/10.1016/j.celrep.2012.06.005(ISSN2211-1247)
Article CAS PubMed PubMed Central Google Scholar
Cappellini E, Jensen LJ, Szklarczyk D, Ginolhac A, da Fonseca RA, Stafford TW, Holen SR, Collins MJ, Orlando L, Willerslev E, Gilbert MT, Olsen JV (2012) Proteomic analysis of a pleistocene mammoth femur reveals more than one hundred ancient bone proteins. J Proteome Res 11(2):917–926. https://doi.org/10.1021/pr200721u (Epub 2011 Dec 14 PMID: 22103443)
Article CAS PubMed Google Scholar
Cappellini E, Gentry A, Palkopoulou E, Ishida Y, Cram D, Roos AM, Watson M, Johansson US, Fernholm B, Agnelli P, Barbagli F, Littlewood DTJ, Kelstrup CD, Olsen JV, Lister AM, Roca AL, Dalén L, Gilbert MTP (2014) Resolution of the type material of the Asian elephant, Elephas maximus Linnaeus, 1758 (Proboscidea, Elephantidae). Zool J Linn Soc 170:222–232. https://doi.org/10.1111/zoj.12084
Article Google Scholar
Cappellini E, Prohaska A, Racimo F, Welker F, Pedersen MW, Allentoft ME, Damgaard PB, Gutenbrunner P, Dunne J, Hammann S, Roffet-Salque M, Ilardo M, Moreno-Mayar JV, Wang Y, Sikora M, Vinner L, Cox J, Evershed RP, Willerslev E (2018) Ancient biomolecules and evolutionary inference. Annu Rev Biochem 87:1029–1060. https://doi.org/10.1146/annurev-biochem-062917-012002
Article CAS PubMed Google Scholar
Cleland TP, Schroeter ER, Schweitzer MH (2015) Biologically and diagenetically derived peptide modifications in moa collagens. Proc R Soc B 282:20150015. https://doi.org/10.1098/rspb.2015.0015
Article CAS PubMed PubMed Central Google Scholar
Cleland TP, Schroeter ER, Colleary C (2020) Diagenetiforms: a new term to explain protein changes as a result of diagenesis in paleoproteomics. J Proteomics 26:103992. https://doi.org/10.1016/j.jprot.2020.103992
Article CAS Google Scholar
Cucina A, Cunsolo V, Di Francesco A, Saletti R, Zilberstein G, Zilberstein S, Tikhonov A, Bublichenko AG, Righetti PG, Foti S (2021) Meta-proteomic analysis of the Shandrin mammoth by EVA technology and high-resolution mass spectrometry: what is its gut microbiota telling us? Amino Acids 53:1507–1521. https://doi.org/10.1007/s00726-021-03061-0
Article CAS PubMed PubMed Central Google Scholar
D’Amato A, Zilberstein G, Zilberstein S, Compagnoni BL, Righetti PG (2018) Of mice and men: traces of life in the death registries of the 1630 plague in Milano. J Proteomics 180:128–137. https://doi.org/10.1016/j.jprot.2017.11.028 (Epub 2018 Jan 3 PMID: 29305937)
Article CAS PubMed Google Scholar
Exposito JY, Cluzel C, Garrone R, Lethias C (2002) Evolution of Collagens. Anat Rec 268:302–316. https://doi.org/10.1002/ar.10162
Article CAS PubMed Google Scholar
Exposito JY, Valcourt U, Cluzel CL, C, (2010) The Fibrillar Collagen Family Int. J Mol Sci 11:407–426. https://doi.org/10.3390/ijms11020407
Article CAS Google Scholar
Flerov C (1931) A trunk of mammoth (Elephas primigenius Blum.) found in the Kolyma district, Izv. Akad. Nauk SSSR. Ser Matem Est Nauk 6:863–870
Google Scholar
Frézal L, Félix MA (2015) C. elegans outside the Petri dish. eLife 4: e05849. doi: https://doi.org/10.7554/eLife.05849.
Goodman M, Birk DE, Romero-Herrera AE, Lande MA, Dene H, Barnhart MI (1980) Collagen preservation in soft tissue from the Magadan mammoth. FEBS Lett 114(1):30–34. https://doi.org/10.1016/0014-5793(80)80854-4
Article CAS Google Scholar
Greco E, El-Aguizy O, Ali MF, Foti S, Cunsolo V, Saletti R, Ciliberto E (2018) Proteomic analyses on an ancient Egyptian cheese and biomolecular evidence of brucellosis. Anal Chem 90:9673–9676. https://doi.org/10.1021/acs.analchem.8b02535
Article CAS PubMed Google Scholar
Han Y, Ma B, Zhang K (2005) SPIDER: software for protein identification from sequence tags containing de novo sequencing error. J Bioinform Comput Biol 3:697–716. https://doi.org/10.1142/s0219720005001247
Article CAS PubMed Google Scholar
Hendy J, Collins M, Teoh KY, Ashford DA, Oateshi JT, Donoghue ED, Pap I, Minnikin DE, Spigelman M, Buckley M (2016) The challenge of identifying tuberculosis proteins in archaeological tissues. J Archaeol Sci 66:146–153. https://doi.org/10.1016/j.jas.2016.01.003
Article CAS Google Scholar
Hendy J, Welker F, Speller C, Warinner C, Collins MJ (2018) A guide to ancient protein studies. Nat Ecol Evol 2:791–799. https://doi.org/10.1038/s41559-018-0510-x
Article PubMed Google Scholar
Hill RC, Wither MJ, Nemkov T, Barrett A, D’Alessandro A, Dzieciatkowska M, Hansen KC (2015) Preserved proteins from extinct Bison latifrons identified by tandem mass spectrometry; hydroxylysine glycosides are a common feature of ancient collagen. Mol Cell Proteomics 14:1946–1958. https://doi.org/10.1074/mcp.M114.047787
Article CAS PubMed PubMed Central Google Scholar
Kafle A, Coy SL, Wong BM, Fornace AJ Jr, Glick JJ, Vouros P (2014) Understanding gas phase modifier interactions in rapid analysis by differential mobility-tandem mass spectrometry. J Am Soc Mass Spectrom 25(7):1098–1113. https://doi.org/10.1007/s13361-013-0808-5
Article CAS PubMed PubMed Central Google Scholar
Mackie M, Rüther P, Samodova D, Di Gianvincenzo F, Granzotto C, Lyon D, Peggie DA, Howard H, Harrison L, Jensen LJ, Olsen JV, Cappellini E (2018) Palaeoproteomic profling of conservation layers on a 14th century Italian wall painting. Angew Chem Int Ed 57(25):7369–7374. https://doi.org/10.1002/anie.201713020
Article CAS Google Scholar
Manfredi M, Barberis E, Gosetti F, Conte E, Gatti G, Mattu C, Robotti E, Zilberstein G, Koman I, Zilberstein S, Marengo E, Righetti PG (2017) Method for noninvasive analysis of proteins and small molecules from ancient objects. Anal Chem 89(6):3310–3317. https://doi.org/10.1021/acs.analchem.6b03722
Article CAS PubMed Google Scholar
Mesuere B, Debyser G, Aerts M, Devreese B, Van-damme P, Dawyndt P (2015) The Unipept metaproteomics analysis pipeline. Proteomics 15:1437–1442. https://doi.org/10.1002/pmic.201400361
Article CAS PubMed Google Scholar
Mesuere B, Van der Jeugt F, Willems T, Naessens T, Devreese B, Martens L, Dawyndt P (2018) High-throughput metaproteomics data analysis with Unipept: a tutorial. J Proteom 171:11–22. https://doi.org/10.1016/j.jprot.2017.05.022(Epub2017May24PMID:28552653)
Article CAS Google Scholar
Mikšík I, Sedláková P, Pataridis S, Bortolotti F, Gottardo R (2016) Proteins and their modifications in a medieval mummy. Protein Sci 11:2037–2044. https://doi.org/10.1002/pro.3024(Epub2016Sep8.PMID:27543755;PMCID:PMC5079257)
Article Google Scholar
Miller W, Drautz DI, Ratan A, Pusey B, Qi J, Lesk AM, Tomsho LP, Packard MD, Zhao F, Sher A, Tikhonov A, Raney B, Patterson N, Lindblad-Toh K, Lander ES, Knight JR, Irzyk GP, Fredrikson KM, Harkins TT, Sheridan S, Pringle T, Schuster SC (2008) Sequencing the nuclear Mammoth and Mastodon collagen 2015 genome of the extinct woolly mammoth. Nature 20; 456(7220), 387–390 https://doi.org/10.1038/nature07446.
Neustroev MP, Tarabukina NP, Neustroev MM, Fedorova MP, Stepanova AM, Parnikova SI, Baishev AA (2014) Microbiota of fossil animals preserved in yakut permafrost. J Earth Sci Eng 4(8):484–489. https://doi.org/10.17265/2159-581X/2014.08.001
Article Google Scholar
Pattison DI, Rahmanto AS, Davis MJ (2012) Photo-oxidation of proteins. Photochem Photobiol Sci 11:38–53. https://doi.org/10.1039/C1PP05164D
Article CAS PubMed Google Scholar
Petrova EA, Masutin VV, Zhuykova IA (2017) Two incomplete skeletons of woolly mammoth (Mammuthus primigenius) from the late Pleistocene in the Kirov Region. European Russia Russ J Theriol 16(2):157–175. https://doi.org/10.15298/rusjtheriol.16.2.05
Article Google Scholar
Plotnikov VV, Maschenko EN, Pavlov IS, Protopopov AV, Boeskorov GG, Petrova EA (2015) New data on trunk morphology in the woolly mammoth, Mammuthus primigenius (Blumenbach). Paleontol J 49:200–210. https://doi.org/10.1134/S0031030115020070
Article Google Scholar
Presslee S, Slater GJ, Pujos F, Forasiepi AM, Fischer R, Molloy K, Mackie M, Olsen JV, Kramarz A, Taglioretti M, Scaglia F, Lezcano M, Lanata JL, Southon J, Feranec R, Bloch J, Hajduk A, Martin FM, Gismondi RS, Reguero M, de Muizon C, Greenwood A, Chait BT, Penkman K, Collins M, MacPhee RDE (2019) Palaeoproteomics resolves sloth relationships. Nat Ecol Evol 3:1121–1130. https://doi.org/10.1038/s41559-019-0909-z
Article PubMed Google Scholar
Righetti PG, Zilberstein G, D’Amato A (2019) What sherlock sorely missed: the EVA technology for cultural heritage exploration. Expert Rev Proteomics 16(6):533–542. https://doi.org/10.1080/14789450.2019.1624164
Article CAS PubMed Google Scholar
Robinson NE, Robinson AB (2001) Prediction of protein deamidation rates from primary and three-dimensional structure. Proc Natl Acad Sci USA 98:4367–4372. https://doi.org/10.1073/pnas.071066498
Article CAS PubMed PubMed Central Google Scholar
Rohland N, Malaspinas AS, Pollack JL, Slatkin M, Matheus P, Hofreiter M (2007) Proboscidean mitogenomics: chronology and mode of elephant evolution using mastodon as an outgroup. PLoS Biol 5:e207. https://doi.org/10.1371/journal.pbio.0050207
Article CAS PubMed PubMed Central Google Scholar
Rybczynski N, Gosse JC, Harington CR, Wogelius RA, Hidy AJ, Buckley M (2013) Mid-Pliocene warm-period deposits in the High Arctic yield insight into camel evolution. Nature Com 4:1550. https://doi.org/10.1038/ncomms2516
Article CAS Google Scholar
Saletti R, Reina S, Pittalà MGG, Magrì A, Cunsolo V, Foti S, De Pinto V (2018) Post-translational modifcations of VDAC1 and VDAC2 cysteines from rat live mitochondria. Biochim Biophys Acta Bioenergetics 1859:806–816. https://doi.org/10.1016/j.bbabio.2018.06.007
Article CAS PubMed Google Scholar
Schroeter ER, Cleland TP (2016) Glutamine deamidation: an indicator of antiquity, or preservational quality? Rapid Commun Mass Spectrom 30:251–255. https://doi.org/10.1002/rcm.7445
Article CAS PubMed Google Scholar
Schweitzer M, Hill CL, Asara JM, Lane WS, Pincus SH (2002) Identification of immunoreactive material in mammoth fossils. J Mol Evol 55(6):696–705. https://doi.org/10.1007/s00239-002-2365-6 (PMID: 12486528)
Article CAS PubMed Google Scholar
Shevchenko A, Schuhmann A, Thomas H, Wetzel G (2018) Fine Endmesolithic fish caviar meal discovered by proteomics in foodcrusts from archaeological site Friesack 4 (Brandenburg, Germany). PLoS ONE 13(11):e0206483. https://doi.org/10.1371/journal.pone.0206483
Article CAS PubMed PubMed Central Google Scholar
Shoshani J (1998) Understanding proboscidean evolution: a formidable task. Trends Eco Evol 13(1):480–487. https://doi.org/10.1016/S0169-5347(98)01491-8
Article CAS Google Scholar
Shoshani J, Tassy P (2005) Advances in proboscidean taxonomy and classification, anatomy and physiology, and ecology and behavior. Quatern Int 126:5–20. https://doi.org/10.1016/J.QUAINT.2004.04.011
Article Google Scholar
Stadtman ER, Van Remmen H, Richardson A, Wehr NB, Levine RL (2005) Methionine oxidation and aging. Biochim. Biophys. Acta 1703(2):135–140. ISSN 1570–9639, https://doi.org/10.1016/j.bbapap.2004.08.010.
Tanasi D, Cucina A, Cunsolo V, Saletti R, Di Francesco A, Greco E, Foti S (2021) Paleoproteomic profiling of organic residues on prehistoric pottery from Malta. Amino Acids 53:295–312. https://doi.org/10.1007/s00726-021-02946-4
Article CAS PubMed PubMed Central Google Scholar
Tolmachoff IP (1929) The carcasses of the mammoth and rhinoceros found in the frozen ground of Siberia. Trans Am Philos Soc New Series 23(1):11–74. https://doi.org/10.2307/1005437
Article Google Scholar
Warinner C, Hendy J, Speller C, Cappellini E, Fischer R, Trachsel C, Arneborg J, Lynnerup N, Craig OE, Swallow DM, Fotakis A, Christensen RJ, Olsen JV, Liebert A, Montalva N, Fiddyment S, Charlton S, Mackie M, Canci A, Bouwman A, Rühli F, Gilbert MTP, Collins MJ (2014) Direct evidence of milk consumption from ancient human dental calculus. Sci Rep 4:7104. https://doi.org/10.1038/srep07104
Article CAS PubMed PubMed Central Google Scholar
Warinner C, Speller C, Collins MJ (2015) A new era in palaeomicrobiology: prospects for ancient dental calculus as a long-term record of the human oral microbiome. Phil Trans R Soc. https://doi.org/10.1098/rstb.2013.0376
Article Google Scholar
Welker F, Collins MJ, Thomas JA, Wadsley M, Brace S, Cappellini E, Turvey ST, Reguero M, Gelfo JN, Kramarz A, Burger J, Thomas-Oates J, Ashford DA, Ashton PD, Rowsell K, Porter DM, Kessler B, Fischer R, Baessmann C, Kaspar S, Olsen JV, Kiley P, Elliott JA, Kelstrup CD, Mullin V, Hofreiter M, Willerslev E, Hublin JJ, Orlando L, Barnes I, MacPhee RDE (2015) Ancient proteins resolve the evolutionary history of Darwin/’s South American ungulates. Nature 522:81–84. https://doi.org/10.1038/nature14249
Article CAS PubMed Google Scholar
Winter DL, Wilkins MR, William AD (2019) Differential ion mobility-mass spectrometry for detailed analysis of the proteome. Trends Biotechnol 37(2):198–213. https://doi.org/10.1016/j.tibtech.2018.07.018
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the Bio-Nanotech Research and Innovation Tower (BRIT; PON project financed by the Italian Ministry for Education, University and Research MIUR) for the availability of the Orbitrap Fusion mass spectrometer.

Funding

This research was partly supported by the University of Catania—linea PIACERI, grant number ARVEST.

Author information

Authors and Affiliations

Laboratory of Organic Mass Spectrometry, Department of Chemical Sciences, University of Catania, Viale A. Doria 6, 95125, Catania, Italy
Annamaria Cucina, Antonella Di Francesco, Rosaria Saletti, Maria Gaetana Giovanna Pittalà, Salvatore Foti & Vincenzo Cunsolo
Spectrophon Ltd, Oppenheimer 7, 7670107, Rehovot, Israel
Gleb Zilberstein & Svetlana Zilberstein
Zoological Institute, Russian Academy of Sciences, Universitetskaya nab.1, Saint-Petersburg, 199034, Russia
Alexei Tikhonov & Andrey G. Bublichenko
Department of Chemistry, Materials and Chemical Engineering ‘‘Giulio Natta’’, Politecnico di Milano, Via Mancinelli 7, 20131, Milano, Italy
Pier Giorgio Righetti

Authors

Annamaria Cucina
View author publications
You can also search for this author in PubMed Google Scholar
Antonella Di Francesco
View author publications
You can also search for this author in PubMed Google Scholar
Rosaria Saletti
View author publications
You can also search for this author in PubMed Google Scholar
Maria Gaetana Giovanna Pittalà
View author publications
You can also search for this author in PubMed Google Scholar
Gleb Zilberstein
View author publications
You can also search for this author in PubMed Google Scholar
Svetlana Zilberstein
View author publications
You can also search for this author in PubMed Google Scholar
Alexei Tikhonov
View author publications
You can also search for this author in PubMed Google Scholar
Andrey G. Bublichenko
View author publications
You can also search for this author in PubMed Google Scholar
Pier Giorgio Righetti
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore Foti
View author publications
You can also search for this author in PubMed Google Scholar
Vincenzo Cunsolo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vincenzo Cunsolo.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Research involving human participants and/or animals

Our research project checked the ethics committee before starting. Fossils and permafrost paleontological samples from museum collections are not classified as “animal samples".

Informed consent

No informed consent is required for this study.

Additional information

Handling editor: Y. Su.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 1518 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cucina, A., Di Francesco, A., Saletti, R. et al. Meta-proteomic analysis of two mammoth’s trunks by EVA technology and high-resolution mass spectrometry for an indirect picture of their habitat and the characterization of the collagen type I, alpha-1 and alpha-2 sequence. Amino Acids 54, 935–954 (2022). https://doi.org/10.1007/s00726-022-03160-6

Download citation

Received: 30 November 2021
Accepted: 27 March 2022
Published: 17 April 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s00726-022-03160-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Meta-proteomic analysis of two mammoth’s trunks by EVA technology and high-resolution mass spectrometry for an indirect picture of their habitat and the characterization of the collagen type I, alpha-1 and alpha-2 sequence

Abstract

Similar content being viewed by others

Meta-proteomic analysis of the Shandrin mammoth by EVA technology and high-resolution mass spectrometry: what is its gut microbiota telling us?

Zooarchaeology by Mass Spectrometry (ZooMS) Collagen Fingerprinting for the Species Identification of Archaeological Bone Fragments

Comparing extraction method efficiency for high-throughput palaeoproteomic bone species identification

Introduction

Materials and methods

Chemicals

The Kolyma and Sanga-Yuryahskii mammoths

Protein sampling by EVA diskette

Protein extraction protocol

Mass spectrometry analysis

Database search for metaproteomic analysis

Calculation of the level of deamidation and other chemical modifications

Metaproteomics analysis at peptide level

Database search for sequence characterization of collagen

Results

Metaproteomic analysis

Protein identification

Proteins related to Mammoth

Proteins related to Viridiplantae, Bacteria, and Nematoda

Unipept analysis

Peptides related to mammoth

Peptides related to Viridiplantae

Peptides related to Bacteria and Nematoda

Level of deamidation and other chemical modifications

Characterization of the alpha-1 type I collagen sequence

Charaterization of the alpha-2 type I collagen sequence

Discussion

Metaproteomic analysis

Collagen type I, alpha-1 and alpha-2 chains

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research involving human participants and/or animals

Informed consent

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 1518 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation