Abstract
Vertebrate blood coagulation is controlled by a cascade containing more than 20 proteins. The cascade proteins are found in the blood in their zymogen forms and when the cascade is triggered by tissue damage, zymogens are activated and in turn activate their downstream proteins by serine protease activity. In this study, we examined proteomes of 21 chordates, of which 18 are vertebrates, to reveal the modular evolution of the blood coagulation cascade. Additionally, two Arthropoda species were used to compare domain arrangements of the proteins belonging to the hemolymph clotting and the blood coagulation cascades. Within the vertebrate coagulation protein set, almost half of the studied proteins are shared with jawless vertebrates. Domain similarity analyses revealed that there are multiple possible evolutionary trajectories for each coagulation protein. During the evolution of higher vertebrate clades, gene and genome duplications led to the formation of other coagulation cascade proteins.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Coagulation (blood clotting) is the process of gel formation of the blood at injured and/or damaged tissue sites. Under normal conditions, blood circulates through arteries, capillaries and veins but responds to tissue damage by clot formation. Therefore, it is essential to have a rapid and effective coagulation mechanism to prevent blood loss as well as undesirable coagulation within veins (thrombosis). Clotting is regulated by a complex cascade which involves more than two dozen proteins including serine proteases (Doolittle (2009)).
The cascade is divided into three pathways: the intrinsic pathway, the extrinsic pathway and the common pathway (Grover and Mackman (2019)). The intrinsic pathway is triggered by the internal trauma in the blood vessels. Factor XII (FXII) is activated by exposure to endothelial collagen which then can activate downstream proteins (Chaudhry et al (2021)). As for the extrinsic pathway, damaged endothelium of extravascular cells releases tissue factor (TF) into the blood stream and TF acts as a cofactor of activated Factor VII (FVII) to enhance its protease activity (Smith et al (2015)). These two pathways merge into a common pathway involving the Factor X (FX) and Factor V (FV) complex. This common pathway results in fibrin clots on the wound site (Supplementary Figure S1).
More than half of the proteins which belong to the blood coagulation cascade are serine proteases which are highly similar to each other in terms of function and structure: They serve as activators/deactivators of downstream proteins with their protease activity and possess the same or similar domain arrangements (Doolittle (2009)). It is clear that the serine proteases share common ancestor(s) due to the domain similarities not only in their terminal protease domains but also in their auxiliary Gla, EGF, and Kringle domains (Doolittle (2009); Ponczek et al (2012)).
Serine proteases FIX, FX, Protein C and Protein Z have identical domain arrangements: Gla – EGF – Fxa inhibition – Trypsin. Similarly, cofactors of the system, FV and FVIII, also share similar domain arrangements: Cu-oxidase (4–6 repeats) – F5 F8 type C. Finally, Plasma Kallikrein and Factor XI (FXI) also share the same domain organization: PAN 1 (× 4) – Trypsin. Gene duplications, fusions, fissions and terminal losses can lead to novel domain arrangements (Bornberg-Bauer and Albà (2013); Cromar et al (2014); Klasberg et al (2018)). Therefore, investigating the changes in domain arrangements can reveal the modular evolution of the blood coagulation cascade (Patthy (1985)).
Analogous to blood coagulation in vertebrates, Arthropods have a hemolymph clotting system to prevent hemolymph loss and provide immunity (Theopold et al (2002, 2004)). Even though both systems possess proteins such as serine proteases, cofactors, and clot stabilizers which are similar in terms of their functions, clotting mechanisms differ between the blood coagulation and the hemolymph clotting systems (Theopold et al (2002, 2004)). While the sequence similarities between the components of these two systems are low, it is argued that both systems share a common ancestry considering their functional and organizational similarities (Krem and Cera (2002)). Analysis of both systems on the domain level would help to reveal the evolutionary histories of vertebrate blood coagulation and Arthropod hemolymph clotting cascades.
The evolution of blood coagulation cascade proteins is widely studied (Patthy (1985, 1990); Davidson et al (2003); Jiang and Doolittle (2003); Ponczek et al (2008, 2012, 2020); Doolittle (2009); Kimura et al (2009)). However, as the amount and the quality of genomic data are increasing, the resolution of the evolutionary studies is improving. In this study, we aim to show possible evolutionary trajectories of the fundamental blood coagulation cascade proteins using not only vertebrate proteomes but also invertebrate proteomes. We tried to shed light on the evolution of the blood coagulation cascade using domain alignments between vertebrate and invertebrate proteomes.
Methods
Dataset Creation
We selected 18 proteomes (Table 1) to be able to study the domain arrangements occurring in the blood coagulation cascade in detail. These proteomes were selected to represent most of the vertebrate clades ranging from jawless vertebrates to mammals. Proteomes of three species belonging to Cnidaria and Tunicata clades were also downloaded as outgroups from Ensembl (Hunt et al (2018)), NCBI (Johnson et al (2008)) and UniProt (UniProt Consortium (2018)) databases (Supplementary Table S1). Two Arthropoda species were also used to reveal domain similarities between blood coagulation and hemolymph clotting. The proteomes were cleared of isoforms and only the longest isoforms were retained. The isoformCleaner program from the dw-helper suite (https://zivgitlab.uni-muenster.de/domain-world/dw-helper) was used with settings ‘-r “gene[: =] \\s*([\\S] +)[\\s]*”’ for proteomes from Ensembl and the settings ‘-r “GN[: =] \\s*([\\S] +)[\\s]*”’ for proteomes from Uniprot. NCBI proteomes could not be handled with that program, so a python script (provided together with the study data) has been used. To detect domain arrangements, we used PfamScan (Finn et al (2014)) using Pfam database v32 (El-Gebali et al (2019)). The blood coagulation domains in each studied species are given in Table 1.
Identifying Blood Coagulation Genes
We used OrthoFinder (v2.3.12) to detect orthologous proteins between species (Emms and Kelly (2015)). We performed three OrthoFinder searches using studied proteomes: one for vertebrates to detect orthologues of blood coagulation proteins, one for arthropods including human, mouse and zebrafish proteomes as references, to detect orthologues of hemolymph clotting proteins and one for invertebrates including human, mouse and zebrafish as references, to detect orthologues of BLASTP target proteins for Prothrombin, FVII, FX. Exonerate (v2.2.0) (Slater and Birney (2005)) was run in ‘protein to genome’ mode to verify whether a certain ortholog was found when there were discrepancies between the current data and the literature.
Identifying Potential Origins of Domains
One of the goals of this study was to identify potential origins of domains involved in the blood coagulation cascade. Specific databases were created containing only sequences from our data set belonging to one specific domain. To shed light on the domain evolution of the blood coagulation cascade, domain level reciprocal BLASTP (v2.9.0) (Altschul et al (1990)) searches were executed between these databases. The domains of blood coagulation cascade proteins in the most basal species were blasted against domains in the closest species not possessing the respective proteins.
If the query domain is found within the first three hits (when sorted by e-value) of the reciprocal search, the target is considered a good hit. To perform a BLASTP search, we first built a local database using all respective domains from a target proteome.
Analysis of “missing” Genes
In a few cases, orthology analysis was not be able to identify some proteins in the studied species which should be present according to the current literature and in some cases orthology analysis identify some proteins which should not be present in the studied species. In both cases the same methodology was used: We used the Ensembl genome browser (Howe et al (2021)) to determine which genes are the closest on either side of the missing gene. We then checked those two genes in the target species to identify the potential region where the gene should be. Furthermore, we performed protein to genome exonerate searches to support the results. Additionally, to ensure that the gene is not only missing in our set of species, we also searched the entire clade surrounding that species for potential matches. We used NCBI blastp and tblastn together with the non-redundant databases (nr and nt). As the query we used a known ortholog from our species set that was most closely related to that clade. If all the analyses show clear signs of the absence of the gene (i.e., no synteny, bad exonerate and Blast hit(s)), we consider the gene is lost during the evolution of vertebrates. When the results were not clear, we label the gene as ambiguous.
Phylogenetic Tree
MUSCLE (v5.1) was used to align blood coagulation cascade proteins in the proteomes (Edgar (2004)). We built phylogenetic trees using IQ-TREE (1.6.1) with 1000 bootstrapping runs (Minh et al (2020)). We, then, used FigTree (1.4.4) to visualize the phylogenetic trees (Rambaut (2014)). All software was used with default parameters following the manuals.
Results and Discussion
Conservation and Domain Arrangements of the Proteins
Putative orthologs of blood coagulation proteins were searched in the study species using OrthoFinder and BLASTP. Table 2 lists the presence and absence of the proteins in the given clades. While most of the data were consistent with the current literature, we could not identify one of the cofactors of the cascade, FV, in jawless vertebrates. It was suggested that FX exists in jawless vertebrates along with its cofactor FV (Doolittle (2009)). Even though FX is found in jawless vertebrates, FV could not be found by Orthofinder, BLASTP, or synteny analyses. However, Exonerate analysis yielded short fragments of alignment (see study data). FV, FVIII, hephaestin, and ceruloplasmin, which are multicopper oxidases, are very similar in their sequence (Vasin et al (2013)). Therefore, one of the multicopper oxidase protein family proteins might have been misidentified as FV using the NCBI Trace database (NCBI Trace Archive) which contains uncurated DNA sequences (see below: Origin of Domains and Evolution of Domain Arrangements).
FXII is known to be missing in cetaceans (Robinson et al (1969); Huelsmann et al (2019); Ponczek et al (2020)). In the study presented here, a putative ortholog of FXII was identified in the bottlenose dolphin Tursiops truncatus. However, according to Exonerate results, the genomic region in T. truncates that spans most of the FXII protein has four nonsense mutations (Supplementary Figure S2). Exonerate runs on other cetaceans, namely sperm whale (Physeter catodon) and blue whale (Balaenoptera musculus), showed similar or worse alignments. Therefore, we confirmed that FXII is pseudogenized in cetaceans. According to previous studies, a pseudogene conversion by point mutations has led to this protein prediction (Semba et al (1998)). FXII is activated through inorganic molecules (e.g., soil). Cetaceans and birds have little to no contact to soil, therefore, it has been suggested that factor FXII has lost its importance and was subsequently lost (Juang et al (2020)).
FXI and Plasma Kallikrein (PK) are paralogs that have the same domain arrangement: four tandem apple domains (PAN 1) followed by a Trypsin protease domain (Supplementary Table S2). Both PK and FXI are not present in jawless vertebrates, cartilaginous fish and ray-finned fish confirmed by exonerate, synteny and BLAST results. The last ancestral protein of PK and FXI should have appeared before the divergence of lobe-finned fish. It is found that, while PK is present in almost all tetrapods except for the cetaceans, a PK duplication before the divergence of mammals led to the emergence of FXI as discussed by Ponczek et al (2008, 2020). Figure 1 shows the emergence and loss of the studied blood coagulation proteins on the vertebrate evolutionary tree.
In terms of their domain arrangements, most of the blood coagulation cascade proteins are well conserved in all vertebrates. FIX, FX, Protein C, and Protein Z have identical domain arrangements (Gla – EGF – Fxa inhibition – Trypsin). Both, FXI and PK have the domain arrangement PAN 1 – PAN 1 – PAN 1 – PAN 1 – Trypsin. As for FXII, the domain arrangement is fn2 – EGF – fn1 – EGF – Kringle – Trypsin. Plasminogen has the domain arrangement PAN 1 – Kringle – Kringle – Kringle – Kringle – Kringle – Trypsin. Lastly, the domain arrangement of Prothrombin is Gla – Kringle – Kringle – Thrombin light – Trypsin. Even though serine proteases of the cascade are well conserved, cofactors are diverse in terms of the number of repetitions of the domains. FV had a variable number of LSPR domain repeats among different vertebrates, ranging from 24 in species Balaenoptera musculus to 38 in species Physeter catodon, while FVIII has a different number of domain repeats in Copper oxidases (Supplementary Table S3–4).
Arthropod Hemolymph Clotting
We next investigated whether Arthropod hemolymph coagulation cascade proteins and vertebrate blood coagulation cascade proteins share a common ancestor. Unlike the blood coagulation cascade which is highly conserved among vertebrates, there are diverse proteins and cascades in different Arthropod clades. Therefore, we used hemolymph coagulation proteins from different members of Arthropoda to study this question.
The hemolymph coagulation cascade of Limulus polyphemus (Atlantic horseshoe crab), a non-insect Arthropod, was investigated as its coagulation proteins are widely studied. Hemolymph clotting in L. polyphemus is controlled by a proteolytic cascade similar to vertebrates. The cascade is controlled by coagulation factors B, C, and G, Proclotting enzyme, Coagulogen, Transglutaminase, and Coagulation inhibitors (Tanaka et al (1991); Theopold et al (2004); Schmid et al (2019)). Even though the cascade is similar to the vertebrate coagulation cascade in terms of proteolytic activity, proteins involved in the proteolytic cascade of L. polyphemus have completely different domain arrangements from the vertebrate coagulation cascade proteins. None of the blood coagulation cascade proteins share a single domain with these proteolytic proteins in L. polyphemus except for Trypsin, which is a common serine protease domain (Supplementary Table S5).
As for insect Arthropods, we used Drosophila melanogaster as a reference species. Hemolymph clotting in D. melanogaster is controlled by a small set of proteins: Fondue, Hemolectin, Transglutaminases, and Prophenoloxidases (Lindgren et al (2008); Wang et al (2010); Schmid et al (2019)) (Supplementary Table S6).
The vertebrate and the invertebrate coagulation systems consist of different proteins with different domain arrangements. The only exception from this is the Transglutaminase in Arthropods, which has the same domain arrangement as FXIII A chain in vertebrates: Transglut N – Transglut core – Transglut C – Transglut C. Not only do they share the same domain organization, but they both play an important role on increasing the clotting efficacy. While FXIII A chain stabilizes fibrin clots (Byrnes et al (2015)), Transglutaminase in Drosophila melanogaster and Limulus polyphemus stabilizes clotting molecules proxin, stablin, and fondue (Lindgren et al (2008); Dushay (2009)). Therefore, we built a phylogenetic tree using all proteins from horseshoe crab, fruit fly, zebrafish, mouse and human which have the same domain arrangement as transglutaminases (Fig. 2). Similarity of domain arrangements are a potential indication for homology (Grassi et al (2010)). With high bootstrap values (> 94) and same domain arrangements, vertebrate FXIIIA and arthropod transglutaminases share a common ancestor.
Besides the identical domain arrangements of coagulation FXIII subunit A and Transglutaminase, D. melanogaster hemolectin and von Willebrand factor share three domains: VWD, TIL and C8. Therefore, we built a phylogenetic tree of the proteins which have VWD, TIL and C8 domains with shared domains and similar modularity, it is highly possible that these two proteins share a common ancestry. The constructed phylogenetic tree with bootstrap values can be found in Fig. 3.
Origin of Domains and Evolution of Domain Arrangements
Blood coagulation cascade proteins have similar domain arrangements indicating a possible common evolutionary history. Human coagulation factors FIX, FX, Protein C and Protein Z share the same domain arrangement (Gla – EGF – Fxa inhibition – Trypsin), while FVII has Gla – EGF – Trypsin. Besides the above arrangements, Gla—EGF—Fxa inhibition arrangement is also found within the different domain arrangement of Protein S (Gla – EGF – Fxa inhibition – EGF CA – EGF CA – Laminin G 1 – Laminin G 2). To reveal evolutionary histories of these domain arrangements, we performed reciprocal BLASTP searches using samples of FVII and FX from the most basal species. Unfortunately, reciprocal BLASTP searches on the domain level did not result in reciprocal matches in most of the cases. Next, we extended our search including the best three hits of the BLASTP results. BLASTP searches using Petromyzon marinus and Eptatretus burgeri FVII and FX domains against Oikopleura dioica, Ciona intestinalis, and Stylophora pistillata yielded two proteins in O. dioica: E4WZI2 and E4X518 (Fig. 4).
Protein S and FVII (and by extension FIX, FX, Protein C, Protein Z which have a similar domain arrangement as FVII) only differ in their terminal domains. While FVII has serine protease activity provided by its terminal Trypsin domain, Protein S has EGF CA, Laminin G 1 and Laminin G 2 domains in its terminal position. During the evolution of vertebrates, terminal domain loss and gain events may have led to the emergence of Protein S and FVII/FX. Since jawless vertebrates have only FVII and FX with this domain arrangement, all other serine proteases having this domain arrangement (FIX, Protein C, Protein Z) are likely the result of gene duplications. In humans, FVII, FX, and Protein Z are located close to each other on the forward strand of 13th chromosome (2–9 kbp between each of them) which supports the role of gene duplication in the emergence of these serine proteases.
The same strategy was applied to the domains of Prothrombin and FXII. As for Prothrombin, two proteins from Tunicata, ENSCINP00000019224 and E4WTL6 and one protein from Cnidaria, A0A2B4SCC6, were found to be the best hits for each domain, Gla, Kringle and Trypsin, respectively (Fig. 4). The best hit for the Gla domain, ENSCINP00000019224, was found to be orthologous to the best hit of FVII Gla, E4WZI2, indicating a common origin. As for FXII, we performed reciprocal BLASTP searches between a frog (X. tropicalis), our most basal species with FXII, and the lobe-finned fish (L. chalumnae). Two hits were found for Kringle and Trypsin domains, ENSLACP00000015011 (tissue type plasminogen activator) and ENSLACP00000010869 (urokinase plasminogen activator), respectively. It is proposed that FXII arose from duplication of Hepatocyte Growth Factor Activator (HGFAC) (Ponczek et al (2008, 2020)), even though both proteins share almost identical domain arrangements. Considering reciprocal domain matches between frog FXII and coelacanth plasminogen activators, FXII emerged as a result of a duplication event around ∼400 mya after the divergence of tetrapods.
As for FV and FVIII, it was suggested that a single copy of FV is present and FVIII missing in jawless vertebrates (Doolittle (2009)). However, the identification of FV might be a misidentification of a hephaestin/ceruloplasmin-like protein. Reciprocal BLASTP searches using FV, FVIII, Ceruloplasmin and Hephaestin from Danio rerio, Callorhinchus milii, and Homo sapiens as queries against Petromyzon marinus and Eptatretus burgeri proteomes showed that there is not a reciprocally “perfect match” between FV and any jawless vertebrate protein. Hephaestin has a reciprocal match in the lamprey proteome which is also the best hit for FV. Therefore, this lamprey protein is most likely an ortholog of Hephaestin instead of FV. To further support this finding, we built a gene tree using all possible putative orthologs of FV found by BLASTP (71 candidates for E. burgeri and 31 candidates for P. marinus), together with H. sapiens, D. rerio, and C. milii FV, FVIII, hephaestin, and ceruloplasmin. While FV and FVIII from shark, zebrafish and human are clustered together, hephaestins and other jawless vertebrate proteins are found in different clades (Fig. 5). Besides, NCBI BLASTP and TBLASTN searches using human FV against jawless vertebrates yielded P. marinus hephaestin-like protein. To further support, we investigated synteny using the genomic region of FV in D. rerio and C. milii. In both species, FV is found between the genes ccdc80 and gpr161. However, the orthologs of ccdc80 and gpr161 are found in two different scaffolds in jawless vertebrates (FYBX02010127.1 and FYBX02010267.1 in hagfish and GL476438 and GL480261 in lamprey, respectively). (Supplementary Figure S3). Lastly, we performed protein to genome Exonerate runs using human FV against lamprey and hagfish genomes. Even though there is no good alignment that spans most of the FV protein, there are short sequences in jawless vertebrates that align to smaller fragments of FV (see study data). Searches on the current genomic data of jawless vertebrates, do not yield a concrete result. Considering all these analyses, FV is therefore potentially not present in jawless vertebrates but rather emerged in Gnathostomata after the split from Cyclostomata around ∼600 mya. However, since the available jawless vertebrate genomes are in low quality, this finding could be an artifact of the fragmented genomes.
As for von Willebrand factor, tracing the domain arrangement evolution of the protein is more difficult due to its modular structure. Similar to FV and FVIII, von Willebrand factor is also composed of a highly modular domain arrangement, making it similar to Arthropod Hemolectin (Supplementary Table S2, S6). In all studied animal groups, there are similar modular TIL, VWD, and C8 containing proteins. The phylogenetic tree of TIL, VWD, and C8 domains-containing proteins from human, mouse, zebrafish, horseshoe crab, and fruitfly shows homology between vertebrate von Willebrand factor and Arthropod hemolectin. However, to gain a deeper understanding of the evolution of von Willebrand factor and Hemolectin, future studies should include animals deeper in the phylogeny and fungi species, Fusarium sp. and Aspergillus sp., as their proteomes contain more ancestral versions of the aforementioned domains.
Unfortunately, due to the long evolutionary time frame of over ∼600 mya and low sequence similarity, it is difficult to determine possible evolutionary trajectories of blood coagulation cascade protein domains. However, our work here increases the resolution of Doolittle’s work on vertebrate blood coagulation cascade evolution (Doolittle (2009)).
Conclusion
The blood coagulation cascade is very complex and, although some proteins are missing in early vertebrates, the coagulation proteins are structurally well conserved among all vertebrates. While Arthropod hemolymph clotting seems functionally similar to the blood coagulation cascade, only two proteins in the blood coagulation cascade (FXIII subunit A and von Willebrand factor (vWF)) have similar domain arrangements to hemolymph clotting proteins Transglutaminase and Hemolectin, respectively. The remainder of the blood coagulation proteins, including all the key proteins, are found to have different evolutionary histories compared to Arthropod hemolymph clotting proteins.
Here, not only do we show new data on the emergence of FXIII, Protein C and Protein Z, but we also propose possible evolutionary trajectories for vertebrate coagulation factors which were derived from domain similarity assessments. The first appearance of FV could not be exactly defined. Although a previous publication (Doolittle et al (2008)) showed a potential existence in lamprey, our own analyses in hagfish and lamprey showed no clear candidate in the available jawless genomes. We confirm the origins of the blood coagulation proteins in Tunicata and Cnidaria and describe the evolution of the blood coagulation cascade at various levels: Jawless to jawed vertebrates and teleosts to tetrapods. Analyses on domain re-arrangements suggest that gene and/or genome duplications during the evolution of the vertebrates led to the modern-day coagulation cascade proteins.
Data availability
Data belonging to this study (e.g., proteomes, BLASTP results) have been uploaded to zenodo.org: https://doi.org/10.5281/zenodo.6576868
References
Altschul SF, Gish W, Miller W et al (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410. https://doi.org/10.1016/S0022-2836(05)80360-2
Bornberg-Bauer E, Albà MM (2013) Dynamics and adaptive benefits of modular protein evolution. Curr Opinion Struct Biol 23(3):459–466. https://doi.org/10.1016/j.sbi.2013.02.012
Byrnes JR, Duval C, Wang Y et al (2015) Factor XIIIa-dependent retention of red blood cells in clots is mediated by fibrin α-chain crosslinking. Blood 126(16):1940–1948. https://doi.org/10.1182/blood-2015-06-652263
Chaudhry R, Usama SM, Babiker HM (2021) Physiology, coagulation pathways. In: StatPearls. StatPearls Publishing. https://europepmc.org/article/NBK/nbk482253
Cromar G, Wong KC, Loughran N et al (2014) New tricks for “Old” domains: how novel architectures and promiscuous hubs contributed to the organization and evolution of the ECM. Genome Biol Evolution 6(10):2897–2917. https://doi.org/10.1093/gbe/evu228
Davidson C, Hirt R, Lal K et al (2003) Molecular evolution of the vertebrate blood coagulation network. Thromb Haemost 89(03):420–428. https://doi.org/10.1055/s-0037-1613369
Doolittle RF (2009) Step-by-Step evolution of vertebrate blood coagulation. Cold Spring Harbor Symposia on Quantitative Biol 74:35–40. https://doi.org/10.1101/sqb.2009.74.001
Doolittle RF, Jiang Y, Nand J (2008) Genomic evidence for a simpler clotting scheme in jawless vertebrates. J Mol Evolution 66(2):185–196. https://doi.org/10.1007/s00239-008-9074-8
Dushay MS (2009) Insect hemolymph clotting. Cell Mol Life Sci 66(16):2643–2650. https://doi.org/10.1007/s00018-009-0036-0
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32(5):1792–1797. https://doi.org/10.1093/nar/gkh340
El-Gebali S, Mistry J, Bateman A et al (2019) The Pfam protein families database in 2019. Nucleic Acids Res 47(D1):427–432. https://doi.org/10.1093/nar/gky995
Emms DM, Kelly S (2015) OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. https://doi.org/10.1186/s13059-015-0721-2
Finn RD, Bateman A, Clements J et al (2014) Pfam: the protein families database. Nucl Acids Res 42(D1):222–230. https://doi.org/10.1093/nar/gkt1223
Grassi L, Fusco D, Sellerio A et al (2010) Identity and divergence of protein domain architectures after the yeast whole-genome duplication event. Mol BioSyst 6(11):2305. https://doi.org/10.1039/c003507f
Grover SP, Mackman N (2019) Intrinsic pathway of coagulation and thrombosis. Arteriosclerosis, Thrombosis Vas Biol 39(3):331–338. https://doi.org/10.1161/ATVBAHA.118.312130
Howe KL, Achuthan P, Allen J et al (2021) Ensembl 2021. Nucleic Acids Res 49(D1):D884–D891. https://doi.org/10.1093/nar/gkaa942
Huelsmann M, Hecker N, Springer MS et al (2019) Genes lost during the transition from land to water in cetaceans highlight genomic changes associated with aquatic adaptations. Sci Adv. https://doi.org/10.1126/sciadv.aaw6671
Hunt SE, McLaren W, Gil L et al (2018) Ensembl variation resources. Database. https://doi.org/10.1093/database/bay119
Jiang Y, Doolittle RF (2003) The evolution of vertebrate blood coagulation as viewed from a comparison of puffer fish and sea squirt genomes. Proc Natl Acad Sci 100(13):7527–7532
Johnson M, Zaretskaya I, Raytselis Y et al (2008) NCBI BLAST: a better web interface. Nucl Acids Res. https://doi.org/10.1093/nar/gkn201
Juang LJ, Mazinani N, Novakowski SK et al (2020) Coagulation factor XII contributes to hemostasis when activated by soil in wounds. Blood Adv 4(8):1737. https://doi.org/10.1182/bloodadvances.2019000425
Kimura A, Ikeo K, Nonaka M (2009) Evolutionary origin of the vertebrate blood complement and coagulation systems inferred from liver EST analysis of lamprey. Develop Comparative Immunol 33(1):77–87. https://doi.org/10.1016/j.dci.2008.07.005
Klasberg S, Bitard-Feildel T, Callebaut I et al (2018) Origins and structural properties of novel and ¡i¿de novo¡/i¿ protein domains during insect evolution. The FEBS J 285(14):2605–2625. https://doi.org/10.1111/febs.14504
Krem MM, Cera ED (2002) Evolution of enzyme cascades from embryonic development to blood coagulation. Trends Biochem Sci 27(2):67–74. https://doi.org/10.1016/S0968-0004(01)02007-2
Lindgren M, Riazi R, Lesch C et al (2008) Fondue and transglutaminase in the Drosophila larval clot. J Insect Physiol 54(3):586–892. https://doi.org/10.1016/j.jinsphys.2007.12.008
Minh BQ, Schmidt HA, Chernomor O et al (2020) IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evolution 37(5):1530–1534. https://doi.org/10.1093/molbev/msaa015
Patthy L (1985) Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modules. Cell 41(3):657–663. https://doi.org/10.1016/S0092-8674(85)80046-5
Patthy L (1990) Evolution of blood coagulation and fibrinolysis. Blood Coag Fibrinol 1:153–166
Ponczek MB, Gailani D, Doolittle RF (2008) Evolution of the contact phase of vertebrate blood coagulation. J Thrombosis Haemostasis 6(11):1876–1883. https://doi.org/10.1111/j.1538-7836.2008.03143.x
Ponczek MB, Bijak MZ, Nowak PZ (2012) Evolution of thrombin and other hemostatic proteases by survey of protochordate, hemichordate, and echinoderm genomes. J Mol Evol 74(5–6):319–331. https://doi.org/10.1007/s00239-012-9509-0
Ponczek MB, Shamanaev A, LaPlace A et al (2020) The evolution of factor XI and the kallikrein-kinin system. Blood Adv 4(24):6135–6147. https://doi.org/10.1182/bloodadvances.2020002456
Rambaut A. 2014. FigTree v1.4.4, A Graphical Viewer of Phylogenetic Trees. Available from http://tree.bio.ed.ac.uk/software/figtree/
Robinson AJ, Kropatkin M, Aggeler PM (1969) Hageman factor (Factor XII) deficiency in marine mammals. Science 166(3911):1420–1422. https://doi.org/10.1126/science.166.3911.1420
Schmid MR, Dziedziech A, Arefin B et al (2019) Insect hemolymph coagulation: kinetics of classically and non-classically secreted clotting factors. Insect Biochem Mol Biol. https://doi.org/10.1016/j.ibmb.2019.04.007
Semba U, Shibuya Y, Okabe H et al (1998) Whale Hageman Factor (Factor XII). Thrombosis Res 90(1):31–37. https://doi.org/10.1016/S0049-3848(97)00307-1
Slater G, Birney E (2005) Automated generation of heuristics for biological sequence comparison. BMC Bioinformat 6(1):31. https://doi.org/10.1186/1471-2105-6-31
Smith SA, Travers RJ, Morrissey JH (2015) How it all starts: Initiation of the clotting cascade. Critical Rev Biochem Mol Biol 50(4):326–336. https://doi.org/10.3109/10409238.2015.1050550
Tanaka S, Aketagawa J, Takahashi S et al (1991) Activation of a limulus coagulation factor G by (1 → 3)-g-d-glucans. Carbohyd Res. https://doi.org/10.1016/0008-6215(91)84095-V
Theopold U, Li D, Fabbri M et al (2002) The coagulation of insect hemolymph. Cell Mol Life Sci CMLS 59(2):363–372. https://doi.org/10.1007/s00018-002-8428-4
Theopold U, Schmidt O, S ̈oderh ̈all K, et al (2004) Coagulation in arthropods: defence, wound closure and healing. Trends in Immunol 25(6):289–294. https://doi.org/10.1016/j.it.2004.03.004
UniProt Consortium T (2018) UniProt: the universal protein knowledgebase. Nucleic Acids Res 46(5):2699. https://doi.org/10.1093/nar/gky092
Vasin A, Klotchenko S, Puchkova L (2013) Phylogenetic analysis of six-domain multi-copper blue proteins. PLoS Currents. https://doi.org/10.1371/currents.tol.574bcb0f133fe52835911abc4e296141
Wang Z, Wilhelmsson C, Hyrsl P et al (2010) Pathogen entrapment by transglutaminase—A conserved early innate immune mechanism. PLoS Pathog 6(2):763. https://doi.org/10.1371/journal.ppat.1000763
Acknowledgements
We thank Nicolas Arning for providing insights into the blood clotting cascade.
Funding
Open Access funding enabled and organized by Projekt DEAL. AC was supported by the Republic of Turkey Ministry of National Education (MoNE1416/YLSY).
Author information
Authors and Affiliations
Corresponding author
Additional information
Handling editor: Darin Rokyta.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Coban, A., Bornberg-Bauer, E. & Kemena, C. Domain Evolution of Vertebrate Blood Coagulation Cascade Proteins. J Mol Evol 90, 418–428 (2022). https://doi.org/10.1007/s00239-022-10071-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-022-10071-3