Abstract
Mammalian blood clotting involves numerous components, most of which are the result of gene duplications that occurred early in vertebrate evolution and after the divergence of protochordates. As such, the genomes of the jawless fish (hagfish and lamprey) offer the best possibility for finding systems that might have a reduced set of the many clotting factors observed in higher vertebrates. The most straightforward way of inventorying these factors may be through whole genome sequencing. In this regard, the NCBI Trace database (http://www.ncbi.nlm.nih.gov/Traces/trace.cgi) for the lamprey (Petromyzon marinus) contains more than 18 million raw DNA sequences determined by whole-genome shotgun methodology. The data are estimated to be about sixfold redundant, indicating that coverage is sufficiently complete to permit judgments about the presence or absence of particular genes. A search for 20 proteins whose sequences were determined prior to the trace database study found all 20. A subsequent search for specified coagulation factors revealed a lamprey system with a smaller number of components than is found in other vertebrates in that factors V and VIII seem to be represented by a single gene, and factor IX, which is ordinarily a cofactor of factor VIII, is not present. Fortuitously, after the completion of the survey of the Trace database, a draft assembly based on the same database was posted. The draft assembly allowed many of the identified Trace fragments to be linked into longer sequences that fully support the conclusion that lampreys have a simpler clotting scheme compared with other vertebrates. The data are also consistent with the hypothesis that a whole-genome duplication or other large scale block duplication occurred after the divergence of jawless fish from other vertebrates and allowed the simultaneous appearance of a second set of two functionally paired proteins in the vertebrate clotting scheme.
Similar content being viewed by others
Notes
The search of the factor V B domain actually detected a large number of almost-perfect tandem 27-nt repeats in the lamprey Trace database. Although both human and mouse factor V have 30 imperfect copies of these tandem nine-amino acid repeats, none occurs in the factor V sequences of chicken or puffer fish; this coincidental but perhaps chance similarity does not bear directly on the problem at hand.
One of the Trace sequences (G52 in Fig. 2; Trace ID 1446326143) exhibited a remarkable 77% identity to a 34-residue segment of human factor VIII (26 identities among the 34 residues) and may be a contaminant. The corresponding region from puffer fish is only 44% identical to the human segment. The same region from chicken is coincidentally 77% identical to the human sequence, but the eight differences are not the same as observed in the lamprey sequence.
Initially we reported that there were 19 GLA domains in the puffer fish (Jiang and Doolittle 2003), but some redundant sequences have apparently been removed from updated versions of that database, and we now count 16.
References
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Banfield DK, MacGillvray RTA (1992) Partial characterization of vertebrate prothrombin cDNAs:amplification and sequence analysis of the B chain of thrombin from nine different species. Proc Natl Acad Sci USA 89:2779–2783
Bohonus VL, Doolittle RF, Pontes M, Strong DD (1986) Complementary DNA sequence of lamprey fibrinogen β chain. Biochemistry 25:6512–6516
Carroll RL (1988) Vertebrate paleontology and evolution. W. H. Freeman, New York
Cripe LD, Moore KD, Kane WH (1992) Structure of the gene for human coagulation factor V. Biochemistry 31:3777–3785
Daimon M, Yamatani K, Igarashi M, Fukase N, Kawanami T, Kato T, Tominaga M, Sasaki H (1995) Fine structure of the human ceruloplasmin gene. Biochem Biophys Res Commun 208:1028–1035
Davidson CJ, Hirt RP, Lal K, Elgar G, Tuddenham EGD, McVey JH (2003a) Molecular evolution of the vertebrate blood coagulation network. J Thromb Haemost 1:1487–1494
Davidson CJ, Tuddenham EG, McVey JH (2003b) 450 Million years of hemostasis. Thromb Haemost 89:420–428
Dehal P, Boore JL (2005) Two rounds of whole genome duplication in the ancestral vertebrate. PLOS Biol 3:e314
Doolittle RF (1961) The comparative biochemistry of blood coagulation. Ph.D. dissertation. Harvard University
Doolittle RF (1987) Of Urfs and Orfs: a primer on how to analyze derived amino acid sequences. University Science Press, Mill Valley, CA
Doolittle RF (1993) The evolution of vertebrate blood coagulation: a case of Yin and Yang. Thromb Haemostasis 70:24–28
Doolittle RF, Feng DF (1987) Reconstructing the evolution of vertebrate blood coagulation from a consideration of the amino acid sequences of clotting proteins. Cold Spring Harbor Symp Quant Biol 52:869–874
Doolittle RF, Feng D-F (1990) Nearest neighbor procedure for relating progressively aligned amino acid sequences. Methods Enzymol 183:659–669
Doolittle RF, Surgenor DM (1962) Blood coagulation in fish. Am J Physiol 203:964–970
Doolittle RF, Oncley JL, Surgenor DM (1962) Species differences in the interaction of thrombin and fibrinogen. J Biol Chem 237:3123–3127
Escriva H, Manzon L, Youson J, Laudet V (2002) Analysis of lamprey and hagfish genes reveals a complex history of gene duplications during early vertebrate evolution. Mol Biol Evol 19:1440–1450
Felsenstein J (1989) PHYLIP—Phylogeny Inference Package (version 3.2). Cladistics 5:164–166
Feng D-F, Doolittle RF (1996) Progressive alignment and phylogenetic tree construction of protein sequences. Methods Enzymol 266:368–372
Gregory TR (2005) Animal Genome Size Database. Available at: http://www.genomesize.com
Hanumanthaiah R, Day K, Jagadeeswaran P (2002) Comprehensive analysis of blood coagulation pathways in Teleostei: evolution of coagulation factor genes and identification in zebrafish factor VIIi. Blood Cells Molecules Dis 29:57–68
Hellman NE, Gitlin JD (2002) Ceruloplasmin metabolism and function. Annu Rev Nutr 22:439–458
Henikoff JG, Henikoff S (1996) Blocks database and its applications. Methods Enzymol 266:88–105
Hughes AL, da Silva J, Friedman R (2001) Ancient genome duplications did not structure the human Hox-bearing chromosomes. Genome Res 11:771–780
Ichinose A, Takeya H, Esping E, Iwanaga S, Kisiel W, Davie EW (1990) Amino acid sequence of human protein Z, a vitamin K-dependent plasma glycoprotein. Biochem Biophys Res Commun 173:1139–1144
Ingram VM (1963) The hemoglobins in genetics and evolution. Columbia University Press, New York
Jagadeeswaran P, Gregory M, Zhou Y, Zon L, Padmanabhan K, Hanumanthaiah R (2000) Characterization of zebrafish full-length prothrombin cDNA and linkage group mapping. Blood Cells Molecules Dis 26:479–489
Jiang Y, Doolittle RF (2003) The evolution of vertebrate blood coagulation as viewed from a comparison of puffer fish and sea squirt genomes. Proc Natl Acad Sci USA 100:7527–7532
Kulman JD, Harris JE, Haldeman BA, Davie EW (1997) Primary structure and tissue distribution of two novel proline-rich γ-carboxyglutamic acid proteins. Proc Natl Acad Sci USA 94:9058–9062
Kulman JD, Harris JE, Nakazawa N, Ogasawara M, Satake M, Davie EW (2006) Vitamin K-dependent proteins in Ciona intestinalis, a basal chordate lacking a blood coagulation cascade. Proc Natl Acad Sci USA 103:15794–15799
Laize V, Martel P, Viegas CSB, Price PA, Cancela ML (2005) Evolution of matrix and bone gamma-carboxyglutamic acid proteins in vertebrates. J Biol Chem 280:26659–26668
Manfioletti G, Brancolini C, Avanzi G, Schneider C (1993) The protein encoded by a growth arrest-specific gene (gas6) is a new member of the vitamin K-dependent proteins related to protein S, a negative co-regulator in the blood coagulation cascade. Mol Cell Biol 13:4976–4985
Ohno S (1970) Evolution by gene duplication. Springer-Verlag, New York
Pan Y (1992) Studies on fibrinogen evolution. Ph.D. dissertation, University of California, San Diego
Pan Y, Doolittle RF (1992) cDNA sequence of a second fibrinogen α chain: an archetypal version alignable with full-length β and γ chains. Proc Natl Acad Sci USA 89:2066–2070
Panopoulou G, Hennig S, Groth D, Krause A, Poustka A, Herwig R, Vingron M, Lehrach H (2003) New evidence for genome-wide duplications at the origin of vertebrates using an amphioxus gene set and completed animal genomes. Genome Res 13:1056–1066
Price PA, Otsuka AA, Poser JW, Kristaponis J, Raman N (1976) Characterization of a gamma-carboxyglutamic acid-containing protein from bone. Proc Natl Acad Sci USA 73:1447–1451
Saitou N, Nei M (1987) The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 4:406–425
Sharman AC, Hay-Schmidt A, Holland PWH (1997) Cloning and analysis of an HMG gene from the lamprey Lampetra fluviatilis:gene duplication in vertebrate evolution. Gene 184:99–105
Sheehan J, Temple M, Gregory M, Hanumanthaiah R, Troyer D, Phan T, Thankavel B, Jagadeeswaran P (2001) Demonstration of the extrinsic coagulation pathway in teleostei: identification of zebrafish coagulation factor VII. Proc Natl Acad Sci USA 98:8768–8773
Sidow A (1996) Gen(om)e duplications in the evolution of early vertebrates. Curr Opin Gen Dev 6:715–722
Smith NGC, Knight R, Hurst LD (1999) Vertebrate genome evolution: A slow shuffle or a big bang?. BioEssays 21:697–703
Strong DD, Moore M, Cottrell BA, Bohonus VL, Pontes M, Evans B, Riley M, Doolittle RF (1985) Lamprey fibrinogen γ chain: cloning, cDNA sequencing and general characterization. Biochemistry 24:92–101
Syed BA, Beaumont NJ, Pate A, Maylor CE, Bayele HK, Joannu CL, Rowe PS, Evans RW, Srai SK (2002) Analysis of the human hephaestin gene and protein: comparative modelling of the N-terminus ecto-domain based upon ceruloplasmin. Prot Eng 15:205–214
Wang Y-Z, Patterson J. Gray JE, Yu C, Cottrell BA. Shimizu A, Graham D, Riley M, Doolittle RF (1989) Complete sequence of the lamprey fibrinogen α chain. Biochemistry 28:9801–9806
Zhang P, Gu Z, Li W-H (2006) Different evolutionary patterns between young duplicate genes in the human genome. Genome Biol 4:R56
Acknowledgments
This work was supported in part by National Institutes of Health Grant HL-81553. We thank Steve Culbertson for assistance in identifying mate-pairs. We are also are grateful to Jonathan Gitlin (Washington University, St. Louis) for sharing his unpublished findings on lamprey ceruloplasmin and to Steve Sommer (City of Hope) for unpublished cDNA sequences of two vitamin K-dependent proteases from lamprey.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Doolittle, R.F., Jiang, Y. & Nand, J. Genomic Evidence for a Simpler Clotting Scheme in Jawless Vertebrates. J Mol Evol 66, 185–196 (2008). https://doi.org/10.1007/s00239-008-9074-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-008-9074-8