Journal of Molecular Evolution

, Volume 36, Issue 1, pp 41–66 | Cite as

Evolution of protein complexity: The blue copper-containing oxidases and related proteins

  • Lars G. Rydén
  • Lois T. Hunt


The blue copper proteins and their relatives have been compared by sequence alignments, by comparison of three-dimensional structures, and by construction of phylogenetic trees. The group contains proteins varying in size from 100 residues to over 2,300 residues in a single chain, containing from zero to nine copper atoms, and with a broad variation in function ranging from electron carrier proteins and oxidases to the blood coagulation factors V and VIII. Difference matrices show the sequence difference to be over 90% for many pairs in the group, yet alignment scores and other evidence suggest that they all evolved from a common ancestor. We have attempted to delineate how this evolution took place and in particular to define the mechanisms by which these proteins acquired an ever-increasing complexity in structure and function. We find evidence for six such mechanisms in this group of proteins: domain enlargement, in which a single domain increases in size from about 100 residues up to 210; domain duplication, which allows for a size increase from about 170 to about 1,000 residues; segment elongation, in which a small segment undergoes multiple successive duplications that can increase the chain size 50-fold; domain recruitment, in which a domain coded elsewhere in the genome is added on to the peptide chain; subunit formation, to form multisubunit proteins; and glycosylation, which in some cases doubles the size of the protein molecule. Size increase allows for the evolution of new catalytic properties, in particular the oxidase function, and for the formation of coagulation factors with multiple interaction sites and regulatory properties. The blood coagulation system is examined as an example in which a system of interacting proteins evolved by successive duplications of larger parts of the genome. The evolution of size, functionality, and diversity is compared with the general question of increase in size and complexity in biology.

Key words

Blue copper proteins Blue oxidases Coagulation factors Discoidins Domain evolution Evolutionary mechanisms Phylogeny 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Adman ET, Stenkamp RE, Sieker LC, Jensen LH (1978) A crystallographic model for azurin at 3 Å resolution. J Mol Biol 123:35–47CrossRefPubMedGoogle Scholar
  2. Adman ET, Turley S, Bramson R, Petratos K, Banner D, Tsernoglou D, Beppu T, Watanabe H (1989) A 2.0-Å structure of the blue copper protein (cupredoxin) fromAlcaligenes faecalis S-6. J Biol Chem 264:87–99PubMedGoogle Scholar
  3. Aitken A (1975) Prokaryote-eukaryote relationships and the amino acid sequence of plastocyanin fromAnabaena variabilis. Biochem J 149:675–683PubMedGoogle Scholar
  4. Ambler RP (1971) Sequence data acquisition for the study of phylogeny. In: Previero A, Pechere J-F, Coletti-Previero M-A (eds) Recent developments in the chemical study of protein structures. INSERM, Paris, pp 289–305Google Scholar
  5. Ambler RP, Tobari J (1985) The primary structures ofPseudomonas AM1 amicyanin and pseudoazurin. Two new sequence classes of blue copper proteins. Biochem J 232:451–457PubMedGoogle Scholar
  6. Astolfi P, Kidd KK, Cavalli-Sforza LL (1981) A comparison of methods for reconstructing evolutionary trees. Syst Zool 30:156–169CrossRefGoogle Scholar
  7. Barker WC, Dayhoff MO (1979) Evolutionary trees from protein sequences: probability of correct reconstruction of fourbranch topologies. Biophysical J 25:158aGoogle Scholar
  8. Barker WC, Hunt LT, George DG (1988) Homology and similarity in amino acid sequences: detection, analysis, and interpretation. In: Silverman BD (ed) Computer simulation of carcinogenic processes. CRC Press, Boca Raton, FL, pp 1–36Google Scholar
  9. Barker WC, George DG, Hunt LT (1990) Protein sequence database. In: Doolittle RF (ed) Methods in enzymology, Vol 183. Academic Press, New York, pp. 31–49.Google Scholar
  10. Bergman C, Gandvik E-K, Nyman PO, Strid L (1977) The amino acid sequence of stellacyanin from the lacquer tree. Biochem Biophys Res Commun 77:1052–1059CrossRefGoogle Scholar
  11. Bergman C (1980) Amino acid sequences studies of two small blue proteins: stellacyanin and umecyanin. PhD thesis (Department of Biochemistry and Biophysics), University of Göteborg, SwedenGoogle Scholar
  12. Blanken RL, Klotz LC, Hinnebusch AG (1982) Computer comparison of new and existing criteria for constructing evolutionary trees from sequence data. J Mol Evol 19:9–19CrossRefPubMedGoogle Scholar
  13. Bonner JT (1988) The evolution of complexity by means of natural selection. Princeton University Press, Princeton, NJGoogle Scholar
  14. Chou PY, Fasman GD (1978a) Empirical predictions of protein conformation. Annu Rev Biochem 47:251–276CrossRefPubMedGoogle Scholar
  15. Chou PY, Fasman GD (1978b) Prediction of the secondary structure of proteins from their amino acid sequence. Adv Enzymol 47:45–148PubMedGoogle Scholar
  16. Cole ST, Grundstrom T, Jaurin B, Robinson JJ, Weiner JH (1982) Location and nucleotide sequence of frdB, the gene coding for the iron-sulphur protein subunit of the fumarate reductase ofEscherichia coli. Eur J Biochem 126:211–216CrossRefPubMedGoogle Scholar
  17. Darlison MG, Guest JR (1984) Nucleotide sequence encoding the iron-sulphur protein subunit of the succinate dehydrogenase ofEscherichia coli. Biochem J 223:507–517PubMedGoogle Scholar
  18. Dayhoff MO, Barker WC, Hunt LT (1983) Establishing homologies in protein sequences. In: Hirs CHW, Timasheff SN (eds) Methods in enzymology, vol 91. Academic Press, New York, pp 524–545Google Scholar
  19. Fitch WM, Margoliash E (1967) Construction of phylogenetic trees: a method based on mutation distances as estimated from cytochrome c sequences is of general applicability. Science 155:279–284PubMedGoogle Scholar
  20. George DG, Hunt LT, Barker WC (1988) The Protein Identification Resource (PIR) software. In: Lesk AM (ed) Computational molecular biology: sources and methods for sequence analysis. Oxford University Press, Oxford, pp 100–115Google Scholar
  21. George DG, Barker WC, Hunt LT (1990) Mutation data matrix and its uses. In: Doolittle RF (ed) Methods in enzymology, vol 183. Academic Press, New York, pp 333–351Google Scholar
  22. Germann UA, Müller G, Hunziker PE, Lerch K (1988) Characterization of two allelic forms ofNeurospora crassa laccase. Amino- and carboxyl-terminal processing of a precursor. J Biol Chem 263:885–896PubMedGoogle Scholar
  23. Goodman M, Romero-Herrera AE, Dene H, Czelusniak J, Tashian RE (1982) Amino acid sequence evidence on the phylogeny of primates and other eutherians. In: Goodman M (ed) Macromolecular sequences in systematic and evolutionary biology. Plenum Press, New York, pp 115–191Google Scholar
  24. Guss JM, Freeman HC (1983) Structure of oxidized poplar plastocyanin at 1.6 Å resolution. J Mol Biol 169:521–563PubMedGoogle Scholar
  25. Guss JM, Merritt EA, Phizackerley RP, Hedman B, Murata M, Hodgson KO, Freeman HC (1988) Phase determination by multiple-wavelength x-ray diffraction: crystal structure of a basic “blue” copper protein from cucumbers. Science 241:806–811PubMedGoogle Scholar
  26. Han T-M, Runnegar B (1992) Megascopic eukaryotic algae from the 2.1-billion-year-old Negaunee Iron-Formation, Michigan. Science 257:232–235PubMedGoogle Scholar
  27. Hasegawa M, Kishino H, Siatou N (1991) On the maximum likelihood method in molecular phylogenetics. J Mol Evol 32:443–445CrossRefPubMedGoogle Scholar
  28. Hedner U, Davie EW (1989) Introduction to hemostasis and the vitamin K-dependent coagulation factors. In: Scriver CR, Beaudet AL, Sly WS, Valle D (eds) Metabolic basis of inherited disease, ed 6, vol 2. McGraw-Hill, New York, pp 2107–2134Google Scholar
  29. Holm L, Saraste M, Wilkström M (1987) Structural models of the redox centres in cytochrome oxidase. EMBO J 6:2819–2823PubMedGoogle Scholar
  30. Hormel S, Adman E, Walsh KA, Beppu T, Titani K (1986) The amino acid sequence of the blue copper protein ofAlcaligenes faecalis. FEBS Lett 197:301–304CrossRefPubMedGoogle Scholar
  31. Hunt LT, George DG, Yeh L-SL (1985) Ragweed allergen Ra3: relationship to some type I copper-binding proteins. J Mol Evol 21:126–132CrossRefGoogle Scholar
  32. Jenny RJ, Pittman DD, Toole JJ, Kriz RW, Aldape RA, Hewick RM, Kaufman RJ, Mann KG (1987) Complete cDNA and derived amino acid sequence of human factor V. Proc Natl Acad Sci USA 84:4846–4850PubMedGoogle Scholar
  33. Kane WH, Davie EW (1986) Cloning of a cDNA coding for human factor V, a blood coagulation factor homologous to factor VIII and ceruloplasmin. Proc Natl Acad Sci USA 83:6800–6804PubMedGoogle Scholar
  34. Kane WH, Davie EW (1988) Blood coagulation factors V and VIII: structural and functional similarities and their relationship to hemorrhagic and thrombotic disorders. Blood 71:539–555PubMedGoogle Scholar
  35. Kane WH, Ichinose A, Hagen FS, Davie EW (1987) Cloning of cDNAs coding for the heavy chain region and connecting region of human factor V, a blood coagulation factor with four types of internal repeats. Biochemistry 26:6508–6514CrossRefPubMedGoogle Scholar
  36. Kanehisa M (1985) IDEAS. Integrated database and extended analysis system for nucleic acids and proteins. User manual. Laboratory of Mathematical Biology, NIH, Bethesda, MDGoogle Scholar
  37. Knoll AH (1992) The early evolution of eukaryotes: a geological perspective. Science 256:622–627PubMedGoogle Scholar
  38. Koschinsky ML, Funk WD, van Oost BA, MacGillivray RTA (1986) Complete cDNA sequence of human preceruloplasmin. Proc Natl Acad Sci USA 83:5086–5090PubMedGoogle Scholar
  39. Kraulis PJ (1991) MolScript—a program to produce both detailed and schematic plots of protein structure. J Appl Crystallogr 24:946–950CrossRefGoogle Scholar
  40. Malkin R, Malmström BG (1970) The state and function of copper in biological systems. Adv Enzymol 33:177–243PubMedGoogle Scholar
  41. Messerschmidt A, Rossi A, Ladenstein R, Huber R, Bolognesi M, Gatti G, Marchesini A, Petruzzelli R, Finazzi-Agró A (1989) X-ray crystal structure of the blue oxidase ascorbate oxidase from zucchini. Analysis of the polypeptide fold and a model of the copper sites and ligands. J Mol Biol 206:513–529CrossRefPubMedGoogle Scholar
  42. Messerschmidt A, Huber R (1990) The blue oxidases, ascorbate oxidase, laccase, and ceruloplasmin. Modelling and structural relationships. Eur J Biochem 187:341–352CrossRefPubMedGoogle Scholar
  43. Mondovi B, Avigliano L (1984) Ascorbate oxidase. In: Lontie R (ed) Copper proteins and copper enzymes, vol 3, CRC Press, Boca Raton, FL pp 101–118Google Scholar
  44. Murata M, Begg GS, Lambrou F, Leslie B, Simpson RJ, Freeman HC, Morgan FJ (1982) Amino acid sequence of a basic blue protein from cucumber seedlings. Proc Natl Acad Sci USA 79:6434–6437PubMedGoogle Scholar
  45. Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48:443–453CrossRefPubMedGoogle Scholar
  46. Norris GE, Anderson BF, Baker EN (1983) Structure of azurin fromAlcaligenes denitrificans at 2.5 Å resolution. J Mol Biol 165:501–521PubMedGoogle Scholar
  47. Ohkawa J, Okada N, Shinmyo A, Takano M (1989) Primary structure of cucumber (Cucumis sativus) ascorbate oxidase deduced from cDNA sequence: homology with blue copper proteins and tissue-specific expression. Proc Natl Acad Sci USA 86:1239–1243PubMedGoogle Scholar
  48. Omoto E, Tavassoli M (1990) Purification and partial characterization of ceruloplasmin receptors from rat liver endothelium. Arch Biochem Biophys 282:34–38CrossRefPubMedGoogle Scholar
  49. Patthy L (1985) Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modules. Cell 41:657–663CrossRefPubMedGoogle Scholar
  50. Poole S, Firtel RA, Lamar E, Rowekamp W (1981) Sequence and expression of the discoidin I gene family inDictyostelium discoideum. J Mol Biol 153:273–289CrossRefPubMedGoogle Scholar
  51. Reinhammar B (1984) Laccase. In: Lontie R (ed) Copper proteins and copper enzymes, vol 3, CRC Press, Boca Raton, FL, pp 1–35Google Scholar
  52. Rogers S, Wells R, Rechsteiner M (1986) Amino acid sequences common to rapidly degraded proteins: the PEST hypothesis. Science 234:364–368PubMedGoogle Scholar
  53. Rydén L (1982) Model of the active site in the blue oxidases based on the ceruloplasmin-plastocyanin homology. Proc Natl Acad Sci USA 79:6767–6771PubMedGoogle Scholar
  54. Rydén L (1984a) Structure and evolution of the small blue proteins. In: Lontie R (ed) Copper proteins and copper enzymes, vol 1, CRC Press, Boca Raton, FL, pp 183–214Google Scholar
  55. Rydén L (1984b) Ceruloplasmin. In: Lontie R (ed) Copper proteins and copper enzymes, vol 3. CRC Press, Boca Raton, FL, pp 37–100Google Scholar
  56. Rydén L (1988) Evolution of blue copper proteins. In: King TP, Mason H (eds) Oxidases and related redox systems. Alan R Liss, New York, pp 349–366Google Scholar
  57. Rydén L, Lundgren J-O (1976) Homology relationships among the small blue proteins. Nature 261:344–346CrossRefPubMedGoogle Scholar
  58. Rydén L, Lundgren J-O (1979) On the evolution of blue proteins. Biochimie 61:781–790PubMedGoogle Scholar
  59. Rydén L, Lundgren J-O (1980) The relationship of the small blue proteins with the copper-containing oxidases. In: Peeters H (ed) Protides of the biological fluids, vol 28. Pergamon Press, Oxford, pp 87–90Google Scholar
  60. Saitou N, Imanishi T (1989) Relative efficiencies of the Fitch-Margoliash, maximum-parsimony, maximum-likelihood, minimum-evolution, and neighbor-joining methods of phylogenetic tree construction in obtaining the correct tree. Mol Biol Evol 6:514–525Google Scholar
  61. Saloheimo M, Niku-Paavola M-L, Knowles JKC (1991) Isolation and structural analysis of the laccase gene from the lignindegrading fungusPhlebia radiata. J Gen Microbiol 137:1537–1544PubMedGoogle Scholar
  62. Saraste M (1990) Structural features of cytochrome oxidase. Q Rev Biophys 23:331–366PubMedCrossRefGoogle Scholar
  63. Schopf JW (1978) The evolution of the earliest cells. Sci Am 239:111–140CrossRefGoogle Scholar
  64. Sourdis J, Nei M (1988) Relative efficiencies of the maximum parsimony and distance-matrix methods in obtaining the correct phylogenetic tree. Mol Biol Evol 5:298–311PubMedGoogle Scholar
  65. Stubbs JD, Lekutis C, Singer KL, Bui A, Yuzuki D, Srinivasan U, Parry G (1990) cDNA cloning of a mouse mammary epithelial cell surface protein reveals the existence of epidermal growth factor-like domains linked to factor VIII-like sequences. Proc Natl Acad Sci USA 87:8417–8421PubMedGoogle Scholar
  66. Takahashi N, Ortel TL, Putnam FW (1984) Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule. Proc Natl Acad Sci USA 81:390–394PubMedGoogle Scholar
  67. Trackman PC, Pratt AM, Wolanski A, Tang S-S, Offner GD, Troxler RF, Kagan HM (1990) Cloning of rat aorta lysyl oxidase cDNA: complete codons and predicted amino acid sequence. Biochemistry 29:4863–4870CrossRefPubMedGoogle Scholar
  68. Vehar GA, Keyt B, Eaton D, Rodriguez H, O'Brien DP, Rotblat F, Oppermann H, Keck R, Wood WI, Harkins RN, Tuddenham EGD, Lawn RM, Capon DJ (1984) Structure of human factor VIII. Nature 312:337–342CrossRefPubMedGoogle Scholar
  69. Williams AF, Barclay AN (1988) The immunoglobulin superfamily: domains for cell surface recognition. Ann Rev Immunol 6:381–405Google Scholar
  70. Wood WI, Capon DJ, Simonsen CC, Eaton DL, Gitschier J, Keyt B, Seeburg PH, Smith DH, Hollingshead P, Wion KL, Delwart E, Tuddenham EGD, Vehar GA, Lawn RM (1984) Expression of active human factor VIII from recombinant DNA clones. Nature 312:330–337CrossRefPubMedGoogle Scholar
  71. Young CL, Barker WC, Tomaselli CM, Dayhoff MO (1979) Serine proteases. In: Dayhoff MO (ed) Atlas of protein sequence and structure, vol 5, suppl 3. National Biomedical Research Foundation, Washington, DC, pp 73–93Google Scholar

Copyright information

© Springer-Verlag New York Inc 1993

Authors and Affiliations

  • Lars G. Rydén
    • 1
  • Lois T. Hunt
    • 2
  1. 1.Department of BiochemistryUppsala University, Uppsala Biomedical CenterUppsalaSweden
  2. 2.National Biomedical Research FoundationGeorgetown University Medical CenterWashington DCUSA

Personalised recommendations