Visualization of Genetic Drift Processes Using the Conserved Collagen 1α1 GXY Domain

Abstract

Speciation proceeds by the accumulation of DNA differences in time. The genetic code changes as a result of genetic drift and by selective pressure. In variable domains, exposure to high selective pressure obscures the view on background mutations. Therefore, we characterized and visualized background mutations using the highly conserved collagen 1α1 GXY domain. Typical change routes were identified and the data set showed several indications that changes in the collagen 1α1 GXY domain have taken place randomly within a functionally restricted space. The types of nucleotide and codon group differences are similar across the vertebrate subphylum and gradually become less functionally neutral with increasing distance between species, which offers the opportunity for rapid visualization of evolutionary relations using a single domain. It was concluded that the findings and approach of the study could be important for analytical method development in authenticity research, especially when conserved domains are targeted.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

References

  1. Barbezange C, Jones L, Blanc H, Isakov O, Celniker G, Enouf V, Shomron N, Vignuzzi M, van der Werf S (2018) Seasonal genetic drift of human influenza A virus quasispecies revealed by deep sequencing. Front Microbiol 9:2596

    Article  PubMed  PubMed Central  Google Scholar 

  2. Bellamy G, Bornstein P (1971) Evidence for procollagen, a biosynthetic precursors of collagen. Proc Natl Acad Sci USA 68:1138–1142

    Article  CAS  PubMed  Google Scholar 

  3. Chen L, Liu P, Evans TC, Ettwiller LM (2017) DNA damage is a pervasive cause of sequencing errors, directly confounding variant identification. Science 355:752–756

    Article  CAS  PubMed  Google Scholar 

  4. Darwin C (1859) On the origin of species by means of natural selection, or the preservation of favoured races in the struggle for life. John Murray, Albemarle street, London: printed by W. Clowes and Sons, Stamford street, and Charing Cross, London

    Book  Google Scholar 

  5. Dawson LF, Valiente E, Wren BW (2009) Clostridium difficile—a continually evolving and problematic pathogen. Infect Genet Evol 9:1410–1417

    Article  CAS  PubMed  Google Scholar 

  6. Delport W, Poon AF, Frost SD, Kosakovsky Pond SL (2010) Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics 26(19):2455–2457

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Di Lullo GA, Sweeney SM, Körkkö J, Ala-Kokko L, San Antonio JD (2002) Mapping the ligand-binding sites and disease-associated mutations on the most abundant protein in the human, Type I collagen. J Biol Chem 277:4223–4231

    Article  CAS  PubMed  Google Scholar 

  8. Futuyma DJ (2005) Evolution. Sinauer Associates, Sunderland

    Google Scholar 

  9. Hudson DM, Garibov M, Dixon DR, Popowics T, Eyre DR (2017) Distinct post-translational features of type I collagen are conserved in mouse and human periodontal ligament. J Periodontal Res 52:1042–1049

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Jabbari K, Cacciò S, Païs de Barros JP, Desgrès J, Bernardi G (1997) Evolutionary changes in CpG and methylation levels in the genome of vertebrates. Gene 205:109–118

    Article  CAS  PubMed  Google Scholar 

  11. Kang AH, Dixit SN, Corbett C, Gross J (1975) The covalent structure of collagen. Amino acid sequence of alpha1-CB5 glycopeptide and alpha1-CB4 from chick skin collagen. J Biol Chem 250:7428–7434

    CAS  PubMed  Google Scholar 

  12. Karsdal MA, Leeming DJ, Henriksen K, Bay-Jensen A (2016) Biochemistry of collagens, laminins and elastin. Structure, function and biomarkers. Elsevier Academic Press, Amsterdam. ISBN: 978-0-12-809847-9

    Google Scholar 

  13. Kimura M, Ohta T (1969) The average number of generations until fixation of a mutant gene in a population. Genetics 61:763–771

    CAS  PubMed  PubMed Central  Google Scholar 

  14. Kleinnijenhuis AJ, van Holthoon FL (2018) Domain-specific proteogenomic analysis of collagens to evaluate de novo sequencing results and database information. J Mol Evol 86:293–302

    Article  CAS  PubMed  Google Scholar 

  15. Kleinnijenhuis AJ, van Holthoon FL, Herregods G (2018) Validation and theoretical justification of an LC-MS method for the animal species specific detection of gelatin. Food Chem 243:461–467

    Article  CAS  PubMed  Google Scholar 

  16. Kosakovsky Pond SL, Frost SDW (2005) Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol 22:1208–1222

    Article  CAS  PubMed  Google Scholar 

  17. Krzywinski M et al (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639–1645

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Kumar S, Subramanian S (2002) Mutation rates in mammalian genomes. Proc Natl Acad Sci USA 99:803–808

    Article  CAS  PubMed  Google Scholar 

  19. Lodish H, Berk A, Zipursky SL, Matsudaira P, Baltimore D, Darnell J (2000) Molecular cell biology, 4th edn. New York: W. H. Freeman

    Google Scholar 

  20. Lourenço JM, Glémin S, Chiari Y, Galtier N (2013) The determinants of the molecular substitution process in turtles. J Evol Biol 26:38–50

    Article  PubMed  Google Scholar 

  21. Marini JC et al (2007) Consortium for osteogenesis imperfecta mutations in the helical domain of type I collagen: regions rich in lethal mutations align with collagen binding sites for integrins and proteoglycans. Hum Mutat 28:209–221

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Murphy WJ, Pringle TH, Crider TA, Springer MS, Miller W (2007) Using genomic data to unravel the root of the placental mammal phylogeny. Genome Res 17:413–421

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Nené NR, Mustonen V, Illingworth CJR (2018) Evaluating genetic drift in time-series evolutionary analysis. J Theor Biol 437:51–57

    Article  Google Scholar 

  24. Nuytinck L, Freund M, Lagae L, Pierard GE, Hermanns-Le T, De Paepe A (2000) Classical Ehlers-Danlos syndrome caused by a mutation in type I collagen. Am J Hum Genet 66:1398–1402

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Perelman P et al (2011) A molecular phylogeny of living primates. PLoS Genet 7(3):e1001342. https://doi.org/10.1371/journal.pgen.1001342

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Phillips GO, Williams PA (2011) Handbook of Food Proteins. Woodhead, Cambridge

    Book  Google Scholar 

  27. Robinson M et al (1984) Codon usage can affect efficiency of translation of genes in Escherichia coli. Nucleic Acids Res 12:6663–6671

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Rogozin IB, Belinky F, Pavlenko V, Shabalina SA, Kristensen DM, Koonin EV (2016) Evolutionary switches between two serine codon sets are driven by selection. Proc Natl Acad Sci USA 113:13109–13113

    Article  CAS  PubMed  Google Scholar 

  29. Sanger F (1949) The terminal peptides of insulin. Biochem J 45:563–574

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Slatter DA, Farndale RW (2015) Structural constraints on the evolution of the collagen fibril: convergence on a 1014-residue COL domain. Open Biol 5:1–7

    Article  CAS  Google Scholar 

  31. Stover DA, Verrelli BC (2011) Comparative vertebrate evolutionary analyses of type I collagen: potential of COL1a1 gene structure and intron variation for common bone-related diseases. Mol Biol Evol 28:533–542

    Article  CAS  PubMed  Google Scholar 

  32. Viguet-Carrin S, Garnero P, Delmas PD (2006) The role of collagen in bone strength. Osteoporos Int 17:319–336

    Article  CAS  PubMed  Google Scholar 

  33. Watson JD, Crick FH (1953) Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature 171:737–738

    Article  CAS  PubMed  Google Scholar 

  34. Weir JT, Schluter D (2008) Calibrating the avian molecular clock. Mol Ecol 17:2321–2328

    Article  CAS  PubMed  Google Scholar 

  35. World Conservation Union (2014) IUCN red list of threatened species

  36. Wright S (1929) The evolution of dominance. Am Nat 63:556–561

    Article  Google Scholar 

  37. Yamauchi M, Sricholpech M (2012) Lysine post-translational modifications of collagen. Essays Biochem 52:113–133

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The study was a sequel to a previous proteogenomic study (Kleinnijenhuis and van Holthoon 2018) and was financed by Triskelion.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Anne J. Kleinnijenhuis.

Ethics declarations

Conflict of interest

The author declares no conflicts of interest. The work was financed by Triskelion.

Additional information

Handling editor: Konstantinos Voskarides.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Kleinnijenhuis, A.J. Visualization of Genetic Drift Processes Using the Conserved Collagen 1α1 GXY Domain. J Mol Evol 87, 106–130 (2019). https://doi.org/10.1007/s00239-019-09890-8

Download citation

Keywords

  • Genetic drift
  • Collagen 1α1
  • Molecular evolution
  • Coding DNA
  • Authenticity
  • Functional restriction