Volume 422 of the series Methods in Molecular Biology™ pp 171-184

Computational Reconstruction of Ancestral DNA Sequences

  • Mathauieu BlanchetteAffiliated withMcGill Centre for Bioinformatics, McGill University
  • , Abdoulaye Baniré DialloAffiliated withMcGill Centre for Bioinformatics, McGill University
  • , Eric D. GreenAffiliated withNational Human Genome Research Institute, National Institutes of Health
  • , Webb MillerAffiliated withCenter for Comparative Genomics and Bioinformatics, Penn State
  • , David HausslerAffiliated withHoward Hughes Medical Institute, University of California

* Final gross prices may vary according to local VAT.

Get Access


This chapter introduces the problem of ancestral sequence reconstruction: given a set of extant orthologous DNA genomic sequences (or even whole-genomes), together with a phylogenetic tree relating these sequences, predict the DNA sequence of all ancestral species in the tree. Blanchette et al. (1) have shown that for certain sets of species (in particular, for eutherian mammals), very accurate reconstruction can be obtained. We explain the main steps involved in this process, including multiple sequence alignment, insertion and deletion inference, substitution inference, and gene arrangement inference. We also describe a simulation-based procedure to assess the accuracy of the reconstructed sequences. The whole reconstruction process is illustrated using a set of mammalian sequences from the CFTR region.

Key Words

Ancestral DNA sequence reconstruction multiple sequences alignment mammalian phylogeny; mammalian evolution substitutions and indels reconstruction ancestral sequence reconstruction accuracy