Journal of Molecular Evolution

, Volume 65, Issue 2, pp 137–153

Comprehensive Analysis of Animal TALE Homeobox Genes: New Conserved Motifs and Cases of Accelerated Evolution

Article

DOI: 10.1007/s00239-006-0023-0

Cite this article as:
Mukherjee, K. & Bürglin, T.R. J Mol Evol (2007) 65: 137. doi:10.1007/s00239-006-0023-0

Abstract

TALE homeodomain proteins are an ancient subgroup within the group of homeodomain transcription factors that play important roles in animal, plant, and fungal development. We have extracted the full complement of TALE superclass homeobox genes from the genome projects of seven protostomes, seven deuterostomes, and Nematostella. This was supplemented with TALE homeobox genes from additional species and phylogenetic analyses were carried out with 276 sequences. We found 20 homeobox genes and 4 pseudogenes in humans, 21 genes in mouse, 8 genes in Drosophila, and 5 genes plus one truncated gene in Caenorhabditis elegans. Apart from the previously identified TALE classes MEIS, PBC, IRO, and TGIF, a novel class is identified, termed MOHAWK (MKX). Further, we show that the MEIS class can be divided into two families, PREP and MEIS. Prep genes have previously only been described in vertebrates but are lacking in Drosophila. Here we identify orthologues in other insect taxa as well as in the cnidarian Nematostella. In C. elegans, a divergent Prep protein has lost the homeodomain. Full-length multiple sequence alignment of the protostome and deuterostome sequences allowed us to identify several novel conserved motifs within the MKX, TGIF, and MEIS classes. Phylogenetic analyses revealed fast-evolving PBC class genes; in particular, some X-linked PBC genes in nematodes are subject to rapid evolution. In addition, several instances of gene loss were identified. In conclusion, our comprehensive analysis provides a defining framework for the classification of animal TALE homeobox genes and the understanding of their evolution.

Keywords

HomeoboxTALEMEISPREPTGIFPBCMOHAWKMKXIRO

Supplementary material

239_2006_23_Supp.pdf (19.5 mb)
Supplementary material

Copyright information

© Springer Science+Business Media, LLC 2007

Authors and Affiliations

  1. 1.Department of Biosciences and NutritionKarolinska Institutet, and School of Life SciencesHuddingeSweden
  2. 2.Center for Genomics and BioinformaticsKarolinska InstitutetStockholmSweden
  3. 3.School of Life SciencesHuddingeSweden