Abstract
HOX genes encode transcriptional factors that play a pivotal role in specifying regional identity in nearly every bilateral animal. The birth of HOX gene cluster and its subsequent evolution, either in regulation or function, underlie the evolution of many bilaterian features and hence to the evolutionary radiation of this group. Despite of this importance, evolution of HOX cluster in vertebrates remains largely obscure because the phylogenetic history of these genes is poorly resolved. This has led to the controversy about whether four HOX clusters in human originated through two rounds (2R) of whole-genome duplications or instead evolved by small-scale events early in vertebrate evolution. Recently, the large-scale phylogenetic analysis of triplicate and quadruplicate paralogous regions residing on human HOX-bearing chromosomes provided an unprecedented insight into events that shaped vertebrate genome early in their history. Based on these data and comparative genomic analysis of fruit fly, red floor beetle, and human, this study infers the genic content of minimal HOX locus in the Urbilaterian and reconstructs its duplication history. It appears that four HOX clusters of humans are not remnants of polyploidy events in vertebrate ancestry. Rather, current evidence suggests that one-to-four transition in HOX cluster number occurred by three-step sequential process involving regional duplication events. Therefore, it is concluded that the evolutionary origin of vertebrate novelties, including the complexity of their body, is the consequence of small-scale genetic changes at widely different times over their history.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
One of the most fundamental questions in evolutionary developmental biology (evo-devo) is how the evolution of metazoan genomes led to the astonishing diversification of body forms that we observe on Earth and in the ocean (Mora et al. 2011). Among the bilaterians, vertebrates have unique anatomical aspects and possess the greatest number of cell/tissue types (True and Carroll 2002). It was speculated that the invention of genes with new functions underlies increasing developmental, morphological, and metabolic complexity during vertebrate early history (Carroll 2001). Particularly, in the early years of genomic research prior to the availability of large-scale animal genome sequence data, based on rather inaccurate indicators such as genome size and gene number, it was suggested that whole-genome duplications (WGDs) generated a large amount of raw material to prompt evolution of novelty and complexity in a short time during early vertebrate history (Ohno 1970; Ohno 1973). This notion popularly theorized “2R hypothesis” (two rounds of WGDs) has been widely debated (Abbasi 2008; Hughes 2001; Hughes and Friedman 2003). In the early years of genomic era, the 2R has remained inconclusive owing to the limited amount of sequence data availability, sensitivity of computational programs to noisy data, and lack of methodological rigor (Durand 2003).
HOX paralogon and the history of vertebrate genome
Relied on partial dataset, initial investigations presented several lines of evidence in favor of this hypothesis. The most important one is the occurrence of potentially quadruplicated HOX (HSA 2/7/12/17), MHC (HSA 1/6/9/19), and FGFR (HSA 4/5/8/10) regions in human and other mammalian genomes (Larhammar et al. 2002; Panopoulou and Poustka 2005). Among these large quadrupled regions, human HOX clusters and their nearby genes are probably the most cited example, whose organization was taken as strong support to ancestral tetraploidy in vertebrates (Furlong and Holland 2002). However, over the past few years, with the rapidly increasing availability of complete genome sequences from diverse set of animal species, evolutionary history of human HOX-bearing chromosomes has been subjected to rigorous scrutiny (Abbasi 2010b; Abbasi and Grzeschik 2007; Ambreen et al. 2014; Asrar et al. 2013; Hughes et al. 2001). Three major steps were applied for large-scale computational analysis of gene duplications: (1) identifying multigene families with at least threefold representation on HSA 2/7/12/17, (2) construction of gene family trees, (3) estimation of timing of duplications relative to a phylogeny of organisms, (4) tests of phylogenetic consistency. Combing the results of these methods, a total of 62 families provided results that were inconsistent with 2R hypothesis. Rather, it appeared that members of these families were created by segmental duplications, independent gene duplications, and translocation events, scattered at different times over the history of animals.
Reconstructing the history of mammalian HOX clusters
In addition to resolving the history of human HOX-bearing chromosomes, the history of duplication of HOX clusters themselves has been revealed in this large-scale phylogenetic investigation of human multigene families (Abbasi 2010b; Abbasi and Grzeschik 2007; Ambreen et al. 2014; Asrar et al. 2013). Reconstructing the duplication history of mammalian four HOX clusters remains problematic because the interspecies variation is not sufficient to resolve the cluster duplication events (Bailey et al. 1997; Zhang and Nei 1996). However, Zhang and Nei (1996) analyzed the phylogeny of mammalian HOX clusters and proposed two alternative topologies (((HOXC HOXD) HOXA)HOXB) and ((HOXC HOXD) (HOXA HOXB)). The former topology suggests three separate regional duplication steps (1 → 2 → 4 HOX clusters) whereas the later favors two rounds of whole-genome duplication events (2 + 2 topology).
Multigene families linked to HOX clusters share the same history as the HOX clusters
It was hypothesized that multigene families closely linked to four human HOX clusters share the same evolutionary history as the HOX sequences themselves and are good candidates to resolve the clusters duplication history (Bailey et al. 1997). Taking advantage of the well-annotated and high-quality human genomic sequence map, intragenomic conserved syntenic association was explored around four human HOX clusters. This survey of human HOX-bearing loci pinpointed triplicate/quadruplicate syntenic association of paralogs of at least four distinct gene families (SP, HNRNPA, ITGA, and FMNL) with HOX clusters (Fig. 1). The close physical linkage of members of these families with each of the human HOX clusters makes them an interesting test case to evaluate clusters duplication history (Ambreen et al. 2014). Current availability of an immense amount of protein sequence data from an expanding range of vertebrate and invertebrate species was exploited to conduct a robust and thorough phylogenetic investigation (Fig. 1) (Abbasi 2010b; Abbasi and Grzeschik 2007; Ambreen et al. 2014; Asrar et al. 2013). Intriguingly, results from each of these families provide strong bootstrap support for a tree where the HOXB cluster branched off first, followed by HOXA, and final HOXC/D (Fig. 1). This would be expected if human HOX clusters and members of SP, HNRNPA, ITGA, and FMNL families (HOX paralogous regions) evolved in conjunction through three independent events of segmental duplications (SDs) (Fig. 2).
History of gene syntenic segments spanning human HOX loci. Upper panel schematically depicts the human fourfold paralogy regions spanning HOX loci on HSA2/7/12/17. Paralogous gene sets are color-coded similarly. The “×” symbol denotes the putative gene losses. Lower panel depicts the schematic NJ tree topologies of the multigene family constituting the human HOX paralogy block. A congruent-type (AB)(C)(D) topology for the HOX, ITGA, HNRNP, SP, and FMNL families suggests that human HOX paralogy regions were quadruplicated by three rounds of segmental duplication events. HOX genes are depicted in red and designed 1 through 14. Color codes for non-HOX genes are as follows: Itga (Integrin alpha), light blue; Fmnl (formin-like), purple; Sp (Transcription factor family SP), green; Evx (even-skipped homeobox), light green; Meox (mesenchyme homeobox), dark blue; Hnrnp (heterogeneous nuclear ribonucleoprotein), pink. None of the features of this figure are drawn to scale
Model of HOX locus evolution in vertebrates. A model for the evolutionary history of human HOX gene cluster is proposed based on phylogenetic history of HOX gene clusters and unrelated neighboring genes. Left panel displays in detail the inferred events, whereas, the right panel is a summary flow chart. Vertebrate ancestry contained a single set of coherent HOX gene cluster (depicted in red). Based on comparative syntenic analysis, here we inferred the minimal block of synteny spanning the ancestral HOX gene cluster. This ancestral HOX gene cluster along with at least six neighboring genes (HOX locus) underwent three rounds of segmental tandem (SD) duplication and translocation events generated four copies of ancestral HOX locus. HOX B locus was the first to diverge, next HOX A, and finally the split of HOX C and HOX D occurred. These three SD events are depicted sequentially from top to bottom in both the right and left panel. Gene losses after each SD event are also shown. Color codes in this figure are the same as described in Fig. 1. None of the features of this figure are drawn to scale
Minimal HOX locus in the Urbilaterian
Interchromosomal syntenic association and the tree topology comparison-based predication of HOX clusters evolution demand the constituent genes were linked prior to vertebrate origin. To test this assumption, heres a survey of Drosophila (fruit fly) and recently sequenced Tribolium (red flour beetle) HOX loci was conducted (Richards et al. 2008). Comparative genomic data from fruit fly, red floor beetle, and human, made it possible to reconstruct the genic content of minimal HOX locus in the Urbilaterian (common ancestor of bilaterians) (Fig. 3). From this reconstruction, it can be seen that, canonical HOX genes are clustered with SP, HNRNPA, ITGA, and FMNL genes, at the Urbilaterian HOX locus (Fig. 3). Given the relatively well-preserved genomic organization of HOX loci in three distantly genomes analyzed, the approximate ancestral order of the genes is also predicted (Fig. 3). Thus, intergenomic synteny analysis lending support to the hypothesis based on intragenomic synteny and phylogenetic data that these genes duplicated in concert with HOX clusters through SD events (Fig. 2). It is notable that SP, HNRNPA, ITGA, and FMNL are novel examples of genes which are present on the same genomic segment as the HOX cluster in the last common ancestor of bilaterians. Such an arrangement implies that these genes from distinct families have remained linked with HOX clusters over hundreds of millions of years of evolution, suggesting functional implications. These associations might reflect ancient cis-regulatory constrains with multiple genes sharing cis-regulatory elements or cis-elements of developmental regulators are often located at large distances from transcriptional start site of the gene upon which they act (Parveen et al. 2013).
Cladogram depicts HOX locus architecture for representative animals. At the base is shown, the putative organization of genic content of the bilaterian (protostome and deutrostome) HOX locus. Gene content order of Urbilaterian (ancestral bilaterian animal) HOX locus was inferred by comparative gene order analysis of extant representatives of protostome (Drosophila and Tribolium) and deutrostome (quadruplicated human HOX loci) animals. The left branch depicts the fragmented HOX loci of the Drosophila and cb. The right branch portrays four coherent human HOX loci. Color codes in this figure are the same as described in Fig. 1. None of the features of this figure are drawn to scale
Ancient vertebrate genome was shaped by small-scale events
Did the paralogy regions on human HOX-bearing chromosomes arise through 2R at the origin of vertebrates? If the HOX clusters themselves were the products of ancient whole-genome duplications? These questions were hotly debated but a definitive answer to these questions has remained elusive because of the paucity of extensive genomic sequence data in early years of genomic era. However, recent availability of high quality whole-genome sequence data from diverse set of animal species and accurate gene predication annotation pipelines is ideal to test whether the human HOX-bearing chromosomes and HOX clusters themselves are remnants of two rounds of polyploidy in vertebrate ancestry (Abbasi 2010a). Indeed, the combined application of the mapself comparison approach and comparison with a preduplication species, together with robust and thorough phylogenetic analysis of human multigene gene families provide an unprecedented opportunity to gain insight into the mechanisms that contributed to the evolution of ancestral vertebrate genome. These results are not consistent with WGD hypothesis. Instead, it appears that paralogy blocks residing on human HOX-bearing chromosomes and HOX clusters themselves resulted from small-scale events which include, segmental duplications, independent gene duplication, and translocations. Furthermore, this work discounts the contention that a burst of gene duplication activity took place in the early vertebrate history after the divergence of invertebrate lineage. It appears that current hierarchy of human proteome is created by small-scale events, scattered at different times over the evolutionary history of animal’s life.
Conclusion and future perspectives
Over the past few years, a growing body of evidence suggests that apparently higher morphological and developmental complexity seen in modern vertebrates must be accounted for by inventions other than adding new genes by duplications. These include: (a) evolution of alternative protein domains in ancient genes through changes in parts of protein-coding sequences not essential for current function, (b) alternative spliced forms of same gene, (c) evolution of transcriptional regulation, expressing ancient protein in a novel tissues or developmental compartments/stages, (d) origin of new chimeric genes through exon shuffling, (e) new coding sequences and cis-regulatory elements may emerge de novo from non-coding genomic sequences. The challenge for the future studies is to extend beyond the traditionally well-studied source of gene duplication and portray a comprehensive view of the interplay of all the aforementioned mechanisms that drive vertebrate evolution during their early history.
References
Abbasi AA (2008) Are we degenerate tetraploids? More genomes, new facts. Biol Direct 3:50
Abbasi AA (2010a) Piecemeal or big bangs: correlating the vertebrate evolution with proposed models of gene expansion events. Nat Rev Genet 11:166
Abbasi AA (2010b) Unraveling ancient segmental duplication events in human genome by phylogenetic analysis of multigene families residing on HOX-cluster paralogons. Mol Phylogenet Evol 57:836–48
Abbasi AA, Grzeschik KH (2007) An insight into the phylogenetic history of HOX linked gene families in vertebrates. BMC Evol Biol 7:239
Ambreen S, Khalil F, Abbasi AA (2014) Integrating large-scale phylogenetic datasets to dissect the ancient evolutionary history of vertebrate genome. Mol Phylogenet Evol 78:1–13
Asrar Z, Haq F, Abbasi AA (2013) Fourfold paralogy regions on human HOX-bearing chromosomes: role of ancient segmental duplications in the evolution of vertebrate genome. Mol Phylogenet Evol 66:737–47
Bailey WJ, Kim J, Wagner GP et al (1997) Phylogenetic reconstruction of vertebrate Hox cluster duplications. Mol Biol Evol 14:843–53
Carroll SB (2001) Chance and necessity: the evolution of morphological complexity and diversity. Nature 409:1102–9
Durand D (2003) Vertebrate evolution: doubling and shuffling with a full deck. Trends Genet 19:2–5
Furlong RF, Holland PW (2002) Were vertebrates octoploid? Philos Trans R Soc Lond B Biol Sci 357:531–44
Hughes AL (2001) Evolution of the integrin alpha and beta protein families. J Mol Evol 52:63–72
Hughes AL, da Silva J, Friedman R (2001) Ancient genome duplications did not structure the human Hox-bearing chromosomes. Genome Res 11:771–80
Hughes AL, Friedman R (2003) 2R or not 2R: testing hypotheses of genome duplication in early vertebrates. J Struct Funct Genomics 3:85–93
Larhammar D, Lundin LG, Hallbook F (2002) The human Hox-bearing chromosome regions did arise by block or chromosome (or even genome) duplications. Genome Res 12:1910–20
Mora C, Tittensor DP, Adl S et al (2011) How many species are there on Earth and in the ocean? PLoS Biol 9:e1001127
Ohno, S., 1970. Evolution by gene duplication. Springer-Verlag.
Ohno S (1973) Ancient linkage groups and frozen accidents. Nature 244:259–62
Panopoulou G, Poustka AJ (2005) Timing and mechanism of ancient vertebrate genome duplications—the adventure of a hypothesis. Trends Genet 21:559–67
Parveen N, Masood A, Iftikhar N et al (2013) Comparative genomics using teleost fish helps to systematically identify target gene bodies of functionally defined human enhancers. BMC Genomics 14:122
Richards S, Gibbs RA, Weinstock GM et al (2008) The genome of the model beetle and pest Tribolium castaneum. Nature 452:949–55
True JR, Carroll SB (2002) Gene co-option in physiological and morphological evolution. Annu Rev Cell Dev Biol 18:53–80
Zhang J, Nei M (1996) Evolution of Antennapedia-class homeobox genes. Genetics 142:295–303
Acknowledgments
I am thankful to Faiqa Khalil (National Center for Bioinformatics, Quaid-i-Azam University, Islamabad) for helping me with Figures.
Authors’ contributions
AAA conceived the project and designed the experiments. AAA performed the experiments and analyzed the data. AAA wrote the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Volker G. Hartenstein
Rights and permissions
About this article
Cite this article
Abbasi, A.A. Diversification of four human HOX gene clusters by step-wise evolution rather than ancient whole-genome duplications. Dev Genes Evol 225, 353–357 (2015). https://doi.org/10.1007/s00427-015-0518-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00427-015-0518-z





