Background

Centromeres provide the essential functions for chromosome segregation. They act as regulators of mitosis and meiosis by controlling attachment of chromosomes to the spindle and regulating progression into anaphase [14]. In mammalian cells, the centromere is associated with large blocks of heterochromatin comprised of highly repetitive satellite DNA of unconserved sequence repeats among species. It have been hypothesized that centromere function is therefore not specified directly by DNA sequence, but rather by higher order DNA or chromatin structure [5]. Centromeric DNA and protein components have been suggested to evolve rapidly in evolution and be responsible for the organization of functional centromere of emerging species [6].

CENP-A is a 17 kD histone H3 variant with homologous polypeptides being described from human to yeast [79]. Different approaches have indicated that CENP-A is required for the assembly of kinetochore and activation of the centromere [1012]. The human protein shows more than 60% sequence identity to H3 at the C-terminal histone fold domain. The highly divergent N-termini of CENP-A is localized outside the nucleosome and should serve to interact with other kinetochore proteins including histones [4, 13]. This unique N-terminal charged region forms a major epitope in the induction of specific autoantibodies against centromere in autoimmune patients [14, 15]. The overall homology between homologous CENP-A proteins and the evolutionarily unconserved satellite repeat DNAs at the kinetochore described for many species, constitute a major unresolved enigma in modern biology for understanding the structural organization and function of the centromere.

Results and Discussion

In order to look for CENP-A conserved motifs in closed species, we have cloned CENP-A cDNA from hamster and compare the deduced amino acid sequence with that of mouse and human. Sequencing was done on both strands of DNA and the sequence data was deposited in GenBank with the accession number AJ428867. As shown in Fig. 1, we isolated a 1623nt cDNA containing several polyadenylation signals and a poly-A tail. The coding region was found between nucleotides 98 and 484. The cDNA of hamster CENP-A contains an open reading frame encoding a polypeptide consisting of 129 amino acid residues. Sequence analysis of deduced hamster CENP-A polypeptide showed 79% and 73% homology with those described previously of mouse and human respectively. CENP-A is defined as a histone H3-like protein and as expected the C-terminal histone H3 fold domain of the hamster cDNA (residues 37–129) showed a 84 % homology with those of human and mouse CENP-A cDNAs. However at the N-terminal domain (residues 1–36) the hamster antigen contains 5 deleted amino acid residues in comparison to that of mouse (Fig. 2). Further, in rodents there are also 5 deleted amino acid residues in the N-terminal related to that of the human CENP-A amino acid sequence. Significantly, this include a serine residue at position 7 which have been described in human CENP-A as a Aurora B kinase phosphorylation site involved in completion of cytokinesis [16]. It is plausible that other serine residues found at the N-terminal region of hamster CENP-A may serve for similar function. Overall changes found in the hamster CENP-A make the protein five and eleven amino acid shorter than those of mouse and human polypeptides respectively (Fig. 2). These observations indicate a degree of evolutionary divergence of protein sequences between hamster, mouse and human CENP-A and suggest a significant structural plasticity in an essential kinetochore component.

Figure 1
figure 1

Nucleotide sequence of hamster CENP-A cDNA. Deduced amino acids of 129 encoded codons are shown in black. Numbers in the right indicate nucleotide position. The asterisk designates the stop codon. Two consensus polyadenylation signal sequences are shown in black underlined.

Figure 2
figure 2

Alignment of hamster, mouse and human CENP-A amino acid sequences. The alignment has been shaded with identical residues. The C-terminal histone H3 fold domain (aa 37–129) is highly conserved. In contrast, several residues of the N-terminal tail are variable among the three species compared, including several deletions as shown. The accession numbers are as follows: hamster AJ428867 (this work); mouse AF012708-10 and human U14518-19.

As suggested in a recent report on centromere DNA sequences divergences found in eukaryotes, those changes observed in the N-terminal region of several CENP-A proteins, could be explained for their specific recruitment at nucleosomes containing different DNA satellite repeat [6, 17]. The centromere drive model proposed by Malik et al. [17] imply changes during evolution on centromeric repeat sequences associated with those found on centromeric constitutive antigens such as CENP-A. This specific event could play a role at the distinctive higher-order of chromatin structure at mammalian centromeres [18]. Thus, understanding the relationships between CENP-A primary structure and DNA repeat organization at the centromeres may provide a clue into the structural requirements needed for assembly of a kinetochore.

Conclusions

Hamster CENP-A and its human and murine homologous proteins are highly conserved at the C-terminal domain. However they have significant amino acid differences at the N-terminal tail, including a serine residue at position 7 which is missing in hamster and mouse CENP-A, but it is phosphorylated by Aurora B kinase in humans for a role in cytokinesis. Further, the N-terminal domain of hamster CENP-A is five and eleven amino acid shorter than those homologous proteins of mouse and human respectively. Thus, the divergences found in the N-terminal tail of CENP-A in different species during evolution, according to a current centromere model could be correlated with those variations widely described on centromeric DNA repeat sequences [6, 17].

Materials and Methods

Isolation of a hamster CENP-A cDNA

A CHO cDNA expression library constructed in lambda ZAP (Stratagene) was screened with a human CENP-A cDNA probe resulting in the isolation of several positive clones. One of these resulted to be a hamster full length CENP-A clone described here. cDNA was subcloned in pBluescript vector (Stratagene) and DNA sequence analysis was performed in both strands by the dideoxy method using T7 DNA polymerase (Amersham Biosciences) and 5'-[α-35S]-labeled dATP (NEN Life Science Prod. Zaventem, Belgium). DNA sequences were analysed in EMBL databases using BLAST and Clustaw softwares.