Introduction

Flatworms (platyhelminthes) comprise planarians, trematodes, and cestodes. Planarians are mostly free-living, while the latter two are exclusively parasitic. Parasitic flatworms can cause serious diseases such as schistosomiasis and taeniosis/cysticercosis. Although great progresses have been made to prevent and treat these diseases, people still face various challenges. Schistosomiasis, for example, is still epidemic all over the world associated with high disability and mortality (Chitsulo et al. 2000; Gryseels et al. 2006). Praziquantel (PZQ), as the most effective chemotherapy drug currently used, has limitations such as not being effective enough against juvenile schistosomes to prevent reinfection (Ribeiro-dos-Santos et al. 2006). Moreover, schistosomes subjected to the drug pressure of PZQ can develop resistance (Fallon and Doenhoff 1994). Therefore, it is urgent to find new targets so as to develop specific drugs, effective vaccines, and new strategies to control these parasites.

Triosephosphate isomerase (TIM) is a vital glycolytic enzyme, which is responsible for the reversible isomerization between glyceraldehyde 3-phosphate (GAP) and dihydroxyacetone phosphate (DHAP). Due to its key role in energy metabolism, as well as its well-understood enzymatic and structural features, TIM of a pathogenetic organism is often considered as drug target or vaccine candidate (Gomez-Puyou et al. 1995; Velanker et al. 1997; Wu et al. 2005). TIMs of several parasitic flatworms have been identified (dos Reis et al. 1993; Hooker and Brindley 1996; Jimenez et al. 2000; Wei et al. 2006), and schistosomal TIM been identified as a protective antigen (Wu et al. 2005). Furthermore, Shoemaker et al. (1992) indicated three fragments within Schistosoma mansoni TIM, rather than the whole molecule, might be used as vaccine candidates so as to avoid human autoimmune responses. However, until now, neither drug nor vaccine aimed at parasitic flatworm TIMs has been successfully applied. To compare the TIMs from the two lifestyle flatworms might be helpful to identify new efficient target vaccines or drugs.

Dugesia japonica, Dugesia ryukyuensis, and Schmidtea mediterranea are well-studied triclad planarians. To date, expressed sequence tag (EST) databases of D. ryukyuensis (Ishizuka et al. 2007), D. japonica (Mineta et al. 2003), and S. mediterranea (Alvarado et al. 2002; Zayas et al. 2005) have already been constructed, and the genome sequencing of S. mediterranea is also ongoing. Therefore, in the present work, first the full TIM gene and protein sequences of the three free-living flatworms were identified. Then, they were compared with those of parasitic flatworms as well as other metazoans. Some distinct features between TIMs of the two lifestyle flatworms were found, in particular, a novel parasitic flatworm TIM-specific insert. Its potential as a control target was investigated.

Materials and methods

D. japonica

The triclad planarian D. japonica was collected from a spring pool in a park, and cultured at 20–25°C. The identity was established by its shape, size, and its distribution in China (Kawakatsu et al. 1995). Adult worms used in our experiments had undergone 1 week of starvation.

Acquisition of the cDNA and DNA sequences of D. japonica TIM by experiments

Total RNA was isolated by using a TaKaRa Trizol Kit (TaKaRa Biotechnology (Dalian) Co., Ltd., Dalian, China) according to the manufacturer's instructions. Genomic DNA was extracted by a modified method described by Liu and Jiang (2005).

According to the reported EST sequence of the D. japonica TIM gene (GenBank accession number BP189215), a primer (5′-CTAATATAAAAgATTggACTCg-3′) was designed to performed 3′-RACE experiment. The amplification parameters were as follows: 94°C for 2 min, followed by 32 cycles at 94°C for 1 min, 55°C for 1 min, and 72°C for 1 min, followed by 72°C for 10 min. The amplification product of the expected size was gel isolated and sequenced after cloning into the pMD19-T cloning vector (TaKaRa).

According to the sequences obtained by 3′-RACE and of the known 5′-end, another pair of primers was designed as follows: sense primer, 5′-TCCAATATTTACTCAAAAATATgTC-3′ and antisense primer, 5′-ATgATAATAATATTCACTAAACC-3′, in order that the product contains the entire coding region of the TIM gene. Then, PCR reactions were performed by using cDNA reverse transcribed from total RNA and genomic DNA as templates, respectively. PCR products were cloned and sequenced as described above.

Search and identification of two other planarian TIM sequences

EST database of D. ryukyuensis was downloaded from the Planarian EST Database (http://planaria.bio.keio.ac.jp/) and genomic sequences of S. mediterranea from the ftp server site at Washington University in St. Louis (http://genome.wustl.edu/). Two other EST databases of S. mediterranea (http://planaria.neuro.utah.edu/ and http://www.life.uiuc.edu/planaria) were searched online. Human TIM (GenBank accession number P60174) was used as a query to carry out local or online BLAST searches against each of the four databases at default settings, respectively (Altschul et al. 1997). Then, the best hits were used as queries to reciprocally BLASTP search against the GenBank non-redundant (nr) protein database. If the best hit of the reciprocal BLASTP was annotated as TIM, the obtained sequence was considered as a candidate sequence.

To further confirm the candidate planarian TIM sequences, Pfam program (http://pfam.sanger.ac.uk/) and ClustalW2 at EBI were used to analyze whether they contain the characteristic TIM domain and active sites conserved in TIMs.

Sequence alignment of metazoan TIMs

Metazoan proteins with unambiguous annotations of TIM in Swiss-Prot/TrEMBL (http://www.expasy.org/sprot/) were collected. TIMs of two trematodes, Schistosoma haematobium and Orientobilharzia turkestanicum, as well as TIM fragment of polyclad planarian, Stylochus sp., were collected from GenBank. In addition, TIM fragments of two parasitic nematodes, Wuchereria bancrofti and Necator americanus, were identified from Parasitic Helminth Genomes (the Sanger Center). For the sequences with high similarities from a common lineage (except flatworms and the mammalian hosts of parasitic flatworms), only one sequence was selected as a representative. (For the GenBank accession numbers of these selected sequences, please see Electronic Supplementary Material (ESM 1.pdf).)

Homology modeling and B cell epitope prediction

To generate a protein structure model of parasitic flatworm TIM, the sequence was submitted to CPHmodels-2.0 Server (Lund et al. 2002) and the modeling template was chosen automatically. Then the model was superposed with human TIM (PDB: 1HTI) by using the SSAP Server (http://www.cathdb.info/cgi-bin/cath/GetSsapRasmol.pl).

Based on the parasitic flatworm TIM models, linear, and discontinuous B cell epitopes were predicted by using the DiscoTope 1.2 server and BepiPred 1.0b server, respectively, with default settings (Haste-Andersen et al. 2006; Larsen et al. 2006), and solvent accessible surface areas were also calculated (Fraczkiewicz and Braun 1998).

Results

Identification of TIMs from three planarians, D. japonica, S. mediterranea, and D. ryukyuensis

D. japonica TIM (DjaTIM)

After a ~500-bp DNA sequence of D. japonica TIM, cDNA was obtained through the 3′-RACE experiment, the unknown part of the coding region, and the complete 3′untranslated region (UTR) sequence of this gene were obtained. Then a ~850-bp cDNA amplicon, which contains the full-length coding region, was obtained and sequenced. The electrophoresis patterns of related PCR products are shown in Fig. 1a. The open reading frame (ORF) encodes a 248-aa protein, which is most similar to chicken TIM when BLASTP searching against non-redundant protein database (e value: 6e-103, and ESM 2.pdf). Pfam program predicted that the protein sequence contains TIM domain (from Phe7 to Ile245). Moreover, it contains all the conserved amino acid residues of known TIMs (e.g., Asn12, Lys14, His96, Glu166) and conserved motifs (e.g., 169-WAIGTG-174, which is critical for the substrate binding to the active center). Thus, D. japonica TIM cDNA and protein sequences were identified, and we named the protein DjaTIM.

Fig. 1
figure 1

a Electrophoresis patterns of PCR products of D. japonica TIM. M, marker; Lane 1, genomic DNA; Lane 2, PCR products of our 3′-RACE experiment of TIM; Lanes 3 and 4, DjaTIM cDNA and DNA amplified from total RNA and genomic DNA, respectively. b A diagram illustrating D. japonica TIM gene structure. The coding sequence is showed by black box, which is disrupted by only one intron (from 116 to 167 nt) showed by open box

The DjaTIM DNA sequence was obtained by PCR against genomic DNA, and then compared with the cDNA sequence obtained above. It showed that this gene contains a single 52-bp intron (from 116 nt to 167 nt), at the ends of which the typical GT/AG intron splicing signal can be found (Fig. 1b).

Both DjaTIM cDNA sequence and DNA sequence identified experimentally in this work have been deposited in GenBank (EU288186 and EU288187).

S. mediterranea TIM (SmeTIM)

High scoring homologs were obtained through database BLAST against both EST and genome databases of S. mediterranea. All displayed EST sequences were found to be almost identical in sequence, thus the longest one, PL06021B1D09 (GenBank accession number DN315650), was chosen for further analyses. It contains an ORF encoding a 248-aa protein, which has the highest scoring homology with chicken TIM (e value 5e−105) and has a positional identity of 87% with DjaTIM mentioned above. TIM-conserved residues and motifs were also found in the encoded peptide. Thus, S. mediterranea TIM cDNA and protein sequences were identified, and we named the protein SmeTIM.

The SmeTIM DNA sequence was also obtained by blasting the genome database, and then compared with the cDNA sequence. The result indicated that the SmeTIM gene also includes only one 47-bp intron, which shares a similar size and identical location with the DjaTIM gene. In addition, the genome database searching result indicated there is only one copy of the TIM gene in the S. mediterranea genome.

D. ryukyuensis TIM (DryTIM)

As described for SmeTIM, the D. ryukyuensis TIM cDNA sequence was identified (EST number Dr_sW_016_F10 and GenBank accession number BW639877), which encodes a 248-aa protein (named DryTIM). DryTIM has the highest scoring homolog with other two planarian TIMs (97% and 88% with DjaTIM and SmeTIM, respectively) and the second highest scoring homolog with chicken TIM (e value 1e−101), rather than with those of parasitic flatworms.

A unique tripeptide insert in TIMs of parasitic flatworms

The alignment of 35 metazoan TIM sequences revealed that the three newly identified planarian TIMs are quite conserved with all known metazoan TIMs. However, interestingly, within the region between the two conserved residues, Q147 and W161 (numbered according to S. japonicum TIM), all the collected TIMs of parasitic flatworms (trematode and cestode) exclusively possess a unique tripeptide insert—“SAD” in three trematodes (S. mansoni, S. haematobium, and O. turkestanicum), “SID” in another trematode S. japonicum, and “SKE” in cestode Taenia solium, and neither free-living flatworm TIM nor other known metazoan TIM shares such an insert (Fig. 2). Furthermore, the insert sequence is quite conserved (the first position is a serine and the third one is an acidic amino acid, aspartic acid, or glutamic acid) and can be defined as SXD/E (“X” here stands for alamine, isoleucine, or lysine).

Fig. 2
figure 2

Partial results of multi-sequence alignment of metazoan TIMs. It is showed that parasitic flatworm TIMs possess a unique tripeptide insert. Species in the two boxes belong to the phylum Platyhelminthes. The upper box contains five parasitic flatworms, and the nether box contains four free-living planarians

Comparative protein structure modeling and prediction of B cell epitopes

Our structure models of the five parasitic flatworm TIMs all display the canonical conserved TIM β-barrel structure, and the unique tripeptide insert is located between the secondary structural elements α-helix 5 and β-strand 6 (ESM 3.pdf).

In terms of S. japonicum TIM, the automated homology modeling server chose the chicken TIM as its modeling template (PDB: 8TIM) (its positional identity with S. japonicum TIM is 66.4%). When superposed with the crystal structure of human TIM (PDB: 1HTI), the model shows exactly like structure except the insert region (Fig. 3). The insert region was far from the active site and the dimer interface; furthermore, this region is nearly completely surface-exposed (ESM 4.pdf). Similar results were observed in other four parasitic flatworm TIMs in spite of choosing different templates by the server (data not shown).

Fig. 3
figure 3

Results of superposition of the modeled S. japonicum TIM (through automated comparative protein modeling and shown in solid ribbon presentation) with the TIM A-chain structure of Homo sapiens (1HTI, shown in line ribbon presentation). The tripeptide insert “SID” of S. japonicum TIM and the catalytic residue Glu 169 were marked and shown in CPK and ball and stic view, respectively

Our prediction indicated that the tripeptide insert of parasitic flatworm TIMs is the main part of either a discontinuous B cell epitope (For S. japonicum TIM see ESM 5.pdf) or linear B cell epitopes, or both. All the three cases are suitable for vaccine development.

Discussion

In the present work, for the first time, full TIM gene and deduced protein sequences from three free-living flatworms were identified, which made it possible to compare this important glycolytic enzyme between parasitic and free-living flatworms, as well as with those of other organisms. Based on these data, we investigated the divergence of TIMs between the two different lifestyle flatworms, and then attempted to identify some distinct features as potential targets against parasitic flatworms.

The adaptive evolution divergence of TIMs between the different lifestyle flatworms

The two identified planarian TIM genes each contain only one intron of a similar size and identical location among species. However, TIM genes of S. mansoni (dos Reis et al. 1993) and S. japonicum (of which the intron information was identified from its genome database by us) both contain five introns, of which the first one has similar size and identical location to the intron of planarian TIM genes. All five introns located in the schistosomal TIM gene, also arise in that of mammals and birds (chicken), as well as placozoa (Trichoplax adhaerens), Cnidaria (Nematostella vectensis), and even a choanoflagellate (Monosiga brevicollis), which was considered to be the closest relative of metazoans in protozoa (King et al. 2008). In addition, our phylogenetic analysis excluded the possibility of lateral gene transfer (LGT) between schistosomes and chickens/mammals (data not shown). Therefore, the TIM gene in the last common ancestor of extant flatworms most likely contained at least the five conserved introns, and the absence of four of them in planarian TIM is mostly due to intron loss in the planarian lineage rather than numerous independent intron acquisitions in schistosomes, and various other animals. It is generally considered that parasitic organisms streamline their structures (including gene structure) under high selective pressure, but here we find an inverse process in intron numbers.

Like other TIMs, planarian TIMs are very conserved, sharing high positional identity with other metazoan TIMs. But unexpectedly, their highest scoring homolog is the chicken TIM followed by many other vertebrate TIMs, instead of those of their parasitic relatives (trematode and cestode), or other invertebrates. To test whether this is totally due to the unique tripeptide insert, another alignment was performed after deleting the insert, but the result was almost the same. This might imply a relatively rapid evolution occurred in parasitic flatworm TIMs after the divergence of the two lifestyle flatworms. Such a rapid evolution might be an adaptive evolution for parasitic flatworms to fit the parasitic life.

Morphological and molecular evidence supports that Rhabditophora, which contains all the parasitic flatworms and most planarians (including all the four species studied here), is a monophyly (Carranza et al. 1997; Ruiz-Trillo et al. 1999), namely, they share a last common ancestor. However, our work showed that compared to their free-living relatives, parasitic flatworms contain an additional tripeptide insert in the conserved TIMs. Our sequence analyses indicated that such an insert does not exist in all other known TIMs yet, including those from invertebrates (including Porifera, Cnidaria, Nematoda, Nemertea, Mollusca, Arthropoda, and Echinodermata in our analysis) and vertebrates—almost all the representative lineages of the Metazoa (Fig. 2 and ESM 1.pdf). Therefore, this tripeptide insert is specific to parasitic flatworm TIMs, suggesting that the insertion event must have occurred after the divergence of parasitic flatworms from free-living ones in evolutionary history. The insert sequences themselves are quite conserved among trematodes, and even between trematodes and cestodes, which suggests that the acquisition of the insert was a single evolutionary event and there might be some functional constraints against losing the insert or changing its sequence remarkably. Therefore, this unique insert of TIM might result from adaptive evolution and be of importance to enzymatic activity and even to the parasitic life of these parasitic flatworms.

A universal target against parasitic flatworms

Understanding the modifications that occurred during coevolution with hosts is not only fundamental to the understanding of parasitism, but also highly relevant for the design of antiparasitic drugs and vaccines (Brehm et al. 2006). In term of schistosomiasis, TIM is one of the most important vaccine candidates (Harn et al. 1992; Wu et al. 2005). However, TIM sequences are so conserved between schistosomes, and their mammalian hosts that autoimmunity of the host body would occur if using the whole schistosomal TIM as an antigen, and deficiency of TIM in the host (e.g., human) will result in hemolytic anemia and neuromuscular dysfunction (Daar et al. 1986). To avoid this, Shoemaker et al. (Shoemaker et al. 1992) recommended three regions within S. mansoni TIM to develop vaccines, which are rich in charges, hydrophilic, and share low identity with corresponding regions of the human TIM. Later, the fusion peptide of the first two regions from the S. japonicum TIM has been demonstrated to be an effective antigen and schistosomal specificity by other authors (Lou et al. 2001). Our identified tripeptide insert is within the second region, and the residues of the insert were predicted to be linear and/or discontinuous B cell epitopes. Therefore, we propose the region that contains the unique insert can be the first choice to develop more specific and effective vaccines, especially multiple antigenic peptide (MAP) vaccines, not just against S. mansoni but against all parasitic flatworms. Besides, this parasitic flatworm-specific insert might also be considered to be an ideal target candidate for developing specific chemotherapy drugs against the parasitic flatworm diseases such as schistosomiasis and taeniosis/cysticercosis.