Abstract
Clonorchis sinensis is a biliary tract parasite, which infects over 30 million people in China, Korea and Southeast Asia through the ingestion of undercooked freshwater fish that harbour the infective metacercariae. The genes expressed in C. sinensis adults were identified in order to develop novel drugs, better diagnostics and vaccines for the parasite. The C. sinensis cDNA library was constructed and DNA sequencing was performed with 450 randomly selected clones. Four hundred and fifteen clones contained the amino-acid-encoding sequences. The functions of these genes could be assigned by DNA sequence homology. Basic Local Alignment Search Tool X analysis showed that 277 of the 415 clones were strongly matched (P<10-9) to previously identified proteins, while the remaining 138 fell into the "no database match" category. Among the clones matching previously identified proteins, 220 putatively identified genes were sorted into seven functional categories. These included the genes associated with energy metabolism (38), gene expression/RNA metabolism (21), regulatory/signalling components (14), protein metabolism/sorting (98), the structure/cytoskeleton (29), membrane transporters (ten) and antigenic proteins (ten). The remaining 57 clones were not included in these categories. The dataset included the genes encoding the proteases, a lipid binding protein, the antigen proteins and the other genes of interest from a diagnostics, drug or vaccine development viewpoint. The present expressed sequence tag analysis proved to be an effective tool for examining gene expression and identified several important genes which increase and complement our knowledge of the biology of C. sinensis.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Clonorchis sinensis is an important human parasite in eastern Asia, including China, Taiwan, northern Vietnam and Korea. In Korea, it is well known that human infection of C. sinensis is widely distributed along the rivers and streams (Rim 1998; Crompton et al. 1999). When the fluke infects humans, the bile duct is severely dilated and the ductal wall is thickened due to mucosal hyperplasia and fibrosis. As the infection becomes chronic and heavier in intensity, complications such as obstructive jaundice, dull epigastric pain, biliary stones, ascites and cholangiocarcinoma can develop (Chapman et al. 1999; Kim et al. 1993).
Over the past decade, the world has witnessed the emergence and progress of several genome projects. The last release of the db expressed sequence tags (ESTs) in 1999 contained a prodigious number of entries for parasite genomes including >20,000 ESTs for Brugia malayi, ∼12,500 for Schistosoma mansoni, and over 12,000 for Trypanosoma cruzi. In addition, an increasing number of ESTs for other parasites including Paragonimus westermani, Strongyloides stercoralis, Toxoplasma gondii and Aedes aegypti etc., have been identified (Paul 2000). Analysis of ESTs has proven to be a rapid and efficient method for characterizing the subset of genes which are expressed in a life-stage specific manner in a wide variety of tissues and organisms (Adams et al. 1995). Gaining knowledge on the genome of parasites is increasingly important in order to understand a parasite's biology, the drug resistance mechanism and antigenic variations that determine the escape from a host's immune system (Franco et al. 2000; Tawe et al. 2001). Although a C. sinensis infection is diagnosed by identifying its eggs through a microscopic examination of stools, immunological methods have recently been adopted for its diagnosis (Kang et al. 1969). The assembly of EST data specific to a protein or a DNA sequence can be used to rapidly produce a gene/protein model to understand the physical and biological properties of C. sinensis, and is useful for immunodiagnosis, determining novel target drugs and identifying potential vaccine candidates.
Materials and methods
Collection of parasite
The C. sinensis metacercariae were collected from naturally infected fish (Pseudorasbora parva) caught in the Nakdong River, Korea. These metacercariae were orally administered to the experimental rabbits. Eight weeks later, adult C. sinensis worms were obtained from the rabbit bile ducts.
Construction of the cDNA library
C. sinensis mRNA was purified by using a messenger RNA isolation kit (Stratagene, La Jolla, Calif.). Briefly, live C. sinensis worms recovered from the infected rabbits were homogenized in a denaturing solution of 4 M guanidium isothiocyanate and 0.14 M β-mercaptoethanol, and centrifuged at 12,000 g for 10 min. The supernatant was transferred to a tube containing oligo (dT) cellulose resin. The resin was washed with a high-salt buffer, and the poly (A)+mRNA was eluted with a low-salt buffer. A cDNA library of C. sinensis was constructed using a ZAP Express cDNA Gigapack II gold cloning kit (Stratagene) according to the manufacturer's instructions. Briefly, the first strand cDNA was synthesized on 5 μg of C. sinensis mRNA and the second strand was synthesized by a nick translation. The EcoRI adaptors were ligated to the blunt ends. XhoI restriction enzyme digestion resulted in directional cDNA. The cDNA was then inserted into the predigested ZAP express vector arms. Packaging was carried out in vitro with Gigapack II packaging extract. The library was plaqued on Escherichia coli XL1-Blue MRF′.
Sequencing
The phage library was converted to the plasmid form by a mass excision according to the reported protocol (Stratagene). The obtained phagemid of the library was used to infect the E. coli strain XLOLR. The bacteria were grown for 45 min and then plated at a low density on a medium containing a Luria-Bertani broth including tetracycline (10 mg/l). The bacteria were cultured at 37°C overnight, and individual colonies were selected randomly for plasmid DNA purification and sequencing. All the sequencing reactions contained the T3 sequencing primer, and were read into the 5′ end of each DNA. The reactions were run and analysed on capillary automated sequencing machines (ABI 377 XL90; Applied Biosystem, USA). The machines generated two computer files, a chromatogram file and a plain text file.
The EST nucleotide sequences and completed cDNA sequence data reported in this paper were submitted to the GenBank dbEST database under the accession nos. AF480464, AF480465, AF527454–AF527457, AT006727–AT006731, AT006733–AT006735, AT006737–AT006740, AT006742–AT006744, AT006746, AT006749, AT006753–AT006755, AT006757, AT006760–AT006762, AT006764, AT006765, AT006768, AT006769, AT006772–AT006775, AT006777, AT006778, AT006780, AT006781, AT00678, AT006784–AT006786, AT006790, AT006797, AT006798, AT006800, AT006801, AT006804, AT006806, AT006809, AT006811, AT006817, AT006818, AT006820, AT006823, AT006833, AT006834, AT006838, AT006840, AT006844, AT006846, AT006847, AT006851, AT006853–AT006855, AT006859, AT006862, AT006864, AT006869, AT006870, AT0068 72, AT006873, AT006879 AT006884, AT006888, AT006891, AT006894, AT006898, AT006902–AT006904, AT006906, AT006907, AT006909, AT006911, AT006912, AT006914–AT006929, AT006931– AT006974.
Homology comparisons
Each edited EST was translated in all six reading frames and compared with the non-redundant database at the National Centre for Biotechnology Information using the Basic Local Alignment Search Tool X (BLASTX) program, which compares the translated nucleotide sequences with the protein sequences. The homologies to the negative reading frames, with the exception of clones with the insert in the reverse orientation, were disregarded. Putative identification of the ESTs was based on the BLAST searches and in some cases on the information contained in the MEDLINE database.
Results
Sequencing of ESTs
Of the 450 sequencing reactions attempted, 415 produced readable amino acid-encoding sequences from the C. sinensis adult cDNA library (Table 1). The leading and tailing vectors as well as the poor-quality sequences were trimmed from each text file. The 3′ vector and linker sequences were removed if the poly (A+) tails could have been included in the sequencing results. Three classes of anomalous sequences were also excluded; sequences without the insert, sequences with the reverse inserts and the incorrect adaptor. The insert size of the clones ranged from 400 to 3,000 bp, and the mean size was 627 bp.
Putative identification of EST sequences
BLASTX analysis showed that 277 of the 415 clones were strongly matched (P<10-9) to other proteins. On the basis of database searches, the 277 different ESTs were classified two groups. The remaining 138 clones fell into a "no database match" category (Table 1). The first class comprised 53% (220 clones) of the sequences with an average read of 562±95 bp, and were matched to the genes with the predicted or known functions of C. sinensis and other organisms. Forty-one clones of these cDNAs were matched with C. sinensis and are related to cysteine protease, glutathione-S transferase 28 kDa and the antigen proteins. The other organisms included helminths (Paragonimus westermani, Fasciola hepatica, Schistosoma japonicum, Schistosoma mansoni, Caenorhabditis elegans), mammals (Homo sapiens, Rattus norvegicus, Mus musculus), insects (Drosophila melanogaster) and others. Secondly, 14% (57 clones) of the sequences with an average sequence read of 548±93 bp were matched with C. elegans, H. sapiens, D. melanogaster, M. musculus, and P. westermani with no assigned function. The remaining 33% of the sequence read of 417±10 bp had no significant match in the database. The first class was sorted into seven functional categories, which included the genes associated with energy metabolism (38), gene expression/RNA metabolism (21), regulatory/signalling components (14), protein metabolism/sorting (98), structure/cytoskeleton (29), membrane transporter (ten), and antigen protein (ten) (Table 2). These were matched to either the ribosomal protein, cysteine protease or the heat shock protein. In addition, the genes related to energy metabolism were matched to those for the enzymes taking part in the glycolytic pathway, TCA cycle and oxidative phosphorylation. Among the other functional categories identified, genes encoding the proteins involved with transcription and translation and those associated with regulatory and signalling functions were identified.
Discussion
Even though the identification of novel parasite molecules using conventional methods are tedious, identifying some of these molecules might serve as starting points for various studies. However, the search for C. sinensis gene expression using the EST analysis performed in this study, showed many important genes which increase and complement our knowledge of the biology of C. sinensis.
Energy production in this fluke depends largely on glycolysis (Kang et al. 1969; Hong et al. 2000). Through glycolysis, it takes up approximately 1.13 mg glucose/h per gram wet weight, produces lactate, and forms several types of amino acids as the end product from exogenous glucose (Hong et al. 2000; Han et al. 1961). Therefore, the enzymes involved in glycolysis are essential for energy metabolism. In this work, the ESTs found represented a variety of proteins related to metabolism. It has been suggested that they produce ATPs through the glycolytic pathway or aerobic metabolism (Table 2; AT006919, AT006737, AT006806, AT006873, AF480465, AT006919, 006731, AT006921). However, an initial search of the genomic basis of the fundamental biochemical pathways of C. sinensis revealed that its biosynthetic networks are fairly consistent with those of humans.
The adenylate kinase 1 (AK 1, AF480464) homologous gene was identified from the ESTs. AK 1 is indispensable for Escherichia coli (Cronan et al. 1972) and Schizosaccharomyces pombe growth (Konrad et al. 1993), indicating that it is an essential enzyme for life in a single cell. Under normal conditions, AK 1-knockout mice showed no phenotypic changes. However, under metabolic stress, compromised energetics were detected in the heart and skeletal muscle, suggesting the physiological significance of AK-catalysed phosphoryl transfer between the intracellular compartments in cellular energetic homoeostasis (Qualtieri et al. 1997; Janssen et al. 2001). The nucleoside analogues of AK have been used clinically for treating certain viral infections and malignant diseases (Pucar et al. 2000; Bourdais et al. 1996; Schneider et al. 1998). Therefore, they can be also targeted for drug research with recombinant antigens.
Several genes encoding antioxidant and detoxification enzymes, such as Cu/Zn-superoxide dismutase, and glutathione-S-transferase were identified. These proteins are believed to play a crucial role in protecting the parasite from the host immune effector mechanisms, and are being pursued as drug targets in other parasitic infections (Selkirk et al. 1998; Smooker et al. 1999). Of the proteases identified from EST analysis, the high frequency of cysteine protease expression (30 out of 415 randomly selected clones) suggests it has an important role in the metabolism and/or pathogenesis of clonorchiasis. It has also been proposed as a target for a structure-based approach for drug design (Park et al. 2001; Song and Rege 1991). In addition, one protease inhibitor was also identified. Specific protease inhibitors have also been isolated from filarial nematodes, and are suggested to play a role in inhibiting the enzymes secreted from host immune cells, the blocking of antigen processing and the control of the endogenous proteases involved in parasite development (Zang et al. 1999; Yenbutr et al. 1995). The biological role of the protease inhibitor in C. sinensis requires further investigation. The ESTs identified in this study showed a significant number of homologous genes previously reported from closely related parasitic trematodes. A homologue of the S. japonicum fatty acid binding protein (FABP) was identified in C. sinensis, and it was demonstrated that the FABP with a cross-protective efficacy could be used as a vaccine against trematode infections, such as fasiolasis (Hillyer et al. 1987, 1988; Estuningsih et al. 1997) and schistosomiasis (Tendler et al. 1996). It was also found that the ESTs contained several antigenic protein genes for C. sinensis or other parasites. The recombinant glycine-rich C. sinensis protein and proline-rich antigen were reported previously to be useful for immunodiagnosis, with a high specificity (Yang et al. 2000; Kim et al. 2001).
AT006742 and AT006744 were found to be homologous to the Schistosoma tegumental antigens. Sm 20.8 was reported to be a member of a family of soluble tegument antigens that contained the EF-hand motifs, and were recognized as antigenic targets with protective antisera (Mohamed et al. 1998). The most significant feature of the flatworm tegument is that the ultimate boundary between the parasite and host is a living plasma membrane and its associated polyanionic coating or glycocalyx. This knowledge has revolutionized our understanding of symbiotic relationships, in that it raises a new conceptual level of the significance of the host-parasite relationship. In vivo, the alimentary tracts in schistosomes and the liver fluke play a role primarily in macromolecular digestion and the subsequent absorption of soluble digestive products. However, it is likely that these are augmented by the host-derived sugars and amino acids absorbed by the tegument (Halton et al. 1997). This can also be a target for developing diagnostics with recombinant antigens.
The EST analysis using the various stages of the parasite life cycle, such as the metacercariae, which is the infective stage to the final host, will be useful for investigating gene expression and characterizing the purified proteins of interest in C. sinensis. A particular gene or a class of genes identified within the EST dataset can be used as an appropriate target for diagnostics, vaccines or drug research and for further study of the development of clonorchiasis.
References
Adams MD, Kerlavage AR, Fleischmann RD, Fuldner RA, Bult CJ, Lee NH, Kirkness EF, Weinstock KG, Gocayne JD, White O, et al. (1995) Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence. Nature 377:3–174
Bourdais J, Biondi R, Sarfati S, Guerreiro C, Lascu I, Janin J, Veron M (1996) Cellular phosphorylation of anti-HIV nucleosides. Role of nucleoside diphosphate kinase. J Biol Chem 271:7887–7890
Chapman RW (1999) Risk factors for biliary tract carcinogenesis. Ann Oncol [Suppl] 10:308–311
Crompton DWT (1999) How much human helminthiasis is there in the world? J Parasitol 85:397–403
Cronan JE Jr, Godson GN (1972) Mutants of Escherichia coli with temperature-sensitive lesions in membrane phospholipid synthesis: genetic analysis of glycerol-3-phosphate acyltransferase mutants. Mol Gen Genet 116:199–210
Estuningsih SE, Smooker PM, Wiedosari E, Widjajanti S, Vaiano S, Partoutomo S, Spithill TW (1997) Evalution of antigens of Fasciola gigantica as vaccines against tropical fasciolosis in cattle. Int J Parasitol 27:1419–1428
Franco GR, Valadado AF, Azevedo V, Rabelo EM (2000) The Schistosoma gene discovery program: state of the art. Int J Parasitol 30:453–463
Halton DW (1997) Nutritional adaptations to parasitism within the platyhelminthes. Int J Parasitol 27:693–704
Han SS, Hahn HJ, Seo BS (1961) The uptake of 14C-glucose by Clonorchis sinensis. Korean J Int Med 4:281–285
Hillyer GV, Rosa MI, Alicea H, Hemandez A (1987) Acquired resistance to Fasciola hepatica in cattle using a purified adult worm antigen. Am J Trop Med Hyg 37:363–369
Hillyer GV, De Galanes M, Fosa MI, Montealegre F (1988) Acquired immunity in schistosomiasis with purified Fasciola hepatica cross-reactive antigens. Vet Parasitol 29:265–280
Hong SJ, Seong KY, Sohn WM, Song KY (2000) Molecular cloning and immunological characterization of phosphoglycerate kinase from Clonorchis sinensis. Mol Biochem Parasitol 108:207–216
Janssen E, Dzeja PP, Oerlemans F, Simonetti AW, Heerschap A, de Haan A, Rush PS, Terjung RR, Wieringa B, Terzic A (2001) Adenylate kinase 1 gene deletion disrupts muscle energetic economy despite metabolic rearrangement. EMBO J 19:6371–6381
Kang IK, Lee SH, Seo BS (1969) Study on the 14C-glucose metabolism by Clonorchis sinensis. Paper chromatographic analyses in combination with autoradiography. Korean J Parasitol 7:43–52
Kim TY, Kang SY, Ahn IY, Cho SY, Hong SJ (2001) Molecular cloning and characterization of an antigenic protein with a repeating region from Clonorchis sinensis. Korean J Parasitol 39:57–66
Kim YI, Rim HJ, Park UB (1993) Effect of Clonorchis sinensis infection and dimethylnitrosamine administration on the induction of cholangiocarcinoma in Syrian golden hamster. Korean J Pathol 31:21–30
Konrad M (1993) Molecular analysis of the essential gene for adenylate kinase from the fission yeast Schizosaccharomyces pombe. J Biol Chem 268:11326–11334
Mohamed MM, Shalaby KA, LoVerde PT, Karim AM (1998) Characterization of Sm20.8, a member of a family of schistosome tegumental antigens. Mol Biochem Parasitol 96:15–25
Park SY, Lee KH, Hwang YB, Kim KY, Park SK, Hwang HA, Sakanari JA, Hong KM, Kim SI, Park H (2001) Characterization and large-scale expression of the recombinant cysteine proteinase from adult Clonorchis sinensis. J Parasitol 87:1454–1458
Paul J (2000) Brindley. Parasite genomes. Int J Parasitol 30:327
Pucar D, Janssen E, Dzeja PP, Juranic N, Macura S, Wieringa B, Terzic A (2000) Compromised energetics in the adenylate kinase AK1 gene knockout heart under metabolic stress. J Biol Chem 275:41424–41429
Qualtieri A, Pedace V, Bisconte MG, Bria M, Gulino B, Andreoli V, Brancati C (1997) Severe erythrocyte adenylate kinase deficiency due to homozygous A→G substitution at codon 164 of human AK1 gene associated with chronic haemolytic anaemia. Br J Haematol 99:770–776
Rim HJ (1998) Field investigations on epidemiology and control of fish-borne parasites in Korea. Int J Food Sci Technol 33:157–168
Schneider B, Xu YW, Sellam O, Sarfati R, Janin J, Veron M, Deville-Bonne D (1998) Pre-steady state of reaction of nucleoside diphosphate kinase with anti-HIV nucleotides. J Biol Chem 273:11491–11497
Selkirk ME, Smith VP, Thomas GR, Gounaris K (1998) Resistance of filarial nematode parasites to oxidative stress. Int J Parasitol 28:1315–1332
Smooker PM, Steeper KR, Drew DR, Strugnell RA, Spithill TW (1999) Humoral responses in mice following vaccination with DNA encoding glutathione S-transferase of Fasciola hepatica: effects of mode of vaccination and the cellular compartment of antigen expression. Parasite Immunol 21:357–364
Song CY, Rege AA (1991) Cysteine proteinase activity in various developmental stages of Clonorchis sinensis: a comparative analysis. Comp Biochem Physiol B 99:137–144
Tawe W, Hashmi S, Lustigman S (2001) Identification of filarial vaccine and drug target candidates by EST analysis. Trends Parasitol 17:204–206
Tendler M, Brito CA, Vilar MM, Serra-Preire N, Diogo CM, Almeida MS, Delbem AC, Da Silva JP, Savino W, Garratt RC, Katz N, Simpson AS (1996) Schistosoma mansoni fatty acid-binding protein, Sm 14, is the potential basis of a dual-purpose anti-helminth vaccine. Proc Natl Acad Sci USAA93:269–273
Yang HJ, Park SJ, Im KI, Yong TS (2000) Identification of a Clonorchis sinensis gene encoding a vitellaria antigenic protein containing repetitive sequences. Mol Biochem Parasitol 111:213–216
Yenbutr P, Scott AL (1995) Molecular cloning of a serine proteinase inhibitor from Brugia malayi. Infect Immun 63:1745–1753
Zang X, Yazdanbakhsh M, Jiang H, Kanost MR, Maizels RM (1999) A novel serpin expressed by blood-borne microfilariae of the parasitic nematode Brugia malayi inhibits human neutrophil serine proteinases. Blood 94:1418–1428
Acknowledgement
This study was supported by the Brain Korea 21 Project for Medical Science, Yonsei University.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lee, JS., Lee, J., Park, SJ. et al. Analysis of the genes expressed in Clonorchis sinensis adults using the expressed sequence tag approach. Parasitol Res 91, 283–289 (2003). https://doi.org/10.1007/s00436-003-0962-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00436-003-0962-y