Skip to main content
Log in

Mining of expressed sequence tag libraries of cacao for microsatellite markers using five computational tools

  • Research Article
  • Published:
Journal of Genetics Aims and scope Submit manuscript

Abstract

Expressed sequence tags (ESTs) provide researchers with a quick and inexpensive route for discovering new genes, data on gene expression and regulation, and also provide genic markers that help in constructing genome maps. Cacao is an important perennial crop of humid tropics. Cacao EST sequences, as available in the public domain, were downloaded and made into contigs. Microsatellites were located in these ESTs and contigs using five softwares (MISA, TRA, TROLL, SSRIT and SSR primer). MISA gave maximum coverage of SSRs in cacao ESTs and contigs, although TRA was able to detect higher order (>5-mer) repeats. The frequency of SSRs was one per 26.9 kb in the known set of ESTs. One-third of the repeats in EST-contigs were found to be trimeric. A few rare repeats like 21-mer repeat were also located. A/T repeats were most abundant among the mononucleotide repeats and the AG/GA/TC/CT type was the most frequent among dimerics. Flanking primers were designed using Primer3 program and verified experimentally for PCR amplification. The results of the study are made available freely online database (http://riju.byethost31.com/cocoa/). Seven primer pairs amplified genomic DNA isolated from leaves were used to screen a representative set of 12 accessions of cacao.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aggarwal R. K., Prasad S. H., Varshney R. K., Prasanna R. B., Krishnakumar V., Lalji S. et al. 2007 Identification, characterization and utilization of EST-derived genic microsatellite markers for genome analyses of coffee and related species. Theor. Appl. Genet. 114, 359–372.

    Article  PubMed  CAS  Google Scholar 

  • Araujo J. S., Intorne A. C., Pereire M. G., Lopes U. V. and deSouza Filko G. A. 2007 Development and characterization of novel tetra, tri and dinucletotide repeat microsatellite markers in cacao (Theobrama cacao L.) Mol. Breed. 20, 73–81.

    Article  CAS  Google Scholar 

  • Bilgen M., Karaca M., Onus A. and Ince A. 2004 A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences. Bioinformatics 20, 3379–3386.

    Article  PubMed  CAS  Google Scholar 

  • Borrone J. W., Brown J. S., Kuhn D. N., Motamayor J. C. and Schnell R. J. 2007 Microsatellite markers developed from Theobroma cacao L. expressed sequence tags. Mol. Ecol. Notes 7, 236–239.

    Article  CAS  Google Scholar 

  • Cardle L., Ramsay L., Milbourne D., Macaulay M., Marshall D. and Waugh R. 2000 Computational and experimental characterization of physically clustered simple sequence repeats in plants. Genetics 156, 847–854.

    PubMed  CAS  Google Scholar 

  • Couch J. A., Zintel H. A. and Fritz P. J. 1993 The genome of the tropical tree Theobroma cacao L. Mol. Gen. Genet. 237, 123–128.

    Article  PubMed  CAS  Google Scholar 

  • Dice L. R. 1945 Measures of the amount of ecologic association between species. Ecology 26, 297–302.

    Article  Google Scholar 

  • Ewing B. and Green P. 1998 Basecalling of automated sequencer traces using phred II. error probabilities. Genome. Res. 8, 186–194.

    PubMed  CAS  Google Scholar 

  • Gao L. F., Tang J. F., Li H. W. and Jia J. Z. 2003 Analysis of microsatellites in major crops assessed by computational and experimental approaches. Mol. Breed. 12, 245–261.

    Article  CAS  Google Scholar 

  • Jurka J. and Pethiygoda C. 1995 Simple repetitive DNA sequences from primates: compilation and analysis. J. Mol. Evol. 40, 120–126.

    Article  PubMed  CAS  Google Scholar 

  • Kantety R. V., La Rota M., Matthews D. E. and Sorrells M. E. 2002 Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum, and wheat. Plant Mol. Biol. 48, 501–510.

    Article  PubMed  CAS  Google Scholar 

  • Katti M. V., Ranjekar P. K. and Gupta V. S. 2001 Differential distribution of simple sequence repeats in eukaryotic genome sequences. Mol. Biol. Evol. 18, 1161–1167.

    PubMed  CAS  Google Scholar 

  • Kumpatla S. P. and Mukhopadhyay S. 2005 Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species. Genome 48, 985–998.

    Article  PubMed  CAS  Google Scholar 

  • Lawson M. J. and Zhang L. 2006 Distinct patterns of SSR distribution in the Arabidopsis thaliana and rice genomes. Genome Biol. 7, R14.

    Article  PubMed  Google Scholar 

  • Lima L. S., Gramacho K. P., Gesteira A. S., Lopes U. V., Gaiotto F. A., Zaidan H. A. et al. 2008 Characterization of microsatellites from cacao-Moniliophthora perniciosa interaction expressed sequence tags. Mol. Breed. 22, 315–318.

    Article  CAS  Google Scholar 

  • Martins W., De Sousa D., Proite K., Guimaraes P., Moretzsohn M. and Bertioli D. 2006 New softwares for automated microsatellite marker development. Nucleic Acids Res. 34, e31.

    Article  PubMed  Google Scholar 

  • Metzgar D., Bytof J. and Wills C. 2000 Selection against frameshift mutations limits microsatellite expansion in coding DNA. Genome Res. 10, 72–80.

    PubMed  CAS  Google Scholar 

  • Newcomb R. D., Crowhurst R. N., Gleave A. P., Rikkerink E. H. A., Allan A. C., Beuning L. L. et al. 2006 Analysis of expressed sequence tags from apple. Plant Physiol. 141, 147–166.

    Article  PubMed  Google Scholar 

  • Pearson C., Sinden E. and Richard R. 1998 Trinucleotide repeat DNA structure: Dynamic mutations from dynamic DNA. Curr. Opin. Structl. Biol. 36, 884–889.

    Google Scholar 

  • Powell W. W., Machery G. C. and Provan J. 1996 Polymorphism revealed by simple sequence repeats. Trends Genet. 1, 76–83.

    Google Scholar 

  • Pugh T., Fouet O., Tisterucci A. M., Brottier P., Abouladze M., Deletrez C. et al. 2004 A new cacao linakage map based on codominant markers: development and integration of 201 new microsatellite markers. Theor. Appl. Genet. 108, 1151–1161.

    Article  PubMed  CAS  Google Scholar 

  • Quackenbush J., Cho D., Lee F. L., Hott I., Karamychera S. and Parizi B. 2001 The TIGR gene indices: analysis of gene transient sequences in highly sample eukaryotic species. Nucleic Acids Res. 29, 159–164.

    Article  PubMed  CAS  Google Scholar 

  • Rabello E., Nunes de Souza A., Saito D. and Tsai S.M. 2005 In silico characterization of microsatellites in Eucalyptus spp.: abundance, length variation and transposon associations. Genet. Mol. Biol. 28, 582–588.

    Article  CAS  Google Scholar 

  • Robinson A. J., Love C. G., Batley J., Barker G. and Edwards D. 2004 Simple sequence repeat marker loci discovery using SSR primer. Bioinformatics 20, 1475–1476.

    Article  PubMed  CAS  Google Scholar 

  • Rohlf F. J. 1998 On applications of geometric morphometrics to studies of ontogeny and phylogeny. Sys. Biol. 47, 147–158.

    Article  CAS  Google Scholar 

  • Smith J. S. C., Chin E. C. L., Shu H., Smith O. S., Wall S. J., Senior M. L. et al. 1997 An evaluation of the utility of SSR loci as molecular markers in maize (Zea mays L.): comparisons with data from RFLPs and pedigree. Theor. Appl. Genet. 95, 163–173.

    Article  CAS  Google Scholar 

  • Sneath P. H. A. and Sokal R. R. 1973 Numerical taxonomy, the principles and practice of numerical classification. W. H. Freeman, San Francisco.

    Google Scholar 

  • Sreenu V. B., Kumar P., Nagaraju J. and Nagarajaram H. A. 2007 Simple sequence repeats in mycobacterial genomes. J. Biosci. 32, 3–15.

    Article  PubMed  CAS  Google Scholar 

  • Tang J. F., Gao L., Cao Y. and Jia J. 2006 Homologous analysis of EST-SSRs and transferability of wheat SSR-EST markers across barley, rice and maize. Euphytica 151, 87–93.

    Article  CAS  Google Scholar 

  • Temnykh S., DeClerck G., Lukashova A., Lipovich L., Cartinhour S. and McCouch S. 2001 Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Res. 11, 1441–1452.

    Article  PubMed  CAS  Google Scholar 

  • Thiel T., Michalek V. and Graner A. 2003 Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106, 411–422.

    PubMed  CAS  Google Scholar 

  • Varshney R. K., Graner A. and Sorrells M. E. 2005 Genic microsatellite markers in plants: features and applications. Trends Biotech. 23, 48–55.

    Article  CAS  Google Scholar 

  • Varshney R. K., Thiel T., Stein N., Langridge P. and Graner A. 2002 In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species. Cell Mol. Biol. Lett. 7, 537–546.

    PubMed  CAS  Google Scholar 

  • Verica J., Maximova S., Strem M., Carlson J., Bailey B. and Guiltinan M. 2004 Isolation of ESTs from cacao (Theobroma cacao L.) leaves treated with inducers of the defense response Plant Cell Rep. 23, 404–413.

    Article  PubMed  CAS  Google Scholar 

  • von Stackelberg M., Rensin S. A. and Reskhi R. 2006 Identification of genic moss SSR markers and a comparative analysis of 24 algal and plant gene indices reveal specific rather group specific characteristics of microsatellites. BMC Plant Biol. 6, 9.

    Article  Google Scholar 

  • Yasodha R., Sumathi R., Chezhian P., Kavitha S. and Ghosh M. 2008 Eucalyptus microsatellites mined in silico: survey and evaluation. J. Genet. 87, 21–25.

    Article  PubMed  CAS  Google Scholar 

  • Yu J. K., La Rota M., Kantety R. V. and Sorrells M. E. 2004 EST derived SSR markers for comparative mapping in wheat and rice. Mol. Gen. Genomics 271, 742–751.

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vadivel Arunachalam.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Riju, A., Rajesh, M.K., Sherin, P.T.P.F. et al. Mining of expressed sequence tag libraries of cacao for microsatellite markers using five computational tools. J Genet 88, 217–225 (2009). https://doi.org/10.1007/s12041-009-0030-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12041-009-0030-1

Keywords

Navigation