Sequencing and Assembly of the Transgenic Papaya Genome

  • Robert VanBuren
  • Ray MingEmail author
Part of the Plant Genetics and Genomics: Crops and Models book series (PGG, volume 10)


Papaya is an excellent model for studying fruit tree genomics. Papaya has a short generation time of 9–12 months, a small genome of 372 Mbp, and little residual within genome heterozygosity, and it produces an abundance of offspring. Papaya is also significant agriculturally, as it produces one of the world’s most nutritious fruits, and it has a pair of incipient sex chromosomes that are well suited for studying sex chromosome evolution. These features spurred the papaya genome sequencing project. The papaya genome was sequenced using 2.8 million whole-genome shotgun reads from a female transgenic SunUp cultivar. The shotgun assembly and physical maps were integrated with a genetic map to generate the draft genome. Analysis of the draft genome revealed fewer genes than any other sequenced plant genome, making it an excellent candidate for studying biosynthesis pathways. Synteny analysis suggests that papaya has no recent genome-wide duplications, which could potentially make it a good model of the ancient angiosperm genome. Most gene families have fewer members in papaya, with several exceptions, including drastic expansion in the MADs-box family. The draft genome sequence of papaya accelerated papaya sex chromosome research and facilitated comparative genomic studies in angiosperm.


Linkage Group Draft Genome Scaffold Sequence Sequence Plant Genome Papaya Ringspot Virus 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Ackerman CM, Yu Q, Kim S, Paull RE, Moore PH, Ming R (2008) B-class MADS-box genes in trioecious papaya: two TM6 paralogs, CpTM6-1 and CpTM6-2, and a PI ortholog CpPI. Planta 227:741–753PubMedCrossRefGoogle Scholar
  2. Alvarez-Buylla ER, Pelaz S, Liljegren SJ, Gold SE, Burgeff C, Ditta GS, Ribas dePouplana L, Martinez-Castill L, Yanofsky MF (2000) An ancestral MADS-box gene duplication occurred before the divergence of plants and animals. Proc Natl Acad Sci USA 97:5328–5333PubMedCrossRefGoogle Scholar
  3. Arumuganathan K, Earle ED (1991) Nuclear DNA content of some important plant species. Plant Mol Biol Rep 9:208–218CrossRefGoogle Scholar
  4. Bemer M, Wolters-Arts M, Grossniklaus U, Angenent GC (2008) The MADS domain protein DIANA acts together with AGAMOUS-LIKE80 to specify the central cell in Arabidopsis ovules. Plant Cell 20:2088–2101PubMedCrossRefGoogle Scholar
  5. Bowers JE, Chapman BA, Rong JK, Paterson AH (2003) Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422:433–438PubMedCrossRefGoogle Scholar
  6. Chandrika UG, Jansz ER, Wickramasinghem SMDN, Warnasuriya ND (2003) Carotenoids in yellow- and red-fleshed papaya (Carica papaya L). J Sci Food Agric 83:1279–1282CrossRefGoogle Scholar
  7. Chen C, Yu Q, Hou S, Li Y, Eustice M, Skelton RL et al (2007) Construction of a sequence-tagged high density genetic map of papaya for comparative structural and evolutionary genomics in Brassicales. Genetics 177:2481–2491PubMedCrossRefGoogle Scholar
  8. Colombo M, Masiero S, Vanzulli S, Lardelli P, Kater MM, Colombo L (2008) AGL23, a type I MADS-box gene that controls female gametophyte and embryo development in Arabidopsis. Plant J 54:1037–1048PubMedCrossRefGoogle Scholar
  9. Cui L, Wall PK, Leebens-Mack JH, Lindsay BG, Soltis DE et al (2006) Widespread genome duplications throughout the history of flowering plants. Genome Res 16:738–749PubMedCrossRefGoogle Scholar
  10. Gramzow L, Ritz MS, Theissen G (2010) On the origin of MADS-domain transcription factors. Trends Genet 26:149–153PubMedCrossRefGoogle Scholar
  11. Hanada K, Zhang X, Borevitz JO, Li WH, Shiu SH (2007) A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection. Genome Res 17:632–640PubMedCrossRefGoogle Scholar
  12. Huang S, Li R, Zhang Z, Li L, Gu X et al (2009) The genome of the cucumber, Cucumis sativus L. Nat Genet 41:1275–1281PubMedCrossRefGoogle Scholar
  13. International Rice Genome Sequencing Project (2005) The map-based sequence of the rice genome. Nature 436:793–800CrossRefGoogle Scholar
  14. Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A et al (2007) The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449:463–465PubMedCrossRefGoogle Scholar
  15. Kohler C, Hennig L, Spillane C, Pien S, Gruissem W, Grossniklaus U (2003) The Polycomb group protein MEDEA regulates seed development by controlling expression of the MADS-box gene PHERES1. Genes Dev 17:1540–1553PubMedCrossRefGoogle Scholar
  16. Leseberg CH, Li A, Kang H, Duvall M, Mao L (2006) Genome-wide analysis of the MADS-box gene family in Populus trichocarpa. Gene 378:84–94PubMedCrossRefGoogle Scholar
  17. Liu Z, Moore PH, Ma H, Ackerman CM, Makandar R, Yu Q, Pearl HM, Kim MS, Charlton JW, Stiles JI, Zee FT, Paterson AH, Ming R (2004) A primitive Y chromosome in papaya marks incipient sex chromosome evolution. Nature 427:348–352PubMedCrossRefGoogle Scholar
  18. Lius S, Manshardt RM, Fitch MMM, Slightom JL, Sanford JC et al (1997) Pathogen-derived resistance provides papaya with effective protection against papaya ringspot virus. Mol Breed 3:161–168CrossRefGoogle Scholar
  19. Manshardt RM, Wenslaff TF (1989) Interspecific hybridization of papaya with other Carica species. J Am Soc Hortic Sci 114:689–694Google Scholar
  20. Ming R, Moore P, Zee F, Abbey C, Ma H, Paterson AH (2001) A papaya BAC library as a foundation for molecular dissection of a tree-fruit genome. Theor Appl Genet 102:892–899CrossRefGoogle Scholar
  21. Ming R, Hou S, Feng Y, Yu Q et al (2008) The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 452:991–996PubMedCrossRefGoogle Scholar
  22. Nagarajan N, Navajas-Pérez R, Pop M, Alam M, Ming R, Paterson AH, Salzberg SL (2008) Genome-wide analysis of repetitive elements in papaya. Trop Plant Biology 1:191–201CrossRefGoogle Scholar
  23. Olsen A, Ernst H, Leggio LL, Skriver K (2005) NAC transcription factors: structurally distinct, functionally diverse. Trends Plant Sci 10:79–87PubMedCrossRefGoogle Scholar
  24. Parenicova L, de Folter S, Kieffer M, Horner DS, Favalli C, Busscher J, Cook HE, Ingram RM, Kater MM, Davies B, Angenent GC, Colombo L (2003) Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis: new openings to the MADS world. Plant Cell 15:1538–1551PubMedCrossRefGoogle Scholar
  25. Paterson AH, Bowers JE, Chapman BA (2004) Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci USA 101:9903–9908PubMedCrossRefGoogle Scholar
  26. Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J et al (2009) The Sorghum bicolor genome and the diversification of grasses. Nature 457:551–556PubMedCrossRefGoogle Scholar
  27. Sawasaki Y, Inomata K, Yoshida K (1998) Trans-kingdom conjugation between Agrobacterium tumefaciens and Saccharomyces cerevisiae, a bacterium and a yeast. Plant Cell Physiol 37:103–106CrossRefGoogle Scholar
  28. Schmutz J, Cannon S, Schlueter J, Ma J et al (2010) Genome sequence of the palaeopolyploid soybean. Nature 463:178–183PubMedCrossRefGoogle Scholar
  29. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S et al (2009) The B73 maize genome: complexity, diversity, and dynamics. Science 326:1112–1115PubMedCrossRefGoogle Scholar
  30. Storey WB (1969) Papaya. In: Ferwerda FP, Wit F (eds) Outlines of perennial crop breeding in the tropics. H Veenman & Zonen, Wageningen, pp 389–408Google Scholar
  31. Tang H, Bowers JE, Wang X, Ming R, Alam M, Paterson AH (2008) Synteny and collinearity in plant genomes. Science 320:486–488PubMedCrossRefGoogle Scholar
  32. The Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:796–815CrossRefGoogle Scholar
  33. Timmis JN, Ayliffe MA, Huang CY, Martin W (2004) Endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes. Nat Genet Rev 5:123–135CrossRefGoogle Scholar
  34. Tuskan GA et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr & Gray). Science 313:1596–1604PubMedCrossRefGoogle Scholar
  35. Velasco R, Zharkikh A, Affourtit J, Dhingra A et al (2010) The genome of the domesticated apple (Malus × domestica Borkh.). Nat Genet 42:833–839PubMedCrossRefGoogle Scholar
  36. Wall PK, Leebens-Mack J, Muller K, Field D, Altman N, dePamphilis CW (2008) PlantTribes: a gene and gene family resource for comparative genomics in plants. Nucleic Acids Res 36:970–976CrossRefGoogle Scholar
  37. Wang J, Na JK, Yu Q, Gschwend AR, Han J et al (2012) Sequencing papaya X and Yh chromosomes reveals molecular basis of incipient sex chromosome evolution. Proc Natl Acad Sci USA 109:13710–13715PubMedCrossRefGoogle Scholar
  38. Wood TE, Takebayashi N, Barker MS, Mayrose I, Greenspoon PB, Rieseberg LH (2009) The frequency of polyploidy speciation in vascular. Proc Natl Acad Sci USA 106:13875–13879PubMedCrossRefGoogle Scholar
  39. Yu Q, Steiger D, Kramer E, Moore P, Ming R (2008) Floral MADS-Box genes in trioecious papaya: characterization of AG and AP1 subfamily genes revealed a sex-type-specific gene. Trop Plant Biol 1:97–107CrossRefGoogle Scholar
  40. Yu Q, Tong E, Skelton RL, Bowers JE, Jones MR, Murray JE et al (2009) A physical map of the papaya genome with integrated genetic map and genome sequence. BMC Genomics 10:371–382PubMedCrossRefGoogle Scholar
  41. Zhang YJ, Wang LJ (2005) The WRKY transcription factor superfamily: its origin in eukaryotes and expansion in plants. BMC Evol Biol 5:1–12PubMedCrossRefGoogle Scholar
  42. Zhang W, Wai CM, Ming R, Yu Q, Jiang J (2010) Integration of genetic and cytological maps and development of a pachytene chromosome-based karyotype in papaya. Trop Plant Biol 3:166–170CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2014

Authors and Affiliations

  1. 1.Department of Plant BiologyUniversity of Illinois at Urbana-ChampaignUrbanaUSA

Personalised recommendations