Molecular Genetics and Genomics

, Volume 276, Issue 1, pp 1–12

Analysis of papaya BAC end sequences reveals first insights into the organization of a fruit tree genome

  • Chun Wan J. Lai
  • Qingyi Yu
  • Shaobin Hou
  • Rachel L. Skelton
  • Meghan R. Jones
  • Kanako L. T. Lewis
  • Jan Murray
  • Moriah Eustice
  • Peizhu Guan
  • Ricelle Agbayani
  • Paul H. Moore
  • Ray Ming
  • Gernot G. Presting
Original Paper

DOI: 10.1007/s00438-006-0122-z

Cite this article as:
Lai, C.W.J., Yu, Q., Hou, S. et al. Mol Genet Genomics (2006) 276: 1. doi:10.1007/s00438-006-0122-z

Abstract

Papaya (Carica papaya L.) is a major tree fruit crop of tropical and subtropical regions with an estimated genome size of 372 Mbp. We present the analysis of 4.7% of the papaya genome based on BAC end sequences (BESs) representing 17 million high-quality bases. Microsatellites discovered in 5,452 BESs and flanking primer sequences are available to papaya breeding programs at http://www.genomics.hawaii.edu/papaya/BES. Sixteen percent of BESs contain plant repeat elements, the vast majority (83.3%) of which are class I retrotransposons. Several novel papaya-specific repeats were identified. Approximately 19.1% of the BESs have homology to Arabidopsis cDNA. Increasing numbers of completely sequenced plant genomes and BES projects enable novel approaches to comparative plant genomics. Paired BESs of Carica, Arabidopsis, Populus, Brassica and Lycopersicon were mapped onto the completed genomes of Arabidopsis and Populus. In general the level of microsynteny was highest between closely related organisms. However, papaya revealed a higher degree of apparent synteny with the more distantly related poplar than with the more closely related Arabidopsis. This, as well as significant colinearity observed between peach and poplar genome sequences, support recent observations of frequent genome rearrangements in the Arabidopsis lineage and suggest that the poplar genome sequence may be more useful for elucidating the papaya and other rosid genomes. These insights will play a critical role in selecting species and sequencing strategies that will optimally represent crop genomes in sequence databases.

Keywords

Bacterial artificial chromosome Carica papaya Comparative genomics Microsatellite Genome mapping 

Abbreviations

BAC

Bacterial artificial chromosome

BES

BAC end sequence

kb

Kilobase

Mbp

Megabase pairs

MYA

Million years ago

nt

Nucleotide

SSR

Simple sequence repeat

Supplementary material

438_2006_122_MOESM1_ESM.pdf (205 kb)
Supplementary material

Copyright information

© Springer-Verlag 2006

Authors and Affiliations

  • Chun Wan J. Lai
    • 1
  • Qingyi Yu
    • 2
  • Shaobin Hou
    • 3
  • Rachel L. Skelton
    • 2
  • Meghan R. Jones
    • 2
  • Kanako L. T. Lewis
    • 3
  • Jan Murray
    • 2
  • Moriah Eustice
    • 1
    • 2
  • Peizhu Guan
    • 1
    • 2
  • Ricelle Agbayani
    • 1
    • 2
  • Paul H. Moore
    • 4
  • Ray Ming
    • 2
    • 5
  • Gernot G. Presting
    • 1
  1. 1.Department of Molecular Biosciences and BioengineeringUniversity of Hawai‘iHonoluluUSA
  2. 2.Hawaii Agriculture Research CenterAieaUSA
  3. 3.Center for Genomics, Proteomics and Bioinformatics Research InitiativeUniversity of Hawai‘iHonoluluUSA
  4. 4.USDA-ARS, Pacific Basin Agricultural Research CenterHiloUSA
  5. 5.Department of Plant BiologyUniversity of Illinois at Urbana-ChampaignUrbanaUSA

Personalised recommendations