Sequencing and Utilization of the Gossypium Genomes

Paterson, Andrew H.; Rong, Jun-kang; Gingle, Alan R.; Chee, Peng W.; Dennis, Elizabeth S.; Llewellyn, Danny; Dure, Leon S.; Haigler, Candace; Myers, Gerald O.; Peterson, Daniel G.; ur Rahman, Mehboob; Zafar, Yusuf; Reddy, Umesh; Saranga, Yehoshua; Stewart, James M.; Udall, Joshua A.; Waghmare, Vijay N.; Wendel, Jonathan F.; Wilkins, Thea A.; Wright, Robert J.; Zaki, Essam; Hafez, Elsayed E.; Zhu, Jun

doi:10.1007/s12042-010-9051-4

Sequencing and Utilization of the Gossypium Genomes

Published: 14 April 2010

Volume 3, pages 71–74, (2010)
Cite this article

Download PDF

Tropical Plant Biology Aims and scope Submit manuscript

Sequencing and Utilization of the Gossypium Genomes

Download PDF

Andrew H. Paterson¹,
Jun-kang Rong¹,
Alan R. Gingle¹,
Peng W. Chee²,
Elizabeth S. Dennis³,
Danny Llewellyn³,
Leon S. Dure III⁴,
Candace Haigler⁵,
Gerald O. Myers⁶,
Daniel G. Peterson⁷,
Mehboob ur Rahman⁸,
Yusuf Zafar⁸,
Umesh Reddy⁹,
Yehoshua Saranga¹⁰,
James M. Stewart¹¹,
Joshua A. Udall¹²,
Vijay N. Waghmare¹³,
Jonathan F. Wendel¹⁴,
Thea A. Wilkins¹⁵,
Robert J. Wright¹⁵,
Essam Zaki¹⁶,
Elsayed E. Hafez¹⁷ &
…
Jun Zhu¹⁸

2299 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Revealing the genetic underpinnings of cotton productivity will require understanding both the prehistoric evolution of spinnable fibers, and the results of independent domestication processes in both the Old and New Worlds. Progress toward a reference sequence for the smallest Gossypium genome is a logical stepping-stone toward revealing diversity in the remaining seven genomes (A, B, C, E, F, G, K) that permitted Gossypium species to adapt to a wide range of ecosystems in warmer arid regions of the world, and toward identifying the emergent properties that account for the superior productivity and quality of tetraploid cottons. The greatest challenge facing the cotton community is not genome sequencing per se but the conversion of sequence to knowledge.

During the recently ended International Year of Natural Fibers (http://www.naturalfibres2009.org/), it is fitting that progress in sequencing of genomes in the cotton genus (Gossypium) accelerated rapidly, toward the realization of many novel opportunities to advance knowledge of organic evolution. Of singular importance is dissecting the evolution of the ‘lint fiber’ that sustains the textile industry, with an aggregate influence estimated at ∼$120 billion/yr on US gross domestic product and ∼$500 billion/yr worldwide. “There are only a few cells in the plant kingdom that are as exaggerated in their size or composition as cotton fibers”, and some of these single-celled seed epidermal trichomes “... reach lengths of over 6 cm, or one-third the height of an Arabidopsis plant (Kim and Triplett 2001).”

Cotton is unusual among major crops in having been domesticated independently four times at two different ploidy levels. Spinnable fibers evolved in the Old World A genome lineage in the past 5–7 million years (Senchina et al. 2003; Udall et al. 2006). Domestication of A genome cottons G. herbaceum and/or G. arboreum may have started before 6000 B.C. in Pakistan (Moulherat et al. 2002). In parallel, by 3500–2300 BC (Stephens and Moseley 1974) New World aboriginals were utilizing two tetraploid species that arose from natural hybridization between an A genome species and a New World D genome species. A and D genome taxa diverged ∼5–10 million years ago (Senchina et al. 2003; Udall et al. 2006), reuniting by polyploidization ∼1–2 million years ago following trans-oceanic dispersal of an A genome propagule to the New World (Wendel 1989). The ancestral allopolyploid spawned two species that were independently domesticated (G. hirsutum, or ‘Upland’ cotton; and G. barbadense, including forms referred to as ‘Sea Island’, Egyptian, and Pima cotton), and three species known only in the wild, native to the Galapagos (G. darwinii), Hawaii (G. tomentosum), and Brazil (G. mustelinum).

Revealing the genetic underpinnings of cotton productivity will require understanding both the prehistoric evolution of spinnable fibers, and the results of independent domestication processes in both the Old and New Worlds. In particular, the New World D genome (similar to extant G. raimondii) played a surprising role in cotton improvement. Although no D genome species produce spinnable fiber, more than half of genetic differences in fiber traits between the two domesticated tetraploid species map to D-genome chromosomes (Jiang et al. 1998; Rong et al. 2007). Moreover, gene expression in tetraploid cotton fiber shows a like bias in favor of D-genome alleles (Hovav et al. 2008). These data support the hypothesis that the superior fiber yield and quality of tetraploids may be an emergent property of combining two genomes (Jiang et al. 1998). Indeed, cotton has gone ‘full circle’—evolution of spinnable fibers may have unwittingly provided the Old World A genome a dispersal mechanism by which to transiently colonize the New World and permit the tetraploid to form. In turn, in the post-Columbian era, more productive and finer-quality New World tetraploids have largely supplanted cultivated diploids in the Old World.

Cotton enjoys many opportunities to participate in a bio-based products revolution that may reduce dependence on petrochemicals (Council 2000). Cotton fiber with increased uniformity, durability, and strength might replace synthetic fibers that require ∼230 million barrels of petroleum per year to produce in the USA alone. Cotton seed oil, and byproducts of fiber processing, are raw materials for biofuel production (Holt et al. 2003).

Discovery and utilization of new Gossypium diversity may be especially important for sustainable cotton production because of its narrow gene pool (Chee et al. 2004; Lubbers et al. 2004). The natural ‘genetic bottleneck’ imposed by polyploid formation has been exacerbated by repeatedly crossing relatively few closely-related genotypes to one another to breed new cultivars (May et al. 1995) and using only a few cultivars to deploy transgenes (Helms 2000). For example, a looming worldwide water crisis (UNESCO 2002) makes it important to identify adaptations that permitted wild cottons to endure periodic drought and temperature extremes (Kohel et al. 1974), restoring such valuable alleles that may have been “left behind” during domestication (Gur and Zamir 2004) to create cultivars that produce more with less (water).

DNA sequencing promises to reveal the spectrum of diversity in the Gossypium genus. A high degree of conservation of gene order and sequence suggests that the vast majority of data from diploids will extrapolate to tetraploids (Rong et al. 2004). Accordingly, obtaining a reference sequence of the smallest Gossypium genome (D, ∼900 Mb) is a logical stepping-stone toward characterizing the larger A diploid (∼1700 Mb) and AD tetraploid genomes (∼2500 Mb) (Paterson 2007; Chen et al. 2007). Rapid low cost re-sequencing might then be sufficient to reveal diversity in the remaining six genomes (B, C, E, F, G, K) that permitted Gossypium species to adapt to a wide range of ecosystems in warmer, arid regions of the world. The US Department of Energy Joint Genome Institute has completed a 0.4x genome-equivalent ‘pilot study’ of G. raimondii that strongly supports the feasibility of assembling a whole-genome shotgun (WGS) sequence (A.H.P. and X. Wang, unpubl. data), and has begun further sequencing (www.jgi.doe.gov/sequencing/cspseqplans2009.html). Early explorations of the A and AD genomes are also in progress.

As a leading crop in the implementation of transgenes in agriculture, a reference genome sequence may expedite ongoing development and stewardship of genetically-modified (GM) cotton. It will become easy to determine whether each transgene insertion site is in euchromatin or heterochromatin, and identify any genes inadvertently disrupted. Identification of genomic characteristics associated with favorable expression of transgenic traits might reduce the need for costly empirical testing of numerous transgenic insertions to commercialize one. Unifying principles of useful transgene insertions might be found by comparison to the only transgenic plant sequenced to date, papaya, in which five of six insertions were in nuclear-encoded DNA fragments of chloroplast origin, with four matching topoisomerase I recognition sites (Ming et al. 2008). Using the sequence to identify DNA markers closely linked to transgenes may reduce the undesirable chromatin (and traits) transmitted to elite genotypes from the otherwise-obsolete cottons that are most efficiently transformed.

The greatest challenge facing the cotton community is not genome sequencing per se but the conversion of sequence to knowledge. Completion of the Arabidopsis thaliana sequence was quickly followed by inception of the NSF 2010 project, which has greatly increased knowledge about the functions of Arabidopsis genes at a cost approaching $200 million. While the functions of perhaps half of the cotton genes might be deduced by analogy to those of Arabidopsis (Rong et al. 2005), de novo functional analysis of the remaining cotton genes faces the disadvantages of ∼20 times as much DNA, the necessity of completing its longer life cycle to see effects on the primary organ of commerce (seedborne lint fiber), and a larger body that cannot complete its life cycle in a test tube.

To realize the potential economic benefits of sequencing the cotton genomes will require investments of at least the same order-of-magnitude made in Arabidopsis. Had Arabidopsis not gone first the cost of cotton functional genomics would be much higher. Much of the required investment will need to come from the private sector, but few single enterprises have the critical mass of knowledge, skills, and resources needed to accomplish such innovation alone. Cotton is an attractive target for public-private partnership to develop enabling tools that will nurture rapid accumulation of fundamental information necessary to empower development and commercialization of products and applications across the value chain.

References

Chee P, Lubbers E, May O, Gannaway J, Paterson AH (2004) Changes in genetic diversity of the U.S. Upland cotton. Beltwide Cotton Conference. National Cotton Council, San Antonio
Google Scholar
Chen ZJ et al (2007) Toward sequencing cotton (Gossypium) genomes. Plant Physiol 145:1303–1310
Article CAS PubMed Google Scholar
Council NR (2000) Biobased industrial products: priorities for research and commercialization
Gur A, Zamir D (2004) Unused natural variation can lift yield barriers in plant breeding. Plos Biology 2:1610–1615
Article CAS Google Scholar
Helms AB (2000) Yield study report. In: Dugger P, Richter D (eds) Proc Beltwide Cotton Prod Conf. Natl. Cotton Council, San Antonio
Google Scholar
Holt G, Simonton J, Beruvides M, Canto AM (2003) Engineering economic analysis of a cotton by-product fuel pellet operation. J Cotton Sci 7:205–216
Google Scholar
Hovav R, Udall JA, Hovav E, Rapp RA, Flagel L, Wendel JF (2008) Gene expression during cellular differentiation of the single-celled cotton trichome (fiber). Planta 227:319–329
Article CAS PubMed Google Scholar
Jiang CX, Wright RJ, El-Zik KM, Paterson AH (1998) Polyploid formation created unique avenues for response to selection in Gossypium (cotton). Proc Natl Acad Sci USA 95:4419–4424
Article CAS PubMed Google Scholar
Kim JK, Triplett BA (2001) Cotton fiber growth in planta and in vitro. Models for plant cell elongation and cell wall biogenesis. Plant Physiol 127:1361–1366
Article CAS PubMed Google Scholar
Kohel RJ, Richmond TR, Lewis CF (1974) Genetics of flowering response in cotton. VI. Flowering behavior of Gossypium hirsutum L. and G. barbadense L. hybrids. Crop Sci 14:696–699
Google Scholar
Lubbers E, Chee P, Gannaway J, Wright R, El-Zik K, Paterson AH (2004) Levels and patterns of genetic diversity in upland cotton. Plant and Animal Genome XII Conference, San Diego
Google Scholar
May OL, Bowman DT, Calhoun DS (1995) Genetic diversity of U.S. upland cotton cultivars released between 1980 and 1990. Crop Sci 35:1570–1574
Article Google Scholar
Ming R et al (2008) The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature 452:991–997
Article CAS PubMed Google Scholar
Moulherat C, Tengberg M, Haquet J-F, Mille B (2002) First evidence of cotton at Neolithic Mehrgarh, Pakistan: analysis of mineralized fibres from a copper bead. J Archaeol Sci 29:1393–1401
Article Google Scholar
Paterson AH (2007) Sequencing the cotton genomes. World Cotton Research Conference. International Cotton Advisory Committee, Lubbock
Google Scholar
Rong J-K et al (2004) A 3347-locus genetic recombination map of sequence-tagged sites reveals features of genome organization, transmission and evolution of cotton (Gossypium). Genetics 166:389–417
Article CAS PubMed Google Scholar
Rong J, Bowers JE, Schulze SR, Waghmare VN, Rogers CJ, Pierce GJ, Zhang H, Estill JC, Paterson AH (2005) Comparative genomics of Gossypium and Arabidopsis: unraveling the consequences of both ancient and recent polyploidy. Genome Res 15:1198–1210
Article CAS PubMed Google Scholar
Rong J-K, Feltus FA, Waghmare VN, Pierce GJ, Chee PW, Draye X, Saranga Y, Wright RJ, Wilkins TA, May OL, Smith CW, Gannaway JR, Wendel JF, Paterson AH (2007) Meta-analysis of polyploid cotton QTLs shows unequal contributions of subgenomes to a complex network of genes and gene clusters implicated in lint fiber development. Genetics 176:2577–2588
Article CAS PubMed Google Scholar
Senchina DS, Alvarez I, Cronn RC, Liu B, Rong JK, Noyes RD, Paterson AH, Wing RA, Wilkins TA, Wendel JF (2003) Rate variation among nuclear genes and the age of polyploidy in Gossypium. Mol Biol Evol 20:633–643
Article CAS PubMed Google Scholar
Stephens SG, Moseley ME (1974) Early domesticated cottons from archaeological sites in central coastal Peru. Am Antiquity 39:109–122
Article Google Scholar
Udall JA et al (2006) A global assembly of cotton ESTs. Genome Res 16:441–450
Article PubMed Google Scholar
UNESCO (2002) Vital water graphics, water use and management. United Nations Education Scientific and Cultural Organization, Paris
Google Scholar
Wendel JF (1989) New world tetraploid cottons contain old-world cytoplasm. Proc Natl Acad Sci USA 86:4132–4136
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Plant Genome Mapping Laboratory, University of Georgia, 111 Riverbend Road Rm 228, Athens, GA, 30602, USA
Andrew H. Paterson, Jun-kang Rong & Alan R. Gingle
Coastal Plain Experiment Station, University of Georgia, Tifton, GA, 31794, USA
Peng W. Chee
CSIRO Plant Ind, Canberra, ACT 2601, Australia
Elizabeth S. Dennis & Danny Llewellyn
Department of Biochemistry, University of Georgia, Athens, GA, 30602, USA
Leon S. Dure III
Departments of Crop Science and Botany, North Carolina State University, Raleigh, NC, 27695, USA
Candace Haigler
LSU AgCenter, Louisiana State University, Baton Rouge, LA, 70803, USA
Gerald O. Myers
Mississippi Genome Exploration Laboratory, Mississippi State University, Mississippi State, MS, 39762, USA
Daniel G. Peterson
Plant Genomics & Molecular Breeding Labs, National Institute for Biotechnology & Genetic Engineering, Faisalabad, Pakistan
Mehboob ur Rahman & Yusuf Zafar
Department of Biology, West Virginia State University, Institute, WV, 25112, USA
Umesh Reddy
The Robert H. Smith Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
Yehoshua Saranga
Department Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, 72701, USA
James M. Stewart
Department of Plant & Wildlife Sciences, Brigham Young University, Provo, UT, 84602, USA
Joshua A. Udall
Central Institute for Cotton Research, Nagpur, Maharashtra, 440010, India
Vijay N. Waghmare
Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
Jonathan F. Wendel
Department of Plant and Soil Science, Texas Tech University, Lubbock, TX, 79409, USA
Thea A. Wilkins & Robert J. Wright
Nucleic Acids Research Department, Genetic Engineering & Biotechnology Research Institute, Borg El Arab, Post Code 21934, Alexandria, Egypt
Essam Zaki
Mubarak City for Scientific Research and Technology Applications, New Borg El Arab City, 21934, Alexandria, Egypt
Elsayed E. Hafez
Institute of Bioinformatics, Zhejiang University, Hangzhou, Peoples Republic of China
Jun Zhu

Authors

Andrew H. Paterson
View author publications
You can also search for this author in PubMed Google Scholar
Jun-kang Rong
View author publications
You can also search for this author in PubMed Google Scholar
Alan R. Gingle
View author publications
You can also search for this author in PubMed Google Scholar
Peng W. Chee
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth S. Dennis
View author publications
You can also search for this author in PubMed Google Scholar
Danny Llewellyn
View author publications
You can also search for this author in PubMed Google Scholar
Leon S. Dure III
View author publications
You can also search for this author in PubMed Google Scholar
Candace Haigler
View author publications
You can also search for this author in PubMed Google Scholar
Gerald O. Myers
View author publications
You can also search for this author in PubMed Google Scholar
Daniel G. Peterson
View author publications
You can also search for this author in PubMed Google Scholar
Mehboob ur Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Yusuf Zafar
View author publications
You can also search for this author in PubMed Google Scholar
Umesh Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Yehoshua Saranga
View author publications
You can also search for this author in PubMed Google Scholar
James M. Stewart
View author publications
You can also search for this author in PubMed Google Scholar
Joshua A. Udall
View author publications
You can also search for this author in PubMed Google Scholar
Vijay N. Waghmare
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan F. Wendel
View author publications
You can also search for this author in PubMed Google Scholar
Thea A. Wilkins
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. Wright
View author publications
You can also search for this author in PubMed Google Scholar
Essam Zaki
View author publications
You can also search for this author in PubMed Google Scholar
Elsayed E. Hafez
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew H. Paterson.

Additional information

Communicated by: Paul Moore

Rights and permissions

Reprints and permissions

About this article

Cite this article

Paterson, A.H., Rong, Jk., Gingle, A.R. et al. Sequencing and Utilization of the Gossypium Genomes. Tropical Plant Biol. 3, 71–74 (2010). https://doi.org/10.1007/s12042-010-9051-4

Download citation

Received: 23 February 2010
Accepted: 15 March 2010
Published: 14 April 2010
Issue Date: June 2010
DOI: https://doi.org/10.1007/s12042-010-9051-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sequencing and Utilization of the Gossypium Genomes

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation