Main text

Cotton, the most important cash crop for the textile industry, is also an excellent model system for studying polyploidization, cell elongation, and cell wall biosynthesis (Yang et al. 2017; Liu et al. 2018; Hu et al. 2019; Li et al. 2019a; Li et al. 2019b; Zhang et al. 2019). The small structural variations including SNPs (single nucleotide polymorphism) and InDels (insertion-deletion) have been revealed in the cotton genome by different groups (Fang et al., 2017; Wang et al. 2017; Du et al. 2018; Ma et al. 2018), which have accelerated the rate of unraveling the genetic basis for complex agricultural traits. However, the large-scale variations (inversions and translocations) and their genetic effects remain unclear due to the limitation of genome quality and identification of recombination breakpoints. Recently, Li’s team from the Institute of Cotton Research, CAAS, successfully identified large-scale inversions on the chromosome At08 in upland cotton, and uncovered their genetic effects on cotton population differentiation (Yang et al. 2019). This work represents new progress and advances our understanding of the mechanism for upland cotton divergence after the completion of the cotton genome and cotton variant genomes (Wang et al. 2012; Li et al. 2014; Li et al. 2015; Du et al. 2018; Ma et al. 2018).

Although multiple cotton genome assemblies are currently available (Paterson et al. 2012; Wang et al. 2012; Li et al. 2014; Li et al. 2015; Zhang et al. 2015; Du et al. 2018; Hu et al. 2019; Wang et al. 2019), the large-scale variations between diverse species and their roles in cotton divergence remain unknown. Comparisons among TM-1, Zhongmiansuo24 (ZM24, an important cultivar released in 1995), and the genomes of the diploid ancestors revealed that the variations between the interspecies are more abundant than those between the intraspecies. It is important that the finding of large-scale inversions on the At08 chromosome of upland cotton deepened our traditional understanding that upland cotton possessed a narrow genetic diversity. They found that the inversions could be used to classify a core collection of upland cotton into two groups, which was consistent with classification by a phylogenetic tree and principal component analysis. Using the artificial population, they found that meiotic recombination was suppressed in the inverted region, which was also validated in the natural cotton population. Further analysis showed that the inversion resulted in a decrease of haplotype and genetic diversity and led to cotton population differentiation. These indicate that there is a need to re-evaluate our understanding about how upland cotton divergence has occurred and how this process has specifically influenced extant nucleotide and trait diversity present in upland cotton populations.

Besides, genotype limitation is the most substantial challenge for cotton transformation. Unlike ZM24, TM-1 cannot produce regenerated seedlings through tissue culture to date. The upregulated expression of genes associated with auxin accumulation and transport is beneficial to the cotton transformation efficiency. Unlike auxin, the diminished active GA content in ZM24 calli, rather than as that in difficult to transform TM-1 cultivar, may promote ZM24 to be transformed. These will help guide future studies in improving the efficiency of cotton genetic transformation.

The study also provides a high-quality TM-1 reference genome for upland cotton to date. Meanwhile, a newly published reference genome of ZM24 has been shown to be an effective complement to the TM-1 reference genome. This dataset will benefit the cotton research community and provide better guidance for cotton breeding and functional genomic research.