Rice materials and field planting
Rice cultivars Nipponbare, Kasalath, SSSL14, and 101 accessions in the core collection of rice were cultivated in a paddy field in Nanjing (118.6° E, 32.1° N). Thirty-five wild rice accessions were cultivated at the National Germplasm Guangzhou Wild Rice Nursery in Guangzhou (113.2° E, 23.1° N). The information of these accessions, including the name, latitude, and longitude, is listed in Supplementary Table S1. Rice cultivar SSSL14 is a single-segment substitution line with the genome background of Nipponbare and contains only one donor fragment from Kasalath in the middle of the short arm of chromosome 3. The transgenic plants were grown in the conditions prevalent at Nanjing and Hainan (110.0 °E, 18.5 °N). On the field, the rows were spaced 20 cm apart, and each plant was spaced at a distance of 17 cm according to standard agronomic management practices.
Trait measurement
At the heading stage, 10 target spikelets were collected from five plants of each accession prior to glume opening (at approximately 10:00 a.m.). For each spikelet, the STL, SYL, and TSSL of the fertile floret were measured using a stereo microscope (10 ×, MC50, Guangzhou, China), and the average values (in mm) of the 10 florets were taken for the accessions.
Sample preparation and observation under light microscopy
In order to investigate the cytologic reasons that affect the SYL, we conducted sheet pressing analysis using the styles of Nipponbare, SSSL14, Kasalath, transgenic complementation plants, and overexpression plants. At anthesis, the mature pistils of Nipponbare, SSSL14, Kasalath, and transgenic positive plants (including the complementation and overexpression plants) were observed under a light microscope (Olympus FV1000, Japan). The pistils were then sampled and fixed with 70% (v/v) alcohol, 38% (w/v) formaldehyde, and glacial acetic acid (90:5:5, by volume), washed with tap water three times, and placed in 5 μl of 6% NaOH between a cover glass and a microscope slide and a pressed plate. Longitudinal sections were then observed under a light microscope.
Application of exogenous phytohormones on the top second leaf blades of main stems
During the third or fourth stage of young panicle differentiation (according to the criteria reported by Itoh et al. (2005)), the second leaf blades from the top of the main stem of different Nipponbare and SSSL14 plants were smeared with 10 μM gibberellic acid (GA3; Shanghai Ryon Biotechnology Co., Ltd, Shanghai, China), 20 μM brassinosteroid (BR; Sigma-Aldrich Trading Company Ltd, Shanghai, China), and 1 μM indole-3-acetic acid (IAA; Beijing Solarbio Science & Technology Co., Ltd, Beijing, China) using a hygroscopic cotton ball. Control plants were treated with absorbent cottons containing the same volume of distilled water. The smear concentrations of GA3, BR, and IAA were determined according to a previous report (Zhao et al. 2010). Finally, we sampled the spikelets to measure the STL, SYL, and TSSL at the flowering stage.
Endogenous phytohormone content analysis using high performance liquid chromatography-tandem mass spectrometry (HPLC–MS/MS)
The content of endogenous plant hormones GA1, GA4, BR, and IAA in mature pistils at stage 8 of young panicle differentiation in the Nipponbare and SSSL14 plants was detected utilising the methods reported by Durgbanshi et al. (2005), Forcat et al. (2008) and Zhong et al. (2013).
The phytohormone content of GA1, GA4, and IAA was measured using HPLC (1260, Agilent technologies, USA) and MS (6420A, Agilent technologies, USA). The chromatographic conditions were: Poroshell 120-SB-C18 reversed phase column (150 × 2.1 mm, 2.7 μm); column temperature: 30℃; sample size: 2 μl; mobile phase: A:B = (methanol/0.1% formic acid): (water/0.1% formic acid); flow velocity: 0.3 ml/min; and gradient elution mode: 0–1 min, 20% A; 1–3 min, 20–50% A; 3–9 min, 50–80% A; 9–10.5 min, 80% A; 10.5–10.6 min, 80–20% A; 10.6–13.6 min, 20% A. The mass spectrometry conditions were: ESI− negative ion mode; scan type: multiple-reaction monitoring; air curtain: 15 psi; spray voltage: − 4000 v; atomising pressure: 65 psi; auxiliary pressure: 70 psi; and atomisation temperature: 400 °C.
The BR content was measured using HPLC (1290, Agilent technologies, USA) and tandem MS (SCIEX-6500 Qtrap, Allen-Bradley, USA). The chromatographic conditions were: Poroshell 120-SB-C18 reversed phase column (150 × 2.1 mm, 2.7 μm); column temperature: 35 °C; sample size: 2 μl; mobile phase: A:B = (methanol): (water/0.1% ammonia); flow velocity: 0.35 ml/min; and gradient elution mode: 0–2 min, 80% A; 2–3.5 min, 80–95% A; 3.5–6 min, 95% A; 6–6.1 min, 95–80% A; 6.1–10 min, 80% A. The mass spectrometry conditions were: ESI− positive ion mode; scan type: multiple-reaction monitoring; air curtain: 15 psi; spray voltage: + 4500 v; atomising pressure: 65 psi; auxiliary pressure: 70 psi; and atomisation temperature: 350 ℃.
For the calculation of hormone content in the samples, we used the following formula: hormone content in the sample (ng/g fresh weight) = detection concentration (ng/ml) × volume coefficient (ml)/mass coefficient (g), where the volume coefficient is the volume of the solution used in the final dissolution of the sample and the mass coefficient is the mass of the sample.
Plasmid construction and plant transformation
To produce the complementation construct pCAMIA1300-SYL3, a 4844-kb SYL3 genomic DNA fragment including the 2221-kb upstream sequence, full-length 2223-kb SYL3 sequence, and 400-kb downstream sequence was amplified from Kasalath and cloned into the plant binary vector pCAMBIA1300 using In-Fusion™ Advantage PCR Cloning Kits (Takara, Japan). To generate the overexpression constructs, the full-length coding sequence (CDS) of SYL3 was amplified from Kasalath and cloned into the plant binary vectors pBWA(V)HS with a 35S promoter. A 2321-kb DNA fragment upstream of the SYL3 start codon was amplified from Kasalath and cloned into the pBWD(LB)1C–GUS plus vector to generate the plasmid PROSYL3:GUS.
We used the site-directed mutation method to construct two single nucleotide polymorphism (SNP) constructs (H2 and H3, the allele-specific transformation vectors). Firstly, the SYL3-n CDS was used as the template to be modified by site-directed mutagenesis based on the primers containing the mutated nucleotides. For the H2 vector, the base G in position S1 was replaced by the base A. For the H3 vector, the base G in position S2 was replaced by the base A. The PCR products were cloned into the plasmid pBWA(V)BS using the homologous recombination method. The genotype of SSSL14 is H1, and the genotype of Nipponbare is H4. All the site-directed mutagenesis transgenes were driven by the SYL3-n promoter. The primer sequences used for the vector construct are listed in Supplementary Table S2.
We verified all the resultant constructs by sequencing, and then, the binary vectors were electroporated into the Agrobacterium tumefaciens strain EHA105 and transformed into Nipponbare (e.g. empty vector, complementation vector, pSYL3-k::GUS, H2, and H3), SSSL14 (e.g. empty vector and overexpression vector), and 7001S (e.g. complementation vector) using the method described by Hiei et al. (1994). Positive pSYL3-k::GUS transgenic plants were selected based on the antibiotic resistance, and GUS histochemical staining analysis was conducted using the method described by Jefferson et al. (1987). Images were captured using a stereo light microscope (Leica DFC 420, Leica Microsystems, Germany).
Subcellular localisation
The SYL3-k-RFP in-frame fusion protein construct and the nuclear marker NLS-mCherry construct were co-expressed transiently in rice leaf protoplasts using the polyethylene glycol method (Chiu et al. 1996; Chen et al. 2006). The corresponding amino acid sequence of the NLS protein was MDPKKKRKV. We observed the fluorescence of the samples above at an excitation wavelength of 588 nm and an emitting wavelength of 635 nm under a confocal laser-scanning microscope (Olympus FV1000). The primer sequences are listed in Supplementary Table S2.
Quantitative RT-PCR analysis
Using the RNApure Plant Kit (Beijing CoWin Biotech Co. Ltd, Beijing, China), total RNA was extracted from roots, culms, leaf blades, leaf sheaths, young panicle from stage 3–8, and young pistils at stage 8 of inflorescence developmental course (Ikeda et al. 2004; Itoh et al 2005) sampled from Nipponbare and SSSL14. RNase-free DNase I was used to remove the genomic DNA in the sample. cDNA at a volume of 20 μl was synthesised using 1 μg RNA using the HiScript II 1st Strand cDNA synthesis Kit (Vazyme biotech co., Ltd, Nanjing, China). Quantitative RT-PCR (20 μl reaction volume) was carried out using 0.4 μl of cDNA, 0.4 μM of each gene-specific primer, and the AceQTM qPCR Kit (Vazyme) in the Roche Applied Science LightCycler™ 480 (Roche diagnostics Ltd., Germany). The ubiquitin gene was used as the internal controls. The primers used for qRT-PCR are listed in Supplementary Table S2. The corresponding PCR programme was pre-denaturation at 95 °C for 5 min, denaturation at 94 °C for 10 s, annealing at 60 °C for 30 s, and a total of 40 cycles. The relative expression of the target gene was calculated using the following formula: Exp = 2−ΔCt, where ΔCt = Cttarget gene − Ctinternal control (Livak and Schmittgen 2001).
Pollen fertility observation
We stained the mature pollen grains of variety 9311, 7001SSYL3−n, and 7001SSYL3−k using a 1% I2-KI water solution and examined them under a light microscope (Olympus BH-2) under 100× magnification.
Phylogenetic tree construction
The amino acid sequences of the conserved parts of the MADS-box proteins in Arabidopsis and rice were obtained from the studies conducted by Alvarez-Buylla et al. (2000) and Parenicová et al. (2003) and the Plant Transcription Factor Database (http://plntfdb.bio.uni-potsdam.de/v3.0/). The amino acid sequences of the conserved parts of the MADS-box proteins in wheat and barley were obtained from http://www.ncbi.nlm.nih.gov/. Then, sequence alignment and construction of the neighbour-joining tree were performed using the MEGA5 software (Tamura et al. 2011) based on the maximum likelihood method.
F1 hybrid seed production potential evaluation for the SYL3 alleles in the paddy field
In order to evaluate the potential of the SYL3 gene in the production of F1 hybrid rice seeds, we conducted an actual F1 seed production experiment using isogenic lines 7001SSYL3−n (short SYL) and 7001SSYL3−k (long SYL) as the female parents and variety 9311 as the common male parent in the paddy fields of Jiangpu Experimental station, Nanjing. 7001SSYL3−n is a long day-sensitive male sterile line used in commercial F1 seed production in Eastern China, and 7001SSYL3−k is an isogenic line obtained in this study. The male and female parents were grown in a ratio of 2:6:2, i.e. 4 lines of 9311 plants were planted around 6 lines of female plants. For each combination, 30 m2 of land was utilised for planting. Before artificial pollination at the flowering time, we observed the fertility of the pollen of 7001SSYL3−n and 7001SSYL3−k under a light microscope to ensure that the pollen of the female parents was sterile. We performed artificial supplementary pollination twice per day during pollen dispersal. Thirty days after artificial supplementary pollination, we harvested the seeds from the female plants individually. The potential of the SYL3 allele for hybrid rice seed production was evaluated according to the weight of rice grains harvested from the male sterile plants per 1.5 m2 of area.
Geographical distribution and genetic diversity analyses of the qSYL3 alleles
In order to analyse the molecular variation in the SYL3 alleles, we collected 136 rice accession resources and designed primers to amplify the SYL3 coding regions of these varieties (Supplementary Table S1). The sequencing was completed at Nanjing TsingKe biological technology Co., Ltd (Nanjing, China). The genetic pedigree diagram of the SYL3 alleles in cultivated and wild rice was developed using the median-joining model of NETWORK version 5.0 (Bandelt et al. 1999). After sequencing, the nucleotide diversity of the cultivated rice groups (indica, japonica, and javanica) and wild rice group in 20 genes around SYL3 (Supplementary Table S3) was analysed using the DnaSP 5.0 software (Librado and Rozas 2009).
Coalescent simulations
Coalescent simulation was used to model the bottleneck impact on sequence diversity using Hudson’s ms programme (Hudson 2002). We modelled the divergence of the cultivated rice (O. sativa) population and wild rice (O. rufipogon and O. nivara), with a population bottleneck in the cultivated rice. We used the bottleneck model reported by Zhu et al. (2007) in this study (Supplementary Figure S1). We combined O. rufipogon and O. nivara as a single population as the progenitor of O. sativa following the report by Londo et al. (2006). Based on the bottleneck model, we assumed that a single ancestral population of size Na experienced an instantaneous size shift to a bottlenecked population of size Nb at time t2 generation ago and the bottleneck population expanded instantaneously to the present population of size Np at time t1 generation ago. The parameter d, the duration of the bottleneck, and the parameter Nb were used for the bottleneck. The parameter K, which is the ratio of Nb and d, was used to describe the severity of the bottleneck in domestication (Wright and Gaut 2005).
The parameter d (200, 500, 1000, 1500, 2000, and 3000) was set as per the reports by Zhu et al. (2007) and Asano et al. (2011). We first set Na to be 120,000 to perform the simulation. A grid of 20 K values is listed in Supplementary Table S4, and the corresponding Nb, which is K × d, is also shown. We simulated the average number of SNP patterns in a 100-kb region corresponding with the region from genes 6–17 listed in Supplementary Table S3.