Introduction

Rice (Oryza sativa L.) is the most important cereal crop in the world. Due to a lack of manpower and higher wages, rice growers turn to the direct seeding method (Angaji et al. 2010). Direct-seeded rice is a common production method in southern Louisiana and areas in Texas and California State, USA (Hardke and Scott 2013). In the same time, rice plants suffer from submergence (flooding) and poor seedling establishment.

Flooding is one of the serious problems which affect rice production in South and Southeast Asia, where the majority of the world’s rice is grown, about 20 million hectares of rice land is prone to flooding. Flooding creates hypoxic or anoxic condition resulting in poor germination and seedling establishment, even in some cases leads to plant death within few days of full submergence (IRRI 2016; Singh et al. 2017). There are different categories of flooding; we are interested in submergence during germination also known as anaerobic germination. On this condition, rapid seedling elongation can provide successful establishment, and escape from submergence stress, hence provides required oxygen for normal growth.

For successful establishment and escape from submergence stress, priming technique is involved to enhance the start of germination processes (Silva and Silva 2016). Doley et al. (2018) studied priming effect on 243 rice genotypes for anaerobic germination under 10 cm of flooding. They found that priming rice seeds for 24 h with different solutions enhanced anaerobic germination under flooding compared to control. In addition, priming three rice cultivars for 48 h was the best seed invigoration treatment under well watered condition (Mulbah and Adjetey 2018). Furthermore, Sarkar 2012 studied two near isogenic lines under flooding and non-flooding conditions. His result revealed that seed priming improved the seedling establishment under anaerobic conditions. Recently, it was observed that rice seed priming followed by sun drying can improve anaerobic germination (Senapati et al., 2019).

Angaji et al. (2010) identified a few tolerant genotypes of over 8000 genotypes screened for the tolerance of flooding during germination. Under submergence, successful rice coleoptile elongation depends on hydrolases induction to mobilize endosperm; α-amylases play a central role in this process. Gibberellic acid (GA3) is an important hormone induces α-amylases expression resulting in germination and seedling growth in rice under anaerobic conditions (Lee et al. 2014). Kaneko et al. (2002) also found that active GA3 is important for α-amylases expression in rice endosperm. Moreover, in barly the expression of the α-amylase gene is up-regulated by exogenous GA3 (Gubler et al. 2002). Rice cultivars have different sensitivity to exogenous gibberellin concentrations via seed treatment, reflecting upon seedling performance (Guadagnin et al. 2017). Likewise, a study on rice showed that the most effective concentration was 2000 ppm GA3, which enhances seedlings length of BW196 (Mutinda et al. 2017).

Mining favorable alleles for coleoptile length (CL), coleoptile length gibberellic acid sensitivity (CLGS) and its gibberellic acid sensitivity index (GSI) for water direct-seeded rice would provide breeders to improve traits. In 2004, Jiang et al. (2004) detected five QTLs for anoxia germinability from 81 RILs with phenotypic variation ranged from 10.5 to 19.6% on chromosomes 1, 2, 5 and 7, respectively. Furthermore, they detected three pairs of epistasis loci located on chromosomes 2, 3, 5 and 11 with significant effects ranging from 16.7 to 48.8%. Five putative QTLs controlling flooding tolerance during germination in rice were detected on chromosomes 1, 3, 7 and 9, explaining 17.9–33.5% of the phenotypic variation (Angaji et al. 2010). Septiningsih et al. (2013) identified six QTLs of mapping 175 F2:3 families genotypes, using 118 SSR markers, on chromosomes 2, 5, 6 and 7 associated with a survival rate of seedling under 10 cm depth of water. Baltzar et al. (2014 detected two major QTLs associated with the survival rate of seedling while analysis 300 lines F2:3 derived from the cross of IR64 and the aus landrace Nanhi. One QTL derived from Nanhi detected on chromosome 7 explained 22.3% phenotypic variance, while the other one was detected on chromosome 2 from IR64 with increased effect. Recently, three QTLs associated with anaerobic germination detected by analysis 285 F2:3 genotypes derived from a cross between Tai Nguyen and Anda using 6 K SNP chip (Kim and Reinke 2018). Two QTLs were detected on chromosome 1 and one QTL on chromosome 8 with variance explained percentage ranged from 5.49 to 14.14%. Taking all together, the QTLs reported up to now for anoxic (flooding) conditions are 4, 4, 2, 3, 1, 6, 1, 1, and 1 on chromosomes 1, 2, 3, 5, 6, 7, 8, 9 and 11 respectively.

In our study, two points are new compared with previous research. One is the QTL detection method (we use GWAS for this trait), the other is gibberellin-treated seeds and germinated under 10 cm depth of water. It is the first report mined favorable alleles of coleoptile elongation and its sensitivity to gibberellic acid for water direct-seeded rice by association mapping using 262 SSR markers from the natural population The aims of the present study were to (1) investigate the phenotypic variation of CL, CLGS and GSI under anoxic condition; (2) identify QTLs and mine the favorable alleles for CL, CLGS and GSI by genome-wide association mapping; (3) predict parental combinations for improve CLGS with high GSI according to superior accessions screened in this study.

Materials and Methods

Plant Materials

The seeds of the 358 rice genotypes were collected, stored, and supplied by State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China (Supplementary Table 1).

Field Planting

All the seeds of the tested materials were sowed in the seedling nursery of paddy field in Jiangpu Experimental Station, Nanjing Agricultural University, in mid-May and transplanted in mid-June in 2017. The experiment was evaluated in a randomized block design with three replications. All the recommended package of practices was followed. In 2018, the dates of sowing and transplanting and field managements were equivalent to 2017. The purpose of field planting was to harvest fresh seeds for germination experiments.

Evaluation of Coleoptile Elongation Length and Its Sensitivity to Gibberellic Acid Under 10 cm Depth of Water Condition

Fifty seeds of each accession were used for each treatment (0 ppm-GA3 and 2000 ppm-GA3). Under the control treatment, the seeds were soaked in distilled water for 24 h; while under GA3 treatment, the seeds were soaked in GA3 solution (2000 ppm) for 24 h. Thirty uniformed soaked seed were visually selected out of the 50 and transferred to a paper towel, lined up on 3 cm from the lower edge, covered with two layers of moist filter paper and rolled the paper up, sailed with a rubber band and placed vertically in plastic box (44 cm × 31 cm × 15 cm) and submerged under 10 cm depth water. The plastic boxes were put under the natural conditions for 13 days to allow the seeds germinate and grow (Supplementary Fig. 1). On the fourteenth day, the coleoptile elongation lengths of 10 seedlings in each replicate of each treatment in each accession were measured with a ruler, and recorded as CL (cm) for the distilled water treatment and CLGS (cm) for the GA3 treatment. Coleoptile elongation length sensitivity of an accession to GA3 was designated as gibberellin sensitivity index (GSI) and was determined using the following formulas:

$$GSI\left( {cm/cm} \right) = \frac{CLGS - CL}{{CL}}$$

where CLGS is coleoptile elongation length (cm) under GA3 treatment, and CL is coleoptile elongation length (cm) under distilled water treatment (control).

Phenotypic Data Analysis

The mean, standard deviation, maximum, minimum and coefficient of variation for the CL and CLGS trait were calculated by using XLSTAT: Statistical software for Excel (Version 20.6.5) available from https://www.xlstat.com/en/. Microsoft Excel software 2016 was used to compute the broad-sense heritability using the following formula (Wang et al. 2007):

$$H_{B}^{2} = \delta_{g}^{2} /\left( {\delta_{g}^{2} + \delta_{e}^{2} /n} \right)$$

where \({\delta }_{g}^{2}\) is genetic variance,\({\delta }_{e}^{2}\) is error variance, and n is a number of replicates.

The correlation coefficient was calculated between each of CL, CLGS and GSI by using SPSS statistics 19 (Weaver and Wuensch 2013).

SSR Marker Genotyping

Based on the existing data published on rice molecular mapping, as well as microsatellite data (Temnykh et al. 2000; McCouch et al. 2002; Varshney et al. 2005), 262 pairs of SSR primers distributed on the 12 chromosomes of rice were utilized in genotyping. Leaf blade tissue of a single individual plant in each accession was used to extract genomic DNA using the method described by Dang et al (2019). DNA amplification primers were synthesized by Shanghai Generay Biotech Co. Ltd., Shanghai, China. Every 10 μl PCR mixture contained 1 µl genomic DNA, 0.7 µl of the forward primer and the same amount (0.7 µl) of reverse primer, 10 × Buffer (free MgCl2) 1 µl, dNTPs 0.2 µl, 0.1 µl of Taq polymerase, 0.6 µl MgCl2, and 5.7 µl ddH2O. PCR amplification was performed on a Peltier Thermal Cycler (PTC-100™, MJ Research™ Incorporated, USA) under denaturation of 94℃ for 5 min; 34 cycles of denaturation at 94℃ for 30 s, annealing at 55 ~ 61℃ (depending on the primer used) for 1 min, with extension at 72℃ for 1 min, and, finally, an extension at 72℃ for 10 min. Visualization of the resultant PCR products was done on an 8% polyacrylamide gel run for 1 h at 150 V and observed through silver staining.

Population Genetic Structure Analysis

Using STRUCTURE version 2.2 (Falush et al. 2007) the genetic clusters in the 358 accessions were identified. A mean log-likelihood value over five runs set each K (K from 2 to 10) with random starting points. The length of the burn-in period was set to 50,000 iterations and defined a run of 100,000 Markov Chain Monte Carlo (MCMC) replicates after burn-in was used. If the mean log-likelihood value was positively correlated with the model parameter K; a suitable value for K could not be determined. In this situation, the optimal K value was determined through an ad hoc statistic (∆K) based on the rate of change in [LnP (D)] between successive K values (Evanno et al. 2005). Non- admixed individuals in each genetic group were determined using a Q-matrix assignment greater than 0.9. Power Marker version 3.25 (Liu and Muse 2005) was used to determine the number of alleles per locus, major allele frequency, genetic diversity per locus, and polymorphism information content (PIC) values. The genetic distance was calculated based on 262 molecular markers using Nei’s distance (Nei et al. 1982) and phylogenetic reconstruction was performed using a neighbor-joining method as implemented in Power Marker with the tree viewed using MEGA 4.0 (Tamura et al. 2007). Locus-by-locus analysis of molecular variance (AMOVA) (Weir and Cockerham 1984) based on genetic groups delimited by the Bayesian clustering method in the program Arlequin 3.5 (Excoffier and Lischer 2010) was performed to statistically verify the geographical structure using SSR and standard multi-locus frequency data. The genetic differentiation coefficient or fixation index (Fst) between subpopulations was calculated using the method proposed by Weir and Hill (2002). The calculation process was performed in Arlequin 3.5 software.

Linkage Disequilibrium Analysis

To evaluate the linkage disequilibrium (LD) level, TASSEL 2.1 (Bradbury et al. 2007) software was used in which each pair of SSR loci was evaluated, in all rice accessions and clusters arising from STRUCTURE analysis. The D' value was used to measure the degree of LD between sites (non-alleles). The formula for calculating the D' value is given as (Hedrick 1987):

$$D^{\prime} = \mathop \sum \limits_{i - 1}^{u} \mathop \sum \limits_{j = 1}^{v} p_{i} q_{j} \left| {D^{\prime}_{ij} } \right|$$

where u and v represent the number of alleles of the two loci, pi and qj the frequency of the i-th allele at position A and the frequency of the j-th allele at position B, respectively.

\(\left|{D}_{ij}^{^{\prime}}\right|\) is the absolute value of Lewontin’s (1964) normalized measure \({D}_{ij}^{^{\prime}}\)

$$D^{\prime}_{ij} = \frac{{D_{ij} }}{{D_{ij}^{max} }}$$

where \({D}_{ij}^{max}\) is the maximum amount of disequilibrium possible between the i-th allele at locus A and the j-th allele at locus B.

Genome Wide Association Mapping

Genome wide association mapping using General Linear Model (GLM, Q) and Mixed Linear Model (MLM, Q+K) was performed using TASSEL 3.0 to calculate the associations between the target trait and markers (Bradbury et al. 2007). The Q matrix was obtained from the analysis results of Structure 2.2, and genetic relatedness (K) matrix was obtained by the software TASSEL 3.0. A false discovery rate (FDR) of 0.001 was used as a threshold for multiple testing according to the correction method published by Benjamini and Hochberg (1995). In this study, marker loci with phenotypic variation explained (PVE) > 7% were considered for further analysis. The phenotypic effect values of the alleles amplified were calculated based on the null allele (not amplified) method described by Breseghello and Sorrells (2006).

Results

Phenotypic Variations of CL, CLGS and GSI

The phenotypic data of the CL and CLGS followed a normal distribution as showed in Fig. 1, which is also confirmed by Kurtosis and Skewness values for both years (Table 1). The mean value for CL over 358 accessions was 2.59 cm with a range from 0.82 to 3.82 cm in 2017. The coefficient of variance was 20.62% and broad sense heritability was 98.50%. In 2018, the results for CL were similar to those of the previous year (Table 1). On the other hand, the mean value for CLGS was 3.04 cm with a range from 1.25 to 4.76 cm in 2017. The coefficient of variation was 19.61% with H2b of 95.92%. Also, the results for CLGS obtained in 2018 were similar to those of the previous year (Table 1). The broad-sense heritability for CL and CLGS was higher than 90% in both years, indicating that the phenotypic variations of the two traits were mainly controlled by genetic factors.

Fig. 1
figure 1

Frequency distribution of coleoptile length (cm) in 358 accessions; a for CL and b for CLGS in cm under 10 cm depth of water

Table 1 Phenotypic characteristic for CL and CLGS

Gibberellic acid treatment increased the coleoptile elongation length by 0.45 cm/cm and 0.46 cm/cm averaged over 358 accessions in 2017 and 2018, respectively, compared with those of water treatment. The GSI ranged from 0 to 2.12 cm/cm in 2017, while the range in 2018 was from 0 to 2.39 cm/cm (Fig. 2), indicating there exist variations in coleoptile elongation length sensitivity to GA3 among the 358 genotypes used. According to the performances of both CL and GSI grown in the 10 cm depth of water, 6 accessions were considered as superior germplasms for water direct-seeded rice (Table 2). The most sensitive accession to GA3 is Gaoliangqing with GSI of 2.26 cm/cm, followed by Wuxiangjing14 (0.91 cm/cm), Changdaotou (0.87 cm/cm), Hongdao35 (0.79 cm/cm), Zhenghan2 (0.71 cm/cm) and Huajing5 (0.61 cm/cm).

Fig. 2
figure 2

Frequency distribution of gibberellic acid sensitivity index (cm/cm) of 358 accessions

Table 2 Superior accessions and their performance under 10 cm depth of water

Figure 3 shows the difference in coleoptile elongation length between 0 ppm-GA3 treatment and 2000 ppm- GA3 treatment under 10 cm depth of water in accessions Changdaotou. It can be seen from Fig. 3 that the deference between CL and CLGS are clear.

Fig. 3
figure 3

Difference in coleoptile length and its sensitivity to gibberellic acid under10 cm; a for accession Changdaotou (seeds soaked in water), b for accession Changdaotou (seeds soaked in GA solution), scale is 1 cm

The correlation coefficients between CL, CLGS and GSI are presented in Table 3. The result revealed positive and highly significant between CLGS and GSI. While the correlation coefficient between CL and GSI was negative and highly significant.

Table 3 Estimates of correlation coefficients between CL, CLGS, DCL and GSI

Genetic Diversity of the Entire Population Revealed by SSR Markers

The genetic diversity of the 358 accessions was determined using 262 SSR markers distributed on the 12 chromosomes in rice. Totally 2474 marker alleles were identified with average of 9.443 alleles per locus (ranged from 2 to 25) (Supplementary Table 2). The gene diversity value averaged over 262 loci was 0.731 with a range from 0.100 (RM7163 on chromosome 11) to 0.937 (RM7545 on chromosome10). The polymorphic information content (PIC) value averaged over 262 loci was 0.702 with a range from 0.095 (RM7163 on chromosome 11) to 0.933 (RM7545 on chromosome 10). While 33 markers showed PIC value less than 0.5, the PIC value of 92 markers were more or equal 0.8, and 137 markers were in between 0.5 and 0.8. These results indicate high genetic diversity in the population used.

Population Genetic Structure

Genetic structure analysis of the entire populations showed an increase in likelihood function LnP (K) value with the increase of subpopulations (Supplementary Fig. 2a). Supplementary Fig. 2b shows that ΔK value reached maximum at K = 6. Therefore, the entire population can be divided into 6 sub-populations. A neighbour-joining tree of the 358 accessions was constructed based on Nei’s genetic distance (Supplementary Fig. 2d), and the results were consistent with the results from the Structure analysis. Using the criterion of Q value > 0.9, each accession was sorted into the corresponding subpopulation. 325 accessions entered into 6 subpopulations (known as SP1, SP2, SP3, SP4, SP5 and SP6) (Supplementary Fig. 2), and the remaining 33 accessions entered into an admixture subpopulation. The numbers of accessions SP1, SP2, SP3, SP4, SP5 and SP6 were 52, 75, 38, 24, 70 and 66, respectively (Supplementary Table 1).

By checking the resources of the 358 accessions, it was found that the 6 subpopulations divided above had different geographic origins or ecotypes. Accessions in SP1 were all from Vietnam (Indica rice). SP2 contains accessions from middle china and a few numbers of northeast accessions (Temperate japonica). Most of the accessions in SP3 are modern cultivars bred in the north-central of Jiangsu province (Temperate japonica). SP4 has accessions from middle-east China (Temperate japonica). SP5 accessions were mainly from south Jiangsu province (Temperate japonica) and SP6 had tall, late-maturing accessions and a small number of northeast accessions in the Taihu Lake Basin (Temperate japonica), as showed in Supplementary Table 1.

The results of the analysis of molecular variance (AMOVA) indicated that 46.2% of the total genetic variation occurred between the subpopulations, whereas 53.8% occurred within the subpopulations (Table 4). These results indicate a high degree of genetic differentiation across the six subpopulations.

Table 4 Analysis of molecular variance (AMOVA) for 6 subpopulation of rice varieties

Genetic Diversity of the Six Subpopulations

The basic genetic information of each subpopulation is shown in Table 5. SP6 has the highest number of alleles per locus (4.057), the highest genetic diversity (0.524), d while SP3 has the lowest numbers of alleles per locus (2.031), the lowest genetic diversity (0.276), among the 6 subpopulations (Table 5). Compared with the entire population, the genetic parameters of each subpopulation were significantly reduced, indicating that the alleles of partial loci were fixed during the process of differentiation of each subpopulation.

Table 5 Summary statistics for each subpopulation

Pairwise Fst Values and Nei’s Genetic Distance Among the Subpopulations

The \({F}_{st}\) values, which reflected the genetic differentiation extent between two subpopulations, for the 15 pairs of subpopulations are shown below the diagonal (Table 6). The \({F}_{st}\) value between SP2 and SP5 was the lowest (0.376), while that between SP3 and SP4 was the highest (0.632). Nei’s genetic distance between SP2 and SP5 was short (0.528), while the distance between SP3 and SP4 was long (0.771) (Table 6). The results in Table 6 indicate that the pairwise \({F}_{st}\) value can reflect the genetic distance between subpopulations.

Table 6 Pairwise estimates of Fst and Nei's genetic distance among the 6 subpopulations

Ratios of Significant Linkage Disequilibrium Pairwise Loci and Decay Distances in the 6 Subpopulations

The ratio of significant linkage disequilibrium (LD) pairwise loci (P˂0.01) was the lowest (0.17%) in SP4 and was the highest (3.33%) in SP6 (Table 7). The highest mean of D' value was 0.61 (SP4) and the lowest value was 0.57 in both SP5 and SP6, suggesting that the accessions of these subpopulations have been subjected to extreme artificial selection. The decay rate of D' in each subpopulation (Supplementary Fig. 3) follows the logarithmic regression equation \(y=b\mathrm{ln}x+c\). The LD decay distance was 82.21, 98.33, 85.12, 79.46, 93.19 and 92.69 cM for subpopulations SP1, SP2, SP3, SP4, SP5 and SP6, respectively. The shortest distance was 79.46 cM in SP4, whilst SP2 has the longest distance (98.33 cM). These results indicated that the accessions in SP4 have been subjected to more recombination and the accessions in SP2 have been subjected to extreme artificial selection.

Table 7 Comparison of the D' value of LD for pairwise loci in all subpopulations

SSR Marker Loci Associated with CL, Favorable Alleles and Their Carrier Accessions

Twenty three marker loci were detected using the GLM model and two SSR loci were detected using MLM model in both years with PVE more than 7% (one SSR marker locus common between the two models). All markers were distributed on all chromosomes except chromosome 5 and chromosome 7 (Table 8). The range of PVE was from 7.19% (RM1013 on chromosome 9) to 18.22% (RM6327 on chromosome 11) in 2017 and the results were similar in 2018.

Table 8 SSR marker loci associated significantly (FDR = 0.001) with CL in 358 rice accessions

Table 9 shows the top 39 positive favorable alleles of the significant association loci with PEV more than 0.5 cm and their typical carrier materials (which carrying the desired alleles with the highest phenotypic value) for CL in both years. The PEV for those alleles ranged from 0.504 cm of RM3754-80pb (typical carrier accession Songjing12) to 1.594 cm of RM6327-215pb (typical carrier accession Wanqu429).

Table 9 Top 39 positive favorable alleles, phenotypic effect value and typical carrier materials for CL

Six marker loci showed positive average allele effect (AAE +), without negative allele effect (PVE more than 7%); RM6327 was the highest with AAE + equal to1.396 cm, followed by RM106 with AAE + 0.627 cm, RM3513 with AAE + 0.503 cm, RM1358 with AAE + 0.371 cm, RM5356 with AAE + 0.296 cm and RM128 with AAE + 0.274 cm (Table 10).

Table 10 Positive (negative) average allele effect of each locus for CL

Based on phenotypic effect value of marker-alleles which have positive effect on CL, the best parental combinations were selected from the top 20 accessions. Seven parental combinations predicted to improve CL; and the predicted phenotypic effect ranged from 0.850 cm to 0.940 cm (Table 11).

Table 11 Parental combination, coleoptile length, number of alleles and predicted phenotypic effect value (CL)

SSR Marker Loci Associated with CLGS, Favorable Alleles and Their Carrier Accessions

Twenty-one SSR loci for CLGS were detected using GLM mode and two SSR loci using MLM model in both years with PVE more than 7% (one SSR marker locus common between the two models). Overall, 22 SSR markers were distributed on all chromosomes except chromosome 12 (Table 12). The range of PVE was from 7.06% (RM3688 on chromosome 2) to 17.24% (RM3773 on chromosome 10) in 2017 and the results were similar in 2018.

Table 12 SSR marker loci associated significantly (FDR = 0.001) with CLGS

Table 13 shows the top 56 positive favorable alleles of the significant association loci with the PEV more than 0.5 cm (PVE more than 7%) and their typical carrier accessions for CLGS. The PEV for those alleles ranged from 1.087 cm for RM562-180 (typical carrier accession Xiaoqingmang) to 0.506 cm for RM283-150 (typical carrier accession Zhenghan2).

Table 13 Top 56 positively favorable alleles, phenotypic effect value and typical carrier materials for CLGS

Three markers showed positive average allele effect (AAE), without negative allele effect (PVE more than 7%); RM562 was the highest one with AAE 0.721 cm, followed by RM434 with AAE 0.538 cm and RM3453 with AAE 0.494 cm (Table 14).

Table 14 Positive (negative) average allele effect of each locus for CLGS

Comparing the association analysis for CL and CLGS, the result showed that RM3754 (chromosome 8) was co-associated with both CL and CLGS analysis with PVE 8.62% and 7.91% for CL and CLGS in 2017, respectively; and the result was similar in 2018. RM3754-80 bp shows phenotypic effect value 0.504 cm for CL and the typical carrier is Songjing12. While RM3754-100 bp shows PEV 0.550 cm for CLGS and the typical carrier is Changdaotou, and RM3754-85 bp shows PEV 0.519 cm and typical carrier Zhenghan2. In 2017, positive AAE value of RM3754 was 0.155 cm and 0.211 cm for CL and CLGS, respectively; and the result was similar in 2018.

Based on PEV of marker-alleles which have positive effects, the best parental combinations were selected from the top 20 accessions for CLGS (Table 15). Seven parental combinations were predicted to improve CLGS ranged from 0.814 to 0.922 cm. Among all, Changdaotou and Hongdao35 were selected before as superior accessions.

Table 15 Parental combination, coleoptile length, number of alleles and predicted phenotypic effect value (CLGS)

SSR Marker Loci Associated with GSI, Favorable Alleles and Their Carrier Accessions

Seventeen SSR loci for GSI were detected using GLM and MLM model in years 2017 and in 2018 with PVE more than 10%. The 17 SSR marker loci were distributed on all chromosomes except chromosome 7 and 8 (Table 16). The range of PVE was from 10.19% (RM112 on chromosome 2) to 36.69% (RM297 on chromosome 1) in 2017 and the results were similar in 2018.

Table 16 SSR marker loci associated significantly (FDR = 0.001) with GSI

Table 17 shows the top 29 positive favorable alleles of the significant association loci with PEV more than 0.1 cm/cm and their typical carrier accessions for GSI in years 2017 and in 2018. The PEV for those alleles ranged from 0.100 cm/cm for RM6869-125pb (typical carrier accession Gaoliangqing) to 0.270 cm/cm for RM6869-110pb (typical carrier accession Yangdao).

Table 17 Top 29 positive favorable alleles, phenotypic effect value and typical carrier materials for GSI

Two markers showed positive average allele effect (AAE), without negative allele effect (PVE more than 10%); RM304 was the highest one with AAE 0.212 cm/cm, followed by RM297 with AAE 0.105 cm/cm. Among 17 markers, the marker RM304 was showing the highest positively AAE (Table 18).

Table 18 Positive (negative) average allele effect of each locus for GSI

The results showed that 7 markers were associated with both GSI and CL traits. Among all positive favorable alleles, RM3513-80 bp shows phenotypic effect value 0.616 cm for CL and the typical carrier is Haidongqing, and the same marker allele shows phenotypic effect value 0.104 cm/cm for GSI and the typical carrier is Gaoliangqing.

Furthermore, we found that RM1182 was associated with both GSI and CLGS traits. Among all positive alleles, RM1182-145 bp showed phenotypic effect value 0.729 cm for CLGS and the typical carrier was Maijieqing. RM1182-150 bp showed phenotypic effect value 0.578 cm for CLGS and the typical carrier was Shuangchengnuo. While RM1182-165 bp showed phenotypic effect value 0.106 cm/cm for GSI and the typical carrier was Wuxiangjing14.

Based on PEV of marker-alleles which have positive effects on GSI, the best parental combinations were selected from the top 20 accessions for GSI Seven parental combinations were predicted to improve GSI from 0.154 to 0.160 cm/cm (Table 19).

Table 19 Parental combination, coleoptile length, number of alleles and predicted phenotypic effect value (GSI)

Comparing the parental combination accessions which selected for CL, CLGS and GSI, the accession Hongdao35 had been found to share in both CLGS and GSI parental combinations; also it was one of the superior accessions. In addition to, all parental combination accessions selected were temperate japonica; and these accessions were categorized under three subpopulations Sp2, Sp3 and SP5.

Discussion

Treating the seeds with gibberellic acid can enhance the coleoptile elongation length under submergence condition, which is considered as key of survival under anoxic conditions for water direct-seeded rice (Gubler et al. 2002; Kaneko et al. 2002; Lee et al. 2014; Mutinda et al. 2017).

In this study, there were great variations in the traits under investigation. The mean value for CL ranged from 0.82 to 3.82 cm in 2017, as well as 2018, the same result was obtained by Hsu and Tung (2015). While mean value for CLGS ranged from 1.25 to 4.76 cm 2017 and the results were similar in 2018, which is consistent with the results of others (Guadagnin et al. 2017; Mutinda et al. 2017). Furthermore, there was a positive correlation between CLGS with GSI, moreover, a wide range for GSI indicating the existence of genotype sensitivity. Hence, the broad-sense heritability was higher than 90% in both years for CL and CLGS, which means that the genetic effect is mainly controlling both CL and CLGS comparing to the environmental effect (Visscher et al. 2008).

The six superior accessions were found in this study belonging to temperate japonica. Previous studies reported that coleoptile performance of temperate japonica varieties (as sub-species of japonica varieties) was better than indica varieties under anaerobic conditions (Lasanthi-Kudahettige et al. 2007; Hsu and Tung 2015).

The AMOVA results (46.16% genetic variability among subpopulations and 53.84% within subpopulations) revealed that the rice genotypes under our study highly variable and suitable for conducting association mapping as demonstrated in previous studies (Adeyemo et al. 2005; Agrama and Eizenga 2008; Jaiswal et al. 2012; Bergamaschi and Lama 2015). These accessions probably had a complex breeding history involving intercrossing and introgression between germplasm from diverse backgrounds, overlaid with strong selection pressure for agronomic and quality characteristics (Mather et al. 2004).

In association mapping, the LD used is present in the germplasm set under study. As well, LD might not only be influenced by recombination but also by various other forces (Flint-Garcia et al. 2003). Contrasting to the previous studies, LD was decaying in our study at more than 70 cM, this can be attributed due to outcrossing and recombination events that have been used in breeding programs (Garris et al. 2003; Lu et al. 2005; Olsen et al. 2006; Dang et al. 2014).

Association mapping is a very prevalent method for the explanation of the genetic basis of complex traits in plants. Different statistical approaches had been designed to deal with the superior marker-phenotype association that could be caused by the population structure. GLM depends only on Q matrix generated during the study of population structure while MLM accounts for both population structure and the kinship. Generally GLM will detect a higher number of significant marker-trait associations than in MLM; while MLM is more accurate in claiming associations than GLM (Korte and Farlow 2013). QQ plots (GLM & MLM) were generated to demonstrate that population structure is only controlling the confounding factors that could bias the results (Wei et al 2017) as shown in Fig. 4.

Fig. 4
figure 4

QQ plot for CL, CLGS and GSI; a GLM Model and b MLM Model

For coleoptile elongation length under control treatment (CL), RM6327 on chromosome11 explained the maximum phenotypic variation, 16 accessions out of 358 (4.47%) showed an excellent alleles RM6327-215 bp, with the largest phenotypic effect values (1.609 cm in 2017 and 1.578 cm in 2018) and the typical carrier accession was Wanqu428.

Exogenous GA3 plays an important role in rice coleoptile elongation under submergence, anoxia or hypoxia (Kota- Noguchi et al. 2008). In this study, for CLGS, RM 562 on chromosome 1 explained the maximum phenotypic value, 14 accessions out of 358 (3.92%) possesses the excellent alleles RM562-180 bp; with the largest phenotypic effect (1.087 cm in both 2017 and 2018) and the typical carrier accession was Xiaoqingmang.

The differences detected between GSI and CLGS verified that they are functioned differently (Zhang et al. 2017). This result indicated that GSI explained the different genetic mechanisms of coleoptile treated with GA3 under anoxic condition. Additionally, RM297 was detected with the highest PVE (36.69% and 34.72% in 2017 and in 2018, respectively), indicated that this was chromosome segment controlling GSI and considered as a promising marker which can increase GSI. Wang et al. (2012) and Zhao et al. (2017) had been detected similar results with PVE value exceeded 20%.

Pleiotropy is the well-established phenomenon of a single gene affecting multiple traits. It has long played a central role in theoretical, experimental, and clinical research in genetics, development, molecular biology, evolution, and medicine (Paaby and Rockman 2012). Seven markers were detected in this study to have a pleiotropic effect for CL and CLGS. A similar result had been found in wheat by Chai et al. (2019); while Ookawa et al. (2010) used the pleiotropy phenomenon for improving rice lodging resistance and yield.

Taking all together, 24, 22 and 17 marker loci were associated significantly with CL, CLGS and GSI. Among all, 34 marker loci were novel in current study; the rest (20) of marker loci were detected in previous studies (Itoh et al. 2001; Kikuchi et al. 2003; Sakai et al. 2003; Toojinda et al. 2003; Huang et al. 2003; Jan et al. 2004; Ling et al. 2004; Angaji 2008; Lo et al. 2008; Narsai et al. 2009; Angaji et al. 2010; Magome et al. 2013; Septiningsih et al. 2013; Hsu and Tung 2015; Tomita and Ishii 2018).

Improving rice coleoptile length under anaerobic condition, all favorable alleles might be pyramided as much as possible into one variety. Crosses between accessions which have favorable alleles (as hybridization parents) should improve target trait. Pyramiding best favorable alleles into new cultivar might need multi round crossing (Cheng et al. 2015). The results of this study provided basic marker information and accession information for breeding cultivars suitable for anaerobic conditions (water direct-seeded rice).

In conclusion, there is a phenotypic variation for coleoptile length under control treatment (CL), coleoptile length under GA3 treatment (CLGS) and molecular marker allele diversity among 358 accessions. Twenty four markers loci significantly associated with CL and 22 markers loci associated with CLGS (PVE > 7%). Thirty nine favorable alleles for CL and 56 favorable alleles for CLGS (PEV > 0.5 cm) were detected across two years by GLM and MLM analysis models. While, 17 markers loci significantly associated with GSI (PVE > 10%), with 29 favorable alleles were detected across two years by GLM and MLM analysis models. Twelve, thirteen and twenty three typical carrier accessions for CL, CLGS and GSI, respectively, possessing the favorable alleles could be used to improve those traits under anoxic condition.