Toward integration of genomic selection with crop modelling: the development of an integrated approach to predicting rice heading dates
- 1k Downloads
It is suggested that accuracy in predicting plant phenotypes can be improved by integrating genomic prediction with crop modelling in a single hierarchical model.
Accurate prediction of phenotypes is important for plant breeding and management. Although genomic prediction/selection aims to predict phenotypes on the basis of whole-genome marker information, it is often difficult to predict phenotypes of complex traits in diverse environments, because plant phenotypes are often influenced by genotype–environment interaction. A possible remedy is to integrate genomic prediction with crop/ecophysiological modelling, which enables us to predict plant phenotypes using environmental and management information. To this end, in the present study, we developed a novel method for integrating genomic prediction with phenological modelling of Asian rice (Oryza sativa, L.), allowing the heading date of untested genotypes in untested environments to be predicted. The method simultaneously infers the phenological model parameters and whole-genome marker effects on the parameters in a Bayesian framework. By cultivating backcross inbred lines of Koshihikari × Kasalath in nine environments, we evaluated the potential of the proposed method in comparison with conventional genomic prediction, phenological modelling, and two-step methods that applied genomic prediction to phenological model parameters inferred from Nelder–Mead or Markov chain Monte Carlo algorithms. In predicting heading dates of untested lines in untested environments, the proposed and two-step methods tended to provide more accurate predictions than the conventional genomic prediction methods, particularly in environments where phenotypes from environments similar to the target environment were unavailable for training genomic prediction. The proposed method showed greater accuracy in prediction than the two-step methods in all cross-validation schemes tested, suggesting the potential of the integrated approach in the prediction of phenotypes of plants.
KeywordsRoot Mean Square Error Markov Chain Monte Carlo Genomic Prediction Markov Chain Monte Carlo Algorithm Local Outlier Factor
Backcross inbred line
Crop (DVR) model based on an MCMC algorithm
Crop (DVR) model based on a Nelder–Mead algorithm
Days to heading
Developmental rate model for rice heading date prediction
Extended Bayesian LASSO
Genomic best linear unbiased prediction
A hierarchical model integrating EBL with the crop (DVR) model
Leave-one-‘combination of an environment and a line’-out cross-validation
Markov chain Monte Carlo
Quantitative trait locus
Root mean squared errors
Two-step approach based on C-Bay and EBL
Two-step approach based on C-Nel and EBL
The authors thank Seishi Ninomiya and Ryo Ohsawa for the contribution to the conception and design of this study. This study was supported by JSPS KAKENHI Grant Numbers 19208003 and 25252002 and by a Grant-in-Aid for JSPS Fellows (14J10661).
The data files, scripts and results are available at https://github.com/Onogi/HeadingDatePrediction. The data files contain heading and emergence dates and environmental information (daily mean air temperature and photoperiod).
Compliance with ethical standards
Conflict of interest
The authors declare that they have no competing interests.
- Bishop CM (2006) Pattern recognition and machine learning. Section 10.2.1 Variational distribution. New York: SpringerGoogle Scholar
- Breunig MM, Kriegel HP, Ng RT, Sander J (2000) LOF: identifying density-based local outliers. In: Chen W, Naughton JF, Bernstein PA (eds) Proceedings of the ACM SIGMOD International Conference on Management Data, ACM, pp 93–104Google Scholar
- Crossa J, Deloscampos G, Perez P, Gianola D, Burgueno J, Araus JL, Makumbi D, Singh RP, Dreisigacker S, Yan J, Arief V, Banziger M, Braun HJ (2010) Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186:713–724CrossRefPubMedPubMedCentralGoogle Scholar
- Ghahramani Z, Beal MJ (2001) Propagation algorithms for variational Bayesian learning. In: Leen TK, Dietterich TG, Tresp V (eds) Advances in neural information processing systems 13, MIT press, pp 507–513Google Scholar
- Onogi A (2015) Documents for VIGoR. https://github.com/Onogi/VIGoR
- R Development Core Team (2011) R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. ISBN 3-900051-07-0. http://www.R-project.org/
- Reymond M, Muller B, Leonardi A, Charcosset A, Tardieu F (2003) Combining quantitative trait Loci analysis and an ecophysiological model to analyze the genetic variability of the responses of maize leaf growth to temperature and water deficit. Plant Physiol 131:664–675CrossRefPubMedPubMedCentralGoogle Scholar
- Soltani A, Sinclair TR (2012) Modeling physiology of crop development, growth and yield. Chapter 6 phenology–temperature. CABI, MA, USAGoogle Scholar
- Yano M, Katayose Y, Ashikari M, Yamanouchi U, Monna L, Fuse T, Baba T, Yamamoto K, Umehara Y, Nagamura Y, Sasaki T (2000) Hd1, a major photoperiod sensitivity quantitative trait locus in rice, is closely related to the Arabidopsis flowering time gene CONSTANS. Plant Cell 12:2473–2484CrossRefPubMedPubMedCentralGoogle Scholar