Something Borrowed, Something New: Precise Prediction of Outcomes from Diverse Genomic Profiles
Precise outcome predictions at an individual level from diverse genomic data is a problem of great interest as the focus on precision medicine grows. This typically requires estimation of subgroup-specific models which may differ in their mean and/or variance structure. Thus in order to accurately predict outcomes for new individuals, it’s necessary to map them to a subgroup from which the prediction can be derived. The situation becomes more interesting when some predictors are common across subgroups and others are not. We describe a series of statistical methodologies under two different scenarios that can provide this mapping, as well as combine information that can be shared across subgroups, with information that is subgroup-specific. We demonstrate that prediction errors can be markedly reduced as compared to not borrowing strength at all. We then apply the approaches in order to predict colon cancer survival from DNA methylation profiles that vary by age groups, and identify those significant methylation sites that are shared across the age groups and those that are age-specific.
Funding and Acknowledgements
J.S.R. was partially funded by NIH grants R01-CA160593A1, R01-GM085205 and NSF grant DMS 1513266. E.K. was partially funded by NIH grant (put in details here). E.K. and D.S. were partially funded by Bankhead-Coley Team Science grant 2BT02 and ACS Institutional Research Grant 98-277-10. J.S.R., E.K. and D.S. were partially funded by NIH grant UL1-TR000460. The authors declare that they have no competing financial interests.
- 2.George, A.W., P.M. Visscher, and C.S. Haley. 2000. Mapping quantitative trait loci in complex pedigrees: a two-step variance component approach. Genetics 156: 2081–2092.Google Scholar
- 4.Henderson, C.R. 1948. Estimation of general, specific and maternal combining abilities in crosses among inbred lines of swine. Ph. D. Thesis, Iowa State University, Ames, Iowa.Google Scholar
- 5.Henderson, C.R. 1984. Application of linear models in animal breeding. Technical Report, (University of Guelph, Ontario).Google Scholar
- 9.Jiang, J., J.S., Rao, Fan, J., and Ngyuen, T. 2015. Classified mixed prediction. Technical Report, University of Miami, Division of Biostatisics.Google Scholar
- 18.Rao, J.S., Kobetz, E. and Coppede, F. 2016. PRISM regression models: The anatomical and genetic to gender and age-related changes of DNA methylation in colorectal cancer (submitted).Google Scholar
- 24.West, L., S.J. Vidwans, N.P. Campbell, J. Shrager, G.R. Simon, R. Bueno, P.A. Dennis, G.A. Otterson, and R. Salgia. 2012. A novel classification of lung cancer into molecular subtypes. PLoS ONE 7: e31906. https://doi.org/10.1371/journal.pone.0031906.