Abstract
Complex disease is different from Mendelian disorders. Its development usually involves the interaction of multiple genes or the interaction between genes and the environment (i.e. epistasis). Although the high-throughput sequencing technologies for complex diseases have produced a large amount of data, it is extremely difficult to analyze the data due to the high feature dimension and the combination in the epistasis analysis. In this work, we introduce machine learning methods to effectively reduce the gene dimensionality, retain the key epistatic effects, and effectively characterize the relationship between epistatic effects and complex diseases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Moore JH (2005) A global view of epistasis. Nat Genet 37(1):13–14
Cheverud JM, Routman EJ (1995) Epistasis and its contribution to genetic variance. Genetics 139(3):1455–1461
Mackay TFC (2014) Epistasis and quantitative traits: using model organisms to study gene-gene interactions. Nat Rev Genet 15(1):22
Bateson W (1903) Mendel’s principles of heredity in mice. Nature 67(1744):512–512
Ellis RJ, Wang Z, Genes N et al (2019) Predicting opioid dependence from electronic health records with machine learning. BioData Min 12:3. https://doi.org/10.1186/s13040-019-0193-0
Liu Q, Zou Y, Liu X (2019) A self-organizing memory neural network for aerosol concentration prediction. Comput Model Eng Sci 119(3):617–637
Neto HA, Tavares WL, Ribeiro DC et al (2019) On the utilization of deep and ensemble learning to detect milk adulteration. BioData Min 12:13. https://doi.org/10.1186/s13040-019-0200-5
Ritchie MD, Motsinger AA, Bush WS et al (2007) Genetic programming neural networks: a powerful bioinformatics tool for human genetics. Appl Soft Comput 7(1):471–479
Kim D, Li R, Dudek SM et al (2014) Knowledge-driven genomic interactions: an application in ovarian cancer. BioData Min 7(1):20
Motsinger-Reif AA, Dudek SM, Hahn LW et al (2008) Comparison of approaches for machine-learning optimization of neural networks for detecting gene-gene interactions in genetic epidemiology. Genet Epidemiol 32(4):325–340
Bureau A, Dupuis J, Hayward B et al (2003) Mapping complex traits using random forests. BMC Genet 4(Suppl 1):S64
Listgarten J (2004) Predictive models for breast cancer susceptibility from multiple single nucleotide polymorphisms. Clin Cancer Res 10(8):2725–2737
Pedregosa F et al (2011) Scikit-learn: machine learning in Python. JMLR 12:2825–2830
Urbanowicz RJ, Kiralis J, Sinnott-Armstrong NA et al (2012) GAMETES: a fast, direct algorithm for generating pure, strict, epistatic models with random architectures. BioData Min 5(1):16
Wright MN, Ziegler A, König IR (2016) Do little interactions get lost in dark random forests? BMC Bioinformatics 17(1):145
http://www.bdmb-web.cn/index.php?m=content&c=index&a=show&catid=38&id=90
Krogh A, Vedelsby J (1995) Neural network ensembles, cross validation, and active learning. Adv Neural Inf Process Syst 7(10):231–238
https://scikit-learn.org/stable/modules/cross_validation.html
Hearst MA, Dumais ST, Osman E et al (1998) Support vector machines. IEEE Intell Syst 13(4):18–28
Cover TM, Hart PE (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13(1):21–27
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Kakuba G, Mango JM, Anthonissen MJH (2019) Convergence properties of local defect correction algorithm for the boundary element method. Comput Model Eng Sci 119(1):207–225
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Liu, L., Wong, KC. (2021). Epistasis Analysis: Classification Through Machine Learning Methods. In: Wong, KC. (eds) Epistasis. Methods in Molecular Biology, vol 2212. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-0947-7_21
Download citation
DOI: https://doi.org/10.1007/978-1-0716-0947-7_21
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-0946-0
Online ISBN: 978-1-0716-0947-7
eBook Packages: Springer Protocols