Abstract
In the area of genomics, Machine Learning (ML) is used to understand how genotype can influence phenotype, such as variations linked to genetic disease. This advanced use of ML was enabled by the recent explosion in automatic data collections through high-throughput sequencing technologies and the increased compute power through public cloud providers. However, the opportunity to process more data than ever before and train more sophisticated methods also means that the discipline of genomics increasingly becomes a digital domain, requiring new algorithms and different analysis strategies.
In this chapter, we discuss real world examples on how ML is used in clinical genomics to identify novel disease genes, prioritize pathogenic variants and as chatbots, supporting genetic counselling. We also discuss ML-applications in the biosecurity space where pathogen strains are grouped into evolutionary networks or the most effective CRISPR binding sites are recommended by ML. The chapter concludes by discussing the funding landscape for ML in the genomics space and provides an outlook on the ML-heavy future applications cases such as Gene Therapy, tracking Antimicrobial Resistance and personalized disease risk prediction for cardiovascular health.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Bibliography
AEHRC Pathling—Advanced FHIR® analytics server [Online]. Available at: https://pathling.csiro.au/. Accessed 13 Jan 2021
AWS Marketplace (2019) AWS Marketplace: VariantSpark Notebook [Online]. Available at: https://aws.amazon.com/marketplace/pp/AEHRC-VariantSpark-Notebook/B07YVND4TD. Accessed 13 Jan 2021
Bauer DC, Metke-Jimenez A, Maurer-Stroh S et al (2020a) Interoperable medical data: the missing link for understanding COVID-19. Transbound Emerg Dis. Jul 68(4):1753–1760
Bauer DC, Tay AP, Wilson LOW et al (2020b) Supporting pandemic response using genomics and bioinformatics: a case study on the emergent SARS-CoV-2 outbreak. Transbound Emerg Dis 67(4):1453–1462
Bayat A, Hosking B, Jain Y, Hosking C, Twine N, Bauer DC (2019) BitEpi: a fast and accurate exhaustive higher-order epistasis search. BioRxiv
Bayat A, Szul P, O’Brien AR et al (2020) VariantSpark: cloud-based machine learning for association study of complex phenotype and large-scale genomic data. GigaScience 9(8)
CSIRO (2020) Working against the new coronavirus. [Online]. Available at: https://www.csiro.au/en/Research/Health/Infectious-dieases-coronavirus/coronavirus. Accessed 5 Mar 2020
Danskine G (2020) Australia’s health sector is seeing a surge of collaboration – interconnections – the Equinix Blog [Online]. Available at: https://blog.equinix.com/blog/2020/09/15/australias-health-sector-is-seeing-a-surge-of-collaboration/?lang=ja. Accessed 13 Jan 2021
Global Innovation Index (2020) Global innovation index | Who will finance innovation? [Online]. Available at: https://www.globalinnovationindex.org/Home. Accessed 13 Jan 2021
Guo R, Zhao Y, Zou Q, Fang X, Peng S (2018) Bioinformatics applications on apache spark. GigaScience 7(8)
Hail T (2021) Hail. Hail Team
Khera AV, Chaffin M, Aragam KG et al (2018) Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat Genet 50(9):1219–1224
Koopman B, Bradford D, Hansen D (2020) Exemplars of artificial intelligence and machine learning in healthcare: improving the safety, quality, efficiency and accessibility of Australia’s healthcare system. CSIRO
Meng X, Bradley J, Yavuz B et al (2015) MLlib: machine learning in apache spark. arXiv
O’Brien AR, Wilson LOW, Burgio G, Bauer DC (2019) Unlocking HDR-mediated nucleotide editing by identifying high-efficiency target sites using machine learning. Sci Rep 9(1):2788
OECD (2017) OECD science, technology and industry scoreboard 2017: the digital transformation. OECD
OUTBREAK Project (2020) Antimicrobial resistance – stop the rise of superbugs – OUTBREAK [Online]. Available at: https://outbreakproject.com.au/. Accessed 13 Jan 2021
Schwab K (2019) The global competitiveness report 2019
Stephens ZD, Lee SY, Faghri F et al (2015) Big data: astronomical or genomical? PLoS Biol 13(7):e1002195
Wiewiórka MS, Messina A, Pacholewska A, Maffioletti S, Gawrysiak P, Okoniewski MJ (2014) SparkSeq: fast, scalable and cloud-ready tool for the interactive genomic data analysis with nucleotide precision. Bioinformatics 30(18):2652–2653
Wilson LOW, Hetzel S, Pockrandt C, Reinert K, Bauer DC (2019) VARSCOT: variant-aware detection and scoring enables sensitive and personalized off-target detection for CRISPR-Cas9. BMC Biotechnol 19(1):40
Wilson LOW, Reti D, O’Brien AR, Dunne RA, Bauer DC (2018) High activity target-site identification using phenotypic independent CRISPR-Cas9 Core functionality. CRISPR J 1(2):182–190
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Bauer, D.C., Wilson, L.O.W., Twine, N.A. (2022). Artificial Intelligence in Medicine: Applications, Limitations and Future Directions. In: Raz, M., Nguyen, T.C., Loh, E. (eds) Artificial Intelligence in Medicine. Springer, Singapore. https://doi.org/10.1007/978-981-19-1223-8_5
Download citation
DOI: https://doi.org/10.1007/978-981-19-1223-8_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-1222-1
Online ISBN: 978-981-19-1223-8
eBook Packages: MedicineMedicine (R0)