Skip to main content

Metagenome-Based Disease Classification with Deep Learning and Visualizations Based on Self-organizing Maps

  • Conference paper
  • First Online:
Future Data and Security Engineering (FDSE 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11814))

Included in the following conference series:

Abstract

Machine learning algorithms have recently revealed impressive results across a variety of biology and medicine domains. The applications of machine learning in bioinformatics include predicting of biological processes (for example, prediction tasks on gene function), prevention of diseases and personalized treatment. In the last decade, deep learning has gained an impressive success on a variety of problems such as speech recognition, image classification, and natural language processing. Among various methodological variants of deep learning networks, the Convolutional Neural Networks (CNN) have been extensively studied, especially in the field of image processing. Moreover, Data visualization is considered as an indispensable technique for the exploratory data analysis and becomes a key for discoveries. In this paper, a novel approach based on visualization capabilities of Self-Organizing Maps and deep learning is proposed to not only visualize metagenomic data but also leverage advances in deep learning to improve the disease prediction. Several solutions are also introduced to reduce negative affects of overlapped points to enhance the performance. The proposed approach is evaluated on six metagenomic datasets using species abundance. The results reveal that the proposed visualization not only shows improvements in the performance but also allows to visualize biomedical signatures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/JustGlowing/minisom.

References

  1. Sudarikov, K., et al.: Methods for the metagenomic data visualization and analysis. Curr. Issues Mol. Biol., 37–58 (2017). ISSN: 14673037

    Google Scholar 

  2. Jiang, L., et al.: Exploring the influence of environmental factors on bacterial communities within the rhizosphere of the cu-tolerant plant, Elsholtzia splendens. Scientific Report (2016). ISSN: 2045–2322. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5080579/

  3. Morton, J.T., et al.: Balance trees reveal microbial niche differentiation (2017). https://doi.org/10.1128/mSystems.00162-16

  4. R Development Core Team: A Language and Environment for Statistical Computing (2008). ISBN: 3-900051-07-0

    Google Scholar 

  5. Ondov, B.D., et al.: Interactive metagenomic visualization in a Web browser. BMC Bioinform., 385 (2011)

    Google Scholar 

  6. Kerepesi, C., et al.: AmphoraNet: the webserver implementation of the AMPHORA2 metagenomic workflow suite. Gene, 538–540 (2013). https://doi.org/10.1016/j.gene.2013.10.015

    Article  Google Scholar 

  7. Rudis, B., Almossawi, A., Ulmer, H.: Package ‘metricsgraphics’, CRAN repository (2015). https://CRAN.R-project.org/package=metricsgraphics

  8. Pasolli, E., et al.: Machine learning meta-analysis of large metagenomic datasets: tools and biological insights. PLoS Comput. Biol. (2016)

    Google Scholar 

  9. Warnes, G.R., et al.: Package ‘gplots’, CRAN Repository (2016). https://CRAN.R-project.org/package=gplots

  10. Bik, H.: Phinch: an interactive, exploratory data visualization framework for metagenomic datasets (2014). https://doi.org/10.6084/m9.figshare.951915.v1

  11. Cheng, J.: Package ‘d3heatmap’, CRAN repository (2016). https://CRAN.R-project.org/package=d3heatmap

  12. Jiang, X., et al.: Manifold learning reveals nonlinear structure in metagenomic profiles. In: 2012 IEEE International Conference on Bioinformatics and Biomedicine (2012)

    Google Scholar 

  13. Alshawaqfeh, M., et al.: Consistent metagenomic biomarker detection via robust PCA. Biology Direct (2016)

    Google Scholar 

  14. Meyer, F., et al.: The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinform. (2011)

    Google Scholar 

  15. Nguyen, T.H., et al.: Disease classification in metagenomics with 2D embeddings and deep learning. In: The Annual French Conference in Machine Learning (CAP 2018) (2018)

    Google Scholar 

  16. Kingma, D.P., et al.: Adam: a method for stochastic optimization, CoRR abs/1412.6980 (2014)

    Google Scholar 

  17. Sarica, A., Cerasa, A., Quattrone, A.: Random forest algorithm for the classification of neuroimaging data in Alzheimer’s disease: a systematic review. PMC (2017). https://doi.org/10.3389/fnagi.2017.00329

  18. Ma, H., Xu, C.-F., Shen, Z., Yu, C.-H., Li, Y.-M.: Application of machine learning techniques for clinical predictive modeling: a cross-sectional study on nonalcoholic fatty liver disease in China. BioMed Res. Int. (2018). https://doi.org/10.1155/2018/4304376

    Google Scholar 

  19. LaPierre, N., Ju, C.J., Zhou, G., Wang, W.: MetaPheno: a critical evaluation of deep learning and machine learning in metagenome-based disease prediction. PubMed (2019). https://doi.org/10.1016/j.ymeth.2019.03.003

    Article  Google Scholar 

  20. Kohonen, T.: The self-organising map. In: Proceedings of the IEEE (1990)

    Google Scholar 

  21. Karlsson, F.H., et al.: Gut metagenome in European women with normal, impaired and diabetic glucose control. Nature 498, 99–103 (2013)

    Article  Google Scholar 

  22. Qin, N., et al.: Alterations of the human gut microbiome in liver cirrhosis. Nature 513, 59–64 (2014)

    Article  Google Scholar 

  23. Pasolli, E., Truong, D.T., Malik, F., Waldron, L., Segata, N.: Machine learning meta-analysis of large metagenomic datasets: tools and biological insights. PLoS Comput. Biol. 12, e1004977 (2016)

    Article  Google Scholar 

  24. Le Chatelier, E., et al.: Richness of human gut mi- crobiome correlates with metabolic markers. Nature 500, 541–546 (2013)

    Article  Google Scholar 

  25. Qin, J., et al.: A human gut microbial gene catalogue established by metagenomic sequencing. Nature 464, 59–65 (2010)

    Article  Google Scholar 

  26. Truong, D.T., et al.: MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat. Methods 12, 902–903 (2015)

    Article  Google Scholar 

  27. Zeller, G., et al.: Potential of fecal microbiota for early-stage detection of colorectal cancer. Mol. Syst. Biol. 10, 766 (2014)

    Article  Google Scholar 

  28. Qin, J., et al.: A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 490, 55–60 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thanh Hai Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nguyen, T.H. (2019). Metagenome-Based Disease Classification with Deep Learning and Visualizations Based on Self-organizing Maps. In: Dang, T., Küng, J., Takizawa, M., Bui, S. (eds) Future Data and Security Engineering. FDSE 2019. Lecture Notes in Computer Science(), vol 11814. Springer, Cham. https://doi.org/10.1007/978-3-030-35653-8_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-35653-8_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-35652-1

  • Online ISBN: 978-3-030-35653-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics