Skip to main content

Advertisement

Log in

A Primer on Persistent Homology of Finite Metric Spaces

  • Methods and Software
  • Published:
Bulletin of Mathematical Biology Aims and scope Submit manuscript

Abstract

Topological data analysis (TDA) is a relatively new area of research related to importing classical ideas from topology into the realm of data analysis. Under the umbrella term TDA, there falls, in particular, the notion of persistent homology PH, which can be described in a nutshell, as the study of scale-dependent homological invariants of datasets. In these notes, we provide a terse self-contained description of the main ideas behind the construction of persistent homology as an invariant feature of datasets, and its stability to perturbations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Azumaya G (1950) Corrections and supplementaries to my paper concerning Krull-Remak-Schmidts theorem. Nagoya Math J 1:117124

    Article  MathSciNet  MATH  Google Scholar 

  • Bardin J, Spreemann G, Hess K (2018) Topological exploration of artificial neuronal network dynamics. Netw Neurosci 1–19

  • Bartlett CW, Yeon Cheong S, Hou L, Paquette J, Yee Lum P, Jäger G, Battke F, Vehlow C, Heinrich J, Nieselt K, Sakai R, Aerts J, Ray WC (2012) An eqtl biological data visualization challenge and approaches from the visualization community. BMC Bioinform 13(8):S8

    Article  Google Scholar 

  • Bauer U (2015) Ripser. https://github.com/Ripser/ripser

  • Bauer U, Kerber M, Reininghaus J (2012) PHAT (Persistent homology algorithm toolbox). https://bitbucket.org/phat-code/phat

  • Brown EN, Frank LM, Tang D, Quirk MC, Wilson MA (1998) A statistical paradigm for neural spike train decoding applied to position prediction from ensemble firing patterns of rat hippocampal place cells. J Neurosci 18(18):7411–7425

    Article  Google Scholar 

  • Burago D, Burago Y, Ivanov S (2001) A course in metric geometry, vol 33 of AMS graduate studies in Math. American Mathematical Society

  • Cámara PG (2017) Topological methods for genomics: present and future directions. Curr Opin Syst Biol 1:95–101 (Future of systems biology genomics and epigenomics)

  • Cámara PG, Levine AJ, Rabadán R (2016a) Inference of ancestral recombination graphs through topological data analysis. PloS Comput Biol 12(8):1–25, 08

  • Cámara PG, Rosenbloom DIS, Emmett KJ, Levine AJ, Rabadán R (2016b) Topological data analysis generates high-resolution, genome-wide maps of human recombination. Cell Syst 3(1):83–94

    Article  Google Scholar 

  • Carlsson G (2009) Topology and data. Bull Am Math Soc 46(2):255–308

    Article  MathSciNet  MATH  Google Scholar 

  • Carlsson G (2014) Topological pattern recognition for point cloud data. Acta Numer 23:289368

    Article  MathSciNet  MATH  Google Scholar 

  • Carlsson G, de Silva V (2010) Zigzag persistence. Found Comput Math 10(4):367–405

    Article  Google Scholar 

  • Carlsson G, Mémoli F (2010) Characterization, stability and convergence of hierarchical clustering methods. J Mach Learn Res 11:1425–1470

    MathSciNet  MATH  Google Scholar 

  • Carlsson G, Mémoli F (2013) Classifying clustering schemes. Found Comput Math 13(2):221–252

    Article  Google Scholar 

  • Carlsson G, de Silva V, Morozov D (2009) Zigzag persistent homology and real-valued functions. In: Proceedings of the twenty-fifth annual symposium on computational geometry, SCG ’09, pp 247–256, New York, NY, USA. ACM

  • Crawley-Boevey W (2012) Decomposition of pointwise finite dimensional persistence modules. J Algebra Appl 14(05):1550066

    Article  MathSciNet  MATH  Google Scholar 

  • Chan JM, Carlsson G, Rabadán R (2013) Topology of viral evolution. Proc Natl Acad Sci 110(46):18566–18571

    Article  MathSciNet  MATH  Google Scholar 

  • Chazal F, Cohen-Steiner D, Guibas LJ, Mémoli F, Oudot SY (2009) Gromov–Hausdorff stable signatures for shapes using persistence. In: Proceedings of the symposium on geometry processing, SGP ’09, pp 1393–1403, Aire-la-Ville, Switzerland, Switzerland. Eurographics Association

  • Chowdhury S, Dai B, Mémoli F (2018) The importance of forgetting: limiting memory improves recovery of topological characteristics from neural data. PloS One 13(9):1–20, 09

  • Chazal F, Oudot SY, Glisse M, De Silva V (2016) The structure and stability of persistence modules. In: SpringerBriefs in mathematics. Springer, Berlin

  • Dabaghian Y, Mémoli F, Frank L, Carlsson G (2012) A topological paradigm for hippocampal spatial map formation using persistent homology. PloS Comput Biol 8(8):1–14, 08

  • de Cecco L, Nicolau M, Giannoccaro M, Daidone MG, Bossi P, Locati L, Licitra L, Canevari S (2015) Head and neck cancer subtypes with biological and clinical relevance: meta-analysis of gene-expression data. Oncotarget 6(11):9627–9642

    Article  Google Scholar 

  • Edelsbrunner H, Harer J (2010) Computational topology–an introduction. American Mathematical Society, Providence

    MATH  Google Scholar 

  • Emmett KJ, Rabadán R (2014) Characterizing scales of genetic recombination and antibiotic resistance in pathogenic bacteria using topological data analysis. In: Ślzak D, Tan A-H, Peters JF, Schwabe L (eds) Brain Informatics and Health. Springer, Cham, pp 540–551

    Chapter  Google Scholar 

  • Emmett K, Rabadán R (2016) Quantifying reticulation in phylogenetic complexes using homology. In: Proceedings of the 9th EAI international conference on bio-inspired information and communications technologies (formerly BIONETICS), BICT’15, pp 193–196, ICST, Brussels, Belgium, Belgium (2016) ICST. Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering

    Google Scholar 

  • Emmett K, Schweinhart B, Rabadán R (2016) Multiscale topology of chromatin folding. In: Proceedings of the 9th EAI international conference on bio-inspired information and communications technologies (formerly BIONETICS), BICT’15, pp 177–180, ICST, Brussels, Belgium, Belgium (2016) ICST. Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering

    Google Scholar 

  • Fasy BT, Kim J, Lecci F, Maria C, Rouvreau V (2014) TDA : statistical tools for topological data analysis. https://cran.r-project.org/web/packages/TDA/index.html. Accessed 1 Dec 2018

  • Frattini V, Pagnotta SM, Tala Fan JJ, Russo MV, Lee SB, Garofano L, Zhang J, Shi P, Lewis G, Sanson H, Frederick V, Castano AM, Cerulo L, Rolland DCM, Mall R, Mokhtari K, Elenitoba-Johnson KSJ, Sanson M, Huang X, Ceccarelli M, Lasorella A, Iavarone A (2018) A metabolic function of FGFR3-TACC3 gene fusions in cancer. Nature 553:222–227

    Article  Google Scholar 

  • Giusti C, Ghrist R, Bassett DS (2016) Two’s company, three (or more) is a simplex. J Comput Neurosci 41(1):1–14

    Article  MathSciNet  Google Scholar 

  • Ghrist RW (2014) Elementary applied topology, vol 1. Createspace, Seattle

    Google Scholar 

  • GUDHI Geometry Understanding in Higher Dimensions (2014) INRIA. http://gudhi.gforge.inria.fr/

  • Hatcher A (2000) Algebraic topology. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  • Jacobson N (2012) Basic algebra II. Dover books on mathematics, 2nd edn. Dover Publications, Mineola

  • Jardine N, Sibson R (1972) Mathematical taxonomy. Wiley series in probability and mathematical statistics, Wiley, New York

    MATH  Google Scholar 

  • Kenet T, Bibitchkov D, Tsodyks M, Grinvald A, Arieli A (2003) Spontaneously emerging cortical representations of visual attributes. Nature 425:954–956

    Article  Google Scholar 

  • Kyeong S, Park S, Cheon K, Kim J, Song D, Kim E (2015) A new approach to investigate the association between brain functional connectivity and disease characteristics of attention-deficit/hyperactivity disorder: topological neuroimaging data analysis. PloS One 10(9):1–15, 09

  • Lee Y, Barthel SD, Dlotko P, Moosavi SM, Hess K, Smit B (2018) High-throughput screening approach for nanoporous materials genome using topological data analysis: application to zeolites. J Chem Theory Comput 14(8):4427–4437 PMID: 29986145

    Article  Google Scholar 

  • Lesnick M (2015) The theory of the interleaving distance on multidimensional persistence modules. Found Comput Math 15(3):613–650

    Article  MathSciNet  MATH  Google Scholar 

  • Lewis R (2014) CTL. https://github.com/appliedtopology/ctl

  • Lesnick M, Rabadán R, Rosenbloom DIS (2018) Quantifying genetic innovation: mathematical foundations for the topological study of reticulate evolution. CoRR, arXiv:1804.01398

  • Lee J, Wang J, Sa JK, Ladewig E, Lee H, Lee I, Kang HJ, Rosenbloom DIS, Cámara PG, Liu Z, Nieuwenhuizen P, Jung SW, Choi SW, Kim J, Chen AH, Kim K, Shin SY, Seo YJ, Oh J, Shin YJ, Park C, Kong D, Seol HJ, Blumberg AJ, Lee J, Iavarone A, Park W, Rabadán R, Nam D (2017) Spatiotemporal genomic architecture informs precision oncology in glioblastoma. Nat Genet 49:594–599

    Article  Google Scholar 

  • McNaughton BL, Barnes CA, O’Keefe J (1983) The contributions of position, direction, and velocity to single unit activity in the hippocampus of freely-moving rats. Exp Brain Res 52(1):41–49

    Article  Google Scholar 

  • Morozov D (2012) Dionysus. http://www.mrzv.org/software/dionysus/

  • Munkres JR (1996) Elements of algebraic topology. Avalon Publishing, New York

    MATH  Google Scholar 

  • Nanda V (2002) Perseus, the persistent homology software. http://www.sas.upenn.edu/~vnanda/perseus

  • Nicolau M, Levine AJ, Carlsson G (2011) Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proce Natl Acad Sci 108(17):7265–7270

    Article  Google Scholar 

  • Olin A, Henckel E, Chen Y, Lakshmikanth T, Pou C, Mikes J, Gustafsson A, Bernhardsson AK, Zhang C, Bohlin K, Brodin P (2018) Stereotypic immune system development in newborn children. Cell 174(5):1277–1292.e14

    Article  Google Scholar 

  • Phinyomark A, Ibez-Marcelo E, Petri G (2017) Resting-state fmri functional connectivity: big data preprocessing pipelines and topological data analysis. IEEE Trans Big Data 3(4):415–428

    Article  Google Scholar 

  • Phinyomark A, Petri G, Ibáñez-Marcelo E, Osis ST, Ferber R (2018) Analysis of big data in gait biomechanics: current trends and future directions. J Med Biol Eng 38(2):244–260

    Article  Google Scholar 

  • Reninghaus J (2014) DIPHA (a Distributed Persistent Homology Algorithm. https://github.com/DIPHA/dipha/

  • Rucco M, Merelli E, Herman D, Ramanan D, Petrossian T, Falsetti L, Nitti C, Salvi A (2015) Using topological data analysis for diagnosis pulmonary embolism. J Theor Appl Comput Sci 9(1):41–55

    Google Scholar 

  • Saggar M, Sporns O, Gonzalez-Castillo J, Bandettini PA, Carlsson G, Glover G, Reiss AL (2018) Towards a new approach to reveal dynamical organization of the brain using topological data analysis. Nat Commun

  • Selinger C, Tisoncik-Go J, Menachery VD, Agnihothram S, Law GL, Chang J, Kelly SM, Sova P, Baric RS, Katze MG (2014) Cytokine systems approach demonstrates differences in innate and pro-inflammatory host responses between genetically distinct mers-cov isolates. BMC Genom 15(1):1161

    Article  Google Scholar 

  • Sexton H, Vejdemo-Johansson M (2008) jPlex. http://comptop.stanford.edu/programs/jplex/

  • Sgouralis I, Nebenfhr A, Maroulas V (2017) A bayesian topological framework for the identification and reconstruction of subcellular motion. SIAM J Imaging Sci 10(2):871–899

    Article  MathSciNet  MATH  Google Scholar 

  • Singh G, Mémoli F, Carlsson G (2007) Topological methods for the analysis of high dimensional data sets and 3D object recognition. In: Botsch M, Pajarola R, Chen B, Zwicker M (eds) Eurographics symposium on point-based graphics. The Eurographics Association

  • Singh G, Mémoli F, Ishkhanov T, Sapiro G, Carlsson G, Ringach DL (2008) Topological analysis of population activity in visual cortex. J Vis 8(8):11

    Article  Google Scholar 

  • Spivak DI (2014) Categ Theory Sci. The MIT Press, Cambridge

    Google Scholar 

  • Tausz A, Vejdemo-Johansson M, Adams H (2014) JavaPlex: a research software package for persistent (co)homology. In: Hong H, Yap C (eds) Proceedings of ICMS 2014, lecture notes in computer science 8592, pp 129–136. http://appliedtopology.github.io/javaplex/

  • Tierny J, Favelier G, Levine J, Gueunet C, Michaux M (2017) Topology toolkit. https://topology-tool-kit.github.io/index.html

  • Torres BY, Oliveira JHM, Thomas AT, Rath P, Cumnock K, Schneider DS (2016) Tracking resilience to infections by mapping disease space. PLOS Biol 14(4):1–19, 04

  • TGDA@OSU TRIPODS Mini course on persistent homology (2018). https://mbi.osu.edu/event/?id=1217

  • Tralie C, Saul N (2019) Scikit-tda: topological data analysis for python

  • Wadhwa R, Dhawan A, Williamson D, Scott J (2018) TDAstats. https://cran.r-project.org/package=TDAstats

  • Xia K, Wei GW (2014) Persistent homology analysis of protein structure, flexibility, and folding. Int J Numer Methods Biomed Eng 30(8):814–844

    Article  MathSciNet  Google Scholar 

  • Xia K, Feng X, Tong Y, Wei GW (2015) Persistent homology for the quantitative prediction of fullerene stability. J Comput Chem 36(6):408–422

    Article  Google Scholar 

  • Yao Y, Sun J, Huang X, Bowman GR, Singh G, Lesnick M, Guibas LJ, Pande VS, Carlsson G (2009) Topological methods for exploring low-density states in biomolecular folding pathways. J Chem Phys 130(14):144115

    Article  Google Scholar 

  • Zhang K, Ginzburg I, McNaughton BL, Sejnowski TJ (1998) Interpreting neuronal population activity by reconstruction: unified framework with application to hippocampal place cells. J Neurophys 79(2):1017–1044 PMID: 9463459

    Article  Google Scholar 

Download references

Acknowledgements

These notes are meant to supplement the lectures given by the first author during the TGDA@OSU TRIPODS Summer School held at MBI during May 2018. Videos of the lectures are available at TRIPODS (2018). We acknowledge NSF support through project CCF #1740761.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kritika Singhal.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mémoli, F., Singhal, K. A Primer on Persistent Homology of Finite Metric Spaces. Bull Math Biol 81, 2074–2116 (2019). https://doi.org/10.1007/s11538-019-00614-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11538-019-00614-z

Keywords

Navigation