Abstract
Topological data analysis (TDA) is a relatively new area of research related to importing classical ideas from topology into the realm of data analysis. Under the umbrella term TDA, there falls, in particular, the notion of persistent homology PH, which can be described in a nutshell, as the study of scale-dependent homological invariants of datasets. In these notes, we provide a terse self-contained description of the main ideas behind the construction of persistent homology as an invariant feature of datasets, and its stability to perturbations.
Similar content being viewed by others
References
Azumaya G (1950) Corrections and supplementaries to my paper concerning Krull-Remak-Schmidts theorem. Nagoya Math J 1:117124
Bardin J, Spreemann G, Hess K (2018) Topological exploration of artificial neuronal network dynamics. Netw Neurosci 1–19
Bartlett CW, Yeon Cheong S, Hou L, Paquette J, Yee Lum P, Jäger G, Battke F, Vehlow C, Heinrich J, Nieselt K, Sakai R, Aerts J, Ray WC (2012) An eqtl biological data visualization challenge and approaches from the visualization community. BMC Bioinform 13(8):S8
Bauer U (2015) Ripser. https://github.com/Ripser/ripser
Bauer U, Kerber M, Reininghaus J (2012) PHAT (Persistent homology algorithm toolbox). https://bitbucket.org/phat-code/phat
Brown EN, Frank LM, Tang D, Quirk MC, Wilson MA (1998) A statistical paradigm for neural spike train decoding applied to position prediction from ensemble firing patterns of rat hippocampal place cells. J Neurosci 18(18):7411–7425
Burago D, Burago Y, Ivanov S (2001) A course in metric geometry, vol 33 of AMS graduate studies in Math. American Mathematical Society
Cámara PG (2017) Topological methods for genomics: present and future directions. Curr Opin Syst Biol 1:95–101 (Future of systems biology genomics and epigenomics)
Cámara PG, Levine AJ, Rabadán R (2016a) Inference of ancestral recombination graphs through topological data analysis. PloS Comput Biol 12(8):1–25, 08
Cámara PG, Rosenbloom DIS, Emmett KJ, Levine AJ, Rabadán R (2016b) Topological data analysis generates high-resolution, genome-wide maps of human recombination. Cell Syst 3(1):83–94
Carlsson G (2009) Topology and data. Bull Am Math Soc 46(2):255–308
Carlsson G (2014) Topological pattern recognition for point cloud data. Acta Numer 23:289368
Carlsson G, de Silva V (2010) Zigzag persistence. Found Comput Math 10(4):367–405
Carlsson G, Mémoli F (2010) Characterization, stability and convergence of hierarchical clustering methods. J Mach Learn Res 11:1425–1470
Carlsson G, Mémoli F (2013) Classifying clustering schemes. Found Comput Math 13(2):221–252
Carlsson G, de Silva V, Morozov D (2009) Zigzag persistent homology and real-valued functions. In: Proceedings of the twenty-fifth annual symposium on computational geometry, SCG ’09, pp 247–256, New York, NY, USA. ACM
Crawley-Boevey W (2012) Decomposition of pointwise finite dimensional persistence modules. J Algebra Appl 14(05):1550066
Chan JM, Carlsson G, Rabadán R (2013) Topology of viral evolution. Proc Natl Acad Sci 110(46):18566–18571
Chazal F, Cohen-Steiner D, Guibas LJ, Mémoli F, Oudot SY (2009) Gromov–Hausdorff stable signatures for shapes using persistence. In: Proceedings of the symposium on geometry processing, SGP ’09, pp 1393–1403, Aire-la-Ville, Switzerland, Switzerland. Eurographics Association
Chowdhury S, Dai B, Mémoli F (2018) The importance of forgetting: limiting memory improves recovery of topological characteristics from neural data. PloS One 13(9):1–20, 09
Chazal F, Oudot SY, Glisse M, De Silva V (2016) The structure and stability of persistence modules. In: SpringerBriefs in mathematics. Springer, Berlin
Dabaghian Y, Mémoli F, Frank L, Carlsson G (2012) A topological paradigm for hippocampal spatial map formation using persistent homology. PloS Comput Biol 8(8):1–14, 08
de Cecco L, Nicolau M, Giannoccaro M, Daidone MG, Bossi P, Locati L, Licitra L, Canevari S (2015) Head and neck cancer subtypes with biological and clinical relevance: meta-analysis of gene-expression data. Oncotarget 6(11):9627–9642
Edelsbrunner H, Harer J (2010) Computational topology–an introduction. American Mathematical Society, Providence
Emmett KJ, Rabadán R (2014) Characterizing scales of genetic recombination and antibiotic resistance in pathogenic bacteria using topological data analysis. In: Ślzak D, Tan A-H, Peters JF, Schwabe L (eds) Brain Informatics and Health. Springer, Cham, pp 540–551
Emmett K, Rabadán R (2016) Quantifying reticulation in phylogenetic complexes using homology. In: Proceedings of the 9th EAI international conference on bio-inspired information and communications technologies (formerly BIONETICS), BICT’15, pp 193–196, ICST, Brussels, Belgium, Belgium (2016) ICST. Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering
Emmett K, Schweinhart B, Rabadán R (2016) Multiscale topology of chromatin folding. In: Proceedings of the 9th EAI international conference on bio-inspired information and communications technologies (formerly BIONETICS), BICT’15, pp 177–180, ICST, Brussels, Belgium, Belgium (2016) ICST. Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering
Fasy BT, Kim J, Lecci F, Maria C, Rouvreau V (2014) TDA : statistical tools for topological data analysis. https://cran.r-project.org/web/packages/TDA/index.html. Accessed 1 Dec 2018
Frattini V, Pagnotta SM, Tala Fan JJ, Russo MV, Lee SB, Garofano L, Zhang J, Shi P, Lewis G, Sanson H, Frederick V, Castano AM, Cerulo L, Rolland DCM, Mall R, Mokhtari K, Elenitoba-Johnson KSJ, Sanson M, Huang X, Ceccarelli M, Lasorella A, Iavarone A (2018) A metabolic function of FGFR3-TACC3 gene fusions in cancer. Nature 553:222–227
Giusti C, Ghrist R, Bassett DS (2016) Two’s company, three (or more) is a simplex. J Comput Neurosci 41(1):1–14
Ghrist RW (2014) Elementary applied topology, vol 1. Createspace, Seattle
GUDHI Geometry Understanding in Higher Dimensions (2014) INRIA. http://gudhi.gforge.inria.fr/
Hatcher A (2000) Algebraic topology. Cambridge University Press, Cambridge
Jacobson N (2012) Basic algebra II. Dover books on mathematics, 2nd edn. Dover Publications, Mineola
Jardine N, Sibson R (1972) Mathematical taxonomy. Wiley series in probability and mathematical statistics, Wiley, New York
Kenet T, Bibitchkov D, Tsodyks M, Grinvald A, Arieli A (2003) Spontaneously emerging cortical representations of visual attributes. Nature 425:954–956
Kyeong S, Park S, Cheon K, Kim J, Song D, Kim E (2015) A new approach to investigate the association between brain functional connectivity and disease characteristics of attention-deficit/hyperactivity disorder: topological neuroimaging data analysis. PloS One 10(9):1–15, 09
Lee Y, Barthel SD, Dlotko P, Moosavi SM, Hess K, Smit B (2018) High-throughput screening approach for nanoporous materials genome using topological data analysis: application to zeolites. J Chem Theory Comput 14(8):4427–4437 PMID: 29986145
Lesnick M (2015) The theory of the interleaving distance on multidimensional persistence modules. Found Comput Math 15(3):613–650
Lewis R (2014) CTL. https://github.com/appliedtopology/ctl
Lesnick M, Rabadán R, Rosenbloom DIS (2018) Quantifying genetic innovation: mathematical foundations for the topological study of reticulate evolution. CoRR, arXiv:1804.01398
Lee J, Wang J, Sa JK, Ladewig E, Lee H, Lee I, Kang HJ, Rosenbloom DIS, Cámara PG, Liu Z, Nieuwenhuizen P, Jung SW, Choi SW, Kim J, Chen AH, Kim K, Shin SY, Seo YJ, Oh J, Shin YJ, Park C, Kong D, Seol HJ, Blumberg AJ, Lee J, Iavarone A, Park W, Rabadán R, Nam D (2017) Spatiotemporal genomic architecture informs precision oncology in glioblastoma. Nat Genet 49:594–599
McNaughton BL, Barnes CA, O’Keefe J (1983) The contributions of position, direction, and velocity to single unit activity in the hippocampus of freely-moving rats. Exp Brain Res 52(1):41–49
Morozov D (2012) Dionysus. http://www.mrzv.org/software/dionysus/
Munkres JR (1996) Elements of algebraic topology. Avalon Publishing, New York
Nanda V (2002) Perseus, the persistent homology software. http://www.sas.upenn.edu/~vnanda/perseus
Nicolau M, Levine AJ, Carlsson G (2011) Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proce Natl Acad Sci 108(17):7265–7270
Olin A, Henckel E, Chen Y, Lakshmikanth T, Pou C, Mikes J, Gustafsson A, Bernhardsson AK, Zhang C, Bohlin K, Brodin P (2018) Stereotypic immune system development in newborn children. Cell 174(5):1277–1292.e14
Phinyomark A, Ibez-Marcelo E, Petri G (2017) Resting-state fmri functional connectivity: big data preprocessing pipelines and topological data analysis. IEEE Trans Big Data 3(4):415–428
Phinyomark A, Petri G, Ibáñez-Marcelo E, Osis ST, Ferber R (2018) Analysis of big data in gait biomechanics: current trends and future directions. J Med Biol Eng 38(2):244–260
Reninghaus J (2014) DIPHA (a Distributed Persistent Homology Algorithm. https://github.com/DIPHA/dipha/
Rucco M, Merelli E, Herman D, Ramanan D, Petrossian T, Falsetti L, Nitti C, Salvi A (2015) Using topological data analysis for diagnosis pulmonary embolism. J Theor Appl Comput Sci 9(1):41–55
Saggar M, Sporns O, Gonzalez-Castillo J, Bandettini PA, Carlsson G, Glover G, Reiss AL (2018) Towards a new approach to reveal dynamical organization of the brain using topological data analysis. Nat Commun
Selinger C, Tisoncik-Go J, Menachery VD, Agnihothram S, Law GL, Chang J, Kelly SM, Sova P, Baric RS, Katze MG (2014) Cytokine systems approach demonstrates differences in innate and pro-inflammatory host responses between genetically distinct mers-cov isolates. BMC Genom 15(1):1161
Sexton H, Vejdemo-Johansson M (2008) jPlex. http://comptop.stanford.edu/programs/jplex/
Sgouralis I, Nebenfhr A, Maroulas V (2017) A bayesian topological framework for the identification and reconstruction of subcellular motion. SIAM J Imaging Sci 10(2):871–899
Singh G, Mémoli F, Carlsson G (2007) Topological methods for the analysis of high dimensional data sets and 3D object recognition. In: Botsch M, Pajarola R, Chen B, Zwicker M (eds) Eurographics symposium on point-based graphics. The Eurographics Association
Singh G, Mémoli F, Ishkhanov T, Sapiro G, Carlsson G, Ringach DL (2008) Topological analysis of population activity in visual cortex. J Vis 8(8):11
Spivak DI (2014) Categ Theory Sci. The MIT Press, Cambridge
Tausz A, Vejdemo-Johansson M, Adams H (2014) JavaPlex: a research software package for persistent (co)homology. In: Hong H, Yap C (eds) Proceedings of ICMS 2014, lecture notes in computer science 8592, pp 129–136. http://appliedtopology.github.io/javaplex/
Tierny J, Favelier G, Levine J, Gueunet C, Michaux M (2017) Topology toolkit. https://topology-tool-kit.github.io/index.html
Torres BY, Oliveira JHM, Thomas AT, Rath P, Cumnock K, Schneider DS (2016) Tracking resilience to infections by mapping disease space. PLOS Biol 14(4):1–19, 04
TGDA@OSU TRIPODS Mini course on persistent homology (2018). https://mbi.osu.edu/event/?id=1217
Tralie C, Saul N (2019) Scikit-tda: topological data analysis for python
Wadhwa R, Dhawan A, Williamson D, Scott J (2018) TDAstats. https://cran.r-project.org/package=TDAstats
Xia K, Wei GW (2014) Persistent homology analysis of protein structure, flexibility, and folding. Int J Numer Methods Biomed Eng 30(8):814–844
Xia K, Feng X, Tong Y, Wei GW (2015) Persistent homology for the quantitative prediction of fullerene stability. J Comput Chem 36(6):408–422
Yao Y, Sun J, Huang X, Bowman GR, Singh G, Lesnick M, Guibas LJ, Pande VS, Carlsson G (2009) Topological methods for exploring low-density states in biomolecular folding pathways. J Chem Phys 130(14):144115
Zhang K, Ginzburg I, McNaughton BL, Sejnowski TJ (1998) Interpreting neuronal population activity by reconstruction: unified framework with application to hippocampal place cells. J Neurophys 79(2):1017–1044 PMID: 9463459
Acknowledgements
These notes are meant to supplement the lectures given by the first author during the TGDA@OSU TRIPODS Summer School held at MBI during May 2018. Videos of the lectures are available at TRIPODS (2018). We acknowledge NSF support through project CCF #1740761.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Mémoli, F., Singhal, K. A Primer on Persistent Homology of Finite Metric Spaces. Bull Math Biol 81, 2074–2116 (2019). https://doi.org/10.1007/s11538-019-00614-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11538-019-00614-z