Abstract
We summarize some areas of great potential for data analytics based on ultrametric topology. These areas include search and discovery, in a way that is computationally efficient and scalable, as well as with demonstrated effectiveness. Further areas discussed are the analysis and synthesis of narrative. We conclude with a preliminary description of work on emotion that has developed out of our work on narrative.
Similar content being viewed by others
References
P. Contreras and F. Murtagh, “Fast, linear time hierarchical clustering using the Baire metric,” J. Classification 29, 118–143 (2012).
Tim Cooks, “Delivery,” http://amazon.co.uk/o/ASIN/B004PLNO08/ (June 2009).
Tim Cooks, “The deception of success,” http://amazon.com/o/ASIN/B006LLEY2M/ (December 2011).
Tim Cooks, “Playing with controversy,” http://amazon.com/o/ASIN/B008I2LDG2/ (February 2012).
Tim Cooks, “Roadkill Casserole,” http://amazon.com/o/ASIN/B0076IJPT6/ (February 2012).
R. D’Abrusco, A. Staiano, G. Longo, E. De Filippis, M. Brescia, M. Paolillo and E. De Filippis, “Mining the SDSS archive. I. Photometric redshift in the nearby universe,” Astrophys. J. 663, 752–764 (2007).
DreamBank, “Repository of dream reports”, www.dreambank.net (2004).
P. Hall, J. S. Marron and A. Neeman, “Geometric representation of high dimension low sample size data,” J. Royal Statist. Society Series B 67, 427–444 (2005).
Hongbo Chen and Ben He, “Automated essay scoring by capturing relative writing quality,” Computer J., in press, (published online 15 October 2013).
S. C. Johnson, “Hierarchical clustering schemes,” Psychometrika 2, 241–254 (1967).
A. Yu. Khrennikov, Information Dynamics in Cognitive, Psychological, Social and Anomalous Phenomena (Kluwer, 2004).
B. Le Roux and H. Rouanet, Geometric Data Analysis: From Correspondence Analysis to Structured Data Analysis (Kluwer Academic Publ., Dordrecht, 2004).
A. Levy, Basic Set Theory (Dover, 1979, reprinted 2002).
J. Mayika et al., “Big data: The next frontier for innovation, competition, and productivity,” pp. 156 (McKinsey Global Institute, May 2011).
R. McKee, Story: Substance, Structure, Style, and the Principles of Screenwriting (Methuen, 1999).
F. Murtagh, “On ultrametricity, data coding, and computation,” J. Classification 21, 167–184 (2004).
F. Murtagh, Correspondence Analysis and Data Coding with R and Java (Chapman and Hall/CRC Press, 2005).
F. Murtagh, “Identifying the ultrametricity of time series,” Europ. Phys. J. B 43, 573–579 (2005).
F. Murtagh, “Symmetry in data mining and analysis: a unifying view based on hierarchy,” Proc. Steklov Inst. Math. 265, 177–198 (2009).
F. Murtagh, “Ultrametric model of mind, II: Application to text content analysis,” p-Adic Numbers Ultram. Anal. Appl. 4(3), 207–221 (2012).
F. Murtagh, “Mapping of emotion,” video based on Scene 43 of Casablanca movie, YouTube (2012); http://www.youtube.com/watch?v=J4EFLp4yGZo.
F. Murtagh, “Ultrametric component analysis with application to text analysis,” report, 48 pp. (2013).
F. Murtagh, G. Downs and P. Contreras, “Hierarchical clustering of massive, high dimensional data sets by exploiting ultrametric embedding,” SIAM J. Scient. Comput. 30(2), 707–730 (2008).
F. Murtagh, A. Ganz and S. McKie, “The structure of narrative: the case of film scripts,” Pattern Recogn. 42, 302–312 (2009).
F. Murtagh, A. Ganz, S. McKie, J. Mothe and K. Englmeier, “Tag clouds for displaying semantics: The Case of filmscripts,” Inform. Visual. J. 9, 253–262 (2010).
F. Murtagh, A. Ganz and J. Reddington, “New methods of analysis of narrative and semantics in support of interactivity,” Entertainment Comput. 2, 115–121 (2011).
F. Murtagh and P. Contreras, “Fast, linear time, m-adic hierarchical clustering for search and retrieval using the Baire metric, with linkages to generalized ultrametrics, hashing, formal concept analysis, and precision of data measurement,” p-Adic Numbers Ultram. Anal. Appl. 4, 45–56 (2012).
J. Reddington, F. Murtagh and D. Cowie, “Computational properties of fiction writing and collaborative work,” Advances in Intelligent Data Analysis XII, Lecture Notes in Computer Science 8207, 369–379 (Springer, 2013).
J. Séguéla and G. Saporta, “A comparison between latent semantic analysis and correspondence analysis”, presentation, CARME Conferences (Rennes, France, 9 February 2011); http//carme2011.agrocampus-ouest.fr/slides/Seguela_Saporta.pdf.
H. A. ’Simon, The Sciences of the Artificial (MIT Press, 3rd edn., 1996).
Apache Solr, open source enterprise search platform from the Apache Lucene project, http://lucene.apache.org/solr.
M. Tonti, Emotions and the Unconscious: Modeling and Measuring the Affective Salience of the Mind (PhD, Università del Salento, 2012).
T. Veale, Exploding the Creativity Myth: The Computational Foundations of Linguistic Creativity (Bloomsbury Academic, 2012).
Author information
Authors and Affiliations
Corresponding author
Additional information
Based on talk at the International Workshop on p-Adic Methods for Modelling of Complex Systems, Bielefeld, April 15–19, 2013.
The text was submitted by the author in English.
Rights and permissions
About this article
Cite this article
Murtagh, F. The new science of complex systems through ultrametric analysis: Application to search and discovery, to narrative and to thinking. P-Adic Num Ultrametr Anal Appl 5, 326–337 (2013). https://doi.org/10.1134/S2070046613040067
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S2070046613040067