Marginality: A Numerical Mapping for Enhanced Exploitation of Taxonomic Attributes

  • Josep Domingo-Ferrer
Conference paper

DOI: 10.1007/978-3-642-34620-0_33

Volume 7647 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Domingo-Ferrer J. (2012) Marginality: A Numerical Mapping for Enhanced Exploitation of Taxonomic Attributes. In: Torra V., Narukawa Y., López B., Villaret M. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2012. Lecture Notes in Computer Science, vol 7647. Springer, Berlin, Heidelberg

Abstract

Hierarchical attributes appear in taxonomic or ontology- based data (e.g. NACE economic activities, ICD-classified diseases, animal/plant species, etc.). Such taxonomic data are often exploited as if they were flat nominal data without hierarchy, which implies losing substantial information and analytical power. We introduce marginality, a numerical mapping for taxonomic data that allows using on those data many of the algorithms and analytical techniques designed for numerical data. We show how to compute descriptive statistics like the mean, the variance and the covariance on marginality-mapped data. Also, we define a mathematical distance between records including hierarchical attributes that is based on marginality-based variances. Such a distance paves the way to re-using on taxonomic data clustering and anonymization techniques designed for numerical data.

Keywords

Hierarchical attributes Classification Taxonomic data Ontologies Descriptive statistics Numerical mapping Anonymization 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Josep Domingo-Ferrer
    • 1
  1. 1.Dept. of Computer Engineering and Mathematics UNESCO Chair in Data PrivacyUniversitat Rovira i VirgiliTarragonaSpain