Empirical Analysis of Database Privacy Using Twofold Integrals

  • Jordi Nin
  • Vicenç Torra
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3801)

Abstract

Record linkage is a technique for linking records of different files that correspond to the same individual. Traditional record linkage methods needs that the files have some common variables to permit such link. In this paper we study the possibility of applying record linkage techniques when the files do not share variables. In this case we establish links based on structural information. For extracting this structural information we use in this paper twofold integrals.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agrawal, R., Srikant, R.: Privacy Preserving Data Mining. In: Proc. of the ACM SIGMOD Conference on Management of Data, pp. 439–450 (2000)Google Scholar
  2. 2.
    Data Extraction System, U.S. Census Bureau, http://www.census.gov/DES/www/welcome.html
  3. 3.
    Mesiar, R., Mesiarová, A.: Fuzzy integrals. In: Torra, V., Narukawa, Y. (eds.) MDAI 2004. LNCS (LNAI), vol. 3131, pp. 7–14. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  4. 4.
    Narukawa, Y., Torra, V.: Twofold integral and Multi-step Choquet integral. Kybernetika 40(1), 39–50 (2004)MathSciNetGoogle Scholar
  5. 5.
    Nin, J., Torra, V.: Towards the use of OWA operators for record linkage. Proc. of the European Soc. on Fuzzy Logic and Technologies (2005) (in press)Google Scholar
  6. 6.
    Rahm, E., Bernstein, P.A.: A survey of approaches to automatic schema matching. The VLDB Journal 10, 334–350 (2001)MATHCrossRefGoogle Scholar
  7. 7.
    Sugeno, M.: Theory of Fuzzy Integrals and its Applications (PhD Dissertation), Tokyo, Japan. Tokyo Institute of Technology (1974)Google Scholar
  8. 8.
    Torra, V., Domingo-Ferrer, J.: Record linkage methods for multidatabase data mining. In: Torra, V. (ed.) Information Fusion in Data Mining, pp. 101–132. Springer, Heidelberg (2003)Google Scholar
  9. 9.
    Torra, V., Nin, J.: Record linkage in databases not sharing attributes using fuzzy integrals. Manuscript (2005)Google Scholar
  10. 10.
    Torra, V.: Twofold integral: A Choquet integral and Sugeno integral generalization. Butlletí de l’Associació Catalana d’Intel · ligència Artificial 29, 13–19 (2003) (in Catalan). Preliminary version: IIIA Research Report TR-2003-08 (in English) Google Scholar
  11. 11.
    Torra, V.: OWA operators in data modeling and re-identification. IEEE Trans. on Fuzzy Systems 12(5), 652–660 (2004)CrossRefGoogle Scholar
  12. 12.
    Willenborg, L., de Waal, T.: Elements of Statistical Disclosure Control. Lecture Notes in Statistics. Springer, Heidelberg (2001)MATHCrossRefGoogle Scholar
  13. 13.
    Winkler, W.E.: Data Cleaning Methods. In: Proc. SIGKDD 2003, Washington (2003)Google Scholar
  14. 14.
    Winkler, W.E.: Re-identification methods for masked microdata. In: Domingo-Ferrer, J., Torra, V. (eds.) PSD 2004. LNCS, vol. 3050, pp. 216–230. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  15. 15.
    Yager, R.R.: On ordered weighted averaging aggregation operators in multi-criteria decision making. IEEE Trans. Syst., Man, Cybern. 18, 183–190 (1988)MATHCrossRefMathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Jordi Nin
    • 1
  • Vicenç Torra
    • 1
  1. 1. BellaterraSpain

Personalised recommendations