On the Declassification of Confidential Documents

  • Daniel Abril
  • Guillermo Navarro-Arribas
  • Vicenç Torra
Conference paper

DOI: 10.1007/978-3-642-22589-5_22

Volume 6820 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Abril D., Navarro-Arribas G., Torra V. (2011) On the Declassification of Confidential Documents. In: Torra V., Narakawa Y., Yin J., Long J. (eds) Modeling Decision for Artificial Intelligence. MDAI 2011. Lecture Notes in Computer Science, vol 6820. Springer, Berlin, Heidelberg

Abstract

We introduce the anonymization of unstructured documents to settle the base of automatic declassification of confidential documents. Departing from known ideas and methods of data privacy, we introduce the main issues of unstructured document anonymization and propose the use of named entity recognition techniques from natural language processing and information extraction to identify the entities of the document that need to be protected.

Keywords

Privacy Declassification Anonymization Named Entity Recognition 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Daniel Abril
    • 1
  • Guillermo Navarro-Arribas
    • 2
  • Vicenç Torra
    • 1
  1. 1.Institut d’Investigació en Intel·ligència Artificial (IIIA)Consejo Superior de Investigaciones Científicas (CSIC)Spain
  2. 2.Dep. Enginyeria de la Informació i de les Comunicacions (DEIC)Universitat Autònoma de Barcelona (UAB)Spain