Semantic Document Model to Enhance Data and Knowledge Interoperability

  • Saša Nešić
Part of the Annals of Information Systems book series (AOIS, volume 6)


To enable document data and knowledge to be efficiently shared and reused across application, enterprise, and community boundaries, desktop documents should be completely open and queryable resources, whose data and knowledge are represented in a form understandable to both humans and machines. At the same time, these are the requirements that desktop documents need to satisfy in order to contribute to the visions of the Semantic Web. With the aim of achieving this goal, we have developed the Semantic Document Model (SDM), which turns desktop documents into Semantic Documents as uniquely identified and semantically annotated composite resources, that can be instantiated into human-readable (HR) and machine-processable (MP) forms. In this paper, we present the SDM along with an RDF and ontology-based solution for the MP document instance. Moreover, on top of the proposed model, we have built the Semantic Document Management System (SDMS), which provides a set of services that exploit the model. As an application example that takes advantage of SDMS services, we have extended MS Office with a set of tools that enables users to transform MS Office documents (e.g., MS Word and MS PowerPoint) into Semantic Documents, and to search local and distant semantic document repositories for document content units (CUs) over Semantic Web protocols.


Machine Processable Resource Description Framework Ontological Concept Digital Document Document Annotation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



Research reported in this paper has been partially financed by the European Commission in the NEPOMUK (IST-FP6-027705) project.


  1. 1.
    Berger, L.P., Luckmann, T.: The Social Construction of Reality: A Treatise its the Sociology of Knowledge. Anchor Books, pp. 51–55, 59–61 New York (1966)Google Scholar
  2. 2.
    Berners-Lee, T., Hendler, J. and Lassila, O.: The Semantic Web. Scientific Am., 2001, pp. 34–43.Google Scholar
  3. 3.
    Decker, S., Frank, M.: The social semantic desktop. WWW2004 Workshop Application Design, Development and Implementation Issues in the Semantic Web (2004)Google Scholar
  4. 4.
    Dill, S., Eiron, N., Gibson, D., Gruhl, D., Guha, R., Jhingran, A., Kanungo, T., McCurley, K.S., Rajagopalan, S., Tomkins, A., Tomlin, J.A., Zien, J.Y.: A case for automated large-scale semantic annotation. J. Web Semantics 1 (1), (2003)Google Scholar
  5. 5.
    Ecma International: Standard ECMA-376, Office Open XML File Formats, Dec. 2006.Google Scholar
  6. 6.
    Eriksson, H.: The semantic-document approach to combining documents and ontologies. Int’l Journal of Human-Computer Studies, 65(7), pp. 642–639. (2007)Google Scholar
  7. 7.
    Euzenat, J., Shvaiko, P.: Ontology Matching. Springer-Verlag, Berlin (2007)MATHGoogle Scholar
  8. 8.
    Groza, T., Handschuh, S., Moller, K. and Decker, S. SALT -Semantically Annotated LATEX for Scientific Publications. In: 4th European Semantic Web Conference, (2007)Google Scholar
  9. 9.
    Jovanović, J., Gašević, D., Devedžić, V.: Ontology-based Automatic Annotation of Learning Content. International Journal on Semantic Web and Information Systems. 2(2), pp. 91–119. (2006)Google Scholar
  10. 10.
    Handschuh, S., Groza, T., Moller, M., Grimnes, G., Sauermann L., Jazayeri, M., Mesnage, C., Reif, G., Gudjonsdottir, R.: The Nepomuk Project On the Way to the Social Semantic Desktop. In: I-Semantic 07’, pp. 201–211 (2007)Google Scholar
  11. 11.
    Harth, A., Decker, S.: Optimized Index Structures for Querying RDF from the Web. 3rd Latin American Web Congress (2005)Google Scholar
  12. 12.
    Lanfranchi, V., Ciravegna1, F., Petrelli, D.: Semantic Web-based document: editing and browsing in AktiveDoc. In: 2nd European Semantic Web Conference, Heraklion, Greece (2005)Google Scholar
  13. 13.
    Nešić, S., Gašević, D., Jazayeri, M.: An Ontology-Based Framework for Authoring Assisted by Recommendation. In: 7th ICALT Conference, pp. 227–231. (2007)Google Scholar
  14. 14.
    Nešić, S., Jovanović, J., Gašević, D., Jazayeri, M.: Ontology Based Content Model for Scalable Content Reuse. In: 4th ACM K-CAP Conf.2007, pp. 195–196. (2007)Google Scholar
  15. 15.
    Nešić, S., Gašević, D., Jazayeri, M.: Extending MS Office for Sharing Document Content Units Over the Semantic Web. In: 8th International Conference on Web Engineering (2008).Google Scholar
  16. 16.
    OASIS Consortium: Open Document Format for Office Applications, Version 1.1 (2007)Google Scholar
  17. 17.
    OpenDoc Programmers’ Guide, Addison Wesley Publishing Company, 1995. ISBN 0-202-47954-0.Google Scholar
  18. 18.
    Priestley, M. DITA XML: a reuse by reference architecturefor technical documentation. In: 19th International Conference on Computer Documentation, pp. 152–156. (2001)Google Scholar
  19. 19.
    Prudhommeaux, E., Seaborne, A.: SPARQL Query Language for RDF. (2007)
  20. 20.
    Stabb, S., Studer, R.: Handbook on Ontologies, Springer, Berlin, (2004)Google Scholar
  21. 21.
    Sintek, M., Elst, L., Scerri, S., Handschuh, S.: Distributed Knowledge Representation on the Social Semantic Desktop: Named Graphs, Views and Roles in NRL. In: 4th European Semantic Web Conference, pp. 594–608. Innsbruck (2007)Google Scholar
  22. 22.
    Tallis, M. SemanticWord processing for content authors. In: Knowledge Markup and Semantic Annotation Workshop at 2nd K-CAP conf., Sanibel, Florida USA, (2003)Google Scholar
  23. 23.
    Uren, V., Cimiano, P., Iria, J., Handschuh, S., Vargas-Vera, M., Motta, E., Ciravegna, F.: Semantic annotation for knowledge management: Requirements and a survey of the state of the art. J. Web Semantics: Science, Services and Agents on the World Wide Web, 4(1), pp. 14–28 (2006)Google Scholar
  24. 24.
    Verbert, K., Gašević, D., Jovanović, J., Duval, E.: Ontology-based Learning Content Repurposing: The ALOCoM Framework. Intl Journal on E-Learning, 5(1), pp. 67–74. (2006)Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  1. 1.Faculty of InformaticsUniversity of LuganoLuganoSwitzerland

Personalised recommendations