International Conference on Knowledge Engineering and the Semantic Web

Knowledge Engineering and Semantic Web pp 16-31 | Cite as

UIMA2LOD: Integrating UIMA Text Annotations into the Linked Open Data Cloud

  • Claudia Bretschneider
  • Heiner Oberkampf
  • Sonja Zillner
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 518)

Abstract

The LOD cloud is becoming the de-facto standard for sharing and connecting pieces of data, information and knowledge on the Web. As of today, means for the seamless integration of structured data into the LOD cloud are available. However, algorithms for integrating information enclosed in unstructured text sources are missing. In order to foster the (re)use of the high percentage of unstructured text, automatic means for the integration of their content are needed. We address this issue by proposing an approach for conceptual representation of textual annotations which distinguishes linguistic from semantic annotations and their integration. Additionally, we implement a generic UIMA pipeline that automatically creates a LOD graph from texts that (1) implements the proposed conceptual representation, (2) extracts semantically classified entities, (3) links to existing LOD datasets and (4) generates RDF graphs from the extracted information. We show the application and benefits of the approach in a case study on a medical corpus.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Augenstein, I., Padó, S., Rudolph, S.: LODifier: generating linked data from unstructured text. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 210–224. Springer, Heidelberg (2012). http://dblp.uni-trier.de/db/conf/esws/eswc2012.html#AugensteinPR12 CrossRefGoogle Scholar
  2. 2.
    Berners-Lee, T.: Linked Data - Design Issues, July 2006. http://www.w3.org/DesignIssues/LinkedData.html
  3. 3.
    Ciccarese, P., Ocana, M., Garcia-Castro, L.J., Das, S., Clark, T.: An open annotation ontology for science on web 3.0. J. Biomedical Semantics 2(S–2), S4 (2011). http://dblp.uni-trier.de/db/journals/biomedsem/biomedsem2S.html#CiccareseOGDC11 CrossRefGoogle Scholar
  4. 4.
    Hellmann, S., Lehmann, J., Auer, S., Brümmer, M.: Integrating NLP using linked data. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013). http://svn.aksw.org/papers/2013/ISWC_NIF/public.pdf CrossRefGoogle Scholar
  5. 5.
    Kawamura, T., Ohsuga, A.: Toward an ecosystem of LOD in the field: LOD content generation and its consuming service. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 98–113. Springer, Heidelberg (2012). http://dblp.uni-trier.de/db/conf/semweb/iswc2012-2.html#KawamuraO12 CrossRefGoogle Scholar
  6. 6.
    Liu, H., Wu, S.T.I., Tao, C., Chute, C.G.: Modeling UIMA type system using web ontology language - towards interoperability among UIMA-based NLP tools. In: Proceedings of Workshop on Managing Interoperability and compleXity in Health Systems (MIX-HS), pp. 31–36 (2012). http://dblp.uni-trier.de/db/conf/cikm/mixhs2012.html#LiuWTC12
  7. 7.
    Oberkampf, H., Bretschneider, C., Zillner, S., Bauer, B., Hammon, M.: Knowledge-based extraction of measurement-entity relations from german radiology reports. In: IEEE International Conference on Healthcare Informatics (ICHI) (2013)Google Scholar
  8. 8.
    Oberkampf, H., Zillner, S., Bauer, B., Hammon, M.: An OGMS-based model for clinical information (MCI). In: Proceedings of International Conference on Biomedical Ontology, pp. 97–100 (2013). http://www2.unb.ca/csas/data/ws/icbo2013/papers/ec/icbo2013_submission_56.pdf
  9. 9.
    Radiological Society of North America: Radlex (2012). http://rsna.org/RadLex.aspx
  10. 10.
    Ramakrishnan, C., Kochut, K.J., Sheth, A.P.: A framework for schema-driven relationship discovery from unstructured text. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 583–596. Springer, Heidelberg (2006). http://dx.doi.org/10.1007/11926078_42 CrossRefGoogle Scholar
  11. 11.
    Rizzo, G., Troncy, R., Hellmann, S., Brümmer, M.: In: Workshop on Linked Data on the Web (LDOW), Lyon, FranceGoogle Scholar
  12. 12.
    Verspoor, K., Baumgartner Jr., W., Roeder, C., Hunter, L.: Abstracting the types away from a UIMA type system. From Form to Meaning: Processing Texts Automatically, 249–256 (2009)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Claudia Bretschneider
    • 1
    • 2
  • Heiner Oberkampf
    • 2
  • Sonja Zillner
    • 2
    • 3
  1. 1.Center for Information and Language ProcessingUniversity MunichMunichGermany
  2. 2.Siemens AG, Corporate TechnologyMunichGermany
  3. 3.School of International Business and EntrepreneurshipSteinbeis UniversityBerlinGermany

Personalised recommendations