Advertisement

Combining a REST Lexical Analysis Web Service with SPARQL for Mashup Semantic Annotation from Text

  • Eetu MäkeläEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8798)

Abstract

Current automatic annotation systems are often monolithic, holding internal copies of both machine-learned annotation models and the reference vocabularies they use. This is problematic particularly for frequently changing references such as person and place registries, as the information in the copy quickly grows stale. In this paper, arguments and experiments are presented on the notion that sufficient accuracy and recall can both be obtained simply by combining a sufficiently capable lexical analysis web service with querying a primary SPARQL store, even in the case of often problematic highly inflected languages.

Keywords

Semantic Annotation Instance Registry Nominative Form Lexical Analysis SPARQL Endpoint 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Hyvänen, E.: Developing and using a national cross-domain semantic web infrastructure. Semantic Computing, pp. 421–438. Wiley, New York (2010)Google Scholar
  2. 2.
    Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: Dbpedia spotlight: shedding light on the web of documents. In: Ghidini, C., Ngomo, A.C.N., Lindstaedt, S.N., Pellegrini, T. (eds.) I-SEMANTICS. ACM International Conference Proceeding Series, pp. 1–8. ACM (2011)Google Scholar
  3. 3.
    Frosterus, M., Tuominen, J., Pessala, S., Seppälä, K., Hyvönen, E.: Linked open ontology cloud KOKO—managing a system of cross-domain lightweight ontologies. In: Cimiano, P., Fernández, M., Lopez, V., Schlobach, S., Völker, J. (eds.) ESWC 2013. LNCS, vol. 7955, pp. 296–297. Springer, Heidelberg (2013)Google Scholar
  4. 4.
    Sinkkilä, R., Suominen, O., Hyvönen, E.: Automatic semantic subject indexing of web documents in highly inflected languages. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part I. LNCS, vol. 6643, pp. 215–229. Springer, Heidelberg (2011)Google Scholar
  5. 5.
    Medelyan, O.: Human-competitive automatic topic indexing. Ph.D. thesis, The University of Waikato (2009)Google Scholar
  6. 6.
    Mäkelä, E., Viljanen, K., Alm, O., Tuominen, J., Valkeapää, O., Kauppinen, T., Kurki, J., Sinkkilä, R., Kansala, T., Lindroos, R., Suominen, O., Ruotsalo, T., Hyvönen, E.: Enabling the semantic web with ready-to-use web widgets. In: Nixon, L.J.B., Cuel, R., Bergamini, C. (eds.) FIRST. CEUR Workshop Proceedings, vol. 293, pp. 56–69. CEUR-WS.org (2007)Google Scholar
  7. 7.
    Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: Sabou, M., Blomqvist, E., Noia, T.D., Sack, H., Pellegrini, T. (eds.) I-SEMANTICS, pp. 121–124. ACM (2013)Google Scholar
  8. 8.
    Lindén, K., Axelson, E., Hardwick, S., Pirinen, T.A., Silfverberg, M.: HFST—framework for compiling and applying morphologies. In: Mahlow, C., Piotrowski, M. (eds.) SFCM 2011. CCIS, vol. 100, pp. 67–85. Springer, Heidelberg (2011)Google Scholar
  9. 9.
    Shuyo, N.: Language detection library for java (2010). http://code.google.com/p/language-detection/
  10. 10.
    Pirinen, T.A.: Modularisation of finnish finite-state language description - towards wide collaboration in open source development of morphological analyser. In: Proceedings of Nodalida. NEALT proceedings, vol. 18 (2011)Google Scholar
  11. 11.
    Moshagen, S.N., Pirinen, T.A., Trosterud, T.: Building an open-source development infrastructure for language technology projects. In: Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013). NEALT Proceedings Series, vol. 16, 22–24 May 2013Google Scholar
  12. 12.
    Porter, M.F.: Snowball: a language for stemming algorithms, October 2001. http://snowball.tartarus.org/texts/introduction.html

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Semantic Computing Research Group (SeCo)Aalto UniversityEspooFinland

Personalised recommendations