Honey Bee Versus Apis Mellifera: A Semantic Search for Biological Data

  • Felicitas Löffler
  • Kobkaew Opasjumruskit
  • Naouel Karam
  • David Fichtmüller
  • Uwe Schindler
  • Friederike Klan
  • Claudia Müller-Birn
  • Michael Diepenbroek
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10577)

Abstract

While literature portals in the biomedical domain already enhance their search applications with ontological concepts, data portals offering biological primary data still use a classical keyword search. Similar to publications, biological primary data are described along meta information such as author, title, location and time which is stored in a separate file in XML format. Here, we introduce a semantic search for biological data based on metadata files. The search is running over 4.6 million datasets from GFBio - The German Federation for Biological Data (GFBio, https://www.gfbio.org), a national infrastructure for long-term preservation of biological data. The semantic search method used is query expansion. Instead of looking for originally entered keywords the search terms are expanded with related concepts from different biological vocabularies. Hosting our own Terminology Service with vocabularies that are tailored to the datasets, we demonstrate how ontological concepts are integrated into the search and how it improves the search result.

Keywords

Semantic search Query expansion Biological data Life sciences Biodiversity 

Notes

Acknowledgements

This work was funded by the Deutsche Forschungsgemeinschaft (DFG) within the scope of the GFBio project.

References

  1. 1.
    Buttigieg, P.L., Morrison, N., Smith, B., Mungall, C., Lewis, S., The ENVO Consortium: The environment ontology: contextualising biological and biomedical entities. J. Biomed. Semant. 4, 43 (2013)Google Scholar
  2. 2.
    Côté, R.G., Jones, P., Apweiler, R., Hermjakob, H.: The ontology lookup service, a lightweight cross-platform tool for controlled vocabulary queries. BMC Bioinform. 7, 97 (2006)CrossRefGoogle Scholar
  3. 3.
    Diepenbroek, M., Glöckner, F., Grobe, P., Güntsch, A., Huber, R., König-Ries, B., Kostadinov, I., Nieschulze, J., Seeger, B., Tolksdorf, R., Triebel, D.: Towards an integrated biodiversity and ecological research data management and archiving platform: GFBio. In: Informatik (2014)Google Scholar
  4. 4.
    Dietze, H., Schroeder, M.: Goweb: a semantic search engine for the life science web. BMC Bioinform. 10(S-10), 7 (2009)Google Scholar
  5. 5.
    Faessler, E., Hahn, U.: Semedico: a comprehensive semantic search engine for the life sciences. In: ACL 2017 - Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Vancouver, Canada, July 30–August 4 2017Google Scholar
  6. 6.
    Frenzel, M., Dussl, F., Höhne, R., Nickels, V., Creutzburg, F.: Wild bee monitoring in six agriculturally dominated landscapes of Saxony-Anhalt (Germany) (2014).  https://doi.org/10.1594/PANGAEA.865100. In: Frenzel, M., Preiser, C., Dussl, F., Höhne, R., Nickels, V., Creutzburg, F.: (2016): TERENO (Terrestrial Environmental Observatories) wild bee monitoring in six agriculturally dominated landscapes of Saxony-Anhalt (Germany). Helmholtz Centre for Environmental Research - UFZ.  https://doi.org/10.1594/PANGAEA.864908
  7. 7.
    Karam, N., Müller-Birn, C., Gleisberg, M., Fichtmüller, D., Tolksdorf, R., Güntsch, A.: A terminology service supporting semantic annotation, integration, discovery and analysis of interdisciplinary research data. Datenbank-Spektrum 16(3), 195–205 (2016)CrossRefGoogle Scholar
  8. 8.
    Löffler, F., Klan, F.: Does term expansion matter for the retrieval of biodiversity data? In: Martin, M., Cuquet, M., Folmer, E. (eds.) Joint Proceedings of the Posters and Demos Track of the 12th International Conference on Semantic Systems (SEMANTiCS 2016). CEUR Workshop Proceedings (2016)Google Scholar
  9. 9.
    Noy, N., Shah, N., Whetzel, P., Dai, B., Dorf, M., Griffith, N., Jonquet, C., Rubin, D., Storey, M., Chute, C., Musen, M.: Bioportal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Res. 37(Web-Server-Issue), 170–173 (2009)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Felicitas Löffler
    • 1
  • Kobkaew Opasjumruskit
    • 1
  • Naouel Karam
    • 2
  • David Fichtmüller
    • 3
  • Uwe Schindler
    • 4
  • Friederike Klan
    • 1
  • Claudia Müller-Birn
    • 2
  • Michael Diepenbroek
    • 4
  1. 1.Heinz-Nixdorf Endowed Chair for Distributed Information SystemsFriedrich Schiller University JenaJenaGermany
  2. 2.Institute of Computer ScienceFreie Universität BerlinBerlinGermany
  3. 3.Botanic Garden and Botanical Museum (BGBM)Freie Universität BerlinBerlinGermany
  4. 4.MARUMUniversity of BremenBremenGermany

Personalised recommendations