Skip to main content

Supporting Description of Research Data: Evaluation and Comparison of Term and Concept Extraction Approaches

  • Conference paper
  • First Online:
Digital Libraries for Open Knowledge (TPDL 2018)

Abstract

The importance of research data management is widely recognized. Dendro is an ontology-based platform that allows researchers to describe datasets using generic and domain-specific descriptors from ontologies. Selecting or building the right ontologies for each research domain or group requires meetings between curators and researchers in order to capture the main concepts of their research. Envisioning a tool to assist curators through the automatic extraction of key concepts from research documents, we propose 2 concept extraction methods and compare them with a term extraction method. To compare the three approaches, we use as ground truth an ontology previously created by human curators.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amorim, R.C., Castro, J.A., da Silva, J.R., Ribeiro, C.: A comparative study of platforms for research data management: interoperability, metadata capabilities and integration potential. In: Rocha, A., Correia, A.M., Costanzo, S., Reis, L.P. (eds.) New Contributions in Information Systems and Technologies. AISC, vol. 353, pp. 101–111. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16486-1_10

    Chapter  Google Scholar 

  2. Castro, J.A., Perrotta, D., Amorim, R.C., da Silva, J.R., Ribeiro, C.: Ontologies for research data description: a design process applied to vehicle simulation. In: Garoufallou, E., Hartley, R.J., Gaitanou, P. (eds.) MTSR 2015. CCIS, vol. 544, pp. 348–354. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24129-6_30

    Chapter  Google Scholar 

  3. Cimiano, P., Mädche, A., Staab, S., Völker, J.: Ontology learning. In: Staab, S., Studer, R. (eds.) Handbook on Ontologies. IHIS, pp. 245–267. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-540-92673-3_11

    Chapter  Google Scholar 

  4. Frantzi, K.T., Ananiadou, S., Tsujii, J.: The C-value/NC-value method of automatic recognition for multi-word terms. In: Nikolaou, C., Stephanidis, C. (eds.) ECDL 1998. LNCS, vol. 1513, pp. 585–604. Springer, Heidelberg (1998). https://doi.org/10.1007/3-540-49653-X_35

    Chapter  Google Scholar 

  5. Rocha, J., Ribeiro, C., Lopes, J.: Ranking Dublin Core descriptor lists from user interactions: a case study with Dublin Core Terms using the Dendro platform. Int. J. Digital Libr. (2018). https://doi.org/10.1007/s00799-018-0238-x

  6. Wong, W., Liu, W., Bennamoun, M.: Ontology learning from text. ACM Comput. Surv. 44(4), 1–36 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cláudio Monteiro .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Monteiro, C., Lopes, C.T., Silva, J.R. (2018). Supporting Description of Research Data: Evaluation and Comparison of Term and Concept Extraction Approaches. In: Méndez, E., Crestani, F., Ribeiro, C., David, G., Lopes, J. (eds) Digital Libraries for Open Knowledge. TPDL 2018. Lecture Notes in Computer Science(), vol 11057. Springer, Cham. https://doi.org/10.1007/978-3-030-00066-0_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00066-0_44

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00065-3

  • Online ISBN: 978-3-030-00066-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics