Skip to main content

Metadata Extraction from Conference Proceedings Using Template-Based Approach

  • Conference paper
  • First Online:
Book cover Semantic Web Evaluation Challenges (SemWebEval 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 548))

Included in the following conference series:

Abstract

The paper describes a number of metadata extraction procedures based on rule-based approach and pattern matching from CEUR Workshop proceedings Cf. http://ceur-ws.org and its converting to a Linked Open Data (LOD) dataset in the framework of ESWC 2015 Semantic Publishing Challenge Cf. http://github.com/ceurws/lod/wiki/SemPub2015.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Cf. http://github.com/ailabitmo/ceur-ws-lod.

  2. 2.

    Cf. http://purl.org/ontology/bibo/.

  3. 3.

    Cf. http://xmlns.com/foaf/0.1/.

  4. 4.

    Cf. http://purl.org/dc/elements/1.1/.

  5. 5.

    Cf. http://swrc.ontoware.org/ontology.

  6. 6.

    Cf. http://dbpedia.org/resource/.

  7. 7.

    Cf. http://vocab.ox.ac.uk/projectfunding.

  8. 8.

    Cf. http://nlp.stanford.edu/software/CRF-NER.shtml.

  9. 9.

    Cf. http://ceur-ws.org/Vol-1155/paper-06.pdf.

  10. 10.

    Cf. http://nlp.stanford.edu/software/lex-parser.shtml.

  11. 11.

    Cf. http://grablib.org/.

  12. 12.

    Cf. https://github.com/RDFLib.

  13. 13.

    Cf. https://pypi.python.org/pypi/pdfminer/.

  14. 14.

    Cf. https://pypi.python.org/pypi/BeautifulSoup/3.2.1.

  15. 15.

    Cf. http://dblp.uni-trier.de.

  16. 16.

    Cf. http://prefix.cc.

References

  1. Guo, Z., Jin, H.: Reference Metadata Extraction from Scientific Papers. In: 2011 12th International Conference on Applications and Technologies Parallel and Distributed Computing (PDCAT), pp. 45–49, October 2011

    Google Scholar 

  2. Kolchin, M., Kozlov, F.: A template-based information extraction from web sites with unstable markup. In: Presutti, V., Stankovic, M., Cambria, E., Cantador, I., Di Iorio, A., Di Noia, T., Lange, C., Reforgiato Recupero, D., Tordai, A. (eds.) SemWebEval 2014. CCIS, vol. 475, pp. 89–94. Springer, Heidelberg (2014). http://dx.doi.org/10.1007/978-3-319-12024-9_11

    Google Scholar 

  3. Marinai, S.: Metadata extraction from pdf papers for digital library ingest. In: 2009 10th International Conference on Document Analysis and Recognition, ICDAR 2009, pp. 251–255, July 2009

    Google Scholar 

  4. Sure, Y., Bloehdorn, S., Haase, P., Hartmann, J., Oberle, D.: The SWRC ontology semantic web for research communities. In: Bento, C., Cardoso, A., Dias, G. (eds.) EPIA 2005. LNCS (LNAI), vol. 3808, pp. 218–231. Springer, Heidelberg (2005). http://dx.doi.org/10.1007/11595014_22

    Chapter  Google Scholar 

Download references

Acknowledgments

This work has been partially financially supported by the Government of Russian Federation, Grant #074-U01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Liubov Kovriguina .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Kovriguina, L., Shipilo, A., Kozlov, F., Kolchin, M., Cherny, E. (2015). Metadata Extraction from Conference Proceedings Using Template-Based Approach. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds) Semantic Web Evaluation Challenges. SemWebEval 2015. Communications in Computer and Information Science, vol 548. Springer, Cham. https://doi.org/10.1007/978-3-319-25518-7_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25518-7_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25517-0

  • Online ISBN: 978-3-319-25518-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics