Extraction and Characterization of Citations in Scientific Papers
- Cite this paper as:
- Bertin M., Atanassova I. (2014) Extraction and Characterization of Citations in Scientific Papers. In: Presutti V. et al. (eds) Semantic Web Evaluation Challenge. SemWebEval 2014. Communications in Computer and Information Science, vol 475. Springer, Cham
We propose a hybrid method for the extraction and characterization of citations in scientific papers using machine learning combined with rule-based approaches. Our protocol consists of the extraction of metadata, bibliography parsing, section titles processing, and find-grained semantic annotation on the sentence level of texts. This allows us to generate Linked Open Data from a set of research papers in XML.