Nine million book items and eleven million citations: a study of book-based scholarly communication using OpenCitations
Books have been widely used to share information and contribute to human knowledge. However, the quantitative use of books as a method of scholarly communication is relatively unexamined compared to journal articles and conference papers. This study uses the COCI dataset (a comprehensive open citation dataset provided by OpenCitations) to explore books’ roles in scholarly communication. The COCI data we analyzed includes 445,826,118 citations from 46,534,705 bibliographic entities. By analyzing such a large amount of data, we provide a thorough, multifaceted understanding of books. Among the investigated factors are (1) temporal changes to book citations; (2) book citation distributions; (3) years to citation peak; (4) citation half-life; and (5) characteristics of the most-cited books. Results show that books have received less than 4% of total citations, and have been cited mainly by journal articles. Moreover, 97.96% of books have been cited fewer than ten times. Books take longer than other bibliographic materials to reach peak citation levels, yet are cited for the same duration as journal articles. Most-cited books tend to cover general (yet essential) topics, theories, and technological concepts in mathematics and statistics.
KeywordsBook citation Scholarly communication Citation analysis OpenCitations COCI Open citation data
This paper was supported by Sungkyun Research Fund (S-2018-2538-000), Sungkyunkwan University, 2018.
- Leydesdorff, L., & Felt, U. (2012a). “Books” and “book chapters” in the book citation index (BKCI) and science citation index (SCI, SoSCI, A&HCI). Proceedings of the American Society for Information Science and Technology banner,49(1), 1–7. https://doi.org/10.1002/meet.14504901027.CrossRefGoogle Scholar
- Moed, H. F. (2005). Citation analysis of scientific journals and journal impact measures. Current Science,89(12), 1990–1996.Google Scholar
- OpenCitations (2018). COCI CSV dataset of all the citation data. Figshare. https://doi.org/10.6084/m9.figshare.6741422.v3.
- Peroni, S., & Shotton, D. (2018a). The OpenCitations Data Model. Figshare. https://doi.org/10.6084/m9.figshare.3443876.
- Peroni, S., & Shotton, D. (2019). Opencitations, a scholarly infrastructure organisation dedicated to open scholarship. arXiv. https://arxiv.org/abs/1906.11964.
- Torres-Salinas, D., Robinson-García, N., Cabezas-Clavijo, Á., & Jiménez-Contreras, E. (2014). Analyzing the citation characteristics of books: Edited books, book series and publisher types in the book citation index. Scientometrics,98(3), 2113–2127. https://doi.org/10.1007/s11192-013-1168-4.CrossRefGoogle Scholar
- Zhu, Y., Yan, E., & Song, I.-Y. (2017). The use of a graph-based system to improve bibliographic information retrieval: System design, implementation, and evaluation. Journal of the Association for Information Science and Technology,68(2), 480–490. https://doi.org/10.1002/asi.23677.CrossRefGoogle Scholar
- Zhu, Y., Yan, E., Peroni, S., & Che, C. (2019). Crossref metadata of COCI bibliographic resources as of November 2018 and LCC categories of the ISBN entities in the dataset. Zenodo. https://doi.org/10.5281/zenodo.3241744.