Abstract
Every researcher must conduct a literature review, and the document management needs of researchers working on various research topics vary. However, there are two significant challenges today. First, traditional methods like the tree hierarchy of document folders and tag-based management are no longer effective with the enormous volume of publications. Second, although their bib information is available to everyone, many papers can be accessed only through paid services. This study attempts to develop an interactive tool for personal literature management solely based on their bibliographic records. To make such a tool possible, we developed a principled “human-in-the-loop latent space learning” method that estimates the management criteria of each researcher based on his or her feedback to calculate the positions of documents in a two-dimensional space on the screen. Since a set of bibliographic records forms a graph, our model is naturally designed as a graph-based encoder-decoder model that connects the graph and the space. The experiments with ten researchers from humanities, science, and engineering domains show that the proposed framework gives much superior results to a typical graph convolutional encoder-decoder model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bates, M.J.: The design of browsing and berrypicking techniques for the online search interface. Online Rev. 13(5), 407–424 (1989)
Deng, Y., Yuan, Y., Fu, H., Qu, A.: Query-augmented active metric learning. J. Am. Stat. Assoc. 1–14 (2022)
Dong, Y., Chawla, N.V., Swami, A.: metapath2vec: scalable representation learning for heterogeneous networks. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 135–144 (2017)
Fire, M., Guestrin, C.: Over-optimization of academic publishing metrics: observing Goodhart’s law in action. GigaScience 8(6), giz053 (2019)
Francese, E.: Usage of reference management software at the university of Torino. Usage Ref. Manage. Softw. Univ. Torino 4, 145–174 (2013)
Fu, X., Zhang, J., Meng, Z., King, I.: MAGNN: metapath aggregated graph neural network for heterogeneous graph embedding. In: Proceedings of The Web Conference 2020, pp. 2331–2341 (2020)
Grover, A., Leskovec, J.: node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 855–864 (2016)
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc. Ser. C (Applied Statistics) 28(1), 100–108 (1979)
Hsu, Y.L., Tsai, Y.C., Li, C.T.: FinGAT: financial graph attention networks for recommending top-k profitable stocks. IEEE Transactions on Knowledge and Data Engineering (2021)
Hu, X., Yoo, I.: A comprehensive comparison study of document clustering for a biomedical digital library medline. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL2006), pp. 220–229. IEEE (2006)
Huang, X., Qian, S., Fang, Q., Sang, J., Xu, C.: Meta-path augmented sequential recommendation with contextual co-attention network. ACM Trans. Multimedia Comput. Commun. Appl. (TOMM) 16(2), 1–24 (2020)
Iwayama, M., Tokunaga, T.: Hierarchical Bayesian clustering for automatic text classification. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, vol. 2, pp. 1322–1327 (1995)
Josh, N., Pepe, A.: 65 out of the 100 most cited papers are paywalled. https://www.authorea.com/users/8850/articles/125400-65-out-of-the-100-most-cited-papers-arepaywalled (2019). Accessed 30 June 2022
Jugovac, M., Jannach, D.: Interacting with recommenders-overview and research directions. ACM Trans. Interact. Intell. Syst. (TiiS) 7(3), 1–46 (2017)
Kang, Y., Hou, A., Zhao, Z., Gan, D.: A hybrid approach for paper recommendation. IEICE Trans. Inf. Syst. 104(8), 1222–1231 (2021)
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016)
Kipf, T.N., Welling, M.: Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: International Conference on Machine Learning, pp. 1188–1196. PMLR (2014)
Lee, J., Lee, K., Kim, J.G.: Personalized academic research paper recommendation system. arXiv preprint arXiv:1304.5457 (2013)
Liang, D., Krishnan, R.G., Hoffman, M.D., Jebara, T.: Variational autoencoders for collaborative filtering. In: Proceedings of the 2018 World Wide Web Conference, pp. 689–698 (2018)
Ma, X., Wang, R.: Personalized scientific paper recommendation based on heterogeneous graph representation. IEEE Access 7, 79887–79894 (2019)
Mikawa, K., Goto, M.: Regularized distance metric learning for document classification and its application. J. Jpn. Ind. Manage. Assoc. 66(2E), 190–203 (2015)
Nadagouda, N., Xu, A., Davenport, M.A.: Active metric learning and classification using similarity queries. arXiv preprint arXiv:2202.01953 (2022)
Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710 (2014)
Elahi, M.: Adaptive active learning in recommender systems. In: Konstan, J.A., Conejo, R., Marzo, J.L., Oliver, N. (eds.) UMAP 2011. LNCS, vol. 6787, pp. 414–417. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22362-4_40
Salehi, A., Davulcu, H.: Graph attention auto-encoders. arXiv preprint arXiv:1905.10715 (2019)
Salha-Galvan, G., Hennequin, R., Chapus, B., Tran, V.A., Vazirgiannis, M.: Cold start similar artists ranking with gravity-inspired graph autoencoders. In: Fifteenth ACM Conference on Recommender Systems, pp. 443–452 (2021)
Scharpf, P., Schubotz, M., Youssef, A., Hamborg, F., Meuschke, N., Gipp, B.: Classification and clustering of arxiv documents, sections, and abstracts, comparing encodings of natural and mathematical language. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, pp. 137–146 (2020)
Sherkat, E., Nourashrafeddin, S., Milios, E.E., Minghim, R.: Interactive document clustering revisited: a visual analytics approach. In: 23rd International Conference on Intelligent User Interfaces, pp. 281–292 (2018)
Sun, Y., Han, J.: Meta-path-based search and mining in heterogeneous information networks. Tsinghua Sci. Technol. 18(4), 329–338 (2013)
Tang, J., Qu, M., Mei, Q.: PTE: predictive text embedding through large-scale heterogeneous text networks. In: Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1165–1174 (2015)
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077 (2015)
Vlaev, I.: Local choices: rationality and the contextuality of decision-making. Brain Sci. 8(1), 8 (2018)
Waheed, W., Imran, M., Raza, B., Malik, A.K., Khattak, H.A.: A hybrid approach toward research paper recommendation using centrality measures and author ranking. IEEE Access 7, 33145–33158 (2019)
Wang, J., Wu, S., Vu, H.Q., Li, G.: Text document clustering with metric learning. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 783–784 (2010)
Wang, W., Tang, T., Xia, F., Gong, Z., Chen, Z., Liu, H.: Collaborative filtering with network representation learning for citation recommendation. IEEE Trans. Big Data 8(5), 1233–1246 (2020)
Wei, C.P., Chiang, R.H., Wu, C.C.: Accommodating individual preferences in the categorization of documents: a personalized clustering approach. J. Manag. Inf. Syst. 23(2), 173–201 (2006)
Yang, L., Jin, R., Sukthankar, R.: Bayesian active distance metric learning. arXiv preprint arXiv:1206.5283 (2012)
Acknowledgement
This work was partially supported by Kumagai Gumi Co., Ltd., JSPS KAKENHI Grant Number 22H00508, 22K17944 and 21H03552. This work was approved by the IRB of University of Tsukuba. We are grateful to Masao Takaku for his valuable comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Watanabe, S., Ito, H., Matsubara, M., Morishima, A. (2022). Bibrecord-Based Literature Management with Interactive Latent Space Learning. In: Tseng, YH., Katsurai, M., Nguyen, H.N. (eds) From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries. ICADL 2022. Lecture Notes in Computer Science, vol 13636. Springer, Cham. https://doi.org/10.1007/978-3-031-21756-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-21756-2_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21755-5
Online ISBN: 978-3-031-21756-2
eBook Packages: Computer ScienceComputer Science (R0)