Content-Based Hybrid Deep Neural Network Citation Recommendation Method

Wang, Leipeng; Rao, Yuan; Bian, Qinyu; Wang, Shuo

doi:10.1007/978-981-15-7984-4_1

Leipeng Wang^10,11,12,
Yuan Rao^10,11,12,
Qinyu Bian^10,11,12 &
…
Shuo Wang^10,11,12

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1258))

Included in the following conference series:

International Conference of Pioneering Computer Scientists, Engineers and Educators

1445 Accesses
6 Citations

Abstract

The rapid growth of scientific papers makes it difficult to query related papers efficiently, accurately and with high coverage. Traditional citation recommendation algorithms rely heavily on the metadata of query documents, which leads to the low quality of recommendation results. In this paper, DeepCite, a content-based hybrid neural network citation recommendation method is proposed. First, the BERT model was used to extract the high-level semantic representation vectors in the text, then the multi-scale CNN model and BiLSTM model were used to obtain the local information and the sequence information of the context in the sentence, and the text vectors were matched in depth to generate candidate sets. Further, the depth neural network was used to rerank the candidate sets by combining the score of candidate sets and multi-source features. In the reranking stage, a variety of Metapath features were extracted from the citation network, and added to the deep neural network to learn, and the ranking of recommendation results were optimized. Compared with PWFC, ClusCite, BM25, RW, NNRank models, the results of the Deepcite algorithm presented in the ANN datasets show that the precision (P@20), recall rate (R@20), MRR and MAP indexesrise by 2.3%, 3.9%, 2.4% and 2.1% respectively. Experimental results on DBLP datasets show that the improvement is 2.4%, 4.3%, 1.8% and 1.2% respectively. Therefore, the algorithm proposed in this paper effectively improves the quality of citation recommendation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Livne, A., Gokuladas, V., Teevan, J., et al.: CiteSight: supporting contextual citation recommendation using differential search. In: Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 807–816 (2014)
Google Scholar
Yang, C., et al.: CARES: a ranking-oriented CADAL recommender system. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 203–212. ACM (2009)
Google Scholar
Sugiyama, K., Kan, M.: Scholarly paper recommendation via user’s recent research interests. In: Proceedings of the 10th Annual Joint Conference on Digital Libraries, pp. 29–38. ACM (2010)
Google Scholar
Li, S., et al.: Conference paper recommendation for academic conferences. IEEE Access 6, 17153–17164 (2018)
Article Google Scholar
Torres, R., McNee, S.M., Abel, M., et al.: Enhancing digital libraries with TechLens. In: Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 228–236. ACM (2004)
Google Scholar
Gori, M., Pucci, A.: Research paper recommender systems: a random-walk based approach, In: IEEE/WIC/ACM International Conference on Web Intelligence, pp. 778–781. IEEE (2006)
Google Scholar
Meng, F., et al.: A unified graph model for personalized query oriented reference paper recommendation. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 1509–1512. ACM (2013)
Google Scholar
Jardine, J., Teufel, S.: Topical PageRank: a model of scientific expertise for bibliographic search. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 501–510 (2014)
Google Scholar
Sun, Y., et al.: Pathsim: meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)
Article Google Scholar
Ren, X., et al.: ClusCite: effective citation recommendation by information network-based clustering, In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 821–830 (2014)
Google Scholar
Guo, L., et al.: Exploiting fine-grained co-authorship for personalized citation recommendation. IEEE Access 5, 12714–12725 (2017)
Article Google Scholar
Mu, D., et al.: Query-focused personalized citation recommendation with mutually reinforced ranking. IEEE Access 6, 3107–3119 (2018)
Google Scholar
Huang, E.H, Socher, R., Manning, C.D., et al: Improving word representations via global context and multiple word prototypes. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1, pp. 873–882. Association for Computational Linguistics (2012)
Google Scholar
Bengio, Y., Ducharme, R., Vincent, P., et al.: A neural probabilistic language model. J. Mach. Learn. Res. 3(Feb), 1137–1155 (2003)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Ebesu, T., Fang, Y.: Neural citation network for context-aware citation recommendation. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1093–1096. ACM (2017)
Google Scholar
Peters, M.E., Neumann, M., Iyyer, M., et al: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)
Devlin, J., Chang, M.W., Lee, K., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Huang, P.S., He, X., Gao, J., et al.: Learning deep structured semantic models for web search using click through data. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 2333–2338. ACM (2013)
Google Scholar
Hu, B., Lu, Z., Li, H., Chen, Q.: Convolutional neural network architectures for matching natural language sentences. In: Proceedings of the NIPS, pp. 2042–2050 (2014)
Google Scholar
Palangi, H., et al.: Semantic modelling with long-short-term memory for information retrieval. arXiv preprint arXiv:1412.6629 (2014)
Bhagavatula, C., Feldman, S., Power, R., et al.: Content-based citation recommendation. arXiv preprint arXiv:1802.08301 (2018)
Sun, Y., Han, J., Yan, X., et al.: Pathsim: meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)
Article Google Scholar
Lichtenwalter, R.N., Lussier, J.T., Chawla, N.V.: New perspectives and methods in link prediction. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 243–252. ACM (2010)
Google Scholar
Robertson, S.E., Walker, S.: Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In: Croft, B.W., van Rijsbergen, C.J. (eds.) SIGIR 1994, pp. 232–241. Springer, London (1994). https://doi.org/10.1007/978-1-4471-2099-5_24
Chapter Google Scholar
Guo, L., Cai, X., Hao, F., et al.: Exploiting fine-grained co-authorship for personalized citation recommendation. IEEE Access 5, 12714–12725 (2017)
Article Google Scholar
Tang, J., Zhang, J., Yao, L., et al.: Arnetminer: extraction and mining of academic social networks. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 990–998. ACM (2008)
Google Scholar

Download references

Acknowledgment

The research work is supported by “Shenzhen Science and Technology Project” (JCYJ20180306170836595); “National key research and development program in China” (2019YFB2102300); “the World-Class Universities (Disciplines) and the Characteristic Development Guidance Funds for the Central Universities of China” (PY3A022); “Ministry of Education Fund Projects” (No. 18JZD022 and 2017B00030); “Basic Scientific Research Operating Expenses of Central Universities” (No. ZDYF2017006); “Xi’an Navinfo Corp.& Engineering Center of Xi’an Intelligence Spatial-temporal Data Analysis Project” (C2020103); “Beilin District of Xi’an Science & Technology Project” (GX1803).

Author information

Authors and Affiliations

Shenzhen Research Institute of Xi’an Jiaotong University, Shenzhen, 518057, China
Leipeng Wang, Yuan Rao, Qinyu Bian & Shuo Wang
Lab of Social Intelligence and Complex Data Processing, Software School, Xi’an Jiaotong University, Xi’an, 710049, China
Leipeng Wang, Yuan Rao, Qinyu Bian & Shuo Wang
Shanxi Joint Key Laboratory for Artifact Intelligence (Sub-Lab of Xi’an Jiaotong University), Xi’an, 710049, China
Leipeng Wang, Yuan Rao, Qinyu Bian & Shuo Wang

Authors

Leipeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Rao
View author publications
You can also search for this author in PubMed Google Scholar
Qinyu Bian
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuan Rao .

Editor information

Editors and Affiliations

North University of China, Taiyuan, China
Pinle Qin
Harbin Institute of Technology, Harbin, China
Hongzhi Wang
Harbin University of Science and Technology, Harbin, China
Guanglu Sun
National Academy of Guo Ding Institute of Data Science, Beijing, China
Zeguang Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, L., Rao, Y., Bian, Q., Wang, S. (2020). Content-Based Hybrid Deep Neural Network Citation Recommendation Method. In: Qin, P., Wang, H., Sun, G., Lu, Z. (eds) Data Science. ICPCSEE 2020. Communications in Computer and Information Science, vol 1258. Springer, Singapore. https://doi.org/10.1007/978-981-15-7984-4_1

Download citation

DOI: https://doi.org/10.1007/978-981-15-7984-4_1
Published: 20 August 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7983-7
Online ISBN: 978-981-15-7984-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics