Skip to main content

Personalized Citation Recommendation Using an Ensemble Model of DSSM and Bibliographic Information

  • Chapter
  • First Online:
Artificial Intelligence Supported Educational Technologies

Part of the book series: Advances in Analytics for Learning and Teaching ((AALT))

  • 923 Accesses

Abstract

With the tremendous proliferation of scientific literature and research papers published every year, fulfilling a comprehensive literature overview became a tedious and time-consuming task. Citation recommendation is considerably important to improve the efficiency and quality of literature search. It scales down the information overload in academia by using the content of the paper and citation information to automatically recommend papers relevant to the students or respectively researcher’s preferences. In this paper, we propose a novel personalized citation recommendation system comprised of a query-based recommendation module and a graph-based ranking module. The query-based recommendation module relies on Deep Semantic Similarity Model (DSSM) to rank papers based on their semantic similarity to a query text. The graph-based ranking module uses a heterogeneous graph that incorporates the citation and content information within papers, to rank the candidate papers based on their relevance to a query text and the corresponding author. The fusion of the results from both modules provides the final recommendation list. Our intensive experiments on the ACL Analogy dataset (AAN) prove that our model significantly outperforms other state-of-the-art techniques in terms of MAP and MRR. Also it shows a better Recall in the top ranked papers over the best performing baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Alkhatib, W., Herrmann, L. A., & Rensing, C. (2017). Onto. KOM-towards a minimally supervised ontology learning system based on word embeddings and convolutional neural networks. KEOD, 2, 17–26.

    Google Scholar 

  • Bethard, S., & Jurafsky, D. (2010). Who should I cite: Learning literature search models from citation behavior. In Proceedings of the 19th ACM international conference on information and knowledge management (pp. 609–618). New York: ACM Press.

    Chapter  Google Scholar 

  • Bird, S., Dale, R., Dorr, B. J., Gibson, B., Joseph, M. T., Kan, M.-Y., et al. (2008). The acl anthology reference corpus: A reference dataset for bibliographic research in computational linguistics. Sixth International Conference on Language Resources and Evaluation, LREC, 2008, 1755–1759.

    Google Scholar 

  • Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2016). Enriching word vectors with subword information. arXiv preprint arXiv, 1607, 04606.

    Google Scholar 

  • Cai, X., Han, J., Li, W., Zhang, R., Pan, S., & Yang, L. (2018). A three-layered mutually reinforced model for personalized citation recommendation. IEEE Transactions on Neural Networks and Learning Systems, 29, 6026–6037.

    Article  Google Scholar 

  • Chakraborty, T., Modani, N., Narayanam, R., & Nagar, S. (2015). Discern: A diversified citation recommendation system for scientific queries. In 2015 IEEE 31st international conference on data engineering (pp. 555–566). Piscataway, NJ: IEEE.

    Chapter  Google Scholar 

  • Chandrasekaran, K., Gauch, S., Lakkaraju, P., & Luong, H. P. (2008). Concept-based document recommendations for citeseer authors. In International conference on adaptive hypermedia and adaptive web-based systems (pp. 83–92). New York: Springer.

    Chapter  Google Scholar 

  • Ebesu, T., & Fang, Y. (2017). Neural citation network for context-aware citation recommendation. In Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval (pp. 1093–1096). New York: ACM Press.

    Chapter  Google Scholar 

  • Ester, M., Kriegel, H.-P., Sander, J., Xu, X., & others. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd, 96, 226–231.

    Google Scholar 

  • Gormally, C. B. (2009). Effects of inquiry-based learning on students’ science literacy skills and confidence. International journal for the scholarship of teaching and learning, 3(2), n2.

    Article  Google Scholar 

  • Healey, M. (2005). Linking research and teaching exploring disciplinary spaces and the role of inquiry-based learning. In Reshaping the university: New relationships between research, scholarship and teaching (pp. 67–68). New York: McGraw-Hill.

    Google Scholar 

  • Hearst, M. A. (1992). Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th conference on computational linguistics-volume 2 (pp. 539–545). Stroudsburg, PA: COLING.

    Chapter  Google Scholar 

  • Hernando, A., Bobadilla, J., & Ortega, F. (2016). A non negative matrix factorization for collaborative filtering recommender systems based on a Bayesian probabilistic model. Knowledge-Based Systems, 97, 188–202.

    Article  Google Scholar 

  • Huang, P.-S., He, X., Gao, J., Deng, L., Acero, A., & Heck, L. (2013). Learning deep structured semantic models for web search using clickthrough data. In Proceedings of the 22nd ACM international conference on conference on information and knowledge management (pp. 2333–2338). New York: ACM Press.

    Google Scholar 

  • Huang, W., Wu, Z., Liang, C., Mitra, P., & Giles, C. L. (2015). A neural probabilistic model for context based citation recommendation. In Twenty-Ninth AAAI conference on artificial intelligence. Palo Alto, CA: AAAI Press.

    Google Scholar 

  • Jinha, A. E. (2010). Article 50 million: An estimate of the number of scholarly articles in existence. Learned Publishing, 23, 258–263.

    Article  Google Scholar 

  • Kang, Z., Peng, C., & Cheng, Q. (2016). Top-n recommender system via matrix completion. In Thirtieth AAAI conference on artificial intelligence. Palo Alto, CA: AAAI Press.

    Google Scholar 

  • Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. In International conference on machine learning (pp. 1188–1196). Beijing, China: JMLR.

    Google Scholar 

  • Mahdisoltani, F., Biega, J., & Suchanek, F. (2014). Yago3: A knowledge base from multilingual wikipedias. In 7th biennial conference on innovative data systems research. Asilomar, CA: CIDR.

    Google Scholar 

  • McNee, S. M., Albert, I., Cosley, D., Gopalkrishnan, P., Lam, S. K., Rashid, A. M., et al. (2002). On the recommending of citations for research papers. In Proceedings of the 2002 ACM conference on computer supported cooperative work (pp. 116–125). New Orleans, LA: CSCW.

    Chapter  Google Scholar 

  • Meng, F., Gao, D., Li, W., Sun, X., & Hou, Y. (2013). A unified graph model for personalized query-oriented reference paper recommendation. In Proceedings of the 22nd ACM international conference on information and knowledge management (pp. 1509–1512). New York: ACM Press.

    Google Scholar 

  • Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38, 39–41.

    Article  Google Scholar 

  • Nascimento, C., Laender, A. H., Silva, A. S., & Gonccalves, M. A. (2011). A source independent framework for research paper recommendation. In Proceedings of the 11th annual international ACM/IEEE joint conference on digital libraries (pp. 297–306). New York: ACM Press.

    Chapter  Google Scholar 

  • Ohta, M., Hachiki, T., & Takasu, A. (2011). Related paper recommendation to support online-browsing of research papers. In Fourth international conference on the applications of digital information and web technologies (ICADIWT 2011) (pp. 130–136). Piscataway, NJ: IEEE.

    Chapter  Google Scholar 

  • Pan, L., Dai, X., Huang, S., & Chen, J. (2015). Academic paper recommendation based on heterogeneous graph. In Chinese computational linguistics and natural language processing based on naturally annotated big data (pp. 381–392). Cham, Switzerland: Springer.

    Chapter  Google Scholar 

  • Rose, S., Engel, D., Cramer, N., & Cowley, W. (2010). Automatic keyword extraction from individual documents. In Text mining: Applications and theory (pp. 1–20). Hoboken, NJ: Wiley.

    Google Scholar 

  • Speer, R., & Havasi, C. (2012). Representing general relational knowledge in ConceptNet 5. In N. Calzolari, K. Choukri, T. Declerck, M. Uğur Doğan, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis Proceedings of the Eight international conference on language resources and evaluation (LREC’12) (pp. 3679–3686). European Language Resources Association (ELRA). Istanbul, Turkey: LREC.

    Google Scholar 

  • Torres, R., McNee, S. M., Abel, M., Konstan, J. A., & Riedl, J. (2004). Enhancing digital libraries with TechLens+. In Proceedings of the 4th ACM/IEEE-CS joint conference on digital libraries (pp. 228–236). New York: ACM Press.

    Chapter  Google Scholar 

  • Wang, J., Song, D., Wang, Q., Zhang, Z., Si, L., Liao, L., et al. (2015). An entity class-dependent discriminative mixture model for cumulative citation recommendation. In Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval (pp. 635–644). New York: ACM Press.

    Chapter  Google Scholar 

  • Ware, M., & Mabe, M. (2015). The STM report: An overview of scientific and scholarly journal publishing. Oxford, UK: STM: International Association of Scientific.

    Google Scholar 

  • Yang, C., Wei, B., Wu, J., Zhang, Y., & Zhang, L. (2009). CARES: A ranking-oriented CADAL recommender system. In Proceedings of the 9th ACM/IEEE-CS joint conference on digital libraries (pp. 203–212). New York: ACM Press.

    Chapter  Google Scholar 

  • Zhai, C., & Lafferty, J. (2001). Model-based feedback in the KL-divergence retrieval model. In Tenth international conference on information and knowledge management (CIKM 2001) (pp. 403–410). New York: ACM Press.

    Google Scholar 

  • Zhang, Y., Yang, L., Cai, X., & Dai, H. (2018). A novel personalized citation recommendation approach based on gan. In International symposium on methodologies for intelligent systems (pp. 268–278). New York: ACM Press.

    Google Scholar 

Download references

Acknowledgments

This work has been co-funded by the German Federal Ministry of Education and Research (BMBF) within the framework of the Software Campus project “PIOBRec” [01IS17050].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christoph Rensing .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Alkhatib, W., Rensing, C. (2020). Personalized Citation Recommendation Using an Ensemble Model of DSSM and Bibliographic Information. In: Pinkwart, N., Liu, S. (eds) Artificial Intelligence Supported Educational Technologies. Advances in Analytics for Learning and Teaching. Springer, Cham. https://doi.org/10.1007/978-3-030-41099-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-41099-5_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-41098-8

  • Online ISBN: 978-3-030-41099-5

  • eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics