Leveraging MRC Framework for Research Contribution Patterns Identification in Citation Sentences

Zhao, Yang; Zhang, Zhixiong; Xiao, Yue

doi:10.1007/978-981-99-8088-8_16

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14458))

Included in the following conference series:

International Conference on Asian Digital Libraries

270 Accesses

Abstract

Research contributions convey the essence of academic papers, highlighting their novel knowledge and understanding compared to prior research. In this study, we address the challenge of identifying research contribution patterns from citation sentences by leveraging a Machine Reading Comprehension (MRC) framework. The MRC approach formulates the extraction of contribution patterns as a question-answering task, utilizing natural language queries to extract contribution patterns (CONTRIBUTION, INFLUENCE, and FIELD) from the context.

Our method outperforms the SOTA NER approach in 2022: W²NER, achieving significant performance improvements of +23.76% and +31.92% in F1 scores for label and entity recognition, respectively. In addition, through manual validation and comparison with ChatGPT annotation results, we demonstrate that the accuracy of our approach is 21.65% higher in identifying research contribution patterns. Moreover, the MRC framework handles nested entities and resolves reference disambiguation more accurately, providing a robust solution for complex citation sentences.

Overall, our work presents an advanced approach for identifying research contribution patterns from citation sentences, showcasing its potential to enhance information retrieval and understanding within the scientific community.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Chen, H., Nguyen, H., Alghamdi, A.: Constructing a high-quality dataset for automated creation of summaries of fundamental contributions of research articles. Scientometrics (2022). https://doi.org/10.1007/s11192-022-04380-z
Article Google Scholar
Zhao, Y., Zhang, Z., Wang, Y., Lin, X.: Identifying research contributions based on semantic analysis of citation sentences: a case study of the 2021 Physiology or Medicine Nobel Prize laureates. In: Proceedings of the 19th International Conference on Scientometrics and Informetrics (2023)
Google Scholar
Levy, O., Seo, M., Choi, E., Zettlemoyer, L.: Zero-shot relation extraction via reading comprehension. http://arxiv.org/abs/1706.04115 (2017)
McCann, B., Keskar, N.S., Xiong, C., Socher, R.: The natural language decathlon: multitask learning as question answering. http://arxiv.org/abs/1806.08730 (2018)
Li, X., et al.: Entity-relation extraction as multi-turn question answering. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (2019)
Google Scholar
Rajpurkar, P., Jia, R., Liang, P.: Know what you don’t know: unanswerable questions for SQuAD. http://arxiv.org/abs/1806.03822 (2018)
Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: SQuAD: 100,000+ questions for machine comprehension of text. http://arxiv.org/abs/1606.05250 (2016)
Li, X., Feng, J., Meng, Y., Han, Q., Wu, F., Li, J.: A unified MRC framework for named entity recognition. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5849–5859 (2020)
Google Scholar
Gupta, S., Manning, C.D.: Analyzing the dynamics of research by extracting key aspects of scientific papers. In: Proceedings of 5th International Joint Conference on Natural Language Processing, pp. 1–9 (2011)
Google Scholar
Chen, L., Fang, H.: An automatic method for extracting innovative ideas based on the scopus® database. Knowl. Org. 46, 171–186 (2019)
Article Google Scholar
Le, X., et al.: CiteOpinion: evidence-based evaluation tool for academic contributions of research papers based on citing sentences. J. Data Inf. Sci. 4, 26–41 (2019)
Google Scholar
Ferrod, R., Di Caro, L., Schifanella, C.: Structured semantic modeling of scientific citation intents. In: Verborgh, R., et al. (eds.) The Semantic Web. LNCS, vol. 12731, pp. 461–476. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-77385-4_27
Chapter Google Scholar
Yousif, A., Niu, Z., Nyamawe, A.S., Hu, Y.: Improving citation sentiment and purpose classification using hybrid deep neural network model. In: Hassanien, A.E., Tolba, M.F., Shaalan, K., Azar, A.T. (eds.) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2018. AISC, vol. 845, pp. 327–336. Springer, Cham (2019). https://doi.org/10.1007/978-3-319-99010-1_30
Chapter Google Scholar
Cohan, A., Ammar, W., van Zuylen, M., Cady, F.: Structural scaffolds for citation intent classification in scientific publications. In: Proceedings of NAACL-HLT, pp. 3586–3596 (2019)
Google Scholar
Berrebbi, D., Huynh, N., Balalau, O.: GraphCite: citation intent classification in scientific publications via graph embeddings. In: Companion Proceedings of the Web Conference 2022, pp. 779–783 (2022)
Google Scholar
Xing, X., Fan, X., Wan, X.: Automatic generation of citation texts in scholarly papers: a pilot study. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6181–6190 (2020)
Google Scholar
Wu, J.-Y., Shieh, A.T.-W., Hsu, S.-J., Chen, Y.-N.: Towards generating citation sentences for multiple references with intent control. http://arxiv.org/abs/2112.01332 (2021)
Lahiri, A., Sanyal, D.K., Mukherjee, I.: CitePrompt: using prompts to identify citation intent in scientific papers. arXiv preprint arXiv:2304.12730 (2023)
Devlin, J., Chang, M.W., Lee, K. and Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, pp. 4171–4186 (2019)
Google Scholar
Li, J., et al.: Unified named entity recognition as word-word relation classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 10965–10973 (2022)
Google Scholar
Wei, X., et al.: Zero-shot information extraction via chatting with ChatGPT. http://arxiv.org/abs/2302.10205 (2023)
Gilardi, F., Alizadeh, M., Kubli, M.: ChatGPT outperforms crowd-workers for text-annotation tasks. http://arxiv.org/abs/2303.15056 (2023)

Download references

Acknowledgments

This work was supported by the major project of the National Social Science Foundation of China “Big Data-driven Semantic Evaluation System of Science and Technology Literature” (Project No. 21&ZD329).

Author information

Authors and Affiliations

National Science Library, Chinese Academy of Science, No. 33, Beisihuan Xilu, Beijing, China
Yang Zhao & Zhixiong Zhang
Department of Information Resources Management, School of Economics and Management, University of Chinese Academy of Sciences, No. 33 Beisihuan Xilu, Beijing, China
Yang Zhao & Zhixiong Zhang
School of Information Resource Management, Renmin University of China, No. 59, Zhongguancun Street, Haidian District, Beijing, China
Yue Xiao

Authors

Yang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhixiong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhixiong Zhang .

Editor information

Editors and Affiliations

Nanyang Technological University, Singapore, Singapore
Dion H. Goh
Academia Sinica, Taipei, Taiwan
Shu-Jiun Chen
Mahidol University, Tambon Salaya, Amphoe Phutthamonthon, Thailand
Suppawong Tuarob

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, Y., Zhang, Z., Xiao, Y. (2023). Leveraging MRC Framework for Research Contribution Patterns Identification in Citation Sentences. In: Goh, D.H., Chen, SJ., Tuarob, S. (eds) Leveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration. ICADL 2023. Lecture Notes in Computer Science, vol 14458. Springer, Singapore. https://doi.org/10.1007/978-981-99-8088-8_16

Download citation

DOI: https://doi.org/10.1007/978-981-99-8088-8_16
Published: 30 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8087-1
Online ISBN: 978-981-99-8088-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics