Abstract
This paper presents a method to extract citation types from scientific articles, viewed as an intrinsic part of emerging trend detection (ETD) in scientific literature. There are two main contributions in this work: (1) Definition of six categories (types) of citations in the literature that are extractable, human-understandable, and appropriate for building the interest and utility functions in emerging trend detection models, and (2) A method to classify citation types using finite-state machines which does not require user-interactions or explicit knowledge. The experimental comparative evaluations show the high performance of the method and the proposed ETD model shows the crucial role of classified citation types in the detection of emerging trends in scientific literature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A survey of emerging trend detection in textual data mining. In: Berry, M. (ed.) A Comprehensive Survey of Text Mining, ch. 9. Springer, Heidelberg (2003)
Pottenger, W.M., Yang, T.-H.: Detecting emerging concepts in textual data mining. Computational information retrieval, 89–105 (2001)
Swan, R., Allan, J.: Automatic generation of overview timelines. In: SIGIR 2000: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pp. 49–56. ACM Press, New York (2000)
Small, H.: Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society of Information Science 24, 265–269 (1973)
Lawrence, S., Giles, C.L., Bollacker, K.: Digital libraries and autonomous citation indexing. IEEE Computer 32(6), 67–71 (1999)
Kostoff, R.N., del Rio, J.A., Humenik, J.A., Garcia, E.O., Ramirez, A.M.: Citation mining: integrating text mining and bibliometrics for research user profiling. Journal of the American Society for Information Science and Technology 52(13), 1148–1156 (2001)
Gevry, D.R.: Detection of emerging trends: Automation of domain expert practices (2002)
Nanba, H., Okumura, M.: Towards multi-paper summarization using reference information. In: Proceedings of 16th International Joint Conference on Artificial Intelligence – IJCAI 1999, pp. 926–931 (1999)
Teufel, S.: Argumentative Zoning: Information Extraction from Scientific Text. PhD thesis, University of Edinburgh (1999)
Pham, S.B., Hoffmann, A.G.: A new approach for scientific citation classification using cue phrases. In: Australian Conference on Artificial Intelligence, pp. 759–771 (2003)
Weinstock, M.: Citation indexes. Encyclopedia of Library and Information Science 5, 16–41 (1971)
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: Proceedings of the 17th International Conference on Machine Learning, pp. 591–598 (2000)
Darroch, J.N., Ratcliff, D.: Generalized iterative scaling for log-linear models. The Annals of Mathematical Statistics, 1470–1480 (1972)
Wordnet:Alexical database for the english language, http://wordnet.princeton.edu
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Le, MH., Ho, TB., Nakamori, Y. (2006). Detecting Citation Types Using Finite-State Machines. In: Ng, WK., Kitsuregawa, M., Li, J., Chang, K. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2006. Lecture Notes in Computer Science(), vol 3918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731139_32
Download citation
DOI: https://doi.org/10.1007/11731139_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33206-0
Online ISBN: 978-3-540-33207-7
eBook Packages: Computer ScienceComputer Science (R0)