Abstract
In this paper, we present an analysis of a relationship between temporal trends of automatically extracted terms in medical research document and their similarities on a structured vocabulary. In order to obtain the temporal trends, we used our temporal pattern extraction method that combines an automatic term extraction, an importance index of the terms, and clustering for the values in each period. By using a set of medical research documents that were published every year, we extracted temporal patterns of the automatically extracted terms. Then, we calculated their similarities on the medical taxonomy by defining a distance on the tree structure. For analyzing the relationship between the terms included in the patterns and the similarity of the terms on the taxonomy, the differences of the averaged similarities of the terms in each pattern are compared between the two trends of the temporal patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abe, H., Tsumoto, S.: Trend detection from large text data. In: Proceedings of the 2010 IEEE International Conference on Systems, Man and Cybernetics, pp. 310–315. IEEE (2010)
Nakagawa, H.: Automatic term recognition based on statistics of compound nouns. Terminology 6(2), 195–210 (2000)
Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. Document Retrieval Systems, 132–142 (1988)
Keogh, E., Chu, S., Hart, D., Pazzani, M.: Segmenting time series: A survey and novel approach. In: Data Mining in Time Series Databases, pp. 1–22. World Scientific (2003)
Liao, T.W.: Clustering of time series data: a survey. Pattern Recognition 38, 1857–1874 (2005)
Medical subject headings, http://www.nlm.nih.gov/mesh/
Srinivasan, P.: Meshmap: a text mining tool for medline. In: Proc. of AMAI Symposium 2001, pp. 642–646 (2001)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tsumoto, S., Abe, H. (2012). Comparing Similarity of Concepts Identified by Temporal Patterns of Terms in Biomedical Research Documents. In: Li, T., et al. Rough Sets and Knowledge Technology. RSKT 2012. Lecture Notes in Computer Science(), vol 7414. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31900-6_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-31900-6_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31899-3
Online ISBN: 978-3-642-31900-6
eBook Packages: Computer ScienceComputer Science (R0)