Abstract
In order to select the best interestingness measure appropriate for evaluating the correlation between syndrome elements and symptoms, 60 objective interestingness measures were selected from different subjects. Firstly, a hypothesis for a good measure was proposed. Based on the hypothesis, an experiment was designed to evaluate the measures. The experiment was based on the clinical record database of past dynasties including 51,186 clinical cases. The selected dataset in this study had 44,600 records. Han and Re were selected as the experimental syndrome elements. Three indicators calculated according to the distances between two syndrome elements were obtained in the experiment and were combined into one indicator. The Z score, φ-coefficient and Kappa were selected from 60 measures after the experiment. The Z score and φ- coefficient were selected according to subjective interestingness. Finally, the φ- coefficient was selected as the best measure for its low computational complexity. The method introduced in this paper may be used in other similar territories. Further research of traditional Chinese medicine can be made based on the conclusion made in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhu, W.: Standardization Research of Differentiation System of Symptoms and Signs and Syndrome in TCM. Tianjin Journal of TCM 19, 1–3 (2002)
Wang, Y., Zhang, Q., Zhang, Z.: Extraction of Syndrome Elements and Destination. Journal of Shandong University of Chinese Medicine 30, 6–7 (2006)
Sun, Z., Xi, G., Yi, J., Zhao, D.: Select informative symptoms combination for diagnosing syndrome. Journal of Biological Systems 15, 27–38 (2007)
Wang, J., Chu, F., Li, J., Yao, K., Zhong, J., Zhou, K., He, Q., Sun, X.: Study on syndrome element characteristics and its correlation with coronary angiography in 324 patients with coronary heart disease. Chinese Journal of Integrative Medicine 14, 274–280 (2008)
Tan, S., Tillisch, K., Bolus, S., Olivas, T., Spiegel, B., Naliboff, B., Chang, L., Mayer, E.: Traditional Chinese medicine based subgrouping of irritable bowel syndrome patients. Am. J. Chin. Med. 33, 365–379 (2005)
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Record 22, 207–216 (1993)
Aggarwal, C.C., Yu, P.S.: A new framework for itemset generation. Association for Computing Machinery, Inc., New York, 10036-5701 (1998)
Brijs, T., Vanhoof, K., Wets, G.: Defining interestingness for association rules. International Journal of Information Theories and Applications 10, 370–376 (2003)
Brin, S., Motwani, R., Silverstein, C.: Beyond market baskets: Generalizing association rules to correlations. ACM SIGMOD Record 26, 265–276 (1997)
McGarry, K.: A survey of interestingness measures for knowledge discovery. The Knowledge Engineering Review 20, 39–61 (2005)
Tan, P., Kumar, V., Srivastava, J.: Selecting the right objective measure for association analysis. Information Systems 29, 293–313 (2004)
Luo, K., Wu, J.: Evaluating Criterion of Association Rules. Control and Decision 18, 277–280 (2003)
Yi, W., Wei, J., Wang, M.: Mining Efficient Association Rules. Computer Engineering & Science 27, 91–94 (2005)
Chen, J., Gao, Y.: Evaluating Criterion of Association Rules Using Efficiency. Computer Engineering and Applications 45, 141–142 (2009)
Huang, Y.: Clinical Epidemiology. People’s Medical Publishing House, Beijing (2006)
Pecina, P.: A machine learning approach to multiword expression extraction. In: Towards a Shared Task for Multiword Expressions (MWE 2008), pp. 54–57 (2008)
Zhou, X., Liu, B., Wu, Z., Feng, Y.: Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks. Artificial Intelligence in Medicine 41, 87–104 (2007)
Piatetsky-Shapiro, G.: Discovery, analysis, and presentation of strong rules. Knowledge Discovery in Databases, 229–248 (1991)
Lenca, P., Meyer, P., Vaillant, B., Lallich, S.: A multicriteria decision aid for interestingness measure selection. Departement LUSSI, ENST Bretagne, Technical Report LUSSI-TR-2004-01-EN (2004)
Geng, L., Hamilton, H.: Interestingness measures for data mining: A survey. ACM Computing Surveys (CSUR) 38, Article 9 (2006)
Zhang, Q., Wang, Y., Zhang, Z., Zhang, Q., Song, G.: The Establishment and Statistics on the Clinical Records Database of the Past Dynasties. Journal of Shandong University of Chinese Medicine 29, 298–299 (2005)
Zhang, Q., Wang, Y., Zhang, L., Yu, D., Wang, Y.: Independent Symptoms with the Least Intension. Journal of Beijing University of Traditional Chinese Medicine, 5–10 (2010)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. The Morgan Kaufmann Series in Data Management Systems. Morgan Kaufmann Publishers, San Francisco (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, L., Zhang, Qm., Wang, Yg., Yu, Dl. (2012). Selecting an Appropriate Interestingness Measure to Evaluate the Correlation between Syndrome Elements and Symptoms. In: Cao, L., Huang, J.Z., Bailey, J., Koh, Y.S., Luo, J. (eds) New Frontiers in Applied Data Mining. PAKDD 2011. Lecture Notes in Computer Science(), vol 7104. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28320-8_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-28320-8_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28319-2
Online ISBN: 978-3-642-28320-8
eBook Packages: Computer ScienceComputer Science (R0)