Abstract
At the International Research and Educational Institute for Integrated Medical Sciences (IREIIMS) project, we are collecting complete medical data sets to determine relationships between medical data and health status. Since the data include many items which will be categorized differently, it is not easy to generate useful rule sets. Sometimes rare rule combinations are ignored and thus we cannot determine the health status correctly. In this paper, we analyze the features of such complex data, point out the merit of categorized data mining and propose categorized rule generation and health status determination by using combined rule sets.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abe, A., Naya, F., Kogure, K., Hagita, N.: Rule Acquisition from small and heterogeneous data set, Technical Report of JSAI, SIG-KBS-A304-32, pp. 189–194 (2004) (in Japanese)
Abe, A., Hagita, N., Furutani, M., Furutani, Y., Matsuoka, R.: Possibility of Integrated Data Mining of Clinical Data. Data Science Journal 6 (Supplement), 104–115 (2007)
Abe, A., Hagita, N., Furutani, M., Furutani, Y., Matsuoka, R.: An interface for medical diagnosis support. In: Apolloni, B., Howlett, R.J., Jain, L. (eds.) KES 2007, Part II. LNCS (LNAI), vol. 4693, pp. 909–916. Springer, Heidelberg (2007)
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD Int’l Conf. on Management of Data, pp. 207–216 (1993)
Džroski, S., Lavrač, N. (eds.): Relational Data Mining. Springer, Heidelberg (2001)
Ichise, R., Numao, M.: A Graph-based Approach for Temporal Relationship Mining, Technical Report of JSAI, SIG-FAI-A301, pp. 121–126 (2003)
Ichise, R., Numao, M.: First-Order Rule Mining by Using Graphs Created from Temporal Medical Data. In: Tsumoto, S., Yamaguchi, T., Numao, M., Motoda, H. (eds.) AM 2003. LNCS (LNAI), vol. 3430, pp. 112–125. Springer, Heidelberg (2005)
Kobayashi, T., Kawakubo, T.: Prospective Investigation of Tumor Markers and Risk Assessment in Early Cancer Screening. Cancer 73(7), 1946–1953 (1994)
Ohsawa, Y., Okazaki, N., Matsumura, N.: A Scenario Development on Hepatics B and C, Technical Report of JSAI, SIG-KBS-A301, pp. 177–182 (2003)
Osawa, Y., McBurney, P. (eds.): Chance Discovery. Springer, Heidelberg (2003)
Tsumoto, S.: Mining Diagnostic Rules from Clinical Databases Using Rough Sets and Medical Diagnostic Model. Information Sciences 162(2), 65–80 (2004)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufman, San Francisco (1993)
Zheng, Z., Webb, G.I.: Stochastic Attribute Selection Committees. In: Antoniou, G., Slaney, J.K. (eds.) AI 1998. LNCS, vol. 1502, pp. 321–332. Springer, Heidelberg (1998)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abe, A., Hagita, N., Furutani, M., Furutani, Y., Matsuoka, R. (2008). Data Mining of Multi-categorized Data. In: Raś, Z.W., Tsumoto, S., Zighed, D. (eds) Mining Complex Data. MCD 2007. Lecture Notes in Computer Science(), vol 4944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68416-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-68416-9_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68415-2
Online ISBN: 978-3-540-68416-9
eBook Packages: Computer ScienceComputer Science (R0)