SDI: Shape Distribution Indicator and Its Application to Find Interrelationships Between Physical Activity Tests and Other Medical Measures

  • Ashkan Sami
  • Ryoichi Nagatomi
  • Makoto Takahashi
  • Takeshi Tokuyama
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4304)


Comprehensibility is driving force in medical data mining results since doctors utilize the outputs and give the final decision. Another important issue specific to some data sets, like physical activity, is their uniform distribution due to tile analysis that was performed on them In this paper, we propose a novel data mining tool named SDI (Shape Distribution Indicator) to give a comprehensive view of co-relations of attributes together with an index named ISDI to show the robustness of SDI outputs. We apply SDI to explore the relationship of the Physical Activity data and symptoms in medical test dataset for which popular data mining methods fail to give an appropriate output to help doctors decisions. In our experiment, SDI found several useful relationships.


Physical Activity Benign Prostatic Hyperplasia Association Rule International Prostate Symptom Score Frequent Itemsets 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Cios, K.J., Moore, G.W.: Uniqueness of medical data mining. Artificial Intelligence in Medicine 26(1-2), 1–24 (2002)CrossRefGoogle Scholar
  2. 2.
    Sami, A.: Obstacles and Misunderstandings Facing Medical Data Mining. In: Li, X., Zaïane, O.R., Li, Z. (eds.) ADMA 2006. LNCS (LNAI), vol. 4093, pp. 856–863. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  3. 3.
    American Society of Consultant Pharmacologists, Health Trends (1998) (last visted 3/2006),
  4. 4.
    StatSoft, Inc: Electronic Statistics Textbook. StatSoft, Tulsa, OK (2006),
  5. 5.
    Okamura, K., Usami, T., Nagahama, K., Maruyama, S., Mizuta, E.: Quality of life assessment of urination in elderly Japanese men and women with some medical problems using International Prostate Symptom Score and King’s Health Questionnaire. European Urology 41(4), 411–419 (2002)CrossRefGoogle Scholar
  6. 6.
    Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications. SIGMOD Record 27(2), 94–105 (1998)CrossRefGoogle Scholar
  7. 7.
    Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proc. 2000 ACM SIGMOD Int. Conf. Management of Data (SIGMOD 2000), Dallas, TX, May 2000, pp. 1–12 (2000)Google Scholar
  8. 8.
    Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proc. of Fourth International Conference on Knowledge Discovery and Data Mining (1998)Google Scholar
  9. 9.
    Li, W., Han, J., Pei, J.: Cmar: Accurate and efficient classification based on multiple class-association rules. In: Proc. of First IEEE International Conference on Data Mining, pp. 369–376 (2001)Google Scholar
  10. 10.
    Dong, G., Zhang, X., Wong, L., Li, J.: Caep: Classification by aggregating emerging patterns. In: Arikawa, S., Furukawa, K. (eds.) DS 1999. LNCS (LNAI), vol. 1721, pp. 30–42. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  11. 11.
    Barry, M.J., Fowler Jr., F.J., O’Leary, M.P., Bruskewitz, R.C., Holtgrewe, H.L., Mebust, W.K., Cockett, A.T.: The American Urological Association symptom index for benign prostatic hyperplasia. Journal of Urology 148(5), 1549–1557 (1992)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Ashkan Sami
    • 1
  • Ryoichi Nagatomi
    • 2
  • Makoto Takahashi
    • 1
  • Takeshi Tokuyama
    • 3
  1. 1.Graduate School of EngineeringTohoku UniversityJapan
  2. 2.Department of Medicine and Science in Sports and Exercise, Graduate School of MedicineTohoku UniversityJapan
  3. 3.Graduate School of Information SciencesTohoku UniversityJapan

Personalised recommendations