Skip to main content
Log in

Application of big data analysis with decision tree for the foot disorder

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

In medical field, massive data sets is generated by rapid development of hospital information system. For analysis of these medical big data, this study showed analysis process of the clinical data to acquire significant information effectively between the foot disorder groups and biomechanical parameters related to symptom by developing a prediction model of the decision tree. The first clinical health records of 1523 patients diagnosed with foot disorder were used for analysis, in total 6610 records. The dependent variable in the analysis data was consisted of five complex disorder groups, and the independent variable was composed of 24 attributes. The decision tree was applied to analyze pattern of the foot disorder. The measured prediction rate was Correct: 72.96 % and Wrong: 27.04 % in the training data, and Correct: 68.66 % and Wrong: 31.34 % in the test data. As a result of analysis on the five foot complex foot disorder groups by using C5.0 algorithm, 12 rules were generated. To improve accuracy of classification, the detailed preprocessing and other data mining algorithms will be applied from now on.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)

    Google Scholar 

  2. Chen, M., Mao, S., Liu, Y.: Big data: a survey. Mob. Netw. Appl. 19(2), 171–209 (2014)

    Article  MathSciNet  Google Scholar 

  3. Seo, K.S.: Database White Paper. Korea Database Agency, Seoul (2013)

    Google Scholar 

  4. Rebentrost, P., Mohseni, M., Lloyd, S.: Quantum support vector machine for big data classification. Phys. Rev. Lett. 113(13), 130503 (2014)

    Article  Google Scholar 

  5. Murdoch, T.B., Detsky, A.S.: The inevitable application of big data to health care. Jama 309(13), 1351–1352 (2013)

    Article  Google Scholar 

  6. Mckinsey Global Institute: Big data: the next frontier for innovation, competition, and productivity. http://bigdatawg.nist.gov/MGI_big_data_full_report.pdf (2011). Accessed June 2011

  7. Islam, A.R., Chung, T.S.: An improved frequent pattern tree based association rule mining technique. In: 2011 International Conference on Information Science and Applications (ICISA), IEEE, pp. 1–8 (2011)

  8. Lincoln, T.L., Suen, P.W.: Common rotational variations in children. J. Am. Acad. Orthop. Surg. 11(5), 312–320 (2003)

    Google Scholar 

  9. Chang, M.Y., Shih, C.C., Chiang, D.A., Chen, C.C.: Mining a small medical data set by integrating the decision tree and t-test. J. Softw. 6(12), 2515–2520 (2011)

    Article  Google Scholar 

  10. Hui, Y., Rongqun, Z., Xianwen, L.: Classification of wetland from TM imageries based on decision tree. WSEAS Trans. Inf. Sci. Appl. 6(6), 1155–1164 (2009)

    Google Scholar 

  11. Song, J.Y., Kim, H.K.: A study of decision tree in detecting intrusions. J. Korean Data Anal. Soc. 12, 983–996 (2010)

    MathSciNet  Google Scholar 

  12. Breault, J.L., Goodall, C.R., Fos, P.J.: Data mining a diabetic data warehouse. Artif. Intell. Med. 26(1), 37–54 (2002)

    Article  Google Scholar 

  13. Kim, Y.M.: A study on analysis of factors on in-hospital mortality for community-acquired pneumonia. J. Korean Data Inf. Sci. Soc. 22(3), 389–400 (2011)

    Google Scholar 

  14. Stoean, R., Stoean, C., Lupsor, M., Stefanescu, H., Badea, R.: Evolutionary-driven support vector machines for determining the degree of liver fibrosis in chronic hepatitis C. Artif. Intell. Med. 51(1), 53–65 (2011)

    Article  Google Scholar 

  15. Lim, K.H., Ryu, K.S., Park, S.H., Shon, H.S., Ryu, K.H.: Short-term mortality prediction of recurrence patients with ST-segment elevation myocardial infarction. J. Korea Soc. Comput. Inf. 17(10), 145–154 (2012)

    Article  Google Scholar 

  16. Duru, N.: An application of apriori algorithm on a diabetic database. In: Khosla, R., Howlet, R.J., Jain, L.C. (eds.) Knowledge-Based Intelligent Information and Engineering Systems, pp. 398–404. Springer, Berlin (2005)

    Chapter  Google Scholar 

  17. Park, M., Choi, S., Shin, A.M., Chul, H.: Analysis of the characteristics of the older adults with depression using data mining decision tree analysis. J. Korean Acad. Nurs. 43, 1–10 (2013)

    Article  Google Scholar 

  18. Huh, M.H., Lee, Y.G.: Data Mining Modeling and Case, 2nd edn. Hannarae, Seoul (2008)

    Google Scholar 

  19. Raghupathi, W.: Data mining in health care. In: Kudyba, S. (ed.) Healthcare Informatics: Improving Efficiency and Productivity, pp. 211–223. CRC Press, Boca Raton (2010)

    Chapter  Google Scholar 

  20. Raghupathi, W., Raghupathi, V.: Big data analytics in healthcare: promise and potential. Health Inf. Sci. Syst. 2(1), 3 (2014)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIP) (NRF-2013R1A2A2A04016782).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jung-Ja Kim.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Choi, JK., Jeon, KH., Won, Y. et al. Application of big data analysis with decision tree for the foot disorder. Cluster Comput 18, 1399–1404 (2015). https://doi.org/10.1007/s10586-015-0480-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-015-0480-6

Keywords

Navigation