Skip to main content

Data Mining and Analytics for Exploring Bulgarian Diabetic Register

  • Conference paper
  • First Online:
Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2017)

Abstract

This paper discusses the need of building diabetic registers in order to monitor the disease development and assess the prevention and treatment plans. The automatic generation of a nation-wide Diabetes Register in Bulgaria is presented, using outpatient records submitted to the National Health Insurance Fund in 2010–2014 and updated with data from outpatient records for 2015–2016. The construction relies on advanced automatic analysis of free clinical texts and business analytics technologies for storing, maintaining, searching, querying and analyzing data. Original frequent pattern mining algorithms enable to discover maximal frequent itemsets of simultaneous diseases for diabetic patients. We show how comorbidities, identified for patients in the prediabetes period, can help to define alerts about specific risk factors for Diabetes Mellitus type 2, and thus might contribute to prevention. We also claim that the synergy of modern analytics and data mining tools transforms a static archive of clinical patient records to a sophisticated knowledge discovery and prediction environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    International Classification of Diseases and Related Health Problems 10th Revision. http://apps.who.int/classifications/icd10/browse/2015/en.

  2. 2.

    WHO, BMI Classification http://apps.who.int/bmi/index.jsp?introPage=intro_3.html.

References

  1. WHO Diabetes Fact Sheets, November 2017. http://www.who.int/mediacentre/factsheets/fs312/en/. Accessed 20 Jan 2018

  2. WHO Global Report on Diabetes (2016). http://apps.who.int/iris/bitstream/10665/204871/1/9789241565257_eng.pdf?ua=1. Accessed 20 Jan 2018. ISBN 978 924 156525 7

  3. Richardson, E., (ed.): National Diabetes Plans in Europe: what lessons are there for the prevention and control of chronic diseases in Europe? Policy Brief of the Joint Action on Chronic Diseases and Promoting Healthy Ageing across the Life Cycle, WHO Regional Office for Europe (2016). ISSN 1997-8065

    Google Scholar 

  4. Garrofé, B., Björnberg, A., Phang, A.Y.: Euro Diabetes Index 2014. Health Consumer Powerhouse Ltd., (2014). ISBN 978-91-980687-4-0

    Google Scholar 

  5. Boytcheva, S., Angelova, G., Angelov, Z., Tcharaktchiev, D.: Integrating Data Analysis Tools for Better Treatment of Diabetic Patients. In: Kalinichenko, L., Manolopoulos, Y., Skvortsov, N., Sukhomlin, V. (eds.) Selected Papers of the XIX International Conference on Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2017), CEUR Workshop Proceedings, vol. 2022, pp. 230–237 (2017). http://ceur-ws.org/Vol-2022/. Accessed 20 Jan 2018

  6. European Best Information through Regional Outcomes in Diabetes (EUBIROD) homepage. http://www.eubirod.eu/. Accessed 20 Jan 2018

  7. Hallgren Elfgren, I.M., Törnvall, E., Grodzinsky, E.: The process of implementation of the diabetes register in primary health care. Int. J. Qual. Health Care 24(4), 419–424 (2012)

    Article  Google Scholar 

  8. Tcharaktchiev, D., Zacharieva, S., Angelova, G., Boytcheva, S., Angelov, Z., et al.: Building a bulgarian national registry of patients with diabetes mellitus. J. Soc. Med. 2, 19–21 (2015). ISSN 1310-1757 (in Bulgarian Language)

    Google Scholar 

  9. Boytcheva, S., et al.: Obtaining status descriptions via automatic analysis of hospital patient records. Informatica 34, 269–278 (2010)

    Google Scholar 

  10. Boytcheva, S., Angelova, G., Angelov, Z., Tcharaktchiev, D.: Text mining and big data analytics for retrospective analysis of clinical texts from outpatient care. Cybern. Inf. Technol. 15(4), 58–77 (2015). https://doi.org/10.1515/cait-2015-0055

    Article  Google Scholar 

  11. Boytcheva, S., Angelova, G., Angelov, Z., Tcharaktchiev, D.: Mining comorbidity patterns using retrospective analysis of big collection of outpatient records. Health Inf. Sci. Syst. 5(1), 3 (2017). https://doi.org/10.1007/s13755-017-0024-y

    Article  Google Scholar 

  12. Aggarwal, C., Bhuiyan, M., Hasan, M.: Frequent pattern mining algorithms: a survey. In: Aggarwal, C., Han, J. (eds.) Frequent pattern mining, pp. 19–64. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07821-2_2

    Chapter  Google Scholar 

  13. Rabatel, J., Bringay, S., Poncelet, P.: Mining sequential patterns: a context-aware approach. In: Guillet, F., Pinaud, B., Venturini, G., Zighed, D. (eds.) Advances in Knowledge Discovery and Management, pp. 23–41. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-35855-5_2

    Chapter  Google Scholar 

  14. Huang, J., Huan, J., Tropsha, A., Dang, J., Zhang, H., Xiong, M.: Semantics-driven frequent data pattern mining on electronic health records for effective adverse drug event monitoring. In: 2013 IEEE International Conference on Bioinformatics and Biomedicine BIBM, pp. 608–611. IEEE (2013). https://doi.org/10.1109/bibm.2013.6732567

  15. Ziembiński, R.Z.: Accuracy of generalized context patterns in the context based sequential patterns mining. Control Cybern. 40(3), 585–603 (2011). http://yadda.icm.edu.pl/baztech/element/bwmeta1.element.baztech-article-BATC-0009-0001/c/httpwww_bg_utp_edu_plartcc2011ziembinski.pdf. Accessed 20 Jan 2018

  16. Yu, H.F., Hsieh, C.J., Chang, K.W., Lin, C.J.: Large linear classification when data cannot fit in memory. ACM Trans. Knowl. Discov. Data 5(4), 23 (2012). https://doi.org/10.1145/2086737.2086743

    Article  Google Scholar 

  17. Pan, X.F., He, M., Yu, C., Lv, J., Guo, Y., Bian, Z., et al.: Type 2 Diabetes and risk of incident cancer in China: a prospective study among 0.5 million Chinese adults. Am. J. Epidemiol., kwx376 (2018). https://doi.org/10.1093/aje/kwx376

  18. Onitilo, A.A., Stankowski, R.V., Berg, R.L., Engel, J.M., Glurich, I., Williams, G.M., Doi, S.A.: Breast cancer incidence before and after diagnosis of type 2 diabetes mellitus in women: increased risk in the prediabetes phase. Eur. J. Cancer Prev. 23(2), 76–83 (2014). https://doi.org/10.1097/CEJ.0b013e32836162aa

    Article  Google Scholar 

Download references

Acknowledgements

This research is partially supported by grant IZIDA 02/4 (SpecialIZed Data MIning MethoDs Based on Semantic Attributes), funded by the Bulgarian National Science Fund in 2017–2019. The authors acknowledge also the support of Medical University – Sofia, the National Health Insurance Fund and the Bulgarian Ministry of Health.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Svetla Boytcheva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Boytcheva, S., Angelova, G., Angelov, Z., Tcharaktchiev, D. (2018). Data Mining and Analytics for Exploring Bulgarian Diabetic Register. In: Kalinichenko, L., Manolopoulos, Y., Malkov, O., Skvortsov, N., Stupnikov, S., Sukhomlin, V. (eds) Data Analytics and Management in Data Intensive Domains. DAMDID/RCDL 2017. Communications in Computer and Information Science, vol 822. Springer, Cham. https://doi.org/10.1007/978-3-319-96553-6_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-96553-6_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-96552-9

  • Online ISBN: 978-3-319-96553-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics