Skip to main content

A Data Analytics Pipeline for Smart Healthcare Applications

  • Conference paper
  • First Online:
Sustained Simulation Performance 2017

Abstract

The rapidly increasing availability of healthcare data is becoming the driving force for the adoption of data-driven approaches. However, due to a large amount of heterogeneous dataset including images (MRI, X-ray), texts (doctor’s note) and sounds, doctors still struggle against temporal and accuracy limitations when processing and analyzing such big data using conventional machines and approaches. Employing advanced machine learning techniques on big healthcare data anlaytics supported by Petascale high performance computing resources is expected to remove those limitations and help find unseen healthcare insights. This paper introduces a data analytics pipeline consisting of data curation (including cleansing, annotation, and integration) and data analytics processes, necessary to develop smart healthcare applications. In order to show its practical use, we present sample applications such as diagnostic imaging, landmark extraction and casenote generation using deep learning models, for orthodontic treatments in dentistry. Eventually, we will build smart healthcare infrastructure and system that fully automate the set of the curation and analytics processes. The developed system will dramatically reduce doctor’s workload and is smoothly expanded to other fields.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    IOTN [1] is one of the severity measures for malocclusion and jaw abnormality, which determines whether orthodontic treatment is necessary.

  2. 2.

    International Statistical Classification of Diseases and Related Health Problems.

References

  1. Brook, P.H., Shaw, W.C.: The development of an index of orthodontic treatment priority. Eur. J. Orthod. 11(3), 309–320 (1989)

    Article  Google Scholar 

  2. Caytiles, R.D., Park, S.: A study of the design of wireless medical sensor network based u-healthcare system. Int. J. Bio-Sci. Bio-Technol. 6(3), 91–96 (2014)

    Article  Google Scholar 

  3. Filipe, L., Fdez-Riverola, F., Costa, N., et al.: Wireless body area networks for healthcare applications. Protocol stack review. Int. J. Distrib. Sens. Netw. (2015). http://dx.doi.org/10.1155/2015/213705

    Google Scholar 

  4. Sharma, M., Bilgic, M.: Evidence-based uncertainty sampling for active learning. Data Min. Knowl. Discov. 31(1), 164–202 (2017)

    Article  MathSciNet  Google Scholar 

  5. Seung, H.S., Opper, M., Sompolinsky, H.: Query by committee. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory (1992)

    Book  Google Scholar 

  6. Settles, B., Craven, M.: An analysis of active learning strategies for sequence labeling tasks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP08 (2008)

    Google Scholar 

  7. Ma, Z., Yang, Y., Nie, F., Sebe, N., Yan, S., Hauptmann, A.: Harnessing lab knowledge for real-world action recognition. Int. J. Comput. Vis. 109(1–2), 60–73 (2014)

    Article  MATH  Google Scholar 

  8. Gomez-Cabrero, D., Abugessaisa, I., Maier, D., et al.: Data integration in the era of omics: current and future challenges. BMC Syst. Biol. 8(2) (2014). http://dx.doi.org/10.1186/1752-0509-8-S2-I1

  9. Doan, A., Halevy, A., Ives, Z.: Principles of Data Integration. Elsevier, Amsterdam (2012)

    Google Scholar 

  10. Sewitch, M.J., Leffondré, K., Dobkin, P.L.: Clustering patients according to health perceptions: relationships to psychosocial characteristics and medication nonadherence. J. Psychosom. Res. 56(3), 323–332 (2004)

    Article  Google Scholar 

  11. Mould, D.: Models for disease progression: new approaches and uses. Clin. Pharmacol. Ther. 92(1), 125–131 (2012)

    Article  Google Scholar 

  12. Schulze, M.B., Hoffmann, K., Boeing, H., et al.: An accurate risk score based on anthropometric, dietary, and lifestyle factors to predict the development of type 2 diabetes. Diabetes Care 30(3), e89 (2007)

    Article  Google Scholar 

  13. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, vol. 1 (2012)

    Google Scholar 

  14. Lakhani, P., Sundaram, B.: Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology (2017). http://dx.doi.org/10.1148/radiol.2017162326

  15. Gulshan, V., Peng, L., Coram, M., et al.: Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. J. Am. Med. Assoc. 316(22), 2402–2410 (2016)

    Article  Google Scholar 

  16. Janowczyk, A., Madabhushi, A.: Deep learning for digital pathology image analysis: a comprehensive tutorial with selected use cases. J. Pathol. Inf. 7(292), 29 (2016)

    Article  Google Scholar 

  17. Avendi, M.R., Kheradvar, A., Jafarkhani, H.: A combined deep-learning and deformable-model approach to fully automatic segmentation of the left ventricle in cardiac MRI. Med. Image Anal. 30, 108–119 (2016)

    Article  Google Scholar 

  18. Jimbocho Orthodontic clinic: www.jimbocho-ortho.com. Accessed June 2017

  19. Grau, V., Alcaniz, M., Juan, M., Knoll, C.: Automatic localization of cephalometric landmarks. J. Biomed. Inform. 34(3), 146–156 (2001)

    Article  Google Scholar 

  20. Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: a neural image caption generator. In: CVPR15 (2015)

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank Prof. Kazunori Nozaki in Osaka University Dental Hospital, for managing and providing medical dataset for experiments. We also thank Prof. Chihiro Tanikawa in Department of Orthodontics & Dentofacial Orthopedics, Osaka University Dental Hospital, for lending her expertise on the orthodontic treatments in dentistry.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chonho Lee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Lee, C., Murata, S., Ishigaki, K., Date, S. (2017). A Data Analytics Pipeline for Smart Healthcare Applications. In: Resch, M., Bez, W., Focht, E., Gienger, M., Kobayashi, H. (eds) Sustained Simulation Performance 2017 . Springer, Cham. https://doi.org/10.1007/978-3-319-66896-3_12

Download citation

Publish with us

Policies and ethics