Skip to main content

An Approach to Extract Meaningful Data from Unstructured Clinical Notes

  • Conference paper
  • First Online:
Inventive Systems and Control

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 204))

Abstract

Clinical notes occupy a major place in electronic health records (EHR). Performing analysis in unstructured clinical notes is always more complicated when compared to structured data. Structured data always provide more details and data to a layman than unstructured data. To perform analysis in the unstructured type of data, it is always necessary to extract data from the unstructured data before working on the data. In this work, a method has been proposed to uncover the entities related to the specific clinical terms as clinical notes are completely involved in the extraction of the data. The main focus of this research will be on the method for discovering the patterns of the relationship that exist among the various clinical entities, which includes the tests, treatments, and the diagnosis, focusing mainly on the clinical notes. It is always difficult to extract data from the unstructured data, and when it comes to clinical notes, the value of data is more. There cannot be any data loss as every word in the clinical note matters. The proposed approach uses natural language processing techniques along with text and rule mining to extract data from the unstructured data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. T. Vijayakumar, R. Vinothkanna, Capsule network on font style classification. J. Artif. Intell. 2(02), 64–76 (2020)

    Google Scholar 

  2. S. Manoharan, A smart image processing algorithm for text recognition information extraction and vocalization for the visually challenged. J. Innov. Image Process. (JIIP) 1(01), 31–38 (2019)

    Article  Google Scholar 

  3. L.B. Sally, R.K. Adam , R.S. Bharanidharan, Y.Y. Gordon, H. Michael , N. Shamim, Predicting mortality in critical care patients with fungemia using structured and unstructured data. pp. 1140–1148. (2019)

    Google Scholar 

  4. K.B. To, L.M. Napolitano, Common complications in the critically ill patient. Surgical Clinics North Amer. 92(6), 1519–1557 (2018)

    Article  Google Scholar 

  5. S.V. Desai, T.J. Law, D.M. Needham, Long-term complications of critical care. Critical Care Med. 39(2), 371–379 (2019)

    Article  Google Scholar 

  6. N.A. Halpern, S.M. Pastores, J.M. Oropello, V. Kvetan, Critical care medicine in the United States: addressing the intensivist shortage and image of the specialty. Critical Care Med. 41(12), 2754–2761 (2017)

    Article  Google Scholar 

  7. E.W. Johnson, M.M. Ghassemi, S. Nemati, K.E. Niehaus, D.A. Clifton, G.D. Clifford, Machine learning and decision support in critical care. Proc. IEEE 104(2), 444–466 (2016)

    Article  Google Scholar 

  8. O. Badawi et al., Making big data useful for health care: a summary of the inaugural MIT critical data conference. JMIR Med. Informat. 2(2), e22 (2014)

    Article  Google Scholar 

  9. C.K. Reddy, C.C. Aggarwal, Healthcare Data Analytics, vol. 36 (CRC Press, Boca Raton, FL, USA, 2015).

    Book  Google Scholar 

  10. D. Gotz, H. Stavropoulos, J. Sun, F. Wang, ICDA: A platform for intelligent care delivery analytics. in Proceedings AMIA Annual Symposium, pp. 264–273. (2012)

    Google Scholar 

  11. Perer, J. Sun, Matrixrow: temporal network visual analytics to track symptom evolution during disease progression. in Proceedings AMIA Annual Symposium, pp. 716–725. (2012)

    Google Scholar 

  12. Y. Mao, W. Chen, Y. Chen, C. Lu, M. Kollef, T. Bailey, An integrated data mining approach to real-time clinical monitoring and deterioration warning. in Proceedings 18th ACM SIGKDD International Conference Knowledge Discovery Data Mining, pp. 1140–1148. (2012)

    Google Scholar 

  13. J. Wiens, E. Horvitz, J.V. Guttag, Patient risk strati_cation for hospital-associated C. Diff as a time-series classifcation task. in Proceedings Advanced Neural Information Processing Systems, pp. 467–475. (2012)

    Google Scholar 

  14. S. Saria, D. Koller, A. Penn, Learning individual and population level traits from clinical temporal data. in Neural Information Processing Systems (NIPS), Predictive Models Personalized Med. Workshop (2019)

    Google Scholar 

  15. R. Dürichen, M.A.F. Pimentel, L. Clifton, A. Schweikard, D.A. Clifton, Multitask Gaussian processes for multivariate physiological time-series analysis. IEEE Trans. Biomed. Eng. 62(1), 314–322 (2015)

    Article  Google Scholar 

  16. M. Ghassemi et al., A multivariate timeseries modeling approach to severity of illness assessment and forecasting in ICU with sparse, heterogeneous clinical data. in Proceedings AAAI Conference Artificial Intelligence, pp. 446–453. (2015)

    Google Scholar 

  17. H.V. Batal, G.F. Cooper, M. Hauskrecht, A pattern mining approach for classifying multivariate temporal data.in Proceedings IEEE International Conference Bioinformatics Biomedicine (BIBM), pp. 358–365. (2011)

    Google Scholar 

  18. T.A. Lasko, Effcient inference of Gaussian-process-modulated renewal processes with application to medical event data. in Proceedings Uncertainty Artificial Intelligence, pp. 469–476. (2014)

    Google Scholar 

  19. L.C. Barajas, R. Akella, Dynamically modeling patient's health state from electronic medical records: a time series approach. in Proceedings 21st ACM SIGKDD International Conference Knowledge Discovery Data Mining, pp. 69–78. (2015)

    Google Scholar 

  20. X. Wang, D. Sontag, F. Wang, Unsupervised learning of disease progression models. in Proceedings 20th ACM SIGKDD International Conference Knowledge Discovery Data Mining, pp. 85–94 (2014)

    Google Scholar 

  21. M.J. Cohen, A.D. Grossman, D. Morabito, M.M. Knudson, J. Butte, G.T. Manley, Identifcation of complex metabolic states in critically injured patients using bioinformatic cluster analysis. Crit. Care 14(1), 1 (2010)

    Article  Google Scholar 

  22. J.L. Zhou, V.A. Narayan, J. Ye, Modeling disease progression via fused sparse group lasso.in Proceedings 18th ACM SIGKDD International Conference Knowledge Discovery Data Mining, pp. 1095–1103. (2012)

    Google Scholar 

  23. E. Choi, N. Du, R. Chen, L. Song, J. Sun, Constructing disease network and temporal progression model via context-sensitive hawkes process. in Proceedings IEEE International Conference Data Mining (ICDM), pp. 721–726. (2015)

    Google Scholar 

  24. R. Pivovarov, A.J. Perotte, E. Grave, J. Angiolillo, C.H. Wiggins, N. Elhadad, Learning probabilistic phenotypes from heterogeneous EHR data. J. Biomed. Informat. 58, 156–165 (2015)

    Article  Google Scholar 

  25. https://www.who.int/publications/i/item/9789241506823

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Sukanya Varshini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Varshini, K.S., Uthra, R.A. (2021). An Approach to Extract Meaningful Data from Unstructured Clinical Notes. In: Suma, V., Chen, J.IZ., Baig, Z., Wang, H. (eds) Inventive Systems and Control. Lecture Notes in Networks and Systems, vol 204. Springer, Singapore. https://doi.org/10.1007/978-981-16-1395-1_44

Download citation

Publish with us

Policies and ethics