Entity Resolution for Maintaining Electronic Medical Record Using OYSTER

  • Tanya Gupta
  • Varad Deshpande
Conference paper
Part of the EAI/Springer Innovations in Communication and Computing book series (EAISICC)


With the advancement of technology, the world has witnessed the digitization of various fields including education, healthcare, agriculture, manufacturing, etc. Healthcare is a very important field which has witnessed the generation of huge amounts of data in the past few decades due to a steep rise in population, and hence the ever-increasing use of online databases for storing every minute detail. Doctors no more rely on written prescriptions or documents for examining the health conditions of a patient. Electronic medical records have enabled doctors to monitor each patient’s medical history with ease. However, the rate at which the data pertaining to healthcare is increasing has led to the search for new and better alternatives that enhance the feasibility and scalability of already existing digital storage systems. This chapter is intended to provide an insight on how Entity Resolution can be put to use in healthcare for maintaining electronic medical records using the open-source software, OYSTER. Also, this chapter will throw light on how performing Entity Resolution using OYSTER has an edge over the currently used systems for storing personal medical information in hospitals.


Entity resolution Electronic Medical Record OYSTER Identity capture Clusters Records Entities 



We are grateful to Dr. John R. Talburt, Professor of Information Science at University of Arkansas, Little Rock, USA, for teaching us the concepts of entity resolution and getting handy with OYSTER. We would also like to thank him for providing us with the sample data for getting the results. Further we would like to thank Prof. Neha Katre and Prof. Vinaya Sawant, Department of Information Technology, Dwarkadas J. Sanghvi College of Engineering, Mumbai for reviewing our work and making it better and more presentable.


  1. 1.
    S.A. Asabe, N.D. Oye, M. Goji, Hospital patient database management. COMPUSOFT Int. J. Adv. Comput. Technol. 2(3), 65–73 (2013)Google Scholar
  2. 2.
    H.S. Lau, C. Florax, A.J. Porsius, A. de Boer, The completeness of medication histories in hospital medical records of patients admitted to general internal medicine wards. Br. J. Clin. Pharmacol. 49, 597–603 (2001)CrossRefGoogle Scholar
  3. 3.
    T.J. Hannan, Electronic medical record. Canad. Med. Assoc. J. 1–15 (2008)Google Scholar
  4. 4.
    K. Häyrinen, K. Saranto, P. Nykänen, Definition, structure, content, use and impacts of electronic health records: a review of the research literature. Int. J. Med. Inform. 77(5), 291–304 (2008)CrossRefGoogle Scholar
  5. 5.
    R.H. Miller, I. Sim, Physicians’ use of electronic medical records: barriers and solutions. Health Affairs 23(2), 116–126 (2004)CrossRefGoogle Scholar
  6. 6.
    I. Bhattacharya, L. Getoor, Iterative record linkage for cleaning and integration, in Proc. SIGMOD-04 DMKD Workshop, 2004Google Scholar
  7. 7.
    W. Cohen, J. Richman, Learning to match and cluster large high-dimensional data sets for data integration, in Proc. KDD-02, 2002, pp. 475–480Google Scholar
  8. 8.
    W. Cohen, P. Ravikumar, S. Fienberg. A comparison of string metrics for matching names and records, in Proc. KDD-03 Workshop on Data Cleaning, Record Linkage, and Object Consolidation, 2003, pp. 13–18Google Scholar
  9. 9.
    G. Cheng, X. Danyun, Y. Qu, C3D+P: a summarization method for interactive entity resolution. J. Web. Semant. 35(4), 203–213 (2015)CrossRefGoogle Scholar
  10. 10.
    G. Cheng, T. Tran, Y. Qu, RELIN: relatedness and informativeness-based centrality for entity summarization, in Proceedings of the Tenth International Semantic Web Conference, Part I, ed. by L. Aroyo, C. Welty, H. Alani, J. Taylor, A. Bernstein, L. Kagal, N. F. Noy, E. Blomqvist, (Springer, Berlin, 2011), pp. 114–129Google Scholar
  11. 11.
    J.R. Talburt, Y. Zhou, A practical guide to entity resolution with OYSTER: handbook of data quality (2013), pp. 235–270CrossRefGoogle Scholar
  12. 12.
    J.L. Fernández-Alemán, I.C. Señor, P.Á.O. Lozoya, A. Toval, Security and privacy in electronic health records: a systematic literature review. J. Biomed. Inform. 46(3), 541–562 (2013)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Tanya Gupta
    • 1
  • Varad Deshpande
    • 1
  1. 1.Dwarkadas J. Sanghvi College of EngineeringMumbai UniversityMumbaiIndia

Personalised recommendations