Big Data Analytics in a Public General Hospital

  • Ricardo S. SantosEmail author
  • Tiago A. Vaz
  • Rodrigo P. Santos
  • José M. Parente de Oliveira
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10122)


Obtaining information and knowledge from big data has become a common practice today, especially in health care. However, a number of challenges make the use of analytics in health care data difficult. The aim of this paper is to present the big data analytics framework defined and implemented at an important Brazilian public hospital, which decided to use this technology to provide insights that will help improve clinical practices. The framework was validated by a use case in which the goal is to discover the behavior patterns of nosocomial infections in the institution. The architecture was defined, evaluated, and implemented. The overall result was very positive, with a relatively simple process for use that was able to produce interesting analytical results.


Big data Data analysis Medical information Decision support system Nosocomial infections 



The authors would like to thank CAPGEMINI Brazil for their financial support; the EMC Brazil for providing the computer servers and technical support; and Hospital de Clínicas de Porto Alegre for their assistance with the data supply and technical information. Finally, we would like to thank for the team involved in this project: Ulisses Souza, Raul Hara, Margarita Bastos, Kefreen Batista, Jean Michel, Luis Macedo, Lais Cervoni, Juliano Pessini and the staff of CAPGEMINI.


  1. 1.
    White, S.E.: A review of big data in health care: challenges and opportunities. Open Access Bioinf. 6, 13–18 (2014)CrossRefGoogle Scholar
  2. 2.
    Raghupathi, W., Raghupathi, V.: Big data analytics in healthcare: promise and potential. Health Inf. Sci. Syst. 2(3), 1–10 (2014)Google Scholar
  3. 3.
    Dao, T.K., Zabaneh, F., Holmes, J., Disrude, L., Price, M., Gentry, L.: A practical data mining method to link hospital microbiology and an infection control database. Am. J. Infect. Control. 36(3), S18–S20 (2008)CrossRefGoogle Scholar
  4. 4.
    Murdoch, T.B., Detsky, A.S.: The inevitable application of big data to health care. J. Am. Med. Assoc. 309(13), 1351–1352 (2013)CrossRefGoogle Scholar
  5. 5.
    Dhar, V.: Big data and predictive analytics in health care. Big Data 2(3), 113–116 (2014)CrossRefGoogle Scholar
  6. 6.
    Kamal, N., Wiebe, S., Engbers, J.D., Hill, M.D.: Big data and visual analytics in health and medicine: from pipe dream to reality. J. Health Med. Inf. 5(5), 1–2 (2014)Google Scholar
  7. 7.
    Peek, N., Holmes, J.H., Sun, J.: Technical challenges for big data in biomedicine and health: data sources, infrastructure, and analytics. IMIA Yearb. Med. Inf. 9(1), 42–47 (2014)CrossRefGoogle Scholar
  8. 8.
    Rumsfeld, J.S., Joynt, K.E., Maddox, T.M.: Big data analytics to improve cardiovascular care: promise and challenges. Nat. Rev. Cardiol. 13, 350–359 (2014)CrossRefGoogle Scholar
  9. 9.
    Ellaway, R.H., Pusic, M.V., Galbraith, R.M., Cameron, T.: Developing the role of big data and analytics in health professional education. Med. Teach. 36, 216–222 (2014)CrossRefGoogle Scholar
  10. 10.
    Khan, H.A., Ahmad, A., Mehboob, R.: Nosocomial infections and their control strategies. Asian Pac. J. Trop. Biomed. 5(7), 509–514 (2015)CrossRefGoogle Scholar
  11. 11.
    Breathnach, A.S.: Nosocomial infections and infection control. Medicine 41(11), 649–653 (2013)CrossRefGoogle Scholar
  12. 12.
    Minelli, M., Chambers, M., Dhiraj, A.: Big data analytics: emerging business intelligence and analytic trends for today’s business. Wiley, Hoboken (2013)CrossRefGoogle Scholar
  13. 13.
    Cooper, A.: What is analytics? Definition and essential characteristics. CETIS Analytics Ser. 1(5), 1–10 (2012). Google Scholar
  14. 14.
    Aasman, J.: Why data lakes require semantics. Information Management Online, 11 June 2015Google Scholar
  15. 15.
    GoPivotal: Greenplum Database: Getting Started, Version 4.2, Rev. A01. Product Documentation (2014)Google Scholar
  16. 16.
    GoPivotal: Pivotal Hadoop: Getting Started, Version 3.0. Product Documentation (2016)Google Scholar
  17. 17.
    Kuiler, E.W.: From big data to knowledge: an ontological approach to big data analytics. Rev. Policy Res. 31(4), 311–318 (2014)CrossRefGoogle Scholar
  18. 18.
    Kim, J., Wang, G., Bae, S.T.: A survey of big data technologies and how semantic computing can help. Int. J. Seman. Comput. 8(1), 99–117 (2014)CrossRefGoogle Scholar
  19. 19.
    Berman, J.: Principles of Big Data. Morgan Kaufmann, Waltham (2013)Google Scholar
  20. 20.
    Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum Comput Stud. 43, 907–928 (1995)CrossRefGoogle Scholar
  21. 21.
    Dean, M., Schreiber, G.: Web Ontology Language (OWL) Reference. Product Documentation, Version OWL2 (2009).

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Ricardo S. Santos
    • 1
    Email author
  • Tiago A. Vaz
    • 3
  • Rodrigo P. Santos
    • 3
  • José M. Parente de Oliveira
    • 2
  1. 1.Universidade de Mogi das CruzesMogi das CruzesBrazil
  2. 2.Aeronautics Institute of TechnologySão José dos CamposBrazil
  3. 3.Hospital de Clínicas de Porto AlegrePorto AlegreBrazil

Personalised recommendations