MapReduce based integration of health hubs: a healthcare design approach
- 4 Downloads
The increasing population in Asia brings up the need for integration of healthcare for efficient and timely manageable treatment for different diseases. Healthcare domain is one of the most important and challenging fields in terms of data collection and analysis. This domain always provide lots of opportunities to explore the hidden knowledge in accessing health records. With the growth of unstructured data in large volume that leads towards the solution by the NoSQL data management tool to manage the huge amount of data. This framework proposes a MapReduce Approach (MRA) for data management in healthcare industry with join based expectation maximization algorithm for NoSQL data management solution, which scales the data with accurate modality. This approach also simplifies the way to integrate healthcare data from different models in the distributed environment from different health hubs. Experimental results show that the proposed approach works in a scalable manner to integrate and match the unstructured data of different health data sources. Examples are illustrated with suitable methodology and further research scope is pinpointed.
KeywordsNoSQL database MapReduce Expectation maximization HDFS Health data
Compliance with ethical standards
Conflict of interest
The author(s) declare(s) that there is no conflict of interest regarding the publication of this paper.
This article does not contain any studies with human participants or animals performed by any of the authors.
- 2.Hermon R, Williams PAH. Big data in healthcare: What is it used for?. 2014.Google Scholar
- 7.Apache. Hadoop. http://hadoop.apache.org/, 2006.
- 12.Chouvarda I, Philip NY, Natsiavas P, Kilintzis V, Sobnath D, Kayyali R, Maglaveras N. WELCOME—innovative integrated care platform using wearable sensing and smart cloud computing for COPD patients with comorbidities. In Engineering in medicine and biology society (EMBC), 2014 36th annual international conference of the IEEE. 2014: 3180–3183.Google Scholar
- 13.Kelders SM, van Gemert-Pijnen JE. Using log-data as a starting point to make e-health more persuasive. Persuasive. 2013:99–109.Google Scholar
- 17.Maglaveras N, Kilintzis V, Koutkias V, Chouvarda I. Integrated care and connected health approaches leveraging personalised health through big data analytics. pHealth. 2016: 117–122.Google Scholar
- 22.Bellazzi R. Big data and biomedical informatics: a challenging opportunity. Yearbook Med Inform. 2014;9(1):8.Google Scholar
- 24.Ramesh D, Suraj P, Saini L. Big data analytics in healthcare: a survey approach. 2016 International Conference on Microelectronics, Computing and Communications (MicroCom), 2016: 1–6. IEEE.Google Scholar
- 25.Ahmed S, Usman Ali M, Ferzund J, Sarwar MA, Rehman A, Mehmood A. Modern data formats for big bioinformatics data analytics. arXiv preprint arXiv:1707.05364. 2017.Google Scholar
- 27.Mall R et al. Representative subsets for big data learning using k-NN graphs. Big data (big data), 2014 IEEE international conference on. IEEE, 2014.Google Scholar
- 28.Rama Satish KV, Kavya NP. Big data processing with harnessing hadoop-MapReduce for optimizing analytical workloads. Contemporary computing and informatics (IC3I), 2014 international conference on. IEEE, 2014.Google Scholar
- 30.Mohamed, Marwa Hussien, and Mohamed Helmy Khafagy. Hash semi cascade join for joining multi-way map reduce. SAI intelligent systems conference (IntelliSys), 2015, pp. 355–361. IEEE, 2015.Google Scholar
- 31.Afrati FN, Ullman JD. Optimizing joins in a map-reduce environment. Proceedings of the 13th international conference on extending database technology. ACM, 2010, 99–110.Google Scholar
- 32.Blanas S, Patel JM, VukErcegovac JR, Shekita EJ, Tian Y. A comparison of join algorithms for log processing in mapreduce. Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. 975–986. ACM, 2010.Google Scholar
- 33.Ferrera P, de Prado I, Palacios E, Fernandez-Marquez JL, Di Marzo Serugendo G. Tuple MapReduce: beyond classic MapReduce. Data Mining (ICDM), 2012 IEEE 12th international conference on. 260-269. IEEE, 2012.Google Scholar
- 34.David M, Benjelloun O, Garcia Molina H. Generic entity resolution with data confidences. Stanford University 2006.Google Scholar
- 35.Akbarinia R, Ayat N, Afsarmanesh H, Valduriez P. Entity resolution for uncertain data. BDA 2012.Google Scholar
- 36.Ayat N, Akbarinia R, Afsarmanesh H, Valduriez P. Entity resolution for uncertain data. 2012.Google Scholar