Analysis of Diabetes and Heart Disease in Big Data Using MapReduce Framework

Saluja, Manpreet Kaur; Agarwal, Isha; Rani, Urvija; Saxena, Ankur

doi:10.1007/978-981-15-5113-0_3

Manpreet Kaur Saluja²⁰,
Isha Agarwal²⁰,
Urvija Rani²⁰ &
…
Ankur Saxena²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1165))

1214 Accesses
4 Citations

Abstract

MapReduce is a programming algorithm which is used for generating a simplified and relative collection of large datasets such as Big Data. In our paper, we relate MapReduce to the healthcare system to make the effective and spontaneous decision making. In life science, an enormous amount of data is generated on a daily basis. But most space is occupied by routinely generated data of non-communicable diseases like blood pressure, diabetes, etc. Diabetes and heart-related diseases are one of the most commonly found around the world. These non- communicable diseases are usually caused by changing lifestyle, food that we eat, stress is becoming the causes of growth in the non-communicable diseases, and so with an increasing number of patients comes, the number of patients records to be handled, this huge amount of data is made useful through a technique called Big Data which contains one such algorithm called MapReduce. By applying MapReduce, the huge clinical data is filtered in different categories which can be easily read for future reference. The main purpose of our paper is to focus on filtering various parameters of diabetes as well as heart diseases, putting clinical data into developing medical intelligence for creating a patient-centered healthcare system. Considering our dataset, 45–54 Age group of people have maximum prone to diabetes and when it comes to heart-related disorders, people above the age of 65 are mostly suffering from this disease.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

T. Mehta, N. Mangla G. Guragon, A survey paper on big data analytics using MapReduce and hive on Hadoop framework a survey paper on big data analytics using MapReduce and hive on HadoopFramework (2016)
Google Scholar
I.J. Anuradha, A brief introduction on big data 5Vs characteristics and Hadoop technology. Proc. Comput. Sci. 48, 319–324 (2015)
Article Google Scholar
R. Beakta, Big data and Hadoop: a review paper. Int. J. Comput. Sci. Inform. Tech. 2 (2015)
Google Scholar
R.C. Shobha, B. Rama, MapReduce with Hadoop for Simplified Analysis of Big Data. Int. J. Adv. Res. Comput. Sci. (2017)
Google Scholar
D. Buono, M. Danelutto, S. Lametti, Map, reduce and MapReduce, the skeleton way, in The Proceedings of International Conference on Computational Science, ICCS 2010, Procedia Computer Science 1 (2012), pp. 2095–2103
Google Scholar
J. Dittrich, J.A. Quiane-Ruiz, Efficient big data processing in Hadoop MapReduce, in The Proceedings of the VLDB Endowment (vol 5, 12, 2012)
Google Scholar
T. Plantenga, Y. Choe, A. Yoshimura, Using performance measurements to improve MapReduce algorithms. Proced. Comput. Sci. 9, 1920–1929 (2012)
Article Google Scholar
T. Sandholm, K. Lai, MapReduce optimization using regulated dynamic prioritization, in Proceedings of the 11th International Joint Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’ 09), Seattle, USA (2009), pp. 299–310
Google Scholar
A.C. Alexander, L. Wang, Big data analytics in heart attack prediction. J. Nurs. Care 06. https://doi.org/10.4172/2167-1168.1000393 (2017)
V.H. Bhat, P.G. Rao, P.D. Shenoy, Efficient prediction model for diabetic database using soft computing techniques architecture Springer-Verlag, Berlin Heidelberg (2009), pp. 328–335
Google Scholar
R. Sharma, Kuppuswamy’s socioeconomic status scale—revision for 2011 and formula for real-time updating. Indian J. Pediatr. 79, 961–962 (2012)
Article Google Scholar
S. Sirsat, M. Sahana, R. Khan, Analysis of Research Data using MapReduce WordCount Algorithm (2015). https://doi.org/10.17148/ijarcce.2015.4542
S. Jain, A. Saxena, Analysis of Hadoop and MapReduce tectonics through hive big data. Int. J. Contr. Theory Appl. 9(14), 3811–3911 (2016)
Google Scholar
A. Saxena, N. Kaushik, N. Kaushik, Implementing and analyzing big data techniques with Spring framework in Java & J2EE, in Second International Conference on Information and Communication Technology for Competitive Strategies (ICTCS) ACM Digital Library (2016)
Google Scholar
A. Matsunaga, M. Tsugawa, J. Fortes, CloudBLAST: Combining MapReduce and virtualization on distributed resources for bioinformatics applications, in Proceedings of the IEEE Fourth International Conference on eScience (eScience’08), Indianapolis, USA (2008), pp. 222–229
Google Scholar
A. Saxena, N. Kaushik, N. Kaushik, A. Dwivedi, Implementation of cloud computing and big data with Java based web application, in Proceedings of the 10th INDIACom; INDIACom-2016; IEEE Conference ID: 37465 2016 3rd International Conference on “Computing for Sustainable Global Development. 16–18 March, 2016. Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA), pp 3043–3047 (2016)
Google Scholar
A. Chhawchharia, A. Saxena, Execution of big data using MapReduce technique and HQL, in Proceedings of the 11th INDIACom; INDIACom-2016; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”. 1–3 March 2017. Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA) (2017)
Google Scholar
M. Chand, C. Shakya, G.S. Saggu, D. Saha, I.K. Shreshtha, A. Saxena, Analysis of big data using apache spark, in Proceedings of the 11th INDIACom; INDIACom-2016; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 1–3rd March, 2017. Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA) (2017)
Google Scholar
J. Polo, D. Carrera, Y. Bacerra, V. Beltran, J. Torres, E. Ayguadé: Performance management of accelerated MapReduce workloads in heterogeneous clusters, in Proceedings of the 39th International Conference on Parallel Processing (ICPP’10), San Diego, USA (2010), pp. 653–662
Google Scholar
S. Sendre, S. Singh, L. Anand, V. Sharma, A. Saxena, Decimation of duplicated images using MapReduce in Bigdata, in Proceedings of the 11th INDIACom; INDIACom-2016; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 1–3 March, 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA) (2017)
Google Scholar
Y. Luo, Z. Guo, Y. Sun, B. Plale, J. Qiu, W.W. Li, A hierarchical framework for cross-domain MapReduce execution, in Proceedings of the 2nd International Workshop on Emerging computational methods for the life sciences (ECMLS’11), San Jose, USA (2011), pp. 15–22
Google Scholar
S. Jain, A. Saxena, Integration of spring in Hadoop for data processing, in Proceedings of the 11th INDIACom; INDIACom-2016; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 1–3rd March 2017 Bharati Vidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA) (2017)
Google Scholar
Z. Fadika, E. Dede, J. Hartog, M. Govindaraju, MARLA: MapReduce for heterogeneous clusters, in Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid’12), Ottawa, Canada (2012), pp. 49–56
Google Scholar
K. Yesugade, V. Bangre, S. Sinha, S. Kak, A. Saxena, Analyzing human behaviour using data analytics in booking a type hotel, in Proceedings of the 11th INDIACom; INDIACom-2016; IEEE Conference ID: 40353 2017 4th International Conference on “Computing for Sustainable Global Development”, 1–3rd March, 2017 BharatiVidyapeeth’s Institute of Computer Applications and Management (BVICAM), New Delhi (INDIA) (2017)
Google Scholar
W.-T. Tsai, P. Zhong, J. Elston, X. Bai, Y. Chen, Service replication with MapReduce in clouds, in Proceedings of the 10th International Symposium on Autonomous Decentralized System (ISADS’11), Kobe, Japan, (2011), pp. 381–388
Google Scholar
R. Buyya, C.S. Yeo, S. Venugopal, J. Broberg, I. Brandic, Cloud computing and emerging IT platforms: vision, hype, and reality for delivering computing as the 5th utility. Fut. Gen. Comput. Syst. 25(6), 599–616 (2009)
Article Google Scholar
F. Tian, K. Chen: Towards optimal resource provisioning for running MapReduce programs in public clouds, in Proceedings of the 4th IEEE International Conference on Cloud Computing (CLOUD’11), Washington DC, USA, (2011), pp. 155–162
Google Scholar
J. Dean, S. Ghemawat, MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
Article Google Scholar
C. Vecchiola, X. Chu, R. Buyya, Aneka: a software platform for .NET-based cloud computing, in W. Gentzsch, L. Grandinetti, G. Joubert, High Speed and Large-Scale Scientific Computing
Google Scholar
Amsterdam, The Netherlands: IOS Press, (2009), pp. 267–295
Google Scholar
Z. Fadika, M. Govindaraju, LEMO-MR: Low overhead and elastic MapReduce implementation optimized for memory and CPU-intensive applications, in Proceedings of the 2nd International Conference on Cloud Computing Technology and Science (CloudCom’10), Indianapolis, USA (2010), pp. 1–8
Google Scholar
Y. Geng, S. Chen, Y. Wu, R. Wu, G. Yang, W. Zheng: Locationaware MapReduce in virtual cloud, in Proceedings of the 40th International Conference on Parallel Processing (ICPP’11), Taipei, Taiwan, (2011), pp. 275–284
Google Scholar
A. Verma, L. Cherkasova, R. Campbell, Resource provisioning framework for MapReduce jobs with performance goals, in Middleware
Google Scholar

Download references

Acknowledgements

We wish to show our gratitude and sincere thanks to our mentor Dr. Ankur Saxena, Amity Institute of Biotechnology, Noida, for his guidance, assistance, insights, and expertise. We also express our sincere thanks to the Amity Institute of Biotechnology, Amity University, Noida, for providing us this opportunity and a great platform to work on.

Author information

Authors and Affiliations

Amity Institute of Biotechnology, Amity University, Noida, Uttar Pradesh, India
Manpreet Kaur Saluja, Isha Agarwal, Urvija Rani & Ankur Saxena

Authors

Manpreet Kaur Saluja
View author publications
You can also search for this author in PubMed Google Scholar
Isha Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Urvija Rani
View author publications
You can also search for this author in PubMed Google Scholar
Ankur Saxena
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ankur Saxena .

Editor information

Editors and Affiliations

Maharaja Agrasen Institute of Technology, Rohini, Delhi, India
Deepak Gupta
Maharaja Agrasen Institute of Technology, Rohini, Delhi, India
Ashish Khanna
CHRIST (Deemed to be University), Bengaluru, Karnataka, India
Siddhartha Bhattacharyya
Department of Information Technology, Faculty of Computers and Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien
Department of Computer Science, Shaheed Sukhdev College of Business Studies, University of Delhi, Rohini, Delhi, India
Sameer Anand
Department of Computer Science, Shaheed Sukhdev College of Business Studies, University of Delhi, Rohini, Delhi, India
Ajay Jaiswal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saluja, M.K., Agarwal, I., Rani, U., Saxena, A. (2021). Analysis of Diabetes and Heart Disease in Big Data Using MapReduce Framework. In: Gupta, D., Khanna, A., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1165. Springer, Singapore. https://doi.org/10.1007/978-981-15-5113-0_3

Download citation

DOI: https://doi.org/10.1007/978-981-15-5113-0_3
Published: 02 August 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5112-3
Online ISBN: 978-981-15-5113-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics