Lessons Learned: Performance Tuning for Hadoop Systems

Trivedi, Manan; Nambiar, Raghunath

doi:10.1007/978-3-319-54334-5_9

Manan Trivedi¹⁵ &
Raghunath Nambiar¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 10080))

Included in the following conference series:

Technology Conference on Performance Evaluation and Benchmarking

1175 Accesses
2 Citations

Abstract

Hadoop has become a strategic data platform for by mainstream enterprises, adopted because it offers one of the fastest paths for businesses take to unlock value from big data while building on existing investments. Hadoop is a distributed framework based on Java that is designed to work with applications implemented using MapReduce modeling. This distributed framework enables the platform to pass the load to thousands of nodes across the whole Hadoop cluster. The nature of distributed frameworks also allows node failure without cluster failure. The Hadoop market is predicted to grow at a compound annual growth rate (CAGR) over the next several years. Several tools and guides describe how to deploy Hadoop clusters, but very little documentation tells how to increase performance of Hadoop clusters after they are deployed. This document provides several BIOS, OS, Hadoop, and Java tunings that can increase the performance of Hadoop clusters. These tunings are based on lessons learned from Transaction Processing Performance Council Express (TPCx) Benchmark HS (TPCx-HS) testing on a Cisco UCS® Integrated Infrastructure for Big Data cluster. TPCx-HS is the industry’s first standard for benchmarking big data systems. It was developed by TPC to provide verifiable performance, price-to-performance, and availability metrics for hardware and software systems that use big data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

IDC Worldwide Big Data Technology and Services Forecast (2015)
Google Scholar
Nambiar, R., Poess, M., Dey, A., Cao, P., Magdon-Ismail, T., Da Ren, Q., Bond, A.: Introducing TPCx-HS: the first industry standard for benchmarking big data systems. In: Nambiar, R., Poess, M. (eds.) TPCTC 2014. LNCS, vol. 8904, pp. 1–12. Springer, Cham (2015). doi:10.1007/978-3-319-15350-6_1
Chapter Google Scholar
Nambiar, R.: A standard for benchmarking big data systems. In: IEEE Big Data Conference, pp. 18–20 (2014)
Google Scholar
TPCx-HS specification. http://www.tpc.org/tpcx-hs/

Download references

Author information

Authors and Affiliations

Cisco Systems, Inc., 275 East Tasman Drive, San Jose, California, 95134, USA
Manan Trivedi & Raghunath Nambiar

Authors

Manan Trivedi
View author publications
You can also search for this author in PubMed Google Scholar
Raghunath Nambiar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Raghunath Nambiar .

Editor information

Editors and Affiliations

Cisco Systems, Inc., San Jose, California, USA
Raghunath Nambiar
Oracle Corporation, Redwood City, California, USA
Meikel Poess

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Trivedi, M., Nambiar, R. (2017). Lessons Learned: Performance Tuning for Hadoop Systems. In: Nambiar, R., Poess, M. (eds) Performance Evaluation and Benchmarking. Traditional - Big Data - Internet of Things. TPCTC 2016. Lecture Notes in Computer Science(), vol 10080. Springer, Cham. https://doi.org/10.1007/978-3-319-54334-5_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-54334-5_9
Published: 18 February 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54333-8
Online ISBN: 978-3-319-54334-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics