Performance Characterization of Big Data Systems with TPC Express Benchmark HS

  • Manan Trivedi
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10661)


TPC Express Benchmark HS (TPCx-HS) is industry’s first standard for benchmarking big data systems. There are many moving parts in a large big data deployment which includes compute, storage, memory and network collectively called the infrastructure, platform and application and in this paper, we characterize in detail how each of these components affect performance.


Industry standards Performance Hadoop Spark 


  1. 1.
    Nambiar, R., Poess, M., Dey, A., Cao, P., Magdon-Ismail, T., Ren, D.Q., Bond, A.: Introducing TPCx-HS: the first industry standard for benchmarking big data systems. In: Nambiar, R., Poess, M. (eds.) TPCTC 2014. LNCS, vol. 8904, pp. 1–12. Springer, Cham (2015). Google Scholar
  2. 2.
    Nambiar, R.: Benchmarking big data systems: introducing TPC express benchmark HS. In: Rabl, T., Sachs, K., Poess, M., Baru, C., Jacobson, H.-A. (eds.) WBDB 2015. LNCS, vol. 8991, pp. 24–28. Springer, Cham (2015). CrossRefGoogle Scholar
  3. 3.
    Nambiar, R.: A standard for benchmarking big data systems. In: BigData Conference 2014, pp. 18–20 (2014)Google Scholar
  4. 4.
    Trivedi, M., Nambiar, R.: Lessons learned: performance tuning for hadoop systems. In: Nambiar, R., Poess, M. (eds.) TPCTC 2016. LNCS, vol. 10080, pp. 121–141. Springer, Cham (2017). CrossRefGoogle Scholar
  5. 5.
    TPCx-HS specification.

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Cisco Systems, Inc.San JoseUSA

Personalised recommendations