ADABench - Towards an Industry Standard Benchmark for Advanced Analytics

Rabl, Tilmann; Brücke, Christoph; Härtling, Philipp; Stars, Stella; Escobar Palacios, Rodrigo; Patel, Hamesh; Srivastava, Satyam; Boden, Christoph; Meiners, Jens; Schelter, Sebastian

doi:10.1007/978-3-030-55024-0_4

Tilmann Rabl^10,11,
Christoph Brücke¹¹,
Philipp Härtling¹¹,
Stella Stars¹¹,
Rodrigo Escobar Palacios¹²,
Hamesh Patel¹²,
Satyam Srivastava¹²,
Christoph Boden¹³,
Jens Meiners¹⁴ &
…
Sebastian Schelter¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 12257))

Included in the following conference series:

Technology Conference on Performance Evaluation and Benchmarking

475 Accesses
3 Citations

Abstract

The digital revolution, rapidly decreasing storage cost, and remarkable results achieved by state of the art machine learning (ML) methods are driving widespread adoption of ML approaches. While notable recent efforts to benchmark ML methods for canonical tasks exist, none of them address the challenges arising with the increasing pervasiveness of end-to-end ML deployments. The challenges involved in successfully applying ML methods in diverse enterprise settings extend far beyond efficient model training.

In this paper, we present our work in benchmarking advanced data analytics systems and lay the foundation towards an industry standard machine learning benchmark. Unlike previous approaches, we aim to cover the complete end-to-end ML pipeline for diverse, industry-relevant application domains rather than evaluating only training performance. To this end, we present reference implementations of complete ML pipelines including corresponding metrics and run rules, and evaluate them at different scales in terms of hardware, software, and problem size.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.tpc.org.
2.
http://www.spec.org.
3.
https://www.kaggle.com/c/walmart-recruiting-trip-type-classification.
4.
Self-Monitoring, Analysis and Reporting Technology.
5.
https://www.backblaze.com/b2/hard-drive-test-data.html.
6.
https://www.kaggle.com/c/walmart-recruiting-store-sales-forecasting.
7.
https://grouplens.org/datasets/movielens/.
8.
http://surpriselib.com/.
9.
https://spark.apache.org/docs/latest/ml-collaborative-filtering.html.
10.
https://github.com/soumith/convnet-benchmarks.

References

MLPerf (2018). https://mlperf.org/, https://mlperf.org/
Abadi, M., Barham, P., Chen, J., et al.: Tensorflow: a system for large-scale machine learning. In: OSDI, pp. 265–283 (2016)
Google Scholar
Amatriain, X.: Building industrial-scale real-world recommender systems. In: RecSys, pp. 7–8 (2012)
Google Scholar
Baru, C., et al.: Discussion of BigBench: a proposed industry standard performance benchmark for big data. In: Nambiar, R., Poess, M. (eds.) TPCTC 2014. LNCS, vol. 8904, pp. 44–63. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-15350-6_4
Chapter Google Scholar
Bennett, J., Lanning, S., et al.: The netflix prize. In: Proceedings of KDD Cup and Workshop, p. 35 (2007)
Google Scholar
Bhandarkar, M.: AdBench: a complete data pipeline benchmark for modern data pipelines. In: TPCTC, pp. 107–120 (2016)
Google Scholar
Boden, C., Rabl, T., Markl, V.: Distributed machine learning-but at what cost. In: Machine Learning Systems Workshop at Conference on Neural Information Processing Systems (2017)
Google Scholar
Boden, C., Spina, A., Rabl, T., Markl, V.: Benchmarking data flow systems for scalable machine learning. In: SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond, pp. 5:1–5:10. BeyondMR (2017)
Google Scholar
Chen, T., Guestrin, C.: Xgboost: a scalable tree boosting system. In: ACM SigKDD, pp. 785–794 (2016)
Google Scholar
Chen, T., Li, M., Li, Y., et al.: Mxnet: a flexible and efficient machine learning library for heterogeneous distributed systems. arXiv preprint arXiv:1512.01274 (2015)
Chigurupati, A., Thibaux, R., Lassar, N.: Predicting hardware failure using machine learning. In: Reliability and Maintainability Symposium (RAMS), pp. 1–6 (2016)
Google Scholar
Chowdhury, B., Rabl, T., Saadatpanah, P., Du, J., Jacobsen, H.-A.: A BigBench implementation in the hadoop ecosystem. In: Rabl, T., Jacobsen, H.-A., Raghunath, N., Poess, M., Bhandarkar, M., Baru, C. (eds.) WBDB 2013. LNCS, vol. 8585, pp. 3–18. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10596-3_1
Chapter Google Scholar
Chui, M., et al.: Notes from the AI Frontier - insights from hundreds of use cases. Technical report, McKinsey Global Institute (2018)
Google Scholar
Coleman, C., et al.: Analysis of DAWNBench, a time-to-accuracy machine learning performance benchmark. CoRR abs/1806.01427 (2018)
Google Scholar
Deng, D., Fernandez, R.C., Abedjan, Z., et al.: The data civilizer system. In: CIDR (2017)
Google Scholar
Gao, W., Zhan, J., Wang, L., et al.: BigDataBench: a dwarf-based big data and AI benchmark suite. CoRR abs/1802.0 (2018)
Google Scholar
Ghazal, A., et al.: BigBench: towards an industry standard benchmark for big data analytics. In: SIGMOD (2013)
Google Scholar
Jain, R.: The Art of Computer Systems Performance Analysis - Techniques for Experimental Design, Measurement, Simulation, and Modeling. Wiley Professional Computing. Wiley, Hoboken (1991)
MATH Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
LeCun, Y.: The mnist database of handwritten digits (1998). http://yann.lecun.com/exdb/mnist/
Liu, Y., Zhang, H., Zeng, L., Wu, W., Zhang, C.: Mlbench: benchmarking machine learning services against human experts. PVLDB 11(10), 1220–1232 (2018)
Google Scholar
McKinney, W., et al.: Data structures for statistical computing in Python. In: Proceedings of the 9th Python in Science Conference, pp. 51–56 (2010)
Google Scholar
Meng, X., Bradley, J., Yavuz, B., et al.: Mllib: machine learning in apache spark. J. Mach. Learn. Res. 17(1), 1235–1241 (2016)
MathSciNet MATH Google Scholar
Narang, S.: DeepBench (2016). https://svail.github.io/DeepBench/
Paszke, A., Gross, S., Chintala, S., et al.: Automatic differentiation in pytorch. In: NIPS Autodiff Decision Workshop (2017)
Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Poess, M., Rabl, T., Jacobsen, H.A.: Analysis of TPC-DS: the first standard benchmark for SQL-based big data systems. In: Proceedings of the 2017 Symposium on Cloud Computing (2017)
Google Scholar
Polyzotis, N., Roy, S., Whang, S.E., Zinkevich, M.: Data management challenges in production machine learning. In: SIGMOD, pp. 1723–1726 (2017)
Google Scholar
Rabl, T., Frank, M., Danisch, M., Gowda, B., Jacobsen, H.-A.: Towards a complete BigBench implementation. In: Rabl, T., Sachs, K., Poess, M., Baru, C., Jacobson, H.-A. (eds.) WBDB 2015. LNCS, vol. 8991, pp. 3–11. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-20233-4_1
Chapter Google Scholar
Rabl, T., Frank, M., Danisch, M., Jacobsen, H.A., Gowda, B.: The vision of BigBench 2.0. In: Proceedings of the Fourth Workshop on Data Analytics in the Cloud DanaC 2015, pp. 3:1–3:4. ACM, New York (2015)
Google Scholar
Rabl, T., Frank, M., Sergieh, H.M., Kosch, H.: A data generator for cloud-scale benchmarking. In: Nambiar, R., Poess, M. (eds.) TPCTC 2010. LNCS, vol. 6417, pp. 41–56. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-18206-8_4
Chapter Google Scholar
Rabl, T., Poess, M.: Parallel data generation for performance analysis of large, complex RDBMS. In: DBTest 2011, p. 5 (2011)
Google Scholar
Schelter, S., Palumbo, A., Quinn, S., Marthi, S., Musselman, A.: Samsara: declarative machine learning on distributed dataflow systems. In: NIPS Workshop MLSystems (2016)
Google Scholar
Shah, V., Kumar, A.: The ML data prep zoo: towards semi-automatic data preparation for ML. In: DEEM, pp. 11:1–11:4 (2019)
Google Scholar
Stevens, C.E.: Information TPC-Cechnology - ATA/ATAPI Command Set - 2 (ACS-2). Technical report, ANSI INCITS (2011)
Google Scholar
Sun, C., Shrivastava, A., Singh, S., Gupta, A.: Revisiting unreasonable effectiveness of data in deep learning era. In: ICCV (2017)
Google Scholar
Transaction Processing Performance Council: TPC-Energy Specification, version 1.5.0 (2012)
Google Scholar
Transaction Processing Performance Council: TPC Pricing Specification, version 2.5.0 (2019)
Google Scholar
Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: HotCloud, p. 10 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

HPI, University of Potsdam, Potsdam, Germany
Tilmann Rabl
bankmark, Passau, Germany
Tilmann Rabl, Christoph Brücke, Philipp Härtling & Stella Stars
Intel, Hillsboro, USA
Rodrigo Escobar Palacios, Hamesh Patel & Satyam Srivastava
Mercedes Benz.io, Berlin, Germany
Christoph Boden
EWE, Oldenburg, Germany
Jens Meiners
NYU, New York City, USA
Sebastian Schelter

Authors

Tilmann Rabl
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Brücke
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Härtling
View author publications
You can also search for this author in PubMed Google Scholar
Stella Stars
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo Escobar Palacios
View author publications
You can also search for this author in PubMed Google Scholar
Hamesh Patel
View author publications
You can also search for this author in PubMed Google Scholar
Satyam Srivastava
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Boden
View author publications
You can also search for this author in PubMed Google Scholar
Jens Meiners
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Schelter
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tilmann Rabl .

Editor information

Editors and Affiliations

Advanced Micro Devices (United States), Santa Clara, CA, USA
Raghunath Nambiar
Oracle Corporation, Redwood Shores, CA, USA
Meikel Poess

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rabl, T. et al. (2020). ADABench - Towards an Industry Standard Benchmark for Advanced Analytics. In: Nambiar, R., Poess, M. (eds) Performance Evaluation and Benchmarking for the Era of Cloud(s). TPCTC 2019. Lecture Notes in Computer Science(), vol 12257. Springer, Cham. https://doi.org/10.1007/978-3-030-55024-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-55024-0_4
Published: 30 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-55023-3
Online ISBN: 978-3-030-55024-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics