Abstract
We present FoodBroker, a new data generator for benchmarking graph-based business intelligence systems and approaches. It covers two realistic business processes and their involved master and transactional data objects. The interactions are correlated in controlled ways to enable non-uniform distributions for data and relationships. For benchmarking data integration, the generated data is stored in two interrelated databases. The dataset can be arbitrarily scaled and allows comprehensive graph- and pattern-based analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Angles, R., et al.: The linked data benchmark council: a graph and RDF industry benchmarking effort. ACM SIGMOD Rec. 43(1), 27–31 (2014)
Boncz, P.: LDBC: benchmarks for graph and RDF data management. In: Proceedings of the 17th International Database Engineering and Applications Symposium. ACM (2013)
Chakrabarti, D., Zhan, Y., Faloutsos, C.: R-mat: a recursive model for graph mining. In: SDM, vol. 4, pp. 442–446. SIAM (2004)
Dominguez-Sal, D., Urbón-Bayes, P., Giménez-Vañó, A., Gómez-Villamor, S., Martínez-Bazán, N., Larriba-Pey, J.L.: Survey of graph database performance on the HPC scalable graph analysis benchmark. In: Shen, H.T., Pei, J., Özsu, M.T., Zou, L., Lu, J., Ling, T.-W., Yu, G., Zhuang, Y., Shao, J. (eds.) WAIM 2010. LNCS, vol. 6185, pp. 37–48. Springer, Heidelberg (2010)
Ghazal, A., et al.: Bigbench: towards an industry standard benchmark for big data analytics. In: Proceedings of the 2013 international conference on Management of data. ACM
Gupta, A.: Generating large-scale heterogeneous graphs for benchmarking. In: Rabl, T., Poess, M., Baru, C., Jacobsen, H.-A. (eds.) WBDB 2012. LNCS, vol. 8163, pp. 113–128. Springer, Heidelberg (2014)
Holzschuher, F., Peinl, R.: Performance of graph query languages: comparison of cypher, gremlin and native access in Neo4j. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops. ACM (2013)
OLAP Council.: APB-1 OLAP Benchmark. http://www.olapcouncil.org/research/bmarkly.htm
Park, Y., et al.: Graph databases for large-scale healthcare systems: a framework for efficient data management and data services. In: IEEE 30th International Conference on Data Engineering Workshops (ICDEW) (2014)
Petermann, A., Junghanns, M., Müller, R., Rahm, E.: BIIIG : enbabling business intelligence with integrated instance graphs. In: IEEE 30th International Conference on Data Engineering Workshops (ICDEW) (2014)
Pham, M.-D., Boncz, P., Erling, O.: S3G2: a scalable structure-correlated social graph generator. In: Nambiar, R., Poess, M. (eds.) TPCTC 2012. LNCS, vol. 7755, pp. 156–172. Springer, Heidelberg (2013)
Transaction Processing Performance Council.: TPC Benchmarks. http://www.tpc.org/information/benchmarks.asp
Vasilyeva, E., et al.: Leveraging flexible data management with graph databases. In: 1st International Workshop on Graph Data Management Experiences and Systems. ACM (2013)
Vicknair, C., et al.: A comparison of a graph database and a relational database: a data provenance perspective. In: Proceedings of the 48th annual Southeast regional conference. ACM (2010)
Acknowledgments
This work is partly funded within the EU program Europa fördert Sachsen of the European Social Fund.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Petermann, A., Junghanns, M., Müller, R., Rahm, E. (2015). FoodBroker - Generating Synthetic Datasets for Graph-Based Business Analytics. In: Rabl, T., Sachs, K., Poess, M., Baru, C., Jacobson, HA. (eds) Big Data Benchmarking. WBDB 2014. Lecture Notes in Computer Science(), vol 8991. Springer, Cham. https://doi.org/10.1007/978-3-319-20233-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-20233-4_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20232-7
Online ISBN: 978-3-319-20233-4
eBook Packages: Computer ScienceComputer Science (R0)