Skip to main content

FoodBroker - Generating Synthetic Datasets for Graph-Based Business Analytics

  • Conference paper
  • First Online:
Book cover Big Data Benchmarking (WBDB 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8991))

Included in the following conference series:

Abstract

We present FoodBroker, a new data generator for benchmarking graph-based business intelligence systems and approaches. It covers two realistic business processes and their involved master and transactional data objects. The interactions are correlated in controlled ways to enable non-uniform distributions for data and relationships. For benchmarking data integration, the generated data is stored in two interrelated databases. The dataset can be arbitrarily scaled and allows comprehensive graph- and pattern-based analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/dbs-leipzig/foodbroker.

  2. 2.

    http://www.biiig.org.

References

  1. Angles, R., et al.: The linked data benchmark council: a graph and RDF industry benchmarking effort. ACM SIGMOD Rec. 43(1), 27–31 (2014)

    Article  Google Scholar 

  2. Boncz, P.: LDBC: benchmarks for graph and RDF data management. In: Proceedings of the 17th International Database Engineering and Applications Symposium. ACM (2013)

    Google Scholar 

  3. Chakrabarti, D., Zhan, Y., Faloutsos, C.: R-mat: a recursive model for graph mining. In: SDM, vol. 4, pp. 442–446. SIAM (2004)

    Google Scholar 

  4. Dominguez-Sal, D., Urbón-Bayes, P., Giménez-Vañó, A., Gómez-Villamor, S., Martínez-Bazán, N., Larriba-Pey, J.L.: Survey of graph database performance on the HPC scalable graph analysis benchmark. In: Shen, H.T., Pei, J., Özsu, M.T., Zou, L., Lu, J., Ling, T.-W., Yu, G., Zhuang, Y., Shao, J. (eds.) WAIM 2010. LNCS, vol. 6185, pp. 37–48. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  5. Ghazal, A., et al.: Bigbench: towards an industry standard benchmark for big data analytics. In: Proceedings of the 2013 international conference on Management of data. ACM

    Google Scholar 

  6. Gupta, A.: Generating large-scale heterogeneous graphs for benchmarking. In: Rabl, T., Poess, M., Baru, C., Jacobsen, H.-A. (eds.) WBDB 2012. LNCS, vol. 8163, pp. 113–128. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  7. Holzschuher, F., Peinl, R.: Performance of graph query languages: comparison of cypher, gremlin and native access in Neo4j. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops. ACM (2013)

    Google Scholar 

  8. OLAP Council.: APB-1 OLAP Benchmark. http://www.olapcouncil.org/research/bmarkly.htm

  9. Park, Y., et al.: Graph databases for large-scale healthcare systems: a framework for efficient data management and data services. In: IEEE 30th International Conference on Data Engineering Workshops (ICDEW) (2014)

    Google Scholar 

  10. Petermann, A., Junghanns, M., Müller, R., Rahm, E.: BIIIG : enbabling business intelligence with integrated instance graphs. In: IEEE 30th International Conference on Data Engineering Workshops (ICDEW) (2014)

    Google Scholar 

  11. Pham, M.-D., Boncz, P., Erling, O.: S3G2: a scalable structure-correlated social graph generator. In: Nambiar, R., Poess, M. (eds.) TPCTC 2012. LNCS, vol. 7755, pp. 156–172. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  12. Transaction Processing Performance Council.: TPC Benchmarks. http://www.tpc.org/information/benchmarks.asp

  13. Vasilyeva, E., et al.: Leveraging flexible data management with graph databases. In: 1st International Workshop on Graph Data Management Experiences and Systems. ACM (2013)

    Google Scholar 

  14. Vicknair, C., et al.: A comparison of a graph database and a relational database: a data provenance perspective. In: Proceedings of the 48th annual Southeast regional conference. ACM (2010)

    Google Scholar 

Download references

Acknowledgments

This work is partly funded within the EU program Europa fördert Sachsen of the European Social Fund.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to André Petermann .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Petermann, A., Junghanns, M., Müller, R., Rahm, E. (2015). FoodBroker - Generating Synthetic Datasets for Graph-Based Business Analytics. In: Rabl, T., Sachs, K., Poess, M., Baru, C., Jacobson, HA. (eds) Big Data Benchmarking. WBDB 2014. Lecture Notes in Computer Science(), vol 8991. Springer, Cham. https://doi.org/10.1007/978-3-319-20233-4_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-20233-4_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-20232-7

  • Online ISBN: 978-3-319-20233-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics