Realistic Synthetic Data for Testing Association Rule Mining Algorithms for Market Basket Databases

* Final gross prices may vary according to local VAT.

Get Access

Abstract

We investigate the statistical properties of the databases generated by the IBM QUEST program. Motivated by the claim (also supported empirical evidence) that item occurrences in real life market basket databases follow a rather different pattern, we propose an alternative model for generating artificial data.