Knowledge Discovery in Databases: PKDD 2007

Volume 4702 of the series Lecture Notes in Computer Science pp 398-405

Realistic Synthetic Data for Testing Association Rule Mining Algorithms for Market Basket Databases

  • Colin CooperAffiliated withDepartment of Computer Science, Kings’ College, London WC2R 2LS
  • , Michele ZitoAffiliated withDepartment of Computer Science, University of Liverpool, Liverpool, L69 3BX

* Final gross prices may vary according to local VAT.

Get Access


We investigate the statistical properties of the databases generated by the IBM QUEST program. Motivated by the claim (also supported empirical evidence) that item occurrences in real life market basket databases follow a rather different pattern, we propose an alternative model for generating artificial data.