False-Negative Frequent Items Mining from Data Streams with Bursting

  • Zhihong Chong
  • Jeffrey Xu Yu
  • Hongjun Lu
  • Zhengjie Zhang
  • Aoying Zhou
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3453)

Abstract

False-negative frequent items mining from a high speed transactional data stream is to find an approximate set of frequent items with respect to a minimum support threshold, s. It controls the possibility of missing frequent items using a reliability parameter δ. The importance of false-negative frequent items mining is that it can exclude false-positives and therefore significantly reduce the memory consumption for frequent itemsets mining. The key issue of false-negative frequent items mining is how to minimize the possibility of missing frequent items. In this paper, we propose a new false-negative frequent items mining algorithm, called Loss-Negative, for handling bursting in data streams. The new algorithm consumes the smallest memory in comparison with other false-negative and false-positive frequent items algorithms. We present theoretical bound of the new algorithm, and analyze the possibility of minimization of missing frequent items, in terms of two possibilities, namely, in-possibility and out-possibility. The former is about how a frequent item can possibly pass the first pruning. The latter is about how long a frequent item can stay in memory while no occurrences of the item comes in the following data stream for a certain period. The new proposed algorithm is superior to the existing false-negative frequent items mining algorithms in terms of the two possibilities. We demonstrate the effectiveness of the new algorithm in this paper.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Zhihong Chong
    • 1
  • Jeffrey Xu Yu
    • 2
  • Hongjun Lu
    • 3
  • Zhengjie Zhang
    • 1
  • Aoying Zhou
    • 1
  1. 1.Fudan UniversityChina
  2. 2.Chinese University of Hong KongChina
  3. 3.Hong Kong University of Science and TechnologyChina

Personalised recommendations