Skip to main content

Mining Data Stream from a Higher Level of Abstraction: A Class Window Approach

  • Conference paper
  • 1067 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 254))

Abstract

All present techniques for mining data stream can detect concept drift, outlier and pattern by using separate techniques directly on data stream and an applicable single technique from a higher level of abstraction for detecting all these has not been developed. The aim here is to develop a technique which can detect concept drift, outlier and pattern using a single model. In order to achieve this goal, we are switching from the traditional thinking of applying all techniques directly on data stream. Here, we have focused on the class labels of the data which is found using classification. This technique gives us a higher level of abstraction of the data and consumes a lower amount of memory. Moreover, we are proposing the idea to keep track of very old data using a bit table. Our technique can also store the timestamp of the pattern or drift.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Domingos, P., Hulten, G.: A General Framework for Mining Massive Data Streams. In: Knowledge Discovery From Sensor Data, ch. 2, pp. 9–14. CRC Press (2009)

    Google Scholar 

  2. Rodrigues, P.P., Gama, J., Lopes, L.: Requirements for Clustering Streaming Sensors. In: Knowledge Discovery From Sensor Data, ch. 4, pp. 36–51. CRC Press

    Google Scholar 

  3. Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Springer Science+Business Media, LLC (2007)

    Google Scholar 

  4. Scholz, M., Klinkenberg, R.: Boosting Classifiers for Drifting Concepts

    Google Scholar 

  5. Lin, C.-H., Chiu, D.-Y., Wu, Y.-H., Chen, A.L.P.: Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding Window (2005)

    Google Scholar 

  6. Kuncheva, L.I.: Classifier Ensembles for Detecting Concept Change in Streaming Data: Overview and Perspectives. In: Second Workshop SUEMA, ECAI 2008, Partas, Greece, pp. 5–9 (2008)

    Google Scholar 

  7. Aggarwal, C.C., Han, J., Jianyong, Yu, P.S.: On Demand Classification of Data Streams. In: KDD 2004, Seattle, Washington, USA, August 22–25 (2004)

    Google Scholar 

  8. Li, H.-F., Lee, S.-Y.: Mining frequent itemsets over data streams using efficient window sliding techniques (2007)

    Google Scholar 

  9. Wu, X., Kumar, V., Ross Quinlan, J., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G.J., Ng, A., Liu, B., Yu, P.S., Zhou, Z.-H., Steinbach, M., Hand, D.J., Steinberg, D.: Top 10 algorithms in data mining. Springer-Verlag London Limited (2007)

    Google Scholar 

  10. Gama, J., Rodrigues, P.P., Castillo, G.: Evaluating Algorithms that Learn from Data Streams (2007)

    Google Scholar 

  11. Li, I.-H., Liao, I.-E., Pang, W.-Z.: Mining classification rules in the presence of concept drift with an incremental genetic algorithm. Journal of Theoretical and Applied Information Technology (2008)

    Google Scholar 

  12. Han, J., Kamber, M.: Data Mining Concepts and Techniques, 2nd edn., University of Illinois (2006)

    Google Scholar 

  13. Aberer, K., Hauswirth, M., Salehi, A.: Infrastructure for data processing in large-scale interconnected sensor networks In: Distributed Information Systems Lab, Ecole Polytechnique Fédérale de Lausanne, Switzerland, Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland

    Google Scholar 

  14. Chandrasekaran, S., Cooper, O., Deshpande, A., Franklin, M.J., Hellerstein, J.M., Hong, W., Krishnamurthy, S., Madden, S., Raman, V., Reiss, F., Shah, M.: TelegraphCQ: Continuous Dataflow Processing for an Uncertain World In: CIDR (2003)

    Google Scholar 

  15. Thakkar, H., Mozafari, B., Zaniolo, C.: A Data Stream Mining System

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abdullah-Al-Mamun, Anowarul Abedin, M., Al Arman, M., Mottalib, M.A., Huq, M.R. (2011). Mining Data Stream from a Higher Level of Abstraction: A Class Window Approach. In: Abd Manaf, A., Sahibuddin, S., Ahmad, R., Mohd Daud, S., El-Qawasmeh, E. (eds) Informatics Engineering and Information Science. ICIEIS 2011. Communications in Computer and Information Science, vol 254. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25483-3_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25483-3_38

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25482-6

  • Online ISBN: 978-3-642-25483-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics