Skip to main content
Log in

A Holistic Approach for Resource-aware Adaptive Data Stream Mining

  • Published:
New Generation Computing Aims and scope Submit manuscript

Abstract

Mining data streams is a field of increasing interest due to the importance of its applications and dissemination of data stream sources. Most of the streaming techniques developed so far have not addressed the need for resource-aware computing in data stream analysis. The fact that streaming information is often generated or received onboard resource-constrained computational devices such as sensor nodes and mobile devices motivates the need for resource-awareness in data stream processing systems. In this paper, we propose a generic framework that enables resource-awareness in streaming computation using algorithm granularity settings in order to change the resource consumption patterns periodically. This generic framework is applied to a novel threshold-based micro-clustering algorithm to test its validity and feasibility. We have termed this algorithm as RA-Cluster. RA-Custer is the first data stream clustering algorithm that can adapt to the changing availability of different resources. The experimental results show the applicability of the framework and the algorithm in terms of resource-awareness and accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aggarwal, C., Han, J., Wang, J. and Yu, P.S., “A Framework for Clustering Evolving Data Streams,” in Proc. of VLDB, 2003.

  2. Bhargava, R., Kargupta, H. and Powers, M., “Energy Consuption in Data Analysis for On-board and Distributed Applications,” in Proc. of the ICML workshop on Machine Learning Technologies for Autonomous Space Applications, 2003.

  3. Castano, B., Judd, M., Anderson, R.C. and Estlin, T., “Machine Learning Challenges in Mars Rover Traverse Science,” in Proc. of the ICML Workshop on Machine Learning Technologies for Autonomous Space Applications, 2003.

  4. Chi, Y., Yu, P.S., Wang, H. and Muntz, R.R., “A Load Schedding Scheme for Classifying Data Streams,” in Proc. of SIAM SDM, 2005.

  5. Gaber, M.M., Zaslavsky, A. and Krishnaswamy, S., “A Cost-efficient Model for Ubiquitous Data Stream Mining,” in Proc. of IPMU, 2004.

  6. Gaber, M.M., Zaslavsky, A. and Krishnaswamy, S., “Mining Data Streams: A Review,” ACM SIGMOD Record, 34, 2, June, 2005.

  7. Hettich, S., Blake, C.L. and Merz, C.J., UCI Repository of Machine Learning Databases, 1998.

  8. Srivastava, A. and Stroeve, J., “Onboard Detection of Snow, Ice, Clouds and Other Geophysical Processes Using Kernel Methods,” in Proc. of the ICML workshop on Machine Learning Technologies for Autonomous Space Applications, 2003.

  9. Tanner, S., Alshayeb, M., Criswell, E., Iyer, M., McDowell, A., McEniry, M. and Regner, K., “EVE: On-board Process Planning and Execution,” in Earth Science Technology Conference, Pasadena, 2002.

  10. Teng, W., Chen, M. and Yu, P.S., “Resource-aware Mining with Variable Granularities in Data Streams,” in Proc. of SIAM SDM, 2004.

  11. Zhang, T., Ramakrishnan, R. and Livny, M., “BIRCH: An Efficient Data Clustering Method for Very Large Databases,” ACM SIGMOD Record, 25, 2, June, 1996.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamed Medhat Gaber.

About this article

Cite this article

Gaber, M.M., Yu, P.S. A Holistic Approach for Resource-aware Adaptive Data Stream Mining. New Gener. Comput. 25, 95–115 (2006). https://doi.org/10.1007/s00354-006-0005-1

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00354-006-0005-1

Keywords

Navigation