Skip to main content
Log in

Exploration of the Performance of a Data Mining Application via Hardware Based Monitoring

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

The performance of software on modern architectures has grown more and more difficult to predict and analyze, as modern microprocessors have grown more complex. The execution of a program now entails the complex interaction of code, compiler and processor architecture. The current generation of microprocessors is optimized to an existing set of commercial and scientific benchmarks but new applications such as data mining are becoming a significant part of the workload. In this paper we explore the use of performance monitoring hardware to analyze the execution of C4.5, a data mining application, on the IBM Power2 architecture. We see how the data gathered by the hardware can be used to identify potential changes that can be made to the program and the processor micro-architecture to improve performance. We then go on to evaluate changes to C4.5 and to the micro-architecture. Based on our experience, we identify issues that limit the use of performance monitoring hardware in user level tuning and in extending its use to high performance computing environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. S. W. White and S. Dhawan. POWER2: Next generation of the RISC system/6000 family. IBM Journal of Research and Development, 38(1):493–502, 1994.

    Google Scholar 

  2. Compaq, Alpha 21264 Hardware Reference Manual, Order No. EC-RJRZA-TE, 1999.

  3. Intel, The IA-32 Intel Architecture Software Developer's Manual Volume 3: System Programming Guide, Order No. 245472, 2003.

  4. E. H. Welbon, C. C. Chan-Nui, D. J. Shippy, and D. A. Hicks. The POWER2 performance monitor. IBM Journal of Research and Development, 38(5):545–554, 1994.

    Google Scholar 

  5. J. Mäki. POWER2 hardware performance monitor tools. In http://www.csc.fi/~jmaki/rs2hpm-paper, 1995.

  6. R. Bergeron. Measurement of a scientific workload using the IBM hardware performance monitor. Proceedings of Supercomputing '98, 1998.

  7. J. Ross Quinlan. C4.5: Programs for Machine Learning, Morgan Kaufmann, 1992.

  8. C. L. Blake and C. J. Merz. UCI Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html], University of California, Department of Information and Computer Science, Irvine, CA, 1998.

    Google Scholar 

  9. M. Rosenblum, S. Herrod, E. Witchel, and A. Gupta. Complete computer simulation: The SimOS approach. IEEE Parallel and Distributed Technology, pp. 34–43, Winter 1995.

  10. T. Keller, A. M. Maynard, R. Simpson, and P. Bohrer. SimOS-PPC full system simulator. http://www.cs.umtexas.edu/users/cart/SimOS.

  11. D. C. Burger, T. M. Austin, and S. Bennett. Evaluating future microprocessors-the SimpleScalar tool set. UW Computer Sciences Technical Report #1308, University of Wisconsin, July 1996.

  12. M. Thoennes. Interaction of a superscalar architecture and a data mining application. IBM Research Report RC22198, IBM TJ Watson Research Center, Yorktown Heights, NY, September 2001.

    Google Scholar 

  13. M. Thoennes and C. Weems. Performance characterization of data mining application via hardware-based monitoring. ITCOM 2001: Commercial Applications for High Performance Computing Proceedings, SPIE, pp. 109–117, August 2001.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thoennes, M.S., Weems, C.C. Exploration of the Performance of a Data Mining Application via Hardware Based Monitoring. The Journal of Supercomputing 26, 25–42 (2003). https://doi.org/10.1023/A:1024411917202

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1024411917202

Navigation