Abstract
The performance of software on modern architectures has grown more and more difficult to predict and analyze, as modern microprocessors have grown more complex. The execution of a program now entails the complex interaction of code, compiler and processor architecture. The current generation of microprocessors is optimized to an existing set of commercial and scientific benchmarks but new applications such as data mining are becoming a significant part of the workload. In this paper we explore the use of performance monitoring hardware to analyze the execution of C4.5, a data mining application, on the IBM Power2 architecture. We see how the data gathered by the hardware can be used to identify potential changes that can be made to the program and the processor micro-architecture to improve performance. We then go on to evaluate changes to C4.5 and to the micro-architecture. Based on our experience, we identify issues that limit the use of performance monitoring hardware in user level tuning and in extending its use to high performance computing environments.
Similar content being viewed by others
References
S. W. White and S. Dhawan. POWER2: Next generation of the RISC system/6000 family. IBM Journal of Research and Development, 38(1):493–502, 1994.
Compaq, Alpha 21264 Hardware Reference Manual, Order No. EC-RJRZA-TE, 1999.
Intel, The IA-32 Intel Architecture Software Developer's Manual Volume 3: System Programming Guide, Order No. 245472, 2003.
E. H. Welbon, C. C. Chan-Nui, D. J. Shippy, and D. A. Hicks. The POWER2 performance monitor. IBM Journal of Research and Development, 38(5):545–554, 1994.
J. Mäki. POWER2 hardware performance monitor tools. In http://www.csc.fi/~jmaki/rs2hpm-paper, 1995.
R. Bergeron. Measurement of a scientific workload using the IBM hardware performance monitor. Proceedings of Supercomputing '98, 1998.
J. Ross Quinlan. C4.5: Programs for Machine Learning, Morgan Kaufmann, 1992.
C. L. Blake and C. J. Merz. UCI Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html], University of California, Department of Information and Computer Science, Irvine, CA, 1998.
M. Rosenblum, S. Herrod, E. Witchel, and A. Gupta. Complete computer simulation: The SimOS approach. IEEE Parallel and Distributed Technology, pp. 34–43, Winter 1995.
T. Keller, A. M. Maynard, R. Simpson, and P. Bohrer. SimOS-PPC full system simulator. http://www.cs.umtexas.edu/users/cart/SimOS.
D. C. Burger, T. M. Austin, and S. Bennett. Evaluating future microprocessors-the SimpleScalar tool set. UW Computer Sciences Technical Report #1308, University of Wisconsin, July 1996.
M. Thoennes. Interaction of a superscalar architecture and a data mining application. IBM Research Report RC22198, IBM TJ Watson Research Center, Yorktown Heights, NY, September 2001.
M. Thoennes and C. Weems. Performance characterization of data mining application via hardware-based monitoring. ITCOM 2001: Commercial Applications for High Performance Computing Proceedings, SPIE, pp. 109–117, August 2001.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Thoennes, M.S., Weems, C.C. Exploration of the Performance of a Data Mining Application via Hardware Based Monitoring. The Journal of Supercomputing 26, 25–42 (2003). https://doi.org/10.1023/A:1024411917202
Issue Date:
DOI: https://doi.org/10.1023/A:1024411917202