Advertisement

A Proposal for Error Handling in OpenMP

  • Alejandro Duran
  • Roger Ferrer
  • Juan José Costa
  • Marc Gonzàlez
  • Xavier Martorell
  • Eduard Ayguadé
  • Jesús Labarta
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4315)

Abstract

OpenMP has been focused in performance applied to numerical applications, but when we try to move this focus to other kind of applications, like Web servers, we detect one important lack. In these applications, performance is important, but reliability is even more important, and OpenMP does not have any recovery mechanism. In this paper we present a novel proposal to address this lack.

In order to add error handling to OpenMP we propose some extensions to the current OpenMP specification. A directive and a clause are proposed, defining an scope for the error handling (where the error can occur) and specifying a behaviour for handling the specific errors.

Some examples of use are presented, and we present also an evaluation showing the impact of this proposal in OpenMP applications. We show that this impact is low enough to consider the proposal worthwhile for OpenMP.

Keywords

Error Recovery Error Handling Callback Function Error Context Error Support 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Balart, J., Duran, A., Gonzàlez, M., Ayguadé, E., Martorell, X., Labarta, J.: Experiences parallelizing a web server with openmp. In: First International Workshop on OpenMP (May 2005)Google Scholar
  2. 2.
    Isensee, P.: Utilizing Multicore Processors with OpenMP. In: Game Programming Gems, vol. 6, Charles River Media (2006)Google Scholar
  3. 3.
    Su Gatlin, K.: Openmp 3.0 feature: Error detection capability. In: Panel at First International Workshop on OpenMP (May 2005), http://www.nic.uoregon.edu/iwomp2005/Talks/gatlin-panel.pdf
  4. 4.
    Huang, Y., Chung, P.E., Kintala, C., Liang, D., Wang, C.: Nt-swift: Software-implemented fault tolerance for windows-nt. In: Proceedings of the 1998 USENIX WindowsNT Symposium (1998)Google Scholar
  5. 5.
    Beazley, D.M.: An embedded error recovery and debugging mechanism for scripting language extensions. In: Proceedings of the USENIX 2001 Annual Technical Conference (June 2001)Google Scholar
  6. 6.
    Donelly, C., Stallman, R.: The Bison Manual: Using the YACC-Compatible Parser Generator. Gnu Press (2004)Google Scholar
  7. 7.
    Elnozahy, M., Alvisi, L., Wang, Y.M., Johnson, D.B.: A survey of rollback-recovery protocols in message passing systems. Technical report, School of Computer Science, Carnegie Mellon University (October 1996)Google Scholar
  8. 8.
    Bronevetsky, G., Marques, D., Pingali, K., Stodghill, P.: Automated application-level checkpointing of mpi programs. In: Proceedings of Principles and Practice of Parallel Programming (PPoPP), June 2003, pp. 84–94 (2003)Google Scholar
  9. 9.
    Dieter, W., Lumpp Jr., J.: A user-level checkpointing library for posix threads programs. In: Proceedings of 1999 Symposium on Fault-Tolerant Computing Systems (FTCS) (June 1999)Google Scholar
  10. 10.
  11. 11.
    Balart, J., Duran, A., Gonzàlez, M., Martorell, X., Ayguadé, E., Labarta, J.: Nanos mercurium: a research compiler for openmp. In: Proceedings of the European Workshop on OpenMP 2004 (October 2004)Google Scholar
  12. 12.
    Bull, J.M.: Measuring synchronization and scheduling overheads in openmp. In: First European Workshop on OpenMP (September 1999)Google Scholar
  13. 13.
    Bailey, D.H., Barszcz, E., Barton, J.T., Browning, D.S., Carter, R.L., Dagum, D., Fatoohi, R.A., Frederickson, P.O., Lasinski, T.A., Schreiber, R.S., Simon, H.D., Venkatakrishnan, V., Weeratunga, S.K.: The NAS Parallel Benchmarks. The International Journal of Supercomputer Applications 5(3), 63–73 (1991)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Alejandro Duran
    • 1
  • Roger Ferrer
    • 1
  • Juan José Costa
    • 1
  • Marc Gonzàlez
    • 1
  • Xavier Martorell
    • 1
  • Eduard Ayguadé
    • 1
  • Jesús Labarta
    • 1
  1. 1.Barcelona Supercomputing Center (BSC) Departament d’Arquitectura de ComputadorsUniversitat Politècnica de CatalunyaBarcelonaSpain

Personalised recommendations