Skip to main content

Interactive Supercomputing for Experimental Data-Driven Workflows

  • Conference paper
  • First Online:
Tools and Techniques for High Performance Computing (HUST 2019, SE-HER 2019, WIHPC 2019)

Abstract

Large scale experimental facilities such as the Swiss Light Source and the free-electron X-ray laser SwissFEL at the Paul Scherrer Institute, and the particle accelerators and detectors at CERN are experiencing unprecedented data generation growth rates. Consequently, management, processing and storage requirements of data are increasing rapidly. Historically, online and on-demand processing of data generated by the instruments used to be tightly-coupled with a dedicated, domains-specific, site-local IT infrastructure. Cost and performance scaling of these facilities not only pose technical but also planning and scheduling challenges. Supercomputing ecosystems optimize cost and scaling for computing and storage resources but typically exploit a shared batch access model, which is optimized for high utilization of compute resources. In comparison, in public clouds, on-demand service delivery models address the concept of elasticity while maintaining isolation with performance trade-offs. Furthermore, these on-demand access models allow for different degrees of privileges to users for managing IT infrastructure services, in contrast with shared, bare-metal supercomputing ecosystems. This paper outlines an approach for enabling interactive, on-demand supercomputing for experimental data-driven workflows, which are characterised by a managed but bursty data and computing requirements. We present a delegated batch reservation model, controlled by the customer and provisioned by the supercomputing site, that allows scientists at the experimental facility to couple generation of data to the allocation of compute, data and network resources at the supercomputing centre. Scientists are then able to manage resources both at the experimental and supercomputing facilities interactively for managing their scientific workflows. Prototype implementation demonstrates that this rather simple co-designed extension to a supercomputing classic batch scheduling system with a controlled degree of privilege can be easily incorporated to the experimental facilities existing IT resource management and scheduling pipelines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.cscs.ch/user-lab/overview/.

References

  1. Alam, S.R., Gilly, L., McMurtrie, C., Schulthess, T.C.: CSCS and the Piz Daint System, pp. 149–174, May 2019

    Google Scholar 

  2. Alam, S.R., Martinasso, M., Schulthess, T.C.: Hybrid cloud and HPC services for extreme data workflows. In: Extreme Data: Demands, Technologies, and Services - A Community Workshop (2018)

    Google Scholar 

  3. Benedicic, L., Cruz, F.A., Madonna, A., Mariotti, K.: Portable, high-performance containers for HPC. CoRR, abs/1704.03383 (2017)

    Google Scholar 

  4. Cameron, D., et al.: The advanced resource connector for distributed LHC computing. PoS (2008)

    Google Scholar 

  5. Martinasso, M., Gila, M., Bianco, M., Alam, S.R., McMurtrie, C., Schulthess, T.C.: RM-replay: a high-fidelity tuning, optimization and exploration tool for resource management. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2018 (2018)

    Google Scholar 

  6. Milne, C., et al.: SwissFEL: the Swiss X-ray free electron laser. Appl. Sci. 7(7), 720 (2017)

    Article  Google Scholar 

  7. Paul Scherrer Institut: cSAXS X12SA: Coherent Small-Angle X-ray Scattering. https://www.psi.ch/en/sls/csaxs. Accessed 20 Sept 2019

  8. SchedMD: Slurm workload manager - scontrol. https://slurm.schedmd.com/scontrol.html. Accessed 20 Sept 2019

  9. SchedMD: Slurm workload manager - user permissions. https://slurm.schedmd.com/user_permissions.html. Accessed 20 Sept 2019

Download references

Acknowledgements

We would like to thank our colleagues at PSI for their insightful remarks and their input for co-designing the early prototype. The work presented in this paper is partly funded by a swissuniversities P-5 grant called SELVEDAS (Services for Large Volume Experiment-Data Analysis utilising Supercomputing and Cloud technologies at CSCS).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Siew Hoon Leong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Klein, M., Martinasso, M., Leong, S.H., Alam, S.R. (2020). Interactive Supercomputing for Experimental Data-Driven Workflows. In: Juckeland, G., Chandrasekaran, S. (eds) Tools and Techniques for High Performance Computing. HUST SE-HER WIHPC 2019 2019 2019. Communications in Computer and Information Science, vol 1190. Springer, Cham. https://doi.org/10.1007/978-3-030-44728-1_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-44728-1_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-44727-4

  • Online ISBN: 978-3-030-44728-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics