Abstract
As researchers across scientific domains rapidly adopt advanced scientific computing methodologies, access to advanced cyberinfrastructure (CI) becomes a critical requirement in scientific discovery. Lowering the entry barriers to CI is a crucial challenge in interdisciplinary sciences requiring frictionless software integration, data sharing from many distributed sites, and access to heterogeneous computing platforms. In this paper, we explore how the challenge is not merely a factor of availability and affordability of computing, network, and storage technologies but rather the result of insufficient interfaces with an increasingly heterogeneous mix of computing technologies and data sources. With more distributed computation and data, scientists, educators, and students must invest their time and effort in coordinating data access and movements, often penalizing their scientific research. Investments in the interfaces’ software stack are necessary to help scientists, educators, and students across domains take advantage of advanced computational methods. To this end, we propose developing a science data fabric as the standard scientific discovery interface that seamlessly manages data dependencies within scientific workflows and CI.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
The Quilt - Advanced Regional Networking in Support of Research and Education. https://www.thequilt.net/
Scientific and Engineering Research Facilities: 1999. https://wayback.archive-it.org/5902/20150628160048/http://www.nsf.gov/statistics/nsf04334/pdfstart.htm
Scientific and Engineering Research Facilities: 2001. https://wayback.archive-it.org/5902/20150629121928/http://www.nsf.gov/statistics/nsf02307/sectb.htm
Scientific and Engineering Research Facilities at Colleges and Universities: 1998. https://wayback.archive-it.org/5902/20150627201815/http://www.nsf.gov/statistics/nsf01301/
Scientific and Engineering Research Facilities at Colleges and Universities: 1998 - Appendix E. https://wayback.archive-it.org/5902/20150629135427/http://www.nsf.gov/statistics/nsf01301/appe.htm
ESnet6 Maps (2022). https://www.es.net/welcome-esnet6/esnet6-maps/
Banda, T.: Research and Education Networks in Africa, August 2020
Bohr, M.: A 30 Year retrospective on Dennard’s MOSFET scaling paper. IEEE Solid-State Circuits Soc. Newslett. 12(1), 11–13 (2007). https://doi.org/10.1109/N-SSC.2007.4785534
Chalker, A., Hillegas, C.W., Sill, A., Broude Geva, S., Stewart, C.A.: Cloud and on-premises data center usage, expenditures, and approaches to return on investment: a survey of academic research computing organizations. In: Practice and Experience in Advanced Research Computing, pp. 26–33. ACM, Portland OR USA, July 2020. https://doi.org/10.1145/3311790.3396642
Chen, J., Ghafoor, S., Impagliazzo, J.: Producing competent HPC graduates. Commun. ACM 65(12), 56–65 (2022). https://doi.org/10.1145/3538878
FCC: FCC National Broadband Map (2023). https://broadbandmap.fcc.gov/home
GEANT: GÉANT Connectivity Map (2023). https://map.geant.org/
Gibbons, M.: Computing and Networking Capacity Increases at Academic Research Institutions (2013)
Holland, T.M.: ATAK Improves Situational Awareness for California Fire Department. https://insights.samsung.com/2019/10/16/atak-improves-situational-awareness-for-california-fire-department/, October 2019
Ian: Answer to "How computationally powerful is an Arduino Uno board?", November 2012. https://robotics.stackexchange.com/a/538
Internet2: Operations and Support (2023). https://internet2.edu/network/operations-and-support/
Luettgau, J.: Maps of the Top500 Supercomputers over Time, November 2022. https://doi.org/10.5281/zenodo.7606369
Luettgau, J., Kirkpatrick, C.R., Scorzelli, G., Pascucci, V., Tarcea, G., Taufer, M.: NSDF-catalog: lightweight indexing service for democratizing data delivering. In: IEEE ACM International Conference on Utility and Cloud Computing (UCC2022) (2022)
Luettgau, J., Olaya, P., Zhou, N., Scorzelli, G., Pascucci, V., Taufer, M.: NSDF-Cloud: enabling ad-hoc compute clusters across academic and commercial clouds. In: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, pp. 279–280. ACM, Minneapolis MN USA, June 2022. https://doi.org/10.1145/3502181.3533710
NVIDIA: ADA GPU Architecture V1.01 (2022). https://images.nvidia.com/aem-dam/Solutions/geforce/ada/nvidia-ada-gpu-architecture.pdf
Olaya, P., et al.: Building trust in earth science findings through data traceability and results explainability. IEEE Trans. Parallel Distrib. Syst. 34(2), 704–717 (2023). https://doi.org/10.1109/TPDS.2022.3220539
Olaya, P., et al.: NSDF-FUSE: a testbed for studying object storage via FUSE file systems. In: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, pp. 277–278. ACM, Minneapolis MN USA, Jun 2022. https://doi.org/10.1145/3502181.3533709
RedCLARA: Network Maps, Mar 2020. https://www.redclara.net/index.php/en/recursos/publicaciones-para-difusion/mapas-de-la-red
Sony Entertainment: Announcement of the Playstation 4, April 2013. https://web.archive.org/web/20130424075309/http://scei.co.jp/corporate/release/130221a_e.html
Tarcea, G., et al.: The materials commons data repository. In: 2022 IEEE 18th International Conference on E-Science (e-Science), Salt Lake City, UT, USA, pp. 405–406. IEEE, October 2022. https://doi.org/10.1109/eScience55777.2022.00060
TEIN: Network Maps (2020). https://www.tein.asia/sub/?mc=2030
Top500: Top500 Supercomputing Sites (2019). http://www.top500.org/
Vince Weaver: The GFLOPS/W of the various machines in the VMW Research Group (2023). https://web.eece.maine.edu/vweaver/group/green_machines.html
Acknowledgment
This research was supported by the National Science Foundation (NSF) under grant numbers #1841758, #2028923, #2103845, and #2138811; the Advanced Cyberinfrastructure Coordination Ecosystem: Services and Support (ACCESS) program, under allocation TG-CIS210128; Chameleon Cloud under allocation CHI-210923; and IBM through a Shared University Research Award.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Luettgau, J., Scorzelli, G., Pascucci, V., Taufer, M. (2023). Development of Large-Scale Scientific Cyberinfrastructure and the Growing Opportunity to Democratize Access to Platforms and Data. In: Streitz, N.A., Konomi, S. (eds) Distributed, Ambient and Pervasive Interactions. HCII 2023. Lecture Notes in Computer Science, vol 14036. Springer, Cham. https://doi.org/10.1007/978-3-031-34668-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-031-34668-2_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34667-5
Online ISBN: 978-3-031-34668-2
eBook Packages: Computer ScienceComputer Science (R0)