Porting a global ocean model onto a shared-memory multiprocessor: Observations and guidelines

Procassini, Richard J.; Whitman, Scott R.; Dannevik, William P.

doi:10.1007/BF01206241

Porting a global ocean model onto a shared-memory multiprocessor: Observations and guidelines

Published: September 1993

Volume 7, pages 287–321, (1993)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Richard J. Procassini¹,
Scott R. Whitman¹ &
William P. Dannevik¹

33 Accesses
3 Citations
Explore all metrics

Abstract

A three-dimensional global ocean circulation model has been modified to run on the BBN TC2000 multiple instruction stream/multiple data stream (MIMD) parallel computer. Two shared-memory parallel programming models have been used to implement the global ocean model on the TC2000: the TCF (TC2000 Fortran) fork-join model and the PFP (Parallel Fortran Preprocessor) split-join model. The method chosen for the parallelization of this global ocean model on a shared-memory MIMD machine is discussed. The performance of each version of the code has been measured by varying the processor count for a fixed-resolution test case. The statically scheduled PFP version of the code achieves a higher parallel computing efficiency than does the dynamically scheduled TCF version of the code. The observed differences in the performance of the TCF and PFP versions of the code are discussed. The parallel computing performance of the shared-memory implementation of the global ocean model is limited by several factors, most notably load imbalance and network contention. The experience gained while porting this large, “real world” application onto a shared-memory multiprocessor is also presented to provide insight to the reader who may be contemplating such an undertaking.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parallelizing the dual revised simplex method

Article Open access 14 December 2017

Containerization technologies: taxonomies, applications and challenges

Article 08 June 2021

MT-3000: a heterogeneous multi-zone processor for HPC

Article 24 May 2022

References

Beeler, M. 1990.Inside the TC2000 Computer, First Release. BBN Advanced Computers, Inc., Cambridge, Mass., pp. 59–62.
Google Scholar
Chervin, R.M. 1990. Private commun.
Crowther, W., Goodhue, J., Starr, E., Thomas, R., Milliken, W., and Blackadar, T. 1985. Performance measurements on a 128-node Butterfly parallel processor. InProc., 1985 Internat. Conf. on Parallel Processing (St. Charles, Ill., Aug. 20–23), IEEE Comp. Soc. Press, pp. 531–539.
Google Scholar
Cvetanovic, Z. 1987. The effects of problem partitioning, allocation, and granularity on the performance of multipleprocessor systems.IEEE Trans. Comp., C-36, 4 (Apr.): 421–432.
Google Scholar
Iqbal, M.A., Saltz, J.H., and Bokhari, S.H. 1986. Performance tradeoffs in static and dynamic load balancing strategies. InProc., 1986 Internat. Conf. on Parallel Processing (St. Charles, Ill., Aug. 19–22), IEEE Comp. Soc. Press, pp. 1040–1046.
Google Scholar
Lapidus, L., and Pinder, G.F. 1982.Numerical Solution of Partial Differential Equations in Science and Engineering. Wesley-Interscience Pub., New York, pp. 385–421.
Google Scholar
LeBlanc, T.J. 1988. Problem decomposition and communication tradeoffs in a shared-memory multiprocessor. InNumerical Algorithms for Modern Parallel Computer Achitectures, IMA Vols. in Mathematics and Its Applications, vol. 16, Springer-Verlag, pp. 145–162.
LeBlanc, T.J., Scott, M.L., and Brown, C.M. 1988. Large-scale parallel programming: Experience with the BBN Butterfly parallel processor.ACM/SIGPLAN Notices, 23, 9 (Sept.): 161–172.
Google Scholar
Moy, B. 1989.Uniform System Programming in Fortran, Revision Beta 2.0 BBN Advanced Computers, Inc., Cambridge, Mass.
Google Scholar
NERSC Documentation Group. 1990.Autotasking Document. Nat. Energy Reseach Supercomputer Center, Livermore, Calif.
Google Scholar
Quigley, A. 1990.TC2000 FORTRAN Reference, Revision 2.0 BBN Advanced Compuers, Inc., Cambridge, Mass., pp. 331–384.
Google Scholar
Rettberg, R., and Thomas, R. 1986. Contention is no obstacle to shared-memory multiprocessing.CACM, 29, 12 (Dec): 1202–1212.
Google Scholar
Semtner, A.J. 1986. Finite-difference formulation of a world ocean model. InAdvaced Physical Oceanographic Numerical Modeling, Reidel Pub. Co., pp. 187–202.
Semtner, A.J., and Chervin, R.M. 1988. A simulation of the global ocean circulation with resolved eddies.J. Geophys. Res., 93, C12 (Dec.): 15,502–15,522.
Google Scholar
Warren, K., Gorda, B., and Brooks, E.D. III. 1991. Programming in PFP. Rep. UCRL-MA-107028, Lawrence Livermore National Laboratory, Livermore, Calif.
Google Scholar
Whitman S.R. 1991. Utilizing scalable shared memory multiprocessors for computer graphics rendering. Ph.D. diss., The Ohio State Univ., Columbus, Oh.
Google Scholar

Download references

Author information

Authors and Affiliations

Lawrence Livermore National Laboratory, University of California, 94550, Livermore, CA
Richard J. Procassini, Scott R. Whitman & William P. Dannevik

Authors

Richard J. Procassini
View author publications
You can also search for this author in PubMed Google Scholar
Scott R. Whitman
View author publications
You can also search for this author in PubMed Google Scholar
William P. Dannevik
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Procassini, R.J., Whitman, S.R. & Dannevik, W.P. Porting a global ocean model onto a shared-memory multiprocessor: Observations and guidelines. J Supercomput 7, 287–321 (1993). https://doi.org/10.1007/BF01206241

Download citation

Received: 15 September 1991
Accepted: 15 March 1993
Issue Date: September 1993
DOI: https://doi.org/10.1007/BF01206241

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Porting a global ocean model onto a shared-memory multiprocessor: Observations and guidelines

Abstract

Access this article

Similar content being viewed by others

Parallelizing the dual revised simplex method

Containerization technologies: taxonomies, applications and challenges

MT-3000: a heterogeneous multi-zone processor for HPC

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Porting a global ocean model onto a shared-memory multiprocessor: Observations and guidelines

Abstract

Access this article

Similar content being viewed by others

Parallelizing the dual revised simplex method

Containerization technologies: taxonomies, applications and challenges

MT-3000: a heterogeneous multi-zone processor for HPC

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation