Optimization of Parallel FDTD Computations Based on Structural Redeployment of Macro Data Flow Nodes

Smyk, Adam; Tudruj, Marek

doi:10.1007/11752578_65

Adam Smyk²⁰ &
Marek Tudruj²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3911))

Included in the following conference series:

International Conference on Parallel Processing and Applied Mathematics

818 Accesses

Abstract

This paper shows methodology, which enables profiling macro data flow graphs (MDFG) that represent computation and communication patterns for the Finite Difference Time Domain (FDTD) problem in irregular computational areas. MDFG optimization is performed in three phases: simulation area partitioning with generation of initial MDFG, macro data nodes merging with static load balancing to obtain given number of macro nodes and communication optimization to minimize (balance) inter-node data transmissions, computational cells redeployment to take into account computational system restrictions. Efficiency of computations for several communication systems (MPI, RDMA RB, SHMEM) is discussed. Experimental results obtained by simulation are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bharadwaj, D.G., Mani, V., Robertazi, T.G.: Scheduling Divisible Loads in Parallel and Distributed Systems. IEEE Computer Society Press, Los Alamitos (1996)
Google Scholar
Dutt, S., Deng, W.: VLSI Circuit Partitioning by Cluster-Removal using Iterative Improve-ment Techniques. In: Proc. IEEE International Conference on Computer-Aided Design, pp. 350–355 (1997)
Google Scholar
Fiduccia, C.M., Mattheyses, R.M.: A Linear Time Heuristic for Improving Network Partitions. In: Proc. Nineteenth Design Automation Conference, pp. 175–181 (1982)
Google Scholar
Garey, M., Johnson, D., Stockmeyer, L.: Some simplified NP-complete graph problems. Theoretical Computer Science 1, 237–267 (1976)
Article MathSciNet MATH Google Scholar
Karypis, G., Kumar, V.: Unstructured Graph Partitioning and Sparse Matrix Ordering, Technical Report, Department of Computer Science, University of Minesota (1995), http://www.cs.umn.edu/~kumar
Khan, M.S., Li, K.F.: Fast Graph Partitioning Algorithms. In: Proceedings of IEEE Pacific Rim Conference on Communications, Computers, and Signal Processing, Victoria, B.C., Canada, May 1995, pp. 337–342 (1995)
Google Scholar
Kerighan, B.W., Lin, S.: An efficient heuristic procedure for partitioning graphs. AT&T Bell Labs. Tech. J. 49, 291–307 (1970)
Article Google Scholar
Kirkpatrick, S., Gelatt, C.D., Vecchi, M.P.: Optimization by simulated annealing. Science 220(4598), 671–680 (1983)
Article MathSciNet MATH Google Scholar
Lin, H.X., van Gemund, A.J.C., Meijdam, J.: Scalability analysis and parallel execution of unstructured problems. In: Eurosim 1996 Conference (1996)
Google Scholar
Sedgewick, R.: Algorithms in C, Part 5: Graph Algorithms, 3rd edn., pages 368. Addison-Wesley Professional, Reading (2001)
Google Scholar
Smyk, A., Tudruj, M.: RDMA Control Support for Fine-Grain Parallel Computations. In: PDP 2004, La Coruna, Spain (2004)
Google Scholar
Smyk, A., Tudruj, M.: Parallel Implementation of FDTD Computations Based on Macro Data Flow Paradigm. In: PARELEC 2004, September 7-10, Dresden, Germany (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Polish-Japanese Institute of Information Technology, 86 Koszykowa Str., 02-008, Warsaw, Poland
Adam Smyk
Institute of Computer Science, Polish Academy of Sciences, 21 Ordona Str., 01-237, Warsaw, Poland
Marek Tudruj

Authors

Adam Smyk
View author publications
You can also search for this author in PubMed Google Scholar
Marek Tudruj
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computational and Information Sciences, Czestochowa University of Technology, Poland
Roman Wyrzykowski
Computer Science Department,, University of Tennessee, 37996-3450, Knoxville, TN, USA
Jack Dongarra
Poznan Supercomputing and Networking Center, Poland
Norbert Meyer
Informatics & Mathematical Modeling, Technical University of Denmark, 2800, Lyngby, DK, Denmark
Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Smyk, A., Tudruj, M. (2006). Optimization of Parallel FDTD Computations Based on Structural Redeployment of Macro Data Flow Nodes. In: Wyrzykowski, R., Dongarra, J., Meyer, N., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2005. Lecture Notes in Computer Science, vol 3911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11752578_65

Download citation

DOI: https://doi.org/10.1007/11752578_65
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34141-3
Online ISBN: 978-3-540-34142-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics