Abstract
This work presents a parallel implementation of the implicitly restarted Arnoldi/Lanczos method for the solution of eigenproblems approximated by the finite element method. The implicitly restarted Arnoldi/Lanczos uses a restart scheme in order to improve the convergence of the desired portion of the spectrum, addressing issues such as memory requirements and computational costs related to the generation and storage of the Krylov basis. The presented implementation is suitable for distributed memory architectures, especially PC clusters. In the parallel solution, a subdomain by subdomain approach was implemented and overlapping and non-overlapping mesh partitions were tested. Compressed data structures in the formats CSRC and CSRC/CSR were used to store the coefficient matrices. The parallelization of numerical linear algebra operations present in both Krylov and implicitly restarted methods are discussed. Numerical examples are shown, in order to point out the efficiency and applicability of the proposed method.
Similar content being viewed by others
References
P_ARPACK (1996) An efficient portable large scale eigenvalue package for distributed memory parallel architectures
Arbenz P, Becka M, Geus R, Hetmaniuk U, Mengotti T (2006) On a parallel multilevel preconditioned maxwell eigensolver. Parallel Comput 32: 157–165
Arbenz P, Hetmaniuk U, Lehoucq RB, Tuminaro R (2005) A comparison of eigensolvers for large-scale 3D modal analysis using AMG-preconditioned iterative solvers. Int J Numer Methods Eng 1: 1–21
Baker CG, Hetmaniuk UL, Lehoucq RB, Tornquist HK (2008) Anasazi software for the numerical solution of scale eigenvalue problems. ACM Trans Math Softw 5: 1–22
Balay S, Gropp WD, McInnes LC, Smith BF (1997) Efficient management of parallelism in object oriented numerical software libraries. In: Arge E, Bruaset AM, Langtangen HP (eds) Modern software tools in scientific computing. Birkhäuser, Boston, pp 163–202
Betcke T (2006) A GSVD formulation of a domain decomposition method for planar eigenvalue problems. IMA J Numer Anal 27: 451–478
Bitzarakis S, Papadrakis M, Kotsopulos A (1997) Parallel solution techniques in computational structural mechanics. Comput Methods Appl Mech Eng 148(1): 75–104
Daniel JW, Gragg WB, Kaufman L, Stewart GW (1976) Reorthogonalization and stable algorithms for updating the Gram-Schmidt QR factorization. Math Comput 30: 772–795
Emad N, Petiton S, Edjlali G (2005) Multiple explicitly restarted Arnoldi method for solving large eigenproblems. SIAM J Sci Comput 27(1): 253–277
Ferreira IA (2008) Parallel solution of a thermo-chemo-mechanical model for early age concrete. PhD thesis, Universidade Federal do Rio de Janeiro, COPPE (in Portuguese)
Francis JGF (1961) The QR transformation—part 1. Comput J 4: 265–271
Francis JGF (1962) The QR transformation—part 2. Comput J 4: 332–345
Gensenberger M (2010) Improving the parallel performance of a domain decomposition preconditioning technique in the Jacobi-Davidson method for large scale eigenvalue problems. J Appl Numer Math 60(11): 1083–1099
Golub GH, Loan CFV (1996) Matrix computations, 3rd edn. The Jonh Hopkins University Press, Baltimore
Gropp W (1996) A high-performance, portable implementation of the mpi message passing interface standard. Parallel Comput 22(6): 789–828. doi:10.1016/0167-8191(96)00024-5
Knyazev AV, Skorokhodov L (1994) The preconditioned gradient-type iterative methods in a subspace for partial generalized symmetric eigenvalue problem. SIAM J Numer Anal 31: 451–478
Hernandez V, Roman JE, Tomas A (2007) Parallel Arnoldi eigensolvers with enhanced scalability via global communications rearrangement. Parallel Comput 33(7–8): 521–540
Hernández V, Román JE, Tomás A, Vidal V (2007) SLEPc users manual. Scalable library for eigenvalue problem computations. Universidad Politecnica de Valencia
Hestenes M, Stiefel E (1952) Methods of conjugate gradients for solving linear systems. J Res Natl Bur Stand 49(6): 409–436
Lehoucq RB (1995) Analysis and implementation of an implicitly restarted Arnoldi iteration. PhD thesis, Rice University, Houston, TX
Lehoucq RB (2001) Implicitly restarted Arnoldi method and subspace iteration. SIAM J Matrix Anal Appl 23(2): 551–562
Lehoucq RB (2005) On the convergence of an implicitly restarted Arnoldi method. Technical report, Sandia National Laboratories Albuquerque, NM
Manteuffel TA (1978) Adaptive procedure for estimating parameters for the nonsymmetric Tchebychev iteration. Numer Math 31: 183–208
Meerbergen K, Spence A (1997) Implicitly restarted Arnoldi and purification for the shift-invert transformation. Math Comput 66: 667–689
Nour-Omid B, Parlett BN, Ericsson T, Jensen PS (1987) How to implement the spectral transformation. Math Comput 48(178): 663–673
Paige CC (1970) Pratical use of the symmetric Lanczos process with re-orthogonalization. BIT 10: 183–195
Papadrakis M, Bitzarakis S (1996) Domain decomposition methods for serial and parallel processing. Adv Eng Softw 25: 291–307
Rao ARM (2005) MPI-based parallel finite element approaches for implicit nonlinear dynamic analysis employing sparse PCG solvers. Adv Eng Softw 36: 181–198
Rebollo TC, Vera EC (2004) Study of a non-overlapping domain decomposition method: Poisson and Stokes problems. Appl Numer Math 48: 169–194
Ribeiro FLB, Coutinho ALGA (2005) Comparison between element, edge and compressed storage schemes for iterative solutions in finite element analyses. Int J Numer Methods Eng 63: 569–588
Ribeiro FLB, Ferreira IA (2007) Parallel implementation of the finite element method using compressed data structures. Comput Mech 1: 187–204
Saad Y (1984) Chebyshev acceleration techniques for solving nonsymmetric eigenvalue problems. Math Comput 42: 567–588
Saad Y (1994) Sparsekit: a basic tool kit for sparse matrix computations. Technical report, Computer Science Department, University of Minessota
Saad Y (1996) Iterative methods for sparse linear systems. PWS Publishing Company, Boston
Saad Y, Schultz M (1986) Gmres: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J Sci Stat Comput 7: 856–869
Shahnaz R, Usman A, Chughtai I (2006) Implementation and evaluation of parallel sparse matrix-vector products on distributed memory parallel computers. In: IEEE international conference on cluster computing, 1–6. http://doi.ieeecomputersociety.org/10.1109/CLUSTR.2006.311878
Sorensen DC (1992) Implicit application of polynomial filters in a k-step Arnoldi method. SIAM J Matrix Anal Appl 13(1): 357–385
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ainsworth, G.O., Ribeiro, F.L.B. & Magluta, C. A parallel subdomain by subdomain implementation of the implicitly restarted Arnoldi/Lanczos method. Comput Mech 48, 563–577 (2011). https://doi.org/10.1007/s00466-011-0607-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00466-011-0607-4