Reducing Memory Requirements in Scientific Computing and Optimal Control
Abstract
In high accuracy numerical simulations and optimal control of time-dependent processes, often both many timesteps and fine spatial discretizations are needed. Adjoint gradient computation, or post-processing of simulation results, requires the storage of the solution trajectories over the whole time, if necessary together with the adaptively refined spatial grids. In this paper we discuss various techniques to reduce the memory requirements, focusing first on the storage of the solution data, which are typically double precision floating point values. We highlight advantages and disadvantages of the different approaches. Moreover, we present an algorithm for the efficient storage of adaptively refined, hierarchic grids, and the integration with the compressed storage of solution data.
Keywords
Optimal Control Problem Proper Orthogonal Decomposition Adjoint Equation Memory Bandwidth Normalize Root Mean Square ErrorNotes
Acknowledgements
The authors gratefully acknowledge support by the DFG Research Center Matheon, project F9.
References
- 1.Alliez, P., Desbrun, M.: Progressive compression for lossless transmission of triangle meshes. In: Proceedings of 28th Annual Conference on Computer Graphics and Interactive Techniques, pp. 195–202. ACM (2001)Google Scholar
- 2.Alliez, P., Gotsman, C.: Recent advances in compression of 3d meshes. In: Dodgson, N., Floater, M., Sabin, M. (eds.) Advances in Multiresolution for Geometric Modelling, Mathematics and Visualization, pp. 3–26. Springer, Berlin/Heidelberg (2005). doi:10.1007/3-540-26808-1_1. http://www.dx.doi.org/10.1007/3-540-26808-1_1
- 3.Avilés, M., Morán, F., García, N.: Progressive lower trees of wavelet coefficients: efficient spatial and SNR scalable coding of 3D models. In: Advances in Mulitmedia Information Processing-PCM 2005 pp. 61–72 (2005)Google Scholar
- 4.Bank, R.E., Sherman, A.H., Weiser, A.: Some refinement algorithms and data structures for regular local mesh refinement. In: Stepleman, R., et al. (eds.) Scientific Computing. Applications of Mathematics and Computing to the Physical Sciences, vol. I. IMACS/North-Holland, Amsterdam (1983)Google Scholar
- 5.Becker, R., Rannacher, R.: An optimal control approach to a posteriori error estimation in finite element methods. Acta Numerica 10(1), 1–102 (2001)MathSciNetMATHGoogle Scholar
- 6.Becker, R., Meidner, D., Vexler, B.: Efficient numerical solution of parabolic optimization problems by finite element methods. Optim. Methods Softw. 22(5), 813–833 (2007). http://www.dx.doi.org/10.1080/10556780701228532
- 7.Burtscher, M., Ratanaworabhan, P.: FPC: a high-speed compressor for double-precision floating-point data. IEEE Trans. Comput. 58(1), 18–31 (2009)MathSciNetCrossRefGoogle Scholar
- 8.Castrillón-Candás, J.E., Amaratunga, K.: Spatially adapted multiwavelets and sparse representation of integral equations on general geometries. SIAM J. Sci. Comput. 24(5), 1530–1566 (2003)MathSciNetCrossRefMATHGoogle Scholar
- 9.Cignoni, P., Rocchini, C., Scopigno, R.: Metro: measuring error on simplified surfaces. Tech. Rep., Paris, France (1996)Google Scholar
- 10.Comas, A.: Time-domain decomposition preconditioners for the solution of discretized parabolic optimal control problems. Ph.D. thesis, Rice University (2005)Google Scholar
- 11.Dahmen, W.: Wavelet methods for PDEs – some recent developments. J. Comput. Appl. Math. 128(1), 133–185 (2001)MathSciNetCrossRefMATHGoogle Scholar
- 12.Denis, L., Satti, S., Munteanu, A., Cornelis, J., Schelkens, P.: Scalable intraband and composite wavelet-based coding of semiregular meshes. IEEE Trans. Multimedia 12(8), 773–789 (2010)CrossRefGoogle Scholar
- 13.Deuflhard, P., Bornemann, F.: Scientific Computing with Ordinary Differential Equations, vol. 42. Springer, Berlin (2002)MATHGoogle Scholar
- 14.Deuflhard, P., Nowak, U.: Extrapolation integrators for quasilinear implicit ODEs. In: Deuflhard, P., Engquist, B. (eds.) Large Scale Scientific Computing. Progress in Scientific Computing, vol. 7, pp. 37–50. Birkhäuser, Basel (1987)CrossRefGoogle Scholar
- 15.Deuflhard, P., Leinen, P., Yserentant, H.: Concepts of an adaptive hierarchical finite element code. IMPACT Comput. Sci. Eng. 1(1), 3–35 (1989)CrossRefMATHGoogle Scholar
- 16.Goeman, B., Vandierendonck, H., De Bosschere, K.: Differential FCM: increasing value prediction accuracy by improving table usage efficiency. In: The 7th International Symposium on High-Performance Computer Architecture, 2001. HPCA, pp. 207–216. IEEE (2001)Google Scholar
- 17.Götschel, S., Weiser, M.: Lossy compression for PDE-constrained optimization: Adaptive error control. ZIB Report, pp. 13–27 (2013)Google Scholar
- 18.Götschel, S., Weiser, M., Schiela, A.: Solving optimal control problems with the Kaskade 7 finite element toolbox. In: Dedner, A., Flemisch, B., Klöfkorn, R. (eds.) Advances in DUNE, pp. 101–112. Springer, New York (2012)CrossRefGoogle Scholar
- 19.Götschel, S., Nagaiah, C., Kunisch, K., Weiser, M.: Lossy compression in optimal control of cardiac defibrillation. J. Sci. Comput. 60(1), 35–59 (2014)MathSciNetCrossRefMATHGoogle Scholar
- 20.Griewank, A.: Achieving logarithmic growth of temporal and spatial complexity in reverse automatic differentiation. Optim. Methods Softw. 1(1), 35–54 (1992)MathSciNetCrossRefGoogle Scholar
- 21.Griewank, A., Walther, A.: Algorithm 799: revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation. ACM Trans. Math. Softw. 26(1), 19–45 (2000)CrossRefMATHGoogle Scholar
- 22.Griewank, A., Walther, A.: Evaluating Derivatives: Principles and Techniques of Algorithmic Differentiation. SIAM, Philadelphia (2008)CrossRefGoogle Scholar
- 23.Heinkenschloss, M.: A time-domain decomposition iterative method for the solution of distributed linear quadratic optimal control problems. J. Comput. Appl. Math. 173(1), 169–198 (2005)MathSciNetCrossRefMATHGoogle Scholar
- 24.Hesse, H.K.: Multiple shooting and mesh adaptation for PDE constrained optimization problems. Ph.D. thesis, University Heidelberg (2008)Google Scholar
- 25.Hesse, H.K., Kanschat, G.: Mesh adaptive multiple shooting for partial differential equations. part I: linear quadratic optimal control problems. J. Numer. Math. 17(3), 195–217 (2009)Google Scholar
- 26.Heuveline, V., Walther, A.: Online checkpointing for parallel adjoint computation in PDEs: application to goal-oriented adaptivity and flow control. In: Euro-Par 2006 Parallel Processing, pp. 689–699. Springer, Berlin (2006)Google Scholar
- 27.Hinze, M., Sternberg, J.: A-revolve: an adaptive memory-reduced procedure for calculating adjoints; with an application to computing adjoints of the instationary Navier-Stokes system. Optim. Methods Softw. 20(6), 645–663 (2005)MathSciNetCrossRefMATHGoogle Scholar
- 28.Hinze, M., Volkwein, S.: Error estimates for abstract linear-quadratic optimal control problems using proper orthogonal decomposition. Comput. Optim. Appl. 39, 319–345 (2008)MathSciNetCrossRefMATHGoogle Scholar
- 29.Hinze, M., Pinnau, R., Ulbrich, M., Ulbrich, S.: Optimization with PDE Constraints. Springer, Berlin (2009)MATHGoogle Scholar
- 30.Ibarria, L., Lindstrom, P., Rossignac, J., Szymczak, A.: Out-of-core compression and decompression of large n-dimensional scalar fields. In: Computer Graphics Forum, vol. 22, pp. 343–348. Wiley Online Library (2003)Google Scholar
- 31.Isenburg, M.: Compressing polygon mesh connectivity with degree duality prediction. In: Graphics Interface Conference Proceedings, pp. 161–170 (2002)Google Scholar
- 32.Isenburg, M., Snoeyink, J.: Mesh collapse compression. In: Proceedings of SIBGRAPI’99, pp. 27–28 (1999)Google Scholar
- 33.Isenburg, M., Snoeyink, J.: Early-split coding of triangle mesh connectivity. In: Graphics Interface Proceedings, pp. 89–97. Canadian Information Processing Society, Toronto, ON (2006)Google Scholar
- 34.Ito, K., Kunisch, K.: Receding horizon optimal control for infinite dimensional systems. ESAIM Control Optim. Calc. Var. 8(1), 741–760 (2002)MathSciNetCrossRefMATHGoogle Scholar
- 35.Iverson, J., Kamath, C., Karypis, G.: Fast and effective lossy compression algorithms for scientific datasets. In: Euro-Par 2012 Parallel Processing, pp. 843–856. Springer, Berlin (2012)Google Scholar
- 36.JavaView Homepage: www.javaview.de (2014)
- 37.Jörres, C., Vossen, G., Herty, M.: On an inexact gradient method using proper orthogonal decomposition for parabolic optimal control problems. Comput. Optim. Appl. 55(2), 1–10 (2013)MathSciNetCrossRefGoogle Scholar
- 38.Kälberer, F., Polthier, K., Reitebuch, U., Wardetzky, M.: Freelence - coding with free valences. Comput. Graph. Forum 24(3), 469–478 (2005)CrossRefGoogle Scholar
- 39.Kälberer, F., Polthier, K., von Tycowicz, C.: Lossless compression of adaptive multiresolution meshes. In: Proceedings of Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI), vol. 22 (2009)Google Scholar
- 40.Kammann, E., Tröltzsch, F., Volkwein, S.: A method of a-posteriori error estimation with application to proper orthogonal decomposition. Tech. Rep. (2011)Google Scholar
- 41.Khodakovsky, A., Guskov, I.: Compression of normal meshes. In: Geometric Modeling for Scientific Visualization, pp. 189–206. Springer, Berlin (2003)Google Scholar
- 42.Khodakovsky, A., Schröder, P., Sweldens, W.: Progressive geometry compression. In: SIGGRAPH’00 Proceedings, pp. 271–278 (2000)Google Scholar
- 43.Kunisch, K., Volkwein, S.: Galerkin proper orthogonal decomposition methods for parabolic problems. Numer. Math. 90, 117–148 (2001)MathSciNetCrossRefMATHGoogle Scholar
- 44.Kunisch, K., Wagner, M.: Optimal control of the bidomain system (I): The monodomain approximation with the Rogers-McCulloch model. Nonlinear Anal. Real World Appl. 13(4), 1525–1550 (2012). doi:10.1016/j.nonrwa.2011.11.003. http://www.sciencedirect.com/science/article/pii/S1468121811003099
- 45.Lakshminarasimhan, S., Shah, N., Ethier, S., Klasky, S., Latham, R., Ross, R., Samatova, N.F.: Compressing the incompressible with ISABELA: in-situ reduction of spatio-temporal data. In: Euro-Par 2011 Parallel Processing, pp. 366–379. Springer, Berlin (2011)Google Scholar
- 46.Lakshminarasimhan, S., Shah, N., Ethier, S., Ku, S.H., Chang, C.S., Klasky, S., Latham, R., Ross, R., Samatova, N.F.: ISABELA for effective in situ compression of scientific data. Concurr. Comput. Pract. Exper. 25, 524–540 (2013)CrossRefGoogle Scholar
- 47.Lee, H., Alliez, P., Desbrun, M.: Angle-analyzer: a triangle-quad mesh codec. In: Drettakis, G., Seidel, H.-P. (eds.) Eurographics 2002 Conference Proceedings, pp. 383–392 (2002)Google Scholar
- 48.Lindstrom, P., Isenburg, M.: Fast and efficient compression of floating-point data. IEEE Trans. Vis. Comput. Graph. 12(5), 1245–1250 (2006). http://www.dx.doi.org/10.1109/TVCG.2006.143
- 49.Martin, G.: Range encoding: an algorithm for removing redundancy from a digitised message. In: Presented at Video & Data Recording Conference, Southampton (1979)Google Scholar
- 50.Nagaiah, C., Kunisch, K., Plank, G.: Numerical solution for optimal control of the reaction-diffusion equations in cardiac electrophysiology. Comput. Optim. Appl. 49, 149–178 (2011). doi:10.1007/s10589-009-9280-3. http://www.dx.doi.org/10.1007/s10589-009-9280-3
- 51.Nielsen, B.F., Ruud, T.S., Lines, G.T., Tveito, A.: Optimal monodomain approximations of the bidomain equations. Appl. Math. Comput. 184(2), 276–290 (2007)MathSciNetCrossRefMATHGoogle Scholar
- 52.Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, New York (2006)MATHGoogle Scholar
- 53.Payan, F., Antonini, M.: An efficient bit allocation for compressing normal meshes with an error-driven quantization. Comput. Aided Geom. Des. 22(5), 466–486 (2005)MathSciNetCrossRefMATHGoogle Scholar
- 54.Peng, J., Kim, C.S., Jay Kuo, C.C.: Technologies for 3d mesh compression: a survey. J. Vis. Commun. Image Represent. 16(6), 688–733 (2005). doi:10.1016/j.jvcir.2005.03.001. http://www.dx.doi.org/10.1016/j.jvcir.2005.03.001
- 55.Rogers, J.M., McCulloch, A.D.: A collocation-Galerkin finite element model of cardiac action potential propagation. IEEE Trans. Biomed. Eng. 41, 743–757 (1994)CrossRefGoogle Scholar
- 56.Rossignac, J.: Edgebreaker: connectivity compression for triangle meshes. IEEE Trans. Vis. Comput. Graph. 5, 47–61 (1999)CrossRefGoogle Scholar
- 57.Sazeides, Y., Smith, J.E.: The predictability of data values. In: Proceedings of 30th Annual IEEE/ACM International Symposium on Microarchitecture, 1997, pp. 248–258. IEEE (1997)Google Scholar
- 58.Schröder, P., Sweldens, W.: Spherical wavelets: efficiently representing functions on the sphere. In: SIGGRAPH ‘95 Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques, pp. 161–172. ACM (1995)Google Scholar
- 59.Shafaat, T.M., Baden, S.B.: A method of adaptive coarsening for compressing scientific datasets. In: Kågström, B., Elmroth, E., Dongarra, J., Wasniewski, J. (eds.) Applied Parallel Computing. State of the Art in Scientific Computing. 8th International Workshop, PARA 2006, Umeå, Sweden, 18–21 June 2006, Revised Selected Papers. Lecture Notes in Computer Science, vol. 4699, pp. 774–780. Springer, New York (2007)Google Scholar
- 60.Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948)MathSciNetCrossRefMATHGoogle Scholar
- 61.Shapiro, J.M.: Embedded image coding using zerotrees of wavelet coefficients. IEEE Trans. Signal Process. 41, 3445–3462 (1993)CrossRefMATHGoogle Scholar
- 62.Sternberg, J., Hinze, M.: A memory-reduced implementation of the Newton-CG method in optimal control of nonlinear time-dependent PDEs. Optim. Methods Softw. 25(4), 553–571 (2010)MathSciNetCrossRefMATHGoogle Scholar
- 63.Stumm, P., Walther, A.: Multi-stage approaches for optimal offline checkpointing. SIAM J. Sci. Comput. 31(3), 1946–1967 (2009)MathSciNetCrossRefMATHGoogle Scholar
- 64.Stumm, P., Walther, A.: New algorithms for optimal online checkpointing. SIAM J. Sci. Comput. 32(1), 836–854 (2010)MathSciNetCrossRefMATHGoogle Scholar
- 65.Sweldens, W.: The lifting scheme: a construction of second generation wavelets. SIAM J. Math. Anal. 29(2), 511–546 (1998)MathSciNetCrossRefMATHGoogle Scholar
- 66.Szymczak, A.: Optimized edgebreaker encoding for large and regular triangle meshes. In: DCC ‘02 Proceedings, p. 472. IEEE Computer Society, Washington, DC (2002)Google Scholar
- 67.Teran, R.I., Thole, C.A., Lorentz, R.: New developments in the compression of LS-DYNA simulation results using FEMZIP. In: 6th European LS-DYNA Users’ Conference (2007). https://www.dynalook.com/european-conf-2007/new-developments-in-the-compression-of-ls-dyna.pdf
- 68.Thole, C.A.: Compression of LS-DYNA3DTM simulation results using FEMZIP©. 3. LS-DYNA Anwenderforum (2004)Google Scholar
- 69.Touma, C., Gotsman, C.: Triangle mesh compression. In: Graphics Interface Conference Proceedings, pp. 26–34 (1998)Google Scholar
- 70.Tröltzsch, F., Volkwein, S.: POD a-posteriori error estimates for linear-quadratic optimal control problems. Comput. Optim. Appl. 44, 83–115 (2009)MathSciNetCrossRefMATHGoogle Scholar
- 71.Tutte, W.: A census of planar triangulations. Can. J. Math. 14, 21–38 (1962)MathSciNetCrossRefMATHGoogle Scholar
- 72.Unat, D., Hromadka, T., Baden, S.: An adaptive sub-sampling method for in-memory compression of scientific data. In: Data Compression Conference, 2009 (DCC ‘09), pp. 262–271. IEEE (2009)CrossRefGoogle Scholar
- 73.Valette, S., Prost, R.: Wavelet-based progressive compression scheme for triangle meshes: wavemesh. IEEE Trans. Vis. Comput. Graph. 10(2), 123–129 (2004)CrossRefGoogle Scholar
- 74.van der Vorst, H.A.: Bi-CGSTAB: a fast and smoothly converging variant of bi-cg for the solution of nonsymmetric linear systems. SIAM J. Sci. Stat. Comput. 13, 631–644 (1994)CrossRefGoogle Scholar
- 75.von Tycowicz, C., Kälberer, F., Polthier, K.: Context-based coding of adaptive multiresolution meshes. Comput. Graph. Forum 30(8), 2231–2245 (2011). doi:10.1111/j.1467-8659.2011.01972.x. http://www.dx.doi.org/10.1111/j.1467-8659.2011.01972.x
- 76.Walther, A.: Program reversal schedules for single-and multi-processor machines. Ph.D. thesis, Institute of Scientific Computing, Technical University Dresden, Germany (1999)Google Scholar
- 77.Wang, Q., Moin, P., Iaccarino, G.: Minimal repetition dynamic checkpointing algorithm for unsteady adjoint calculation. SIAM J. Sci. Comput. 31(4), 2549–2567 (2009)MathSciNetCrossRefMATHGoogle Scholar
- 78.Weiser, M., Götschel, S.: State trajectory compression for optimal control with parabolic PDEs. SIAM J. Sci. Comput. 34(1), A161–A184 (2012)CrossRefMATHGoogle Scholar
- 79.Yserentant, H.: On the multi-level splitting of finite element spaces. Numer. Math. 49(4), 379–412 (1986)MathSciNetCrossRefMATHGoogle Scholar