Deep Learning Driven Self-adaptive Hp Finite Element Method

Paszyński, Maciej; Grzeszczuk, Rafał; Pardo, David; Demkowicz, Leszek

doi:10.1007/978-3-030-77961-0_11

Maciej Paszyński¹³,
Rafał Grzeszczuk¹³,
David Pardo^14,15,16 &
…
Leszek Demkowicz¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12742))

Included in the following conference series:

International Conference on Computational Science

2991 Accesses
3 Citations

Abstract

The finite element method (FEM) is a popular tool for solving engineering problems governed by Partial Differential Equations (PDEs). The accuracy of the numerical solution depends on the quality of the computational mesh. We consider the self-adaptive hp-FEM, which generates optimal mesh refinements and delivers exponential convergence of the numerical error with respect to the mesh size. Thus, it enables solving difficult engineering problems with the highest possible numerical accuracy. We replace the computationally expensive kernel of the refinement algorithm with a deep neural network in this work. The network learns how to optimally refine the elements and modify the orders of the polynomials. In this way, the deterministic algorithm is replaced by a neural network that selects similar quality refinements in a fraction of the time needed by the original algorithm.

You have full access to this open access chapter, Download conference paper PDF

HiDeNN-FEM: a seamless machine learning approach to nonlinear finite element analysis

Article 11 April 2023

Design of Efficient Finite Elements Using Deep Learning Approach

Design of Efficient Quadrature Scheme in Finite Element Using Deep Learning

Keywords

1 Introduction

The self-adaptive hp-Finite Element Method (FEM) has been developed for many years by the community of applied mathematicians working in the field of numerical analysis [3,4,5,6, 9]. They require extremely high numerical accuracy, which is difficult to obtain by other numerical methods. In this paper, we refer to the iterative Algorithm 1 proposed by [3], and we introduce the simplified, one-step, Algorithm 2 as a kernel for the selection of the optimal refinements for the interiors of elements. The edge refinements are adjusted by taking the minimum of the corresponding orders of interiors. We further propose how to replace Algorithm 2 with a Deep Neural Network (DNN).

The DNN can make similar quality decisions about mesh refinements as Algorithm 2, while the online computational time is reduced. The main motivation for this work is the following observation. We have noticed that making random of 10% of the decision about element refinements made by the self-adaptive hp-FEM algorithm does not disturb the algorithm’s exponential convergence. Thus, the possibility of teaching the deep neural network making decisions optimal up to 90% is enough to keep the exponential convergence.

2 Self-adaptive hp-FEM with Neural Network

We focus on the L-shape domain model problem [5, 6] to illustrate the self-adaptive applicability hp-FEM algorithm for the solution of a model problem with a singular point. The gradient of the solution tends to infinity, and intensive mesh refinements are needed to approximate this behavior properly.

We describe in Algorithm 1 the self-adaptive hp-algorithm, initially introduced by [3]. It utilizes Algorithm 2 for the selection of the optimal refinements over element K. This algorithm delivers exponential convergence of the numerical error with respect to the mesh size, which has been verified experimentally by multiple numerical examples [3, 4].

Our goal is to replace Algorithm 2 with a deep neural network. The left column in Fig. 2 presents the optimal distribution of refinements, as provided by the deterministic algorithm. We can see that all the h refinements (breaking of elements) are performed towards the point singularity. We also see that the p refinements are surrounding the singularity as layers with a different color. They change from red, light and dark pink (\(p=6,7,8\)), through brown (\(p=5\)), yellow (\(p=4\)), green (\(p=3\)), blue (\(p=2\)) and dark blue (\(p=1\)) close to the singularity.

The refinements performed by the iterative Algorithm 1 are executed first closer to the singularity. With the iterations, the differences between the coarse and fine mesh solution tend to zero [3].

Dataset. We propose the following samples to train the DNN:

Input variables: coarse mesh solution \(u_{hp} \in V_{hp}\) for element K, the element sizes and coordinates, the norm of the fine mesh solution over element K, the maximum norm of the fine mesh solution over elements

Output variables: Optimal refinement \(V_{opt}^K\) for element K.

We construct the dataset by executing the deterministic Algorithm 1 for the model L-shape domain problem. We perform 50 iterations of the hp-adaptivity, generating over 10,000 deterministic element refinements, resulting in 10,000 samples. We repeat this operation for rotated boundary conditions (4) by the following angles: 10, 20, 30, 40, 50, 60, 70, 80, and 90\(^\circ \). Each rotation changes the solution and the samples. We obtain a total of 100,000 samples. We randomly select 90% of the samples for training and use the remaining 10% as a test set. We further sample the training data and use 10% of training data as a validation set. After one-hot encoding the categorical variables, each sample is represented by a 136-dimensional vector. Since it is much more common for the deterministic algorithm to make specific h refinement decisions (nref) for the L-shape domain problem, the dataset is imbalanced. To mitigate this, we apply supersampling of underrepresented nref classes. DNN Architecture. We use a feed-forward DNN[1] with 12 fully-connected layers. After 8 layers, the network splits into 6 branches, 4 layers each: the first branch decides about the optimal nref parameter - h refinement, the remaining branches decide about modifying the polynomial orders - p refinement. Experiments have shown that further expanding of the network makes it prone to overfitting [8]. Splitting the network into branches assures sufficient parameter freedom for each variable. This approach also simplifies the model: there is no need to train a DNN for each variable. Since all possible decisions are encoded as categorical variables, we use cross-entropy as the loss function. We encoded the input data as a 136-dimensional normalized vector, as detailed in Table 1. We assume that the polynomial degree will not exceed \(n=11\). We train the network for up to 200 epochs with validation loss-based early stopping on an Nvidia Tesla v100 GPGPU with 650 tensor cores available in ACK Cyfronet PROMETHEUS cluster [2]. To minimize the loss function, we use the Adam optimizer [7], with the learning rate set to \(10e^{-3}\). We apply kernel L2-penalty throughout the training as a means of regularization and dropout [10] with probability 0.5. The network converges after approximately 110 epochs.

Table 1. Dimensionality of specific input features to the DNN, encoded in a single 136-dimensional vector. Polynomial coefficients that do not exist in a given polynomial order are always 0.

Full size table

DNN Performance. The network achieved over 92% accuracy on the test set. We run three tests to assess whether such a network can be used in the hp-FEM.

First numerical experiment is to reproduce the deterministic Algorithm 2 for the original L-shape domain problem, presented in Figs. 1, 2 and left panel in Fig. 3. Both deterministic and DNN-driven algorithms provide exponential convergence. The verification phase shows that the DNN makes up to 50% of incorrect decisions when the element sizes go down to \(10^{-7}\) and less, see the right panel in Fig. 3. Thus, at the zoom of 100,000 times, we see some differences in Fig. 2. Despite that, the algorithm still converges exponentially.

Second Numerical Experiment. We run the self-adaptive hp-FEM algorithm, and we provide zeros as the coarse mesh solution degrees of freedom. We get the same convergence. This second test shows that the DNN is not sensitive with respect to the coarse mesh solution and that the norm of the fine mesh solution, the maximum norm, and the coordinates and dimensions of the elements are enough to make proper decisions. The DNN looks at the fine mesh solution’s norms at the given and neighboring elements and, based on these data in decides whether the element is to be broken and how it should be broken. Thus, we can replace Algorithm 2 and the coarse mesh solution phase with the DNN. Left panel in Fig. 4 presents the execution times of particular parts of the hp-FEM algorithm. The removal of the coarse mesh solution phase and replacing Algorithm 2 by the DNN saves up to 50% of the execution times.

Third Numerical Experiment. The third test, illustrated in Fig. 5, concerns the L-shape domain algorithm with boundary conditions rotated 45\(^\circ \) (no samples for this case were provided in the training set). The DNN hp-FEM also provides exponential convergence in this case.

Fourth Numerical Experiment. The last test, illustrated in the right panel in Fig. 4 concerns randomly disturbed mesh, different from the training set. The DNN captures both top and bottom singularities. It produces hp refinements towards the bottom singularity and p refinements towards the top singularity. The resulting accuracy was 1% of the relative error after ten iterations.

3 Conclusions

We replaced the algorithm selecting optimal refinements in the self-adaptive hp-FEM by a deep neural network. We obtained over 92% of correct answers, the same accuracy of the final mesh, and exponential convergence of the mesh refinement algorithm. A very interesting observation is that DNN requires coordinates of elements (to recognize the adjacency between elements), the dimensions of elements (to recognize the refinement level), the \(H^1\) norm of the solution over the element, and the maximum norm of the solutions over elements. The DNN by “looking” at the norms over adjacent elements, recognizes with 92% accuracy the proper p-refinement of the element. The replacement of the coarse mesh solver (line 2 in Algorithm 1) and the optimal refinements selection (Algorithm 2) by the DNN allows for a 50% reduction of the computational time.

The DNN used is available at

http://home.agh.edu.pl/paszynsk/dnn_hp2d/dnn_hp2d.tar.gz

References

Bebis, G., Georgiopoulos, M.: Feed-forward neural networks. IEEE Potent. 13(4), 27–31 (1994)
Article Google Scholar
Bubak, M., Kitowski, J., Wiatr, K. (eds.): eScience on Distributed Computing Infrastructure. LNCS, vol. 8500. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10894-0
Book Google Scholar
Demkowicz, L.: Computing with hp-Adaptive Finite Elements, vol. 1. Chapman & Hall/CRC Applied Mathematics & Non-linear Science (2006)
Google Scholar
Demkowicz, L., Kurtz, J., Pardo, D., Paszyński, M., Rachowicz, W., Zdunek, A.: Computing with hp-Adaptive Finite Elements, vol. 2. Chapman & Hall/CRC Applied Mathematics & Non-linear Science (2007)
Google Scholar
Guo, B., Babuška, I.: The hp version of the finite element method, part i: the basic approximation results. Comput. Mech. 1(1), 21–41 (1986)
Article Google Scholar
Guo, B., Babuška, I.: The hp version of the finite element method, part ii: general results and applications. Computat. Mech. 1(1), 203–220 (1986)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Salman, S., Liu, X.: Overfitting mechanism and avoidance in deep neural networks. arXiv preprint arXiv:1901.06566 (2019)
Schwab, C.: p-and hp-Finite Element Methods. The Clarendon Press, Oxford University Press, New York (1998)
MATH Google Scholar
Srivastava, N.: Improving neural networks with dropout. Univ. Toronto 182(566), 7 (2013)
Google Scholar

Download references

Acknowledgement

This work was partially supported by National Science Centre grant no. 2016/21/B/ST6/01539. The visit of Maciej Paszyński at the Odens Institute has been supported by J. T. Oden Research Faculty Fellowship. Additionally, this work was supported by National Science Centre, Poland grant no. 2017/26/M/ST1/ 00281.

Author information

Authors and Affiliations

AGH University of Science and Technology, Kraków, Poland
Maciej Paszyński & Rafał Grzeszczuk
The University of the Basque Country, Bilbao, Spain
David Pardo
Basque Center for Applied Mathematics, Bilbao, Spain
David Pardo
IKERBASQUE, Bilbao, Spain
David Pardo
Oden Institute, The University of Texas at Austin, Austin, USA
Leszek Demkowicz

Authors

Maciej Paszyński
View author publications
You can also search for this author in PubMed Google Scholar
Rafał Grzeszczuk
View author publications
You can also search for this author in PubMed Google Scholar
David Pardo
View author publications
You can also search for this author in PubMed Google Scholar
Leszek Demkowicz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Maciej Paszyński .

Editor information

Editors and Affiliations

AGH University of Science and Technology, Krakow, Poland
Maciej Paszynski
Ludwig-Maximilians-Universität München, Munich, Germany
Dieter Kranzlmüller
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M. A. Sloot

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Paszyński, M., Grzeszczuk, R., Pardo, D., Demkowicz, L. (2021). Deep Learning Driven Self-adaptive Hp Finite Element Method. In: Paszynski, M., Kranzlmüller, D., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2021. ICCS 2021. Lecture Notes in Computer Science(), vol 12742. Springer, Cham. https://doi.org/10.1007/978-3-030-77961-0_11

Download citation

DOI: https://doi.org/10.1007/978-3-030-77961-0_11
Published: 09 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77960-3
Online ISBN: 978-3-030-77961-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Deep Learning Driven Self-adaptive Hp Finite Element Method

Abstract

Similar content being viewed by others

HiDeNN-FEM: a seamless machine learning approach to nonlinear finite element analysis

Design of Efficient Finite Elements Using Deep Learning Approach

Design of Efficient Quadrature Scheme in Finite Element Using Deep Learning

Keywords

1 Introduction

2 Self-adaptive hp-FEM with Neural Network

3 Conclusions

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Deep Learning Driven Self-adaptive Hp Finite Element Method

Abstract

Similar content being viewed by others

HiDeNN-FEM: a seamless machine learning approach to nonlinear finite element analysis

Design of Efficient Finite Elements Using Deep Learning Approach

Design of Efficient Quadrature Scheme in Finite Element Using Deep Learning

Keywords

1 Introduction

2 Self-adaptive hp-FEM with Neural Network

3 Conclusions

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation