Unstructured surface mesh smoothing method based on deep reinforcement learning

Wang, Nianhua; Zhang, Laiping; Deng, Xiaogang

doi:10.1007/s00466-023-02370-3

Unstructured surface mesh smoothing method based on deep reinforcement learning

Original Paper
Open access
Published: 23 August 2023

Volume 73, pages 341–364, (2024)
Cite this article

Download PDF

You have full access to this open access article

Computational Mechanics Aims and scope Submit manuscript

Unstructured surface mesh smoothing method based on deep reinforcement learning

Download PDF

Nianhua Wang^1,2,
Laiping Zhang³ &
Xiaogang Deng⁴

1218 Accesses
Explore all metrics

Abstract

In numerical simulations such as computational fluid dynamics simulations or finite element analyses, mesh quality affects simulation accuracy directly and significantly. Smoothing is one of the most widely adopted methods to improve unstructured mesh quality in mesh generation practices. Compared with the optimization-based smoothing method, heuristic smoothing methods are efficient but yield lower mesh quality. The balance between smoothing efficiency and mesh quality has been pursued in previous studies. In this paper, we propose a new smoothing method that combines the advantages of the heuristic Laplacian method and the optimization-based method based on the deep reinforcement learning method under the Deep Deterministic Policy Gradient framework. Within the framework, the actor artificial neural network predicts the optimal position of each interior free node with its surrounding ring nodes. At the same time, a critic-network is established and takes the mesh quality as input and outputs the reward of the action taken by the actor-network. Training of the networks will maximize the cumulative long-term reward, which ends up maximizing the mesh quality. Training and validation of the proposed method are presented both on 2-dimensional triangular meshes and 3-dimensional surface meshes, which demonstrates the efficiency and mesh quality of the proposed method. Finally, numerical simulations on perturbed meshes and smoothed meshes are carried out and compared which prove the influence of mesh quality on the simulation accuracy.

MGNet: a novel differential mesh generation method based on unsupervised neural networks

Article 10 March 2022

Meshing using neural networks for improving the efficiency of computer modelling

Article Open access 11 April 2023

Faster and efficient tetrahedral mesh generation using generator neural networks for 2D and 3D geometries

Article 17 November 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Unstructured mesh is popular in numerical simulations such as computational fluid dynamics (CFD) simulations due to its flexibility and ease of generation [1, 2]. In finite element analyses (FEA), the unstructured triangular mesh is also widely used and studied [3]. However, the quality of unstructured meshes remains a major concern in its applications since mesh quality affects simulation accuracy directly and significantly [4,5,6]. Initial mesh elements generated by automatic mesh generators often have poor quality and the mesh optimization process is indispensable [7].

Topological optimization and smoothing are the two most widely used methods to improve unstructured mesh quality in mesh generation practices. Topological optimization methods such as the Delaunay edge/face swap [8, 9] improve mesh quality by swapping the diagonal of quadrilaterals in which the Delaunay criterion is violated. Only triangular and tetrahedral meshes can be optimized by the Delaunay edge/face swap and the mesh quality cannot be maximized because node positions are not changed. Node inserting/deleting and local mesh reconstruction are other types of topological optimization methods [10,11,12]. These methods are effective in extreme cases such as the valence of an interior mesh node being too large or small.

Unlike topological optimization, smoothing methods move the interior nodes to their optimal position iteratively while keeping the topology unchanged. The smoothing methods can improve mesh quality substantially because node position displacement can be relatively large and can be applied in most cases for all interior nodes.

Smoothing methods mainly include optimization-based smoothing methods and heuristic smoothing methods. Compared with optimization-based smoothing methods, heuristic smoothing methods are more efficient but yield lower mesh quality. The balance between smoothing efficiency and mesh quality has been pursued in previous studies.

The Laplacian smoothing method [13] is one of the heuristic methods that move each interior node iteratively to the arithmetic averaging center of its surrounding neighbor nodes. Due to its extremely high efficiency and satisfactory mesh quality, the Laplacian method is widely adopted in mesh generation practices. But the Laplacian smoothing method may result in some low-quality or even invalid elements as well as deformation and shrinkage in smoothing 3D surface meshes [14]. Improvements have been proposed to avoid these problems and other heuristic methods have been developed. For example, Zhou [15] proposed the angle-based smoothing method which iteratively moves the interior free nodes to adjust the angles of adjacent elements of the free node. It turned out that the angle-based method generates higher quality mesh than the Laplacian method. Vartziotis [14, 16, 17] developed the geometric element transformation method (GETMe) that transforms poor-quality elements into regular ones and moves all nodes of an element simultaneously thus improving overall quality, which is different from other node-moving methods.

On the other hand, the optimization-based method [18,19,20] defines local or global objective functions such as inverse shape quality, skewness, etc., and minimizes the objective function by iterative methods such as the conjugate gradient solver or the Newton solver and eventually increases mesh quality after iterations. This method generally generates meshes with the highest quality at the highest computational cost.

Recently, due to its good generalization of nonlinear relations, artificial neural networks (ANN) are widely studied and applied to multi-disciplines such as solving PDEs [21, 22] and surrogate modeling [23, 24], computational mechanics [25,26,27,28,29], mesh adaptation [30,31,32] and mesh generation [33,34,35,36,37,38]. Application of ANN methods in computation-intensive fields can improve efficiency significantly while maintaining reasonable accuracy. The ANN-based mesh smoothing method (NN-Smoothing) proposed by Guo [38] predicts the optimal node positions by an ANN, which was trained by samples extracted from optimization-based smoothing meshes. The NN-Smoothing method imitates the optimization-based smoothing method in generating high-quality meshes while maintaining the high efficiency of the Laplacian method. In this method, the training samples are generated, optimized, and normalized by cumbersome manual work. In addition, seven separate neural networks are trained to handle input with different dimensions.

To eliminate the manual work of preprocessing training samples, and unify the networks into a single network capable of dealing with different input dimensions. This paper proposes a new smoothing method that combines the advantages of the heuristic Laplacian method and the optimization-based method based on the deep reinforcement learning (DRL) method under the Deep Deterministic Policy Gradient (DDPG) [39] framework. Within the DDPG framework, the actor-network predicts the optimal position of each interior free node with its surrounding ring nodes. At the same time, a critic-network is established and takes the mesh quality as input and outputs the reward of the action taken by the actor-network. Training of the networks will maximize the cumulative long-term reward, which ends up maximizing the mesh quality.

This paper is organized as follows: in Sect. 2, we briefly review popular mesh quality optimization methods including topological optimization methods and smoothing methods. The mesh smoothing based on deep reinforcement learning is illustrated in Sect. 3. Training algorithms and hyperparameters are studied in Sect. 4. Training, validations, and comparison between different methods on 2-dimensional and 3-dimensional surface meshes will be presented in Sects. 5 and 6. Numerical simulations on smoothed meshes and perturbed meshes are carried out and results are compared in Sect. 7. Finally, conclusions and possible future work of the method are proposed in Sect. 8.

2 Brief review of unstructured mesh quality optimization methods

Mesh quality is so important to CFD and FEA numerical simulations that mesh quality optimization is almost indispensable, either integrated into the mesh generation process or carried out afterward. Topological optimization and smoothing are two major types of mesh quality optimization methods.

The Delaunay edge swap [8, 9] is one type of topological optimization that swaps the diagonal of a quadrilateral which violates the Delaunay criterion. As Fig. 1 shows, point D lies in the circumcircle of $\Delta ABC$, which violates the Delaunay criterion. Point B also has the same problem. Therefore, an edge swap is performed in quadrilateral ABCD which diagonal AC is replaced by BD. After the edge swap, the thin triangle $\Delta ADC$ is eliminated thus improving mesh quality.

The Laplacian smoothing method [13] calculates the optimal free node positions (v^*) by arithmetic averaging of its surrounding ring nodes (v₁-v₅), which can be expressed as Eq. (1). The smoothing process for a single free node and its surrounding cells is shown in Fig. 2.

$$ x^{ * } = \frac{1}{N}\sum\limits_{i = 1}^{N} {x_{i} } ,y^{ * } = \frac{1}{N}\sum\limits_{i = 1}^{N} {y_{i} } $$

(1)

The angle-based smoothing method [15] tries to optimize the included angle of neighboring cells which share the same free node by moving the node. As shown in Fig. 3, the free node, v, is moved to v^* to make the included angles $\alpha_{1}$, $\alpha_{2}$ to be equal.

Smoothing based on the geometric element transformation method (GETMe) [14, 16, 17] improves the regularity and quality of each element by a two-step regularizing element transformation. Different from other smoothing methods which move one node each time, the GETMe moves all nodes of an element simultaneously. Figure 4 shows the clockwise and counterclockwise two-step transformation of a triangular element.

Optimization-based method [18,19,20] improves mesh quality by minimizing an objective function that is defined by certain mesh quality metrics, such as inverse aspect ratio, inverse included angle, inverse shape quality, etc. Linear solvers such as conjugate gradient or Newton solver are adopted to iteratively find the minimum objective function value (Fig. 5).

The neural network-based smoothing (NN-Smoothing) [38] predicts the optimal node positions with its surrounding ring nodes like the Laplacian smoothing by a fully connected artificial neural network as Fig. 6 shows. By training with samples extracted from mesh optimized by optimization-based method, the NN-Smoothing method imitates the optimization-based method in mesh quality while keeping high efficiency.

3 Deep reinforcement learning-based smoothing

3.1 Deep reinforcement learning framework

Deep learning based on artificial neural networks is capable of generalizing any nonlinear relation. Reinforcement learning is good at observing and exploring the environment and providing guidance to complete a certain task. Deep reinforcement learning (DRL) combines the strengths of both generalization and exploration.

Typical DRL algorithm procedures are shown in Fig. 7. The goal of reinforcement learning is to train an agent to complete a task within an unknown environment. The agent receives observations and a reward from the environment and sends actions to the environment. The reward is a measure of how successful an action is to complete the task goal. The policy is a mapping that selects actions based on the observations from the environment, which could be a function approximator with tunable parameters, such as a deep neural network. The learning algorithm continuously updates the policy parameters based on the actions, observations, and rewards. The goal of the learning algorithm is to find an optimal policy that maximizes the cumulative reward received during the task.

DRL algorithms can be categorized into policy-based and value-based models. Policy-based methods determine the action according to a probability and can output continuous actions, such as the policy gradient method, while value-based methods choose the action with the largest value, which only works in discrete action space, such as Deep Q-Network.

In this paper, a new unstructured mesh smoothing method based on DRL (DRL-Smoothing) is proposed. We implement the policy-based Deep Deterministic Policy Gradient (DDPG) DRL method [39] which works well in continuous action space to deal with continuous node coordinates. The DDPG agent can explore the environment and find the optimal strategy to complete the mesh smoothing task. Details of the DRL-Smoothing method will be discussed in the next subsection.

3.2 DRL-based mesh smoothing

To complete the mesh smoothing task, we define the environment, state, action, reward, and policy in the DDPG algorithm as follows.

Environment includes the mesh and the dynamics of the mesh.

Mesh: including all mesh nodes and cell topology information.

Dynamics of the mesh: including mesh nodes relocation and updating functions.

State:

The current free node v and its surrounding ring nodes (v₁–v₅) as Fig. 6a shows.

The interior free nodes are selected and smoothed one by one in the order of storage in one training episode. And the ring nodes (v1–v2–v3–v4–v5) are constructed in a clockwise/counter-clockwise way as shown in Fig. 6a. To eliminate the impact of the absolute location of the nodes and increase generalizability, normalization is applied to the coordinates of the ring nodes to transform the ring polygon into a unit-length box [38] as shown in Fig. 8.

Action:

Predicted the optimal free node position v* as Fig. 6a shows.

Having the prior knowledge that the Laplacian smoothing could improve mesh quality, we incorporate the knowledge by defining the node position as a summation of the Laplacian smoothing node position with the action vector $\left( {a_{x} ,a_{y} } \right)$, which can be expressed as:

$$ \begin{gathered} x^{new} = x^{ * } + a_{x} \hfill \\ y^{new} = y^{ * } + a_{y} \hfill \\ \end{gathered} $$

(2)

where $\left( {x^{ * } ,y^{ * } } \right)$ are node positions calculated by the Laplacian smoothing method in Eq. (1).

Reward:

Weighted mesh quality metrics.

The minimum equiangular skewness ($Q_{\min }$) and the average equiangular skewness ($Q_{avg}$) are weighted to construct the reward function R as shown in Eq. (3).

$$ R = \lambda Q_{\min } + \left( {1 - \lambda } \right)Q_{avg} $$

(3)

The equiangular skewness quality ($Q$) is defined as follows:

$$ Q_{i} = 1.0 - \max \left( {\frac{{\theta_{e} - \theta_{\min } }}{{\theta_{e} }},\frac{{\theta_{\max } - \theta_{e} }}{{180 - \theta_{e} }}} \right) $$

(4)

where $\theta_{e} = 60^{ \circ }$ for triangles, $\theta_{e} = 90^{ \circ }$ for quadrilaterals, $\theta_{\min }$ and $\theta_{\max }$ are the minimum and maximum included angles for a single cell. The range of $Q$ is from 0 to 1.0. When skewness quality $Q = 1.0$, the quality is the best and the corresponding reward should be the largest. When skewness quality $Q = 0.0$, the quality is the worst and the reward is the smallest. The influence of the weighted coefficient $\lambda $ will be discussed in Sect. 4.2.

In 2-dimensional meshes, when the intersection check failed or the predicted node locates outside its surrounding polygon composed of its ring nodes, a penalty of − 1 is given to the reinforcement learning agent. The geometrical intersection check of 2 line segments is simple and will not be illustrated in this paper.

In the DDPG agent, a critic-network is established to process the reward signal and provide guidance to the training of the actor-network. The critic-network consists of a state path, an action path, and a concatenation common path. As Fig. 9 shows, the state path is a 3-layer fully connected (FC) network, with 64, 16, and 8 neurons in each hidden layer. The action path is a 1-layer FC network with 8 neurons. The concatenation path also has 1 FC layer, with 4 neurons. ReLU activation functions are added with each fully connected layer.

Policy

The actor artificial neural network predicts the optimal node positions according to its ring nodes. The actor-network is also called the policy network, which is represented by a fully connected multilayer perceptron in this paper as shown in Fig. 10.

Previous work [38] established 7 neural networks with different input dimensions and trained each network separately to solve the problem of variance of the number of ring nodes. Meanwhile, training samples and neural networks need to be established and trained separately for triangular grids and quadrilateral grids.

In this paper, we fix the input dimension for the actor-network, only considering a maximum of 8 surrounding ring nodes. If the number of ring nodes is larger than 8, we use the Laplacian smoothing for these nodes. Zero-padding is adopted when the number of ring nodes is less than 8. Thus, the input dimension is fixed at 16. The influence of the input dimension of the actor-network and zero-padding strategy will be discussed in Sect. 4.2. We use 2 fully connected hidden layers, with 32 and 16 neurons in each layer. The output layer consists of 2 neurons that output the action vector (a_x, a_y), optimized node coordinates can be obtained by Eq. (2). ReLU activation functions are added with each hidden layer to improve nonlinear generalization, which is shown as follows:

$$ \sigma \left( x \right) = \left\{ {\begin{array}{*{20}c} {0,x < 0} \\ {x,x \ge 0} \\ \end{array} } \right. $$

(5)

3.3 Three-dimensional surface mesh smoothing based on the DRL method

Traditional surface mesh smoothing methods project the free nodes to the geometry after the relocation of every node, which is very inefficient. The DRL-Smoothing method can be applied to 3-dimensional (3D) surface mesh with minor modifications and only one projection is required.

Like the 2D mesh smoothing, we adopt the same DDPG framework and extend the state definition, normalization (Fig. 8), and action definition Eq. (2) to 3D situations. Specifically, a 3D triangle is normalized so that it can be fit into a unit bounding box, and 3D actions are defined by directly extending Eq. (2) to 3D. For the actor-network, we use 3 fully connected hidden layers, with 32, 16 and 16 neurons in each layer.

4 Training algorithm and hyperparameters

4.1 Training algorithm

As an actor-critic agent, the DDPG agent maintains four artificial neural networks: online actor $\mu \left( S \right)$, online critic $Q\left( {S,A} \right)$, target actor $\mu^{\prime}\left( S \right)$, and target critic $Q^{\prime}\left( {S,A} \right)$.

The actor-network takes observation S and outputs the corresponding action that maximizes the long-term reward. The critic-network takes observation S and action A as inputs and outputs the corresponding expectation of the long-term reward.

The online networks are used and updated in real time, while the target networks are periodically updated based on the parameters of the latest online networks to improve stability. The detailed training algorithm of the DDPG agent is as follows [39].

The general training procedure is depicted in the flowchart as shown in Fig. 11.

4.2 Training hyperparameters

In the DDPG algorithm, the update smooth factor $\tau = 10^{ - 3}$, the random experience minibatch size is M = 64, the experience buffer size is R = $10^{6}$, and the discount factor $\gamma = 0.99$. The Ornstein Uhlenbeck noise model is adopted to encourage exploration.

We adopt the Adam optimizer with a learning rate = 10^–4 for the actor-network, and a learning rate = $5 \times 10^{ - 3}$ for the critic-network to improve learning results. $L_{2}$ regularization with factor = 10^–3 is adopted to avoid overfitting.

To get better training convergence and accumulative reward, we consider the influence of the weighted coefficient $\lambda $, dimension of the neuron network, input dimension of the actor-network, activation function, and padding strategy. In this section, we generate a triangular mesh with the node valence equal to 6 for all nodes and perturb the nodes' coordinates randomly to establish the training mesh and validation mesh. The initial mesh, the perturbed training mesh, and the perturbed validation mesh are depicted in Fig. 12, and information on the mesh is listed in Table 1.

(1)
Reward weighted coefficient $\lambda $

Table 1 Information of the perturbed training and validation mesh

Unstructured surface mesh smoothing method based on deep reinforcement learning

Abstract

Similar content being viewed by others

MGNet: a novel differential mesh generation method based on unsupervised neural networks

Meshing using neural networks for improving the efficiency of computer modelling

Faster and efficient tetrahedral mesh generation using generator neural networks for 2D and 3D geometries

1 Introduction

2 Brief review of unstructured mesh quality optimization methods

3 Deep reinforcement learning-based smoothing

3.1 Deep reinforcement learning framework

3.2 DRL-based mesh smoothing

3.3 Three-dimensional surface mesh smoothing based on the DRL method

4 Training algorithm and hyperparameters

4.1 Training algorithm

4.2 Training hyperparameters

5 Training and validation cases on 2-dimensional meshes

5.1 Training cases

5.2 Validation cases

6 Training and validation cases on 3-dimensional surface meshes

6.1 Training cases

6.2 Validation cases

7 Numerical simulations with smoothed meshes

8 Conclusions and future work

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation