Support vector machine guided reproducing kernel particle method for image-based modeling of microstructures

Wang, Yanran; Baek, Jonghyuk; Tang, Yichun; Du, Jing; Hillman, Mike; Chen, Jiun-Shyan

doi:10.1007/s00466-023-02394-9

Support vector machine guided reproducing kernel particle method for image-based modeling of microstructures

Original Paper
Open access
Published: 18 October 2023

Volume 73, pages 907–942, (2024)
Cite this article

Download PDF

You have full access to this open access article

Computational Mechanics Aims and scope Submit manuscript

Support vector machine guided reproducing kernel particle method for image-based modeling of microstructures

Download PDF

Yanran Wang¹,
Jonghyuk Baek¹,
Yichun Tang²,
Jing Du²,
Mike Hillman³ &
…
Jiun-Shyan Chen ORCID: orcid.org/0000-0002-6871-8815¹

878 Accesses
Explore all metrics

Abstract

This work presents an approach for automating the discretization and approximation procedures in constructing digital representations of composites from micro-CT images featuring intricate microstructures. The proposed method is guided by the Support Vector Machine (SVM) classification, offering an effective approach for discretizing microstructural images. An SVM soft margin training process is introduced as a classification of heterogeneous material points, and image segmentation is accomplished by identifying support vectors through a local regularized optimization problem. In addition, an Interface-Modified Reproducing Kernel Particle Method (IM-RKPM) is proposed for appropriate approximations of weak discontinuities across material interfaces. The proposed method modifies the smooth kernel functions with a regularized Heaviside function concerning the material interfaces to alleviate Gibb's oscillations. This IM-RKPM is formulated without introducing duplicated degrees of freedom associated with the interface nodes commonly needed in the conventional treatments of weak discontinuities in the meshfree methods. Moreover, IM-RKPM can be implemented with various domain integration techniques, such as Stabilized Conforming Nodal Integration (SCNI). The extension of the proposed method to 3-dimension is straightforward, and the effectiveness of the proposed method is validated through the image-based modeling of polymer-ceramic composite microstructures.

Eighty Years of the Finite Element Method: Birth, Evolution, and Future

Article Open access 13 June 2022

Development of an ABAQUS plugin tool for periodic RVE homogenisation

Article Open access 21 May 2018

A physics-informed neural network technique based on a modified loss function for computational 2D and 3D solid mechanics

Article 28 November 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In recent years, a variety of non-destructive imaging techniques, such as micro-X-ray computed tomography (micro-CT), have emerged as powerful alternatives to obtain detailed information about the microstructure and internal deformation of composite materials [1,2,3,4]. Nevertheless, modeling microstructures remains challenging owing to their geometrical and topological complexities and heterogeneity, making the body-fitted mesh generation for mesh-based methods extremely tedious and time-consuming, especially in the three-dimension model construction. An example of a 2D micro-CT image slice of a polymer-ceramic composite specimen (polymer matrix reinforced by ceramic particles) is shown in Fig. 1a with a resolution of 8 μm, and a body-fitted FEM mesh is generated for a selected Region of Interest (ROI) of 200 by 200 pixels in Fig. 1b to demonstrate the meshing complexity. As can be seen, body-fitted meshing requires significant mesh refinement near material interfaces of inclusions with complex geometries. This adds complexity to mesh generation and demands extensive mesh density in the discretization, yielding 37,454 elements and 112,538 nodes in this demonstration example.

Various image segmentation techniques have been developed over the past several decades, including region-based and classification-based methods [5]. Global and local thresholding [6] is a simple region-based method that uses a threshold value to separate objects from the background, but it can lead to poor results if the threshold is not chosen correctly. The region growing method [7] is another region-based approach that relies on user-selected seed pixels and offers advantages over thresholding, but the numerical results can be sensitive to the selection of initial seed points. Clustering-based methods [8], such as K-mean clustering, hierarchical clustering, and Gaussian Mixture Models, are region-based, unsupervised algorithms that partition images into local regions or clusters based on the similarity of their attributes. They can perform image segmentation directly with image pixel information without labeled data, but appropriate selection of features and number of clusters are essential to the effectiveness of many clustering methods. On the contrary, classification-based methods generally adopt a global approach for image segmentation, whereby an automatic pattern recognition process is utilized in the context of supervised learning based on manually segmented training datasets. K-Nearest Neighbor is a simple, non-parametric supervised learning model that makes predictions based on the k-nearest neighbors in the training data, but it usually requires a large amount of training data to suppress high variance problems [9]. Tree-based algorithms, including decision trees and random forests, are another widely used supervised learning techniques in which the training data is partitioned into smaller subsets without much data pre-processing and with high interpretability and computational efficiency. However, these methods may have limited ability to predict unseen data, restricted decision boundary expressiveness, and can be sensitive to imbalanced data [10]. Recently, deep learning algorithms have enabled to develop state-of-the-art image segmentation methods, especially those based on convolutional neural networks, which can automatically learn features from raw images with minimal human interaction. However, these methods require large amounts of labeled datasets with extensive training and are mathematically more challenging to interpret due to the highly non-linear relationships between input features and output labels [11, 12]. Other techniques besides machine learning algorithms have also been widely adopted for image segmentation. The level set method originated from Osher and Sethian [13] uses an auxiliary function to represent and track the evolution of interfaces in images [14]. While it is capable of handling irregular shapes and complex topologies, identification of zero level set can be time consuming. Fast Fourier Transform explores the frequency domain of images where quick computations can be performed on frequency components of images. This algorithm is highly efficient as it operates in low-dimensional frequency domain. However, Fast Fourier Transform provides limited direct spatial localization information of images features, and it works more optimally for certain image processing task like image registration [15] and periodic patterns reconstruction and segmentation [16].

The present work employs the Support Vector Machine (SVM) algorithms as the image segmentation method to guide the numerical model generation. SVM is a classification-based machine learning algorithm built on solid mathematical foundation and optimization frameworks [17, 18]. Compared to other supervised algorithms, SVM is advantageous because it generates a unique maximum-margined global hyperplane for separating training datasets, providing a global solution for data classification. Additionally, it is not sensitive to the underlying probabilistic distribution of the training dataset, ensuring high performance for limited, noisy, or imbalanced datasets [19]. One apparent limitation of the standard SVM is that it requires $O({l}^{3})$ operations, where $l$ is the length of the training dataset, to solve a complex quadratic programming problem (QPP) with inequality constraints. Various approaches have been proposed to overcome this limitation, such as the training decomposition method [20] and the reduced support vector machine algorithm [21], which significantly improves SVM’s training speed. Additionally, more efficient formulations of SVM have been introduced, such as the Least Square SVM algorithm [22] and the Lagrangian SVM algorithm [23]. The Least Square SVM algorithm optimizes a dual problem directly using a least-square loss function, replacing the hinge loss function in the original SVM’s formulation to reformulate the complex QPP as a linear system of equations. In contrast, the Lagrangian SVM algorithm utilizes an implicit Lagrangian for the dual of the standard quadratic program of a linear SVM, leading to the minimization of an unconstrained differentiable convex function in the space of dimensionality equal to the number of training datasets. Both mentioned algorithms eliminate the necessity of complicated programming problem solvers, making them feasible for classifying large datasets. In addition to the binary SVM classifier, extensive research has been done to extend SVM to multi-class classification. The one-vs-all method, one-vs-one method, error-correcting output codes, and directed acyclic graphs are among the most widely used approaches to handle multi-class classification with SVM [24]. The traditional binary SVM algorithm is adopted in this work for its effective applicability to the two-phase materials.

Numerical modeling of heterogeneous materials remains challenging for both mesh-based methods discretized with body-fitted discretization and meshfree methods formulated with smooth approximations. For the Finite Element Method (FEM), incomplete handling of discontinuities in mesh construction can lead to suboptimal convergence [25], and aligning meshes with interfaces is a non-trivial task for composites with complex microstructures and significant variations in constituent moduli. The Finite Cell Method is a high-order embedded domain technique [26] that provides simple yet effective modification of traditional FEM to bypass the necessity of exhaust body-fitted meshing for geometrically and topologically complex microstructures. Korshunova et al. [27, 28] presented image-based numerical characterization and validation of additively manufactured structures using Finite Cell Method and numerical homogenization. Special numerical integration schemes need to be considered to differentiate between inside and outside the physical domain for the Finite Cell Method. The meshfree methods utilize point-wise discretization instead of carefully constructed body-fitted meshes. However, meshfree methods such as element-free Galerkin (EFG) [29] and reproducing kernel particle method (RKPM) [30,31,32] typically suffer from Gibb's-like oscillation in the approximation when modeling weak continuities in composite materials, as their smooth approximation functions with overlapping local supports fail to capture gradient jump conditions across material interfaces [33]. Considerable effort has been dedicated to developing effective techniques for dealing with interface discontinuities. Since the proposed work is under the Galerkin meshfree framework, the review of methods developed based on mesh-based context to address interface discontinuities is omitted here. Reviews on some key non body-fitted FEM developments for interface discontinuities can be found in [34, 35].

Two primary approaches in meshfree methods have been proposed for handling material interface weak discontinuities. The first approach involves introducing discontinuities in the meshless approximation function. Krongauz and Belytschko proposed two types of jump enrichment functions into the conventional Moving Least Squares or Reproducing Kernel (RK) approximation of the field variables [33]. The enrichment functions introduce discontinuous derivatives into solutions along material interfaces, but additional unknowns must be solved in this method. Chen et al. [36] introduced the jump enrichment functions into the RK shape function based upon enforcing the consistency conditions, which is termed the interface-enriched reproducing kernel approximation. However, coupling interface-enriched RK shape functions with the standard RK shape function requires duplicated unknowns. In addition, Masuda and Noguchi introduced a discontinuous derivative basis functions to replace the conventional polynomial basis function used in Moving Least-Squares approximations [37]. Another class of methods introduces modifications to the weak formulation to consider the effects of discontinuity in a weak sense. Codes and Moran treated material interface discontinuities by a Lagrange Multiplier technique so that the approximations are disjoint across the interfaces while the Lagrange Multiplier imposes the interface continuity constraints into the variational formulation of the meshfree discretization [38]. This approach introduces additional degrees of freedom to be solved associated with the Lagrange Multiplier, and stability conditions need additional attention. On the other hand, the discontinuous Galerkin formulation has also been considered, where the continuity of a field variable and its resulting interface flux or traction across interfaces are imposed in the weak form [39, 40]. Wang et al. proposed a Discontinuous Galerkin reformulation of the EFG and RKPM to address interface discontinuity problems of composite materials [41]. This approach avoids duplicated unknowns, and by decomposing the domain into patches, the gradient jump of the dependent variable is captured by the boundary of the adjacent patches while the continuity condition is realized weakly through an augmented variational form with associated flux or traction crossing material interfaces. Additionally, other meshfree methods have also been proposed for non-body-fitted discretization of heterogeneous media, such as the immersed methods [42, 43]. However, these methods require special care of interface oscillations due to the employment of volumetric constraints on the foreground and background discretization.

The current work introduces a novel Interface-Modified Reproducing Kernel Particle Method (IM-RKPM) to properly handle weak discontinuities in composite materials across material interfaces. The proposed approach utilizes signed distance functions obtained from SVM classified micro-CT images to introduce regularized weak discontinuities to the kernel functions for arbitrary interface geometries. No duplicated unknowns, special enrichment functions, or complicated reformulation of the RK shape functions are required in the proposed approach, offering automated model construction capabilities for modeling complex microstructures.

The remainder of the paper is organized as follows. Section 2 provides basic equations for the model problem and the Reproducing Kernel Particle Method, and the associated numerical domain integration techniques are also discussed in this section. A brief introduction of the SVM formulation and the proposed SVM and RK-guided procedures for image segmentation of heterogenous materials are introduced in Sect. 3. Section 4 presents an interface-modified kernel function and the Interface-Modified Reproducing Kernel Particle Method formulation to introduce weak discontinuities in the image-based modeling of composite microstructures. In Sect. 5, two numerical examples for image-based modeling of microstructures are demonstrated, and the paper concludes with a discussion and summary in Sect. 6. Throughout the paper, the following abbreviations and symbols have been introduced:

SVM: Support Vector Machine
IM-RKPM: Interface-Modified Reproducing Kernel Particle Method
SCNI: Stabilized Conforming Nodal Integration
micro-CT: micro-X-ray Computed Tomography
ROI: Region of Interest
QPP: Quadratic Programming Problem
FEM: Finite Element Method
EFG: Element-Free Galerkin
RKPM: Reproducing Kernel Particle Method
SVM-RK: Support Vector Machine Guided Reproducing Kernel
B2: Quadratic B Spline Kernel Function
B3: Cubic B Spline Kernel Function
SEM: Scanning Electron Microscopy
EDS: Energy Dispersive X-ray Spectroscopy
PK: Power Kernel Function
${\mathbb{S}}^{\mathrm{RK}}$: SVM-RK discretization node set containing total number of $NP$ nodes
${\Psi }_{I}\left({\varvec{x}}\right)$: RK shape function with support centered at node ${{\varvec{x}}}_{I}$ and evaluated at point ${\varvec{x}}$
${\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{{\varvec{I}}}\right)$: RK kernel function with a compact support $a$ defined over a node $I$’s associated subdomain ${\Omega }_{I}$
${{\varvec{H}}}^{T}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$: Vector of monomial basis functions to the order of $n$
${\varvec{M}}\left({\varvec{x}}\right)$: Moment matrix
$\widetilde{\nabla }{\Psi }_{I}\left({{\varvec{x}}}_{L}\right)$: Smoothed RK shape function gradient evaluated at nodal integration point ${{\varvec{x}}}_{L}$
$\widetilde{{\varvec{B}}}\left({{\varvec{x}}}_{{\varvec{L}}}\right)$: Gradient matrix associated with smoothed nodal RK shape function gradient
$\mathbf{D}={\left\{\left({{\varvec{x}}}_{i},{y}_{i}\right)\right\}}_{i=1}^{l}$: Labeled SVM training data set containing $l$ pairs of training data ${{\varvec{x}}}_{i}\in {\mathbb{R}}^{d}$ and corresponding response label ${y}_{i}\in \{-\mathrm{1,1}\}$
${\varvec{w}}$: $d$-dimensional weight vector
$b$: A scalar bias
$h\left({\varvec{x}};\left\{{\varvec{w}},b\right\}\right)$: $d$-dimensional separating hyperplane
${\left\{{{\varvec{x}}}_{i}^{SV}\right\}}_{i=1}^{{N}_{SV}}$: The set of support vectors with a total number of ${N}_{sv}$ data points
${\xi }^{*}$: Margin
${h}^{*}\left({\varvec{x}}\right)$: Optimal separation hyperplane that maximizes the margin
$\nabla {\epsilon }_{i}$: Slack variable corresponding to the ${i}^{th}$ data point
$C$: Penalty weight parameter
${\varvec{K}}\left({{\varvec{x}}}_{i},{{\varvec{x}}}_{j}\right)$: Kernel function used in non-linear SVM formulation
$\gamma $: Gaussian radial basis kernel scale
$S\left({\varvec{x}}\right)$: SVM classification score function
${\mathbb{S}}^{0}$: A set of training data points located at image pixel centroids with a total number of $N{P}_{0}$ points
${\mathbb{S}}^{+}$: A subset of ${\mathbb{S}}^{0}$ that contains elements with non-negative classification scores
${\mathbb{S}}^{-}$: A subset of ${\mathbb{S}}^{0}$ that contains elements with negative classification scores
${\left\{{{\varvec{x}}}_{K}^{+}, {{\varvec{x}}}_{K}^{-}\right\}}_{K=1}^{N{P}_{IF}}$: Interface-searching node pairs with ${{\varvec{x}}}_{K}^{\pm }\in {\mathbb{S}}^{\pm }$ as a near-interface master/slave node
${d}_{K}^{*}$: A scaler line search step for identifying interface node ${{\varvec{x}}}_{K}^{*}$
${{\varvec{R}}}_{K}$: Interface node line search direction for identifying interface node ${{\varvec{x}}}_{K}^{*}$
${\mathbb{S}}^{IF}$: Interface node set, containing a total number of $N{P}^{*}$ identified interface node ${{\varvec{x}}}^{*}$
${\mathbb{S}}^{C+}$: Master candidate node set, a subset of ${\mathbb{S}}^{+}$ containing near-interface master nodes
${\mathbb{S}}^{C-}$: Slave candidate node set, a subset of ${\mathbb{S}}^{-}$ containing near-interface slave nodes
$\widetilde{S}\left({\varvec{x}}\right)$: RK shape function interpolated classification score function
${s}_{I}$: SVM classification score value for node ${{\varvec{x}}}_{I}\in {\mathbb{S}}^{0}$
${\Psi }_{I}^{0}\left({\varvec{x}}\right)$: RK shape function centered at node ${{\varvec{x}}}_{I}\in {\mathbb{S}}^{0}$
$\widetilde{H}$: Regularized Heaviside function for interface modification
${\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$: Modified RK kernel function with a compact support $a$
$\overline{\xi }({\varvec{x}})$: A distance measure normalized with respected to the nodal spacing
$\overline{{\varvec{M}} }\left({\varvec{x}}\right)$: Modified moment matrix
${\overline{\Psi } }_{I}\left({\varvec{x}}\right)$: Interface modified reproducing kernel shape function
$\left\{{{\varvec{x}}}_{K}^{*+},{{\varvec{x}}}_{K}^{*-}\right\}$: Mirrored node pair for the interface node ${{\varvec{x}}}_{K}^{*}$

2 Basic equations

2.1 Model problem

Let a model elasticity problem be defined on a domain $\Omega $ with its boundary assigned as $\partial\Omega =\partial {\Omega }_{g}\cup \partial {\Omega }_{t}$, $\partial {\Omega }_{g}\cap \partial {\Omega }_{t}= \varnothing $, where the subscripts $g$ and $t$ denote the Dirichlet and Neumann boundaries, respectively. The strong form for heterogeneous elastic media can be described as:

$$ \begin{aligned} \nabla \cdot {\varvec{\sigma}} + {\varvec{s}} & = 0\;\;{\text{in }}\;\;{\Omega } \\ {\varvec{u}} & = \overline{\user2{u}}\;\;{\text{ on }}\;\;\partial {\Omega }_{g} \\ {\varvec{n}} \cdot {\varvec{\sigma}} & = {\varvec{t}}\;\;{\text{ on}}\;\;{ }\partial {\Omega }_{t} \\ \end{aligned} $$

(1)

where ${\varvec{u}}$ represents the unknown displacement field, ${\varvec{\sigma}}$ is the Cauchy stress tensor, ${\varvec{s}}$ is the body force vector, $\overline{{\varvec{u}} }$ and ${\varvec{t}}$ are the prescribed displacement vector and the applied surface traction vector on the Dirichlet and Neumann boundaries, respectively, and ${\varvec{n}}$ is the unit outward normal of the Neumann boundaries. The elastic constitutive relationship for heterogeneous materials is represented as:

$${\varvec{\sigma}}\left({\varvec{x}}\right)={\varvec{C}}\left({\varvec{x}}\right):{\varvec{\varepsilon}}\left({\varvec{u}}\left({\varvec{x}}\right)\right)$$

(2)

Here ${\varvec{C}}\left({\varvec{x}}\right)$ is the elasticity tensor defined as:

$$ {\varvec{C}}\left( {\varvec{x}} \right) = \left\{ {\begin{array}{*{20}c} {{\varvec{C}}^{1} , \quad \varvec{x} \in {\Omega }^{1} } \\ {{\varvec{C}}^{2} , \quad \varvec{x}\in {\Omega }^{2} } \\ \end{array} } \right. $$

(3)

where ${\Omega }^{i}$ are material sub-domains to be segmented by the SVM classification of microstructure image pixels.

The weak formulation is to find ${\varvec{u}}\left({\varvec{x}}\right)\in U\subset {H}_{g}^{1}$, such that for all weight function ${\varvec{v}}\left({\varvec{x}}\right)\in V\subset {H}_{0}^{1}$,

$${\int }_{\Omega }{\varvec{\varepsilon}}\left({\varvec{v}}\right):{\varvec{\sigma}}\left({\varvec{u}}\right)d\Omega ={\int }_{\Omega }{\varvec{v}}\cdot {\varvec{s}}d\Omega +{\int }_{\partial {\Omega }_{h}}{\varvec{v}}\cdot {\varvec{t}}d\Gamma $$

(4)

The Galerkin formulation seeks the trial solution function ${{\varvec{u}}}^{h}\in {U}^{h}\subset U$, so that for all weight function ${{\varvec{v}}}^{h}\in {V}^{h}\subset V$,

$${\int }_{\Omega }{\varvec{\varepsilon}}\left({{\varvec{v}}}^{h}\right):{\varvec{\sigma}}\left({{\varvec{u}}}^{h}\right)d\Omega ={\int }_{\Omega }{{\varvec{v}}}^{h}\cdot {\varvec{s}}d\Omega +{\int }_{\partial {\Omega }_{h}}{{\varvec{v}}}^{h}\cdot {\varvec{t}}d\Gamma $$

(5)

2.2 Reproducing kernel approximation

Let a closed domain $\overline{\Omega }=\Omega \cup \partial\Omega \subset {\mathbb{R}}^{d}$ be discretized by a set of $NP$ nodes denoted by ${\mathbb{S}}^{{{\text{RK}}}} = \left\{ {{\varvec{x}}_{1} ,{\varvec{x}}_{2} , \ldots ,{\varvec{x}}_{NP} | {\varvec{x}}_{I} \in {\overline{\Omega }}} \right\}$, and let the approximation of a field variable ${\varvec{u}}\left({\varvec{x}}\right)$ in $\overline{\Omega }$ be denoted by ${{\varvec{u}}}^{h}\left({\varvec{x}}\right)$. The RK approximation of the field variable ${\varvec{u}}\left({\varvec{x}}\right)$ based on the discrete points in the set ${\mathbb{S}}^{\mathrm{RK}}$ is formulated as follows:

$${u}_{i}^{h}\left({\varvec{x}}\right)={\sum }_{I=1}^{NP}{\Psi}_{I}\left({\varvec{x}}\right){d}_{iI}$$

(6)

where ${\Psi }_{I}$ denotes the RK shape function with support centered at the node ${{\varvec{x}}}_{I}$ and ${d}_{iI}$ is the nodal coefficient in ${i}^{th}$ dimension to be sought. Moreover, let a node $I$ be associated with a subdomain ${\Omega }_{I}$, over which a kernel function ${\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{{\varvec{I}}}\right)$ with a compact support $a$ is defined, such that $\overline{\Omega }\subset {\bigcup }_{I\in {\mathbb{S}}^{\mathrm{RK}}}{\Omega }_{I}$ holds. The RK approximation function is constructed as [30,31,32, 44]:

$$ \begin{aligned} \Psi _{I} \left( \user2{x} \right) & \!=\! C\left( {\user2{x};\user2{x} \!-\! \user2{x}_{I} } \right)\phi _{a} \left( {\user2{x} \!-\! \user2{x}_{I} } \right) \!=\! \left( {\mathop \sum \limits_{{\left| \upalpha \right| \le n}} \left( {\user2{x} \!-\! \user2{x}_{I} } \right)^{\upalpha } b_{\upalpha } \left( \user2{x} \right)} \right)\phi _{a} \left( {\user2{x} \!-\! \user2{x}_{I} } \right) \\ & \equiv \user2{H}^{{\text{T}}} \left( {\user2{x} \!-\! \user2{x}_{I} } \right)\user2{b}\left( \user2{x} \right)\phi _{a} \left( {\user2{x} \!-\! \user2{x}_{I} } \right)\end{aligned} $$

(7)

$${{\varvec{H}}}^{T}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)=[1, {x}_{1}-{x}_{1I}, {x}_{2}-{x}_{2I}, \dots , {\left({x}_{3}-{x}_{3I}\right)}^{n}]$$

(8)

where $\upalpha$ is a multi-index notation such that ${\upalpha }=(\upalpha_{1},\upalpha_{2}, \dots ,{{\upalpha }}_{d})$ with a length defined as $\left|{\upalpha }\right|={{\upalpha }}_{1}+{{\upalpha }}_{2}+\cdots +{{\upalpha }}_{d}$, and ${{\varvec{x}}}^{\alpha }\equiv {{\varvec{x}}}_{1}^{{\alpha }_{1}}\cdot {{\varvec{x}}}_{2}^{{\alpha }_{2}},\dots ,{{\varvec{x}}}_{d}^{{\alpha }_{d}}$, ${b}_{{\upalpha }}={b}_{{{\upalpha }}_{1}{{\upalpha }}_{2}\cdots {{\upalpha }}_{d}}$. The term $C\left({\varvec{x}};{\varvec{x}}-{{\varvec{x}}}_{I}\right)= {{\varvec{H}}}^{\mathrm{T}}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right){\varvec{b}}\left({\varvec{x}}\right)$ is called the correction function of the kernel ${\phi }_{a}\left({{\varvec{x}}-{{\varvec{x}}}_{I}}\right)$ designed to introduced completeness to the RK approximation. The terms ${\left\{{\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)}^{{\upalpha }}\right\}}_{\left|{\upalpha }\right|\le n}$ form a set of basis functions, and ${{\varvec{H}}}^{T}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$ is the corresponding vector of basis functions to the order $n$. The vector ${\varvec{b}}\left({\varvec{x}}\right)$ is the coefficient vector of ${\left\{{b}_{\alpha }\left({\varvec{x}}\right)\right\}}_{\left|\alpha \right|\le n}$ and is solved by enforcing the following discrete reproducing conditions [45]:

$${\sum }_{I=1}^{NP}{\Psi }_{I}\left({\varvec{x}}\right){{\varvec{x}}}_{I}^{{\upalpha }}={{\varvec{x}}}^{{\upalpha }},\quad \left| {\upalpha } \right| \le n $$

(9)

or equivalently,

$$ \begin{aligned} & \mathop \sum \limits_{I = 1}^{NP} {\Psi }_{I} \left( {\varvec{x}} \right)\left( {{\varvec{x}} - {\varvec{x}}_{I} } \right)^{{\upalpha }} = \delta_{{0{\upalpha }}} ,\quad \left| {\upalpha } \right| \le n \\ & {\text{Or}} \\ & \mathop \sum \limits_{I = 1}^{NP} {\Psi }_{I} \left( {\varvec{x}} \right){\varvec{H}}\left( {{\varvec{x}} - {\varvec{x}}_{I} } \right) = {\varvec{H}}\left( \varvec{0} \right) \\ \end{aligned} $$

(10)

where ${\varvec{H}}\left(\varvec{0}\right)={\left[\mathrm{1,0},\dots ,0\right]}^{T}$ according to Eq. (8). After inserting Eq. (7) into Eq. (10), ${\varvec{b}}\left({\varvec{x}}\right)$ is obtained as:

$${\varvec{b}}\left({\varvec{x}}\right)={{\varvec{M}}}^{-1}\left({\varvec{x}}\right){\varvec{H}}\left(\varvec{0}\right){\phi }_{a}\left({{\varvec{x}} - {\varvec{x}}_{I} }\right)$$

(11)

where ${\varvec{M}}\left({\varvec{x}}\right)$ is the moment matrix and is formulated as:

$${\varvec{M}}\left({\varvec{x}}\right)=\sum_{I=1}^{NP}{\varvec{H}}\left({{\varvec{x}} - {\varvec{x}}_{I} }\right){{\varvec{H}}}^{\mathrm{T}}\left({{\varvec{x}} - {\varvec{x}}_{I} }\right){\phi }_{a}({{\varvec{x}} - {\varvec{x}}_{I} })$$

(12)

Finally, the RK shape function is obtained as:

$${\Psi }_{I}\left({\varvec{x}}\right)={{\varvec{H}}}^{\mathbf{T}}\left(\varvec{0}\right){{\varvec{M}}}^{-1}\left({\varvec{x}}\right){\varvec{H}}\left({{\varvec{x}} - {\varvec{x}}_{I} }\right){\phi }_{a}\left({{\varvec{x}} - {\varvec{x}}_{I} }\right)$$

(13)

Examples of 1-dimensional kernel functions are shown in Fig. 2, and the 1- and 2-dimensional RK shape functions constructed based on cubic B-spline kernel function and linear basis functions are shown in Fig. 3. The locality and the smoothness of the RK approximation functions are determined by the kernel function, while the order of completeness in the approximation is determined by the order of basis functions $n$. Interested readers are referred to [30,31,32, 44, 46, 47] for basic properties of reproducing kernel approximation.

2.3 Numerical domain integration

Due to the rational nature and arbitrary local supports, introducing RK approximations in the Galerkin weak form requires special attention. The conventional Gauss integration on background integration cells leads to a sub-optimal convergence unless significantly high-order quadrature rules are used, which is computationally infeasible especially in three-dimension [44]. More recent quadrature integration methods have been developed to handle broken integration cells commonly encountered in non-geometrically conforming meshes, such as the moment fitting based methods [48, 49] and smart octree based methods [50]. Those quadrature methods can achieve similar accuracy with low order quadrature points compared to the conventional Gauss integration and handle complex and irregular integration domains. However, it would be natural for meshfree methods to use nodal-based numerical integrations. Several nodal-based integration techniques have been proposed, such as stabilized conforming and non-conforming nodal integrations [51,52,53,54,55,56] and variationally consistent integration [57], along with various stabilization methods [58,59,60]. The stabilized conforming nodal integration methods is utilized in this work and is summarized in this section.

2.3.1 Stabilized conforming nodal integration

One solution that significantly eases the computational cost of Gauss domain integration is to use the discretized nodes as integration points [58], referred to as the direct nodal integration. The direct nodal integration technique is appealing because of its simplicity and efficiency, as it does not require a background integration mesh, which makes numerical approximation truly "mesh-free." Nevertheless, since it is similar to a one-point quadrature rule, the under-integration of the weak form results in improper zero energy modes in most situations [51, 58].

Chen et al. [51] introduced the Stabilized Conforming Nodal Integration (SCNI) method as an enhancement of the direct nodal integration by fulfilling the consistency conditions between the approximations and the numerical integration of the weak form known as the integration constraints. The SCNI method is formulated to exactly meet the first-order integration constraint and also to remedy the rank deficiency in the direct nodal integration method by introducing the following smoothed gradient in the Galerkin approximation:

$$ \tilde{\nabla }\Psi_{I} \left( {{\varvec{x}}_{L} } \right) = \frac{1}{{W_{L} }}\mathop \smallint \limits_{{\Omega_{L} }} \nabla \Psi_{I} \left( {\varvec{x}} \right){\text{d}}\Omega = \frac{1}{{W_{L} }}\mathop \smallint \limits_{{\partial \Omega_{L} }} \Psi_{I} \left( {\varvec{x}} \right){\varvec{n}}{\text{d}}\Gamma , \quad W_{L} = \mathop \smallint \limits_{{\Omega_{{\text{L}}} }} d\Omega $$

(14)

where ${\Omega }_{L}$ denotes the nodal representative conforming smoothing cells, and ${\varvec{n}}$ represents the unit outward normal of the smoothing cell boundaries. A convenient way of generating conforming smoothing cells is to create the Voronoi diagram according to the domain boundaries and nodal coordinates, as illustrated in Fig. 4, where the boundary integral in the smoothed gradient in Eq. (14) is carried out by the cell boundary quadrature points.

The associated gradient matrix $\widetilde{{\varvec{B}}}\left({{\varvec{x}}}_{{\varvec{L}}}\right)$ of RK approximation evaluated at nodal integration point ${{\varvec{x}}}_{{\varvec{L}}}$ is now expressed in terms of smoothed gradient as:

$$ \begin{aligned} \tilde{\user2{B}}_{I} \left( {{\varvec{x}}_{L} } \right) & = \left[ {\begin{array}{*{20}c} {\tilde{b}_{I1} \left( {{\varvec{x}}_{L} } \right)} & 0 \\ 0 & {\tilde{b}_{I2} \left( {{\varvec{x}}_{L} } \right)} \\ {\tilde{b}_{I2} \left( {{\varvec{x}}_{L} } \right)} & {\tilde{b}_{I1} \left( {{\varvec{x}}_{L} } \right)} \\ \end{array} } \right] \\ \tilde{b}_{Ii} \left( {{\varvec{x}}_{L} } \right) & = \frac{1}{{W_{L} }}\mathop \int \limits_{{{\Gamma }_{L} }}^{{}} {\Psi }_{I} \left( {\varvec{x}} \right)n_{i} \left( {\varvec{x}} \right)d{\Gamma } \\ \end{aligned} $$

(15)

The stiffness matrix is then integrated by nodal integration with the smoothed gradient as:

$${\varvec{K}}= \sum_{L=1}^{NP}{\widetilde{{\varvec{B}}}}^{{\varvec{T}}}({{\varvec{x}}}_{L}){\varvec{C}}({{\varvec{x}}}_{L})\widetilde{{\varvec{B}}}({{\varvec{x}}}_{L}){W}_{L}$$

(16)

It is noted that Voronoi cells conformed to the material interfaces without confining to the existing pixel points can be constructed, as demonstrated in Fig. 4. In such construction, the centers of those Voronoi cells can be viewed as the integration points that are not coincided with the image pixel points. A naturally stabilized nodal integration [56] can be added to SCNI for additional stabilization. Details of constructing those Voronoi cells near the material interface are given in Appendix A.

3 Support vector machine (SVM) classification of micro-CT images and model discretization

3.1 Support vector machine (SVM) classification algorithm

The Support Vector Machine (SVM) is a class of supervised machine learning algorithms that assigns labels to objects through training [61]. Let a labeled classification dataset containing $l$ sets of data be denote as $\mathbf{D}={\left\{\left({{\varvec{x}}}_{i},{y}_{i}\right)\right\}}_{i=1}^{l}$, where ${{\varvec{x}}}_{i}\in {\mathbb{R}}^{d}$ is the $d$-dimensional data points, and ${y}_{i}$ is the label corresponding to the ith data point. Since the primary focus of this work is bi-material classification, ${y}_{i}$ is assumed to be either $-1$ or $+1$, representing the negative (matrix) and positive (inclusion) classes, respectively. If the given dataset is perfectly linearly separable, the SVM classification process can be described as to find a separating hyperplane in the form of a linear discriminant function in $d$-dimension:

$$h\left({\varvec{x}};\left\{{\varvec{w}},b\right\}\right)={{\varvec{w}}}^{T}{\varvec{x}}+b$$

(17)

where ${\varvec{w}}$ denotes a $d$-dimensional weight vector and $b$ denotes a scalar bias. Additionally, $h({\varvec{x}};\left\{{\varvec{w}},b\right\})$ serves as a linear classifier for class prediction following the decision rule:

$$ y = \left\{ {\begin{array}{*{20}c} { + 1 \quad if\; h\left( {{\varvec{x}};\left\{ {{\varvec{w}},b} \right\}} \right) > 0} \\ { - 1\quad if \;h\left( {{\varvec{x}};\left\{ {{\varvec{w}},b} \right\}} \right) < 0} \\ \end{array} } \right. $$

(18)

Therefore, the weight vector ${\varvec{w}}$ is orthogonal to the defined hyperplane, and for each ${{\varvec{x}}}_{i}\in \mathbf{D}$, the relative distance in terms of ${\varvec{w}}$ to the defined hyperplane can be expressed as:

$${\xi }_{i}=\frac{{y}_{i}\left({{\varvec{w}}}^{T}{{\varvec{x}}}_{i}+b\right)}{\Vert {\varvec{w}}\Vert }$$

(19)

The margin of the linear classifier is identified by selecting a collection of the data points that achieve a minimum distance to $h({\varvec{x}};\left\{{\varvec{w}},b\right\})$, which are called the support vectors ${\left\{{{\varvec{x}}}_{i}^{SV}\right\}}_{i=1}^{{N}_{SV}}$, and are defined as:

$${\left\{{{\varvec{x}}}_{i}^{SV}\right\}}_{i=1}^{{N}_{SV}}=\underset{{\varvec{x}}_{i}\in {\varvec{D}}}{\mathrm{argmin}}\left\{\frac{{y}_{i}\left({{\varvec{w}}}^{T}{{\varvec{x}}}_{i}+b\right)}{\Vert {\varvec{w}}\Vert }\right\}$$

(20)

Note that if the distances of all support vectors to the hyperplane are normalized to be $1$, the margin can be defined as ${\xi }^{*}=\frac{1}{\Vert {\varvec{w}}\Vert }$. Thus, the goal of training the SVM with a linear classifier as Eq. (17) can be described as to find the optimal hyperplane ${h}^{*}({\varvec{x}})$ as follows:

$$ \begin{aligned} & h^{*} \left( {\varvec{x}} \right) = h\left( {{\varvec{x}};\left\{ {{\varvec{w}}^{*} ,b^{*} } \right\}} \right), \\ & {\varvec{w}}^{*} , b^{*} = \mathop {{\text{argmax}}}\limits_{{{\varvec{w}},b}} \left\{ {\frac{1}{{\Vert {\varvec{w}}}\Vert }} \right\}, \\ & {\text{subject }}\;\;{\text{to}}:{ }\;{ }y_{i} \left( {{\varvec{w}}^{T} {\varvec{x}}_{i} + b} \right) \ge 1,{ }\;\;\;{ }\forall {\varvec{x}}_{i} \in {\mathbf{D}} \\ \end{aligned} $$

(21)

The constrained optimization problem described in Eq. (21) can be formulated as an equivalent convex constrained minimization problem:

$$ \begin{aligned} & \mathop {\min }\limits_{{{\varvec{w}},b}} J\left( {\varvec{w}} \right) = \frac{1}{2}{\Vert {\varvec{w}} \Vert }^{2} , \\ & {\text{subject to}}:\;\;{ }y_{i} \left( {{\varvec{w}}^{T} {\varvec{x}}_{i} + b} \right) \ge 1,{ }\;\;\;{ }\forall {\varvec{x}}_{i} \in {\mathbf{D}} \\ \end{aligned} $$

(22)

which is called the primal formulation of SVM with linear classifier [18]. Instead of directly solving the primal convex minimization problem, it is computationally more efficient to solve the dual problem, formulated using the Lagrange multipliers. To construct the dual problem, a Lagrange multiplier ${\lambda }_{i}$ is introduced for each linear constraint based on the Karush–Kuhn–Tucker (KKT) conditions [62]:

$${\lambda }_{i}\left({y}_{i}\left({{\varvec{w}}}^{T}{{\varvec{x}}}_{i}+b\right)-1\right)=0, {\lambda }_{i}\ge 0$$

(23)

Then the objective of the dual problem can be formulated as:

$$\underset{{\varvec{w}},b}{\mathrm{min}}L=\frac{1}{2}{\Vert {\varvec{w}}\Vert }^{2}-\sum_{i=1}^{l}{\lambda }_{i}\left({y}_{i}\left({{\varvec{w}}}^{T}{{\varvec{x}}}_{i}+b\right)-1\right)$$

(24)

By finding the stationary point of the Lagrangian $L$ with respect to ${\varvec{w}}$, the optimal weight vector ${\varvec{w}}$ can be expressed in terms of a linear combination of the data points, data label, and Lagrange multipliers:

$${\varvec{w}}=\sum_{i=1}^{l}{\lambda }_{i}{y}_{i}{{\varvec{x}}}_{i}$$

(25)

Additionally, a new constrain arises when minimizing $L$ with respect to the bias, which indicates that the sum of the labeled Lagrange multipliers must be equal to zero. Therefore, by substituting Eq. (25) into Eq. (24), the dual problem’s training objective can be formulated as:

$$ \begin{aligned} & \mathop {\max }\limits_{{\varvec{\lambda}}} L_{dual} = \mathop \sum \limits_{i = 1}^{l} \lambda_{i} - \frac{1}{2}\mathop \sum \limits_{i = 1}^{l} \mathop \sum \limits_{j = 1}^{l} \lambda_{i} \lambda_{j} y_{i} y_{j} {\varvec{x}}_{i}^{T} {\varvec{x}}_{j} , \\ & {\text{subject to}}: \lambda_{i} \ge 0, \mathop \sum \limits_{i = 1}^{l} \lambda_{i} y_{i} = 0,\quad {\text{for}}\;\;{ }i = 1,2,3, \ldots ,l \\ \end{aligned} $$

(26)

where Eq. (26) forms a well-known convex quadratic programming problem (QPP).

Nevertheless, it is possible that no such hyperplane can be found through Eq. (26) as the real-world data sets are rarely perfectly separable. To deal with cases with overlapping classes, a non-negative slack variable $\nabla {\epsilon }_{i}$ is introduced to each data point ${{\varvec{x}}}_{i}\in \mathbf{D}$, such that the linear constraint in Eqs. (21) and (22) is modified as:

$$ \begin{aligned}&{y}_{i}\left({{\varvec{w}}}^{T}{{\varvec{x}}}_{i}+b\right)\ge 1-\nabla {\epsilon }_{i}, \\ &\quad \forall {{\varvec{x}}}_{i}\in {\mathbf{D}},\nabla {\epsilon }_{i}\ge 0,\forall i=\mathrm{1,2},3,\ldots ,l \end{aligned}$$

(27)

It is worth noting that the magnitude of $\nabla {\epsilon }_{i}$ affects the correctness of the classification of its corresponding data point ${{\varvec{x}}}_{i}$: when $\nabla {\epsilon }_{i}\ge 1$, the data point will be misclassified as it appears on the wrong side of the hyperplane. As a result, for non-separable data sets, SVM introduces a “soft margin” concept into the training process, and the new training objective function can be described as:

$$ \begin{aligned} & \mathop {\min }\limits_{{{\varvec{w}},b,\left\{ {\nabla \epsilon_{i} } \right\}}} \left\{ {\frac{1}{2}\|{\varvec{w}}\|^{2} + C\mathop \sum \limits_{i = 1}^{l} \nabla \epsilon_{i} } \right\}, \\ & {\text{subject to}}: \;\; y_{i} \left( {{\varvec{w}}^{T} {\varvec{x}}_{i} + b} \right) \ge 1 - \nabla \epsilon_{i} ,{ }\quad \forall {\varvec{x}}_{i} \in {\mathbf{D}}, \\ & \nabla \epsilon_{i} \ge 0,\quad \forall i = 1,2,3, \ldots ,l \\ \end{aligned} $$

(28)

where $C$ is a weight parameter that penalizes the cost of misclassification and $\sum_{i=1}^{l}\nabla {\epsilon }_{i}$ gives the loss due to the deviation from the separable cases with the introduction of slack variables. Moreover, the penalty weight parameter $C$ controls the trade-off between maximizing the hyperplane’s margin and minimizing the misclassification’s loss. Therefore, the selection of $C$ depends on the nature of problems and datasets at hand. Figure 5 presents the hard- and soft-margined linear SVM classifiers trained on a binary dataset. For the hard-margined SVM classifier, the support vectors are located exactly on the maximum lower and upper margins, while the soft-margined SVM classifier relaxes the linear separability constraints, which allows some support vectors to cross over the decision boundary. Note that although the dataset in Fig. 5 is linearly separable, if the linear separability is strictly enforced, as for the hard-margined case, the resulting margin is significantly smaller than the one for the soft-margined case.

By introducing Lagrange multipliers ${\lambda }_{i}$ and ${\beta }_{i}$, corresponding to each of the constraints in Eq. (28), a Lagrangian of Eq. (28) can be formulated as:

$$L=\frac{1}{2}{\Vert {\varvec{w}}\Vert }^{2}+C\sum_{i=1}^{l}\nabla {\epsilon }_{i}-\sum_{i=1}^{l}{\lambda }_{i}\left({y}_{i}\left({{\varvec{w}}}^{T}{{\varvec{x}}}_{i}+b\right)-1+\nabla {\epsilon }_{i}\right)-\sum_{i=1}^{l}{\beta }_{i}\nabla {\epsilon }_{i}$$

(29)

By minimizing the $L$ with respect to ${\varvec{w}}$, $b$, and $\nabla {\epsilon }_{i}$, respectively, one additional relationship connecting the penalty weight parameter $C$ to ${\lambda }_{i}$ and ${\beta }_{i}$ is obtained, in addition to the ones acquired in the linearly separable cases:

$${C=\lambda }_{i}+{\beta }_{i}$$

(30)

Therefore, the dual objective can be described as:

$$ \begin{gathered} \mathop {\max }\limits_{{\varvec{\lambda}}} L_{dual} = \mathop \sum \limits_{i = 1}^{l} \lambda_{i} - \frac{1}{2}\mathop \sum \limits_{i = 1}^{l} \mathop \sum \limits_{j = 1}^{l} \lambda_{i} \lambda_{j} y_{i} y_{j} {\varvec{x}}_{i}^{T} {\varvec{x}}_{j} , \hfill \\ {\text{subject to}}: \; 0 \le \lambda_{i} \le C, \mathop \sum \limits_{i = 1}^{l} \lambda_{i} y_{i} = 0, \quad {\text{for}}\;\;{ }i = 1,2,3, \ldots ,l \hfill \\ \end{gathered} $$

(31)

Note that the objective function achieved for the inseparable cases is the same as the one obtained from the linearly separable cases in Eq. (26), except one additional constraint on the Lagrange multiplier ${\lambda }_{i}$.

For complicated data sets, the linear classifier is often found inadequate. One strategy is to introduce non-linear transformation function $\phi $, which maps the data points ${\varvec{x}}$ to a higher dimension so that the projected data points $\phi ({\varvec{x}})$ are approximately linearly separable in the higher dimensional feature space. However, upscaling the dimensionality usually leads to high and impractical computational costs. Since the Lagrange dual formulation in Eq. (31) only depends on the dot product between two vectors in the feature space, the SVM can utilize the “kernel trick” to include high-degree polynomial features. The idea of the kernel trick is to represent $l$ data point ${\varvec{x}}$ by a $l$ by $l$ kernel matrix ${\varvec{K}}$ that contains elements ${k}_{i,j}={\varvec{K}}\left({{\varvec{x}}}_{i},{{\varvec{x}}}_{j}\right)=\langle \phi \left({{\varvec{x}}}_{i}\right),\phi ({{\varvec{x}}}_{j})\rangle $, which performs pairwise similarity comparisons between the original low dimensional data points without an explicit definition of the transformation function $\phi $ for mapping data to high dimensions. More detailed introduction to the requirement and existence of kernel function can be found in [19]. Therefore, the dual formulation of the training objective for non-linear SVM can be described by replacing ${{\varvec{x}}}_{i}^{T}{{\varvec{x}}}_{j}$ in Eq. (31) by a kernel function ${\varvec{K}}\left({{\varvec{x}}}_{i},{{\varvec{x}}}_{j}\right)$:

$$ \begin{aligned} & \mathop {\max }\limits_{{\varvec{\lambda}}} L_{dual} = \mathop \sum \limits_{i = 1}^{l} \lambda_{i} - \frac{1}{2}\mathop \sum \limits_{i = 1}^{l} \mathop \sum \limits_{j = 1}^{l} \lambda_{i} \lambda_{j} y_{i} y_{j} {\varvec{K}}\left( {{\varvec{x}}_{i} ,{\varvec{x}}_{j} } \right), \\ & {\text{subject }}\;{\text{to}}: 0 \le \lambda_{i} \le C, \mathop \sum \limits_{i = 1}^{l} \lambda_{i} y_{i} = 0,\quad {\text{for}}\;{ }i = 1,2,3, \ldots ,l \\ \end{aligned} $$

(32)

Note that Eq. (32) can be viewed as a generalized Lagrange dual formulation of SVM since for linear cases, the kernel function can be expressed as the data point dot product as:

$$\mathrm{Linear}:{\varvec{K}}\left({{\varvec{x}}}_{i},{{\varvec{x}}}_{j}\right)={{\varvec{x}}}_{i}^{T}{{\varvec{x}}}_{j}$$

(33)

Other widely used kernel functions are polynomial and Gaussian radial basis kernel functions, which are illustrated in Eq. (34) and (35), respectively.

$$\mathrm{Polynomial}: {\varvec{K}}\left({{\varvec{x}}}_{i},{{\varvec{x}}}_{j}\right)={\left(1+{{\varvec{x}}}_{i}^{T}{{\varvec{x}}}_{j}\right)}^{q}, q=\mathrm{2,3},\dots ,$$

(34)

$$\text{Gaussian radial basis}: {\varvec{K}}\left({{\varvec{x}}}_{i},{{\varvec{x}}}_{j}\right)={e}^{-\gamma {\Vert {{\varvec{x}}}_{i}-{{\varvec{x}}}_{j}\Vert }^{2}}, \gamma >0$$

(35)

Figure 6 demonstrates the transformation of the 2-dimensional training data to 3-dimension using a Gaussian kernel. It is clear that the 3-dimensional data points become linearly separable by a 2-dimensional hyperplane, and the resulting separating hyperplane can be projected back to the 2-dimensional space, which becomes a nonlinear decision boundary.

Moreover, the SVM produces a classification score by predicting new datasets that provide information about the material class and reveal the location of material interfaces. The classification score is a signed distance measure for an observation point ${\varvec{x}}$ to its nearest decision boundary, with a score of zero denoting ${\varvec{x}}$ is precisely on the decision boundary. Therefore, the classification score acts as a guide in identifying material boundaries in the image, facilitating more accurate numerical model generation. The classification score for predictions at ${\varvec{x}}$ to the positive class is defined as:

$$S\left({\varvec{x}}\right)=\sum_{j=1}^{{N}_{SV}}{\lambda }_{j}{y}_{j}{\varvec{K}}\left({{\varvec{x}}}_{j},{\varvec{x}}\right)+b$$

(36)

where ${N}_{SV}$ is the total number of support vectors and $({\lambda }_{1},{\lambda }_{2},\dots ,{\lambda }_{{N}_{SV}},b)$ are the trained SVM parameters.

In summary, the SVM algorithms have shown promising performances in many application fields [63]. During the hyperplane selection process, SVMs utilize different kernel functions to transform the low-dimensional, non-linear, and possibly non-separable training data to higher-dimensional feature spaces, which allows the data to be linearly separated. In addition, the selected high-dimensional hyperplane can be projected back down to the original space where the training data belong, providing non-linear decision boundaries between the separated classes [64]. As a result, SVMs not only aid in classifying different material pixels from micro-CT images of heterogeneous materials but also inherently identify material interfaces. Here, we use SVMs to guide numerical model discretization for the proposed Interface-modified Reproducing Kernel Particle Method, which will be introduced in the later sections.

3.2 From micro-CT images to numerical models

In this work, the sample images are taken from micro-CT, which is an imaging technique that generates three-dimensional images of an object's microstructure with (sub)micron resolution using an X-ray tube with cone-beam geometry as a source and a rotating sample holder [65]. The overall processes of generating Support Vector Machine Guided Reproducing Kernel (SVM-RK) numerical model from micro-CT images are summarized in Fig. 7.

3.2.1 Training data preparation and training the SVM

The training data points are located at the centroid of each pixel cell in the sample image, and the physical coordinates of those data points are assigned as the training data for the SVM. To supervise the SVM's training, response labels $y$, are created by segmenting the sample image using Otsu's method [66]. Otsu's method selects a global threshold that maximizes inter-class intensity variance from the zeroth- and the first-order cumulative moments of the sample image's intensity-level histogram. Figure 8 illustrates an alumina-epoxy composite micro-CT image slice, where the white areas in the sample image indicate the alumina inclusion material and the grey areas represent the epoxy material in the matrix. A ROI of 30 $\times $ 30 pixels (area in the red box) containing 4 irregularly shaped alumina particles is selected to demonstrate the SVM training and numerical model generation processes. Note that only the pixel centroid material class assignments and physical coordinates are provided as labeled training data, and the goal of training is to identify material class at arbitrary locations within the image domain for numerical integration purposes.

Specific hyperparameters of the training must be determined beforehand to facilitate SVM classification, which are summarized in Table 1. The kernel scale determines the extent to which each data point affects the shape of the decision boundaries, which is selected as 0.25. Furthermore, a penalty weight parameter $C$ of $500$ is chosen to ensure that the resulting separation hyperplane resembles the material interface. The Gaussian radial basis function is selected as the kernel function based on the geometries of the inclusions, as they are distinctive small particles. The selections of the kernel scale and penalty weight parameter are optimized utilizing an iterative Bayesian optimization process, and the objective function for the Bayesian optimization process is to minimize the fivefold cross validation classification loss. A total of 30 iterations are performed, and once the iterations are completed, the hyperparameter configuration associated with the smallest validation classification loss is selected as the optimal set of hyperparameters for the SVM model. Additionally, a standardization in which the training data points are normalized to have a zero mean and a standard deviation of unity is performed. The standardization of the training data is critical because SVM training is based on the relative distances between the training data points, and without standardization, larger-scale training data may dominate in distance determination, leading to a biased model.

Table 1 SVM hyperparameters selected for training the sample image

Full size table

3.2.2 RK interface nodes

In the Interface-modified RK approximation to be proposed in Sect. 4, a set of interface nodes are included to introduce proper weak discontinuity across material interfaces. In this section, we present an approach to generate interface nodes, utilizing the score $S\left({\varvec{x}}\right)$ (Eq. (36)) that is produced during the SVM classification and can be interpreted as a scaled signed distance function.

Let ${\mathbb{S}}^{0}\equiv {\left\{{{\varvec{x}}}_{I}\right\}}_{I=1}^{N{P}_{0}}$ be the set of training data points located at the image pixel centroids in the image domain $\overline{\Omega }$, and define ${\mathbb{S}}^{+}\equiv \left\{{\varvec{x}}\in {\mathbb{S}}^{0} | S\left({\varvec{x}}\right)\ge 0\right\}$ and ${\mathbb{S}}^{-}\equiv \left\{{\varvec{x}}\in {\mathbb{S}}^{0} | S\left({\varvec{x}}\right)<0\right\}$. Consequently, defining a set of interface-searching node pairs ${\left\{{{\varvec{x}}}_{K}^{+}, {{\varvec{x}}}_{K}^{-}\right\}}_{K=1}^{N{P}_{IF}}$ in which ${{\varvec{x}}}_{K}^{\pm }\in {\mathbb{S}}^{\pm }$ is a near-interface master/slave node (see Remark 3.1 for details). The search of an interface node ${{\varvec{x}}}_{K}^{*}\equiv {{\varvec{x}}}_{K}^{+}+{d}_{K}^{*}{{\varvec{R}}}_{K}$ can be defined as follows:

Find ${d}_{K}^{*}\in {\mathbb{R}}$ such that:

$$S\left({{\varvec{x}}}_{K}^{+}+{d}_{K}^{*}{{\varvec{R}}}_{K}\right)=0, \forall K=1\cdots N{P}^{*}$$

(37)

where ${{\varvec{R}}}_{K}=$ $({{\varvec{x}}}_{K}^{-}-{{\varvec{x}}}_{K}^{+})/\Vert {{\varvec{x}}}_{K}^{-}-{{\varvec{x}}}_{K}^{+}\Vert $ is the line search direction. The resulting RK node set is then ${\mathbb{S}}^{RK}\equiv {\mathbb{S}}^{0}\cup {\mathbb{S}}^{IF}$ with ${\mathbb{S}}^{IF}\equiv {\left\{{{\varvec{x}}}_{K}^{*}\right\}}_{K=1}^{N{P}^{*}}$, which serves as the SVM-RK discretized model.

Remark 3.1

The interface-searching node pairs ${\left\{{{\varvec{x}}}_{K}^{+}, {{\varvec{x}}}_{K}^{-}\right\}}_{K=1}^{N{P}_{IF}}$ can be determined in various ways. In this work, the following approach is taken: given the set of support vectors ${\left\{{{\varvec{x}}}_{L}^{SV}\right\}}_{L=1}^{{N}_{SV}}$, define the master candidate node set ${\mathbb{S}}^{C+}\equiv \left\{{{\varvec{x}}}_{I}\in {\mathbb{S}}^{+} | \Vert {{\varvec{x}}}_{I}-{{\varvec{x}}}_{L}^{SV}\Vert \le \xi \ell, \forall L=1\cdots {N}_{SV}\right\}$, in which $\ell$ and $\xi $ denote the image voxel size and a scaling factor, respectively. In this work $\xi =1.5$ is used. The corresponding nearest slave nodes ${{\varvec{x}}}_{K}^{-}$ are found such that, for ${{\varvec{x}}}_{K}^{+}\in {\mathbb{S}}^{C+}$,

$${{\varvec{x}}}_{K}^{-}=\underset{{{\varvec{x}}}_{I}\in {\mathbb{S}}^{C-}}{\mathrm{argmin}}\Vert {{\varvec{x}}}_{I}-{{\varvec{x}}}_{K}^{+}\Vert $$

(38)

with the slave candidate node set ${\mathbb{S}}^{C-}\equiv \left\{{{\varvec{x}}}_{I}\in {\mathbb{S}}^{-} | \Vert {{\varvec{x}}}_{I}-{{\varvec{x}}}_{L}^{SV}\Vert \le \xi \ell, \forall L=1\cdots {N}_{SV}\right\}$. Note that Eq. (38) can result in multiple ${{\varvec{x}}}_{K}^{-}$ for one ${{\varvec{x}}}_{K}^{+}$ and lead to the master–slave pairs. Figure 9a illustrates an example of the master and slave candidate nodes plotted along with the support vectors. The corresponding candidate node pairs are shown in Fig. 9b.

Remark 3.2

The solution of ${d}_{K}^{*}$ in Eq. (37) can be determined iteratively by the Newton–Raphson method. For the ${\left(\nu +1\right)}^{th}$ iteration, the increment $\Delta {d}^{\nu +1}$ for ${d}_{K}^{*\nu +1}=\Delta {d}_{K}^{*\nu +1}+{d}_{K}^{*\nu }$ is obtained as follows:

$$\Delta {d}_{K}^{*\nu +1}=-S\left({{\varvec{x}}}_{K}^{*\nu }\right)/\left(\frac{\partial S\left({{\varvec{x}}}_{{\varvec{K}}}^{*\nu }\right)}{\partial {{\varvec{x}}}_{K}^{*\nu }}\cdot {{\varvec{R}}}_{K}\right)$$

(39)

Remark 3.3.

One may consider interpolating the score function with SVM predicted nodal score values without constructing Eq. (36) as follows:

$$\widetilde{S}\left({\varvec{x}}\right)=\sum_{I=1}^{N{P}_{0}}{\Psi }_{I}^{0}\left({\varvec{x}}\right){s}_{I}$$

(40)

where ${\Psi }_{I}^{0}\left({\varvec{x}}\right)$ and ${s}_{I}$ are the RK shape function constructed on ${\mathbb{S}}^{0}$ and SVM score value (signed distance) for node ${{\varvec{x}}}_{I}$, respectively. The RK shape function can serve as a filter for potentially noisy predicted score values to provide smoothing on the zig-zag material interface determined directly from image pixels. Equation (40) is used for numerical implementation in the current work.

Remark 3.4.

To ensure relative even distribution of nodes around interfaces, a MATLAB built-in function “uniqetol” with a relative tolerance 0.01 is applied to the interface node set ${\mathbb{S}}^{IF}$. In addition, ${{\varvec{x}}_{I}}\in {\mathbb{S}}^{0}$ is removed if $\Vert {{\varvec{x}}_{I}}-{{\varvec{x}}}_{K}^{*}\Vert <\zeta \ell$ for all ${{\varvec{x}}}_{K}^{*}\in {\mathbb{S}}^{IF}$, and $\zeta =1/3$ is selected in this work. An example of rearranged RK nodes is illustrated in Fig. 10. The material interfaces are represented by a simple line connection in Fig. 10 for visualization purposes; the interface-modified RK approximation to be discussed next requires only the interface point locations and the signed distance of each discrete point obtained from SVM. Note that discretization far from the material interfaces can be made coarser to improve computational efficiency.

3.3 Image-based SVM-RK model validation

3.3.1 Validation with a synthetic image

A synthetic two-phase image containing the known locations of inclusions is generated, as illustrated in Fig. 11. The synthetic image has a dimension of 10 mm × 10 mm and a resolution of 244 × 244 pixels. To account for uncertainties in the imaging process, Gaussian noise is added to the original image, and the manufactured testing image is scaled down to 100 × 100 pixels to lower the resolution, especially around the material interfaces. Figure 12 demonstrates the manufactured noisy testing image, which will serve as the input image for the image-based SVM-RK model generation. The accuracy of the obtained interface nodes is determined by a normalized mean square error of the discretized material interfaces as:

$$MSE=\frac{1}{NC\cdot L}{\left(\sum_{j=1}^{NC}\sum_{K=1}^{N{P}^{*}}{\left(\Vert {{\varvec{x}}}_{K}^{*}-{{\varvec{c}}}_{j}\Vert -{R}_{j}\right)}^{2}\right)}^{\frac{1}{2}}$$

(41)

where ${{\varvec{c}}}_{j}$ and ${R}_{j}$ represent the center coordinates and radius of the inclusion to which the interface node ${{\varvec{x}}}_{K}^{*}\in {\mathbb{S}}^{IF}$ belongs, $N{P}^{*}$ is the total number of generated interface nodes in the set ${\mathbb{S}}^{IF}$, and $NC$ and $L$ denote the total number of inclusions in the synthetic image and the x-dimension of the synthetic image, respectively.

As previously discussed, for implementation the score function is interpolated using the RK shape function in Eq. (40), and the locality and smoothness of the RK shape function may differ depending on the size and order of continuity of the kernel function chosen for its construction. Therefore, various RK kernel support sizes and kernel functions with different orders of continuity are employed to study their effects on the accuracy of the SVM-RK interface node generation algorithm. Figure 13 illustrates the obtained interface nodes overlaid with the synthetic image using a cubic B spline RK kernel (B3, ${\mathrm{C}}^{2}\mathrm{ continuity})$ with a normalized support size of 2 and linear bases. In addition, results of the interface node generated with various support sizes and kernel function continuities can be found in Tables 2 and 3, respectively. Upon comparison of results in Tables 2 and 3, it can be observed that the proposed SVM-RK interface node search algorithm converges less than an average of 5 iterations for all instances, and the resulting interface nodes achieve average scores (Eq. (40)) to the order of ${10}^{-12}$. Additionally, the normalized mean square error of the proposed image-based RK discretization model generation process is approximated 0.65% for all scenarios. Moreover, the results show that the generated interface nodes are not sensitive to the choices of the kernel support size and kernel continuity (tent function with ${\mathrm{C}}^{0}$ continuity, quadratic B spline function (B2) with ${\mathrm{C}}^{1}$ continuity, and B3 with ${\mathrm{C}}^{2}$ continuity) used in constructing the score function (Eq. (40)).

Table 2 Results of interface node search algorithm with various kernel support sizes

Full size table

Table 3 Results of interface node search algorithm with various RK kernel functions

Full size table

3.3.2 Validation of the image-based SVM-RK discrete model with Scanning Electron Microscopy (SEM) images

To analyze the quality of the proposed image-based RK discretization model generation procedure, a comparison is made between the constructed digital surface model from micro-CT and a surface image obtained from Scanning Electron Microscopy (SEM) with a spatial resolution of 1.5 μm for the same specimen as a comparison reference. SEM uses an electron beam to scan the surface of a material, producing a high-resolution image that reveals details such as surface topography, crystalline structure, chemical composition, and electrical behavior of the top 1 μm portion of a specimen [67]. The inclusion materials in SEM are identified based on the Energy Dispersive X-ray Spectroscopy (EDS), a chemical analysis technique that detects X-rays emitted by the material in response to the electron beam to form an elemental mapping of the SEM-scanned specimen surface [68]. The micro-CT input image for constructing the numerical model is selected accordingly near the surface of the same specimen, and a ROI around $2.82$ mm by $2.37$ mm is chosen to match the SEM scanned area, which is highlighted in the red box in Fig. 14.

An SVM-RK discretization model is created from the input 2D slice of the micro-CT image using the proposed method, as illustrated in Fig. 15, containing discretized nodes in the epoxy matrix, alumina inclusions, and on the identified material interfaces. The constructed SVM-RK discretization model is superimposed over the original micro-CT image, and the result is shown in Fig. 16. Furthermore, the alumina inclusions enclosed by the identified interface nodes in the constructed SVM-RK discretization model are highlighted and overlaid onto the SEM surface image, which is contrasted with the EDS-layered SEM image shown in Fig. 17. As can be seen, the obtained image-based RK discretization model agrees well with the input micro-CT image in detail.

Remark 3.5

It is worth noting that the surface of the specimen was polished to enhance imaging quality for SEM, which may cause slight alterations to the distribution of surface particles. Figure 18 illustrates the minor discrepancies between the SEM and micro-CT images. As the blue boxes indicate, misalignments can be observed for certain inclusion particles. Additionally, smaller particles were not captured in the SEM scan, as highlighted in the red boxes. Consequently, the inclusion particles identified by the SVM-RK discretization model exhibit slight variances compared to the EDS elemental mapping result, as illustrated in Fig. 17. Nevertheless, they are mostly consistent, particularly for the larger and more distinctive inclusion particles.

4 Interface-modified reproducing kernel approximation guided by support vector machine

4.1 Interface-modified kernel functions

With the material interface segmented by the SVM, the weak discontinuities across the material interfaces are to be introduced by modifying the regular RK kernel function with a regularized Heaviside function $\widetilde{H}$ as follows:

$${\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)={\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)\widetilde{H}\left({\overline{\xi }}_{I}\left({\varvec{x}}\right)\right)$$

(42)

where ${\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$ is a modified kernel function, and $\widetilde{H}\left(\cdot \right)$ and ${\overline{\xi }}_{I}\left({\varvec{x}}\right)$ in Eq. (42) are defined as:

$$\widetilde{H}\left(\cdot \right)=\mathrm{max}\left(0,\mathrm{tanh}\left(\cdot \right)\right)$$

(43)

and

$${\overline{\xi }}_{I}\left({\varvec{x}}\right)=\left\{\begin{array}{cc}-\frac{S\left({\varvec{x}}\right)}{c},& S\left({{\varvec{x}}}_{I}\right)<0\\ +\frac{S\left({\varvec{x}}\right)}{c},& S\left({{\varvec{x}}}_{I}\right)>0\end{array}\right.$$

(44)

where $S\left({\varvec{x}}\right)$ is the score function, and $c$ denotes a scaling factor that has a length of the order of nodal spacing. Note that $S({\varvec{x}})$ is a signed distance of an evaluation point to its nearby interface, which is given from the output of the SVM-RK image segmentation and is readily available for evaluation of regularized Heaviside function $\widetilde{H}$. This normalized distance measure $\overline{\xi }({\varvec{x}})$ is applicable to general n-dimensional image data. The kernel functions associated with nodes away from the interfaces have been scaled to zero at the material interfaces by the regularized Heaviside function, and the kernel functions associated with the interface nodes are not scaled. As will be discussed in the next section, the “reproduced” RK shape functions via the reproducing conditions given in Eqs. (9)–(10) reveals a weak (C⁰) continuity of the approximated function at the material interface due to the Heaviside scaling in Eq. (42) regardless of the continuity of kernel function associated with nodes at the material interfaces. Hence, cubic B spline (B3) kernel functions with C² continuity or power kernel (PK) function with C⁰ continuity [69] can be considered for the kernel function associated with the interface nodes:

$${\phi }_{a}^{B3}\left(z\right)=\left\{\begin{array}{l@{\quad}l}\frac{2}{3}-4{\left|z\right|}^{2}+4{\left|z\right|}^{3}& for\,\, 0\le \left|z\right|\le \frac{1}{2}\\\frac{4}{3}-4\left|z\right|+4{\left|z\right|}^{2}-\frac{4}{3}{\left|z\right|}^{3}& for\,\,\frac{1}{2}\le \left|z\right|\le 1\\ 0 &otherwise\end{array}\right.z=\frac{{\varvec{x}}-{{\varvec{x}}}_{I}}{a}$$

(45)

$${\phi }_{a}^{PK}\left(z\right)=\left\{\begin{array}{l@{\quad}l}{\left(1-z\right)}^{\alpha } & for\,\, 0\le z\le 1\\ 0 & otherwise\end{array}\right.z=\frac{{\varvec{x}}-{{\varvec{x}}}_{I}}{a}$$

(46)

Figure 19 shows the un-modified kernel functions ${\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$, regularized Heaviside function $\widetilde{H}({\overline{\xi }}_{I}\left({\varvec{x}}\right))$, the interface-modified kernel functions ${\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$ and their derivatives ${\overline{\phi }}_{a,x}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$ in 1D with different choices of interface kernels. The blue, red, and black kernel functions are associated with the interface node, nodes within the support of interface nodes, and nodes away from the interface, respectively.

Remarks 4.1.

1. One can observe that after the interface modification, the influence domains of all nodes, except the interface node, terminate at the interface location, which naturally introduces a weak discontinuity to interface-modified kernel functions, even for the case of the smooth B-spline kernel at the interface.

2. The added computational cost to perform the proposed kernel modifications is marginal because only a scaling is applied to the original kernel functions to construct kernel modifications, and this can be done effortlessly for arbitrary spatial dimensions.

4.2 Interface-modified RK (IM-RK) approximation

Let us consider the RK discretization node set ${\mathbb{S}}^{RK}$ in the SVM-RK discretized numerical model. Recall that interface nodes in ${\mathbb{S}}^{RK}$ are contained in the set ${\mathbb{S}}^{IF}$. Let ${\mathbb{S}}^{RK}\backslash {\mathbb{S}}^{IF}$ denotes the set of all RK discrete points excluding those on the interfaces, then the RK shape function can be written as follows:

$$\left\{\begin{array}{l}\begin{array}{l}{\Psi }_{I}\left({\varvec{x}}\right)=C\left({\varvec{x}};{\varvec{x}}-{{\varvec{x}}}_{I}\right){\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)\\\qquad={({\varvec{H}}}^{\mathrm{T}}\left({\varvec{x}}-{{\varvec{x}}}_{\mathbf{I}}\right) \varvec{b}(\varvec{x})){\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right), \forall I\in {\mathbb{S}}^{IF}\\ \end{array}\\{\Psi }_{I}\left({\varvec{x}}\right)=C\left({\varvec{x}};{\varvec{x}}-{{\varvec{x}}}_{I}\right){\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)\\\qquad={({\varvec{H}}}^{\mathrm{T}}\left({\varvec{x}}-{{\varvec{x}}}_{\mathbf{I}}\right) \varvec{b}(\varvec{x})){\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right), \forall I\in {\mathbb{S}}^{RK}\backslash {\mathbb{S}}^{IF}\end{array}\right.$$

(47)

where ${\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$ is the regular kernel functions without interface modification and ${\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$ is the interface-modified kernel function defined in Eq. (42). The unknown coefficient vector ${\varvec{b}}({\varvec{x}})$ is obtained by imposing the ${n}^{th}$order reproducing conditions, as shown in Eq. (10). Substituting Eq. (47) into Eq. (10) yields:

$${\varvec{b}}\left({\varvec{x}}\right)={\overline{{\varvec{M}}}}^{-1}\left({\varvec{x}}\right){\varvec{H}}(\varvec{0})$$

(48)

where $\overline{{\varvec{M}}}\left({\varvec{x}}\right)$ is the modified moment matrix:

$$\overline{{\varvec{M}}}\left({\varvec{x}}\right)=\sum_{I\in {\mathbb{S}}^{IF}}{{\varvec{H}}}^{\mathrm{T}}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right){\varvec{H}}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right){\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)+ \sum_{I\in {\mathbb{S}}^{RK}\backslash {\mathbb{S}}^{IF}}{{\varvec{H}}}^{\mathrm{T}}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right){\varvec{H}}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right){\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right)$$

(49)

By substituting Eq. (48) into Eq. (47), the interface modified reproducing kernel shape function is obtained as:

$${\overline{\Psi } }_{I}\left({\varvec{x}}\right)=\left\{\begin{array}{l}{{\varvec{H}}}^{T}\left(\varvec{0}\right){\overline{{\varvec{M}}} }^{-1}(\varvec{x}) \varvec{H}({\varvec{x}}-{{\varvec{x}}}_{I}){\phi }_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right), \forall I\in {\mathbb{S}}^{IF}\\ {{{\varvec{H}}}^{T}\left(\varvec{0}\right){\overline{{\varvec{M}}} }^{-1}({\varvec{x}}){\varvec{H}}({\varvec{x}}-{{\varvec{x}}}_{I})\overline{\phi }}_{a}\left({\varvec{x}}-{{\varvec{x}}}_{I}\right), \forall I\in {\mathbb{S}}^{RK}\backslash {\mathbb{S}}^{IF}\end{array}\right.$$

(50)

Finally, the IM-RK approximation of the displacement field ${{\varvec{u}}}^{h}({\varvec{x}})$ is expressed as:

$${u}_{i}^{h}\left({\varvec{x}}\right)= \sum_{I=1}^{NP}{\overline{\Psi } }_{I}\left({\varvec{x}}\right){d}_{iI}$$

(51)

As shown in Eq. (51), no duplicated degrees of freedom associated with the interface nodes are added (such as interface enrichments) when using the IM-RK to approximate the displacement field. Figure 20 compares the 1D traditional RK and IM-RK shape functions and their derivatives in a domain [0, 10] with a material interface at x = 5. The red-colored node in Fig. 20 is the interface node, and the shape functions colored black, blue, and red are associated with nodes outside the support of the interface node, nodes within the influence of the interface node, and the interface node, respectively.

Remarks 4.2.

1. Owing to the regularization function $\widetilde{H}({\overline{\xi }}_{I}\left({\varvec{x}}\right))$ introduced in Eqs. (43)–(44), nodes on different sides of the interface lose communication, leading to weak discontinuities in the IM-RK shape functions. This is true even when a smooth cubic B-spline kernel is used for all nodes, including the interface nodes, as shown in Fig. 20. Same weak discontinuity properties exist in high dimensions, as shown in Fig. 21, regardless of the smoothness of interface kernel functions. Figure 22 illustrates the IM-RK interpolation of a function:

$$f\left({\varvec{x}}\right)=\left\{\begin{array}{l}{e}^{{c}_{1}{\Vert {\varvec{x}}\Vert }^{2}}, if \Vert {\varvec{x}}\Vert \le R\\ {e}^{{c}_{2}{\Vert {\varvec{x}}\Vert }^{2}}+{(e}^{{c}_{1}{R}^{2}}-{e}^{{c}_{2}{R}^{2}}), if \Vert {\varvec{x}}\Vert >R\end{array}\right.$$

where ${c}_{1}=0.5$, ${c}_{2}=0.1$, and the interface is a circular arc with $R=0.8$. IM-RK shape functions with smooth cubic B-spline kernels can effectively capture weak discontinuities along the interface in the interpolated function and represent its discontinuous x-directional derivative field, while both the interpolated function and its x-directional derivative fields are smooth across the interface when using the standard RK shape functions with B3 kernels.

2. Since all kernel functions vanish at the interface, except for the kernel functions defined on the interface nodes, the resulting IM-RK approximation functions possess weak Kronecker delta properties, as have been discussed in [46]. These weak Kronecker delta properties, however, do not exist in high dimensions because the supports of interface nodes overlap except for interface nodes located on domain boundaries.

In this work, the meshfree method using the above IM-RK approximation as the approximation function for the test and trial functions under the Galerkin framework is named the Interface-Modified Reproducing Kernel Particle Method (IM-RKPM). Two numerical examples are presented in Appendix B to validate the accuracy and convergence of proposed IM-RKPM.

4.3 IM-RK shape functions for the image-based numerical model

Figures 23 and 24 respectively show the IM-RK shape functions of non-interface nodes near the interfaces and the IM-RK shape functions of the interface nodes, constructed on the image-based SVM-RK discretized model shown in Fig. 10.

In Figs. 23 and 24, the non-zero IM-RK shape functions are color-coded by different color blocks, and the maximum shape function is shown at each plotting point in the top view. By observing the results in Fig. 23, the shape functions are truncated across arbitrarily shaped interfaces. The interface nodes’ shape functions, however, provide support coverage to the nodes located on both sides of the interface with C⁰ continuity along the interfaces’ normal direction for embedding weak discontinuities normal to the interface.

5 Image-based numerical results

5.1 Compression-shear test on 2D composite microstructure

In this numerical example, a compression-shear test is conducted on a composite constructed based on the image shown in Fig. 25. The image consists of 200 $\times $ 200 pixels with a pixel size of 8 μm. The physical dimensions of the specimen are 1.6 mm in width and height. The bottom edge of the specimen is fixed in both x- and y- directions, while the top edge is prescribed with a total displacement of $-0.01$ mm in both x- and y- directions. In addition, two vertical edges of the specimen are assigned as traction-free. The material constants for the alumina inclusion materials are: ${E}_{1}=320 \, \mathrm{ GPa}, {\nu }_{1}=0.23$, while the epoxy material is with material constants: ${E}_{1}=3.66\, \mathrm{ GPa}, {\nu }_{1}=0.358$.

The problem at hand is examined using the proposed IM-RKPM and compared with the results produced using ANSYS [72], a commercially available FEM software with a much refined body-fitted mesh. All numerical analyses are performed under the 2D plane strain condition. The proposed IM-RKPM uses Nitsche’s method [70] to apply Dirichlet boundary conditions. The numerical model of the test image is constructed following Sect. 3.2, as shown in Figs. 26, and the IM-RKPM approximation functions presented in Sect. 4 using cubic B-spline kernel with normalized support size of 2 and linear bases. The model employed for FEM analysis is manually traced from the inclusion geometries of the test image, resulting in a slight variation between the FEM and IM-RKPM discretization near interfaces. The FEM approximation involves a fine body-fitted mesh that comprises of 37,454 elements and 112,538 nodes, as illustrated in Fig. 27. On the other hand, the IM-RKPM approximation uses only 11,316 nodes to discretize the image domain, which is approximately one-tenth of the number of nodes used in the FEM model. Figure 28 demonstrates the IM-RKPM and FEM approximated displacement solutions, respectively, and it is observed that both IM-RKPM and FEM predict similar displacements.

Figure 29 shows the strains predicted by IM-RKPM and FEM, respectively. As shown in Fig. 29, the strains of IM-RKPM display sharp transitions across the material interfaces and concentrated strains around the corners of the material interfaces, comparable to the results obtained using the FEM approximation. Furthermore, both the IM-RKPM and FEM approximations of strain solutions show some coalescence of the strain concentration around some closely positioned inclusions, as indicated by the boxed areas in Fig. 29. The results show that the proposed IM-RKPM, accompanied by the SVM-based RK discretization with simple interface modified RK approximation functions, is capable of modeling composite materials with complicated microstructure and arbitrarily shaped inclusions with accuracy comparable to that obtained from a much refined and laborious FEM model.

5.2 Uniaxial tensile test on 3D composite microstructure

In this example, a three-dimensional image-based SVM-RK model is constructed, and a uniaxial tensile test is conducted on the specimen’s numerical model with the same material properties and essential boundary treatment by Nitsche’s method [70] as those used in Example 5.1. The input of imaged-based 3D numerical model generation is performed by stacking 30 slices of ROI of 30 by 30 pixels extracted from reconstructed micro-CT 2D images of a specimen’s internal microstructure along the z-direction into a volumetric data matrix, as illustrated in Fig. 30. The size of the test volume is $0.24\;{\text{mm}} \times 0.24\;{\text{mm}} \times 0.24\;{\text{mm}}$, corresponding to an input image voxel size of $8 \;\upmu {\text{m}}$. The uniaxial tension is applied to the two surfaces with surface normal in the z-direction under prescribed displacements in the z-direction while without constraints in the x–y displacements.

Although the training data points are now in ${\mathbb{R}}^{3}$, the discrete model generation procedures remain the same as those for 2D, which corresponds to the physical coordinates of voxel centroids, as detailed in Sect. 3.2. The training response labels are obtained by stacking the segmented ROIs using Otsu’s method into a binary volumetric data label matrix. It is important to note that both matrices that contain the training data points and training response labels are concatenated before being fed into the SVM, which means that the combined training data set can be represented as $\mathbf{D}={\left\{{{\varvec{x}}}_{i},{y}_{i}\right\}}_{i=1}^{l}$, where ${{\varvec{x}}}_{i}\in {\mathbb{R}}^{3},{y}_{i}\in \{-\mathrm{1,1}\}$, and $l=\mathrm{27,000}$ for the present case. For the SVM training, the same hyperparameters specified in Sect. 3.2.1 are utilized. Figure 31 demonstrates the SVM material classification results, the support vectors resulting from training, and the RK interpolated decision boundaries. It is worth mentioning that the resulting RK interpolated separating hyperplane, which is the material interface determined by the SVM training and RK interpolation, exhibits a smoother appearance in contrast to the inclusion geometries represented by binary label data volumetric matrix. This outcome is not surprising, given that SVM considers all three dimensions of the training data points to identify an appropriate separating hyperplane, which is more realistic and resembles a specimen image stack with higher resolution (i.e., with a smaller voxel size).

Figure 32 illustrates the results of identified interface nodes and the 3D RK discrete model of the test volume, where the black (small) nodes, blue nodes, and red nodes represent the epoxy material points, alumina material points, and points on the material interfaces, respectively. The 3D SVM-RK discretization model contains in total 17,648 discretized nodes, among which 2330 nodes are on the material interfaces.

The produced 3D SVM-RK discrete model is utilized for a uniaxial tensile test in the z-direction. The model’s bottom surface is fixed in all three directions, while a z-directional displacement of 0.01 mm is prescribed at the top surface of the model. The other surfaces of the specimen are assigned traction-free boundary conditions. The proposed IM-RKPM is employed for the numerical solution, and Fig. 33 depicts the displacement solution in all three directions. The predicted normal strains are plotted in Fig. 34, to which a transparency filter is applied such that strain with a large magnitude is not visible. As shown in Fig. 34, the regions with the relatively small magnitude of strains are consistent with the shapes of the alumina inclusions, which is expected because the alumina inclusions are significantly stiffer than the surrounding epoxy matrix. In Figs. 35, 36, and 37, the normal strains are plotted on multiple slices and are compared with the slices of the inclusion contours. The results show that distinctive strain transitions in all three dimensions can be observed across interfaces. In addition, some strain concentrations are observed between two nearby inclusions and around the corner of the inclusions. Overall, this example demonstrates the capability of the proposed SVM-RK image-based model and IM-RKPM in modeling composite material with arbitrarily shaped inclusions in three dimensions.

6 Conclusion

A Support Vector Machine (SVM) guided model discretization and reproducing kernel approximation, utilizing micro-CT images of heterogeneous materials as input, is introduced in this work. The trained SVM-RK model generates classification scores for RKPM model discretization from the image, enabling their use as inputs for (1) interface node generation and (2) the interface kernel modification to construct a modified RK approximation of weak discontinuities. The SVM classification scores, representing the signed distances, enables identification of material phase, interface discretization, and interface surface normals, allowing automatic construction of RK approximation with weak discontinuities and interface-conforming gradient smoothing cells for SCNI based domain integration. The proposed image-based SVM-RK model generation process was validated through a synthetic image and a high-resolution surface image obtained from the SEM.

The resulting Interface-Modified Reproducing Kernel Particle Method (IM-RKPM) effectively remedy Gibb's oscillations commonly seen in the conventional RKPM for modeling problems with weak discontinuities. The proposed method incorporates a regularized Heaviside function defined on the SVM classification score to achieve RK approximation with interface weak discontinuity while avoiding Gibb-type oscillations. These procedures involved in the proposed SVM guided RK approximation with interface weak discontinuities are fully automatic in 3-dimensions and without the need of using duplicated degrees of freedom on the interface nodes common in other interface-enriched meshfree methods [33, 36]. In addition, this IM-RKPM with interface weak discontinuities can be constructed by kernel functions with arbitrary smoothness/roughness while achieving optimal convergence as demonstrated in the numerical examples using both Gauss integration and SCNI.

Finally, the effectiveness of the proposed automated SVM-RK model generation process in conjunction with the IM-RKPM method is demonstrated through numerical examples based on test micro-CT images in both 2- and 3-dimensions. Notably, the 3D example shows that the proposed approach is applicable for 3D simulations where the SVM-RK model precisely represents the geometry of the inclusion particles, trained based on stacked image slices.

It is worth mentioning that while the present work utilizes the standard binary SVM library for two-phase materials, it is possible to explore multi-class SVM algorithms, or more advanced machine learning algorithms, such as the convolutional neural network, for multi-phase materials segmentation.

References

Schilling PJ, Karedla BPR, Tatiparthi AK, Verges MA, Herrington PD (2005) X-ray computed microtomography of internal damage in fiber reinforced polymer matrix composites. Compos Sci Technol 65(14):2071–2078. https://doi.org/10.1016/J.COMPSCITECH.2005.05.014
Article Google Scholar
Croom B, Wang W-M, Li J, Li X (2016) Unveiling 3D deformations in polymer composites by coupled micro X-ray computed tomography and volumetric digital image correlation. Exp Mech 56:999–1016. https://doi.org/10.1007/s11340-016-0140-7
Article Google Scholar
Tang Y, Su K, Man R, Hillman MC, Du J (2021) Investigation of internal cracks in epoxy-alumina using in situ mechanical testing coupled with micro-CT. JOM 73:2452. https://doi.org/10.1007/s11837-021-04714-x
Article Google Scholar
Liu G, Tang Y, Hattar K, Wang Y, Winders W, Haque A, Du J (2023) An investigation of fracture behaviors of NBG-18 nuclear graphite using in situ mechanical testing coupled with micro-CT. J Mater Res. https://doi.org/10.1557/s43578-023-00929-7
Article Google Scholar
Norouzi A, Rahim MSM, Altameem A, Saba T, Rad AE, Rehman A, Uddin M (2014) Medical image segmentation methods, algorithms, and applications. IETE Tech Rev 31(3):199–213. https://doi.org/10.1080/02564602.2014.906861
Article Google Scholar
Sahoo PK, Soltani S, Wong AKC (1988) A survey of thresholding techniques. Comput Vis Gr Image Process 41(2):233–260. https://doi.org/10.1016/0734-189X(88)90022-9
Article Google Scholar
Haralick RM, Shapiro LG (1985) Image segmentation techniques. Comput Vis Gr Image Process 29(1):100–132. https://doi.org/10.1016/S0734-189X(85)90153-7
Article Google Scholar
Saxena A, Prasad M, Gupta A, Bharill N, Patel OP, Tiwari A, Er MJ, Ding W, Lin CT (2017) A review of clustering techniques and developments. Neurocomputing 267:664–681. https://doi.org/10.1016/J.NEUCOM.2017.06.053
Article Google Scholar
Guo G, Wang H, Bell D, Bi Y, Greer K (2003) KNN model-based approach in classification. Lect Notes Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinf) 2888:986–996. https://doi.org/10.1007/978-3-540-39964-3_62
Article Google Scholar
Bishop CM, Nasrabadi NM (2006) Pattern recognition and machine learning, Vol. 4. Springer, New York, p 738
Minaee S, Boykov Y, Porikli F, Plaza A, Kehtarnavaz N, Terzopoulos D (2022) Image segmentation using deep learning: a survey. IEEE Trans Pattern Anal Mach Intell 44:1. https://doi.org/10.1109/TPAMI.2021.3059968
Article Google Scholar
Shen D, Wu G, Suk H-I (2017) Deep Learning in Medical Image Analysis. Annu Rev Biomed Eng 1:221–248. https://doi.org/10.1146/annurev-bioeng-071516-044442
Article Google Scholar
Osher S, Sethian JA (1988) Fronts propagating with curvature-dependent speed: algorithms based on Hamilton–Jacobi formulations. J Comput Phys 79:12–49. https://doi.org/10.1016/0021-9991(88)90002-2
Article MathSciNet Google Scholar
Chen JS, Basava RR, Zhang Y, Csapo R, Malis V, Sinha U, Hodgson J, Sinha S (2016) Pixel-based meshfree modelling of skeletal muscles. Comput Methods Biomech Biomed Eng Imaging Vis 4(2):73–85. https://doi.org/10.1080/21681163.2015.1049712
Article Google Scholar
Srinivasa Reddy B, Chatterji BN (1996) An FFT-based technique for translation, rotation, and scale-invariant image registration. IEEE Trans Image Process 5(8):1266–1271. https://doi.org/10.1109/83.506761
Article Google Scholar
Wettimuny R, Penumadu D (2004) Application of fourier analysis to digital imaging for particle shape analysis. J Comput Civ Eng 18(1):2–9. https://doi.org/10.1061/(ASCE)0887-3801(2004)18:1(2)
Article Google Scholar
Cortes C, Vapnik V, Saitta L (1995) Support-vector networks. Mach Leaming 20:273–297
Article Google Scholar
Vapnik V (1995) The nature of statistical learning theory, 2nd ed. Springer, New York
Book Google Scholar
Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, Lopez A (2020) A comprehensive survey on support vector machine classification: applications, challenges and trends. Neurocomputing 408:189–215. https://doi.org/10.1016/J.NEUCOM.2019.10.118
Article Google Scholar
Osuna E, Freund R, Girosit F (1997) Training support vector machines: an application to face detection. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition, pp 130–136. https://doi.org/10.1109/CVPR.1997.609310.
Lee Y-J, Mangasarian OL (2001) RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM international conference on data mining (SDM), pp 1–17. https://doi.org/10.1137/1.9781611972719.13.
Suykens JAK, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9:293–300. https://doi.org/10.1023/A:1018628609742
Article Google Scholar
Mangasarian OL, Musicant DR (2001) Lagrangian support vector machines. J Mach Learn Res 1:161–177
MathSciNet Google Scholar
Hsu C-W, Lin C-J (2002) A comparison of methods for multiclass support vector machines. IEEE Trans Neural Networks 13(2):415. https://doi.org/10.1109/72.991427
Article Google Scholar
Babuska I (1970) The finite element method for elliptic equations with discontinuous coefficients. Computing 5:207–218. https://doi.org/10.1007/BF02248021
Article MathSciNet Google Scholar
Parvizian J, Düster A, Rank E (2007) Finite cell method : h- and p-extension for embedded domain problems in solid mechanics. Comput Mech 41(1):121–133. https://doi.org/10.1007/s00466-007-0173-y
Article MathSciNet Google Scholar
Korshunova N, Jomo J, Lékó G, Reznik D, Balázs P, Kollmannsberger S (2020) Image-based material characterization of complex microarchitectured additively manufactured structures. Comput Math with Appl 80(11):2462–2480. https://doi.org/10.1016/J.CAMWA.2020.07.018
Article MathSciNet Google Scholar
Korshunova N, Alaimo G, Hosseini SB, Carraturo M, Reali A, Niiranen J, Auricchio F, Rank E, Kollmannsberger S (2021) Image-based numerical characterization and experimental validation of tensile behavior of octet-truss lattice structures. Addit Manuf 41:101949. https://doi.org/10.1016/j.addma.2021.101949
Article Google Scholar
Belytschko T, Lu YY, Gu L (1994) Element-free Galerkin methods. Int J Numer Methods Eng 37(2):229–256. https://doi.org/10.1002/nme.1620370205
Article MathSciNet Google Scholar
Liu WK, Jun S, Zhang YF (1995) Reproducing kernel particle methods. Int J Numer Methods Fluids 20(8–9):1081–1106. https://doi.org/10.1002/FLD.1650200824
Article MathSciNet Google Scholar
Liu WK, Jun S, Li S, Adee J, Belytschko T (1995) Reproducing kernel particle methods for structural dynamics. Int J Numer Methods Eng 38(10):1655–1679. https://doi.org/10.1002/NME.1620381005
Article MathSciNet Google Scholar
Chen JS, Pan C, Wu CT, Liu WK (1996) Reproducing kernel particle methods for large deformation analysis of non-linear structures. Comput Methods Appl Mech Eng 139(1–4):195–227. https://doi.org/10.1016/S0045-7825(96)01083-3
Article MathSciNet Google Scholar
Krongauz Y, Belytschko T (1998) EFG approximation with discontinuous derivatives. Int J Numer Methods Eng 41(7):1215–1233. https://doi.org/10.1002/(sici)1097-0207(19980415)41:7%3c1215::aid-nme330%3e3.0.co;2-%23
Article MathSciNet Google Scholar
Fries TP, Belytschko T (2010) The extended/generalized finite element method: an overview of the method and its applications. Int J Numer Methods Eng 84(3):253–304. https://doi.org/10.1002/NME.2914
Article MathSciNet Google Scholar
Jirásek M (2000) Comparative study on finite elements with embedded discontinuities. Comput Methods Appl Mech Eng 188(1):307–330. https://doi.org/10.1016/S0045-7825(99)00154-1
Article Google Scholar
Chen JS, Kotta V, Lu H, Wang D, Moldovan D, Wolf D (2004) A variational formulation and a double-grid method for meso-scale modeling of stressed grain growth in polycrystalline materials. Comput Methods Appl Mech Eng 193(12–14):1277–1303. https://doi.org/10.1016/j.cma.2003.12.020
Article MathSciNet Google Scholar
Masuda S, Noguchi H (2006) Analysis of structure with material interface by Meshfree method. Comput Model Eng Sci 11(3):131–144. https://doi.org/10.3970/CMES.2006.011.131
Article Google Scholar
Cordes LW, Moran B (1996) Treatment of material discontinuity in the Element-Free Galerkin method. Comput Methods Appl Mech Eng 139(1–4):75–89. https://doi.org/10.1016/S0045-7825(96)01080-8
Article Google Scholar
Cockburn B, Karniadakis GE, Shu CW (2000) The development of discontinuous Galerkin methods. In: Cockburn B, Karniadakis GE, Shu CW (eds) Discontinuous Galerkin methods. Lecture notes in computational science and engineering, vol 11. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59721-3_1
Zienkiewicz OC, Taylor RL, Sherwin SJ, Peiró J (2003) On discontinuous Galerkin methods. Int J Numer Methods Eng 58(8):1119–1148. https://doi.org/10.1002/NME.884
Article MathSciNet Google Scholar
Wang D, Sun Y, Li L (2009) A discontinuous Galerkin Meshfree modeling of material interface. Comput Model Eng Sci 45(1):57–82
MathSciNet Google Scholar
Huang T-H, Chen J-S, Tupek MR, Beckwith FN, Koester JJ, Eliot Fang H (2021) A variational multiscale immersed meshfree method for heterogeneous materials. Comput Mech 67:1059–1097. https://doi.org/10.1007/s00466-020-01968-1
Article MathSciNet Google Scholar
Wang J, Zhou G, Hillman M, Madra A, Bazilevs Y, Du J, Su K (2021) Consistent immersed volumetric Nitsche methods for composite analysis. Comput Methods Appl Mech Eng 385:1142. https://doi.org/10.1016/j.cma.2021.114042
Article MathSciNet Google Scholar
Chen J-S, Hillman M, Chi W (2017) Meshfree methods: progress made after 20 years. Am Soc Civ Eng 143(4):1. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001176
Article Google Scholar
Chen JS, Pan C, Roque CMOL, Wang HP (1998) A Lagrangian reproducing kernel particle method for metal forming analysis. Comput Mech 22(3):289–307. https://doi.org/10.1007/S004660050361
Article Google Scholar
Chen J-S, Han W, You Y, Meng X (2003) A reproducing kernel method with nodal interpolation property. Int J Numer Methods Eng 56:935–960. https://doi.org/10.1002/nme.592
Article MathSciNet Google Scholar
Chen J, Liu W, Hillman M, Chi S, Lian Y, Bessa M (2017) Reproducing kernel approximation and discretization. In: Encyclopedia of computational mechanics, 2nd ed. Wiley
Joulaian M, Hubrich S, Düster A (2016) Numerical integration of discontinuities on arbitrary domains based on moment fitting. Comput Mech 57(6):979–999. https://doi.org/10.1007/s00466-016-1273-3
Article MathSciNet Google Scholar
Hubrich S, Stolfo PD, Kudela L, Kollmannsberger S, Rank E, Schröder A, Düster A (2017) Numerical integration of discontinuous functions: moment fitting and smart octree. Comput Mech 60(5):863–881. https://doi.org/10.1007/s00466-017-1441-0
Article MathSciNet Google Scholar
Kudela L, Zander N, Kollmannsberger S, Rank E (2016) Smart octrees: accurately integrating discontinuous functions in 3D. Comput Methods Appl Mech Eng 306:406–426. https://doi.org/10.1016/J.CMA.2016.04.006
Article MathSciNet Google Scholar
Chen JS, Wu CT, Yoon S, You Y (2001) Stabilized conforming nodal integration for Galerkin mesh-free methods. Int J Numer Methods Eng 50(2):435–466. https://doi.org/10.1002/1097-0207(20010120)50:2%3c435::AID-NME32%3e3.0.CO;2-A
Article Google Scholar
Yoo JW, Moran B, Chen JS (2004) Stabilized conforming nodal integration in the natural-element method. Int J Numer Methods Eng 60(5):861–890. https://doi.org/10.1002/NME.972
Article Google Scholar
Chen JS, Yoon S, Wu CT (2002) Non-linear version of stabilized conforming nodal integration for Galerkin mesh-free methods. Int J Numer Methods Eng 53(12):2587–2615. https://doi.org/10.1002/NME.338
Article Google Scholar
Chen JS, Hu W, Puso M, Wu Y, Zhang X (2007) Strain Smoothing for Stabilization and Regularization of Galerkin Meshfree Methods. Lecture Notes in Computational Science and Engineering. Springer, Berlin, pp 57–75. https://doi.org/10.1007/978-3-540-46222-4_4
Book Google Scholar
Guan PC, Chi SW, Chen JS, Slawson TR, Roth MJ (2011) Semi-Lagrangian reproducing kernel particle method for fragment-impact problems. Int J Impact Eng 38(12):1033–1047. https://doi.org/10.1016/j.ijimpeng.2011.08.001
Article Google Scholar
Hillman M, Chen J-S (2016) An accelerated, convergent, and stable nodal integration in Galerkin meshfree methods for linear and nonlinear mechanics. Int J Numer Methods Eng 107:603–630. https://doi.org/10.1002/nme.5183
Article MathSciNet Google Scholar
Chen J-S, Hillman M, Rüter M (2013) An arbitrary order variationally consistent integration for Galerkin meshfree methods. Int J Numer Methods Eng 95:387–418. https://doi.org/10.1002/nme.4512
Article MathSciNet Google Scholar
Beissel S, Belytschko T (1996) Nodal integration of the element-free Galerkin method. Comput Methods Appl Mech Eng 139(1–4):49–74. https://doi.org/10.1016/S0045-7825(96)01079-1
Article MathSciNet Google Scholar
Liu GR, Zhang GY, Wang YY, Zhong ZH, Li GY, Han X (2007) A nodal integration technique for meshfree radial point interpolation method (NI-RPIM). Int J Solids Struct 44(11–12):3840–3860. https://doi.org/10.1016/J.IJSOLSTR.2006.10.025
Article Google Scholar
Puso MA, Chen JS, Zywicz E, Elmer W (2008) Meshfree and finite element nodal integration methods. Int J Numer Methods Eng 74(3):416–446. https://doi.org/10.1002/NME.2181
Article MathSciNet Google Scholar
Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on computational learning theory, pp 144–152. https://doi.org/10.1145/130385.130401
Kuhn HW, Tucker AW (2013) Nonlinear Programming. Traces Emerg Nonlinear Program, pp 247–258. https://doi.org/10.1007/978-3-0348-0439-4_11.
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2:121–167. https://doi.org/10.1023/A:1009715923555
Article Google Scholar
Noble WS (2006) What is a support vector machine? Nat Biotechnol 24:1565–1567. https://doi.org/10.1038/nbt1206-1565
Article Google Scholar
Vásárhelyi L, Kónya Z, Kukovecz RV (2020) Microcomputed tomography–based characterization of advanced materials: a review. Mater Today Adv 8:1. https://doi.org/10.1016/J.MTADV.2020.100084
Article Google Scholar
Otsu N (1979) A threshhold selection method from gray level histograms. IEEE Trans Syst Man Cybern C(1):62–66
Vernon-Parry KD (2000) Scanning electron microscopy: an introduction. III-Vs Rev 13(4):40–44. https://doi.org/10.1016/S0961-1290(00)80006-X
Article Google Scholar
Shindo D, Oikawa T (2002) Energy dispersive X-ray spectroscopy. In: Analytical electron microscopy for materials science. Springer, Tokyo. https://doi.org/10.1007/978-4-431-66988-3_4
Roth MJ, Chen J-S, Danielson KT, Slawson TR (2016) Hydrodynamic meshfree method for high-rate solid dynamics using a Rankine-Hugoniot enhancement in a Riemann-SCNI framework. Int J Numer Methods Eng 108(12):1525–1549. https://doi.org/10.1002/nme.5266
Article MathSciNet Google Scholar
Fernández-Méndez S, Huerta A (2004) Imposing essential boundary conditions in mesh-free methods. Comput Methods Appl Mech Eng 193(12–14):1257–1275. https://doi.org/10.1016/J.CMA.2003.12.019
Article MathSciNet Google Scholar
Mura T (2013) Micromechanics of Defects in Solids. Springer, Berlin
Google Scholar
Ansys® Workbench, Release 2023 R1, Help System, Workbench User's Guide, ANSYS, Inc.

Download references

Acknowledgements

This research is supported by the National Science Foundation (#1826221). The authors are grateful to the program manager, Dr. Siddiq Qidwai, for his encouragement and support. Appreciation is extended to Dr. Timothy Stecko of Penn State Center for Quantitative Imaging for technical assistance with micro-CT scanning and Ms. Julie Anderson for technical assistance with SEM imaging.

Author information

Authors and Affiliations

Department of Structural Engineering, University of California San Diego, La Jolla, CA, 92093‑0085, USA
Yanran Wang, Jonghyuk Baek & Jiun-Shyan Chen
Department of Mechanical Engineering, Pennsylvania State University, University Park, PA, 16802, USA
Yichun Tang & Jing Du
Karagozian and Case, Inc, Glendale, CA, 91203, USA
Mike Hillman

Authors

Yanran Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jonghyuk Baek
View author publications
You can also search for this author in PubMed Google Scholar
Yichun Tang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Du
View author publications
You can also search for this author in PubMed Google Scholar
Mike Hillman
View author publications
You can also search for this author in PubMed Google Scholar
Jiun-Shyan Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiun-Shyan Chen.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Construction of interface-conforming gradient smoothing cells

Since the interface locations are determined by the RK interpolated score function in Eq. (40), the outward unit normal of the interfaces at an interface point ${{\varvec{x}}}_{{\varvec{K}}}^{*}$ can be calculated as follows:

$${\varvec{n}}\left({{\varvec{x}}}_{{\varvec{K}}}^{*}\right)={\left.\frac{\nabla \widetilde{S}\left({\varvec{x}}\right)}{\Vert \nabla \widetilde{S}\left({\varvec{x}}\right)\Vert }\right|}_{{\varvec{x}}={{\varvec{x}}}_{K}^{*}}$$

(52)

To construct interface-conforming gradient smoothing cells for SCNI domain integration, a mirroring technique is utilized. For all interface nodes ${{\varvec{x}}}_{K}^{*}\in {\mathbb{S}}^{IF}$, the mirrored node pair $\left\{{{\varvec{x}}}_{K}^{*+},{{\varvec{x}}}_{K}^{*-}\right\}$ is obtained as follows:

$$ {\varvec{x}}_{K}^{* \pm } = {\varvec{x}}_{K}^{*} \pm \epsilon \user2{\mathop{n}\limits^{\rightharpoonup} }\left( {{\varvec{x}}_{K}^{*} } \right) $$

(53)

where $\epsilon $ is a small perturbation number, and the interface normal $\user2{\mathop{n}\limits^{\rightharpoonup} }\left( {{\varvec{x}}_{K}^{*} } \right)$ is defined in Eq. (52). In this work, $\epsilon ={10}^{-3}\ell$ is chosen, where $\ell$ is the image voxel size. As an illustration example, mirrored nodes are shown in the black box in Fig.

38a, and the resulting gradient smoothing cells are shown in Fig. 38b. Note that these mirrored node pairs are only used to generate the “interface conforming” Voronoi cells; they are not the RK nodes, and they don’t carry degrees of freedom.

This approach allows for the use of conventional techniques, such as Voronoi tessellation, which will result in the two smoothing cells adjacent to either side of the interface having a common boundary along the interface location as shown in Fig. 38b. In addition, the material class for the smoothing cells can be assigned according to the centered mirrored nodes’ material classes. As a result, smoothing cells away from the material interfaces are uniformly arranged, and the material interfaces are well represented by the adjacent two layers of smoothing cells.

Appendix B: Verification of IM-RKPM

2.1 Numerical example 1: one-dimensional composite rod problem

A one-dimensional composite rod with a centered material interface is fixed at the left and is subjected to a displacement of 1 on the right end, as demonstrated in Fig.

39. The rod is also subjected to a polynomial body force up to the third order $b\left(x\right)={a}_{0}+{a}_{1}x+{a}_{2}{x}^{2}+{a}_{3}{x}^{3}$. The Young’s modulus of the two sections of the rod are set as ${E}_{1}=10000,\mathrm{ for }\,x\in [\mathrm{0,5}]$ and ${E}_{2}=1000,\mathrm{ for }\,x\in ]\mathrm{5,10}]$. The exact solution to this problem is provided in [33].

The example is analyzed with two body force cases: $\left(1\right) b=0$; $\left(2\right) b\left(x\right)=25x-7.5{x}^{2}+0.5{x}^{3}$. The 1D problem domain is discretized with 11 uniformly distributed nodes, and the problem is approximated using a linear basis in both standard RKPM and IM-RKPM with a constant normalized nodal support size of 2. SCNI and 5-point Gauss integration are selected as numerical integration methods for case $(1)$ and case $(2)$, respectively. Figures

40 and

41 demonstrate the approximated displacement and strain solutions using RKPM and IM-RKPM for case $(1)$, respectively. The results show that RKPM strain solution exhibits Gibb’s-like oscillations and fails to reproduce the exact weak discontinuity at the material interface. On the other hand, IM-RKPM with SCNI can precisely capture the displacement and strain field to the machine’s precision. Similar behaviors are observed for case (2), as illustrated in Figs.

42 and

43, IM-RKPM significantly reduces the oscillations of the strain solution and can accurately capture the weak discontinuity across the material interface.

In addition, the convergence behaviors of IM-RKPM with the cubic B spline and the power interface kernels and standard RKPM are investigated in terms of the normalized displacement and energy error norms as follows with high-order Gauss quadrature rule:

$${\Vert {\varvec{u}}-{{\varvec{u}}}^{h}\Vert }_{0}=\sqrt{\frac{{\int }_{\Omega }{\left({{\varvec{u}}}^{\mathrm{exact}}\left({\varvec{x}}\right)-{{\varvec{u}}}^{h}\left({\varvec{x}}\right)\right)}^{2}d\Omega }{{\int }_{\Omega }{({{\varvec{u}}}^{\mathrm{exact}}\left({\varvec{x}}\right))}^{2}d\Omega }}$$

(54)

$${\Vert {\varvec{u}}-{{\varvec{u}}}^{h}\Vert }_{E}=\sqrt{\frac{{\int }_{\Omega }\left({{\varvec{\varepsilon}}}^{\mathrm{exact}}\left({\varvec{x}}\right)-{{\varvec{\varepsilon}}}^{h}\left({\varvec{x}}\right)\right) \cdot \left({{\varvec{\sigma}}}^{\mathrm{exact}}\left({\varvec{x}}\right)-{{\varvec{\sigma}}}^{h}\left({\varvec{x}}\right)\right)d\Omega }{{\int }_{\Omega }{{\varvec{\varepsilon}}}^{\mathrm{exact}}\left({\varvec{x}}\right)\cdot {{\varvec{\sigma}}}^{\mathrm{exact}}\left({\varvec{x}}\right)d\Omega }}$$

(55)

As illustrated in Fig. 44, standard RKPM exhibits a suboptimal convergence rate of $1$ for the displacement norms and 0.5 for the energy norm, while the accuracy and convergence rates are substantially improved in IM-RKPM, restoring the optimal convergence rates of $2$ and $1$, independent to the continuity of the interface kernel functions.

2.2 Numerical example 2: 2D circular inclusion in an infinite plate

An infinite plate with a circular inclusion subjected to a constant dilatational eigenstrain ${\varepsilon }^{*}=0.01$, as shown in Fig.

45, is analyzed.

The material constants selected for the inclusion material are: ${\lambda }_{1}=497.16, {\mu }_{1}=390.63$, and matrix material are: ${\lambda }_{2}=656.79, {\mu }_{2}=338.35$, where $\lambda $ and $\mu $ are Lamé parameters. Due to the symmetry of the domain and loading conditions, only the upper right quadrant of the domain is modeled. The length of each side of the finite quarter domain is $5$, the radius of the circular inclusion is $R=1$, and an analytical displacement field is prescribed on the boundaries. The analytical solutions in cylindrical coordinates can be found in [71].

The example is modeled as a plane strain axisymmetric problem. Both $5 \times 5$ Gauss integration and SCNI method are employed as the numerical integration schemes, and RK approximation with linear basis and a normalized support size of $2$ are utilized throughout the numerical analysis. Figure

46 demonstrates an example of domain discretization and background integration cell arrangement for the Gauss integration. The approximated radial displacement, radial strain, and hoop strain solutions using RKPM, IM-RKPM with cubic B spline interface kernels, and IM-RKPM with fourth-order power interface kernels, accompanied with $224$ non-uniform nodes and $5\times 5$ Gauss integration, are plotted along the line $y=x$ in Fig.

47. Like the 1D composite rod example, the RKPM solution of the radial strain and hoop strain are both oscillatory near the interface. IM-RKPM, on the other hand, effectively alleviates the oscillations in the strain solutions. In addition, a convergence study is performed for both RKPM and IM-RKPM with different interface kernels, and the results are shown in Fig.

48. The IM-RKPM recovers the optimal convergence rates with the Gauss domain integration for both smooth and C⁰ interface kernels.

Next, the same problem is solved using the computationally efficient SCNI. Figure

49 demonstrates an arrangement of 211 non-uniformly distributed nodes and conforming strain smoothing cells for SCNI. The numerical solutions obtained by RKPM and IM-RKPM with different interface kernels are plotted in Fig. 50. Similar to the Gauss integration, the standard RKPM with SCNI again experiences strain oscillations near the interface. However, solutions obtained by the IM-RKPM with different interface kernel functions are consistent with the ones obtained with Gauss integration in Fig. 47. By observing results in the convergence plots shown in Fig.

51, IM-RKPM with SCNI has displacement and strain solutions to converge optimally. These results show that the proposed IM-RKPM performs well with different selections of numerical domain integration techniques.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Baek, J., Tang, Y. et al. Support vector machine guided reproducing kernel particle method for image-based modeling of microstructures. Comput Mech 73, 907–942 (2024). https://doi.org/10.1007/s00466-023-02394-9

Download citation

Received: 18 May 2023
Accepted: 07 September 2023
Published: 18 October 2023
Issue Date: April 2024
DOI: https://doi.org/10.1007/s00466-023-02394-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Support vector machine guided reproducing kernel particle method for image-based modeling of microstructures

Abstract

Similar content being viewed by others

Eighty Years of the Finite Element Method: Birth, Evolution, and Future

Development of an ABAQUS plugin tool for periodic RVE homogenisation

A physics-informed neural network technique based on a modified loss function for computational 2D and 3D solid mechanics

1 Introduction

2 Basic equations

2.1 Model problem

2.2 Reproducing kernel approximation

2.3 Numerical domain integration

2.3.1 Stabilized conforming nodal integration

3 Support vector machine (SVM) classification of micro-CT images and model discretization

3.1 Support vector machine (SVM) classification algorithm

3.2 From micro-CT images to numerical models

3.2.1 Training data preparation and training the SVM

3.2.2 RK interface nodes

Remark 3.1

Remark 3.2

Remark 3.3.

Remark 3.4.

3.3 Image-based SVM-RK model validation

3.3.1 Validation with a synthetic image

3.3.2 Validation of the image-based SVM-RK discrete model with Scanning Electron Microscopy (SEM) images

Remark 3.5

4 Interface-modified reproducing kernel approximation guided by support vector machine

4.1 Interface-modified kernel functions

Remarks 4.1.

4.2 Interface-modified RK (IM-RK) approximation

Remarks 4.2.

4.3 IM-RK shape functions for the image-based numerical model

5 Image-based numerical results

5.1 Compression-shear test on 2D composite microstructure

5.2 Uniaxial tensile test on 3D composite microstructure

6 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix A: Construction of interface-conforming gradient smoothing cells

Appendix B: Verification of IM-RKPM

2.1 Numerical example 1: one-dimensional composite rod problem

2.2 Numerical example 2: 2D circular inclusion in an infinite plate

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation