1 Introduction and motivation

Semiconductor technology is undergoing a continuously increasing advancement in the aggressive scaling of device length [1]. In this scenario, three-dimensional (3D) device modeling and numerical simulation techniques play a critical role in the prediction of the electrical performance of the system under investigation. In the case of novel memory devices, due to the different undergoing physical phenomena, a self-consistent multi-physics approach is preferred with respect to the simulation of independent phenomena such as chemical reactions, electrical conduction and material properties modification. To respond to this need the software FEMOS (Finite Element Modeling Oriented Simulator) has been designed: FEMOS is a general-purpose modular numerical code based on the use of the Galerkin Finite Element Method (GFEM) implemented in a fully 3D framework through shared libraries using an objected-oriented programming language (C++). The first results of this new approach have been recently presented in [2, 3] and [4]. In the present work we extend the FEMOS computational platform in the study of the Drift Diffusion model (DD) [5, 6] and focus on the issue of endowing the simulation tool of a consistent, stable and accurate procedure for the approximation of electron and hole current densities in the device. This is of utmost importance in: (i) visualization and post-processing; (ii) evaluation of conduction currents at device terminals; and (iii) inclusion in the DD model of generation phenomena due to Impact Ionization (II). Here, our attention is devoted to (iii), because of the critical role of II in the convergence and numerical stability of the iterative algorithm used to solve the DD system (see [7], Chapter 3 and [8]), although the methods we propose for the treatment of (iii) can also be profitably employed for (i) and (ii).

To allow a consistent treatment of the generation term due to II within the FE procedure, we propose two novel discrete models for electron and hole current densities over the computational grid. The two methods provide a constant approximation of the current density inside each mesh element and for this reason they can be easily implemented in any simulation environment not necessarily employing the GFEM but utilizing, instead, the Box Integration Method (BIM) that is widely employed in contemporary device simulation tools. We refer to [9, 10] for an introduction to the mathematical structure of the BIM; we refer also to [5], Chapter 6, for a detailed discussion of the implementation of the BIM in the context of a finite box geometrical discretization, and to [11] for a general overview of the BIM in the solution of a general transport model including electro-thermal effects.

The first scheme (method A) is based on the use of a primal-mixed formulation of the DD model written as a function of the quasi-Fermi potential gradient (see [12] and [13]). The second scheme (method B) is a modification of the standard DD formula based on the introduction of an artificial diffusion matrix (see also [14]). Method A can be classified as a Galerkin FE approximation of the density current (written in generalized ohmic form) where the harmonic average of the electrical conductivity is used instead of the standard average (cf. [15, 16]), while method B is a genuine 3D extension of the classic 1D Scharfetter-Gummel (SG) difference formula for the computation of the current density over a 1D element [17], having interpreted this latter formula as the result of a stabilized Galerkin FE approximation with the use of an ‘optimal’ artificial diffusion (cf. [18]). The proposed finite element models are validated in the numerical study of 3D device structures (p-n junction diode and MOS transistors) under on-state and off-state regimes. Results clearly indicate that method A provides the best performance in terms of physical accuracy and numerical stability, and demonstrate the ability of the simulation model in accurately computing the I-V characteristic of the device until the onset of drain-to-bulk junction breakdown.

The outline of the article is as follows. In Section 2 we review the DD model and the algorithms for its finite element discretization. In Section 3 we illustrate the novel methods proposed to calculate the current densities in the device. In Section 4 we carry out their extensive validation by comparing the obtained results with a reference simulation suite in the study of 3D p-n and MOS structures including the II generation term. Concluding remarks and perspectives are addressed in Section 5.

2 Physical models, solution map and finite element discretization

In this section we present the mathematical model of the problem, the functional iteration for its decoupled solution and the finite element approximation of the linearized subproblems obtained through decoupling.

2.1 Drift-Diffusion model

The Drift Diffusion model (DD) is described by the Poisson equation coupled with the continuity equations for the mobile carriers, electrons and holes (see [57]):

$$\begin{aligned} & \nabla\cdot\mathbf{D} = q(p-n+D), \end{aligned}$$
(1a)
$$\begin{aligned} & -q\frac{\partial n}{\partial t} + \nabla\cdot\mathbf{J}_{n} = q (R -G), \end{aligned}$$
(1b)
$$\begin{aligned} & q\frac{\partial p}{\partial t} + \nabla\cdot\mathbf{J}_{p} = -q (R -G), \end{aligned}$$
(1c)
$$\begin{aligned} & \mathbf{D} = \epsilon\mathbf{E}, \end{aligned}$$
(1d)
$$\begin{aligned} & \mathbf{E} = - \nabla\varphi, \end{aligned}$$
(1e)
$$\begin{aligned} & \mathbf{J}_{n} = q \mu_{n} n \mathbf{E} + q D_{n} \nabla n, \end{aligned}$$
(1f)
$$\begin{aligned} & \mathbf{J}_{p} = q \mu_{p} p \mathbf{E} - qD_{p} \nabla p, \end{aligned}$$
(1g)

where D, E, \(\mathbf{J}_{n}\) and \(\mathbf{J}_{p}\) are the electric displacement, electric field and electron and hole current densities, while ϵ is the dielectric constant, φ the electrostatic potential, p and n the hole and electron concentrations, D the total net doping, q the elementary charge, \(\mu_{n}\) and \(\mu_{p}\) the electron and hole mobilities, \(D_{n}\) and \(D_{p}\) the electron and hole diffusivities, while R and G are the net recombination and generation rates. Adopted models for \(\mu_{n}\) and \(\mu_{p}\) account for an appropriate description of the scattering mechanism from lattice and impurities and the degradation due to high electric fields. Adopted models for R account for two-particle (Shockley Read and Hall model) and three-particle net recombination mechanisms (Auger model) while the function G is a pure generation term and accounts for particle rate per unit volume due to Impact Ionization (Van Overstraeten-De Man model) with the projection of electric field along current density direction as a driving force. We refer to [5], Chapter 4 for an exhaustive discussion of recombination/generation mechanisms in semiconductors and associated model parameters used in numerical simulations.

2.2 Geometrical notation and boundary conditions

The DD equation system (1a)-(1g) is solved in a 3D computational domain Ω that represents the geometrical model of the semiconductor device under investigation. In our analysis stationary conditions are always assumed (i.e., \(\partial n/\partial t = \partial p/\partial t = 0\)), without compromising the validity of the proposed approximation methods.

The device domain Ω is the union of two open disjoint subsets, \(\Omega_{\mathrm{Si}}\) (doped silicon part), and \(\Omega_{\mathrm{ox}}\) (oxide part) which it is assumed to be a perfect insulator. The device boundary Ω is divided into three disjoint subsets, \(\Gamma_{D}\), \(\Gamma_{N}\) and \(\Gamma _{\mathrm{int}} = \partial\Omega_{\mathrm{Si}} \cap\partial\Omega_{\mathrm{ox}}\). The subset \(\Gamma_{D}\) includes the ideal ohmic contacts of the device, i.e., equipotential surfaces where Dirichlet boundary conditions for the dependent variables φ, n and p are applied. On the subset \(\Gamma_{N}\) homogeneous Neumann boundary conditions are applied to prevent any exchange of electric and current flux with the surrounding environment. On the interface subset \(\Gamma_{\mathrm{int}}\) continuity of electric potential and of the normal component of the electric displacement is assumed, while tangentially flowing electron and hole current fluxes are enforced. This latter condition reflects the fact that the oxide region is assumed to behave as a perfect insulator and prevents from simulating current injection from the semiconductor into the oxide region.

The complete set of boundary conditions is:

φ = φ D , n = n D on  Γ D , p = p D , D n = 0 , J n n = 0 on  Γ N , J p n = 0 , D n = 0 , J n n = 0 on  Γ int , J p n = 0 ,
(2)

where \(\varphi_{D}\), \(p_{D}\) and \(n_{D}\) are boundary data for the dependent variables (computed as described in [5], Chapter 5), n is the outward unit normal vector over \(\partial\Omega= \Gamma_{D} \cup\Gamma_{N}\) and over \(\Gamma_{\mathrm{int}}\), while is the jump operator across the interface \(\Gamma_{\mathrm{int}}\).

2.3 The Gummel solution map

The highly nonlinear coupled nature of the DD system makes its analytical treatment very difficult, if not impossible. For this reason, numerical schemes must be used to compute an approximate solution. To this purpose, we introduce the Maxwell-Boltzmann (MB) statistics for electrons and holes:

$$\begin{aligned} & n = n_{i} \exp \bigl( (\varphi-\varphi_{n})/V_{\mathrm{th}} \bigr), \end{aligned}$$
(3a)
$$\begin{aligned} & p = n_{i} \exp \bigl( (\varphi_{p} - \varphi)/V_{\mathrm{th}} \bigr), \end{aligned}$$
(3b)

where \(n_{i}\) is the intrinsic concentration in the semiconductor material, \(\varphi_{n}\) and \(\varphi_{p}\) are the quasi-Fermi potentials for electrons and holes while \(V_{\mathrm{th}}\) is the thermal voltage. The Gummel’s decoupled algorithm for the iterative solution of the DD system (2.1) consists of solving first the Poisson equation (1a) with respect to the dependent variable φ, having expressed n and p with the Maxwell-Boltzmann statistics (3a) and (3b). This makes (1a) exponentially nonlinear with respect to the electric potential so that to update this latter dependent variable the Newton method is employed [19]. The continuity equations (1b)-(1c) are then successively solved in linear form with respect to the dependent variables n and p inside the adopted decoupled Gummel map algorithm [20]. The lagging procedure introduced and analyzed in [6] is used to treat the recombination/generation terms in such a way that each decoupled continuity equation enjoys a continuous maximum principle. This ensures the well-posedness of each subproblem and the strict positivity of the carrier densities as physically required. Then, once a step of the Gummel iteration is completed, the MB relations (3a) and (3b) are inverted to update the quasi-Fermi potentials using the computed iterates for φ, n and p, until convergence. Linear convergence of the Gummel algorithm is theoretically proved in [6], although practical computational experience often shows superlinear convergence behavior.

2.4 Finite element approximation

The simulation domain Ω is divided in a discrete partition \(\mathcal{T}_{h}\) made by elements K, each element K being a tetrahedron such that \(\overline{\Omega} = \bigcup_{K \in\mathcal{T}_{h}} \overline{K}\). Then, each differential problem in the Gummel decoupled algorithm of Section 2.3 is written in weak form (see [21], Chapter 5) and numerically approximated using the displacement-based Galerkin Finite Element Method (GFEM) with piecewise linear conforming elements for potential and carrier densities (see [21], Chapter 6). In the particular case of the linearized continuity equations, to avoid numerical instabilities due to possible dominance of the drift term, the variant of the GFEM denoted Edge Averaged Finite Element (EAFE) method proposed and analyzed in [2225] is used in the numerical approximation. The EAFE scheme enjoys several good properties (numerical stability, robustness and convergence with respect to the mesh discretization parameter). In particular, in the 2D case if \(\mathcal{T}_{h}\) is a Delaunay triangulation, then the stiffness matrix associated with (1c) and (1b) is a M-matrix [24]. As a consequence, the computed solution (electron and hole density) satisfies the Discrete Maximum Principle (DMP) and is strictly positive over the computational domain. Unfortunately this property is no longer valid in a 3D framework because the Delaunay condition on the mesh is not enough to guarantee the M-matrix property, so that a more restrictive condition, sufficient to ensure the DMP, is presented in [22] (cf. Eq. (2.5), Lemma 2.1). In the numerical experiments shown in Section 4 we have used this latter condition to check the geometrical quality of the triangulation \(\mathcal{T}_{h}\) in the critical case where the generation term due to impact ionization phenomena becomes the dominating driving force in the simulation of charge transport in the device.

3 Finite element models for the current density

The construction of a stable and accurate approximate current density field in a primal-based FE formulation is not a trivial task because of possible numerical problems arising from differentiation and cancellation in the DD transport relations (1f)-(1g). Throughout the remainder of this section we assume that the solution of the DD system (1a)-(1g) is carried out using the Gummel decoupled algorithm described in Section 2.3. We first introduce a simple finite element formula for the representation of the current density over each mesh element which is a straightforward implementation of (1f)-(1g). Then, we propose two novel finite element methodologies for current density discretization. The novel approaches have a much simpler implementation than other more sophisticated formulations (such as the dual-mixed hybridized FEM with exponential fitting, see [16, 26]) and are completely compatible for use in the classical Box Integration Method (BIM) [9, 10]. Of the three examined methods, one reveals to be particularly robust and accurate, as demonstrated by several tests discussed in Section 4. In the remainder of the article, for a given element K in \(\mathcal{T}_{h}\) we denote the volume of K by \(\operatorname {vol}(K)\); moreover, with the subscript K we refer to a quantity defined in the interior of K while the subscript h refers to a quantity defined at the vertices of K. For a given function \(f : K \rightarrow\mathbb{R}\) we define \(\langle f \rangle_{K}:=\int_{K} f \,dK /\operatorname {vol}(K)\) the mean integral value of f over K. We also assume that carrier mobility (and the associated diffusivity through Einstein’s relation) is constant in K, while the electric potential is linear (so that the associated electric field is a constant vector in K).

3.1 The DD formula

The simplest FE model for the current densities (method DDFE) is obtained by substituting into the transport equations (1f)-(1g) the functions n, p and φ with their corresponding FE approximations \(n_{h}\), \(p_{h}\) and \(\varphi_{h}\), and then by computing the integral average of the resulting expressions over the element K. This yields

$$\begin{aligned} & \mathbf{J}_{n,K} = q \mu_{n} \langle n \rangle_{K} \mathbf{E}_{K} + q D_{n} \nabla n_{h,K}, \end{aligned}$$
(4a)
$$\begin{aligned} & \mathbf{J}_{p,K} = q \mu_{p} \langle p \rangle_{K} \mathbf{E}_{K} - q D_{p} \nabla p_{h,K}. \end{aligned}$$
(4b)

It is immediate to see that the discrete current densities (4a) and (4b) automatically reproduce the limiting cases of pure diffusive flow (\(\mathbf{E}_{K} = \mathbf{0}\)) and pure ohmic flow (\(n_{h,K} = \mathrm {constant}\) and \(p_{h,K} = \mathrm {constant}\)). In the case of thermal equilibrium (\(\mathbf{J}_{n,K} = \mathbf{J}_{p,K} = \mathbf{0}\)), we can anticipate computational difficulties with the use of (4a) and (4b) because of exact cancellation of drift and diffusive current contributions. Thus, by a continuity argument, we also see that method DDFE does not seem appropriate in the numerical treatment of the subthreshold current regime, where currents are not exacty equal to zero but are very small. The two following formulations are designed to overcome this limitation.

3.2 Method A

To describe method A we consider the case of electron continuity equation because a completely similar treatment holds for the hole continuity equation. Using (3a) into (1f) yields the equivalent ohmic form of the electron current density

$$ \mathbf{J}_{n} = \sigma_{n} \mathbf{E}_{n}, $$
(5a)

where:

$$\begin{aligned} & \sigma_{n} = q \mu_{n} n = q \mu_{n} n_{i} \exp \bigl( (\varphi-\varphi _{n})/V_{\mathrm{th}} \bigr) , \end{aligned}$$
(5b)
$$\begin{aligned} & \mathbf{E}_{n} = - \nabla\varphi_{n} \end{aligned}$$
(5c)

are the electrical conductivity and the effective electric field experienced by electrons, respectively. We notice that the use of the ohmic representation yields a nonlinear gradient form of the DD current density. To construct the finite element model for \(\mathbf{J}_{n}\) as in (5a), we use the primal-mixed (PM) FEM introduced and analyzed in [12] and recently extended to the case of advective-diffusive operators in [13]. In the PM FEM of lowest order, the approximate current density is constant over each \(K \in\mathcal{T}_{h}\) while the approximate quasi-Fermi potential is piecewise linear and continuous over \(\mathcal{T}_{h}\). Let us introduce the finite element spaces of piecewise constant and piecewise linear continuous functions over \(\mathcal{T}_{h}\):

$$\begin{aligned} & Q_{h}= \bigl\{ w \in L^{2}(\Omega) : w|_{K}\in \mathbb {P}_{0}(K)\ \forall K \in \mathcal{T}_{h} \bigr\} , \end{aligned}$$
(5d)
$$\begin{aligned} & V_{h}= \bigl\{ u \in C^{0}(\overline{\Omega}) : u|_{K} \in \mathbb {P}_{1}(K)\ \forall K \in\mathcal{T}_{h} \bigr\} \end{aligned}$$
(5e)

and the approximate electrical conductivity \(\sigma_{n,h}:= q \mu_{n} n_{i} \exp ( (\varphi_{h} -\varphi_{n,h})/V_{\mathrm{th}} )\) where \(\varphi_{n,h} \in V_{h}\) and \(\varphi_{h} \in V_{h}\) are the finite element discrete analogues of \(\varphi_{n}\) and φ, respectively. Then, the PM-FE approximation of (5a) reads: find \(\mathbf{J}_{n,h} \in [Q_{h} ]^{3}\) such that

$$ \int_{\Omega}\sigma_{n,h}^{-1} \mathbf{J}_{n,h} \cdot\mathbf{q}_{h} \,d\Omega + \int _{\Omega}\nabla\varphi_{n,h} \cdot\mathbf{q}_{h} \,d\Omega= 0 \quad \forall\mathbf{q}_{h} \in [Q_{h} ]^{3}, $$
(5f)

where \(\mathbf{J}_{n,h} \in [Q_{h} ]^{3}\) is the finite element discrete analogue of \(\mathbf{J}_{n}\). Since functions in \([Q_{h} ]^{3}\) are discontinuous, relation (5f) amounts to

$$ \int_{K} \sigma_{n,h}^{-1} \mathbf{J}_{n,h} \cdot\mathbf{q}_{h} \,dK + \int _{K} \nabla\varphi_{n,h} \cdot\mathbf{q}_{h} \,dK = 0 \quad \forall\mathbf{q}_{h} \in \bigl[\mathbb {P}_{0}(K) \bigr]^{3}. $$
(5g)

Using in (5g) the standard basis functions for \([\mathbb {P}_{0}(K) ]^{3}\) we obtain

$$ [\mathbf{J_{n,h}}]_{i} = - \mathcal{H}_{K} ( \sigma_{n,h} ) \frac{\partial\varphi_{n,h}}{\partial x_{i}}, \quad i = 1,2,3, \forall K \in \mathcal{T}_{h}, $$
(5h)

where \(\mathcal{H}_{K} ( \sigma_{n,h} ) := ( \langle \sigma_{n,h}^{-1} \rangle_{K} )^{-1}\) is the harmonic average of \(\sigma_{n,h}\) over the element K.

We notice that the approximation of the current density with a constant vector may appear inadequate because of the exponential dependence of the electron conductivity on the quasi-Fermi potential (cf. (5b)), except in the case of a constant electron concentration. It can be proved (see [12]) that \(\| \mathbf{J}_{n} - \mathbf{J}_{n,h} \|_{L^{2}(\Omega)} = \mathcal{O}(h)\), where h denotes the mesh parameter, so that an appropriate reduction of the grid size may be required to achieve a desired fixed tolerance.

To numerically compute in a simple and accurate manner the harmonic average of the electrical conductivity, we use the following quadrature rule (see also [27, 28])

$$ \biggl(\frac{\int_{K} \sigma^{-1}_{n,h} \,dK}{\operatorname {vol}(K)} \biggr)^{-1} \simeq \biggl(\frac{\int_{e \ast} \sigma^{-1}_{n,h} \,de}{|e^{\ast}|} \biggr)^{-1} $$
(5i)

\(e^{*}\) being the edge of ∂K where the maximum drop of \(\sigma_{n,h}\) occurs and \(|e^{\ast}|\) its Euclidean length. Upon introducing the linear dimensionless potential \(\Phi_{n} := (\varphi_{h} - \varphi_{n,h} )/V_{\mathrm{th}}\) and the two vertices of K: \(\mathbf{x}_{m}\) s.t. \(\Phi_{n}(\mathbf{x}_{m})=\Phi_{n,m} := \min_{K}(\Phi_{n})\) and \(\mathbf{x}_{M}\) s.t. \(\Phi_{n}(\mathbf{x}_{M})=\Phi_{n,M}:= \max_{K}(\Phi_{n})\), we define \(e^{\ast}\) to be the edge of ∂K which connects \(\mathbf{x}_{m}\) and \(\mathbf{x}_{M}\), so that (5i) yields

$$ \int_{K} \sigma_{n,h}^{-1} \,dK \simeq q \mu_{n} n_{i} \exp(\Phi_{n,m}) \mathcal{B}( \Phi_{n,m}-\Phi_{n,M}), $$
(5j)

where \(\mathcal{B}(Z): = Z/(\exp(Z) -1)\) is the inverse of the Bernoulli function, such that \(\mathcal{B}(0)=1\). Using MB statistics (3a), relation (5j) can be written as

$$ \int_{K} \sigma_{n,h}^{-1} \,dK \simeq q \mu_{n} n_{m} \mathcal{B}(\Phi_{n,m}- \Phi_{n,M}), $$
(5k)

where \(n_{m}: = n_{i} e^{\Phi_{n,m}}\). Expression (5j) can be also written as

$$\int_{K} \sigma_{n,h}^{-1} \,dK \simeq q \mu_{n} n_{i} \exp(\Phi_{n,m}) \exp( \Phi_{n,M}) \exp(-\Phi_{n,M}) \mathcal{B}(\Phi_{n,m}- \Phi_{n,M}) $$

which, using again (3a) and the property \(\exp(Z)\mathcal{B}(Z) = \mathcal{B}(-Z)\), becomes

$$ \int_{K} \sigma_{n,h}^{-1} \,dK \simeq q \mu_{n} n_{M} \mathcal{B}(\Phi_{n,M}- \Phi_{n,m}), $$
(5l)

where \(n_{M} :=n_{i} e^{\Phi_{n,M}}\). Combining (5k) and (5l), we find

$$ \mathbf{J}_{n,K} = - q \mu_{n} \biggl[ \frac{ n_{m} \mathcal{B}(-\Delta\Phi_{n,\mathrm{max}}) + n_{M}\mathcal{B}(\Delta\Phi_{n,\mathrm{max}})}{2} \biggr]\nabla\varphi_{n,h}, $$
(5m)

where \(\Delta\Phi_{n,\mathrm{max}} := \Phi_{n,M} - \Phi_{n,m}\). In the case where \(\Delta\Phi_{n,\mathrm{max}}\) is sufficiently large, we see that the 1D harmonic average of \(n_{h}\) along the edge \(e^{\ast}\) has the effect of ‘favoring’ the smallest nodal concentration in the element K rather than the largest one, so that we approximately have

$$ \mathbf{J}_{n,K} \simeq- q \mu_{n} n_{m} \max _{\partial K} \frac{(\varphi_{h} - \varphi_{n,h})}{2 V_{\mathrm{th}}} \nabla\varphi_{n,h}. $$
(5n)

If, instead, \(\Delta\Phi_{n,\mathrm{max}}\) is small then the 1D harmonic average behaves appreciably as the standard average and we obtain

$$ \mathbf{J}_{n,K} \simeq- q \mu_{n} \frac{n_{m} + n_{M}}{2} \nabla\varphi_{n,h}. $$
(5o)

Following the same procedure as above also for the hole current density we have

$$ \mathbf{J}_{p,K} = - q \mu_{p} \biggl[ \frac{ p_{m} \mathcal{B}(-\Delta\Phi_{p,\mathrm{max}}) + p_{M} \mathcal{B}(\Delta\Phi_{p,\mathrm{max}})}{2} \biggr]\nabla\varphi_{p,h}, $$
(5p)

where \(\Phi_{p}:= (\varphi_{p,h}-\varphi_{h})/V_{\mathrm{th}}\) and \(\Delta\Phi_{p,\mathrm{max}}:= \max_{K} \Phi_{p} - \min_{K} \Phi_{p}\).

3.3 Method B

In the previous section the discrete form of the current density is constructed by starting from the equivalent ‘ohmic’ representation in terms of the quasi Fermi potential, and then by performing a suitable approximation of the electrical conductivity over the finite element K. Here, we continue along the same line of thought, but starting from the classic DD format (1f)-(1g), with the intent of using the method of Streamline Upwind artificial diffusion proposed in [18] for the advective-diffusive model to stabilize the computation in the presence of a high electric field.

3.3.1 The 1D SG method as an artificial diffusion scheme

In the 1D setting the artificial diffusion technique consists of replacing the electron diffusion coefficient \(D_{n}\) with the modified quantity

$$ D_{n,h} = D_{n} + D_{n} \Phi (\mathbb {P}e|_{K} ), $$
(6a)

where Φ is a suitable nonnegative stabilization function of the 1D local Pèclet number \(\mathbb {P}e|_{K} = (h |\partial_{x} \varphi_{h}| )/(2V_{\mathrm{th}}) = |\Delta\varphi|/(2 V_{\mathrm{th}})\), h and Δφ being the length of the 1D interval and the potential drop over the interval, respectively. The local Pèclet number gives a measure of how much the drift term dominates over the diffusion term in the transport mechanism. If \(\mathbb {P}e|_{K} > 1\) the problem is locally drift (advection)-dominated and in such a case we need introduce an extra amount of diffusion in (6a) (given by \(D_{n} \Phi(\mathbb {P}e|_{K})\)) to prevent the occurrence of unphysical spurious oscillations in the computed solution, which might even lead to a negative electron concentration. If \(\mathbb {P}e|_{K} < 1\) the problem is locally diffusion-dominated and there is no need of adding an extra diffusion, so that the standard GFEM is enough for obtaining an accurate and numerically stable solution. Based on the last observation, the function Φ has to satisfy the property of consistency

$$ \lim_{\mathbb {P}e|_{K} \to0} \Phi(\mathbb {P}e|_{K}) = 0\quad \forall K\in\mathcal{T}_{h}. $$
(6b)

The 1D approximation of the electron current density to be used in a stabilized GFEM is thus given by the following relation

$$ J_{n,h}(n_{h}) |_{K} = q \mu_{n} \langle n \rangle_{K} E_{h,K} + q D_{n} \bigl( 1 + \Phi(\mathbb {P}e|_{K})\bigr) ) \partial_{x} n_{h} \quad \forall K\in\mathcal{T}_{h}. $$
(6c)

To design in a physically sound and consistent manner the optimal stabilization function Φ, we assume the modified method to exactly satisfy some limiting cases that often occur in practical important electronic applications. Using a 3D notation, for sake of generality and because this will be used in later extension, the considered cases are:

  1. C1.

    Constant carrier concentrations (only drift contribution): \(\mathbf{J}_{n} = q\mu_{n} n \mathbf{E}\).

  2. C2.

    Constant potential (\(\mathbf{E}=0\), only diffusive contribution): \(\mathbf{J}_{n} = qD_{n}\nabla n\).

  3. C3.

    Constant quasi Fermi potential (no current flow): \(\mathbf{J}_{n} = -q \mu_{n} n \nabla\varphi_{n} = \mathbf{0}\).

Notice that case C3 implies that

$$ n=C e^{\varphi/ V_{\mathrm{th}}}, $$
(6d)

where C is an arbitrary constant such that \(C = \exp(-\overline{\varphi}_{n}/V_{\mathrm{th}})\), \(\overline{\varphi}_{n}\) being a given constant value. Thanks to assumption (6b) the stabilized current (6c) automatically satisfies cases C1 and C2. Case C3 is recovered by imposing (in the 1D setting)

$$ J_{n,h}\bigl(\Pi_{1}^{K} \bigl(Ce^{\varphi/ V_{\mathrm{th}}}\bigr)\bigr) = 0, $$
(6e)

where \(\Pi_{1}^{K}\) is the \(\mathbb {P}_{1}\)-interpolant over the element K. Using (6e) in (6c), noting that \(E_{K} = -\partial_{x} \varphi_{h}\) and using Einstein’s relation, we obtain the following relation for the stabilization function

$$ \Phi(\mathbb {P}e|_{K}) = \frac{\langle n \rangle_{K}}{V_{\mathrm{th}}} \frac{\partial_{x} \varphi_{h}}{\partial_{x} \Pi_{1}^{K}(e^{\varphi/ V_{\mathrm{th}}}) } -1. $$
(6f)

Enforcing relation (6d) at the two vertices \(x_{i}\) of the interval, \(i=1,2\), yields

$$ \Phi(\mathbb {P}e|_{K}) = \sigma \mathbb {P}e|_{K} \frac{e^{\varphi _{1}/V_{\mathrm{th}}}+e^{\varphi_{2}/V_{\mathrm{th}}}}{ e^{\varphi_{2}/V_{\mathrm{th}}}-e^{\varphi_{1}/V_{\mathrm{th}}}} -1 = \sigma \mathbb {P}e|_{K} \frac{e^{2 \sigma \mathbb {P}e|_{K}}+1}{e^{2\sigma \mathbb {P}e|_{K}}-1} -1, $$
(6g)

where \(n_{1}\) and \(n_{2}\) are the two nodal values of \(n_{h}\) while \(\sigma: = \operatorname{sign}(\Delta\varphi)\). Setting for brevity \(X := 2 \sigma \mathbb {P}e|_{K}\), relation (6g) becomes

$$\begin{aligned} \Phi(X) & = \frac{X}{2} \biggl( \frac{e^{X}}{e^{X}-1} + \frac {1}{e^{X}-1} \biggr) -1 = \frac{1}{2} \bigl( \mathcal{B}(-X) + \mathcal{B}(X) \bigr) -1 \\ & = \frac{1}{2} \bigl( X + \mathcal{B}(X) + \mathcal{B}(X) \bigr) -1 = \mathcal{B}(X) +\frac{X}{2} -1, \end{aligned}$$

and replacing the definition of X we obtain for both \(\Delta\varphi> 0\) and \(\Delta\varphi< 0\)

$$ \Phi(\mathbb {P}e|_{K}) = \mathcal{B}(2\mathbb {P}e|_{K}) + \mathbb {P}e|_{K} -1 $$
(6h)

which, upon substitution into (6c), recovers the well known 1D Scharfetter-Gummel scheme [17]. This latter method was originally proposed as an ‘exponential fitting’ difference scheme by Allen and Southwell in [29] and subsequently referred to as an artificial diffusion stabilized scheme with ‘optimal viscosity’ by Brooks and Hughes in [18].

3.3.2 The 3D SG Artificial Diffusion method

In a 3D framework a straightforward extension of the 1D Scharfetter-Gummel stabilization (6g)-(6h) can be obtained by introducing a \(3\times3\) diagonal stabilizing tensor \(\underline{\underline{\boldsymbol{\Phi}}}^{K}\) defined on each element \(K \in\mathcal{T}_{h}\) as follows

$$ \underline{\underline{\boldsymbol{\Phi}}}^{K}_{ ii} = \frac{\langle \Pi_{1}^{K}(e^{(\varphi_{h}-\varphi_{M})/V_{\mathrm{th}}}) \rangle_{K} \partial _{x_{i}}\varphi}{\partial_{x_{i}}\Pi_{1}^{K} (e^{(\varphi_{h}-\varphi_{M})/V_{\mathrm{th}}})V_{\mathrm{th}}} -1,\quad i = 1,2,3, $$
(6i)

where we have used, as a reference value for the potential reference to avoid overflow exceptions in the machine evaluation of (6i), the maximum \(\varphi_{M}\) of \(\varphi_{h}\) in K.

The 3D approximate electron current density is then

$$ \mathbf{J}_{n,K} = q \mu_{n} \langle n \rangle_{K} \mathbf{E}_{K} + qD_{n}(\underline{ \underline{\mathcal{I}}}+\underline{\underline {\boldsymbol{\Phi}}}_{K}) \nabla n_{h}, $$
(6j)

where \(\underline{\underline{\mathcal{I}}}\) is the \(3\times3\) identity tensor. In a completely similar manner we have the relation for the hole current density

$$ \mathbf{J}_{p,K} = q \mu_{p} \langle p \rangle_{K} \mathbf{E}_{K} - qD_{p}(\underline{ \underline{\mathcal{I}}}+ \underline{\underline{\boldsymbol{\Phi}}}_{K}) \nabla p_{h}, $$
(6k)

where

$$ \underline{\underline{\boldsymbol{\Phi}}}^{K}_{ ii} = \frac{\langle \Pi_{1}^{K}(e^{(\varphi_{m}-\varphi_{h})/V_{\mathrm{th}}})\rangle_{K} \partial _{x_{i}}\varphi}{\partial_{x_{i}} \Pi_{1}^{K}(e^{(\varphi_{m}-\varphi_{h})/V_{\mathrm{th}}})V_{\mathrm{th}}} -1, \quad i = 1,2,3, $$
(6l)

\(\varphi_{m}\) being the minimum of φ over K. It is immediate to check that the two proposed novel approximations (6j) and (6k) satisfy all cases C1, C2 and C3 in Section 3.3.1.

4 Numerical experiments

This section is devoted to the illustration of several numerical experiments. Specifically, Section 4.1 is aimed to check the correctness of the implementation of the 3D-EAFE method within the FEMOS code and to compare the performance of this finite element approach with a reference-commercial tool based on the BIM [30]. Section 4.2 is aimed to compare the performance of the various methods for current calculation of Section 3 against physical expectation. The conclusion of these tests is that method A turns out to be the most stable and accurate, so that Section 4.3 employs such method in a self-consistent simulation of semiconductor devices in the presence of II generation terms. 3D ideal and realistic devices commonly employed in semiconductor industry are taken into consideration: (1) a p-n ideal diode; (2) a p and n-MOS ideal devices; and (3) a p-MOS device resulting from a realistic 2D-process simulation accounting for non ideal doping profiles. Device (3) represents a severe benchmark to highlight the accuracy and the stability of the different current density calculation methods.

4.1 Accuracy of the 3D-EAFE discretization

Several tests have been performed in order to assess the correctness of the EAFE approach in the simulation of semiconductor devices in 3D framework. For reading benefits we just report a benchmark with a reference commercial tool based on n-p-channel MOSFET structure of Figure 1(a) (\(Gate Length:100 \mbox { nm}\); \(Gate Ox:30 \mbox { nm}\) and contacts \(S=Source\), \(Gate\), \(D=Drain\), \(Body\)) with analytical doping profiles: Figure 1(b) reports the doping profile for the n-channel device (the p-channel device is obtained just by the inverting the doping type). The agreement with the reference simulation tool is very good as reported in Figure 2(a) for the n-channel device and in Figure 2(b) for the p-channel device for several \(Drain\) bias conditions (electron and hole mobilities have been calculated using the Canali model [31]). We point out that the calculation of the current at the contacts is carried out by extending to the 3D case the approach proposed in [32] for the 2D case: this novel formulation, known as residual method, is based on the approach proposed in [33]. We also notice that the 3D-profiles of carriers are indistinguishable between FEMOS and Ref. [30] as reported in Figure 3 in the case of electrons in the n-channel device at \(V_{\mathrm{Gate}}=2 \mbox { V}\) and \(V_{\mathrm{Drain}}=0.1 \mbox { V}\).

Figure 1
figure 1

MOS structure: Left: 3D tetrahedral mesh. Right: Doping profile in the case of the n-channel device.

Figure 2
figure 2

MOSFET forward bias: comparison between FEMOS (line) and Ref. [ 30 ] (symbol). Left: n-channel Id-Vg. Right: p-channel Id-Vg.

Figure 3
figure 3

n -MOSFET electron concentration at \(\pmb{V_{\mathrm{Gate}}=2 \mbox { V}}\) and \(\pmb{V_{\mathrm{Drain}}=0.1 \mbox { V}}\) . Left: FEMOS calculation. Right: Ref. [30] calculation.

4.2 Evaluation of the performance of current calculation methods

To evaluate the simulation performance of the methods for current calculation introduced in Section 3 we have firstly considered a diode structure with a p-n junction geometrically represented by the cubic region Ω= ( 0 , 0.3 ) 3 μ m 3 (see Figure 4(a)). A Gaussian implantation of donors with a peak of 1019 cm−3 and a depth of 0.15μm (see Figure 4(b)) is made over a p-type region with a constant acceptor profile of 1018 cm−3 magnitude. One contact is defined for each of the doped regions: for the n-type part a rounded-shape contact is used (Top), while the p-type part is contacted at the bottom face (Body). As mentioned in Section 2.2, contacts are considered of pure Ohmic-type with appropriate Dirichlet boundary conditions: the Top is maintained at ground while the Body is ramped at 0.8 V. In Figure 5 the results of the calculation of the current density vector field for electrons and holes in the semiconductor bulk are represented through streamlines connecting the Body with the Top contact. As expected the current calculation obtained with method DDFE (eq. (4a) and (4b)) is affected by a critical behavior, in particular close to and inside the n-junction as shown in Figure 5(a) and Figure 5(b) where instability has to be ascribed to numerical cancellation of the drift and diffusion contributions. Results get definitely better by employing method B using Eqs. (6j) and (6k) where the improvement can be appreciated in Figure 5(e) and Figure 5(f). However, a careful inspection of the hole current density reveals that some numerical instability is still evident inside the n-junctions. The extension of the 1D SG scheme to 3D provided by method A in Eq. (5h) and (5p) results in the streamlines presented in Figure 5(c) and Figure 5(d): no spurious instability can be observed anymore and our calculations are in excellent agreement with the results of a commercial code (not shown here). The comparison between the different current computational methods has also been carried out on a p-channel MOSFET. The doping profiles have been obtained by using a 2-D process simulator with implantation and diffusion steps [30] with the purpose to have a realistic doping as reported in Figure 6(a) with a \(Gate Length: 180 \mbox { nm}\) and a \(Gate Ox: 4.5 \mbox { nm}\). The presence of floating non-compensated p-type regions in the channel body increases the computational difficulties. The 2D doping profile has been then extruded in three spatial dimensions, and the generated mesh is shown in Figure 6(b) where the device contact has been highlighted with purple color (the body contact is not shown in the picture but is located in the bottom face of the silicon region). Regarding the calculation of \(\mathbf{J}_{p}\), the numerical difficulties found with method DDFE are still confirmed as clearly depicted in Figure 7(a) not only inside the p-type region but also around the floating regions present in the body (the visualization are referred to the bias conditions \(V_{G}=-1 \mbox { V}\) and \(V_{D}=-0.1 \mbox { V}\)). The marginality found using formula (6k) is increased as reported in Figure 7(c): this comes by the fact that the evaluation of the coefficient in (6i) is again undergoing numerical problems related to roundoff error. However method B is giving a much better current density evaluation with respect to the pure application of the Drift-Diffusion approach at the element level. We conclude this section by noting that, again, the best description of the expected physical behavior of the device is obtained by adopting method A, that turns out to provide an accurate and stable 3D extension of the 1D SG formula, as clearly demonstrated by Figure 7(b).

Figure 4
figure 4

Diode structure: Left: mesh. Right: 2D cut of the doping profile.

Figure 5
figure 5

Electron (left column) and hole (right column) current densities of Diode test case at \(\pmb{V_{\mathrm{body}}=0.8 \mbox { V}}\) . Top: Method DDFE. Middle: Method A. Bottom: Method B.

Figure 6
figure 6

p -MOSFET structure: Left: 2D cut of the doping profile. Right: Mesh.

Figure 7
figure 7

Hole current density calculation of p -MOSFET at \(\pmb{V_{G}=-1.0 \mbox { V}}\) and \(\pmb{V_{D}=-0.1 \mbox { V}}\) : Top left: Method DDFE. Top right: Method A. Bottom: Method B.

4.3 Application of method A to the study of a reverse-biased MOS with II

In this section we implement method A for current calculation inside the Gummel map algorithm to evaluate the II generation term by projecting the electric field along the obtained current streamlines. Since we are using the Van Overstraeten-de Man model for II we have run the simulation not beyond the limits of validity of this model, i.e. when the avalanche field is in the range of \(6 \cdot10^{5} \mbox { Vcm}^{-1}\). Computations are performed by keeping all the contacts at ground and applying a reverse bias to the \(Drain\) contact. Figure 8(a) and Figure 8(b) compare the FEMOS simulations with those obtained using [30] for both n and p channel device under condition of reverse bias. The current calculated at \(Drain\) and \(Source\) contact are quite aligned with the reference tool while the \(Bulk\) component is lower due to the different generation value of II terms (not shown here). Figure 9 and Figure 10 visualize the increase of the II generation term with the increasing of the reverse bias in the case of the n-MOS and p-MOS device. The drain-to-bulk breakdown is starting in the \(Drain\)-\(Gate\) overlap region and proceed until it reaches the drain contact where the current streamlines are more dense as shown in Figure 11(b). A final insight into the simulation of the n-MOS device using method A is offered by Figure 11 which reports the visualization of \(\mathbf{J}_{n}\) in the off and on states: in accordance to physical expectation, in the on-state the electron current density is confined under the gate from drain-to-source while in the off-state it is flowing from source-to-drain and from source-to-body.

Figure 8
figure 8

MOSFET in reverse bias (all contacts are grounded while Drain is reverse biased): comparison between FEMOS (line) and Ref. [ 30 ] (symbol). Left: n-channel device. Right: p-channel device.

Figure 9
figure 9

n -MOSFET: evolution of the II generation term (all contacts are grounded except drain). Left: \(V_{D}=3.4 \mbox { V}\). Right: \(V_{D}=4.8 \mbox { V}\).

Figure 10
figure 10

p -MOSFET: evolution of the II generation term (all contacts are grounded except drain). Left: \(V_{D}=-3.4 \mbox { V}\). Right: \(V_{D}=-4.8 \mbox { V}\).

Figure 11
figure 11

n -MOSFET: electron current density streamlines obtained with method A. Left: On-state at \(V_{G}=1 \mbox { V}\) and \(V_{D}=0.1 \mbox { V}\). Right: Off-state at \(V_{G}=0 \mbox { V}\) and \(V_{D}=0.85 \mbox { V}\).

5 Conclusions

In this article we have addressed the problem of representing in 3D the carrier current density by extending the beneficial properties emanating from the classic 1D Scharfetter-Gummel difference formula. To this purpose, we have adopted the Galerkin Finite Element Method for the numerical simulation of the Drift-Diffusion model in the 3D case, and we have proposed two novel methods for current density evaluation, denoted method A and B, to which, for comparison, we have added also the basic method DDFE using the DD formula. The three schemes compute a piecewise constant approximation of the current density over a tetrahedral partition of the device domain.

Method DDFE turns out to provide the worst results in the test experiments. Method B is a 3D extension of the method of optimal artificial diffusion and gives reasonably accurate results. Method A is based on a primal-mixed formulation endowed with a suitable quadrature formula for the approximate evaluation of the harmonic average of the electrical conductivity. It is by far the best of the three considered approaches, providing simulation results in excellent agreement with a commercial software. The three FE formulations are numerically verified in the study of realistic 3D device structures, also including the presence of Impact Ionization phenomena. Even in this latter case, method A is able to correctly describe the complex carrier flow patterns inside the device bulk and to track the I-V curve of the device up to the avalanche breakdown region.

Despite the fact that the proposed formulations are illustrated and validated in the study of the classic DD transport model in semiconductors, they can be applied in a straightforward manner to the numerical treatment of general conservation laws for advective-diffusive fluxes where the advective term is in gradient form, as is the case for ion electrodiffusion in electrochemistry and biology with the Poisson-Nernst-Planck model [34] and hydrodynamic and quantum-corrected charge transport in semiconductors [3537].