1 Introduction

The study of new semiconductor materials is important from a fundamental science point of view and for the ever increasing number of applications in optoelectronics. Concrete progress requires accurate and simple modelling that can predict optical properties and become a tool for numerical characterization and device design from the ultra-violet to the THz range and Mid Infrared Ranges (Jepsen et al. 2013; Gu et al. 2014; Krier et al. 2012; De la Mare et al. 2009; Zhuang et al. 2008). Furthermore, understanding the properties of new bulk semiconductors has attracted renewed interest in the solar cell material arena to avoid environmentally unfriendly materials such as Cd (Vidal et al. 2010).

The importance of the Coulomb interaction leading to many body and correlation effects is now a well-established fact from bulk to quantum dots and the associated nonlinear effects become significantly pronounced when a semiconductor is highly excited with light fields or electrically injected carriers (Chemla and Shah 2001). Strong light fields create electron–hole pairs, which in turn constitute quantum mechanical many body systems interacting in various ways, e.g. band gap renormalization, band filling and Coulomb enhancement, screening and dephasing arising from the attractive and repulsive scattering processes from electrons, holes, phonons and impurity defects. At low temperatures and small carrier densities, excitonic effects dominate the optical absorption of bulk semiconductors and the excitons are bleached as the temperature and excitation densities increase due to a combination of screening, band filling and dephasing effects. But even at higher temperatures and densities, the Coulomb interaction is important (Chemla and Shah 2001) and plasma theories have delivered excellent approximations for the absorption and gain in this case and have been successfully applied to different isotropic bulk materials (Banyai and Koch 1986) and superlattices treated as effective anisotropic media (Pereira 1995, 2016). Photoluminescence is crucial to characterise new materials and devices under development and this paper combines analytical approximations for the optical absorption and luminescence of semiconductors with Trust Region-Reflective (TRR) methods, delivering a robust numerical characterization method to be used in the characterisation of new bulk semiconductors per direct comparison with experimental spectra. It further extends recent applications of the theory (Oriaku and Pereira 2017) to the case of dilute nitride semiconductors and confirms results for the s-shape of the luminescence peak as a function of temperature.

2 Mathematical formalism

An interesting feature of dilute semiconductor material is the anomalous energy emission peaks at low temperatures, following an unusual s-shape behaviour that is associated with disorder and localization effects, which have been seen in both dilute bismides (Mazzucato et al. 2014) and nitrides (Krier et al. 2012). A simple and efficient method to describe this and other light emission effects in those materials is used here (Oriaku and Pereira 2017), based on analytical solutions for the Photon Green’s functions approach (Pereira and Henneberger 1996, 1998; Michler et al. 1998), delivering a microscopic, fully quantum mechanical solution. Note that, in spite of its success to accurately explain experiments such as both single beam and nonlinear pump-probe photoluminescence (Michler et al. 1998), as well as being a powerful tool to design optical devices and solar cells (Aeberhard 2011).

Nonequilibrium Green’s Functions (NEGF) techniques that include Photon Green’s functions mentioned above and semiconductor Bloch equations, can be applied to both intersubband (Pereira 2011, 2016; Pereira and Faragai 2014) and interband transitions (Pereira et al. 1994; Grempel et al. 1996), but usually requires intensive numerical methods. Therefore, in this project, we did not fully calculate the dephasing attributed to electron–phonon, electron-impurity and electron-alloy disorder scattering and that leads to measured s-shape-like features in dilute semiconductor samples (Imhof et al. 2010; Kesaria et al. 2015). The scattering processes cited above can all be described by selfenergies (Pereira 1995; Wacker 2002; Schmielau and Pereira 2009). Instead, we used Trust Region-Reflective (TRR) methods (Hung 2012) to obtain the values of carrier density, homogeneous and inhomogenous broadening that best characterize experimental results and as a future step, we shall use these numbers to compare and contrast with Nonequilibrium Green’s Functions (NEGF) calculations to help determine the relative influence of each scattering/dephasing mechanisms.

The quantum mechanical Poynting vector describing light emission can be expressed in terms of the Photon Green’s Function, leading to the optical power density spectrum, which can be directly compared with photoluminescence experiments (Oriaku and Pereira 2017).

$$\begin{aligned} I\left( \omega \right) & = \frac{{I_{0} }}{{1 + exp\left( {\beta \left( {\hbar \omega - \mu } \right)} \right)}}\left\{ {\sum\limits_{n = 1}^{\sqrt g } {\frac{4\pi }{n}\left( {\frac{1}{{n^{2} }} - \frac{{n^{2} }}{{g^{2} }}} \right)\delta_{\varGamma } \left( {\xi - e_{n} } \right)} } \right. \\ & \quad \left. { + 2\pi \mathop \smallint \limits_{0}^{\infty } \frac{{sinh\left( {\pi g\sqrt x } \right)}}{{cosh\left( {\pi g\sqrt x } \right) - \cos \left( {\sqrt {4g - g^{2} x} } \right)}}\delta_{\varGamma } \left( {\xi - x} \right)dx} \right\}, \\ \end{aligned}$$
(1)

where \(I_{0} = \frac{{\hbar \omega^{2} e^{2} \left| \varPi \right|^{2} }}{{\pi e_{0} c^{3} a_{0}^{3} }}\), \(e_{{n = - \left( {n^{ - 1} - ng^{ - 1} } \right)^{2} }}\), \(\xi = \left( {\hbar \omega - E_{g} } \right)/e_{0}\), \(g = \left( {\kappa a_{0} } \right)^{ - 1}\) and \(a_{0} ,e_{0}\) denote, respectively the exciton Bohr radius and binding energy.

Fluctuations in the alloy composition are described here by a Gaussian distribution in the dilute Bi mole fraction x. If \(x_{0}\) is the nominal Bi mole fraction, and \(I\left( {x,\omega } \right)\) is the expression in Eq. 8, the inhomogeneously broadened spectrum reads

$$I_{inh} \left( \omega \right) = \frac{1}{{\sqrt {2\pi \sigma } }}\mathop \smallint \limits_{{x_{0} - 3\sigma }}^{{x_{0} + 3\sigma }} I\left( {x,\omega } \right)e^{{ - \left( {\frac{{x - x_{0} }}{\sigma }} \right)^{2} }} dx$$
(2)

Thus, in order to compare our calculations with experimental data, we need three main parameters: the carrier density, the inhomogeneous broadening parameter and the homogeneous dephasing: \(\left( {\rho ,\sigma ,\varGamma } \right)\).

3 Numerical method and results

In our model, \(u_{T} \left( {\rho ,\sigma ,\varGamma } \right)\) is the luminescence peak energy function at temperature T. In the cases that we have investigated, the carrier density, inhomogeneous and homogeneous broadening are restricted to the following intervals (normalized units are used here) \(\in \left[ {1.5 \times 10^{14} ,10^{17} } \right]\), \(\sigma \in \left[ {10^{ - 3} ,2.5 \times 10^{ - 3} } \right]\) and \(\varGamma \in \left[ {1,2} \right]\).

Experimental data such as in Refs. Krier et al. (2012) and Mazzucato et al. (2014) provides the luminescence peak energy at different temperatures for InAsN. Thus, by applying the least squares method, we can estimate the parameters \(\left( {\rho ,\sigma ,\varGamma } \right)\) so that the residual between the theoretical function and the experimental data is minimized. Since the scales of the parameters are different by several orders of magnitude, we introduce auxiliary variables, \(\acute{\rho } = log\left( \rho \right),\acute{\sigma } = 10^{4} \sigma ,\acute{\varGamma } = 10\varGamma .\) That leads to a transformed function, \(u_{T} \left( {\rho ,\sigma ,\varGamma } \right) = \acute{u}_{T} \left( {\acute{\rho },\acute{\sigma },\acute{\varGamma }} \right).\) Krier et al. (2012) provides a series of data points \(d = \left( {d_{1} ,d_{2} , \ldots d_{N} } \right)\) measured at \(T = \left( {T_{1} , \ldots ,T_{N} } \right)\). Therefore, the problem becomes:

$$\mathop {min}\limits_{{x = \left( {\rho ,\sigma ,\varGamma } \right)}} \mathop \sum \limits_{i = 1}^{N} \left( {\acute{u}_{{t_{1} }} - d_{i} } \right)^{2} .$$
(3)

In order to address this problem, we selected the function lsqnonlin in Matlab, which applies Trust Rion-Reflective (TRR) methods. Compared with other techniques, such as line-searching methods, TRR delivers more accurate and less time costly results. In a nutshell, the reasoning underlying the TRR approach is as follows: Suppose we wish to minimize \(F\left( x \right).\) In order to find a point \(x_{i + 1}\) with a smaller function value, we proceed by minimizing the following quadratic model:

$$\mathop {min}\limits_{s \in N} m\left( s \right) = g^{T} s + \frac{1}{2}s^{T} Hs,$$
(4)

where \(N\) is a neighbourhood of \(x_{i}\) and is called the trust region. \(g\) and \(H\) are the gradient and Hessian of \(F.\) particular, \(s \in N\) is equivalent to \(\left\| {D_{i} s} \right\| < \Delta_{i}\), where \(D_{i}\) is a diagonal scaling matrix and \(\Delta_{i}\) is a positive scalar. Then the current point is updated to \(x_{i} + s\) if \(F\left( {x_{i} + s} \right) < F\left( {x_{i} } \right);\) otherwise, the current point remains unchanged, \(\left( {x_{i + 1} = x_{i} } \right)\), the region of trust is shrunk and the trial step computation is repeated. In other words, we set the new step size \(\Delta_{i + 1} \in \left( {0,\left. {\tau \Delta_{i} } \right]} \right.\), given a scaling factor \(0 < \tau < 1\).

In order to solve for \(s\) in Eq. (4), there are several good algorithms. However, they require time proportional to several factorisations of \(H\). Therefore, for trust-region problems, a more efficient approach is needed. The approximation approach followed in Optimization Toolbox solvers is to restrict the trust-region subproblem to a two-dimensional subspace S (Branch et al. 1999; Byrd et al. 1988). Once the subspace \(S\) has been computed, the work to solve Eq. (4) is trivial even if full eigenvalue/eigenvector information is needed, since in the subspace, the problem is only two-dimensional. The dominant work has now been shifted to the determination of the subspace.

The two-dimensional subspace \(S\) is determined with the aid of a pre-condition conjugate gradient described below. The solver defines \(S\) as the linear space spanned by \(s_{1}\) and \(s_{2}\), where \(s_{1}\) is the direction of the gradient \(g\) and \(s_{2}\) is either an approximate Newton direction, i.e., a solution to

$$H \cdot s_{2} = - g,$$
(5)

or a direction of negative curvature,

$$s_{2}^{T} \cdot H \cdot s_{2} < \, 0.$$
(6)

The philosophy behind this choice of \(S\) is to force global convergence (via the steepest descent directive or negative curvature direction) and achieve fast local convergence (via the Newton step, when it exists).

In our Numerical Experiments for Ref. Mazzucato et al. (2014), confirming the results found in Ref. Oriaku and Pereira (2017), the initial values for \(\left( {\rho ,\sigma ,\varGamma } \right)\) were \(\left[ {1E15,2E - 3,1} \right]\). After nine iterations, the trust region shrank and the optimal values remained the same with the initial. These initial values were obtained by try and error during several runs. Therefore, we conclude that the local minimum for the function is found.

Figures 1 and 2 show results of the comparison between theory and experiments in Ref. Krier et al. (2012), extending the findings of Ref. Oriaku and Pereira (2017) for the InAsN case.

Fig. 1
figure 1

Theoretical versus experimental (inset) luminescence for a InAsN sample at various temperatures. From a to m the temperatures are: 4, 20, 40, 60, 80, 100, 120, 150, 180, 210, 240, 270 and 300 K. The low energy feature (around 0.28 meV) at high temperatures is due to the presence of CO2 in the optical path of the measurements. The experimental data for comparison has been extracted from Krier et al. (2012). The density used in the calculations is N = 1014 carriers/cm3

Fig. 2
figure 2

Plot of peak PL energy E PL (eV) versus T (K) for the mole fractions a x = 0.3%, b x = 0.62%, c x = 1.6% of InAs1−xNx on InAs, Good agreement can be observed between the theory (black) and experiments from Krier et al. (2012). The carrier density used is n = 1015 cm−3

It is interesting to see that lower quality interfaces lead to more scattering and thus to a more pronounced s-shape as shown in Fig. 3 where InAsN is grown on GaAs.

Fig. 3
figure 3

Plot of peak PL energy E PL (eV) versus T (K) for InAs1−xNx on GaAs. The experimental data has been extracted from Krier et al. (2012). Good agreement can be observed between the theory (black) and experiments from Krier et al. (2012). The carrier density used is n = 1015 cm−3

In summary, the automated numerical method developed validated our previous work that took days of trial and error attempt for a single data set. It can now be applied to a number of new materials, serve as guideline to interpret experimental data and to guide the choice of microscopic models that better deliver agreement with experiments. The comparison with experiments for the InAsN nitride case further validates our method as a powerful tool to support the characterization and development of new materials and devices.