1 Introduction

The \(III\text{-}V\) ternary and quaternary semiconducting materials have attracted a great scientific and technological research interest last decade, thanks to their unique and exceptional electrical and optical properties missing in Silicon. Among these properties one can note high mobility of electrons, which found in \(InGaAs\) componds, make them promising candidates to manufacture MOSFETs components (Xuan et al. 2008). Also, the possibility to tune the gap energy value in these materials provides a large panel of optoelectronic applications, such as photodetectors, long wavelength semiconductor lasers (Agrawal 1986), light-emitting diodes (LEDs) (Tarbi and Mani 2015), and optic fiber for high-speed data communications (Fashinger et al. 1994) devices. Further arguments concern the development of supplementary electronic heterostructures (Akiba et al. 1979).

\(InGaAsP\) quaternary system we suggest to study here is an \(III\text{-}V\) semiconducting material, which crystalizes in the bulk zinc-blende (ZB) crystal structure. Its chemical synthesis can be performed by using epitaxial crystal growth techniques, on the surface of an \(InP\) substrat, over a wide range of chemical \(x\) and \(y\) compositions (Seifert and Runge 2016; Manasevit 1968). Physical properties of the \(InGaAsP\) alloys such as the lattice parameter and the energy gap can be changed by modifying many control parameters such as chemical composition ratio \(x\) and \(y\) (Vurgaftman et al. 2001), temperature and even pressure (Degheidy and Elkenany 2015). The alloy prepared on the \(InP\) substrate covers a range of energies gap ranging from \(0.75\) to \(1.35\; {\text{eV}}\), whilst, it takes values between \(1.42\) and \(1.9\;{\text{eV}}\) when it prepared on \(GaAs\) substrate (Houston 1981).

This shows their great strength which lies in the almost unlimited number of possible alloys between the four elements \(Ga, As, In\) and \(P\) (Adachi 1982). In particular conditions, one can observe associated with a good matching between lattice parameters of both materials \((InGaAsP/InP)\), a direct band-gap. However, most theoretical calculations of these physical properties are computed by Virtual Crystal Approximation (VCA) and the Vegard’s law (Shim et al. 2000).

This research aims to assume an ANN-LM estimation model for the prediction and the representation of the physical behavior of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\) \((0 \le x \le 0.47\); \(0 \le {\text{y}} \le 1\)) with a high performance compared to the empirical equations used. To show their abilities to model physical phenomena, the predicted results using ANN-LM, and the values computed by equations based on the adoption of interpolation will be compared by the experimental values of the lattice parameters and of the energy gap, by the error indicators \(MSE,\) \(RMSE,\) and \(R^{2}\).

Beyond the state of the art, we presume the above method can be used in several applications. In fact, it offers a complementary method to provide clues about problems concerning complex physical phenomena for existed material. It can be even used for studding and developing new materials. Thus, this pre-manufacturing possibility provides an optimal parameter needed with a low cost and time. In the particular case of photovoltaic cells, the predictions provided by the ANN-LM estimation model method, can be helpful to unveil and expect the temperature-dependent energy-gap fluctuations in the \(InGaAsP/InP\) material. In fact, the so called thermalization loss process, as shown in Fig. 1 below, is responsible of heat transfer to lattice and thus the temperature of the network and disturbs \(E_{g} (T)\).

Fig. 1
figure 1

Schematic view of the thermalization loss effect on the band-gap. The different steps are numbered according to time order

2 Theoretical analysis

2.1 Lattice parameters of quaternary material \(In_{1 - x} Ga_{x} As_{y} P_{1 - y} /InP\): Vergard’s law

The quaternaries \(A_{x} B_{1 - x} C_{y} D_{1 - y}\) consist of two group III elements and two group V elements. To calculate their band parameters, the interpolation approximation (Glisson et al. 1978) is used. It is given by (Krijn 1991):

$$G_{ABCD}^{\prime \prime } \left( {x,y} \right) = \frac{{x\left( {x - 1} \right)\left[ {\left( {1 - y} \right)G_{ABD}^{\prime } \left( x \right) + yG_{ABC}^{\prime } \left( x \right)} \right] + y\left( {1 - y} \right)\left[ {xG_{ACD}^{\prime } \left( y \right) + \left( {1 - x} \right)G_{BCD}^{\prime } \left( y \right)} \right]}}{{x\left( {1 - x} \right) + y(1 - y)}}$$
(1)

The band parameter of the ternary \({\text{A}}_{{1 - {\text{x}}}} {\text{B}}_{\text{x}} {\text{C}}\) can be expressed as (Krijn 1991):

$$G_{ABC}^{'} \left( x \right) = \left( {1 - x} \right)G_{AC} + xG_{BC} - x(1 - x)C_{ABC}$$
(2)

With \(G_{AC}\) and \(G_{BC}\) are the values of the binary \(AC\) and \(BC\) respectively, and \(C_{ABC}\) is the appropriate bowing.

The Vegard’s law is an approximation derived from the linear combination of the lattice constants of the \(GaAs\), \(GaP\), \(InAs\) and \(InP\) binary compounds, which constitute the quaternary material \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\) (Mei 2007), it is given by the following relation:

$$a\left( {{\text{x}},{\text{y}}} \right) = {\text{xy}} a({\text{GaAs}}) + {\text{x}}(1 - {\text{y}})a({\text{GaP}}) + (1 - {\text{x}}){\text{y}}a({\text{InAs}}) + (1 - {\text{x}})(1 - {\text{y}})a({\text{InP}})$$
(3)

With: a(GaAs) = 5.6536 Å, a(GaP) = 5.4512 Å, a(InAs) = 6.0590 Å, and a(InP) = 5.8696 Å, the relationship becomes:

$$a(x,y) = 0.1894y - 0.4184x + 0.0130xy + 5.8696$$
(4)

The lattice mismatch can be calculated by (Vurgaftman et al. 2001):

$$f = \frac{a(x,y) - a(InP)}{{a_{InP} }}$$
(5)

In order to obtain a perfect match between the material \(InGaAsP\) and the substrate \(InP\), the condition below must be imposed.

$$a\left( {x,y} \right) = a(InP)$$
(6)

With \(a\left( {x,y} \right)\) represents the lattice parameter of the material \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\), and \(a(InP)\) indicates the lattice parameter of the \(InP\) substrate. In the absence of a mismatch, we consider \(y \approx 2.2x\) (Vurgaftman et al. 2001; Houston 1981). Equation (4) becomes a single dimension:

$$a\left( y \right) = 5.8696 - 0.0008y + 0.0059y^{2}$$
(7)

With \(a\) is measured in Å. The lattice parameter may undergo a change due to the increase in temperature, according to the Adachi relation (Degheidy and Elkenany 2015):

$$a\left( T \right) = a\left( {300\;{\text{K}}} \right)\left[ {1 + \alpha_{th} \left( {1 - 300\;{\text{K}}} \right)} \right]$$
(8)

where: \(\alpha_{th}\) is linear thermal expansion coefficient (Table 1). The pressure can also change the lattice parameter, according to the relationship given by Adachi (Degheidy and Elkenany 2015):

$$a\left( p \right) = a(p = 0)\left[ {1 + \left( {\frac{B'}{B}} \right)p} \right]^{{\frac{ - 1}{3B'}}}$$
(9)

where \(a(p)\) is the lattice parameter at different pressures, \(B\) is the bulk modulus and \(B\prime\) is the pressure derivative of the bulk modulus for the associated binary semiconductors (see Table 1).

Table 1 Varshni’s parameter and hydrostatic pressure of the binary semiconductors which constitute the \(InGaAsP\) quaternary (Degheidy and Elkenany 2015)

2.2 Energy gap of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\)

The Electroreflectance (ER) allows to determine the energy gap of the quaternary alloy \(InGaAsP\) adapted to \(InP\). The measurement of the photoluminescence spectrum also makes it possible to specify the energy gap by measuring the peak energy spectrum (Yamazoe et al. 1981). The difference between the two methods is that the (ER) spectrum is not sensitive to the effects of impurities, defects and errors in the alloy (Yamazoe et al. 1981).

There are models that have been developed for the prediction and computation of the band structures based on the pseudopotential method (Jones and Lettington 1969). The theoretical approaches that explain the data remain modest. Numerous articles and books have dealt with the methods of computation of the heterostructure and bulk band structures (Vurgaftman et al. 2001; Yu and Cardona 1996). The energy gap as a function of the compositions of \(x\) and \(y\) is not linear, the modeling in this case becomes complex. The simplification of this relationship has been achieved by geometric methods based on the use of the four ternary boundaries (Nahory et al. 1978). The energy gap at room temperature (300 K) is in the form:

$$E_{g} \left( {x,y} \right) = (1.35 + 0.668x - 1.17y + 0.758x^{2} + 0.18y^{2} - 0.069xy - 0.322x^{2} y + 0.03xy^{2} )\;{\text{eV}}$$
(10)

This relationship shows a strong inclination with respect to the experimental measurements (Nahory et al. 1978; Nakajima et al. 1978). The solution for finding a very good agreement is to use the empirical fit of the curve drawn by the experimental points. The equation of adjustment is given by the following relation:

$${\text{E}}_{\text{g}} \left( {\text{y}} \right) = (1.35 - 0.72{\text{y }} + 0.12{\text{y}}^{2} ){\text{eV}}\;{\text{at}}\;(T = 300\;{\text{K)}}$$
(11)

The model has been improved by adding coefficient represents the effective bowing parameter. The relation becomes (Yamazoe et al. 1980):

$${\text{E}}_{\text{g}} \left( {\text{y}} \right) = 1.35.(1 - {\text{y}}) + 0.75{\text{y}} - {\text{C}}_{\text{e}} \cdot {\text{y}} \cdot (1 - {\text{y}})$$
(12)

With: \(C_{e} = 0.138\). Operation of the material at a higher or lower temperature than ambient temperature affects its electrical properties and performance. As a result, Model 11 becomes far from accurate. The energy gap of the \(InGaAsP\) material at 0 K is given by the relation (Waqas et al. 2016):

$${\text{E}}_{\text{g}} \left( {\text{y}} \right) = (1.421 - 0.72{\text{y }} + 0.12{\text{y}}^{2} ){\text{eV}}\;{\text{at}}\;(T = 0\;{\text{K}})$$
(13)

Taking into account that the energy gap of the \(InP\) varies between \(1.35 \;{\text{eV}}\) and \(1.421 \;{\text{eV}}\) when the temperature varies from \(300\) to \(0\;{\text{K}}\). The difference between the energy gap Eqs. (11) and (13) is reflected in the first term which represents the energy gap of the \(InP\). The coefficients of composition remain the same. On the other hand, the semi-empirical law of Varshni presents the equation which predicts the forbidden energy as a function of temperature in the semiconductor (Varshni 1967):

$$E_{g} (T) = E_{g0} - \frac{{\alpha T^{2} }}{T + \beta }\;\left( {\text{Kelvin}} \right)$$
(14)

With: \(E_{g0}\) is the bandgap energy at temperature 0 K. \(\alpha\) and \(\beta\) are adjustment constants. The energy gap as a function of temperature and composition is deduced from relation (13) and (14):

$$E_{g} \left( {T,y} \right) = \left[ {\left( {E_{g0} - \frac{{\alpha T^{2} }}{T + \beta }} \right) - 0.72y + 0.12y^{2} } \right]\;\left( {\text{eV}} \right)$$
(15)

The good estimation of the energy gap also makes it possible to calculate the refractive index of the alloy \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\) at \(T = 300\;{\text{K}}\). According to the semi-empirical modified single oscillator model (Fiedler and Schlachetzki 1987), the relation can be written as:

$$n^{2} = 1 + \frac{{E_{d} }}{{E_{0} }} + \frac{{E_{d} }}{{E_{0}^{3} }}E^{2} + \frac{\eta }{\pi }E^{4} ln\left( {\frac{{2E_{0}^{2} - E^{2} - E_{g}^{2} }}{{E_{g}^{2} - E^{2} }}} \right)$$
(16)
$${\text{With:}}\quad \quad \eta = \frac{{\pi E_{d} }}{{2E_{0}^{3} (E_{0}^{2} - E_{g}^{2} )}}$$
(17)

\(E_{0}\) is the energy of the single-effect oscillator, \(E_{d}\) is the dispersion energy, and \(E = h\upsilon\) is the photon energy. The energies \(E_{0}\) and \(E_{d}\) are calculated with the interpolation method (Nahory et al. 1978). At ambient temperature, these parameters are in the form of approximations (Waqas et al. 2016):

$$E_{0} \left( y \right) = 3.391 - 1.652y + 0.863y^{2} - 0.123y^{3}$$
(18)
$$E_{d} \left( y \right) = 28.91 - 9.278y + 5.626y^{2}$$
(19)

The direct and indirect energy gap of the material subjected to the pressure \(E_{g}^{d,id} \left( {T,p} \right)\) at the points of high symmetry \(\varGamma\), \(X\) and \(L\) is obtained by using the following empirical relation (Degheidy and Elkenany 2015):

$$E_{g}^{d,id} \left( {T,p} \right) = E_{g} \left( {T,p = 0} \right)^{d,id} + bp + cp^{2}$$
(20)

where: \(b\) and \(c\) are the pressure coefficients, cited in Table 1 (Degheidy and Elkenany 2015).

2.3 Artificial neural network optimized by the Levenberg–Marquardt algorithm (ANN-LM)

The Artificial Neural Network ANN is a strong mathematical tool for modeling and prediction, it is a black box based on learning from examples. It is characterized by the generalization of the process in a flexible way and with a great speed. It is capable of generating a particular, nonlinear and complex function between inputs and outputs by adjusting the biases and weights that represent the network’s digital knowledge (Mekki et al. 2016). ANN modeling does not provide a tangible model that ensures the physical understanding of the underlying process in the material.

The Multilayer Perceptron MLP is a feed forward network that guarantees multi-layer transmission via neurons (Tarbi et al. 2017) as shown in Fig. 2. The processing of the information of each node has been represented in Fig. 2 and calculated by the mathematical Eq. (21). The output of this node serves as input of another node of the following layer (Kumari et al. 2016; Dkhichi et al. 2016):

Fig. 2
figure 2

Method of processing information by the node

$$y = f\left( {\mathop \sum \limits_{i = 1}^{n} (x_{ji} \times w_{ji} + b_{j} )} \right)$$
(21)

With: \(y\) is the output of the current node of the hidden layer \(j\). \(n\) is the number of the inputs of the current layer, i.e. the number of nodes in the previous hidden layer, \(x_{ji}\) an input of the current hidden layer \(j\) from the previous layer \(i\), \({\text{w}}_{\text{ji}}\) the weight that joins layer \(j\) and the layer \(i\), \(f\) is the activation function. The \(LogSig\), \(TanSig\) and \(Purelin\) activations functions are defined by:

$$LogSig(x) = \frac{1}{{1 + e^{ - x} }}$$
(22)
$$TanSig(x) = \frac{{e^{x} - e^{ - x} }}{{e^{x} + e^{ - x} }}$$
(23)
$$Purelin(x) = x$$
(24)

Experimental measurements are used for the formation of ANN-LM. The back-propagation (BP) algorithm used to minimize the error function is the Levenberg–Marquardt iterative learning algorithm, due to its precision and fast convergence (Dkhichi et al. 2014). The performance function such as the mean square error (MSE) stops the training process when it takes values smaller than the predefined threshold. The formation also stops when it has reached the desired number of epochs (Afram et al. 2017). The variance indicators used in this work for the evaluation of the quality of the prediction are the RMSE (Geyikçi et al. 2012), and the determination coefficient R-square (Chen et al. 2017) defines respectively:

$$RMSE = \left( {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} (y_{pred} - y_{exp} )^{2} } \right)^{1/2}$$
(25)
$$R^{2} = 1 - \left( {\frac{{\mathop \sum \nolimits_{i = 1}^{n} (y_{pred - } y_{exp} )^{2} }}{{\mathop \sum \nolimits_{i = 1}^{n} (y_{avg,exp - } y_{exp} )^{2} }}} \right)$$
(26)

where \(n\) is the number of targets or the number of experimental responses; \(y_{pred}\) and \(y_{exp}\) are the predicted and target data, respectively, and \(y_{avg, exp}\) is the average of the target values. In this work, the physical properties of the quaternary alloy \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\) where \(x\) and \(y\) denote the composition of Gallium and Arsenic respectively, are studied using the ANN-LM model.

3 Result and discussion

3.1 Estimation of the lattice parameter of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y} / InP\)

ANN-LM models were used to predict the lattice parameter ‘ a’. Some networks are better adjusted than others. The procedure for choosing the optimal neural network is to analyze their performance using RMSE, as presented in Tables 2 and 3. The optimal artificial neural network is shown in Fig. 3, its architecture contains four layers: The first is the ‘input layer’, the last is the ‘output layer’, and between these two layers, there are two ‘hidden layers’. In our case, two hidden layers give more accurate results than a single layer .

Table 2 Procedure for selecting the optimal transfer function for predicting the lattice parameter by the ANN method
Table 3 Procedure for selecting the optimal number of neurons with the best transfer function \({\text{TangSig}}/ {\text{Purlin}}\) for ANN training
Fig. 3
figure 3

Topology of the multilayer feedforward artificial neural network for the identification and the prediction of the lattice parameter \(' {\text{a'}}\). x and y are the chemical compositions of Gallium and Arsenic respectively

By default, it separates the experimental data into three categories: \(70\%\) of the data for the training, \(15\%\) were used for the test, and \(15\%\) for validation. The model is optimized by the Levenberg Maquardt algorithm ‘Trainlm’. Neural networks with several hidden layers (more than three layers) lead to the problem of ‘over fitting’, generally they do not give optimum performance of prediction (Aish et al. 2015; Gazzaz et al. 2012).

The execution of various iterations makes it possible to specify the optimal and desirable scenario, since there is no theoretical rule for determining the number of neurons and the hidden layers of the neural network. Table 2 shows the transfer function choice methodology: The number of neurons was temporarily fixed at \(10\), the transfer function was changed by calculating the RMSE each time. The optimal function found is \(TangSig/Purelin\), because it marks the smallest RMSE of among the \(9\) combinations. The second step is to choose the number of neurons optimized: We did the simulation using the optimal function \(TangSig / Purelin\), changing the number of neurons regularly. The smallest RMSE and the largest \(R^{2}\) correspond to \(4\) neurons (\(RMSE_{ANN} = 0.0014\) and \(R_{ANN}^{2} = 95.84\%\)).

We notice from the Fig. 4 that the values \((a_{ANN} )\) are very close to the values \(\left( {a_{exp} } \right)\). The difference between the values \((a_{exp} )\), and the values estimated by the Vegard’ law \((a_{vegard} )\) becomes acute when the composition of Arsenic is equal to \(0.531\). The RMSE and \(R^{2}\) reach by the ANN-LM \((RMSE_{(ANN)} = 0.0014,and R_{ANN}^{2} = 95.84\% )\) is much better compared to those completed by the Vegard’s law \((RMSE_{vegard} = 0.006, and R_{vegard}^{2} = 17.73\% )\). The horizontal dotted line corresponds to the lattice parameter of the \(InP\) substrate \(a(InP).\)

Fig. 4
figure 4

Comparison between the experimental measurements \(({\text{a}}_{ \exp } )\), the data calculated by the Vegard’s law \(({\text{a}}_{\text{vegard}} )\) described by Eq. (4), and data predicted by ANN-LM \(({\text{a}}_{\text{ANN}} )\)

when the values of the lattice parameters \(a(x, y)\) converge to the value \(a(InP)\) (i.e. the horizontal line), we can say that there is an agreement between the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\) and the \(InP\) substrate.

The figure shows the graphical representation of the correlation between the estimated data by the ANN-LM method and by the Vegard’s law (Fig. 5) and the measured data of the lattice parameter \(a\left( {In_{1 - x} Ga_{x} As_{y} P_{1 - y} } \right)\). It shows that in the case of the use of Vegard’s law, the correlation coefficient is very far from \(1\). Contrariwise, when estimating by ANN-LM model, the slope becomes close to \(1\). These results show that there is a perfect correlation between the values estimated by the ANN-LM model and the targets, and they confirm the robustness of the ANN-LM to predict values close to experimental values, and this, in the case of the absence or existence of the accordance between the quaternary and the substrate, that is to say, whatever the value of \(x\) and \(y\).

Fig. 5
figure 5

Correlation between the estimated data of the lattice parameter \(\text{a}(\text{In}_{1 - \text{x}} \text{Ga}_{\text{x}} \text{As}_{\text{y}} \text{P}_{1 - \text{y}})\) and the measured data by: The ANN-LM method (a), and the Vegard’s law (Eq.  4 ). The adjustment of the points for each case is represented by the function Y

On the other hand, the Fig. 6 shows the curve of the lattice parameter as a function of the composition \(y\). It makes it possible to compare the experimental values, the values estimated by the ANN-LM model, and the values calculated by the Eq. (7) deduced on the basis of the hypothesis of the total agreement between the quaternary and the substrate (i.e. \(y \approx 2.2x\)). The difference between the targets and the values calculated by Eq. (7) is given by an important RMSE \((RMSE = 0.0071)\) larger than the RMSE calculated from the Eq. (4).

Fig. 6
figure 6

Lattice parameter versus the composition y for the \({\text{InGaAsP}}\) alloy adapted to \({\text{InP}}\). The circles are the experimental measurements (Nahory et al. 1978), the stars in blue show the values calculated by empirical Eq. ( 7 ), and the red points show the values predicted by ANN-LM

Thus, we notice that the use of the assumption of the total agreement, (i.e. \(y \approx 2.2x\)) increases the error, because of the convergence of the values calculated by Eq. (7) to the horizontal dotted line as shown in Fig. 6.

3.2 Energy gap of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y} / InP\)

The optimal ANN-LM topology for estimating energy gap \(Eg(x,y)\) as a function of composition \(x\) and \(y\) comprises an input layer with two neurons \((x, y)\), an output layer that displays gap energy, and two hidden layers that quickly and intelligently calculate the complex interconnections of neurons.

This topology was selected from several recombinations formed, based on the maximization of \(R^{2}\) and the minimization of RMSE. Table 4 allows selecting the optimal transfer function, and Table 5 allows selecting the optimal number of neurons.

Table 4 Procedure for selecting the optimal transfer function for predicting the energy gap by the ANN-LM method
Table 5 Procedure for selecting the optimal number of neurons with the best transfer function \({\text{Purlin}}/ {\text{Purlin}}\) for ANN-LM training

On the other hand, the decrease in the energy gap by increasing the \(y\) composition, as clearly seen in Fig. 7, is caused by the progressive and regular replacement of Phosphorus atom sites \(As - Ga - P\), by Arsenic atom sites \(As - Ga - As\). The electronic configuration of the Arsenic is \(4s^{2} 4p^{3}\), and for the Phosphorus is \(3s^{2} 3p^{3}\), which implies an increase in the atomic radius \((R_{P} \left( {\AA} \right) < R_{As} ({\AA})),\) and the decrease of the ionization energy and the electronegativity \(\chi ({\text{P}}) = 2.19\) and \(\chi ({\text{As}}) = 2.18\) according to Pauling scale. These parameters are responsible for changing the charge transfer mechanism in the material.

Fig. 7
figure 7

The energy gap versus the composition for the \(\text{In}_{1 - \text{x}} \text{Ga}_{\text{x}} \text{As}_{\text{y}} \text{P}_{1 - \text{s}}\) alloy adapted to \({\text{InP}}\). a, b Show the measures predicted by ANN-LM and the experimental data (Nahory et al. 1978) above and below the three-dimensional representation of the band gap area \({\text{E}}({\text{x}},{\text{y}})\) of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y}\) plotted by Eq. ( 10 ). c Represents the energy gap \({\text{Eg}}({\text{y}})\) versus composition \({\text{y}}\). The circles are the experimental measurements, the stars in blue show the values calculated by Eq. ( 10 ), and the red points show the values predicted by ANN-LM

Figure 7a shows the incorporation of most experimental data with the data found by ANN-LM model. The energies gap with the two compositions \(Eg(x,y)\) calculated by the Eq. (10) are further away from the target data as shown in Fig. 7b and 7c. The values of the RMSE error indicator of the ANN-LM network are small compared to those generated from the data computed by Eq. (10) \((RMSE_{ANN} = 0.0112 and RMSE_{(EQ10)} = 0.0341).\) Likewise, the coefficient of determination was estimated at \(99.32\%\) for the ANN-LM model, it is higher than the coefficient of determination of the data calculated by Eq. (10), which is equal to \(93.74\%\), and which indicates an interesting deviation between the values calculated by the Eqs. (10) and the target data. This comparative study shows that ANN-LM is a powerful and preeminent tool for modeling nonlinear processes such as energy gap that vary with \(x\) and \(y\).

The simulation results (predicted by ANN-LM and targets) were described by the dispersion diagram in Fig. 8, the prediction is good because the correlation equation shows a mediocre difference (less than \(0.008\)) between the predicted data by ANN-LM and the experimental values, this confirms the results of the error indicators.

Fig. 8
figure 8

Correlation between the measured and estimated data by ANN-LM of the energy gap. The adjustment of the points is represented by the function Y

The empirical fit of the experimental curve gives the relation (11) which expresses the energy gap \(Eg(y)\) as a function of a single variable \(y\). The RMSE value in these conditions decreased strongly \(RMSE_{(EQ11)} = 0.0113\), and the coefficient of determination \(R_{{({\text{EQ}}11)}}^{2} = 99.31{\text{\% }}\) approaches \(1\), which means that the deviation is small between the data computed by Eq. (11) and the target data.

Despite this improvement which does not take the composition \(x\) into consideration, the precision of the ANN-LM model remains the strongest. In addition, it is robust thanks to its capacity to learn via the two experimental inputs (composition of \(x\) and \(y\)), without recourse to approximations and considerations that eliminates composition. It is a powerful tool that intelligently binds input parameters and output parameters (energy gap), with the existence or absence of mismatch, and it does not require a mathematical model or a gnarled calculation. Also, it is characterized by its capacity of generalization which increases with examples of enriched and numerous training which enclose various situations. The well-formed and validated ANN-LM model gives an overall response even if the impact or sensitivity of an entry is not physically significant.

3.3 Variation of the energy gap of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y} / InP\) with the temperature \(\varvec{T}\) and the composition \(\varvec{x}\) and \(\varvec{y}\)

The optimized ANN-LM structure is selected based on the R-square and RMSE error indicators, we found that the best model among several combinations possesses a very small RMSE \((RMSE_{(ANN)} = 7.4744\times10^{ - 4} ),\) and the \(R^{2}\) is very close to \(1\) (\(R_{{({\text{ANN}})}}^{2} \simeq\) 100%). The architecture of the ANN-LM model contains four layers, the first one is the input layer, it contains 3 variables: the composition of \(x\), \(y\) and the temperature \(T\). The last one is the output layer that configures the energy gap, and between these two layers, there are two hidden layers. The optimal number of neurons is \(5\) and the most suitable transfer function is \(LogSig/Purelin\)(Fig. 9).

Fig. 9
figure 9

Diagram of neural network for identification \({\text{Eg}}({\text{x}},{\text{y}},{\text{T}}).\)

Figure 10 shows the performance of the ANN-LM model, which details the correlation between the experimental training, test and validation measurements and the estimated values by ANN-LM (Fig. 10a). It should be noted that the test and the validation measures are not used in the training process. The more precise the estimated data, the closer the points are to the bisector \(y = x\), and the closer the indicator R approaches 1. \((R = 1)\) indicates that the training process is perfect. For the test, the correlation coefficient takes a maximum value \((R = 1),\) which shows a strong linear relationship between the targets and the estimated values. The mean squared error (MSE) is determined at epoch 1 by \(\left( {MSE = 8.4832 \times 10^{ - 7} } \right)\), it describes the best validation of the model ANN-LM and confirms that the proposed topology has a very good performance, it is also able to predict the responses of layer output with amazing precision (Fig. 10b).

Fig. 10
figure 10

Effectiveness of the neural network: (a) Correlation between target data \(({\text{Eg}})\) and the estimated output data by ANN-LM. (b) The procedure for training the ANN-LM through three stages: Testing, validation and train

In the case of the alloy \(In_{0.73} Ga_{0.27} As_{0.59} P_{0.41}\), the energy gap as a function of the temperature was calculated by the theoretical Eq. (14), with: \(E_{g0} = 1.041eV\), \(\alpha = 7.5 10^{ - 4} eV\), \(\beta = 611\) (Madelon and Dore 1981). The experimental values, estimated data by ANN-LM and calculated data by the theoretical Eq. (14) are shown in Fig. 11.

Fig. 11
figure 11

Comparison between the experimental values of \({\text{Eg }}({\text{T}})\) (Madelon and Dore 1981), the calculated data by the theoretical semi-empirical curve (Eq. 14) and the estimated data by ANN-LM of the quaternary \({\text{In}}_{0.73} {\text{Ga}}_{0.27} {\text{As}}_{0.59} {\text{P}}_{0.41}\)

The comparison of the error indicators calculated from the theoretical data \((RMSE_{(EQ14)} = 0.0022, R_{(EQ14)}^{2} = 99.08\% )\), and those calculated based on the data predicted by ANN-LM (\(RMSE_{(ANN)} = 7.47 10^{ - 4} , R_{(ANN)}^{2} \approx 99.9\% )\) shows the power of the ANN-LM tool.

In addition, using the theoretical equation requires a calculation to extract the values of \(\alpha\), \(\beta\) and \(E_{g0}\) at each variation of composition \(x\) or \(y\), which makes the physical process complicated, and the analytical model is not unique. This problem is solved by the model ANN-LM thanks to the weights which connects between the input parameters \((x, y, T)\) and the output parameter \(\left( {Eg} \right).\)

On the other hand, the estimate of the energy gap in the absence of the pressure \(E_{g} (T,p = 0)\) makes it possible to minimize the error of the calculation of the energy gap of the material subjected to the effect of the pressure \({\text{E}}_{\text{g}} \left( {{\text{T}},{\text{p}}} \right)\) using the relation the empirical relation (20).

3.4 Variation of the energy gap of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y} / InP\) with the temperature and the pressure

The architecture of the best structure ANN-LM found is represented in Fig. 2a, it contains four layers: an input layer with two variables (the temperature \(T\) and the pressure \(P\)). An output estimates the energy gap, and between these two layers, there are two hidden layers. The number of neurons is \(20\), the activation function used is \(TanSig / Purelin\) (Fig. 12b).

Fig. 12
figure 12

Diagram of neural network for identification of \({\text{Eg}}({\text{T}},{\text{P}})\): a describes the number of layers, and b describes the activation function, the number of neurons, the inputs and the output of the ANN-LM

We notice that the energy gap \(E_{g}^{\varGamma }\) of binary alloys \(GaAs\) at \(T = 300\;{\text{K}}\) increases with increasing pressure. this means that the energy band for Γ-point is moved upwards. On the other hand, as shows the Fig. 13, the experimental measurements, the data estimated by ANN-LM and the data calculated by the Eq. (20) in the interval \([0\text{-}90\;{\text{Kbar}}]\) are all in agreement. The \(R^{2}\) is almost equal to \(1\), and the error RMSE of the two estimates is very small \((RMSE_{{\left( {EQ20} \right)}} = 6.87 10^{ - 5} , RMSE_{{\left( {ANN} \right)}} = 9.46 10^{ - 4} )\). The ANN-LM method does not require a previous knowledge of the parameters that constitute the empirical equation, and which are the energy band gap at zero pressure \(E_{g} \left( {T,p = 0} \right)^{d,id}\), and the pressure coefficients \((b and c).\)

Fig. 13
figure 13

The energy band gap \({\text{E}}_{\text{g}}^{\varGamma }\) of binary alloys \({\text{GaAs }}\) versus pressure at room temperature: The points in pink are the values calculated by the Eq. ( 20 ), the stars in blue are the experimental data (Elabsy et al. 2010), and the marks in green are the values estimated by ANN-LM

4 Conclusion

In this work, we used the Artificial Neural Networks ANN, to study the physical properties of the quaternary \(In_{1 - x} Ga_{x} As_{y} P_{1 - y} /InP\) with \(0 \le {\text{x}} \le 0.47 \;{\text{and}}\; 0 \le {\text{y}} \le 1\). Learning was optimized by the Levenberg Maquardt backpropagation algorithm. The ANN-LM hybrid system was used to determine optimal combinations capable of predicting the lattice parameter \('a'\) and the band-gap energy at different \(x\), and \(y\), also the band-gap energy at different \(x\), \(y\), and temperature, and finally the band-gap energy at different temperatures and pressures. Performance indicators \((R^{2} )\) and RMSE have shown that the four ANN-LM networks proposed are reliable and robust. The coefficient of determination \((R^{2} )\) of the optimal recombinations exceeded \(95.84\% ,\) and the RMSE error was minimized, it does not exceed \(0.0014\). This shows that the experimental and the prediction data are in excellent agreement. The dispersion diagrams confirm the effectiveness of the models, their outputs show good compliance with experimental data. On the other hand, there is a considerable difference between experimental data of physical properties \(a(x,y), Eg (x,y,T)\) and the results of proposed empirical equations derived from interpolation. The high predictive capacities of the ANN-LM are obtained without any approximation to eliminate the variable \(\varvec{x}\) by considering the perfect match between the material and the substrate. On the other hand, the calculation of the refractive index of the alloy becomes easy to calculate using values estimated by ANN-LM. This method can be used to control and estimate the energy gap variation of the photovoltaic cell subjected to different temperatures.