1 Introduction

Mid infrared region (MIR) has recently attracted a great attention in many applications especially in the wavelength range from 2.0 to 20.0 µm (Schliesser et al. 2012) such as absorption spectroscopy, free space communication and thermal imaging (Lloyd 2013). The MIR range is called "molecular finger print" because it occurs in almost all chemical bonds with simple vibration signatures. The main challenge in the MIR region is the bulk size and high cost of the laser source. However, compact, affordable and tunable quantum cascade lasers have lately been commercially available. Silicon photonics platform in the MIR region has various advantages due to the silicon transparency for wavelength up to 8.0 μm. In addition, silicon photonics, in the MIR region, exhibits substantially superior nonlinear characteristics than in the near-infrared region (NIR). This is due to the two-photon absorption (TPA) that goes to zero beyond 2.0 μm, Therefore, in the MIR region, the TPA problem and the associated free carriers can be neglected (Swillam et al. 2018).

Silicon on insulator (SOI) (Abd-Elkader et al. 2019a; Kandeel et al. 2019) is the most popular silicon photonics platforms due to the high index contrast between the silicon and silicon dioxide. In this context, high field confinement can be achieved with compact device footprint. The silicon on sapphire (SOS) is an alternative to the conventional SOI platform where sapphire is used instead of buried silicon dioxide as an insulator layer. The sapphire has better transparency even for longer wavelengths than the conventional SOI. Thus, the SOS platform (Mueller and Robinson 1964) is introduced as a very interesting candidate for integrated silicon photonics (Soref and Larenzo 1986) in the MIR region. The SOS platform is commercially available and features a strong refractive index contrast (n = 1.7) between the core and sapphire cladding with a large transmission window of up to 5.5 μm. Furthermore, as compared to bulk silicon, the SOS platform provides low parasitic capacitance, low power consumption, good linearity, and greater isolation, and it does not require special fabrication procedures (Zou et al. 2018).

The high-index contrast between silicon core and sapphire, on the other hand, results in a considerable optical birefringence and highly polarization-dependent devices. To solve such obstacle, there is a need for polarization handling devices (Hameed and Obayya 2011, 2014) such as polarization rotators, polarization splitters and polarizers to get polarization transparent photonic integrated circuits (PICs). The use of polarization splitters and polarization rotators is highly complicated and maximizes the system size significantly. As a result, a polarizer is mandatory to stop a polarization state and allows the propagation of the other polarized wave with minimal losses. Such a simple solution can be integrated in polarization-based devices with half power loss. Consequently, wide variety of optical polarizers has been investigated. Regarding to that, optical polarizers based on all-dielectric structures, with high ERs and minimum ILs, have been reported (Wang and Ho 2010; Guan et al. 2014; Azzam et al. 2014; Azzam and Obayya 2015; El Shamy et al. 2017). However, such waveguides usually suffer from some critical drawbacks such as complex structures with long device length and narrow bandwidth with high sensitivity to fabrication errors. The mode cutoff concept has been used to design SOI TM-pass polarizer with tapered structure (Wang and Ho 2010) with a device length of 10 μm and an ER of 26 dB. Moreover, Guan et al. (2014) have fabricated SOI TM-pass polarizer with a subwavelength grating. The fabricated device has 27 dB ER, and 0.5 dB high losses at a long device length (LD) of 9.0 μm. In addition, a symmetrically etched silicon core was used to develop SOI TE/TM pass polarizers (Azzam et al. 2014) with low losses. At LD = 2.5 μm, the TM- and TE- pass polarizers have ERs of 30 dB and 26 dB, respectively, and 0.4 dB relatively high IL for the two polarizers (Azzam et al. 2014). Further, the resonance tunneling effect has been used to develop SOI TE/TM-pass polarizers (Azzam and Obayya 2015). The polarization filtering depends on the birefringence between the two polarized states of the conventional silicon wire. The TM-pass polarizer achieves 20 dB ER and 0.15 dB IL with a device length of 1.31 μm. However, the TE-pass polarizer has a device length of 1.35 μm, an IL of 0.004 dB and an ER of 18 dB. However, due to the unavoidable resonant character of such polarizers (Azzam and Obayya 2015), they can only work within a restricted bandwidth. Further, El Shamy et al., (2017) have presented wideband and easy to fabricate TE-pass and TM-pass polarizers using the SOS platform for the MIR applications. The TE-pass polarizer has an ER of 25.57 dB with high IL of 1.92 dB, while, the TM-pass polarizer has 69.77 dB ER with 1.21 dB IL. However, both structures have a long device length of 23 μm (El Shamy et al. 2017).

Recently, plasmonic materials have been used for designing highly efficient and compact polarizers. With the proper design of the plasmonic waveguide, one of the two fundamental guided modes in the silicon core can be coupled with the surface plasmon modes at the metal/dielectric interface. Therefore, only one polarized mode will propagate whereas the other polarized mode has high losses. In this respect, SOI TM-pass polarizer has been designed using plasmonic/dielectric waveguides with a device length of 10.0 μm (Alam et al. 2011). However, such hybrid design has high IL of 3.0 dB and not compatible with CMOS technology. Moreover, SOI TM-pass polarizer (Ng et al. 2012) has been introduced with a copper plasmonic material which is CMOS-compatible. Such a structure suffers from low ER of 13 dB and high IL of 1.6 dB. In addition, Bai et al. (2017) have reported a low loss and compact SOI TM-pass polarizer based on hybrid plasmonic grating. The complex polarizer (Bai et al. 2017) suffers from CMOS incompatibility with ER = 25 dB, IL = 0.088 and device length of 2.5 µm. Further, Saber et al. (2018) have used highly doped p-silicon to develop plasmonic TM-pass polarizer. This design is CMOS compatible with a long LD of 15 μm, ER = 30.11 dB and high IL = 3.08 dB at wavelength of 1.55 μm (Saber et al. 2018).

Most of the plasmonics-based polarizers have a compact device length. However, they have a drawback of high IL because of the significant Ohmic loss of the conventional plasmonic materials. Moreover, plasmonic polarizers are commonly based on gold and silver, which are neither CMOS-compatible nor chemically stable. Therefore, alternative plasmonic materials are widely used, recently, instead of the conventional plasmonic materials. Transparent conducting oxide (TCOs) and transitional metal nitrides drew the most attention to replace conventional plasmonic materials. The new plasmonic materials have great advantages over the conventional materials such as tunability, lower losses, high design flexibility, and high compatibility with CMOS fabrication technology (Naik et al. 2011). Accordingly, TE/TM pass polarizers were presented in (Azzam and Obayya 2016) using titanium nitride material. An ER = 20 dB and IL = 0.063 dB have been achieved at LD = 3.5 µm for the TE-pass polarizer (Azzam and Obayya 2016). On the other hand, the TM-pass polarizer realized ER = 20 dB and IL = 0.96 dB at LD = 2.84 µm. Additionally, ultra-compact SOI TE/TM-pass polarizers were proposed in (Abd-Elkader et al. 2019b) based on AZO. An ER of 20.6 dB has been achieved with low IL of 0.21 at a compact LD of 3.5 µm using the TE-pass polarizer. However, the TM-pass polarizer has ER = 22 dB and IL = 0.11 dB at LD = 1.0 µm (Abd-Elkader et al. 2019b). Further, CMOS compatible TE/TM-pass polarizers with bi-metallic configuration of indium tin oxide and zirconium nitride have been reported in (Kandeel et al. 2019). A large coupling occurs between one polarized core mode and the surface plasmon mode due to the bi-metallic configuration, while the other polarized mode can propagate with low losses. At λ = 1.55 μm, the TE-pass polarizer has ER = 32.7 dB with IL of 0.13 dB at a compact LD = 1.5 μm. In addition, the TM-pass polarizer, at LD = 2 μm, has an ER of 31.5 dB and IL of 0.17 dB (Kandeel et al. 2019).

In this paper, a TM-pass polarizer based on SOS platform is introduced and analyzed for MIR applications. The proposed polarizer has a bimetallic configuration of AZO/ZrN to highly suppress the TE mode in a compact length. The AZO is one of the most attractive TCO materials because it is reasonably inexpensive and abundant element with non-toxicity. The AZO has also advantages of optical tunability, low Ohmic loss, and high thermal stability (Lin et al. 2016; Naik et al. 2013). Further, the ZrN is one of metal nitrides with favorable features such as high melting temperature state, metallic properties, optical tunability and mechanical stability. Moreover, it is compatible with silicon CMOS fabrication technology. The ZrN deposition can be achieved by many techniques like chemical vapor deposition (CVD), physical vapor deposition (PVD) or atomic layer deposition (ALD) (Naik et al. 2013). The proposed TM pass polarizer achieves 20.3 dB of ER and 0.14 dB IL with a 3.0 µm device length at a wavelength of 2.0 µm. The suggested structure has advantages of simplicity, compact size and high ER with low IL. It is also easy for fabrication and compatible with the standard CMOS fabrication technology. Additionally, the suggested design is more practical compared to that reported in (El Shamy et al. 2017) with long LD of 23 µm and large ER based on SOS platform. It is worth noting also that the proposed TM-pass polarizer has an operating wavelength of 2 μm in the MIR region which is different than that reported in (Abd-Elkader et al. 2019b) which is working in the NIR region at wavelength of 1.55 μm.”

2 Design consideration

Figure 1 shows 3D and 2D schematic diagrams of the proposed TM-pass polarizer. The suggested structure is designed to support the propagation of TM mode only. A silicon core of width ws = 800 nm is used with refractive index of 3.48. The core is surrounded by two sapphire layers with identical width wd and refractive index of 1.7377. The two sapphire layers are coated by two symmetric AZO layers with width of wi and two layers of ZrN with width of wz. All layers have the same height hm of 1.0 μm. The two plasmonic materials are added in the x-direction where the surface plasmon mode is coupled with the main component of the TE mode (Ex or Hy). Initially, the sapphire layers have identical width wd of 80 nm while the widths of the AZO and ZrN layers are equal to wi = 70 nm and wz = 60 nm, respectively.

Fig. 1
figure 1

3D and 2D schematic diagrams of the proposed TM-pass polarizer

The relative permittivity of the AZO and ZrN at a wavelength of λ = 2 μm are taken as -3.8316 + 0.57253i and -92.295 + 82.32i, respectively. These values are obtained from the following Drude Lorentz model based on the spectroscopic ellipsometry measurements reported in (Naik et al. 2013) over the wavelength range 350–2000 nm without the need for electric biasing.

$$\varepsilon \left(\omega \right)= {\varepsilon }_{b}-\frac{{\omega }_{p}^{2}}{\omega (\omega +i{\gamma }_{p})}+\frac{{f}_{1}{\omega }^{2}}{{\omega }_{1}^{2}-{\omega }^{2}-i\omega {\gamma }_{1}}$$
(1)

where the AZO and ZrN parameters are given in Table 1 (Naik, et al. 2013). Figure 2 depicts the magnetic field intensities of the quasi-TM and quasi-TE modes' main components Hx and Hy, respectively. It can be observed that Hx of the quasi-TM mode has a good confinement through the silicon core. However, Hy of the quasi-TE mode has more leakage toward the plasmonic materials. Thus, high losses occur for the TE mode only.

Table 1 The Drude-Lorentz model parameters of AZO and ZrN (Naik et al. 2013)
Fig. 2
figure 2

Field plots of the a Hx and b Hy of the quasi-TM and quasi-TE modes, respectively

The TM-pass polarizer has high losses for the TE mode while the TM mode has low attenuation. As a result, our goal is to maximize the figure of merit (FOM) for the suggested polarizer, which is defined as follow (Alam et al. 2012; Kandeel et al. 2019).

$$FOM = \frac{{Propagation\;losses\;of\;TE\;mode\left( {\frac{{dB}}{{\mu m}}} \right)}}{{Propagation\;losses\;of\;TM\;mode\left( {\frac{{dB}}{{\mu m}}} \right)}}$$
(2)

3 Numerical results

In order to design highly efficient and compact TM-pass polarizer based on SOS platform, the geometrical parameters of the suggested design are studied to maximize the FOM with good polarization filtering characteristics. The modal characteristics of the proposed polarizer are studied using a full-vectorial finite element approach (Obayya et al. 2000, 2002; Rahman et al. 2008) based on COMSOL Multiphysics software package (https://www.comsol.com) with a minimum element size of 6 × 10–4 µm and a perfect matched layer boundary condition. The propagation characteristics of the proposed polarizer are performed by using 3D finite difference time domain (FDTD) simulations using Lumerical software package (https://www.lumerical.com). The FDTD simulations are employed to check the filtering characteristics and calculate the ER of the reported polarizer. The ER can be defined as (Abd-Elkader et al. 2019b).

$$ER\left( {dB} \right) = 10 \cdot \log \cdot \left( {\frac{{p_{out}^{p} }}{{p_{in}^{p} }} \cdot \frac{{p_{in}^{c} }}{{p_{out}^{c} }}} \right)$$
(3)

where (\({P}_{out}^{p}/{P}_{in}^{p}\)) is the ratio of the power at the output terminal to the power at the input of the passed mode. Also, (\({P}_{out}^{c}/{P}_{in}^{c}\)) is the ratio of the filtered mode's output power to its input power. Additionally, the insertion loss (IL) is considered as

$$IL\left( {dB} \right) = 10log_{{10}} \left( {\frac{{P_{{out}}^{p} }}{{P_{{in}}^{p} }}} \right)$$
(4)

where \({P}_{in}^{p}\) and \({P}_{out}^{p}\) refer to the power of the passed polarization state at the input and the output of the polarizer, respectively. The grid size used for the 3D FDTD is ∆x = 10 nm, ∆y = 10 nm, ∆z = 0.1 μm.

The effect of the sapphire layers width is first investigated. Figure 3 indicates the ER and FOM dependence on the width of sapphire layers (wd) at a device length of 3.0 µm and an operating wavelength of 2.0 µm. It can be noticed from Fig. 3a that the ER increases from 3.8 to 20.3 dB as the width of the sapphire layers wd increases from 60 to 100 nm. Then, the ER decreases to 3.6 dB by increasing wd to 140 nm. The behavior of the FOM is relatively matched with that of the ER as can be seen form Fig. 3b, where ER of 20.3 dB can be achieved at wd= 100 nm. Figure 4 illustrates the variation of the ER and FOM with the widths of the AZO (wi) and ZrN (wz) layers at 3.0 µm device length. The ER as a function of the AZO layers width (wi) at different widths of ZrN (wz) layers is presented in Fig. 4a, while, the variation of the FOM with the AZO layers width (wi) at different ZrN (wz) layers widths is plotted in Fig. 4b. It can be observed that the ER and FOM increase as the width of the AZO layers wiincrease from 30 to 70 nm. If wi is further increased to 110 nm the ER and FOM decrease for the different widths of ZrN (wz) layers. In can be also seen from Fig. 4a and b that wz has a slight effect on the ER and FOM. In order to investigate that effect, the dependence of field distribution │Ex│ of the filtered TE mode on the widths of ZrN (wz) layers is presented in Fig. 4c. It is evident from Fig. 4c that such small effect is due to the amount of the field that penetrants the ZrN layer is very small and is independent on the width of the ZrN layers. The ER and FOM have their maximum values of 20.3 dB and 202.8, respectively, at wi = 70 nm and wz = 60 nm.

Fig. 3
figure 3

Variations of a ER and, b FOM with the width of the sapphire layer (\({w}_{d})\) at LD = 3.0 µm and λ = 2 µm. The widths of the AZO and ZrN layers are set as wi = 70 nm and wz = 60 nm, respectively

Fig. 4
figure 4

Dependence of the a ER, and b FOM on the widths of the AZO (wi) and the ZrN (wz) layers at LD = 3.0 µm and λ = 2 µm with the width of the sapphire layers wd = 100 nm. c Dependence of the field distribution │Ex│ of the filtered TE mode on the widths of the ZrN (wz) layers at wi = 70 nm and wd = 100 nm

Figure 5 shows the ER and IL variations with the device length without and with input (i/p) and output (o/p) waveguides. It can be observed from Fig. 5 that the ER and IL rise as the device length of the polarizer increases. Figure 5a shows the effect of the device length on the ER and IL without the existence of input and output waveguides, where, at a compact LD of 3.0 µm and λ = 2.0 μm, the proposed TM-pass polarizer has ER = 20.3 dB and IL = 0.14 dB. In order to mimic the reality, where the input power is launched by an input laser source to excite the fundamental modes to propagate through the proposed TM-pass polarizer, a study is performed to investigate the effect of the device length on the ER and IL with the presence of i/p and o/p waveguides. The input and output waveguides are SOS waveguides without AZO and ZrN layers as shown in Fig. 1. In this case, the ER and IL are equal to 12.6 dB and 0.15 dB, respectively, at a polarizer length of 3.0 µm and 0.2 µm input and output waveguide lengths. The resultant decrease in the ER and the rise in the IL are owing to the coupling loss at the interfaces between the i/p and o/p waveguides and the polarizer. The ripples shown in Fig. 5 are due to the Fabry–Perot cavity effect between the two end faces of the polarizer waveguide (Abd-Elkader et al. 2019b). From the numerical analysis, the optimum dimensions of the suggested TM-pass polarizer are summarized in Table 2.

Fig. 5
figure 5

The dependence of the ER and IL on the device length at λ = 2 µm a without input and output waveguides and b with input and output waveguides. The widths of the sapphire, AZO and ZrN layers are set as wd = 100 nm, wi = 70 nm and wz = 60 nm, respectively

Table 2 Optimum geometry of the TM-pass polarizer

The light propagation of the two polarized modes through the whole structure (the polarizer with both input and output waveguides is shown in Fig. 6. It may be seen from Fig. 6a that the TM mode propagates with low losses through the reported design. However, the TE mode does not propagate and suffers from high attenuation as shown in Fig. 6b.

Fig. 6
figure 6

Field evolution through the suggested TM-pass polarizer with input and output waveguides excited by the quasi a TM and, b TE input modes. The widths of the sapphire, AZO and ZrN layers are set as wd = 100 nm, wi = 70 nm and wz = 60 nm, respectively

In order to clarify the importance of using bimetallic configuration in the suggested design, the following comparison has been made. In this study, the effect of using AZO or ZrN alone on the extinction ratio (ER) and insertion loss (IL) are investigated and compared to that obtained using the bimetallic configuration. Figure 7a shows the field distribution │Ex│ of the filtered TE mode due to the bimetallic design with width of 130 nm and compared to that obtained using AZO only with different layer widths. It can be seen from the figure that the bimetallic case (wi (AZO) = 70 nm, wz (ZrN) = 60 nm) has larger amount of field penetration to the AZO/ZrN through the Si core and the Sapphire material. Therefore, the field is highly attenuated by the plasmonic materials which is highly needed for filtering process. Whereas in the case of using AZO only, most of the field is confined in the Si core and only a small amount is passed to the plasmonic materials. therefore the filtration process is very weak by using AZO only. Further, Fig. 7b and c show that low ER is achieved using device length range from 1 to 6 μm based on AZO only with width range from 70 to 170 nm. At a device length of 3 μm, the ER is always less than 4 dB while an ER = 20.3 dB is obtained when using AZO and ZrN materials. It is also observed that, at device length of 3 μm, the IL = 0.14 dB for the bimetallic configuration while the use of AZO alone produces minimum IL of 0.07 dB.

Fig. 7
figure 7

The impact of the plasmonic layers, for (AZO + ZrN) Vs. (AZO only) on a the field distribution │Ex│ of the filtered TE mode, b ER, and c IL at different device lengths at an operating wavelength of 2 µm with the width of the sapphire layers wd = 100 nm

Figure 8 shows a comparison between the performances of the suggested design with bimetallic configuration with that obtained using ZrN only. Figure 8a explores the field distribution │Ex│ of the filtered TE mode due to the AZO/ ZrN with total thickness of 130 nm and that obtained using ZrN only with different layer widths. It can be noticed from the figure that the use of ZrN only results in very small amount of field penetration through the plasmonic material with a good confinement in the Si/sapphire layers. Therefore, the TE filter performance is weak. However, the bimetallic configuration has a good field penetration to the plasmonic materials which attenuates well the TE mode. This is evident in Fig. 8b and c where the use of ZrN only has ER < 0.1 dB with minimum IL of 0.34 dB at a device length of 3 μm.

Fig. 8
figure 8

The impact of the plasmonic layers, for (AZO + ZrN) Vs. (ZrN only) on a the field distribution │Ex│ of the filtered TE mode, b ER, and c IL at different device lengths at an operating wavelength of 2 µm with the width of the sapphire layers wd = 100 nm

The fabrication tolerance of the different geometrical parameters is then investigated. The thickness of the sapphire, AZO and ZrN layers are studied with a tolerance of ± 10 nm around the optimum values. Figure 9 illustrations the dependence of the ER and IL on the widths (wi) of AZO layer and wz of ZrN layer. It can be observed from Fig. 9a that the polarizer has a tolerable performance over the entire range of wi variations at wd = 100 nm and wz = 60 nm, where the ER is always greater than 10 dB for the wi range from 61 to 79 nm with IL < 0.15 dB. On the other hand, Fig. 9b demonstrates how the tolerance of the ZrN layer width wz while wd and wi remain constant at 100 nm and 70 nm, respectively. It can be seen that the ER is always greater than 20 dB while the IL is less than 0.142 dB.

Fig. 9
figure 9

Effect of widths variations of the a AZO and b ZrN layers on the extinction ratio and insertion loss for the TM-pass polarizer at LD = 3.0 µm and λ = 2 µm with the width of the sapphire layers wd = 100 nm

The fabrication tolerance of the sapphire layer width wd is then investigated. Figure 10 illustrates the impact of the width wd of the sapphire layer on the ER and IL. However, the other geometries are constant at wi = 70 nm, and wz = 60 nm. It can be seen that the IL is always less than 0.16 dB with ER better than 12 dB over the studied range. Therefore, the suggested TM-pass polarizer has a good tolerance to fabrication imperfections.

Fig. 10
figure 10

Effect of the width wd of the sapphire layer on the extinction ratio and insertion loss at LD = 3.0 µm and λ = 2 µm with the widths of the AZO and ZrN layers are set as wi = 70 nm and wz = 60 nm, respectively

The proposed polarizer’s performance with the operating wavelength variation is also investigated as shown in Fig. 11. It is revealed from this figure that the presented TM-pass polarizer achieves ER higher than 20.3 dB over a wavelength range of 1.94:2.06 µm with IL less than 0.16 dB.

Fig. 11
figure 11

The wavelength dependent extinction ratio and insertion loss at LD = 3.0 µm. The widths of the sapphire, AZO and ZrN layers are taken as wd = 100 nm, wi = 70 nm and wz = 60 nm, respectively

The suggested design can be fabricated using the current technology. The silicon core can be implemented using electron beam lithography. However, thermal oxidation (Gusev et al. 1995) can be used to define the sapphire layer. Further, the CVD or sputtering can be utilized to deposit the plasmonic layers (Hess and Graves 1993).

Table 3 illustrates a comparison between the proposed structure and those reported in the literature in terms of the device length, ER, IL, CMOS compatibility, used platform, wavelength range, and type of work (theoretical or experimental). The MIR operation is the most important feature of the proposed TM-pass polarizer based on the SOS platform. It may be seen that most of the polarizers in the literature are dependent on the SOI platform. Additionally, the suggested design has a shorter device length than that reported in Wang and Ho (2010), Guan et al. (2014), El Shamy et al. (2017), Alam et al. (2011), and Ng et al., (2012). It has also a higher ER than that reported in Azzam and Obayya (2015), and Azzam and Obayya, (2016). Moreover, it achieves a lower IL than that proposed in Guan et al., (2014), (Kandeel et al. 2019), Azzam et al., (2014), El Shamy et al. (2017), Alam et al. (2011), Bai et al., (2017), and Azzam and Obayya (2016).

Table 3 Comparison of the suggested design to those reported in the literature

4 Conclusion

TM-pass polarizer based on SOS platform is introduced and analyzed for MIR application. The suggested polarizer has bi-metallic configuration of AZO and ZrN materials at a wavelength of 2.0 µm. The reported design achieves high ER of 20.3 dB with low IL of 0.14 dB at a device length of 3.0 µm. The proposed polarizer is compatible with current CMOS fabrication procedures and provides acceptable performance over a wide range of bandwidths with high resistant to fabrication errors.