I. ISSUES WITH Si-MOSFET AND THE ADVANTAGE OF 2D CHANNELS

The issues with the miniaturization of Si-MOSFETs are generically called short channel effects.1 When the source and drain depletion regions become comparable in length with the channel length, as shown in Fig. 1(a), the drain bias weakens the gate bias, which leads to a drastic increase in the off-current. Based on an analysis of the distribution of the electrical potential in the channel region, it is widely known that the short channel effect can be neglected when the channel length is ∼6 times longer than the scaling length, \({{\lambda}} = \sqrt {\left( {{{{\varepsilon}}_{{\rm{ch}}}}{t_{{\rm{ch}}}}{t_{{\rm{ox}}}}} \right)/\left( {N{{{\varepsilon}}_{{\rm{ox}}}}} \right)}\),2,3 where εch, εox, tch, and tox are the dielectric constants for the channel, the gate insulator, the thickness of the channel, and the gate oxide, respectively. Figure 1(b) shows the 6λ values calculated for Si, carbon nanotubes (CNT), bilayer graphene, and MoS2, where the contribution of the tunneling effect is neglected. N is defined as the effective gate number: N = 1 for planar, N = 2 for dual gate, N = 3 for FIN-FET, and N = 4 for gate-all-around. Although the FIN structure has already been adopted for Si to reduce the short channel effects,4 it is difficult to avoid the short channel effects for channel lengths shorter than 10 nm. 2D layered channels in FET applications are attractive because of their rigidly controllable atomic thickness (tch < 1 nm) and their low dielectric constants where εch = ∼4 for a typical 2D layered channel.3,4 This results in a 6λ smaller than that of Si. Of course, Si channels of a few nanometers thick have already been constructed using the microfabrication process. However, the operation of Si-MOSFET with an atomic scale thickness is not realistic because the mobility is drastically reduced because of fabrication damage.5,6 The advantage of 2D materials is their intrinsic atomic thickness,7,8 which allows both the reduction of the short channel effect and the possible retention of high mobility. For the short channel devices in which quasiballistic transport is assumed, the on-current can be determined not by the mobility but by the effective mass. Figure 1(c) shows effective masses (m*) for 2D materials.9 Although there are many 2D channels, bilayer graphene has the lowest effective mass. Therefore, bilayer graphene with an electrostatically tunable band gap still has an advantage over the high performance device.

FIG. 1
figure 1

(a) The charge conservation model for Si-MOSFETs.1 (b) 6λ for Si, CNT, bilayer graphene and MoS2. tch = 5 nm for Si, and tch = 1 nm for all other channel materials. Although Poisson’s equation in cylindrical co-ordinates should be solved for CNT, the present expression (N = 4) was used for simplicity. (c) Electron effective mass for 2D materials.9

II. GATE STACK FORMATION FOR 2D CHANNELS

One of the most important building blocks for an FET is the gate stack formation because the device performance is mainly controlled by the carriers that flow near the channel/insulator interface. This is especially true of graphene, which has an atomic monolayer composed of a strong sp2 hybrid orbital. The defects introduced in graphene during the top gate insulator formation results in the severe degradation of the electron transport properties. Many deposition methods were tested during the initial stages of graphene gate stack research.10 The obstacles specific to each deposition method have been elucidated. Physical vapor deposition (PVD) methods using high particle energy, such as radio frequency sputtering10,11 and pulsed laser deposition,10 introduce a significant amount of defects in graphene. Most of the deposition techniques applicable to the conventional Si process are not applicable to graphene. On the other hand, atomic layer deposition (ALD)12 has difficulties with nucleation because the graphene surface is chemically inert. Therefore, as shown in Fig. 2(a), Y2O3 is deposited only on defects and grain boundaries in highly oriented pyrolytic graphite (HOPG) by ALD. These issues have been overcome by the utilization of PVD1316 with particle energies lower than that required to displace one C atom out of graphene [∼7.5 eV (Refs. 17 and 18)] and by the utilization of buffer layers such as thin oxidized metal layers,19 polymer coating,20 or other types of materials12,2123 for ALD. In this case, no defect is introduced. Interestingly, as shown in Fig. 2(b), Y2O3 was deposited directly on approximately 70% of the h-BN surface without any preferences for the grain boundaries or defects due to the physical adsorption of the Y precursor from the polarization in h-BN. This is the big difference between graphene and other compound type 2D materials. Further utilization of a buffer layer results in the full surface coverage of ALD-Y2O3 with a small surface roughness of approximately 0.2 nm, as shown in Fig. 2(c).24

FIG. 2
figure 2

AFM images for (a) Y2O3 on HOPG, (b) Y2O3 on h-BN, and (c) Y2O3 on h-BN with an oxidized Y metal buffer layer of 1.5 nm. The bottom figures show the relative height profiles along the dotted lines in the AFM images.

Using an alternative technique to ALD, we have succeeded the high insulating properties of high-k Y2O3 top-gate in graphene FETs by depositing Y metal in an O2 atmosphere and subsequently applying high-pressure O2 annealing (100 atm).25 Y metal was selected because it has the highest susceptibility to oxidation given the thermodynamic considerations. Figure 3 shows the sheet resistivity of monolayer and bilayer graphene as a function of top gate voltage (VTG) for different back gate voltages (VBG), respectively.25,26 These devices were fabricated on SiO2 (∼90 nm)/n+-Si substrates by the mechanical exfoliation of Kish graphite. For monolayer graphene, as expected from the linear dispersion without the band gap (EG), ambipolar characteristics are clearly observed. Moreover, because the electric structure of monolayer graphene is not affected by the external electrical field, the Dirac point is shifted in parallel. This is dependent on the position of the Fermi energy (EF) controlled by the capacitive coupling between the top gate and back gate insulators. On the other hand, the electric structure of bilayer graphene can be tuned and EG is opened up to ∼0.3 eV by increasing the external electrical field. In the tight binding model for bilayer graphene, the origin for the gap opening can be explained by the breaking of inversion symmetry of two layers,27,28 that is, the introduction of the potential difference between two layers. In other words, under the external perpendicular electrical field, bilayer graphene can be regarded as parallel plate capacitor with the vacuum as a dielectric. The displacement field \((\bar D)\) 29 is defined as \(\bar D = 1/2\left[ {{{{\varepsilon}}_{{\rm{BG}}}}/{d_{{\rm{BG}}}}\left( {{V_{{\rm{BG}}}} - V_{{\rm{BG}}}^0} \right) - {{{\varepsilon}}_{{\rm{TG}}}}/{d_{{\rm{TG}}}}\left( {{V_{{\rm{TG}}}} - V_{{\rm{TG}}}^0} \right)} \right]\) in this study, where εBG, εTG, dBG, and dTG are the dielectric constants, the insulator thickness for back- and top-gate insulators, respectively. \((V_{{\rm{TG}}}^0,\>V_{{\rm{BG}}}^0)\) is the charge neutrality point to give the minimum resistance in the top-gated region. Indeed, the rapid increase in the sheet resistivity at the Dirac point is clearly observed, which indicates the band gap opening. Comparing the resistivity modulation of monolayer to bilayer graphene is possible because the channel shape factor is removed in the sheet resistivity. The drastic increase in the sheet resistivity for bilayer graphene is quite evident. The maximum current on/off ratio achieved is 5.5 × 103 at 20 K, which is the best result at 20 K. The remaining task is to improve the current on/off ratio at room temperature, because it is still ∼100 at present.3032 So far, EG has been estimated from the temperature dependence of the resistivity at the Dirac point. The detailed analysis on the temperature dependence of the resistivity can be found in the Ref. 26, while in the present paper EG will be estimated from the CQ measurement in the next section.

FIG. 3
figure 3

Sheet resistivity as a function of VTG at different VBG for monolayer and bilayer graphene. An SEM image for typical Y2O3 top gate graphene FET is also shown.

From the viewpoint of miniaturization in device applications, improving the capacitance of the top gate insulator (CTG) is critical. Figure 4 shows the CTG reported so far for graphene FETs in the literature. It should be noted that these data were obtained only from monolayer, bilayer, and trilayer graphene channels except other 2D materials because the electrical quality of the gate insulator largely depends on the channel materials. The large capacitance is achieved by the direct deposition of a high-k insulator on the graphene, and the capacitance data for h-BN top gate insulators is generally low because of the small dielectric constant of ∼3. At present, all the reported data do not reach the effective oxide thickness (EOT) of 1 nm, which is a standard value for Si FETs. Although high temperature annealing (500–600 °C) is required to improve the electrical quality of high-k insulators, such a high temperature annealing introduces defects in graphene. To overcome the oxidation issue for high-k insulators, the combination of h-BN and high-k oxide is key because the oxidation barrier of h-BN is quite high (>800 °C).33,34 As shown in Fig. 2(c), high-k oxide deposition on h-BN using ALD has already been achieved.24,35 However, CTG for high-k on h-BN is still low due to the thickness of h-BN,36,37 as shown in Fig. 4. Monolayer h-BN should be used to increase the total capacitance.24

FIG. 4
figure 4

Comparison of CTG with the previously reported values for monolayer, bilayer and trilayer graphene. Closed and open circles indicate that the CTG was obtained for oxide insulators deposited via ALD and for insulators prepared using another technique, respectively. Closed and open boxes indicate the CTG obtained for h-BN and for the combination of h-BN and high-k oxide. “EOT = 1 nm” indicates CTG obtained for SiO2 with the thickness of 1 nm “EDLC” means an electric double-layer capacitor, whose capacitance value is typically ∼20 µF/cm2.

III. DOS DETERMINATION BY QUANTUM CAPACITANCE MEASUREMENT

In Sec. II, we have observed the carrier modulation in graphene using the electric field effect, although monolayer and bilayer graphene are categorized as metals from the viewpoint of the band structure. This is because the carrier density induced by the back gate (n = 1/e CBGVBG) becomes larger than that of graphene due to the small DOS near EF. The DOS—energy relation is quite useful because it can answer the following two questions: Is the linear dispersion really preserved when graphene is sandwiched by the SiO2/Si substrate and high-k oxide insulator or is in contact with the metal electrode? Is the EG formation in bilayer graphene revealed by DOS? However, it is generally difficult to extract the DOS from the IV characteristics because the contribution of the scattering factor in the Boltzmann transport equation is often unknown. On the other hand, it is possible to extract the DOS by analyzing the CQ obtained in the CV measurement.3840 Next we discuss the electric band structure of graphene embedded in the FET structure.

Figure 5(a) gives a schematic drawing to help explain the contribution of CQ to the total capacitance (CTotal). Graphene, the top gate electrode, and the top gate insulator work as a parallel plate capacitor when voltage is applied between the source and the top gate. A certain density of positive carriers are induced in the top gate electrode, and the equivalent density of negative carriers are induced in the graphene. Here, from the energy viewpoint, DOS at EF is large for the metal, so there is almost no change in EF. On the other hand, EF should be shifted upward to induce carriers in graphene because of the small DOS at EF. The energy required to induce carriers can be modeled using the additional voltage drop (Vch) in the equivalent circuit model, as shown in Fig. 5(b). Because the carriers are “accumulated” in graphene, the circuit element for this voltage drop is a capacitor, not a resistor, or inductor. This is known as the quantum capacitance.38 In this simple circuit, CTotal can be described by equation 1/CTotal = 1/Cox + 1/CQ, where Cox is the geometric capacitance and CQ = e2 DOS.38 Here, the DOS for monolayer and bilayer graphene are 2EF/π(vFħ)41 and m*/2πħ,28,41 respectively. vF is the Fermi velocity (1 × 108 cm/s) and ħ is the Planck’s constant. Here, let’s estimate CQ and CTotal for monolayer graphene when Cox = 1 µF/cm2 for the SiO2 thickness of 4.5 nm. The key is the calculation of Vch in Fig. 5(b), because CQ is a function of VTG. Vch can be expressed as Vch = \({V_{{\rm{ch}}}} = \int_0^{{V_{{\rm{TG}}}}} {{C_{{\rm{Total}}}}/{C_{{\rm{ox}}}}d{V_{{\rm{TG}}}}}\) from the equivalent circuit. EF is indeed the charging energy and is expressed as EF= eVch. As a result, CQ can be estimated since CQ is a function of EF. Figure 5(c) shows Cox, CQ, and CTotal as a function of top gate voltage. CQ increases as EF increases for monolayer graphene, while \({C_{{\rm{Si}}{{\rm{O}}_{\rm{2}}}}}\) is constant. As a result, CTotal depends on VTG. For comparison, \({C_{{\rm{Si}}{{\rm{O}}_{\rm{2}}}}}\) (∼0.0383 µF/cm2) for the SiO2 thickness of 90 nm (this is a typical thickness for back gate SiO2) is also plotted, suggesting that the contribution of CQ in CTotal can be neglected because of the small \({C_{{\rm{Si}}{{\rm{O}}_{\rm{2}}}}}\) value. Therefore, we do not generally discuss CQ for back gate graphene devices. In other words, to extract CQ, a top gate graphene device with a CTG value comparable to the CQ value is critical, which indicates the importance of the gate stack formation in Sec. II. On the other hand, from the device operation viewpoint, a large contribution of CQ in CTotal means that inducing carriers in graphene is difficult. The inset in Fig. 5(c) shows the channel voltage (Vch) calculated as a function of VTG. Vch is the voltage drop equivalent to the energy required to induce carriers in graphene. Thus, when VTG = 1 V is applied, ∼30% of VTG is spent in graphene itself and the actual voltage applied to the top gate insulator is reduced to be V = VTGVch. This is a common problem for high-mobility channels. However, consider CQ positively as it can be the analytical tool used to extract DOS by the C−V measurement.

FIG. 5
figure 5

(a) Schematic drawing of the graphene device and DOSs for metal and monolayer graphene. (b) Equivalent circuits with and without CQ. (c) Capacitance as a function of VTG. \({C_{{\rm{Si}}{{\rm{O}}_{\rm{2}}}}}\) for the SiO2 thickness of 4.5 nm, CQ for monolayer graphene and their total capacitance (CTotal) are shown. For comparison, \({C_{{\rm{Si}}{{\rm{O}}_{\rm{2}}}}}\) for the SiO2 thickness of 90 nm is also indicated. Inset: Channel voltage [Vch in (b)] as a function of VTG.

Figure 6 shows the CQ extracted from the C−V measurements for (a) monolayer graphene and (b) bilayer graphene, respectively.25,26 It should be noted that the vertical axis on the right side is converted from CQ to DOS using the relation of CQ = e2 DOS. The devices used in this analysis are the same as those in Fig. 3. For monolayer graphene, the estimated CQ value is consistent with the theoretical dotted line values for EF > ∼0.15 eV. The deviation from the theoretical value near the Dirac point is due to the residual carriers that are externally induced by the charged impurities.14,40,42 The residual carrier density (n*), as shown by the arrow in Fig. 6(a) is calculated as 3.6 × 1011 cm−2 using the relation of \({E_{\rm{F}}} = \hbar {v_{\rm{F}}}\sqrt {{{\pi}}{n^*}}\), where ħ is the reduced Planck constant and vF is the Fermi velocity. The n* values obtained from the CV and I−V measurements are almost identical. Based on these results, it has been shown that graphene sandwiched between Y2O3 and SiO2 generally preserves the linear band structure with the exception being near the Dirac point. Although the electric band structure is generally determined experimentally by an angle-resolved photoemission spectroscopy (ARPES),43 the energy resolution in DOS obtained using the CQ measurement is much higher than that by ARPES for the energy range of actual device operation.

FIG. 6
figure 6

(a) CQ as a function of EF for monolayer graphene. EF is evaluated as EF = eVch. n* is the residual carrier density. Inset: The schematic drawing shows that the spatial distribution of charged impurities results in the variation of the Dirac point. (b) CQ as a function of EF for bilayer graphene. The EF formation is clearly observed as the external electrical field is increased.

On the other hand, the EG formation (for bilayer graphene), where DOS becomes nearly zero, is clearly observed by increasing the displacement field \((\bar D)\). The equation for \(\bar D\) is provided in the previous section. EG is defined as the energy between inflection points for the conduction and valence sides in Fig. 6(b). EG is roughly ∼0.3 V at \(\bar D\) = 2.5 V/nm,26 which corresponds to the maximum value expected from the theoretical calculation.27 It should be emphasized that the DOS within the gap region almost reaches zero, which is not observed in the previous report for bilayer graphene with the h-BN top gate.44 Moreover, the van Hove Singularity is also observed near the valence band edge, as shown by arrows. In the case of the IV measurement, the temperature dependence of the resistivity at the Dirac point must be measured to determine EG. However, EG can be determined directly from the CQ measurement because the scattering factor is not included in the CV measurement.

Finally, the application of the CQ measurement to the metal/graphene contact in the graphene FET is described.45 In terms of electron device miniaturization, the electrical contacts are critically important to reduce the total resistance.46 This is more significant in the higher carrier mobility channel. Although no Schottky barrier exists at the graphene/metal interface due to the lack of a band gap, the contact resistivity is intrinsically high due to the small DOS in graphene.46 It is known that the DOS of monolayer graphene increases with contact to the metal due to the orbital hybridization, that is, π − d coupling.47 The typical chemisorption group is Ni, while the typical physisorption group is Au.48 Here, our strategy is the reduction of the contact resistivity by the π − d coupling with Ni. For this purpose, the DOS of graphene in contact with the metals should be estimated through the CQ measurement. Figure 7(a) shows the schematic drawing of the experimental setup. The thickness of the back gate SiO2 in the metal/graphene/SiO2/n+-Si contact structure is reduced to ∼3 nm to extract the CQ. Even for ∼3 nm thickness of SiO2, monolayer graphene can be identified by the optical contrast under the optical microscope, which is supported by our calculation on the visibility of graphene on SiO2. The key technique used here is the resist-free metal deposition process using the finely patterned PMMA shadow masks, which enables us to extract the “intrinsic” metal/graphene interaction. Many researchers have reported that the resist residue remains on graphene.4951 The resist residue is a serious concern in light of the fact that activated carbon, whose hydrophobic surface attracts organic materials, is composed of graphene.52 Fig. 7(b) shows the DOS extracted from the CQ measurements. For the resist-free metal/graphene contacts, graphene underneath the Au electrode maintains the linear DOS—energy relation except near the Dirac point, while the DOS of graphene underneath the Ni electrode is broken and largely enhanced around the Dirac point, resulting in only a slight modulation of the Fermi energy in the graphene. On the other hand, when Ni is deposited using the polymer resist, the traces of linear dispersion of the graphene can be observed, which is also seen in the IV curve.53 Fig. 7(c) summaries the DOS—energy relation for monolayer graphene modified with the metal contact. The contact resistivity measurements using the resist-free metal deposition technique show that the contact resistivity obtained for the Ni electrode has been reduced to ∼1500 Ω µm because of the increase in the DOS by the π − d coupling.45 Although this contact resistivity value is still high for the requirement, the present experiment clearly suggest that one of effective guidelines to reduce the contact resistivity is the increase in the DOS of graphene underneath the metal electrode.

FIG. 7
figure 7

(a) Schematic of the metal/graphene/SiO2/n+-Si device to extract CQ for metal/graphene structure. The thickness of SiO2 is reduced to 3 nm, which results in a \({C_{{\rm{Si}}{{\rm{O}}_{\rm{2}}}}}\) comparable to CQ. (b) The experimentally extracted CQ of graphene in contact with metals. Solid red circle, open red rectangular, and solid blue circle represent the resist-processed Ni, resist-free Ni, and resist-free Au devices, respectively. (c) Summary of DOS—energy relation after the metal/graphene interaction suggested from the CQ measurements.

IV. CONCLUSIONS

In this paper, we have discussed the gate stack formation in graphene FETs and the extraction of DOS through CQ measurements. The top gate formation, which appears at first glance to be quite easy, is actually very difficult because defects are easily introduced in graphene by the conventional deposition techniques for high-k oxides. The improvement of the gate stack process in graphene research enables the extraction of DOS through the CQ measurement. It is shown that the electric band structure of graphene in contact with gate insulators or metal electrode deviates from the intrinsic band gap structure. Through the extension of this technique, a further understanding of the graphene/insulator and the graphene/metal interfaces is needed to fabricate graphene FETs.