1 Introduction

Two neutral, elastically stiff and dry solid objects usually do not adhere to each other. This is a result of the surface roughness which reduces the area of real contact, where solid-solid adhesion can occur, to a very small fraction of the apparent area of contact. If a loading pressure is applied to increase this real contact area, the surface asperities are deformed storing up elastic energy at the interface, which is then given back when the loading pressure is released, resulting, in most cases, in a negligible adhesive force [1,2,3,4,5, 7,8,9,10]. For extremely smooth surfaces (i.e., root-mean-square (rms) roughness \(\lesssim 1 \ \textrm{nm}\)), however, adhesion can be observed even for elastically stiff solids.

Here, we consider the case of hydrophilic direct wafer bonding, where adhesion occurs between the two surfaces, usually silicon wafers covered by thin \(\textrm{SiO}_2\) oxide layers or silica glass surfaces (amorphous \(\textrm{SiO}_2\)). This process is initiated by capillary condensation of water vapor in the small gap between the surfaces that generates a sufficient capillary adhesion force to overcome the elastic repulsion once the loading pressure exceeds a certain threshold, leading to “interfacial collapse” where the real area of solid-solid contact suddenly increases [11, 12].

For hydrophilic interfaces in a humid atmosphere, often the main contribution to the adhesion force comes from capillary bridges or menisci that form at the interface between the solids due to capillary condensation of water vapor [6, 7, 13,14,15,16,17,18,19,20,21,22,23,24]. In contrast no water capillary bridges form between hydrophobic surfaces, and the adhesive force, in the absence of covalent and ionic bonding, is mainly from the van der Waals interaction. This van der Waals adhesive force is usually much smaller than the capillary force for hydrophilic surfaces in a humid atmosphere.

For a perfectly smooth rigid sphere (radius R) in contact with a perfectly smooth flat surface, the pull-off force is given by Bradley equation \(2\pi w_0 R\) (where \(w_0\) is the work of adhesion) [25]. For a water capillary bridge (assuming complete wetting) \(w_0=2\gamma \approx 0.17 \ \mathrm{J/m^2}\) is twice the surface tension of water. For dry silica surfaces the van der Waals interaction gives \(w_0\approx 0.02-0.03 \ \mathrm{J/m^2}\) so the pull-off force due to capillary adhesion is nearly \(\sim 10\) times larger than obtained assuming only the van der Waals interaction.

Wafer bonding is strongly influenced by surface roughness. In most studies the surface roughness is only characterized by the root-mean-square (rms) roughness amplitude \(h_{\textrm{rms}}\), which is usually determined by the most long wavelength roughness components. We show that for wafer bonding the long wavelength roughness is unimportant. This result from the the existence of a roll-off region in the surface roughness power spectrum, separating the short wavelength (\(\lesssim 60 \ \textrm{nm}\)) roughness from the longer (\(\gtrsim 30 \ \mathrm{\mu m}\)) waviness roughness. As a result, to determine (or control) the strength of wafer bonding, the roughness needs to be determined only over the length scales up to \(0.1 \ \mathrm{\mu m}\). Also, we show that a better measure than the rms-roughness of the potential strength of wafer bonding is the elasticity integral defined by (4) below.

2 Previous wafer bonding results

Wafer bonding is a topic of high practical importance and of great scientific interest, and many papers have been published about wafer bonding [26,27,28,29,30,31,32,33,34]. The strength of wafer bonding depends on the nature of the surface roughness which varies depending on the surface preparation method. Theory shows that the surface roughness on many length scales influence the work of adhesion, so it is not enough to characterize the surface with the rms roughness amplitude alone. For smooth surfaces like silicon wafers, the surface roughness on all relevant length scales can be studied using Atomic Force Microscopy (AFM). In addition, for strong bond formation the surfaces need to be cleaned and “activated”. For silicon wafers two surface activation methods have been used: exposure to oxygen plasma or wet chemical surface activation.

Wafer bonding experiments use thin silicon disks with a diameter of order \(\sim 10 \ \textrm{cm}\) and thickness \(\sim 0.5 \ \textrm{mm}\). When such disks are first brought into contact, the contact initially only occurs at asperity summits and a thin air film exists over most of the interface, which has to be squeezed out before atomic-scale contact can occur over the entire interface. If the applied pressure is uniform (e.g. from the gravity force on the upper disk), a very long time (several hours) may be needed before the solids make atomic contact. In practical applications, a squeezing force is applied at one point on the surface of the upper disk to nucleate an atomic contact region that then expands radially via interfacial collapse with a characteristic speed v that depends on the strength of the adhesive interaction. More exactly, if w is the work of adhesion (the energy per unit surface area to separate the surfaces), then a simple theory, where the resistance to squeeze-out of the air film is attributed to the viscous energy dissipation in the air film, predicts \(v\propto w^{5/4}\) (see Ref. [35]). Due to the speed at which the contact zone propagates outward, there may not be enough time for the water to form the minimum free energy configuration.Footnote 1 Thus w may differ from the work of adhesion when this crack tip speed vanishes (which we denote by \(w=w_0\), i.e., the work of adhesion at the minimum energy configuration). \(w_0\) can also be measured using the cantilever bending testing methodology [see Fig. 17a]. In this study we will consider how \(w_0\) depends on the surface roughness and the humidity for hydrophilic contact.

Fig. 1
figure 1

The work of adhesion as a function of the root-mean-square (rms) roughness amplitude for silicon wafers with silicon oxide layers. Red squares are from Ref. [26] for the relative humidity \(\textrm{RH} =0.4\) and blue squares from Ref. [27] (unknown RH). The rms-roughness amplitudes was obtained from AFM topography measurements over \(2 \ \mathrm{\mu m} \times 2 \ \mathrm{\mu m}\) (Ref. [26]) and \(10 \ \mathrm{\mu m} \times 10 \ \mathrm{\mu m}\) regions (Ref. [27])

Fig. 2
figure 2

Bonding time \(t_{\textrm{b}}\) as a function of humidity for wet chemical activated surfaces (red squares) and for plasma activated surfaces (blue squares). For silicon wafers bonding to glass surfaces. The work of adhesion is proportional to \(t_{\textrm{b}}^{-5/4}\) (see Ref. [35]). From Ref. [28]

Figure 1 shows two sets of previously published experimental results that illustrate how the work of adhesion \(w_0\) is a function of the root-mean-square (rms) roughness amplitude. The red squares are from Ref. [26] for the \(\textrm{RH} =0.4\) and was obtained from the bonding velocity assuming \(w\propto v^{5/4}\). The blue squares are from Ref. [27] (unknown RH) and was obtained using cantilever bending testing.

We note that, while the rms roughness amplitude generally depends on the size of the surface area studied, as we discuss below, the power spectra of silicon wafers typically have a roll-off region for roughness wavelength between \(0.1 \ \mathrm{\mu m}\) to a few \(\mathrm{\mu m}\) so the rms roughness amplitude does not depend sensitively on the size of the studied region as long as it is in the roll-off region. For image sizes larger than \(\sim 10 \ \mathrm{\mu m}\) the rms amplitude for wafers increases due to long wavelength roughness, usually referred to as waviness. As discussed below, this long wavelength roughness has only a small influence on the wafer bonding strength.

Figure 2 shows another set of previously published experimental results, which this time illustrate the importance of humidity to increasing the magnitude of the work of adhesion in hydrophilic wafer bonding. Figure 2 shows the bonding time \(t_{\textrm{b}}\) (the time it takes for the bonding wave to propagate throughout the whole nominal (final) contact region) as a function of humidity for silicon wafers bonding to glass surfaces [28]. The red squares are for wet chemical activated surfaces, and the blue squares for oxygen plasma activated surfaces. Assuming that the work of adhesion is proportional to \(t_{\textrm{b}}^{-5/4}\) (see Ref. [35]), we conclude that the work of adhesion then increases monotonically with increasing humidity. The different bonding times for the two cases studied cases may, in part, be due to different modifications of the surface topography by the chemical and oxygen plasma treatments.

Fig. 3
figure 3

AFM topography images of the Si(100) wafer over square areas of size a \(80 \ \mathrm{\mu m} \times 80 \ \mathrm{\mu m}\) and b \(0.2 \ \mathrm{\mu m} \times 0.2 \ \mathrm{\mu m}\)

Fig. 4
figure 4

The surface roughness power spectrum obtained from AFM topography data from a Si(100) wafer surface. The blue line was obtained from a \(80 \ \mathrm{\mu m} \times 80 \ \mathrm{\mu m}\) surface area, while the other curves was obtained from square areas with the side 0.2 (green line), 1 (yellow line), 2 (pink line) and \(5 \ \mathrm{\mu m}\) (red line). The dashed line is fitted to colored, experimental curves. The rms-roughness for the \(80 \ \mathrm{\mu m}\) surface is \(0.95 \ \textrm{nm}\) and of the other surfaces 0.12, 0.14, 0.13, and \(0.11 \ \textrm{nm}\), respectively

Fig. 5
figure 5

The surface roughness on silicon wafers consist of two length scale regions, a long-wavelength region denoted “waviness” and short wavelength region extending down to atomic dimensions. During the initial wafer bonding (due to capillary bridges and the van der Waals interaction) the waviness deforms elastically and has only a small influence on the wafer bond strength. Strong chemical bonds form between the surfaces after a heat treatment

Fig. 6
figure 6

The surface roughness rms amplitude (red line) in nm and the rms-slope (blue line) and the elastic energy per unit surface area assuming complete contact (green) in \(\mathrm{J/m^2}\), as a function of the low-wavenumber cut-off q for the wafer. In the calculation only the roughness components with wavenumber between q and the large-wavenumber cut-off \(q_1\) (here \(q_1 \approx 8\times 10^9 \ \textrm{m}^{-1}\)) is included, i.e. we use (2), (3) and (4) with \(q_0\) replaced by q

3 Surface roughness power spectra

Many surfaces in nature have self-affine fractal surface roughness [37, 38], i.e., when magnified, their roughness appears similar to the unmagnified surface, except that the amplitude of the roughness increases. The Hurst exponent H is a number between 0 and 1 which define the z-scaling factor needed in order for the magnified surface to appear similar to the unmagnified surface. \(H=1\) corresponds to a fractal-like surface where no scaling of the height coordinate is needed. Most self-affine fractal surfaces have \(H>0.7\) which can be explained using surface fragility arguments [38]. However, very smooth surfaces, such as surfaces with frozen capillary waves [5], may have \(H<0.5\).

Silicon wafers are very smooth and do not have self-affine fractal surface roughness, but have length-scale regions where the scaling properties are similar to self-affine fractal surfaces with \(H\approx 0\) and \(H\approx 1\) (Fig. 4).

For short ranged (contact) interaction theory shows that when \(H> 0.5\) the long wavelength roughness has the strongest influence on the adhesion between solids while the opposite is true for \(H<0.5\) (see Ref. [1, 3, 4]). This is related to the elastic energy needed to deform the solids so they make contact in the nominal contact area. If this deformation energy is larger than the interfacial binding energy, resulting from the force field acting between the surfaces, no adhesion will be observed, as is the case for most elastically stiff solids unless the surfaces are extremely smooth. For surfaces that are not self-affine, such as the silicon wafers studied here, only a detail study can show if long-wavelength or short-wavelength roughness reduces the adhesion most. For such studies, which rely on the Persson theory of contact mechanics, the surface roughness power spectrum is of crucial importance [2, 38,39,40,41,42,43].

The surface roughness power spectrum C(q) is defined by

$$\begin{aligned} C(q) = {1\over (2\pi )^2} \int d^2x \langle h(\textbf{x}) h(\textbf{0})\rangle e^{-i \textbf{q}\cdot \textbf{x}} \end{aligned}$$
(1)

Here \(z=h(\textbf{x})=h(x,y)\) is the surface height with the \(z=0\) plane chosen so that \(\langle h \rangle = 0\), and \(\langle .. \rangle \) stands for ensemble average, which in most cases is equivalent to averaging over the surface area. A self-affine fractal surface has a power spectrum \(C(q)\sim q^{-2(1+H)}\). The root-mean-square (rms) roughness amplitude \(h_{\textrm{rms}}\) and the rms-slope \(\xi \) are easily calculated from the power spectrum using

$$\begin{aligned} h_{\textrm{rms}}^2 =\langle h^2 \rangle = 2 \pi \int _{q_0}^{q_1} dq \ qC(q) \end{aligned}$$
(2)
$$\begin{aligned} \xi ^2 = \langle (\nabla h)^2 \rangle = 2\pi \int _{q_0}^{q_1} dq \ q^3 C(q) \end{aligned}$$
(3)

where \(q_0\) and \(q_1\) are the smallest and the largest roughness wavenumber included in the calculation.

Another important quantity is the elastic energy stored per unit surface area when two solids with the combined surface roughness power spectrum C(q) are in complete contact:

$$\begin{aligned} {U_{\textrm{el}} \over A_0} = E^* {\pi \over 2} \int _{q_0}^{q_1} dq \ q^2 C(q) \end{aligned}$$
(4)

where \(E^*\) is the effective Young’s modulus (see below).

Here we are interested in very smooth surfaces such as highly polished glass surfaces or silicon wafers. Such surfaces have similar rms-roughness, typically of order \(0.1-1 \ \textrm{nm}\) when studied over surface areas of linear size \(\sim 1 \ \mathrm{\mu m}\), and similar forms of the surface roughness power spectra. In most cases the height probability distribution is nearly Gaussian.

3.1 AFM topography measurements

In the present study we use the surface roughness power spectrum obtained from atomic force microscopy (AFM) topography measurements on a Si(100) wafer. To obtain different magnitudes of the roughness, we scale the power spectrum with different prefactors. Note that since C(q) is quadratic in \(h(\textbf{x})\) scaling the power spectrum with a factor \(s^2\) correspond to scaling the rms-roughness amplitude with a factor s.

Using AFM we have measured the height topography of Si(100) wafers with square image sizes ranging from \(0.2 \ \mathrm{\mu m} \times 0.2 \ \mathrm{\mu m}\) to \(80 \ \mathrm{\mu m} \times 80 \ \mathrm{\mu m}\), see Figs. 3 and 4. The blue line in Fig. 4 shows the surface roughness power spectrum obtained from the \(80 \ \mathrm{\mu m} \times 80 \ \mathrm{\mu m}\) surface area, while the other curves was obtained from square areas with the side 0.2, 1, 2 and \(5 \ \mathrm{\mu m}\). Figure 3 shows that the Si(100) surface has (a) long wavelength waviness with wavelengths \(>10 \ \mathrm{\mu m}\) and (b) short wavelength roughness with a wavelength \(\sim 0.05 \ \mathrm{\mu m}\). In the power spectrum these two regions are separated by a roll-off region. We will show below that the long wavelength roughness has a negligible effect on the wafer bonding (see Fig. 5). Hence studying the surface roughness over small surface areas with the lateral size of order \(\sim 1 \ \mathrm{\mu m}\) is enough for determining the strength of the capillary adhesion for wafer bonding in most cases.

Figure 6 shows the surface roughness rms-amplitude (red line) in nm, the rms-slope (blue line) and the elastic energy per unit surface area, \(U_{\textrm{el}}/A_0\), assuming complete contact (green) in \(\mathrm{J/m^2}\), as a function of the low-wavenumber cut-off q for the wafer. In the calculation only the roughness components with wavenumber between q and the large-wavenumber cut-off \(q_1\) (here \(q_1 \approx 8\times 10^9 \ \textrm{m}^{-1}\)) is included, i.e. we use (2)-(4) with \(q_0\) replaced by q. The elastic energy \(U_{\textrm{el}}\) is proportional to the effective Young’s modulus \(E^*\) [defined below after (5)], and we have used \(E^* \approx 71.6 \ \textrm{GPa}\). Note the roll-off region between \(10^6 \ \textrm{m}^{-1}< q < 10^8 \ \textrm{m}^{-1} \), where the rms-roughness amplitude is nearly constant, which is also the roll-off region of the C(q) curve. The figure also shows that the rms-slope is determined by the very shortest roughness wavelength component and that \(U_{\textrm{el}}/A_0\) is independent of the roughness in the roll-off region. The latter indicate that the roughness in the roll-off region is unimportant for the wafer bonding which is also confirmed by calculations below. This is opposite to most “normal” cases where it is the long wavelength roughness which kills adhesion [1]. This difference results from the peculiar form of the power spectrum of the wafer surface with the waviness region occurring at several decades longer wavelength than the (short wavelength) surface roughness region.

4 Theory and numerical results

In this section we first describe the theory used to study capillary adhesion. Next we present the numerical results focusing on the work of adhesion and the average surface separation, assuming that thermal (or kinetic) equilibrium with the water vapor phase occur at all stages during pull-off.

4.1 Theory

We will use the same theory for capillary adhesion as presented in Ref. [7] which we briefly review here. Consider the contact between two solids with the roughness profiles \(z=h_1(\textbf{x})\) and \(h_2(\textbf{x})\) and with the Young’s elastic modulus and Poisson ratio \((E_1,\nu _1)\) and \((E_2,\nu _2)\), respectively. For relative smooth surfaces, where the average rms surface slopes are much smaller than 1 this problem is equivalent to a perfectly flat and rigid solid in contact with an elastic solid with the surface profile \(z=h_1(\textbf{x})+h_2(\textbf{x})\) and the effective Young’s modulus E and Poisson ratio \(\nu \)

$$\begin{aligned} {1-\nu ^2 \over E} = {1-\nu _1^2 \over E_1} + {1-\nu _2^2 \over E_2} \end{aligned}$$
(5)

We define the effective modulus \(E^*=E/(1-\nu ^2)\). The elastic properties of silicon are anisotropic but here we use an average modulus \(E_1^*=144 \ \textrm{GPa}\) giving for silicon in contact with silicon \(E^* = 72 \ \textrm{GPa}\). We note that silica has a modulus roughly half of that of silicon and that when the roughness on the two surfaces are uncorrelated the power spectrum of the combined roughness \(h=h_1+h_2\) is \(C=C_1+C_2\).

We assume that the solids are in contact in a humid atmosphere (water vapor pressure \(P_{\textrm{v}}\)) and that thermal equilibrium occurs. For hydrophilic surfaces, capillary bridges form at the interface with a meniscus radius \(r_{\textrm{K}}\) given by the Kelvin equation

$$\begin{aligned} r_{\textrm{K}} = - {\gamma v_0 \over k_{\textrm{B}} T \textrm{ln}(P_{\textrm{v}}/P_{\textrm{sat}})} \end{aligned}$$
(6)

where \(v_0\) is the volume of a water molecule in the liquid state. The height of the capillary bridge or meniscus

$$\begin{aligned} d_{\textrm{K}} = r_{\textrm{K}} \left( \textrm{cos}\theta _1+\textrm{cos}\theta _2 \right) \end{aligned}$$
(7)

where \(\theta _1\) and \(\theta _2\) are the water contact angles on the two solid surfaces. In the water meniscus, a negative (Laplace) pressure acts with \(p=-p_{\textrm{K}}\) where \(p_{\textrm{K}} = \gamma /r_{\textrm{K}}\). In what follows we assume complete wetting, \(\theta _1=\theta _2=0\). In this case \(d_{\textrm{K}} = 2 r_{\textrm{K}}\) and \(p_{\textrm{K}} = 2 \gamma /d_{\textrm{K}}\). We further assume that the Kelvin equation holds down to a Kelvin radius as small as \(r_{\textrm{K}} = 0.22 \ \textrm{nm}\) which occurs for a relative humidity \(=0.1\) [8].

Assume the solids are squeezed together with an external pressure \(p_0 = F_0/A_0\). If \(\Delta A\) is the surface area occupied with water, then the attractive capillary force acting on the upper solid is \(F_{\textrm{ad}} = p_{\textrm{K}} \Delta A\). In a mean-field type of approach we replace the nonuniform adhesive pressure from the capillary bridges with a uniform pressure \(p_{\textrm{ad}} = F_{\textrm{ad}}/A_0\). To determine the contact area, and the probability distribution \(P(u) = \langle \delta [u-u(x,y)]\rangle \) of interfacial separation, we assume that the solids are squeezed together with the pressure \(p=p_0+p_{\textrm{ad}}\). Note that the applied pressure \(p_0\) is negative during pull-off.

We use the Persson contact mechanics theory [5, 39,40,41] to calculate P(u). We assume that water occupies all the interfacial non-contact surface area where the separation \(u(x,y) < d_{\textrm{K}}\). We obtain the work of adhesion by integrating \(p_0({\bar{u}})\) as a function of the (average) separation \({\bar{u}}\) from the equilibrium separation (where \(p_0=0\), and where \(p_{\textrm{ad}}\) is balanced by the repulsive pressure from the area of real contact) to infinite separation (where again \(p_0=0\)). For more details see Ref. [7].

Fig. 7
figure 7

The work of adhesion as a function of the relative humidity \(\textrm{RH}=P/P_{\textrm{sat}}\) (where P is the water partial pressure and \(P_{\textrm{sat}}\) the saturated water partial pressure). Results are shown for the contact between a perfectly flat surface and the wafer surface (blue symbols), and between two wafer surfaces (green symbols) using the fitted line power spectrum in Fig. 4. The red symbols are the results for the magnetic disk against a flat

Fig. 8
figure 8

When the Kelvin length, which equals \(2r_{\textrm{K}}\) for wetting liquids, becomes much larger than the surface roughness amplitude, the work of adhesion approach \(w=2\gamma \), where \(\gamma \) is the water surface tension

Fig. 9
figure 9

The surface roughness power spectrum of the wafer surface (green line) and of the same surface after removing the waviness (red). The blue line is the power spectrum of a magnetic disk

Fig. 10
figure 10

The work of adhesion calculated with and without long-wavelength roughness as a function of the relative humidity \(\textrm{RH}=P/P_{\textrm{sat}}\) (where P is the water partial pressure and \(P_{\textrm{sat}}\) the saturated water partial pressure). Results are shown for the contact between two wafer surfaces (green symbols) using the green line power spectra shown in Fig. 9. The red symbols are for the contact between two wafer surfaces after removing the waviness, i.e. using the red line power spectrum in Fig. 9

Fig. 11
figure 11

The work of adhesion (a) and the average surface separation at equilibrium (b), as a function of the relative humidity \(\textrm{RH}=P/P_{\textrm{sat}}\) (where P is the water partial pressure and \(P_{\textrm{sat}}\) the saturated water partial pressure). Results are shown for five different surfaces with the rms roughness amplitudes \(h_{\textrm{rms}} = 0.1\), 0.2, 0.3345, 0.5 and \(0.7 \ \textrm{nm}\). The surface roughness power spectra are obtained by scaling the roughness of the magnetic disk with appropriate scaling factors. The red data (\(h_{\textrm{rms}} =0.3345 \ \textrm{nm}\)) is assuming the power spectrum of the magnetic disk shown in Fig. 9 (red line)

Fig. 12
figure 12

a The force per unit surface area (pull-off stress) needed to separate the surfaces for the \(h_{\textrm{rms}} =0.7 \ \textrm{nm}\) surface for the relative humidity’s \(\textrm{RH}=0.65\), 0.8 and 0.9. b The relative contact area covered by water as a function of the (average) surface separation (in units of the rms surface roughness \(h_{\textrm{rms}}\)) during pull-off. It is assumed that the system is in thermal equilibrium with the water vapor, which will be the case if the pull-off occurs slowly enough

Fig. 13
figure 13

Illustration of both the elastic deformation of a rough surface pressed against a rigid smooth surface and the different wetting regimes that form at this contacting interface at different separation distances due to capillary condensation. Note that amount of elastic deformation is greatly exaggerated. The terms for the different wetting regimes (flooded, pillbox, toe-dipping) are from Refs. [9, 12]. Note that the term “pillbox” is used to describe those menisci with a shape similar to old fashion, cylindrical shaped boxes used to store pills which had a small height to diameter ratio

Fig. 14
figure 14

Boundary line in the \(\textrm{RH}-h_{\textrm{rms}}\) plane between wafers exhibiting capillary adhesion sufficient to overcome the elastic deformation force and collapse the interface and wafers with insufficient capillary adhesion to lead to direct bonding

Fig. 15
figure 15

Boundary line in the \(d_{\textrm{K}}/h_{\textrm{rms}} - E^* h_{\textrm{rms}}/\gamma \) plane between wafers exhibiting capillary adhesion sufficient to overcome the elastic deformation force and collapse the interface and wafers with insufficient capillary adhesion to lead to direct bonding

5 Numerical results

We will now study the work of adhesion assuming thermal (kinetic) equilibrium with the water vapor. We consider the contact between two wafer surfaces and also the contact between a wafer surface and a perfectly flat surface, which can also be interpreted as the contact between two wafer surfaces with the combined power spectra reduced by a factor of 2 (or the rms-roughness of both surfaces reduced by a factor of \(\surd 2\)).

Figure 7 shows the work of adhesion as a function of the relative humidity \(\textrm{RH}=P/P_{\textrm{sat}}\) (where P is the water partial pressure and \(P_{\textrm{sat}}\) the saturated water partial pressure). Results are shown for the contact between two wafer surfaces (green symbols) and for the contact between a wafer surface and a perfectly flat counter surface (blue symbols). Also shown is the contact between the magnetic disk and a flat surface (red symbols).

Note that no capillary adhesion occurs when the surfaces are fully immersed in water; but, when the relative humidity \(\textrm{RH} < 1\), capillary adhesion occurs and the capillary bridges have a finite (maximal) height. The higher the relative humidity, the higher the height of these capillary bridges, and the longer the distance needed to break from these capillary bridges. In particular, when the relative humidity is very close to 1, the Kelvin length \(d_{\textrm{K}}\) and becomes much larger than surface roughness amplitude. In this case, the surfaces need to be separated by \(\approx d_{\textrm{K}}\) before the capillary bridges are broken (see Fig. 8). Further, the work of adhesion is \(w \approx p_{\textrm{K}} d_{\textrm{K}}\), and since the Laplace pressure for a meniscus is \(p_{\textrm{K}} = \gamma /r_{\textrm{K}}\) and since \(d_{\textrm{K}} = 2 r_{\textrm{K}}\) for a wetting liquid, this gives \(w\approx (\gamma /r_{\textrm{K}}) \times (2r_{\textrm{K}}) = 2\gamma \). In other words, the work of adhesion approaches \(2\gamma \) as the water partial pressure P approaches \(P_{\textrm{sat}}\) (i.e., as \(\textrm{RH} \rightarrow 1\).)

To help show that the short wavelength roughness matters most for the capillary adhesion we have calculated the work of adhesion for the case where we remove the waviness part of the surface roughness. (The red line in Fig. 9 shows the surface roughness power spectra used in the calculation.) Fig. 10 shows the work of adhesion as a function of the relative humidity for the contact between two wafer surfaces (green symbols) using the full surface roughness power spectrum. The red symbols are for the contact between two wafer surfaces after removing the waviness.

Note that removing the waviness part of the roughness has only a small influence on the result. This is due to the roll-off region separating the long-wavelength (waviness) roughness from the short wavelength roughness. This is also reflected in the rms-slope which is \(\approx 4\times 10^{-4}\) for the (low resolution) measurement on the \(80 \ \mathrm{\mu m}\) surface area and \(\approx 0.2\) for the (high resolution) measurement on the \(0.2 \ \mathrm{\mu m}\) surface area.

To further illustrate how short wavelength roughness determines the work of adhesion, we have repeated the calculation using the roughness power spectrum of the magnetic disk from a hard disk drive. This power spectrum is shown in Fig. 9 (blue line). This roughness power spectrum was obtained from an AFM topography image collected over a \(1 \ \mathrm{\mu m} \times 1 \ \mathrm{\mu m}\) area. This magnetic disk is similar to the one described Section 2.4.3.1 of Ref. [9]. While still very smooth, it has about twice the rms roughness as the silicon surface over a \(1 \ \mathrm{\mu m} \times 1 \ \mathrm{\mu m}\) area (\(h_{\textrm{rms}}\approx 0.33 \ \textrm{nm}\) versus \(\approx 0.14 \ \textrm{nm}\)), and a higher rms slope (\(\xi \approx 0.28\) versus \(\approx 0.20\)), but much smaller long wavelength roughness (not shown). (Thus the condition \(\xi<<1\) needed for the theory to be accurate is approximately satisfied.) The work of adhesion of the magnetic disk against a flat is shown in Fig. 7. Due to the larger short wavelength roughness, this surface has much lower values of the work of adhesion.

We now study how scaling the power spectrum by different factors affects the capillary adhesion. Figure 11a shows the work of adhesion and Fig. 11b the average surface separation at equilibrium, as a function of the relative humidity \(\textrm{RH}=P/P_{\textrm{sat}}\). The average separation is defined as the volume of the non-solid portion of the interface (wet and dry) divided by the nominal area of the interface. Results are shown for five different surfaces with the power spectrum of the magnetic disk surface scaled so that the the rms roughness amplitudes are \(h_{\textrm{rms}} = 0.1\), 0.2, 0.33, 0.5 and \(0.7 \ \textrm{nm}\). Again, it is assumed that the system is in equilibrium with the water vapor.

Next we consider how the force per unit area (pull-off stress) varies as the two surfaces are pulled apart (with the surfaces parallel) after contact. Figure 12a shows the pull-off stress as a function of separation distance for the \(h_{\textrm{rms}} =0.7 \ \textrm{nm}\) surface for the relative humidities \(\textrm{RH}=0.65\), 0.8 and 0.9. Note that the areas under these curves yields the work of adhesion, and that, for the three studied humidities, the highest work of adhesion is for the highest humidity \(\textrm{RH}=0.9\), but the pull-off stress (during uniform separation) is highest for \(\textrm{RH}=0.8\). Figure 12b shows the relative contact area covered by water as a function of the average surface separation (in units of the rms surface roughness \(h_{\textrm{rms}}\)) during pull-off.

The shapes of the curves in Fig. 12 can be understood in terms of the schematic diagrams in Fig. 13, which illustrate both elastic deformation of a rough surface when pressed against a smooth, rigid surface and the different wetting menisci that form due capillary condensation of water vapor. Figure 13a shows the situation where the contact interface is flooded with the condensed water (at high enough humidity) and the capillary adhesion force \(F_{\textrm{a}} = p_{\textrm{K}} A_{\textrm{0}}\) is balanced the elastic repulsive force; this situation corresponds to the leftmost point of the curves in Fig. 12. When an increasing pull-off force \(F_{\textrm{pull}}\) is applied to separate the surfaces, the interface remains flooded (\(A_{\textrm{wet}}\)/\(A_{\textrm{0}}\) = 1 part of the curves in Fig. 12b) until parts of the interface become separated by distances greater than \(d_{\textrm{K}}\) and dry voids start to appear in the interface. This results in the kinks in the curves in Fig. 12b where \(A_{\textrm{wet}}\)/\(A_{\textrm{0}}\) becomes \(< 1\). At first the interface is wet mainly by large area, pillbox shaped menisci (Fig. 13b), but as the surfaces are further separated, these pillbox menisci shrink to toe-dipping menisci, and the capillary adhesion decreases as the wet area decreases until only a few toe-dipping menisci remain at the last contacting asperities (Fig. 13c). Figure 13 also illustrates how, as the loading force on the contacting interface is reduced, the surface roughness elastically relaxes back to its unstressed topography.

Fig. 16
figure 16

The force per unit surface area (pull-off stress) needed to separate the surfaces for the Si wafer-flat interface in the dry state assuming VdW interaction between the surfaces (red line), and due to capillary bridges (green line) (the relative humidity \(\textrm{RH}=0.5\)). For the VdW interaction we have assumed that the adhesive pressure depends on the surface separation as \(\propto u^{-3}(x,y)\) and that the work of adhesion between to ideally smooth, solid surfaces is \(w_0=0.03 \ \mathrm{J/m^2}\)

In order to achieve solid-solid contact over a large area (as shown in Fig. 13a) the right combination of conditions have to occur for the capillary adhesive force to be large enough to overcome the elastic repulsive force in order to collapse the interface, a precursor step to achieving direct bonding between two wafers. Figure 14 shows the boundary line in the \(\textrm{RH}-h_{\textrm{rms}}\) plane between wafers exhibiting capillary adhesion sufficient to collapse a hydrophilic interface and those situations where the capillary adhesion is insufficient at any separation distance to overcome the elastic repulsive deformation force from the contacting asperities. As expected, smoother surfaces with a low value of \(h_{\textrm{rms}}\) can be collapsed with a low value RH, while rougher surfaces require higher humidity. (For example, for \(h_{\textrm{rms}}=1 \ \textrm{nm}\) a relative humidity of \(\textrm{RH} > 0.75\) is needed in order for capillary adhesion to induce interfacial collapse.)

For a given shape of the surface roughness power spectrum, the capillary adhesion theory enters only through the parameters \(d_{\textrm{K}}\), \(h_{\textrm{rms}}\), \(E^*\) and the liquid surface tension \(\gamma \). From these parameters one can only construct two (independent) dimensionless parameters, namely \(d_{\textrm{K}}/h_{\textrm{rms}}\) and \(E^* h_{\textrm{rms}}/\gamma \). Hence the boundary line separating capillary adhesion from non-adhesion must depend on only these parameters. In Fig. 15 we show the boundary line in the \(d_{\textrm{K}}/h_{\textrm{rms}} - E^* h_{\textrm{rms}}/\gamma \) plane separating wafers exhibiting capillary adhesion from those situations with insufficient capillary adhesion to collapse the interface. As expected reducing the elastic modulus \(E^*\) or the root-mean-square roughness amplitude \(h_{\textrm{rms}}\), or increasing liquid surface tension \(\gamma \) result in capillary adhesion prevailing to smaller \(d_{\textrm{K}}/h_{\textrm{rms}}\) or smaller relative humidity.

Next, let us address the role of the van der Waals interaction in wafer bonding. In Fig. 16 we compare pull-off stress as a function of the (average) surface separation for the wafer-flat contact for the dry contact, with only the VdW interaction between the surfaces (red line), with the capillary case (green line) for relative humidity \(\textrm{RH}=0.5\). For the VdW interaction we have assumed that the adhesive pressure depends on the surface separation as \(\propto u^{-3}(x,y)\) and that the work of adhesion for ideally smooth surfaces is \(w_0=0.03 \ \mathrm{J/m^2}\) (see Ref. [10] for details of the model). For the dry case the work of adhesion (the area under the stress-separation curve) is \(w \approx 0.01 \ \mathrm{J/m^2}\), and for capillary adhesion is about 10 times higher. This is similar to the difference in binding strength for the perfect sphere-flat configuration (Bradley case), but in that case the result does not depend on the humidity, in contrast to wafer bonding. It is also interesting to note that for the adhesion between hard particles with big surface roughness (such as mineral particles produced by fracture, e.g. crunched stone) the capillary force is typically \(\sim 100\) times bigger than the van der Waals adhesion force [13, 14].

Fig. 17
figure 17

a The cantilever bending test methodology to determine the work of adhesion w for a thin elastic sheet bound to a substrate (black) here assumed to be rigid. For two wafers in contact the work of adhesion \(w=3 Ed^2h^3/(16 L^4)\). b The crack tip process zone for a stationary crack (thermal equilibrium). The work of adhesion is the work to separate the surfaces from the equilibrium (bonding) position (to the right of the crack tip) to the fully separated (non-bonding) state (to the left of the crack tip) where no capillary bridges occur between the surfaces. c The work of adhesion is maximal for the case of smooth surfaces where the capillary contribution to the work of adhesion equals \(2 \gamma \), where \(\gamma \) is the surface tension of water

6 Discussion

The breaking the adhesive bond between two surfaces is almost never uniform, but occurs instead by interfacial crack propagation. The onset of this crack propagation, however, is determined by the work of adhesion rather than the maximum pull-off stress during uniform separation. Since the interfacial collapse that occurs during the direct bonding between two wafers is just the reverse process of crack propagation, the work of adhesion is the most relevant parameter for the direct bonding process.

The work of adhesion depends not only on the interaction force but on the distance over which it acts. Thus strong adhesion typically results from “long bonds” rather than “strong bonds” [44]. For the case of capillary adhesion, this depends mainly on \(d_{\textrm{K}}\) and hence on the relative humidity. When the humidity increases, the distance needed to break a capillary bridge increases, and this is more important than the magnitude of the force the capillary menisci exert on the surfaces. This can be seen in Fig. 12a which shows that for the three RH used, the maximum force takes its highest value for the relative humidity \(\textrm{RH}=0.8\), but the work of adhesion increases continuously with increasing humidity and is larger for 0.9 than for 0.8 (Fig. 11).

We note that for small and elastically stiff solid objects the bond-breaking during pull-off does not occur by interfacial crack propagation but occurs uniformly in the nominal contact area [43, 45]. In this case the maximum pull-off force will occur for a relative humidity \(<1\) (which depends on the surface roughness) as typically observed in AFM pull-off experiments [47].

Figure 17a shows a typical experiment used to determine the work of effective adhesion \(w_0\) for a thin elastic sheet (e.g. a silicon wafer) bound to a substrate here assumed to be rigid. The work of adhesion \(w_0\) can be determined by minimizing the total energy which has a positive contribution from bending the sheet and a negative contribution from the binding of the sheet to the substrate, leading to the equation \(w_0=3 Ed^2h^3/(16 L^4)\) (here we have assumed two bound silica wafers, see Ref. [29]). The work of adhesion \(w_0\) depends on the complex separation processes occurring in some region close to the crack tip (the crack-tip process zone) where the capillary bridges are broken. (When determining \(w_0\), we assume stationary contact and thermal equilibrium.) Thus the work of adhesion is the work to adiabatically separate the surfaces from the equilibrium bonding position (to the right of the crack tip in Fig. 17) to the fully separated non-bonding state (to the left of the crack tip) where no capillary bridges are present between the surfaces. The work of adhesion is maximal for the case of smooth, flooded surfaces where the capillary contribution to the work of adhesion equals \(2 \gamma \), where \(\gamma \) is the surface tension of water.

Next we consider other factors that can contribute to the work of adhesion during direct wafer bonding. One can distinguish several stages in wafer bonding [30]:

Stage 1 - Prior to wafer bonding, the surfaces need to be cleaned and “activated”. For silicon wafers, two surface activation methods are used: exposure to oxygen plasma or wet chemical surface activation. This step ensures that the silicon oxide surfaces are covered with a high density Si-OH groups, that not only make the surface hydrophilic, but are also necessary for strong covalent bonds to form during the annealing stage.

Stage 2 - The interface is collapsed using the hydrophilic bonding process discussed in this paper, where the adhesion originates from the Laplace pressure in the capillary menisci from the condensation of water vapor.

Stage 3 - To form strong covalent bonds across the solid-solid interface, the surface is annealed to typically \(T\approx 800^\circ \textrm{C}\). This annealing cycle removes water from the interface and promotes covalent oxide bonds forming across the interface, strongly bonding the wafers together. During annealing, a condensation polymerization reaction occurs to produce covalent high strength bonding:

Si-OH + HO-Si \(\leftrightarrow \) Si-O-Si + \(\mathrm{H_2O}\)

This reaction is reversible up to \(400^\circ \textrm{C}\), but at higher temperatures the released water is able to diffuse into the bulk silicon, releasing hydrogen via

-Si-Si- + \(\mathrm{H_2O}\) \(\rightarrow \) -Si-O-Si + \(\mathrm{H_2}\)

(This hydrogen gas is the primary cause of annealing voids.)

This picture has some similarities to the binding of cellulose fibers in paper. To bind the fibers the surfaces must first be chemically “activated” to increase the surface energy and make the surfaces more hydrophilic. During drying of wet fibers water capillary bridges pull the fibers into such close contact that hydrogen bonds can form between hydroxyl groups on adjacent cellulose fibers. In addition, interdiffusion of fibrils between the (in the wet state) gel-like surface layers on the cellulose fibers affect the bonding strength between the fibers in the dry state [48].

We now consider the extent that plastic deformation might impact the work of adhesion. For the case of maximal height of the water film (\(d_{\textrm{K}} = 0.44 \ \textrm{nm}\) for \(\textrm{RH}=0.1\) and \(d_{\textrm{K}} = 9.59 \ \textrm{nm}\) for \(\textrm{RH}=0.9\)) the corresponding squeezing pressures from the condensed water menisci are \(p\sim 2 \gamma /d_{\textrm{K}} \approx -319 \ \textrm{MPa}\) and \(\approx -15 \ \textrm{MPa}\), respectively. However, due to the surface roughness, the area of solid-solid contact is very small, as contact only occurs at the summits of the asperities resulting in relatively high contact pressures occurring over these areas of contact. These contact pressures are of the order of \(25 \ \textrm{GPa}\) for the roughest surface studied in Fig. 11 (with \(h_{\textrm{rms}} = 0.7 \ \textrm{nm}\)). This stress is similar to the yield stress \(\sim 10 \ \textrm{GPa}\) of silica at the nanoscale [49] and is sufficient to cause plastic deformation to occur, increasing slightly the area of solid-solid contact area and its contribution to the work of adhesion. The plastic deformation may be further increased during the heating cycle, as increased temperature lowers of the yield strength. At high temperatures of \(800-1400^\circ \textrm{C}\) silicon oxide flows slightly (i.e., plastically deforms), filling interfacial voids and helping to form a “perfect” bond.

In addition to plastic flow we note that with increasing pressure silicon undergoes various phase transformations whose stress thresholds are dependent on the loading/unloading conditions [50]. In loading at a pressure of \(10-12 \ \textrm{GPa}\) the original diamond structure of silicon transforms into a denser \(\beta \)-tin (metallic) phase, accompanied by a \(22\%\) volumetric reduction. Hence the region below the \(\mathrm{SiO_2}\) film may not deform just elastically but more complex structural changes may occur.

The study presented above is valid for other elastically stiff materials (\(E^* \sim 10^{11} \ \textrm{Pa}\) or higher) with very smooth surfaces, where the water contact angle is well below \(90^\circ \), giving hydrophilic interfaces. This includes most ceramic materials like SiC, CrN and TiN which have water contact angles of order \(\sim 30^\circ \). For elastically softer materials like glassy polymers with hydrophilic interfaces (e.g. as a result of exposure to oxygen plasma) capillary adhesion will manifest itself for much larger surface roughness than for the silica surfaces studied above.

For very soft materials such as rubber, the capillary adhesion can have a major influence on the area of real contact, as predicted and observed for wiper blades on glasses surface [7, 51]. In this case, as the water is removed by wiping and evaporation, as a function of time a high friction peak, with a friction considerably higher than the dry one, may be observed. Experiments [51] have shown that in the tacky regime the attraction from water capillary bridges between the rubber and the glass substrate pull the rubber into close contact with the substrate so that the area of real contact is even larger in the tacky regime than for the perfectly dry contact. Similar effects have been observed for the human skin [52, 53].

7 Summary and conclusions

We have studied the influence of surface roughness and humidity on the work of adhesion for the case of hydrophilic wafer bonding. In agreement with experiments, we find that the surfaces with roughness greater than \(\sim 1 \ \textrm{nm}\) rms amplitude (when measured over a surface area of order \(1 \ \mathrm{\mu m} \times 1 \ \mathrm{\mu m}\)) will not bond under typical conditions unless the humidity is very close to the saturation level (i.e., relative humidity \(\textrm{RH}>0.8\)). For any given surface roughness, the force needed to separate two solids with nominal flat surfaces takes a maximum for some relative humidity between 0 and 1 assuming the solids are separated with their surfaces parallel. Also, since bond breaking occurs non-uniformly in the form of interfacial crack propagation, the pull-off force is determined by the work of adhesion, which increases continuously with the humidity and reaches the value \(2\gamma \approx 0.14 \ \mathrm{J/m^2}\) as \(\textrm{RH} \rightarrow 1\). We have shown that in the case of hydrophilic bonding of silicon wafers, the long-wavelength roughness (usually denoted waviness) has a negligible influence on the strength of wafer bonding (the work of adhesion) due to capillary bridges.