1 Introduction

Quantum-degenerate atomic gases in optical dipole traps provide the starting point of many experiments. To realize these samples, one can directly load a laser-cooled sample into an optical dipole trap (ODT) and perform evaporative cooling, which however requires very high ODT powers to provide sufficient trap volume, depth and confinement. Alternatively, one loads the atoms first in a magnetic trap and performs evaporative cooling and afterward transfers a dense and compact atomic cloud into an ODT, which now requires a much lower ODT power, at the expense of experimental complexity. Within this last category, a very elegant approach is the hybrid trap (HT), introduced in Ref. [1], consisting of a simple quadrupole magnetic trap (QMT) and a single-beam ODT. Efficient evaporation and Bose–Einstein condensation (BEC) have been demonstrated (see, e.g., [13]), using ODT powers of only a few Watts. By switching off the QMT completely, the atoms are transferred from the HT to a pure ODT.

The hybrid trap has been mostly applied to \(^{87}\hbox {Rb}\), but is assumed to be generally applicable to other magnetically trappable atomic species [1]. However, the application of HT strongly depends on the mass of atom. Most importantly, the rates of Majorana loss and heating, which determine the temperature that can be reached by evaporative cooling in a QMT, scale inversely with mass [4, 5]. This limits the transfer efficiency for light atoms, or puts constraints on the trap volume and trap depth, and therefore the power, of the ODT. Furthermore, for light atoms evaporative cooling in the HT is limited as the additional axial confinement provided by the QMT is small because of the small levitation gradient, below which the QMT has to operate in the HT. Finally, the small levitation gradient puts experimental limits on the control of the displacement of the QMT with respect to the ODT, which further limits the axial confinement.

Here we report on the production of a metastable triplet helium (\(^4\hbox {He}^*\)) BEC using a single-beam HT with a moderate power of less than 3 W, demonstrating the application of HT for a light atom. Our work provides a novel and simple method for obtaining a \(^4\hbox {He}^*\) BEC, which can be used for atom optics experiments [610] or precision spectroscopy for fundamental tests of two-electron quantum electrodynamic theory [1113]. So far, \(^4\hbox {He}^*\) BECs have been obtained in Ioffe-Pritchard or cloverleaf type of magnetic traps [1420], which has been subsequently transferred to a single-beam [21, 22] or crossed-beam ODT [11] (see Ref. [23] for a review on experimental work on ultracold \(^4\hbox {He}^*\)). Very recently, a \(^4\hbox {He}^*\) BEC has been realized in a crossed-beam ODT [24], using a total power of 26 W, in which the ODT is loaded from a QMT, following evaporative cooling to BEC in the ODT.

This paper is organized as follows. In Sect. 2, we give a brief description of HT for the particular case of metastable helium. In Sect. 3, we describe our experimental setup and initial cooling scheme to load the single-beam HT or ODT. In Sect. 4, we present our results regarding the alignment and loading of the HT, comparing evaporative cooling in the HT and ODT, and provide evidence for BEC. Finally, in Sect. 5 we conclude.

2 Single-beam hybrid trap for \(^4\hbox {He}^*\)

In the HT, a single-beam ODT is aligned slightly away from the QMT center, such that the trap minimum of the combined magnetic and optical trapping potential is at a finite magnetic field, and atoms do not suffer Majorana spin-flip losses once loaded in the HT. This also means that the atoms remain spin-polarized, which is crucial for \(^4\hbox {He}^*\) in order to avoid strong losses due to Penning ionization. After forced evaporative cooling in the QMT the magnetic field gradient of the QMT is ramped down such that the vertical gradient is equal or lower than the levitation gradient \(B^{\prime}_{\mathrm{lev}}\equiv m g /\mu \), at which the QMT alone cannot trap atoms. Here m is the mass, g is the gravitational acceleration, and \(\mu \) is the magnetic moment of the atom. Lowering the power in the ODT beam allows further evaporative cooling in the HT, in which the “hot atoms” can escape mainly downwards.

Fig. 1
figure 1

Trapping potentials and peak densities of HT (solid lines) and ODT (red dashed lines) for \(^4\hbox {He}^*\), calculated for \(B^{\prime}=B_{\mathrm{lev}}=0.35\) G/cm, \(\lambda =1557\) nm and \(w_0=40\,\upmu\hbox{m}\) (corresponding to \(y_R=3.2\) mm), using the trap potential given by Eq. 1. The displacement in case of the HT is \(z_0=0.1\) mm (black), \(z_0=1\) mm (blue) and \(z_0=10\) mm (green). ad Trapping potentials along the z-axis (vertical direction) and y-axis (axial direction), comparing ODT power \(P=3\) W and \(P=0.3\) W. e Peak densities \(n_0/N\) as function of P and T, assuming \(k_B T=(1/10)U_0\) (and therefore \(T\propto P\))

The combined potential of a QMT and single-beam ODT is given by:

$$\begin{aligned} U(x,y,z)=\mu B^{\prime} \sqrt{x^2+4y^2+(z-z_0)^2}-\frac{2PC}{\pi w(y)^2}\exp \left[ -2\frac{x^2+z^2}{w(y)^2}\right] +m g z, \end{aligned}$$
(1)

where the first term is the QMT potential, the second term is the ODT potential, and the third term is the gravitational potential. In our case, the symmetry (strong) axes of the QMT and the ODT beam are along the y-axis. Here \(B^{\prime}\) is the magnetic field gradient along the weak axis of the QMT, \(z_0\) is the vertical displacement of the QMT with respect to the ODT (\(z_0>0\) means that the ODT is placed below the QMT center), P is the power of the ODT beam, and \(C=\alpha _{\mathrm{pol}}/2\epsilon _0 c\) is a constant proportional to the polarizability \(\alpha _{\mathrm{pol}}\) depending on the atomic species and used wavelength \(\lambda \), \(w(y)=w_{0}\sqrt{1+y^2/y_{R}^2}\), where \(w_{0}\) and \(y_{R}=\pi w_{0}^2/\lambda \) are the beam waist (\(1/e^2\) radius) and the Rayleigh length, respectively. We use \(\lambda =1557\) nm at which \(C=1.88\times 10^{-36}\,\hbox {J}/(\hbox {W\,m}^{-2}\)) [25].

For HT, the magnetic force in the vertical direction should be equal or smaller than gravity, which means in our geometry that \(B^{\prime}\le B^{\prime}_{\mathrm{lev}}\). For \(B^{\prime} = B^{\prime}_{\mathrm{lev}}\), the trap depth is always given by \(U_0=2PC/(\pi w_0^2)\), whereas for \(B^{\prime} < B^{\prime}_{\mathrm{lev}}\) (or \(B^{\prime}=0\) for a pure ODT) gravity leads to a reduction in the trap depth, which also depends on P, but this starts to be significant only for \(P<100\) mW. The radial confinement is dominated by the ODT potential. For \(^4\hbox {He}^*\), the levitation gradient \(B^{\prime}_{\mathrm{lev}}=0.35\) G/cm is very small, due to the small mass and relatively large magnetic moment \(\mu =2\mu _B\), where \(\mu _B\) is the Bohr magneton. For comparison, the values for \(^{87}\hbox {Rb}\) are 15 or 30 G/cm, depending on the Zeeman state. Therefore, the additional axial confinement provided by the QMT, compared to the pure ODT, is limited in the HT, and only for low ODT power P the axial confinement is dominated by the QMT. In Fig. 1a–d, we show the trapping potentials in both the radial and axial direction, for two different ODT powers, and \(z_0=0.1\), 1 and 10 mm. Clearly, for \(P=3\) W the axial confinement of the ODT is still significant, while for \(P=0.3\) W the axial trapping potential is dominated by the QMT for sufficiently small \(z_0\).

The peak density \(n_0\) of a thermal sample is obtained from numerically solving the integral \(\int {\exp \left[ -U(\mathbf {r})/k_B T\right] {\mathrm{d}}\mathbf {r}}=N/n_0\), where \(U(\mathbf {r})\) is the trapping potential (Eq. 1). A comparison between the HT (for \(z_0=0.1\), 1 and 10 mm) and ODT is given in Fig. 1e, showing \(n_0/N\) as function of the ODT power P (and T), assuming that T is determined by the trap depth according to \(U_0=\eta k_B T\) (and therefore \(T\propto P\)), taking a typical value of \(\eta =10\). Indeed, only for low P the peak density \(n_0\) in the HT is significantly higher than that of the ODT. Also the dependence on \(z_0\) becomes more prominent for low P. For heavier atoms, like \(^{87}\hbox {Rb}\), this regime is already reached at ODT powers well above 1 W.

The small \(B^{\prime}_{\mathrm{lev}}\) limits the confinement also in an indirect way. Any stray magnetic field \(B_{\mathrm{offset}}\) will shift the center of the QMT and therefore affects the displacement with respect to the ODT center, such that \(z_0'=z_0+B_{\mathrm{offset}}^{z}/B^{\prime}\). Those stray magnetic fields can be compensated by additional bias fields; however, magnetic field fluctuations translate in a jitter of the QMT center, which limits the smallest displacement that can be chosen for which the atoms do not experience a magnetic field zero. At \(B^{\prime}=B^{\prime}_{\mathrm{lev}}\) magnetic field fluctuations on the order of 10 mG will already give a jitter of 0.3 mm in the location of the QMT. Furthermore, even in the absence of magnetic field fluctuations, the magnetic field offset at the location of the potential minimum of the HT, \(B_0=B^{\prime} z_0\), has to be sufficiently large to provide a well-defined quantization axis, in order to suppress Majorana spin-flips and Penning ionization. For the heavier atoms, this problem is much less severe. For instance, for \(^{87}\hbox {Rb}\) in the \(F=2\), \(m_F=2\) state a small displacement of about a waist, say \(z_0=50\,\upmu\hbox{m}\), already gives an offset field of 75 mG, and a magnetic field fluctuation of 10 mG in the vertical direction only gives a jitter of 6 \(\upmu\hbox{m}\) in the displacement.

3 Experimental setup

The main part of our experimental setup has been described in Ref. [26], while our HT (applied to \(^{87}\hbox {Rb}\)) has been discussed in Ref. [3]. Here we will summarize the most important features and mention changes made compared to previous work. A schematic of the HT is given in Fig. 2, showing the QMT coils, ODT beam, absorption imaging beam and micro-channel plate (MCP) detector. The axial direction of the QMT, the ODT beam and the absorption imaging beam are in the horizontal (xy) plane. Absorption imaging is used to obtain information about the atom number and temperature, as well as the position of the QMT and ODT. We use an InGaAs camera (Xenics) with 30 \(\upmu\hbox{m}\) pixel size, and our imaging setup has a magnification of 0.5.

Fig. 2
figure 2

Schematics of our experimental setup, a showing the angles between the QMT axis, ODT beam and absorption imaging beam in the xy (horizontal) plane and b the location of the MCP detector and deflection coil in the xz plane

We also record time-of-flight (TOF) spectra using a MCP detector, which is placed at a distance of 106 mm from the trap center, under an angle of \(22^\circ \) with respect to the direction of gravity, and has a diameter of 15 mm. For temperatures below 10 \(\upmu\hbox{K}\), the ballistically expanding cloud, after switching off the trap, would not hit the MCP detector during its free fall. Therefore, a short (10 ms) magnetic field gradient pulse is applied using a single “deflection” coil (see Fig. 2) to direct the atoms toward the MCP after release from the trap. The TOF spectra have a better resolution regarding temperature, and observing BEC is much easier compared to expansion measurements with our absorption imaging system. However, the deflection field affects the TOF distribution and we need to use absorption imaging to calibrate the MCP detection regarding atom number and temperature.

Our single-beam ODT has a waist \(w_0=39.8\pm 0.3\,\upmu\hbox{m}\) (corresponding to a Rayleigh length of \(y_R=3.2\) mm), and the maximum power available at the setup is about \(P=3\) W, resulting in a maximum trap depth of \(U_0=k_B\times 160\,\upmu\hbox{K}\). We control the ODT power by an acousto-optical modulator (AOM), for which the output is coupled into a single-mode fiber and sent to the experimental setup. The ODT beam enters the setup under an angle of \(11^\circ \) with respect to the QMT axis (see Fig. 2a), which leads to a reduction in the axial magnetic field gradient by a factor of \(1-\sin (11^\circ )/2\approx 0.90\), but does not affect the vertical magnetic field gradient.

Fig. 3
figure 3

Overview of our experimental scheme for loading of and evaporation in the hybrid trap, showing the QMT gradient \(B^{\prime}\), radiofrequency \(\nu _{\mathrm{RF}}\) and ODT power P

An overview of our experimental scheme, in particular the magnetic field gradient \(B^{\prime}\), RF-frequency \(\nu _{\mathrm{RF}}\) and ODT power P, is shown in Fig. 3. We use a liquid-nitrogen-cooled DC-discharge source to produce a \(^4\hbox {He}^*\) beam, which is collimated and subsequently decelerated in a 2.5-m-long Zeeman slower and loaded into a magneto-optical trap (MOT). An in vacuo shutter after the collimation section is opened only during the loading time of the MOT. The pressure in the main vacuum chamber is \(1\times 10^{-10}\) mbar. We load about \(5\times 10^8\) \(^4\hbox {He}^*\) atoms at a temperature of about 1 mK within 1.5 s in the MOT, which consists of three retroreflected 2-inch laser beams at 1083 nm with a total power of ~45 mW and a large detuning of −33 MHz (21 linewidths), and a magnetic field gradient (weak axis) of 10 G/cm. Compared to Ref. [26], we have improved our atom number by increasing the MOT beam diameter from 1 to 2 inches. Afterward, we compress the cloud (the “cMOT” stage) by ramping down the detuning to −6 MHz in 10 ms, during which the power is reduced by a factor of ten, while keeping the same magnetic field gradient. After this cMOT stage, we end up with \(3\times 10^8\) atoms at a temperature of 260(10) \(\upmu\hbox{K}\). Before loading in the QMT, we optically pump (OP) the atoms into the \(m=+1\) magnetic trappable state in 0.5 ms, during which the magnetic field gradient is switched off. Then we switch on abruptly the QMT at \(B^{\prime}=48\) G/cm and stay for 100 ms and then ramp in 1 s to \(B^{\prime}=120\,\hbox {G/cm}\). At this point, we have about \(1\times 10^8\) atoms at a temperature of about 1 mK.

Fig. 4
figure 4

Lifetime in the QMT, before RF-cooling (red squares) and after RF-cooling (blue circles). Before RF-cooling, we observe a slow, exponential decay of the number of trapped atoms, where the lifetime is limited by background collisions. After RF-cooling, we observe a fast, non-exponential decay. Lines are the result of a loss model that includes Majorana loss and heating, fitted to the data

After loading in the QMT, we apply RF-induced forced evaporative cooling, resulting in an effective trap depth \(U_0^{\mathrm{eff}}=h\nu _{\mathrm{RF}}\). We generate the RF-frequency \(\nu _{\mathrm{RF}}\) by frequency doubling the output of a tunable 80-MHz function generator. After several amplification stages, we send up to 5 W of RF power to a coil that is placed inside the vacuum chamber. We ramp down \(\nu _{\mathrm{RF}}\) from 140 to 18 MHz in 4 s, corresponding to a final trap depth of \(k_B\times 0.9\) mK. At this point, the lifetime of the trapped atoms is only a few seconds, caused by Majorana spin-flips, which has a loss rate of \(\varGamma _{\mathrm{M}}=\chi (\hbar /m)(2\mu B^{\prime}/k_B T)^2\) [5]. In Fig. 4, we show the lifetime of the trapped atoms in the QMT, before and after RF-cooling, where we have fitted the data with a loss model that includes Majorana loss and heating [5, 24]. The lifetime data after RF-cooling are consistent with a temperature of 150–220 μK for a \(\chi \) factor between 0.1 and 0.2 [5, 24]. To load the HT, we switch on the ODT light at maximum power and ramp down the QMT gradient from \(B^{\prime}=120\) G/cm to \(B^{\prime}=0.33\) G/cm (just below \(B^{\prime}_{\mathrm{lev}}\)) in 220 ms. During the ramp down, \(\nu _{\mathrm{RF}}\) is set at 9 MHz. Afterward, an additional 200 ms is used to fine-adjust the bias fields to control the displacement (see Sect. 4.2). For loading of a pure ODT, we switch off the QMT gradient during this last stage, while switching on a bias magnetic field in the axial direction to provide a quantization axis in order to keep the atoms spin-polarized.

4 Results

4.1 Loading of HT or ODT

For the alignment of the ODT beam, we use a piezo-mirror to scan in both the horizontal (x-axis) and vertical (z-axis) direction and monitor the number of atoms loaded into the HT or ODT. Typical measurements are shown in Fig. 5a, b. In both directions, one finds a minimum when the ODT is located at the center of the QMT, which is due to Majorana loss, and the width of the loss feature is on the order of the waist. We position our ODT about 60 \(\upmu\hbox{m}\) below the QMT center. We obtain up to \(1\times 10^6\) atoms at a temperature of 14 \(\upmu\hbox{K}\) for both HT and ODT. The transfer efficiency from the QMT to the HT or ODT is about 5–10 %.

Fig. 5
figure 5

Number of atoms loaded in the hybrid trap at \(P=2.8\) W as function of position of the ODT (using a piezo-mirror) with respect to the QMT center in the vertical direction (a) and horizontal direction (b). The gray data in (b) show a scan in the horizontal, while the vertical position is 60 \(\upmu\hbox{m}\) below the QMT center. c Number of atoms loaded in the hybrid trap as function of ODT power P, with vertical displacement of 60 \(\upmu\hbox{m}\)

We have measured the number of loaded atoms for different initial ODT powers in order to investigate to what extent we are limited by our maximum ODT power of 2.8 W. The results are shown in Fig. 5c. The number of loaded atoms does not fully saturate, meaning that the number of transferred atoms is limited by our ODT power. At the maximum power, we measure a 1/e trapping lifetime of more than 20 s of the HT and pure ODT, at which the calculated off-resonant photon scattering at 1557 nm is \((6\,\hbox {s})^{-1}\), but the recoil temperature of 2 \(\upmu\hbox{K}\) is much smaller than the trap depth.

4.2 Control of QMT displacement in HT

During the QMT stage, magnetic bias fields are set to minimize stray magnetic fields at the center of the trap and we use the piezo-mirror to adjust the displacement of the ODT with respect to the QMT center. However, in the HT this displacement becomes very sensitive to bias fields, which provides a much broader tuning range (up to a few cm) compared to the piezo-mirror (limited to 0.3 mm). Therefore, once the atoms are loaded in the HT, we fine-adjust these magnetic bias fields to set the displacement of the QMT center with respect to the ODT, while keeping the piezo-mirror at the optimal loading condition (see Fig. 5a, b). In scanning the bias fields in the x- and z-direction, we do observe loss resonances at which atoms are lost on the timescale of 100 ms. Again, those losses are due to Majorana spin-flips in which the atoms can leave the HT in the axial direction. In Fig. 6, we show the number of atoms in the HT as function of vertical bias field after a hold time of 300 ms, in which we jump to the field value from below and above the resonance position. The asymmetry of the loss resonances is due to the slow response of the magnetic field coils. However, by fitting Gaussian distributions to the two scans, in which the slow rising part of the data is omitted, the centers and actual widths can be determined, where the actual center is taken as the average of the two centers. We find a \(1/e^2\) width of 5 mG and the center is reproducible within 3 mG on a day-to-day basis.

While the center of the loss resonance fixes \(B_z=0\), the displacement is simply given by \(z_0=B_z/B^{\prime}\). Here \(B^{\prime}=0.33\) G/cm, which means that the width of 5 mG already corresponds to 0.15 mm. This width is much larger than observed in scanning the ODT by means of the piezo-mirror to find the optimum loading conditions (see Fig. 5), where the width is on the order of the waist. This can be explained by assuming that the atoms are transferred from the QMT to the HT already at \(B^{\prime}\gg B^{\prime}_{\mathrm{lev}}\), where the sensitivity to magnetic bias fields is much less. It also means that the displacement assigned in the measurement of Fig. 5 only holds for the loading, not the final HT. While the piezo-mirror is optimized on the loading, we use the magnetic bias fields to set the displacement of the final HT, which also covers a much broader range. In the x-direction, we set the bias field “on resonance” (\(x_0=0\)) and control the displacement with the vertical bias field. In order not be effected by Majorana spin-flips, a displacement larger than 0.3 mm has to be chosen, which limits the axial confinement and the peak density (see Fig. 1), and the displacement jitters by 0.15 mm. Again, such constraints are essentially not present for the heavier atoms, in which the displacement can be chosen to be on the order of a waist and the jitter is much smaller than the waist.

Fig. 6
figure 6

Number of atoms in the hybrid trap as function of an applied vertical bias field that controls the positions of the QMT center with respect to the ODT. After the vertical bias field is set to its final value, the total hold time is 300 ms, during which the ODT power is ramped down from \(P=2.8\) W to 2.2 W. The loss resonance occurs due to Majorana spin-flips when the center of the QMT is overlapped with the ODT. An asymmetry and broadening of the loss feature are caused by the slow response of the magnetic field coil. Two sets of data are shown, for which we ramp up from a value just below the resonance (blue squares) or ramp down from above the resonance (green circles). The solid lines are Gaussian fits taking into account only the fast slopes. The resonance position is determined by the average of the two Gaussian centers. With the magnetic field gradient of 0.33 G/cm in the vertical direction, the vertical bias field can be converted into vertical displacement using \(z_0=B_z/B^{\prime}\)

4.3 Evaporative cooling in HT and ODT

After loading the HT or ODT, we perform forced evaporative cooling to lower the temperature and increase the phase-space density by ramping down the ODT power. A sample of TOF spectra obtained by the MCP detector is shown in Fig. 7. In the regime where the kinetic energy is much smaller than the gravitational energy, i.e., \(k_B T \ll m g h\), where h is the height difference between the trap and the detector, the TOF spectrum of a thermal sample is described by a Gaussian distribution with a width that is proportional to the square root of the temperature. Indeed, for lower ODT power we observe a narrowing of the distribution, but also an increase in the signal, which is due to the finite size of the MCP detector. For a thermalized sample, the temperature is proportional to the trap depth \(U_0=2PC/(\pi w_0^2)\) via the truncation parameter \(\eta =U_0/k_B T\). Indeed, we find the width to be proportional to the square root of the ODT power P. By fitting the relation \(\sigma =a\sqrt{P}+b\) to the data, we find a small offset b of about 1 ms, which is probably caused by the magnetic field gradient pulse that we apply after release from the trap to direct the atoms toward the MCP. Before converting the width into temperature, we correct for this offset. Absolute calibration of the temperature is done by absorption imaging at \(P=2.8\) W, at which we have measured \(13.9\pm 0.2\,\upmu\hbox{K}\). The result is shown in the inset of Fig. 7, in which the data show a linear behavior for the full range of ODT powers. From the slope, one directly obtains \(\eta \), which turns out to be 11 for both HT and ODT.

Fig. 7
figure 7

Forced evaporative cooling in the pure ODT, showing TOF spectra obtained by the MCP detector for different final ODT powers. The inset shows the temperature, obtained from the width of the TOF spectra, as function of final ODT powers (dashed line is a linear fit), from which a truncation parameter of \(\eta =U_0/k_B T=11\) is obtained. Each TOF spectra is an average over five experimental runs

First we investigate the timescale of forced evaporative cooling, comparing HT (\(z_0=1.5\) mm) and ODT. We note that the initial conditions are the same in terms of temperature (14 \(\upmu\hbox{K}\)) and atom number (\(1\times 10^6\)), while the peak density and the collision rate are slightly higher for the HT (\(2.1\times 10^{12}\,\hbox {cm}^{-3}\) and 410 \(\hbox {s}^{-1}\) for HT; \(1.7\times 10^{12}\,\hbox {cm}^{-3}\) and 320 \(\hbox {s}^{-1}\) for ODT). We ramp down the ODT power from \(P=2.8\) W to 0.26 W for variable ramp times. A sample of TOF spectra is shown in Fig. 8a, for the HT and ODT (inset). Two striking observations can be made: first of all, the timescale for thermalization is much shorter for the ODT than the HT, and secondly, a shoulder on the left side of the TOF peak appears for short evaporation ramp times in the HT, corresponding to “hot atoms” that remain trapped. In Fig. 8a, we show the temperature, obtained by a fit of the main TOF peak, as function of evaporation ramp time. The initial temperature at \(P=2.8\) W is 14 \(\upmu\hbox{K}\), while the final temperature at \(P=0.26\) W is 1.3 \(\upmu\hbox{K}\). For the ODT, a temperature of 1.5 \(\upmu\hbox{K}\) is reached in 100 ms, which for the HT it takes about 3 s. We have observed similar behavior for a HT at a much larger displacement of \(z_0=15\) mm, and a HT at about half the levitation gradient (\(B^{\prime}=0.16\) G/cm, \(z_0=1.2\) mm).

We explain these observations by the dimensionality of evaporation: while in the ODT, the atoms can escape in all directions; in the HT, they can only escape downwards. Therefore, the removal of “hot atoms” takes much longer, even though the collision rate in the HT at \(P=0.26\) W is about twice that of the ODT (\(1.5\times 10^{12}\,\hbox {cm}^{-3}\) compared to \(7\times 10^{11}\,\hbox {cm}^{-3}\)). The required evaporation ramp time for the ODT compares quite well with a calculated thermalization rate at \(P=0.26\) W of \((70\,\hbox {ms})^{-1}\), while the axial trap frequency is \(2\pi \times 12\) Hz. The longest timescale of the HT is the axial trap frequency of \(2\pi \times 27\) Hz.

Fig. 8
figure 8

Forced evaporative cooling in HT (with \(z_0=1.5\) mm) and ODT, ramping down the ODT power from \(P=2.8\) W to 0.26 W. a TOF spectra for HT and ODT (inset) for different evaporation ramp times (each TOF spectrum is an average over 2 to 4 experimental runs). b Temperature as function of ramp time for HT and ODT. The initial temperature at \(P=2.8\) W is 14 \(\upmu\hbox{K}\)

After having determined the appropriate ramp times (also for lower final ODT powers), we study the efficiency of evaporation by measuring the atom number and temperature for different final ODT powers. In Fig. 9a, we show the temperature as function of atom number in the HT for two displacements, \(z_0=0.6\) mm and \(z_0=1.5\) mm, and the ODT. Here the efficiency is typically quantified as \(\alpha ={\mathrm{d}}[\log T]/{\mathrm{d}}[\log N]\), and we observe a \(\alpha \) parameter between 2 and 3. For the same final temperature, we obtain the highest atom number for ODT and the lowest one for the HT with the smallest displacement.

Fig. 9
figure 9

Evaporative cooling in the HT \(z_0=0.6\) mm and \(z_0=1.5\) mm and ODT, showing (a) the temperature and (b) phase-space density as function the atom number. The gray dashed lines indicate the efficiency in terms of \(\alpha ={\mathrm{d}}[\log T]/{\mathrm{d}}[\log N]\) and \(\gamma =-{\mathrm{d}}[\log D]/{\mathrm{d}}[\log N]\) parameters

In Fig. 9b, we present the phase-space density \(D=n_0\lambda _{\mathrm{dB}}^3\) (where \(\lambda _{\mathrm{dB}}=h/\sqrt{2\pi m k_B T}\) is the de Broglie wavelength) as function of atom number, for which the peak density \(n_0\) is calculated using numerical integration of Eq. 1 (see Sect. 2). Here the efficiency is typically quantified as \(\gamma =-\mathrm{d}[\log D]/\mathrm{d}[\log N]\), and we observe a \(\gamma \) parameter between 2 and 3. We reach the onset of BEC (\(D>1\)) in all three cases, with the most atoms of \(2\times 10^5\) for the HT with \(z_0=1.5\) mm, showing a slightly better performance than the one with \(z_0=0.6\) mm. But even for the single-beam ODT, we obtain efficient evaporation up to \(D=1\) with \(\gamma >2\), which is probably explained by the sufficiently large axial trap frequencies of at least a few Hz.

The appearance of BEC can be observed in the TOF spectra obtained by MCP detection, as it provides a distinct TOF distribution described by an inverted parabola,

$$\begin{aligned} \varPhi _{\mathrm{BEC}}(t)\propto \max \left[ 0,1-\left( \frac{t-t_0}{\sigma }\right) ^2\right] ^2, \end{aligned}$$
(2)

where the width \(\sigma \) is given by the chemical potential [14, 17, 18]. Compared to the Gaussian distribution associated with a thermal sample, a BEC gives rise to much sharper wings of the distribution. In our case, the magnetic field gradient pulse, which is required to direct the atoms toward the MCP after release from the trap, affects the narrow BEC TOF peak, and we do not obtain clear bimodal distributions in which the thermal and BEC part can easily be distinguished. Nonetheless, for the lowest final ODT powers we do observe a sharpening of the wings of the TOF distribution. In Fig. 10a, we show a series of TOF spectra of the HT with \(z_0=1.5\) mm for decreasing ODT power, for which \(D>1\). The narrowing of the TOF distributions can be explained by both a decreasing temperature and a decreasing chemical potential, due to reduction in trap frequencies. In Fig. 10b, c, we show pure thermal and BEC fits to the situation of (b) \(D\approx 1\) (c) and \(D\gg 1\). Indeed, the BEC distribution describes the TOF spectrum of panel (c) quite well, much better than the thermal fit. From this observation, we estimate that we have reached a pure BEC of \(5\times 10^4\) atoms.

Fig. 10
figure 10

a Normalized TOF spectra of the HT with \(z_0=1.5\) mm for different final ODT powers at which \(D>1\), showing a sharpening of the wings of the distributions. b, c TOF spectra for \(P=173\) mW and \(P=15\) mW, for which \(D\approx 1\) and \(D\gg 1\), respectively, together with Gaussian fits (red) and BEC fits (blue, Eq. 2). Each TOF spectrum is an average over five experimental runs. The shot-to-shot fluctuation in the signal height is less than 5 %

5 Conclusions

We have achieved Bose–Einstein condensation (BEC) of metastable triplet helium atoms via RF-induced evaporative cooling in a quadrupole magnetic trap, transferred to a single-beam hybrid trap, and subsequent evaporative cooling in both the hybrid trap and pure optical dipole trap, using only moderate ODT power of less than 3 W. While evaporation in the pure ODT is much faster than that in the HT, a larger BEC is obtained in the HT. We reach the onset of BEC with \(2\times 10^5\) atoms and a pure BEC of \(5\times 10^4\) atoms. We observe small shot-to-shot fluctuations (less than 5 %) and excellent day-to-day reproducibility. The total experimental cycle duration is between 8 and 10 s. This all could be improved by the implementation of a bright molasses [27] and gray molasses [24] before loading the QMT, which would provide better initial conditions of RF-induced forced evaporative cooling in the QMT and therefore shorten the duration of that stage, and finally lead to a larger BEC. Our present work provides the most simple scheme so far to obtain a \(^4\hbox {He}^*\) BEC, using limited experimental infrastructure. It also shows that a single-beam hybrid trap can be applied for a light atom such as helium, despite several challenges caused by its small mass and small levitation gradient.