1 Introduction

Whispering gallery mode (WGM) resonators are widely used for a number of applications, such as frequency comb generation [1, 2], laser stabilization [35] and sensing [6, 7]. The high optical quality factor (Q-factor) and relatively small mode volume of whispering gallery resonators (WGRs) render the modes very sensitive to subtle environmental changes. Until now, WGRs have been used to measure changes in a number of parameters such as refractive index [8, 9], temperature [1012], pressure [13, 14] and stress [15, 16]. Aside from parameter change detection, ultrahigh Q resonators have also been used to detect nanoparticles [17, 18] and single viruses [19, 20]. The mechanism behind ultrahigh sensitivity sensing in WGRs is based on a reactive (i.e. dispersive) frequency shift of the whispering gallery modes [19] as a result of perturbations that may be present. Alternatively, a perturbation may increase the optical linewidth of the WGM by introducing more dissipation [21, 22], or may change the back-scattering strength [23] and subsequent mode splitting if modal coupling is present [17, 20]. The optomechanical properties of WGRs can also be used for force [24] or viscosity sensing [25].

Currently, in order to retrieve the dispersive, dissipative and mode splitting information, the transmission spectrum of a WGR through an externally coupled waveguide, such as a tapered optical fibre, is usually measured. Light from a tunable laser source is coupled into the tapered fibre and the transmission is monitored. Low powers are used in order to minimise thermal and nonlinear effects on the whispering gallery modes. By sweeping the laser frequency, the transmission spectrum through the fibre can be recorded. Any changes to the frequency, mode splitting or linewidth are used to monitor perturbations induced by the physical parameter that is being sensed. During measurements, the transmission spectrum represents a steady state of the coupled system; therefore, time response of the sensor [2628] is related to the lifetime of the resonator which limits the scanning speed. For a WGR with an optical Q-factor \({>}2\times 10^7\), even when the pumping laser is shut off immediately, there is still light intensity remaining in the cavity due to the long lifetime of the high Q cavity. A transient ringing effect [29, 30] is observable even if the laser is scanned as quickly as 25 MHz/μs [31]. The ringing spectrum can be used to distinguish between the overcoupled and undercoupled cases [32]. When the scanning speed is faster than the character speed, as defined in [28], the steady-state treatment can no longer be used to describe the coupled mode system. Therefore, by recording lineshape changes in the ringing tail of an observed transmission spectrum, either by (1) a scanning probe laser or (2) a fixed laser in resonance with a high Q mode, transient sensing should be possible; this would allow us to not only transiently detect the existence of a nanoparticle but also provide information such as speed or size of the particle [28]. A proof-of-principle experiment based on the ringing phenomenon has recently been reported [33].

Another possible approach is to send light pulses, which are far detuned from a WGM resonance, through the optical coupler. The retrieved signal on the coupler’s output shows an oscillatory lineshape similar to that in [28]; this effect is termed cavity ring-up spectroscopy (CRUS) [27] and the rising edge of the light pulse leads to transient broadening. Even though the light is far detuned from the WGM, a fraction can still be coupled into the cavity if the broadening is much larger than the detuning. The system is not affected by thermal or nonlinear processes which may arise due to the ultrahigh Q of the mode. The ringing effect occurs within the lifetime of the WGM and, therefore, can be used for ultrafast sensing. The transient capability of CRUS has already been demonstrated by measuring the time response for thermo-refractive effects, Kerr nonlinearity and optomechanical vibrations [27].

To date, there has been no thorough theoretical investigation of CRUS and details, such as the influence of the pulse’s rise time on the observed spectra, are relatively unknown. In this manuscript, we use coupled mode theory to solve the related differential equations without relying on the steady-state assumption. An approximate analytical solution is obtained and compared to a precise numerical transient solution. The theoretical results fit well to experimental data that we obtained for a silica microsphere resonator. The influences of the pulse rise time, the coupling condition and the detuning on the ring-up spectrum are given. This provides a solid foundation for future applications in transient sensing.

2 Coupled mode theory

Fig. 1
figure 1

a Schematic of a taper coupled WGR system for transient sensing using CRUS. A pump laser of frequency \(\omega _{\mathrm{L}}\), far off resonance with the WGM, \(\omega _0\), is coupled through the tapered fibre with a temporal profile, \(S_{\mathrm{in}}(t)\). The pulse profile is depicted in (b); the laser pulse starts at a time \(\tau _0\) and rises up to its maximum within a time \(t_{\mathrm{r}}\). The mathematical description of the pulse is given in Eq. 3. ce Transient frequency of the laser pulse for different time intervals. c \(t=[0, \tau _0]\); d \(t=[\tau _0,\tau _0+t_{\mathrm{r}}]\); e \(t=[\tau _0+t_{\mathrm{r}},+\infty\)). At the rising edge of the pulse, the laser source is transiently broadened (d), so a fraction of the pump signal couples to the WGM and contributes to a beat signal between it and the pump source (e)

A typical CRUS setup is shown in Fig. 1a. The WGR is coupled evanescently to a tapered optical fibre through which light from a laser propagates. The light couples into the resonator and is monitored at the opposite end of the fibre. The coupling dynamics can be described using coupled mode theory. The amplitude of the intracavity electromagnetic field, a(t), changes in time according to the following [28, 34, 35]

$$\frac{{\hbox {d}}a(t)}{{\hbox {d}}t}=-j\omega _0a(t)-(\kappa _{\mathrm{e}}+\kappa _0)a(t)+\sqrt{2\kappa _{\mathrm{e}}}S_{\mathrm{in}}(t),$$
(1)

where \(j = \sqrt{-1}\), the resonant frequency of the WGM is \(\omega _0\), and \(\kappa _{\mathrm{e}}\) and \(\kappa _0\) represent the external and intrinsic coupling rates, respectively. The total damping rate of the cavity is given by \(\kappa =\kappa _{\mathrm{e}}+\kappa _0\) and \(\tau =1/\kappa\) is the intracavity lifetime. In order to arrive at the transient response of the WGR, the laser light is pulsed with a temporal profile, \(S_{\mathrm{in}}(t)\). The laser frequency, \(\omega _{\mathrm{L}}\), is far detuned, i.e. \(\omega _{\mathrm{L}}-\omega _0\gg \kappa\). The pulsed input field can be separated into a slowly varying and a fast varying term so that

$$S_{\mathrm{in}}(t)=a_{\mathrm{in}}(t){\hbox {e}}^{-j\omega _{\mathrm{L}}t}.$$
(2)

Here, the slowly varying part, \(a_{\mathrm{in}}(t)\), represents the temporal profile of the pulse and, in the following discussions, it takes the form of a Guassian function where

$$\begin{aligned} a_{\mathrm{in}}(t)=\left\{ \begin{array}{ll} 0,&{} \quad t<\tau _0 \\ \alpha _{\mathrm{in}}\exp \left( -\frac{4[t-\tau _0-t_{\mathrm{r}}]^2}{\ln 2\cdot t_{\mathrm{r}}^2}\right) , &{} \quad \tau _0<t<\tau _0+t_{\mathrm{r}}\\ \alpha _{\mathrm{in}}, &{} \quad t>\tau _0+t_{\mathrm{r}} \end{array}\right. \end{aligned}$$
(3)

The pulse is illustrated in Fig. 1b. The pulse starts at time \(\tau _0\) and follows a Gaussian profile with a rise time, \(t_{\mathrm{r}}\ll \tau\). At time \(t=\tau _0+t_{\mathrm{r}}\), the total power of the pulse reaches its maximum, \(|\alpha _{\mathrm{in}}|^2\), and, for later times, the laser can be treated as a continuous light source over the lifetime of the cavity mode.

2.1 A simple model: large spectral broadening bandwidth

The temporal profile of \(S_{\mathrm{in}}(t)\) can be obtained from Eqs. 2 and 3 and Fourier expanded as follows:

$$S_{{{\text{in}}}} (t) = \left\{ {\begin{array}{*{20}l} {0,} \hfill & {t < \tau _{0} } \hfill \\ \begin{gathered} \sqrt {\frac{{\ln 2}}{\pi }} \frac{{t_{{\text{r}}} \alpha _{{{\text{in}}}} }}{4} \hfill \\ \, \times \int_{{ - \infty }}^{{ + \infty }} {{\text{e}}^{{ - \frac{{\ln 2t_{{\text{r}}}^{2} \omega ^{2} }}{{16}}}} }, \hfill \\ {\text{e}}^{{ - j[(\omega + \omega _{{\text{L}}} )(t - \tau _{0} )]}} {\text{d}}\omega \hfill \\ \end{gathered} \hfill & {\tau _{0} < t < \tau _{0} + t_{{\text{r}}} } \hfill \\ {\alpha _{{{\text{in}}}} {\text{e}}^{{ - j\omega _{{\text{L}}} t}} ,} \hfill & {t > \tau _{0} + t_{{\text{r}}} } \hfill \\ \end{array} } \right.$$
(4)

It can be seen that, for a time interval \(t\in [\tau _0, \tau _0+t_{\mathrm{r}}]\), the rise time of the pulse induces sideband frequencies even though the laser source is monochromatic. As the Fourier transform of a Gaussian function is a Gaussian, the laser pulse is expanded transiently with a bandwidth, \(B=1/(t_{\mathrm{r}}\ln 2)\).

In our experiments, the pulse has a rise time ranging from ~1 ns to several tens of ps. The laser frequency broadening bandwidth, B, is of the order of GHz, and, for a WGM with \(Q>10^7\), \(\kappa \sim\) MHz. As \(B\gg \kappa\), we can assume that only the portion of the broadened laser source at the resonant frequency, \(\omega _0\), can be efficiently coupled to the WGM. As illustrated in Fig. 1c–e, we assume that the laser is far red-detuned from the resonant frequency, such that \(\omega _{\mathrm{L}}-\omega _0=\Delta \gg \kappa\). The WGM acts as an infinitely narrow band pass filter so that we only need to consider the frequency at \(\omega _0\), see Fig. 2c. Thus, the input signal can be simplified as:

$$S_{\mathrm{in}}(t)\approx \sqrt{\frac{\ln 2}{\pi }}\frac{t_{\mathrm{r}}\alpha _{\mathrm{in}}}{4}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta ^2}{16}}{\hbox {e}}^{-j\omega _0t}.$$
(5)

Note that, in the above expression, only the last factor is time dependent. The single frequency pulse as defined by Eq. 5 must satisfy Eq. 1. Thence, we get the dynamics of the intracavity amplitude for the transient interval at the rise time and for later times. Equation 1 has a general solution [28]:

$$\begin{aligned} a(t)&=\sqrt{2\kappa _{\mathrm{e}}}a_{\mathrm{in}}{\hbox {e}}^{j\omega _0t-\kappa t} \left[ \frac{\tau }{1+j(\omega _{\mathrm{L}}-\omega _0)\tau } \right. \\&\quad \left. +\int _{\tau _0}^{t}{\hbox {e}}^{j\phi (t')-j\omega _0t'+\kappa t'}{\hbox {d}}t'\right] . \end{aligned}$$
(6)

The complete dynamics can be separated into two steps. First, at the rise time interval, the signal is broadened and light of frequency \(\omega _0\) is coupled into the cavity. As \(t_{\mathrm{r}}\ll \tau\), dissipation during this short time can be ignored and \(\omega _{\mathrm{L}}\) in Eq. 6 can be substituted by \(\omega _0\). If we consider the accumulated phase \(\phi (t')=\int _{\tau _0}^{t'}\omega _0(t''){\hbox {d}}t''=\omega _0t'\), then the accumulated amplitude at a time \(t=\tau _0+t_{\mathrm{r}}\) is given by

$$\begin{aligned} a(t)=\frac{\sqrt{2\kappa _{\mathrm{e}}}}{\kappa }\sqrt{\frac{\ln 2}{\pi }}\frac{t_{\mathrm{r}}\alpha _{\mathrm{in}}}{4}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta ^2}{16}}{\hbox {e}}^{-j\omega _0t},\quad \tau _0<t<t_{\mathrm{r}}. \end{aligned}$$
(7)

Next, as already explained, the frequency returns to \(\omega _{\mathrm{L}}\) and is far detuned. Therefore, for later times, there is no light coupled into the WGM and the system follows simple decay dynamics with a decay rate, \(\kappa\), whose initial value is as in Eq. 7:

$$\begin{aligned} a(t)= \frac{\sqrt{2\kappa _{\mathrm{e}}}}{\kappa }\sqrt{\frac{\ln 2}{\pi }}\frac{t_{\mathrm{r}}\alpha _{\mathrm{in}}}{4}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta ^2}{16}}{\hbox {e}}^{-j\omega _0t-\kappa t},\quad t>\tau _0+t_{\mathrm{r}}. \end{aligned}$$
(8)

The output signal, \(S_{\mathrm{out}}(t)\), can be calculated using the input–output relationship:

$$S_{\mathrm{out}}(t)=S_{\mathrm{in}}(t)-\sqrt{2\kappa _{\mathrm{e}}}a(t).$$
(9)

Here, we are interested in the output signal after the pulse reaches its maximum value. At this point the input signal changes to a continuous state and the broadening vanishes, yielding \(S_{\mathrm{in}}(t)=\alpha _{\mathrm{in}}{\hbox {e}}^{-j\omega _{\mathrm{L}}t}\). It can be seen from Eqs. 8 and 9 that the final signal is a superposition of the laser signal at \(\omega _0\), which is the residue of intracavity photons leaking out after the rise time, and \(\omega _{\mathrm{L}}\) of the pumping signal. Therefore, it gives transient transmission, T(t), at a time after the rise up of the pulse as follows:

$$\begin{aligned} T(t)=\left|\frac{S_{\mathrm{out}}}{S_{\mathrm{in}}}\right|^2=1+\frac{\kappa _{\mathrm{e}}}{\kappa }\sqrt{\frac{\ln 2}{\pi }}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta ^2}{16}}{\hbox {e}}^{-\kappa t}\sin {(\Delta (t-\tau _0-t_{\mathrm{r}}))}. \end{aligned}$$
(10)

A typical calculated waveform is shown in Fig. 2a. There are three values that can be retrieved from the CRUS signal: peak height, oscillation period, and the decay rate. From Eq. 10, the oscillation frequency is the detuning, \(\Delta\) and the decay rate is the lifetime, \(\tau\). Note that this is the loaded Q-factor of the system. The peak height is more complicated as it is related to both the dissipation rate and the detuning. This will be discussed in more detail later.

Fig. 2
figure 2

a A typical CRUS signal. The curve in the grey rectangle is the beat signal deduced from Eq. 10. This is due to the frequency broadening at the rising edge of the input signal. We can treat the WGM as an ultra-narrow filter; therefore, only the on-resonance, \(\omega =\omega _0\), component in the broadened pump laser signal can be stored in the WGM as shown in (c). b The CRUS signal when there is modal coupling with coupling strength, g. The beating is modulated by a cosine envelope of period \((2g)^{-1}\). This is due to the beating between the two normal standing wave modes, with resonant frequencies of \(\omega _1\) and \(\omega _2\), as shown in (d)

2.2 Transient response during the rising edge

From the above discussion, we assume that the phase accumulation during the rise time can be ignored. This is not valid when considering a large value for the detuning. As shown in Eq. 10, the frequency of the obtained transient signal is determined by the detuning of the laser source from the resonant frequency. At large detuning, one period of oscillation is almost as long as the rise time; hence, the phase accumulated during the rise time can influence the output signal. This distorts the first few periods of the beat signal. To calculate this, we should use the full form of Eq.  7 at the rise time.

The output signal, \(S_{\mathrm{out}}(t)\), can be determined from Eq. 9, with the input amplitude defined in Eq. 3. Therefore, a beating frequency equalling \(\Delta =\omega _{\mathrm{L}}-\omega _0\) can be achieved during the rise time :

$$\begin{aligned} T(t)\approx \left[ 1+\sqrt{\frac{\ln 2}{\pi }}\frac{\kappa _{\mathrm{e}}}{2\kappa }{\hbox {e}}^{-\kappa (t-\tau _0)}\sin {(\Delta (t-\tau _0))}\right] {\hbox {e}}^{-\frac{4(t-\tau _0-t_{\mathrm{r}})^2}{\ln 2\cdot t_{\mathrm{r}}^2}}. \end{aligned}$$
(11)

The signal which arrives at the detector can be separated into two components. The first part has the same form as that in Eq. 3, i.e. the rising part of the retrieved signal maintains a Gaussian profile and the rise time is unchanged. The second part comes from the contribution of the intracavity amplitude and takes the form of a trigonometric function with a period \(2\pi /\Delta\). Since this component only occurs for a time interval, \(t_{\mathrm{r}}\), if \(\Delta\) is small it follows that \(\sin {(\Delta (t-\tau _0))}\approx 0\) and the rising edge waveform will not be disturbed.

2.3 The mode mixing case

In practice, multiple WGM resonances may fall within the transient spectral broadening bandwidth. In the following, the case in which two modes mix will be investigated. There are two different types of mode mixing that we consider. The first situation is that where two modes exist quite close to each other in the spectrum. In the following discussion, we assume that both modes lie within the transiently broadened spectrum bandwidth, B, while satisfying the narrow band pass filter approximation made in Sect. 2.1. Defining the two modes as \(\omega _1\) and \(\omega _2\), with a separation between them of \(\delta\), the coupled mode equations for each mode are given by

$$\begin{aligned} \frac{{\hbox {d}}a_i}{{\hbox {d}}t}=-j\omega _ia_i(t)-(\kappa _{0,i}+\kappa _{{\mathrm{e}},i})a_i(t)+\sqrt{2\kappa _{\mathrm{e}}}a_{{\mathrm{in}},i}(t){\hbox {e}}^{-j\omega _it}, \end{aligned}$$
(12)

where i = 1, 2 and represents the two modes. Since there is no spectral overlap of the two modes, the mode equations are decoupled and can be solved separately. This yields the same expressions as in Eq. 8 and each mode beats with \(S_{\mathrm{in}}(t)\) separately. The total beat signal, \(T_{\mathrm{b}}(t)\), is

$$\begin{aligned} T_{\mathrm{b}}(t)=\sum _{i=1,2}T_{i,{\mathrm{b}}}(t)=\sum _{i=1,2}\frac{\kappa _{{\mathrm{e}},i}}{\kappa _i}\sqrt{\frac{\ln 2}{\pi }}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta _i^2}{16}}{\hbox {e}}^{-\kappa _it}\sin {(\Delta _it)}. \end{aligned}$$
(13)

\(T_{\mathrm{b}}(t)\) is a linear combination of the two individual beat signals. The amplitudes of the signals with different frequencies are determined from the detunings, \(\Delta _i\), of the resonances relative to the laser source. If \(\delta\) is large, then one of the beat signals will be much larger than the other since \(T_{\mathrm{b},i}\) follows the Gaussian relationship with the detuning. In this case, the weaker signal can be ignored. In contrast, if the two modes are close to each other so that \(\delta \ll \Delta _i\), then, for simplicity, we set \(\Delta _1\approx \Delta _2=\Delta\) and the coupling rates are approximately equal. From Eq. 13, the total beat signal, \(T_{\mathrm{b}}(t)\), is (\(T(t)=1+T_{\mathrm{b}}(t)\)):

$$\begin{aligned} T_{\mathrm{b}}(t)\approx \frac{\kappa _{\mathrm{e}}}{\kappa }\sqrt{\frac{\ln 2}{\pi }}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta ^2}{16}}{\hbox {e}}^{-\kappa t}(1+\cos {(\delta t)})\sin {(\Delta t)}. \end{aligned}$$
(14)

The other possible case to consider is that where the two modes are coupled. This is often the case in travelling wave resonators, such as WGRs. Degenerate modes in the resonator represent clockwise and counter clockwise propagation. Due to scattering along the propagation path, the two modes can indirectly couple to each other; this effect is called modal coupling and leads to normal mode splitting [36, 37]. Here, we define a modal coupling strength, g, and the coupled mode equations for the two modes are given by \(a_{cw}\) and \(a_{ccw}\) representing the two opposing propagation directions. The two coupled mode equations can be written as

$$\begin{aligned} \frac{{\hbox {d}}a_{cw}}{{\hbox {d}}t}&=-j\omega _0a_{cw}(t)-(\kappa _0+\kappa _{\mathrm{e}})a_{cw}(t)\\&\quad -jga_{ccw}(t)+\sqrt{2\kappa _{\mathrm{e}}}S_{\mathrm{in}}(t);\\ \frac{{\hbox {d}}a_{ccw}}{{\hbox {d}}t}&=-j\omega _0a_{ccw}(t)-(\kappa _0+\kappa _{\mathrm{e}})a_{ccw}(t)-jga_{cw}(t). \end{aligned}$$
(15)

The two modes should have the same values of \(\kappa _0, \kappa _{\mathrm{e}}\) and detuning. By letting \(A_1(t)=a_{cw}(t)+a_{ccw}(t)\) and \(A_2(t)=a_{cw}(t)-a_{ccw}(t)\), these two equations can be transformed into two uncoupled equations [37]:

$$\begin{aligned} &\frac{{\hbox {d}}A_1(t)}{{\hbox {d}}t}=-j(\omega _0+g)A_1(t)-(\kappa _0+\kappa _{\mathrm{e}})A_1(t)+\sqrt{2\kappa _{\mathrm{e}}}S_{\mathrm{in}}(t)\\&\frac{{\hbox {d}}B(t)}{{\hbox {d}}t}=-j(\omega _0-g)A_2(t)-(\kappa _0+\kappa _{\mathrm{e}})A_2(t)+\sqrt{2\kappa _{\mathrm{e}}}S_{\mathrm{in}}(t) \end{aligned}$$
(16)

Here, the original two travelling modes generate two equivalent standing wave modes with frequency shifts, \(\pm g\), as illustrated in Fig. 2d. Usually, the modal coupling strength is in the MHz range for silica WGRs, i.e. the frequencies of the two standing waves are quite close to each other. Since \(B\gg 2g\), we can assume that the two modes have the same transient components at the rising edge time, as in the previous case. Also, in the actual experiments, we measure the light transmitted through the tapered fibre, so that \(T(t)=|(1-\sqrt{2\kappa _{\mathrm{e}}}a_{cw}(t))/S_{\mathrm{in}}(t)|^2\), where \(a_{cw}=1/2(A_1(t)+A_2(t))\). \(T_{\mathrm{b}}(t)\) is a beat signal between the initial signals with frequencies \(\Delta\) and \(\Delta \pm g\) and has the following form

$$\begin{aligned} T_{\mathrm{b}}(t)\approx \frac{\kappa _{\mathrm{e}}}{\kappa }\sqrt{\frac{\ln 2}{\pi }}{\hbox {e}}^{-\frac{\ln 2t_{\mathrm{r}}^2\Delta ^2}{16}}{\hbox {e}}^{-\kappa t}\sin {(\Delta t)}\left\{ 1+\frac{1}{2}\cos {(2gt)}\right\} . \end{aligned}$$
(17)

From the above equation, when there is mode splitting due to intrinsic scattering, the transient signal has a cosine form of frequency \(\Delta\) and is modulated by a slowly oscillating envelope with a frequency of 2g. This yields a similar waveform to that obtained in the uncoupled modes’ case. Comparing Eqs. 17 to 14, we see that there is a factor of 1/2 difference. When modal coupling is present, the two modes interfere with each other, whereas for the uncoupled case, the beating of the two modes has no coherent property.

2.4 Numerical method

For a more precise simulation of the transient system, we should solve the coupled mode equations numerically. Equation 1 can be transformed into a rotating frame of reference, with an angular frequency \(\omega _{\mathrm{L}}\) such that

$$\frac{{\hbox {d}}a(t)}{{\hbox {d}}t}=-j\Delta a(t)-\kappa a(t)+\sqrt{2\kappa _{\mathrm{e}}}a_{\mathrm{in}}(t).$$
(18)

The input–output relationship for the rotating frame is \(a_{\mathrm{out}}(t)=a_{\mathrm{in}}(t)-\sqrt{2\kappa _{\mathrm{e}}}a(t)\). For simplicity, we rescale the time by \(t_{\mathrm{r}}\) in the following discussion and we assume that the WGR is critically coupled to the external coupler, unless we explicitly mention otherwise. First, let us assume that the lifetime of a WGM is about 250 ns. Three different responses under various detunings are plotted in Fig. 3a, ranging from \(\Delta =5\kappa\) to \(\Delta =35\kappa\). The oscillation period follows the detuning, so that a larger detuning yields a higher oscillation frequency, which represents the beating between the laser and WGM frequencies, as discussed in Sect. 2(A). However, the peak height reduces when the detuning increases. In Fig. 3b we plot the peak height as a function of detuning. The data points can be fitted quite well with a Gaussian function and this confirms the presence of the Gaussian term in Eq. 10. If the detuning, \(\Delta\), is fixed while the ratio \(\kappa _{\mathrm{e}}/\kappa\) is changed, according to Eq. 10, the peak height should be proportional to this ratio. From the numerical simulations, this linear relationship is confirmed and depicted in Fig. 3c. Varying \(t_{\mathrm{r}}\) and \(\tau\) should not affect the peak height, as shown in both Eq. 10 and the numerical simulations. However, \(t_{\mathrm{r}}\) determines the bandwidth of the transient broadening; therefore, it controls the peak height relationship to detuning, as illustrated in Fig. 3d. Significant bandwidth shrinkage is visible only if the rise time increases by more than a factor of 10. This verifies the behaviour we assumed in Sect. 2. In essence, it is the broadening from the rise time of a detuned pulse that allows light to couple into the WGM and leads to the subsequent beat signal.

Fig. 3
figure 3

a The normalised transient response of a pulse detuned to the high Q WGM. The legend shows the detuning (normalised to \(\kappa\)) for different curves. b The peak heights of the transient signals for different detunings in (a). c The peak heights at different coupling conditions \(\kappa _{\mathrm{e}}/\kappa\). d Peak heights as a function of detuning for certain coupling conditions with different pulse rise-up times, \(t_{\mathrm{r}}\)

In most cases, we find that the analytical method gives the same result as the numerical method, see Fig. 4a. Here, the lifetime of the WGM is chosen to be \(\tau =100\) ns with a rise time of \(t_{\mathrm{r}}=1\) ns. In this case, \(B\gg \kappa\) is well satisfied and both methods yield the same results. However, when the lifetime decreases to \(\tau =10\) ns, the analytical result yields a peak height less than the more accurate numerical method (see Fig. 4b). In this case, \(B\sim \kappa\) and the assumption of narrow filtering by the WGM for obtaining Eq. 10 is not justified. The broader linewidth of the WGM should permit more photons to enter into the cavity at the transient broadening time (shown as the shaded area in Fig. 2c), so that, at a later time, more light can beat with the transmitted pulse.

Fig. 4
figure 4

Transient response of a WGR with a pulsed input signal. Red curve analytical result using Eqs. 10 and 11. Black dots numerical results by solving Eq. 18 directly. Two different cases were considered. a \(\tau =200 t_{\mathrm{r}}, \Delta =10\kappa\); b \(\tau =10t_{\mathrm{r}}, \Delta =2\kappa\). Both cases satisfy the under coupling condition

3 Experiment

In order to confirm the above theory, we performed an experiment using an ultrahigh Q silica microsphere. The experimental setup is depicted in Fig. 5. A 30 μW, 1550 nm laser was initially modulated using an intensity modulator with an EOM (Thorlabs model LN63S-FC, with rise-up time 50 ps). For this purpose, a pulse generator providing a pulse with a rise time of 5 ns, a width of 500 ns and a delay about 100 ns was used. The modulated light was coupled to the microsphere’s WGM using a fibre taper. The transmission through the fibre was detected with a fast photodetector (Newport model 818-BB-35F) with a typical rise time of 500 ps. The signal was retrieved on a digital storage oscilloscope (DSO) and recorded at a sampling rate of >1 GS/s. The microsphere had a diameter of 80 μm and the fibre waist was ~1.2 μm. We chose a high Q silica microsphere with a lifetime of ~500 ns. To study the detuning effects on the CRUS, the microsphere and the fibre taper were aligned to be in contact coupling; therefore, the coupling condition \(\kappa _{\mathrm{e}}/\kappa\) is fixed. The detuning of the laser with respect to the whispering gallery mode was changed so that its frequency approaches that of the WGM in finite steps. The results were normalised to get the peak heights and are plotted in Fig. 6a.

Fig. 5
figure 5

Experimental setup used for cavity ring-up spectroscopy. A 1550-nm laser is intensity modulated and coupled to the microsphere cavity and the transmitted light pulses are detected using a fast detector, with the signals recorded on a fast digital storage oscilloscope (DSO)

Fig. 6
figure 6

a The peak height measured at different laser detunings simulating a dispersive shift of the microsphere (black dots). The red curve is the theoretical results calculated from the measured parameters. b The transmission spectrum of the microsphere which has an ultrahigh Q WGM

Fig. 7
figure 7

ae The experimental cavity ring-up signals from a silica microsphere resonator. Black dots Experimental data; Red curves numerical simulations with a given rise time, \(t_{\mathrm{r}}=5\) ns. The fitting parameters are given in each case. f The peak heights defined as the maxima for plots ae satisfy a linear relationship to \(\kappa _{\mathrm{e}}/\kappa\). The outlier in the plot is due to a large dispersive shift which breaks the linear relationship. The inset shows the external coupling coefficient as an exponential function of the coupling gap

To match with the theoretical framework, we made a separate measurement of the transmission spectrum of the WGM being probed, as illustrated in Fig. 6b. From the transmission efficiency of the mode (80%), and assuming that the system is in the undercoupled regime, it can be deduced that \(\kappa _{\mathrm{e}}/\kappa =0.3\). The FWHM of the mode is 1.5 MHz using Lorentz fitting to the dip in Fig. 6b. In fact, thermal broadening was present even at a low pump laser power of 30 μW; therefore, \(\kappa\) is over estimated. In practice, it is found that \(\tau \approx\) 750 ns yields a good fit to the experimental data. By the means mentioned in Sect. 2, the theoretical peak height relationship to the frequency detuning is plotted as the red curve in Fig. 6a. The trend of the peak height to the detuning follows a Gaussian profile.

We also evaluated the peak height with different coupling gaps by varying the gap using a closed-loop, piezo nanopositioner (Smaract SLC1730s-416). The relative position of the taper from the microsphere was determined using a nanopositioner controller (Smaract MCS-3D). The experimental results for different coupling gaps were fitted with the theory to determine \(\kappa _{\mathrm{e}}/\kappa\) (see Fig. 7a–e). Since \(\kappa _{\mathrm{e}}\ll \kappa _0\), we assume that \(\kappa _{\mathrm{e}}/\kappa =\kappa _{\mathrm{e}}/\kappa _0\). Here, \(\kappa _0\) is a constant, while \(\kappa _{\mathrm{e}}\) satisfies a near exponential curve to the coupling gap [34], as shown in the inset of Fig. 7f. The corresponding peak height for different coupling conditions plotted in Fig. 7f shows a near-linear relationship.

From Fig. 7a–e, the coupling gap is increased and the period of the CRUS becomes larger. The taper introduces a dispersive red shift to the microsphere’s resonance [38]. In our experiments, the laser is blue-detuned relative to the resonance and fixed; the larger the distance between the WGR and the taper, the less the dispersion introduced; thus, the cavity mode shifts relative to the laser thereby decreasing the beat frequency. In Eq. 10, the peak height should be related to both the coupling condition and the detuning. However, since \(\Delta\) appears in the Gaussian term (and assuming that it is large), slightly changing its value will not influence the peak height significantly (see Fig. 3b). In the experiment, we deliberately chose an initial large detuning; therefore, the peak height is still linear with \(\kappa _{\mathrm{e}}/\kappa\) despite the dispersive disturbances. When the system is strongly overcoupled, the results deviate from the linear relationship, as seen when \(\kappa _{\mathrm{e}}/\kappa >0.65\). In the supplementary material of Ref. [38], it was shown that the dispersive shift rate increases exponentially when moving to a strongly overcoupled regime. This means that the dispersive influence of the taper will induce a very large frequency shift for the cavity mode and the changes in the Gaussian term in Eq. 10 cannot be neglected. As a consequence, the peak height does not vary linearly with \(\kappa _{\mathrm{e}}/\kappa\).

4 Discussion

Similar to the work in [28], the deduced formula of CRUS in this manuscript shows that it also provides redundant information if the cavity’s intrinsic Q factor is known; hence, it could be very useful for transient sensing. Instead of doing a time-consuming fast Fourier transform (FFT) of the transient response signal [27], one need only record the maximum of the transient signal to retrieve the information for sensing, assuming that one can measure all the other parameters, such as \(\kappa _{\mathrm{e}}, \kappa _0\) and \(t_{\mathrm{r}}\), from the steady-state transmission spectrum. This significantly reduces the complexity of the data processing and decreases the burden for data acquisition. In this sense, the acquisition speed can be further improved. For sensing based on reactive/dispersive interactions [19], sensitivity can be optimised by choosing the correct laser detuning. From Fig. 3b, the peak height has a Gaussian relationship to the detuning. In order to obtain the highest sensitivity, the pump laser frequency should be chosen so that it is closer to the WGM resonance. For example, in Fig. 3b, when \(\Delta \sim 10\kappa\) the sensitivity is \({\hbox {d}}H/{\hbox {d}}\Delta \sim 0.01/\kappa\) (where H is the peak height). Also, from Fig. 3d, when \(t_{\mathrm{r}}\) is longer, the Gaussian profile is steeper; this also improves the sensitivity.

For dissipative sensing [21], measuring the peak height for a fixed detuning will also yield valid results. As predicted by theory, the peak height changes linearly with \(\kappa _{\mathrm{e}}/\kappa\). If the system experiences an intrinsic dissipation change due to environmental conditions, the peak height should maintain an inverse relationship to intrinsic dissipation under a certain coupling condition, which is the gap between the taper and the microsphere in our case (in more complicated situations, the coupling is not only determined by the gap but also the dissipation of the surrounding environment [22]). In a more complicated scenario, where both dispersion and dissipation exist, a measurement of the peak height may still be sufficient. In the experiment mentioned in the previous section, it was shown that, for large detuning, the peak height always satisfies a linear relationship to \(\kappa _{\mathrm{e}}/\kappa\); therefore, it provides a dissipative sensing method immune to any changes to the laser detuning. By having different laser detuning configurations, contributions from dispersive and dissipative interactions can be well categorised. For instance, using a WDM technique, two lasers with independent configurations can be used for simultaneous dispersion and dissipation sensing.

5 Conclusion

In summary, the dynamical mechanisms behind CRUS were investigated by solving the coupled mode equations for a transient response to a Gaussian input pulse. The detailed relationship of the CRUS to laser detuning, coupling coefficient and rise time was determined using approximate analytical solutions. This is further verified by experimental measurements using an ultrahigh Q silica microsphere. Using this method, dispersive and dissipative sensing can be performed separately in the transient domain.

6 Personal recollections: Síle Nic Chormaic

I had the pleasure of working in the Laser Spectroscopy Division of the MPQ during a very exciting time, from 1999 to 2000, when the mode-locked femtosecond laser frequency comb was being developed and used for precision spectroscopy. Though I was working on a different experiment, the energy in the laboratory at the time was contagious; the weekly group meetings and the Schloss Ringberg workshop ensured that we all felt part of this exciting project. I am not sure how many of us fully realised the impact that this early work would have on the field of precision spectroscopy over the following decades. Ted’s insight and vision drove this work forward. I am extremely grateful for the exceptional opportunities he gave to me and for introducing me to Rote Grütze. Happy 75th birthday!