Abstract
Several kmscale gravitationalwave detectors have been constructed world wide. These instruments combine a number of advanced technologies to push the limits of precision length measurement. The core devices are laser interferometers of a new kind; developed from the classical Michelson topology these interferometers integrate additional optical elements, which significantly change the properties of the optical system. Much of the design and analysis of these laser interferometers can be performed using wellknown classical optical techniques, however, the complex optical layouts provide a new challenge. In this review we give a textbookstyle introduction to the optical science required for the understanding of modern gravitational wave detectors, as well as other highprecision laser interferometers. In addition, we provide a number of examples for a freely available interferometer simulation software and encourage the reader to use these examples to gain handson experience with the discussed optical methods.
Introduction
The scope and style of the review
The historical development of laser interferometers for application as gravitationalwave detectors [47] has involved the combination of relatively simple optical subsystems into more and more complex assemblies. The individual elements that compose the interferometers, including mirrors, beam splitters, lasers, modulators, various polarising optics, photo detectors and so forth, are individually well described by relatively simple, mostlyclassical physics. Complexity arises from the combination of multiple mirrors, beam splitters etc. into optical cavity systems, have narrow resonant features, and the consequent requirement to stabilise relative separations of the various components to subwavelength accuracy, and indeed in many cases to very small fractions of a wavelength.
Thus, classical physics describes the interferometer techniques and the operation of current gravitationalwave detectors. However, we note that at signal frequencies above a couple of hundreds of Hertz, the sensitivity of current detectors is limited by the photon counting noise at the interferometer readout, also called shotnoise. The next generation systems such as Advanced LIGO [23, 5], Advanced Virgo [4] and LCGT [36] are expected to operate in a regime where the quantum physics of both light and mirror motion couple to each other. Then, a rigorous quantummechanical description is certainly required. Sensitivity improvements beyond these ‘Advanced’ detectors necessitate the development of nonclassical techniques. The present review, in its first version, does not consider quantum effects but reserves them for future updates.
The components employed tend to behave in a linear fashion with respect to the optical field, i.e., nonlinear optical effects need hardly be considered. Indeed, almost all aspects of the design of laser interferometers are dealt with in the linear regime. Therefore the underlying mathematics is relatively simple and many standard techniques are available, including those that naturally allow numerical solution by computer models. Such computer models are in fact necessary as the exact solutions can become quite complicated even for systems of a few components. In practice, workers in the field rarely calculate the behaviour of the optical systems from first principles, but instead rely on various wellestablished numerical modelling techniques. An example of software that enables modelling of either timedependent or frequencydomain behaviour of interferometers and their component systems is Finesse [22, 19]. This was developed by one of us (AF), has been validated in a wide range of situations, and was used to prepare the examples included in the present review.
The target readership we have in mind is the student or researcher who desires to get to grips with practical issues in the design of interferometers or component parts thereof. For that reason, this review consists of sections covering the basic physics and approaches to simulation, intermixed with some practical examples. To make this as useful as possible, the examples are intended to be realistic with sensible parameters reflecting typical application in gravitational wave detectors. The examples, prepared using Finesse, are designed to illustrate the methods typically applied in designing gravitational wave detectors. We encourage the reader to obtain Finesse and to follow the examples (see Appendix A).
Overview of the goals of interferometer design
As set out in very many works, gravitationalwave detectors strive to pick out signals carried by passing gravitational waves from a background of selfgenerated noise. The principles of operation are set out at various points in the review, but in essence, the goal has been to prepare many photons, stored for as long as practical in the ‘arms’ of a laser interferometer (traditionally the two arms are at right angles), so that tiny phase shifts induced by the gravitational waves form as large as possible a signal, when the light leaving the appropriate ‘port’ of the interferometer is detected and the resulting signal analysed.
The evolution of gravitationalwave detectors can be seen by following their development from prototypes and early observing systems towards the Advanced detectors, which are currently in the final stages of planning or early stages of construction. Starting from the simplest Michelson interferometer [18], then by the application of techniques to increase the number of photons stored in the arms: delay lines [31], FabryPérot arm cavities [16, 17] and power recycling [15]. The final step in the development of classical interferometry was the inclusion of signal recycling [41, 30], which, among other effects, allows the signal from a gravitationalwave signal of approximatelyknown spectrum to be enhanced above the noise.
Reading out a signal from even the most basic interferometer requires minimising the coupling of local environmental effects to the detected output. Thus, the relative positions of all the components must be stabilised. This is commonly achieved by suspending the mirrors etc. as pendulums, often multistage pendulums in series, and then applying closedloop control to maintain the desired operating condition. The careful engineering required to provide lownoise suspensions with the correct vibration isolation, and also lownoise actuation, is described in many works. As the interferometer optics become more complicated, the resonance conditions, i.e., the allowed combinations of intercomponent path lengths required to allow the photon number in the interferometer arms to reach maximum, become more narrowly defined. It is likewise necessary to maintain angular alignment of all components, such that beams required to interfere are correctly coaligned. Typically the beams need to be aligned within a small fraction (and sometimes a very small fraction) of the farfield diffraction angle, and the requirement can be in the low nanoradian range for kmscale detectors [44, 21]. Therefore, for each optical component there is typically one longitudinal (i.e., along the direction of light propagation), plus two angular degrees of freedom (pitch and yaw about the longitudinal axis). A complex interferometer can consist of up to around seven highly sensitive components and so there can be of order 20 degrees of freedom to be measured and controlled [3, 57].
Although the light fields are linear, the coupling between the position of a mirror and the complex amplitude of the detected light field typically shows strongly nonlinear dependence on mirror positions due to the sharp resonance features exhibited by cavity systems. However, the fields do vary linearly or at least smoothly close to the desired operating point. So, while wellunderstood linear control theory suffices to design the control system needed to maintain the optical configuration at its operating point, bringing the system to that operating condition is often a separate and more challenging nonlinear problem. In the current version of this work we consider only the linear aspects of sensing and control.
Control systems require actuators, and those employed are typically electricalforce transducers that act on the suspended optical components, either directly or — to provide enhanced noise rejection — at upper stages of multistage suspensions. The transducers are normally coilmagnet actuators, with the magnets on the moving part, or, less frequently, electrostatic actuators of varying design. The actuators are frequently regarded as part of the mirror suspension subsystem and are not discussed in the current work.
Overview of the physics of the primary interferometer components
To give order to our review we consider the main physics describing the operation of the basic optical components (mirrors, beam splitters, modulators, etc.) required to construct interferometers. Although all of the relevant physics is generally well known and not new, we take it as a starting point that permits the introduction of notation and conventions. It is also true that the interferometry employed for gravitationalwave detection has a different emphasis than other interferometer applications. As a consequence, descriptions or examples of a number of crucial optical properties for gravitational wave detectors cannot be found in the literature. The purpose of this first version of the review is especially to provide a coherent theoretical framework for describing such effects. With the basics established, it can be seen that the interferometer configurations that have been employed in gravitationalwave detection may be built up and simulated in a relatively straightforward manner.
As mentioned above, we do not address the newer physics associated with operation at or beyond the standard quantum limit. The interested reader can begin to explore this topic from the following references.
These matters are to be included in a future revision of this review.
Planewave analysis
The main optical systems of interferometric gravitationalwave detectors are designed such that all system parameters are well known and stable over time. The stability is achieved through a mixture of passive isolation systems and active feedback control. In particular, the light sources are some of the most stable, lownoise continuouswave laser systems so that electromagnetic fields can be assumed to be essentially monochromatic. Additional frequency components can be modelled as small modulations (in amplitude or phase). The laser beams are well collimated, propagate along a welldefined optical axis and remain always very much smaller than the optical elements they interact with. Therefore, these beams can be described as paraxial and the wellknown paraxial approximations can be applied.
It is useful to first derive a mathematical model based on monochromatic, scalar, plane waves. As it turns out, a more detailed model including the polarisation and the shape of the laser beam as well as multiple frequency components, can be derived as an extension to the planewave model. A plane electromagnetic wave is typically described by its electric field component: with E_{0} as the (constant) field amplitude in V/m, \({\vec e_p}\) the unit vector in the direction of polarisation, such as, for example, \({\vec e_y}\) for \({\mathscr I}\)polarised light, ω the angular oscillation frequency of the wave, and \(\vec k = {\vec e_k}\omega/c\) the wave vector pointing the in the direction of propagation. The absolute phase φ only becomes meaningful when the field is superposed with other light fields.
In this document we will consider waves propagating along the optical axis given by the zaxis, so that \(\vec k\vec r = kz\). For the moment we will ignore the polarisation and use scalar waves, which can be written as
Further, in this document we use complex notation, i.e.,
This has the advantage that the scalar amplitude and the phase φ can be given by one, now complex, amplitude E′_{0} = E_{0} exp(iφ) We will use this notation with complex numbers throughout. For clarity we will simply use the unprimed letters for the auxiliary field. In particular, we will use the letter E and also a and b to denote complex electricfield amplitudes. But remember that, for example, in E = E_{0} exp(−i kz) neither E nor E_{0} are physical quantities. Only the real part of E exists and deserves the name field amplitude.
Frequency domain analysis
In most cases we are either interested in the fields at one particular location, for example, on the surface of an optical element, or we want to know the fields at all places in the interferometer but at one particular point in time. The latter is usually true for the steady state approach: assuming that the interferometer is in a steady state, all solutions must be independent of time so that we can perform all computations at t = 0 without loss of generality. In that case, the scalar plane wave can be written as
The frequency domain is of special interest as numerical models of gravitationalwave detectors tend to be much faster to compute in the frequency domain than in the time domain.
Optical Components: Coupling of Field Amplitudes
When an electromagnetic wave interacts with an optical system, all of its parameters can be changed as a result. Typically optical components are designed such that, ideally, they only affect one of the parameters, i.e., either the amplitude or the polarisation or the shape. Therefore, it is convenient to derive separate descriptions concerning each parameter. This section introduces the coupling of the complex field amplitude at optical components. Typically, the optical components are described in the simplest possible way, as illustrated by the use of abstract schematics such as those shown in Figure 2.
Mirrors and spaces: reflection, transmission and propagation
The core optical systems of current interferometric gravitational interferometers are composed of two building blocks: a) resonant optical cavities, such as FabryPérot resonators, and b) beam splitters, as in a Michelson interferometer. In other words, the laser beam is either propagated through a vacuum system or interacts with a partiallyreflecting optical surface.
The term optical surface generally refers to a boundary between two media with possibly different indices of refraction n, for example, the boundary between air and glass or between two types of glass. A real fused silica mirror in an interferometer features two surfaces, which interact with a reffected or transmitted laser beam. However, in some cases, one of these surfaces has been treated with an antireffection (AR) coating to minimise the effect on the transmitted beam.
The terms mirror and beam splitter are sometimes used to describe a (theoretical) optical surface in a model. We define real amplitude coefficients for reflection and transmission r and t, with 0 ≤ r, t ≤ 1, so that the field amplitudes can be written as The π/2 phase shift upon transmission (here given by the factor i) refers to a phase convention explained in Section 2.4.
The free propagation of a distance D through a medium with index of refraction n can be described with the following set of equations: In the following we use n = 1 for simplicity.
Note that we use above relations to demonstrate various mathematical methods for the analysis of optical systems. However, refined versions of the coupling equations for optical components, including those for spaces and mirrors, are also required, see, for example. Section 2.6.
The twomirror resonator
The linear optical resonator, also called a cavity is formed by two partiallytransparent mirrors, arranged in parallel as shown in Figure 5. This simple setup makes a very good example with which to illustrate how a mathematical model of an interferometer can be derived, using the equations introduced in Section 2.1.
The cavity is defined by a propagation length D (in vacuum), the amplitude reflectivities r_{1}, r_{2} and the amplitude transmittances t_{1}, t_{2}. The amplitude at each point in the cavity can be computed simply as the superposition of fields. The entire set of equations can be written as
The circulating field impinging on the first mirror (surface) a′_{3} can now be computed as
This then yields
We can directly compute the reflected field to be
while the transmitted field becomes
The properties of two mirror cavities will be discussed in more detail in Section 5.1.
Coupling matrices
Computations that involve sets of linear equations as shown in Section 2.2 can often be done or written efficiently with matrices. Two methods of applying matrices to coupling field amplitudes are demonstrated below, using again the example of a two mirror cavity. First of all, we can rewrite the coupling equations in matrix form. The mirror coupling as given in Figure 3 becomes and the amplitude coupling at a ‘space’, as given in Figure 4, can be written as In these examples the matrix simply transforms the ‘known’ impinging amplitudes into the ‘unknown’ outgoing amplitudes.
Coupling matrices for numerical computations
An obvious application of the matrices introduced above would be to construct a large matrix for an extended optical system appropriate for computerisation. A very flexible method is to setup one equation for each field amplitude. The set of linear equations for a mirror would expand to
where the input vector^{Footnote 1} \({{\vec a}_{{\rm{input}}}}\) has nonzero values for the impinging fields and \({{\vec a}_{{\rm{sol}}}}\) is the ‘solution’ vector, i.e., after solving the system of equations the amplitudes of the impinging as well as those of the outgoing fields are stored in that vector.
As an example we apply this method to the two mirror cavity. The system matrix for the optical setup shown in Figure 5 becomes
This is a sparse matrix. Sparse matrices are an important subclass of linear algebra problems and many efficient numerical algorithms for solving sparse matrices are freely available (see, for example, [13]). The advantage of this method of constructing a single matrix for an entire optical system is the direct access to all field amplitudes. It also stores each coupling coefficient in one or more dedicated matrix elements, so that numerical values for each parameter can be read out or changed after the matrix has been constructed and, for example, stored in computer memory. The obvious disadvantage is that the size of the matrix quickly grows with the number of optical elements (and with the degrees of freedom of the system, see, for example, Section 7).
Coupling matrices for a compact system descriptions
The following method is probably most useful for analytic computations, or for optimisation aspects of a numerical computation. The idea behind the scheme, which is used for computing the characteristics of dielectric coatings [28, 40] and has been demonstrated for analysing gravitational wave detectors [43], is to rearrange equations as in Figure 6 and Figure 7 such that the overall matrix describing a series of components can be obtained by multiplication of the component matrices. In order to achieve this, the coupling equations have to be reordered so that the input vector consists of two field amplitudes at one side of the component. For the mirror, this gives a coupling matrix of
In the special case of the lossless mirror this matrix simplifies as we have r^{2} + t^{2} = R + T =1. The space component would be described by the following matrix:
With these matrices we can very easily compute a matrix for the cavity with two lossless mirrors as
with e^{+} = exp(i kD) and e^{−} = exp(−ikD). The system of equation describing a cavity shown in Equation (4) can now be written more compactly as
This allows direct computation of the amplitude of the transmitted field resulting in
which is the same as Equation (8).
The advantage of this matrix method is that it allows compact storage of any series of mirrors and propagations, and potentially other optical elements, in a single 2 × 2 matrix. The disadvantage inherent in this scheme is the lack of information about the field amplitudes inside the group of optical elements.
Phase relation at a mirror or beam splitter
The magnitude and phase of reflection at a single optical surface can be derived from Maxwell’s equations and the electromagnetic boundary conditions at the surface, and in particular the condition that the field amplitudes tangential to the optical surface must be continuous. The results are called Fresnel’s equations [33]. Thus, for a field impinging on an optical surface under normal incidence we can give the reflection coefficient as
with n_{1} and n_{2} the indices of refraction of the first and second medium, respectively. The transmission coefficient for a lossless surface can be computed as t^{2} = 1 − r^{2}. We note that the phase change upon reflection is either 0 or 180°, depending on whether the second medium is optically thinner or thicker than the first. It is not shown here but Fresnel’s equations can also be used to show that the phase change for the transmitted light at a lossless surface is zero. This contrasts with the definitions given in Section 2.1 (see Figure (3)ff.), where the phase shift upon any reflection is defined as zero and the transmitted light experiences a phase shift of π/2. The following section explains the motivation for the latter definition having been adopted as the common notation for the analysis of modern optical systems.
Composite optical surfaces
Modern mirrors and beam splitters that make use of dielectric coatings are complex optical systems, see Figure 8 whose reflectivity and transmission depend on the multiple interference inside the coating layers and thus on microscopic parameters. The phase change upon transmission or reflection depends on the details of the applied coating and is typically not known. In any case, the knowledge of an absolute value of a phase change is typically not of interest in laser interferometers because the absolute positions of the optical components are not known to subwavelength precision. Instead the relative phase between the incoming and outgoing beams is of importance. In the following we demonstrate how constraints on these relative phases, i.e., the phase relation between the beams, can be derived from the fundamental principle of power conservation. To do this we consider a Michelson interferometer, as shown in Figure 9, with perfectlyreflecting mirrors. The beam splitter of the Michelson interferometer is the object under test. We assume that the magnitude of the reflection r and transmission t are known. The phase changes upon transmission and reflection are unknown. Due to symmetry we can say that the phase change upon transmission should be the same in both directions. However, the phase change on reflection might be different for either direction, thus, we write for the reflection at the front and for the reflection at the back of the beam splitter.
Then the electric fields can be computed as
We do not know the length of the interferometer arms. Thus, we introduce two further unknown phases: Φ_{1} for the total phase accumulated by the field in the vertical arm and Φ_{2} for the total phase accumulated in the horizontal arm. The fields impinging on the beam splitter compute as
The outgoing fields are computed as the sums of the reflected and transmitted components:
with R = r^{2} and T = t^{2}.
It will be convenient to separate the phase factors into common and differential ones. We can write
with
and similarly
with
for simplicity we now limit the discussion to a 50:50 beam splitter with \(r = t = 1/\sqrt 2\), for which we can simplify the field expressions even further:
Conservation of energy requires that E_{0}^{2} = E_{5}^{2} + E_{6}^{2}, which in turn requires
which is only true if
with N as in integer (positive, negative or zero). This gives the following constraint on the phase factors
One can show that exactly the same condition results in the case of arbitrary (lossless) reflectivity of the beam splitter [48].
We can test whether two known examples fulfill this condition. If the beamsplitting surface is the front of a glass plate we know that φ_{t} = 0, φ_{r1} = π φr2 = 0, which conforms with Equation (28). A second example is the twomirror resonator, see Section 2.2. If we consider the cavity as an optical ‘black box’, it also splits any incoming beam into a reflected and transmitted component, like a mirror or beam splitter. Further we know that a symmetric resonator must give the same results for fields injected from the left or from the right. Thus, the phase factors upon reflection must be equal φ_{r} = φ_{r1} = φ_{r2}. The reflection and transmission coefficients are given by Equations (7) and (8) as
and
We demonstrate a simple case by putting the cavity on resonance (kD = Nπ). This yields
with r_{cav} being purely real and t_{cav} imaginary and thus φ_{t} = π/2 and φ_{r} = 0 which also agrees with Equation (28).
In most cases we neither know nor care about the exact phase factors. Instead we can pick any set which fulfills Equation (28). For this document we have chosen to use phase factors equal to those of the cavity, i.e., φ_{t} = π/2 and φ_{r} = 0, which is why we write the reflection and transmission at a mirror or beam splitter as
In this definition r and t are positive real numbers satisfying r^{2} +t^{2} = 1 for the lossless case.
Please note that we only have the freedom to chose convenient phase factors when we do not know or do not care about the details of the optical system, which performs the beam splitting. If instead the details are important, for example when computing the properties of a thin coating layer, such as antireflex coatings, the proper phase factors for the respective interfaces must be computed and used.
Lengths and tunings: numerical accuracy of distances
The resonance condition inside an optical cavity and the operating point of an interferometer depends on the optical path lengths modulo the laser wavelength, i.e., for light from an Nd:YAG laser length differences of less than 1 µm are of interest, not the full magnitude of the distances between optics. On the other hand, several parameters describing the general properties of an optical system, like the finesse or free spectral range of a cavity (see Section 5.1) depend on the macroscopic distance and do not change significantly when the distance is changed on the order of a wavelength. This illustrates that the distance between optical components might not be the best parameter to use for the analysis of optical systems. Furthermore, it turns out that in numerical algorithms the distance may suffer from rounding errors. Let us use the Virgo [56] arm cavities as an example to illustrate this. The cavity length is approximately 3 km, the wavelength is on the order of 1 µm, the mirror positions are actively controlled with a precision of 1 µm and the detector sensitivity can be as good as 10^{−18} m, measured on ∼ 10 ms timescales (i.e., many samples of the data acquisition rate). The floating point accuracy of common, fast numerical algorithms is typically not better than 10^{−15}. If we were to store the distance between the cavity mirrors as such a floating point number, the accuracy would be limited to 3 pm, which does not even cover the accuracy of the control systems, let alone the sensitivity.
A simple and elegant solution to this problem is to split a distance D between two optical components into two parameters [29]: one is the macroscopic ‘length’ L, defined as the multiple of a constant wavelength λ_{0} yielding the smallest difference to D. The second parameter is the microscopic tuning T that is defined as the remaining difference between L and D, i.e., D = L + T. Typically, λ_{0} can be understood as the wavelength of the laser in vacuum, however, if the laser frequency changes during the experiment or multiple light fields with different frequencies are used simultaneously, a default constant wavelength must be chosen arbitrarily. Please note that usually the term λ in any equation refers to the actual wavelength at the respective location as λ = λ_{0}/n with n the index of refraction at the local medium.
We have seen in Section 2.1 that distances appear in the expressions for electromagnetic waves in connection with the wave number, for example,
Thus, the difference in phase between the field at z = z_{1} and z = z_{1} + D is given as
We recall that k = 2π/λ = ω/c. We can define ω_{0} = 2π c/λ_{0} and k_{0} = ω_{0}/c. For any given wavelength λ we can write the corresponding frequency as a sum of the default frequency and a difference frequency ω = ω_{0} + Δω. Using these definitions, we can rewrite Equation (34) with length and tuning as
The first term of the sum is always a multiple of 2π, which is equivalent to zero. The last term of the sum is the smallest, approximately of the order Δω · 10^{−14}. For typical values of L ≈ 1 m, T < 1 µm and Δω < 2π · 100 MHz we find that
which shows that the last term can often be ignored.
We can also write the tuning directly as a phase. We define as the dimensionless tuning
This yields
The tuning ϕ is given in radian with 2π referring to a microscopic distance of one wavelength^{Footnote 2} λ_{0}.
Finally, we can write the following expression for the phase difference between the light field taken at the end points of a distance D:
or if we neglect the last term from Equation (36) we can approximate (ω/ω_{0} ≈ 1) to obtain
This convention provides two parameters L and ϕ that can describe distances with a markedly improved numerical accuracy. In addition, this definition often allows simplification of the algebraic notation of interferometer signals. By convention we associate a length L with the propagation through free space, whereas the tuning will be treated as a parameter of the optical components. Effectively the tuning then represents a microscopic displacement of the respective component. If, for example, a cavity is to be resonant to the laser light, the tunings of the mirrors have to be the same whereas the length of the space in between can be arbitrary.
Revised coupling matrices for space and mirrors
Using the definitions for length and tunings we can rewrite the coupling equations for mirrors and spaces introduced in Section 2.1 as follows. The mirror coupling becomes (compare this to Figure 6), and the amplitude coupling for a ‘space’, formally written as in Figure 7, is now written as
Finesse examples
Mirror reflectivity and transmittance
We use Finesse to plot the amplitudes of the light fields transmitted and reflected by a mirror (given by a single surface). Initially, the mirror has a power reflectance and transmittance of R = T = 0.5 and is, thus, lossless. For the plot in Figure 13 we tune the transmittance from 0.5 to 0. Since we do not explicitly change the reflectivity, R remains at 0.5 and the mirror loss increases instead, which is shown by the trace labelled ‘total’ corresponding to the sum of the reflected and transmitted light power. The plot also shows the phase convention of a 90° phase shift for the transmitted light.
Finesse input file for ‘Mirror reflectivity and transmittance’
Length and tunings
This Finesse file demonstrates the conventions for lengths and microscopic positions introduced in Section 2.5. The top trace in Figure 14 depicts the phase change of a beam reflected by a beam splitter as the function of the beam splitter tuning. By changing the tuning from 0 to 180° the beam splitter is moved forward and shortens the path length by one wavelength, which by convention increases the light phase by 360°. On the other hand, if a length of a space is changed, the phase of the transmitted light is unchanged (for the default wavelength Δk = 0), as shown the in the lower trace.
Finesse input file for ‘Length and tunings’
Light with Multiple Frequency Components
So far we have considered the electromagnetic field to be monochromatic. This has allowed us to compute lightfield amplitudes in a quasistatic optical setup. In this section, we introduce the frequency of the light as a new degree of freedom. In fact, we consider a field consisting of a finite and discrete number of frequency components. We write this as
with complex amplitude factors a_{j}, ω_{j} as the angular frequency of the light field and k_{j} = ω_{j}/c. In many cases the analysis compares different fields at one specific location only, in which case we can set z = 0 and write
In the following sections the concept of light modulation is introduced. As this inherently involves light fields with multiple frequency components, it makes use of this type of field description. Again we start with the twomirror cavity to illustrate how the concept of modulation can be used to model the effect of mirror motion.
Modulation of light fields
Laser interferometers typically use three different types of light fields: the laser with a frequency of, for example, f ≈ 2.8 · 10^{14} Hz, radio frequency (RF) sidebands used for interferometer control with frequencies (offset to the laser frequency) of f ≈ 1 • 10^{e} to 150 • 10^{e} Hz, and the signal sidebands at frequencies of 1 to 10,000 Hz^{Footnote 3}. As these modulations usually have as their origin a change in optical path length, they are often phase modulations of the laser frequency, the RF sidebands are utilised for optical readout purposes, while the signal sidebands carry the signal to be measured (the gravitationalwave signal plus noise created in the interferometer).
Figure 15 shows a time domain representation of an electromagnetic wave of frequency ω_{0}, whose amplitude or phase is modulated at a frequency One can easily see some characteristics of these two types of modulation, for example, that amplitude modulation leaves the zero crossing of the wave unchanged whereas with phase modulation the maximum and minimum amplitude of the wave remains the same. In the frequency domain in which a modulated field is expanded into several unmodulated field components, the interpretation of modulation becomes even easier: any sinusoidal modulation of amplitude or phase generates new field components, which are shifted in frequency with respect to the initial field. Basically, light power is shifted from one frequency component, the carrier, to several others, the sidebands. The relative amplitudes and phases of these sidebands differ for different types of modulation and different modulation strengths. This section demonstrates how to compute the sideband components for amplitude, phase and frequency modulation.
Phase modulation
Phase modulation can create a large number of sidebands. The number of sidebands with noticeable power depends on the modulation strength (or depth) given by the modulation index m. Assuming an input field
a sinusoidal phase modulation of the field can be described as
This equation can be expanded using the identity [27]
with Bessel functions of the first kind J_{k}(m). We can write
The field for k = 0, oscillating with the frequency of the input field ω_{0}, represents the carrier. The sidebands can be divided into upper (k > 0) and lower (k < 0) sidebands. These sidebands are light fields that have been shifted in frequency by k Ω. The upper and lower sidebands with the same absolute value of k are called a pair of sidebands of order k. Equation (46) shows that the carrier is surrounded by an infinite number of sidebands. However, for small modulation indices (m < 1) the Bessel functions rapidly decrease with increasing k (the lowest orders of the Bessel functions are shown in Figure 16). For small modulation indices we can use the approximation [2]
In which case, only a few sidebands have to be taken into account. For m ≪ 1 we can write
and with
we obtain
as the firstorder approximation in m. In the above equation the carrier field remains unchanged by the modulation, therefore this approximation is not the most intuitive. It is clearer if the approximation up to the second order in is given:
which shows that power is transferred from the carrier to the sideband fields.
Higherorder expansions in m can be performed simply by specifying the highest order of Bessel function, which is to be used in the sum in Equation (46), i.e.,
Frequency modulation
For small modulation, indices, phase modulation and frequency modulation can be understood as different descriptions of the same effect [29]. Following the same spirit as above we would assume a modulated frequency to be given by
and then we might be tempted to write
which would be wrong. The frequency of a wave is actually defined as ω/(2π) = f = dφ/dt. Thus, to obtain the frequency given in Equation (53), we need to have a phase of
for consistency with the notation for phase modulation, we define the modulation index to be
with Δω as the frequency swing — how far the frequency is shifted by the modulation — and Ω the modulation frequency — how fast the frequency is shifted. Thus, a sinusoidal frequency modulation can be written as
which is exactly the same expression as Equation (44) for phase modulation. The practical difference is the typical size of the modulation index, with phase modulation having a modulation index of m < 10, while for frequency modulation, typical numbers might be m > 10^{4}. Thus, in the case of frequency modulation, the approximations for small m are not valid. The series expansion using Bessel functions, as in Equation (46), can still be performed, however, very many terms of the resulting sum need to be taken into account.
Amplitude modulation
In contrast to phase modulation, (sinusoidal) amplitude modulation always generates exactly two sidebands. Furthermore, a natural maximum modulation index exists: the modulation index is defined to be one (m = 1) when the amplitude is modulated between zero and the amplitude of the unmodulated field.
If the amplitude modulation is performed by an active element, for example by modulating the current of a laser diode, the following equation can be used to describe the output field:
However, passive amplitude modulators (like acoustooptic modulators or electrooptic modulators with polarisers) can only reduce the amplitude. In these cases, the following equation is more useful:
Sidebands as phasors in a rotating frame
A common method of visualising the behaviour of sideband fields in interferometers is to use phase diagrams in which each field amplitude is represented by an arrow in the complex plane.
We can think of the electric field amplitude E_{0} exp(i ω_{0}t) as a vector in the complex plane, rotating around the origin with angular velocity ω_{0}. To illustrate or to help visualise the addition of several light fields it can be useful to look at this problem using a rotating reference frame, defined as follows. A complex number shall be defined as z = x + iy so that the real part is plotted along the xxis, while the yaxis is used for the imaginary part. We want to construct a new coordinate system (x′, y′) in which the field vector is at a constant position. This can be achieved by defining
or
Figure 17 illustrates how the transition into the rotating frame makes the field vector to appear stationary. The angle of the field vector in a rotating frame depicts the phase offset of the field. Therefore these vectors are also called phasors and the illustrations using phasors are called phasor diagrams. Two more complex examples of how phasor diagrams can be employed is shown in Figure 18 [11].
Phasor diagrams can be especially useful to see how frequency coupling of light field amplitudes can change the type of modulation, for example, to turn phase modulation into amplitude modulation. An extensive introduction to this type of phasor diagram can be found in [39].
Phase modulation through a moving mirror
Several optical components can modulate transmitted or reflected light fields. In this section we discuss in detail the example of phase modulation by a moving mirror. Mirror motion does not change the transmitted light; however, the phase of the reflected light will be changed as shown in Equation (11).
We assume sinusoidal change of the mirror’s tuning as shown in Figure 19. The position modulation is given as x_{m} = cos(ω_{s}t + φ_{s}), and thus the reflected field at the mirror becomes (assuming a_{4} = 0)
setting m = 2k_{0}a_{s}. This can be expressed as
Coupling matrices for beams with multiple frequency components
The coupling between electromagnetic fields at optical components introduced in Section 2 referred only to the amplitude and phase of a simplified monochromatic field, ignoring all the other parameters of the electric field of the beam given in Equation (1). However, this mathematical concept can be extended to include other parameters provided that we can find a way to describe the total electric field as a sum of components, each of which is characterised by a discrete value of the related parameters. In the case of the frequency of the light field, this means we have to describe the field as a sum of monochromatic components. In the previous sections we have shown how this could be done in the special case of an initial monochromatic field that is subject to modulation: if the modulation index is small enough we can limit the amount of frequency components that we need to consider. In many cases it is actually sufficient to describe a modulation only by the interaction of the carrier at φ_{0} (the unmodulated field) and two sidebands with a frequency offset of °φ_{m} to the carrier. A beam given by the sum of three such components can be described by a complex vector:
with φ_{0} = φ_{0}, φ_{0} − φ_{m} = φ1 and φ_{0} + φ_{m} = φ2. In the case of a phase modulator that applies a modulation of small modulation index m to an incoming light field \({{\vec a}_1}\), we can describe the coupling of the frequency component as follows:
which can be written in matrix form:
And similarly, we can write the complete coupling matrix for the modulator component, for example, as
Finesse examples
Modulation index
This file demonstrates the use of a modulator. Phase modulation (with up to five higher harmonics is applied to a laser beam and amplitude detectors are used to measure the field at the first three harmonics. Compare this to Figure 16 as well.
Finesse input file for ‘Modulation index’
Mirror modulation
Finesse offers two different types of modulators: the ‘modulator’ component shown in the example above, and the ‘fsig’ command, which can be used to apply a signal modulation to existing optical components. The main difference is that ‘fsig’ is meant to be used for transfer function computations. Consequently Finesse discards all nonlinear terms, which means that the sideband amplitude is proportional to the signal amplitude and harmonics are not created.
Finesse input file for ‘Mirror modulation’
Optical Readout
In previous sections we have dealt with the amplitude of light fields directly and also used the amplitude detector in the Finesse examples. This is the advantage of a mathematical analysis versus experimental tests, in which only light intensity or light power can be measured directly. This section gives the mathematical details for modelling photo detectors.
The intensity of a field impinging on a photo detector is given as the magnitude of the Poynting vector, with the Poynting vector given as [58]
Inserting the electric and magnetic components of a plane wave, we obtain
with ϵ_{0} the electric permeability of vacuum and the speed of light.
The response of a photo detector is given by the total flux of effective radiation^{Footnote 4} during the response time of the detector. For example, in a photo diode a photon will release a charge in the np junction. The response time is given by the time it takes for the charge to travel through the detector (and further time may be taken up in the electronic processing of the signal). The size of the photodiode and the applied bias voltage determine the travel time of the charges with typical values of approximately 10 ns. Thus, frequency components faster than perhaps 100 MHz are not resolved by a standard photodiode. For example, a laser beam with a wavelength of = 1064 nm has a frequency of f = c/λ ≈ 282 10^{12} Hz = 282 THz. Thus, the 2ω component is much too fast for the photo detector; instead, it returns the average power
In complex notation we can write
However, for more intuitive results the light fields can be given in converted units, so that the light power can be computed as the square of the light field amplitudes. Unless otherwise noted, throughout this work the unit of light field amplitudes \(\sqrt {{\rm{watt}}}\). Thus, the notation used in this document to describe the computation of the light power of a laser beam is
Detection of optical beats
What is usually called an optical beat or simply a beat is the sinusoidal behaviour of the intensity of two overlapping and coherent fields. For example, if we superpose two fields of slightly different frequency, we obtain
with ω_{+} = ω_{1} + ω_{2} and ω_{−} = ω_{1} − ω_{2}. In this equation the frequency ω_{−} can be very small and can then be detected with the photodiode as illustrated in Figure 22.
Using the same example photodiode as before: in order to be able to detect an optical beat ω_{−} would need to be smaller than 100 MHz. If we take two, sightly detuned Nd:YAG lasers with f = 282 THz, this means that the relative detuning of these lasers must be smaller than 10^{−7}.
In general, for a field with several frequency components, the photodiode signal can be written as
for example, if the photodiode signal is filtered with a lowpass filter, such that only the DC part remains, we can compute the resulting signal by looking for all components without frequency dependence. The frequency dependence vanishes when the frequency becomes zero, i.e., in all parts of Equation (75) with ω_{i} = ω_{j}. The output is a real number, calculated like this:
Signal demodulation
A typical application of light modulation, is its use in a modulationdemodulation scheme, which applies an electronic demodulation to a photodiode signal. A ‘demodulation’ of a photodiode signal at a userdefined frequency ω_{x}, performed by an electronic mixer and a lowpass filter, produces a signal, which is proportional to the amplitude of the photo current at DC and at the frequency Interestingly, by using two mixers with different phase offsets one can also reconstruct the phase of the signal, or to be precise the phase difference of the light at ω_{0} ± ω_{x} with respect to the carrier light. This feature can be very powerful for generating interferometer control signals.
Mathematically, the demodulation process can be described by a multiplication of the output with a cosine: cos(ω_{x}+φ_{j}) (is the demodulation phase), which is also called the ‘local oscillator’. After the multiplication was performed only the DC part of the result is taken into account. The signal is
Multiplied with the local oscillator it becomes
With \({A_{ij}} = {a_i}a_j^{\ast}\) and \({e^{{\rm{i}}{\omega _{ij}}\,t}} = {e^{{\rm{i}}\,{{{\rm{(}}{\omega _i}{\rm{}}{\omega _j})}^t}}}\) we can write
When looking for the DC components of S_{1} we get the following [20]:
This would be the output of a mixer and a subsequent lowpass filter. The results for φ_{x} = 0 and φ_{x} = π/2 are called inphase and inquadrature, respectively (or also first and second quadrature). They are given by
if only one mixer is used, the output is always real and is determined by the demodulation phase. However, with two mixers generating the inphase and inquadrature signals, it is possible to construct a complex number representing the signal amplitude and phase:
Often several sequential demodulations are applied in order to measure very specific phase information. For example, a double demodulation can be described as two sequential multiplications of the signal with two local oscillators and taking the DC component of the result. First looking at the whole signal, we can write:
This can be written as
and thus reduced to two single demodulations. Since we now only care for the DC component we can use the expression from above (Equation (82)). These two demodulations give two complex numbers:
The demodulation phases are applied as follows to get a real output (two sequential mixers)
In a typical setup, a userdefined demodulation phase for the first frequency (here is given. If two mixers are used for the second demodulation, we can reconstruct the complex number
More demodulations can also be reduced to single demodulations as above.
Finesse examples
Optical beat
In this example two laser beams are superimposed at a 50:50 beam splitter. The beams have a slightly different frequency: the second beam has a 10 kHz offset with respect to the first (and to the default laser frequency). The plot illustrates the output of four different detectors in one of the beam splitter output ports, while the phase of the second beam is tuned from 0° to 180°. The photodiode ‘pd1’ shows the total power remaining constant at 1. The amplitude detectors ‘ad1’ and ‘ad10k’ detect the laser light at 0 Hz (default frequency) and 10 kHz respectively. Both show a constant absolute \(\sqrt {1/2}\) and the detector ‘ad10k’ tracks the tuning of the phase of the second laser beam. Finally, the detector ‘pd10k’ resembles a photodiode with demodulation at 10 kHz. In fact, this represents a photodiode and two mixers used to reconstruct a complex number as shown in Equation (82). One can see that the phase of the resulting electronic signal also directly follows the phase difference between the two laser beams.
Finesse input file for ‘Optical beat’
Basic Interferometers
The large interferometric gravitationalwave detectors currently in operation are based on two fundamental interferometer topologies: the FabryPérot and the Michelson interferometer. The main instrument is very similar to the original interferometer concept used in the famous experiment by Michelson and Morley, published in 1887 [42]. The main difference is that modern instruments use laser light to illuminate the interferometer to achieve much higher accuracy. Already the first prototype by Forward and Weiss has thus achieved a sensitivity a million times better than Michelson’s original instrument [18]. In addition, in current gravitationalwave detectors, the Michelson interferometer has been enhanced by resonant cavities, which in turn have been derived from the original idea for a spectroscopy standard published by Fabry and Pérot in 1899 [16]. The following section will describe the fundamental properties of the FabryPérot interferometer and the Michelson interferometer. A thorough understanding of these basic instruments is essential for the study of the highprecision interferometers used for gravitationalwave detection.
The twomirror cavity: a FabryPérot interferometer
We have computed the field amplitudes in a linear twomirror cavity, also called FabryPérot interferometer, in Section 2.2. In order to understand the features of this optical instrument it is of interest to have a closer look at the power circulation in the cavity. A typical optical layout is shown in Figure 24: two parallel mirrors form the FabryPérot cavity. A laser beam is injected through the first mirror (at normal incidence).
The behaviour of the (ideal) cavity is determined by the length of the cavity L, the wavelength of the laser λ and the reflectivity and transmittance of the mirrors. Assuming an input power of a_{0}^{2} = 1, we obtain
with k = 2π/λ, P, T = t^{2} and R = r^{2}, as defined in Section 1.4. Similarly we could compute the transmission of the optical system as the inputoutput ratio of the field amplitudes. For example,
is the frequencydependent transfer function of the cavity in transmission (the frequency dependency is hidden inside the k = 2πf/c).
Figure 25 shows a plot of the circulating light power i over the laser frequency. The maximum power is reached when the cosine function in the denominator becomes equal to one, i.e., at kL = Nπ with N an integer. This is called the cavity resonance. The lowest power values are reached at antiresonance when kL = (N + 1/2)π. We can also rewrite
with FSR being the freespectral range of the cavity as shown in Figure 25. Thus, it becomes clear that resonance is reached for laser frequencies
where N is an integer.
Another characteristic parameter of a cavity is its linewidth, usually given as full width at half maximum (FWHM) or its pole frequency, f_{p}. In order to compute the linewidth we have to ask at which frequency the circulating power becomes half the maximum:
This results in the following expression for the full linewidth:
The ratio of the linewidth and the free spectral range is called the finesse of a cavity:
In the case of high finesse, i.e., r_{1} and r_{2} are close to 1 we can use the fact that the argument of the arcsin function is small and make the approximation
The behaviour of a two mirror cavity depends on the length of the cavity (with respect to the frequency of the laser) and on the reflectivities of the mirrors. Regarding the mirror parameters one distinguishes three cases^{Footnote 5}:

when T_{1} < T_{2} the cavity is called undercoupled

when T_{1} = T_{2} the cavity is called impedance matched

when T_{1} > T_{2} the cavity is called overcoupled
The differences between these three cases can seem subtle mathematically but have a strong impact on the application of cavities in laser systems. One of the main differences is the phase evolution of the light fields, which is shown in Figure 26. The circulating power shows that the resonance effect is better used in overcoupled cavities; this is illustrated in Figure 27, which shows the transmitted and circulating power for the three different cases. Only in the impedancematched case can the cavity transmit (on resonance) all the incident power. Given the same total transmission T_{1} + T_{2}, the overcoupled case allows for the largest circulating power and thus a stronger ‘resonance effect’ of the cavity, for example, when the cavity is used as a mode filter. Hence, most commonly used cavities are impedance matched or overcoupled.
Michelson interferometer
We came across the Michelson interferometer in Section 2.4 when we discussed the phase relation at a beam splitter. The typical optical layout of the Michelson interferometer is shown again in Figure 28: a laser beam is split by a beam splitter and send along two perpendicular interferometer arms. The four directions seen from the beam splitter are called North, East, West and South. The ends of these arms (North and East) are marked by highly reflective end mirrors, which reflect the beams back into themselves so that they can be recombined by the beam splitter. Generally, the Michelson interferometer has two outputs, namely the so far unused beam splitter port (South) and the input port (West). Both output ports can be used to obtain interferometer signals, however, most setups are designed such that the signals with high signaltonoise ratios are detected in the South port.
The Michelson interferometer output is determined by the laser wavelength λ, the reflectivity and transmittance of the beam splitter and the end mirrors, and the length of the interferometer arms. In many cases the end mirrors are highly reflective and the beam splitter ideally a 50:50 beam splitter. In that case, we can compute the output for a monochromatic field as shown in Section 2.4. Using Equation (20) we can write the field in the South port as
We define the common arm length and the armlength difference as
which yield \(2{L_N} = 2\bar L + \Delta L\) and \(2{L_E} = 2\bar L  \Delta L\). Thus, we can further simplify to get
The photo detector then produces a signal proportional to
This signal is depicted in Figure 29; it shows that the power in the South port changes between zero and the input power with a period of ΔL/λ = 0.5. The tuning at which the output power drops to zero is called the dark fringe. Current interferometric gravitationalwave detectors operate their Michelson interferometer at or near the dark fringe.
The above seems to indicate that the macroscopic armlength difference plays no role in the Michelson output signal. However, this is only correct for a monochromatic laser beam with infinite coherence length. In real interferometers care must be taken that the armlength difference is well below the coherence length of the light source. In gravitationalwave detectors the macroscopic armlength difference is an important design feature; it is kept very small in order to reduce coupling of laser noise into the output but needs to retain a finite size to allow the transfer of phase modulation sidebands from the input to the output port; this is illustrated in the Finesse example below and will be covered in detail in Section 6.4.
Finesse examples
Michelson power
The power in the South port of a Michelson detector varies as the cosine squared of the microscopic arm length difference. The maximum output can be equal to the input power, but only if the Michelson interferometer is symmetric and lossless. The tuning for which the South port power is zero is referred to as the dark fringe.
Finesse input file for ‘Michelson power’
Michelson modulation
This example demonstrates how a macroscopic arm length difference can cause different ‘dark fringe’ tuning for injected fields with different frequencies. In this case, some of the 10 MHz modulation sidebands are transmitted when the interferometer is tuned to a dark fringe for the carrier light. This effect can be used to separate light fields of different frequencies. It is also the cause for transmission of laser noise (especially frequency noise) into the Michelson output port when the interferometer is not perfectly symmetric.
Finesse input file for ‘Michelson modulation’
Interferometric Length Sensing and Control
In this section we introduce interferometers as length sensing devices. In particular, we explain how the FabryPérot interferometer and the Michelson interferometer can be used for highprecision measurements and that both require a careful control of the base length (which is to be measured) in order to yield their large sensitivity. In addition, we briefly introduce the general concepts of error signals and transfer functions, which are used to describe most essential features of length sensing and control.
Error signals and transfer functions
In general, we will call an error signal any measured signal suitable for stabilising a certain experimental parameter p with a servo loop. The aim is to maintain the variable p at a userdefined value, the operating point, p_{0}. Therefore, the error signal must be a function of the parameter p. In most cases it is preferable to have a bipolar signal with a zero crossing at the operating point. The slope of the error signal at the operating point is a measure of the ‘gain’ of the sensor (which in the case of interferometers is a combination of optics and electronics).
Transfer functions describe the propagation of a periodic signal through a plant and are usually given as plots of amplitude and phase over frequency. By definition a transfer function describes only the linear coupling of signals inside a system. This means a transfer function is independent of the actual signal size. For small signals or small deviations, most systems can be linearised and correctly described by transfer functions.
Experimentally, network analysers are commonly used to measure a transfer function: one connects a periodic signal (the source) to an actuator of the plant (which is to be analysed) and to an input of the analyser. A signal from a sensor that monitors a certain parameter of the plant is connected to the second analyser input. By mixing the source with the sensor signal the analyser can determine the amplitude and phase of the input signal with respect to the source (amplitude equals one and the phase equals zero when both signals are identical).
Mathematically, transfer functions can be modeled similarly: applying a sinusoidal signal sin(ω_{s}t) to the interferometer, e.g., as a position modulation of a cavity mirror, will create phase modulation sidebands with a frequency offset of ±ω_{s} to the carrier light. If such light is detected in the right way by a photodiode, it will include a signal at the frequency component ω_{s}, which can be extracted, for example, by means of demodulation (see Section 4.2).
Transfer functions are of particular interest in relation to error signals. Typically a transfer function of the error signal is required for the design of the respective electronic servo. A ‘transfer function of the error signal’ usually refers to a very specific setup: the system is held at its operating point, such that, on average, \(\bar p = {p_0}\). A signal is applied to the system in the form of a very small sinusoidal disturbance of p. The transfer function is then constructed by computing for each signal frequency the ratio of the error signal and the injected signal. Figure 32 shows an example of an error signal and its corresponding transfer function. The operating point shall be at
The optical transfer function \({T_{{\rm{opt,}}{{\rm{x}}_{\rm{d}}}}}\) with respect to this error signal is defined by
with T_{det} as the transfer function of the sensor. In the following, T_{det} is assumed to be unity. At the zero crossing the slope of the error signal represents the magnitude of the transfer function for low frequencies:
The quantity above will be called the errorsignal slope in the following text. It is proportional to the optical gain T_{opt},x_{d}, which describes the amplification of the gravitationalwave signal by the optical instrument.
FabryPérot length sensing
In Figure 25 we have plotted the circulating power in a FabryPérot cavity as a function of the laser frequency. The steep features in this plot indicate that such a cavity can be used to measure changes in the laser frequency. From the equation for the circulating power (see Equation (88)),
we can see that the actual frequency dependence is given by the cos(2kL) term. Writing this term as
we can highlight the fact that the cavity is in fact a reference for the laser frequency in relation to the cavity length. If we know the cavity length very well, a cavity should be a good instrument to measure the frequency of a laser beam. However, if we know the laser frequency very accurately, we can use an optical cavity to measure a length. In the following we will detail the optical setup and behaviour of a cavity used for a length measurement. The same reasoning applies for frequency measurements. If we make use of the resonant power enhancement of the cavity to measure the cavity length, we can derive the sensitivity of the cavity from the differentiation of Equation (88), which gives the slope of the trace shown in Figure 25,
with d as defined in Equation (103). This is plotted in Figure 33 together with the cavity power as a function of the cavity tuning. From Figure 33 we can deduce a few key features of the cavity:

The cavity must be held as near as possible to the resonance for maximum sensitivity. This is the reason that active servo control systems play an important role in modern laser interferometers.

If we want to use the power directly as an error signal for the length, we cannot use the cavity directly on resonance because there the optical gain is zero. A suitable error signal (i.e., a bipolar signal) can be constructed by adding an offset to the light power signal. A control system utilising this method is often called DClock or offsetlock. However, we show below that more elegant alternative methods for generating error signals exist.

The differentiation of the cavity power looks like a perfect error signal for holding the cavity on resonance. A signal proportional to such differentiation can be achieved with a modulationdemodulation technique.
The PoundDreverHall length sensing scheme
This scheme for stabilising the frequency of a light field to the length of a cavity, or vice versa, is based on much older techniques for performing very similar actions with microwaves and microwave resonators. Drever and Hall have adapted such techniques for use in the optical regime [14] and today what is now called the PoundDreverHall technique can be found in a great number of different types of optical setups. An example layout of this scheme is shown in Figure 34, in this case for generating a length (or frequency) signal of a twomirror cavity. The laser is passed through an electrooptical modulator, which applies a periodic phase modulation at a fixed frequency. In many cases the modulation frequency is chosen such that it resides in the radio frequency band for which lowcost, lownoise electronic components are available. The phase modulated light is then injected into the cavity. However, from the frequency domain analysis introduced in Section 5, we know that in most cases not all the light can be injected into the cavity. Let’s consider the example of an overcoupled cavity with the reflectivity of the end mirror R_{2} < 1. Such a cavity would have a frequency response as shown in the top traces of Figure 26 (recall that the origin of the frequency axis refers to an arbitrarily chosen default frequency, which for this figure has been selected to be a resonance frequency of the cavity). If the cavity is held on resonance for the unmodulated carrier field, this field enters the cavity, gets resonantly enhanced and a substantial fraction is transmitted. If the frequency offset of the modulation sidebands is chosen such that it does not coincide with (or is near to) an integer multiple of the cavity’s free spectral range, the modulation sidebands are mostly reflected by the cavity and will not be influenced as much by the resonance condition of the cavity as the carrier. The photodiode measuring the reflected light will see the optical beat between the carrier field and the modulation sidebands. This includes a component at the modulation frequency which is a measure of the phase difference between the carrier field and the sidebands (given the setup as described above). Any slight change of the cavity length would introduce a proportional change in the phase of the carrier field and no change in the sideband fields. Thus the photodiode signal can be used to measure the length changes of the cavity. One of the advantages of this method is the fact that the sogenerated signal is bipolar with a zero crossing and steep slope exactly at the cavity’s resonance, see Figure 35.
Michelson length sensing
Similarly to the twomirror cavity, we can start to understand the lengthsensing capabilities of the Michelson interferometer by looking at the output light power as a function of a mirror movement, as shown in Figure 29. The power changes as sine squared with the maximum slope at the point when the output power (in what we call the South port) is half the input power. The slope of the output power, which is the optical gain of the instrument for detecting a differential armlength change ΔL with a photo detector in the South port can be written as
and is shown in Figure 36. The most notable difference of the optical gain of the Michelson interferometer with respect to the FabryPérot interferometer (see Figure 33) is the wider, more smooth distribution of the gain. This is due to the fact that the cavity example is based on a highfinesse cavity in which the optical resonance effect is dominant. In a basic Michelson interferometer such resonance enhancement is not present.
However, the main difference is that the measurement is made differentially by comparing two lengths. This allows one to separate a larger number of possible noise contributions, for example noise in the laser light source, such as amplitude or frequency noise. This is why the main instrument for gravitationalwave measurements is a Michelson interferometer. However, the resonant enhancement of light power can be added to the Michelson, for example, by using FabryPérot cavities within the Michelson. This construction of new topologies by combining Michelson and FabryPérot interferometers will be described in detail in a future version of this review.
The Michelson interferometer has two longitudinal degrees of freedom. These can be represented by the positions (along the optical axes) of the end mirrors. However, it is more efficient to use proper linear combinations of these and describe the Michelson interferometer length or position information by the common and differential arm length, as introduced in Equation (97):
The Michelson interferometer is intrinsically insensitive to the common arm length \({\bar L}\).
The Schnupp modulation scheme
Similar to the FabryPérot cavity, the Michelson interferometer is also often used to set an operating point where the optical gain of a direct light power detection is zero. This operating point, given by ΔL/λ = (2N + 1) • 0.25 with N a nonnegative integer, is called dark fringe. This operating point has several advantages, the most important being the low (ideally zero) light power on the diode. Highly efficient and lownoise photodiodes usually use a small detector area and thus are typically not able to detect large power levels. By using the dark fringe operating point, the Michelson interferometer can be used as a null instrument or null measurement, which generally is a good method to reduce systematic errors [49].
One approach to make use of the advantages of the dark fringe operating point is to use an operating point very close to the dark fringe at which the optical gain is not yet zero. In such a scenario a careful tradeoff calculation can be done by computing the signaltonoise with noises that must be suppressed, such as the laser amplitude noise. This type of operation is usually referred to as DC control or offset control and is very similar to the similarlynamed mechanism used with FabryPérot cavities.
Another option is to employ phase modulated light, similar to the PoundDreverHall scheme described in Section 6.3. The optical layout of such a scheme is depicted in Figure 37: an electrooptical modulator is used to apply a phase modulation at a fixed (usually RF type) frequency to the (monochromatic) laser light before it enters the interferometer. The photodiode signal from the interferometer output is then demodulated at the same frequency. This scheme allows one to operate the interferometer precisely on the dark fringe. The method originally proposed by Lise Schnupp is also sometimes referred to as frontal modulation.
The optical gain of a Michelson interferometer with Schnupp modulation is shown in Figure 39 in Section 6.6.
Finesse examples
Cavity power and slope
Figure 33 shows a plot of the analytical functions describing the power inside a cavity and its differentiation by the cavity tuning. This example recreates the plot using a numerical model in Finesse.
Finesse input file for ‘Cavity power and slope’
Michelson with Schnupp modulation
Figure 39 shows the demodulated photodiode signal of a Michelson interferometer with Schnupp modulation, as well as its differentiation, the latter being the optical gain of the system. Comparing this figure to Figure 36, it can be seen that with Schnupp modulation, the optical gain at the dark fringe operating points is maximised and a suitable error signal for these points is obtained.
Finesse input file for ‘Michelson with Schnupp modulation’
Beam Shapes: Beyond the Plane Wave Approximation
In previous sections we have introduced a notation for describing the onaxis properties of electric fields. Specifically, we have described the electric fields along an optical axis as functions of frequency (or time) and the location z. Models of optical systems may often use this approach for a basic analysis even though the respective experiments will always include fields with distinct offaxis beam shapes. A more detailed description of such optical systems needs to take the geometrical shape of the light field into account. One method of treating the transverse beam geometry is to describe the spatial properties as a sum of ‘spatial components’ or ‘spatial modes’ so that the electric field can be written as a sum of the different frequency components and of the different spatial modes. Of course, the concept of modes is directly related to the use of a sort of oscillator, in this case the optical cavity. Most of the work presented here is based on the research on laser resonators reviewed originally by Kogelnik and Li [35]. Siegman has written a very interesting historic review of the development of Gaussian optics [52, 51] and we use whenever possible the same notation as used in his textbook ‘Lasers’ [50].
This section introduces the use of Gaussian modes for describing the spatial properties along the transverse orthogonal x and y directions of an optical beam. We can write
with u_{nm} as special functions describing the spatial properties of the beam and a_{jnm} as complex amplitude factors (ω_{j} is again the angular frequency and k_{j} = ω_{j}/c). For simplicity we restrict the following description to a single frequency component at one moment in time (t = 0), so
In general, different types of spatial modes u_{nm} can be used in this context. Of particular interest are the Gaussian modes, which will be used throughout this document. Many lasers emit light that closely resembles a Gaussian beam: the light mainly propagates along one axis, is well collimated around that axis and the cross section of the intensity perpendicular to the optical axis shows a Gaussian distribution. The following sections provide the basic mathematical framework for using Gaussian modes for analysing optical systems.
The paraxial wave equation
Mathematically, Gaussian modes are solutions to the paraxial wave equation — a specific wave equation for electromagnetic fields. All electromagnetic waves are solutions to the general wave equation, which in vacuum can be given as:
But laser light fields are special types of electromagnetic waves. For example, they are characterised by low diffraction. Hence, a laser beam will have a characteristic length ω describing the ‘width’ (the dimension of the field transverse to the main propagation axis), and a characteristic length l defining some local length along the propagation over which the beam characteristics do not vary much. By definition, for what we call a beam ω is typically small and l large in comparison, so that ω/l can be considered small. In fact, the paraxial wave equation (and its solutions) can be derived as the firstorder terms of a series expansion of Equation (109) into orders of ω/l [37].
A simpler approach to the paraxialwave equation goes as follows: A particular beam shape shall be described by a function u(x, y, z) so that we can write the electric field as
Substituting this into the standard wave equation yields a differential equation for u:
Now we put the fact that u(x, y, z) should be slowly varying with z in mathematical terms. The variation of u(x, y, z) with z should be small compared to its variation with x or y. Also the second partial derivative in z should be small. This can be expressed as
With this approximation, Equation (111) can be simplified to the paraxial wave equation,
Any field u that solves this equation represents a paraxial beam shape when used in the form given in Equation (110).
Transverse electromagnetic modes
In general, any solution u(x, y, z) of the paraxial wave equation, Equation (113), can be employed to represent the transverse properties of a scalar electric field representing a beamlike electromagnetic wave. Especially useful in this respect are special families or sets of functions that are solutions of the paraxial wave equation. When such a set of functions is complete and countable, it’s called a set of transverse electromagnetic modes (TEM). For instance, the set of HermiteGauss modes are exact solutions of the paraxial wave equation. These modes are represented by an infinite, countable and complete set of functions. The term complete means they can be understood as a base system of the function space defined by all solutions of the paraxial wave equation. In other words, we can describe any solution of the paraxial wave equation u′ by a linear superposition of HermiteGauss modes:
which in turn allows us to describe any laser beam using a sum of these modes:
The HermiteGauss modes as given in this document (see Section 7.5) are orthonormal so that
This means that, in the function space defined by the paraxial wave equation, the HermiteGauss functions can be understood as a complete set of unitlength basis vectors. This fact can be utilised for the computation of coupling factors. Furthermore, the power of a beam, as given by Equation (108), being detected on a singleelement photodetector (provided that the area of the detector is large with respect to the beam) can be computed as
or for a beam with several frequency components (compare with Equation (76)) as
Properties of Gaussian beams
The basic or ‘lowestorder’ HermiteGauss mode is equivalent to what is usually called a Gaussian beam and is given by
The parameters of this equation are explained in detail below. The shape of a Gaussian beam is quite simple: the beam has a circular cross section, and the radial intensity profile of a beam with total power P is given by
with ω the spot size, defined as the radius at which the intensity is 1/e^{2} times the maximum intensity I(0). This is a Gaussian distribution, see Figure 40, hence the name Gaussian beam.
Figure 41 shows a different cross section through a Gaussian beam: it plots the beam size as a function of the position on the optical axis.
Such a beam profile (for a beam with a given wavelength λ) can be completely determined by two parameters: the size of the minimum spot size ω_{0} (called beam waist) and the position z_{0} of the beam waist along the zaxis.
To characterise a Gaussian beam, some useful parameters can be derived from ω_{0} and z_{0}. A Gaussian beam can be divided into two different sections along the zaxis: a near field — a region around the beam waist, and a far field — far away from the waist. The length of the nearfield region is approximately given by the Rayleigh range z_{R}. The Rayleigh range and the spot size are related by
With the Rayleigh range and the location of the beam waist, we can usefully write
This equation gives the size of the beam along the zaxis. In the farfield regime (z ≫ z_{R}, z_{0}), it can be approximated by a linear equation, when
The angle Θ between the zaxis and ω(z) in the far field is called the diffraction angle^{Footnote 6} and is defined by
Another useful parameter is the radius of curvature of the wavefront at a given point z. The radius of curvature describes the curvature of the ‘phase front’ of the electromagnetic wave — a surface across the beam with equal phase — intersecting the optical axis at the position z. We obtain the radius of curvature as a function of z:
We also find:
Astigmatic beams: the tangential and sagittal plane
If the interferometer is confined to a plane (here the x–z plane), it is convenient to use projections of the threedimensional description into two planes [46]: the tangential plane, defined as the x–z plane and the sagittal plane as given by y and z.
The beam parameters can then be split into two respective parameters: z_{0,s}, ω_{0,s} for the sagittal plane and z_{0,t} and ω_{0,t} for the tangential plane so that the HermiteGauss modes can be written as
Beams with different beam waist parameters for the sagittal and tangential plane are astigmatic.
Remember that these HermiteGauss modes form a base system. This means one can use the separation into sagittal and tangential planes even if the actual optical system does not show this special type of symmetry. This separation is very useful in simplifying the mathematics. In the following, the term beam parameter generally refers to a simple case where ω_{0,x} = ω_{0,y} and z_{0,x} = z_{0,y} but all the results can also be applied directly to a pair of parameters.
Higherorder HermiteGauss modes
The complete set of HermiteGauss modes is given by an infinite discrete set of modes u_{nm}(x, y, z) with the indices n and m as mode numbers. The sum n+m is called the order of the mode. The term higherorder modes usually refers to modes with an order n + m > 0. The general expression for HermiteGauss modes can be given as [35]
with
and H_{n}(x) the Hermite polynomials of order n. The first Hermite polynomials, without normalisation, can be written
Further orders can be computed recursively since
for both transverse directions we can also rewrite the above to
The latter form has the advantage of clearly showing the extra phase shift along the zaxis of (n + m +1)Ψ(z) called the Gouy phase; see Section 7.8.
The Gaussian beam parameter
For a more compact description of the interaction of Gaussian modes with optical components we will make use of the Gaussian beam parameter q [34]. The beam parameter is a complex quantity defined as
It can also be written as
Using this parameter, Equation (119) can be rewritten as
Other parameters, like the beam size and radius of curvature, can also be written in terms of the beam parameter q:
and
The HermiteGauss modes can also be written using the Gaussian beam parameter as^{Footnote 7}
Properties of higherorder HermiteGauss modes
Some of the properties of HermiteGauss modes can easily be described using cross sections of the field intensity or field amplitude. Figure 42 shows such cross sections, i.e., the intensity in the x–y plane, for a number of higherorder modes. This shows a x–y symmetry for mode indices n and m. We can also see how the size of the intensity distribution increases with the mode index, while the peak intensity decreases.
Similarly, Figure 44 shows the amplitude and phase distribution of several higherorder HermiteGauss modes. Some further features of HermiteGauss modes:

The size of the intensity profile of any sum of HermiteGauss modes depends on z while its shape remains constant over propagation along the optical axis.

The phase distribution of HermiteGauss modes shows the curvature (or radius of curvature) of the beam. The curvature depends on z but is equal for all higherorder modes.
Note that these are special features of Gaussian beams and not generally true for arbitrary beam shapes. Figure 43, for example, shows the amplitude and phase distribution of a triangular beam at the point where it is (mathematically) created and after a 10 m propagation. Neither the shape is preserved nor does it show a spherical phase distribution.
Gouy phase
The equation for HermiteGauss modes shows an extra longitudinal phase lag. This Gouy phase [8, 26, 25] describes the fact that, compared to a plane wave, the HermiteGauss modes have a slightly slower phase velocity, especially close to the waist. The Gouy phase can be written as
or, using the Gaussian beam parameter,
Compared to a plane wave, the phase lag φ of a HermiteGauss mode is
With an astigmatic beam, i.e., different beam parameters in the tangential and sagittal planes, this becomes
with
as the Gouy phase in the tangential plane (and is similarly defined in the sagittal plane).
LaguerreGauss modes
LaguerreGauss modes are another complete set of functions, which solve the paraxial wave equation. They are defined in cylindrical coordinates and can have advantages over HermiteGauss modes in the presence of cylindrical symmetry. More recently, LaguerreGauss modes are being investigated in a different context: using a pure higherorder LaguerreGauss mode instead of the fundamental Gaussian beam can significantly reduce the impact of mirror thermal noise on the sensitivity of gravitational wave detectors [54, 12]. LaguerreGauss modes are commonly given as [50]
with r, ϕ and z as the cylindrical coordinates around the optical axis. The letter p is the radial mode index, l the azimuthal mode index^{Footnote 8} and \(L_p^{(l)}(x)\) are the associated Laguerre polynomials:
All other parameters (w(z), q(z),…) are defined as above for the HermiteGauss modes.
The dependence of the Laguerre modes on ϕ as given in Equation (146) results in a spiraling phase front, while the intensity pattern will always show unbroken concentric rings; see Figure 45. These modes are also called helical LaguerreGauss modes because of the their special phase structure.
The reader might be more familiar with a slightly different type of Laguerre modes (compare Figure 46 and Figure 47) that features dark radial lines as well as dark concentric rings. Mathematically, these can be described simply by replacing the phase factor exp(i lϕ) in Equation (146) by a sine or cosine function. For example, an alternative set of LaguerreGauss modes is given by [55]
This type of mode has a spherical phase front, just as the HermiteGauss modes. We will refer to this set as sinusoidal LaguerreGauss modes throughout this document.
For the purposes of simulation it can be sometimes useful to decompose LaguerreGauss modes into HermiteGauss modes. The mathematical conversion for helical modes is given as [7, 1]
with real coefficients
if N = n + m. This relates to the common definition of Laguerre modes as u_{pl} as follows: p = min(n, m) and l = n − m. The coefficients h(n, m, k) can be computed numerically by using Jacobi polynomials. Jacobi polynomials can be written in various forms:
or which leads to
Tracing a Gaussian beam through an optical system
Whenever Gauss modes are used to analyse an optical system, the Gaussian beam parameters (or equivalent waist sizes and locations) must be defined for each location at which field amplitudes are to be computed (or at which coupling equations are to be defined). In our experience the quality of a computation or simulation and the correctness of the results depend critically on the choice of these beam parameters. One might argue that the choice of a basis should not alter the result. This is correct, but there is a practical limitation: the number of modes having nonnegligible power might become very large if the beam parameters are not optimised, so that in practice a good set of beam parameters is usually required.
In general, the Gaussian beam parameter of a mode is changed at every optical surface in a welldefined way (see Section 7.11). Thus, a possible method of finding reasonable beam parameters for every location in the interferometer is to first set only some specific beam parameters at selected locations and then to derive the remaining beam parameters from these initial ones: usually it is sensible to assume that the beam at the laser source can be properly described by the (hopefully known) beam parameter of the laser’s output mode. In addition, in most stable cavities the light fields should be described by using the respective cavity eigenmodes. Then, the remaining beam parameters can be computed by tracing the beam through the optical system. ‘Trace’ in this context means that a beam starting at a location with an alreadyknown beam parameter is propagated mathematically through the optical system. At every optical element along the path the beam parameter is transformed according to the ABCD matrix of the element (see below).
ABCD matrices
The transformation of the beam parameter can be performed by the ABCD matrixformalism [34, 50]. When a beam passes an optical element or freely propagates though space, the initial beam parameter q_{1} is transformed into q_{2}. This transformation can be described by four real coefficients as follows:
with the coefficient matrix
n_{1} being the index of refraction at the beam segment defined by q_{1}, and n_{2} the index of refraction at the beam segment described by q_{2}. ABCD matrices for some common optical components are given below, for the sagittal and tangential plane.
Transmission through a mirror:
A mirror in this context is a single, partlyreflecting surface with an angle of incidence of 90°. The transmission is described by with R_{C} being the radius of curvature of the spherical surface. The sign of the radius is defined such that R_{C} is negative if the centre of the sphere is located in the direction of propagation. The curvature shown above (in Figure 48), for example, is described by a positive radius. The matrix for the transmission in the opposite direction of propagation is identical.
Reflection at a mirror:
The matrix for reflection is given by The reflection at the back surface can be described by the same type of matrix by setting C = 2n_{2}/R_{C}.
Transmission through a beam splitter:
A beam splitter is understood as a single surface with an arbitrary angle of incidence α_{1}. The matrices for transmission and reflection are different for the sagittal and tangential planes (M_{s} and M_{t}): with α_{2} given by Snell’s law:
and Δn by
if the direction of propagation is reversed, the matrix for the sagittal plane is identical and the matrix for the tangential plane can be obtained by changing the coefficients A and D as follows:
Reflection at a beam splitter:
The reflection at the front surface of a beam splitter is given by: To describe a reflection at the back surface the matrices have to be changed as follows:
Transmission through a thin lens:
A thin lens transforms the beam parameter as follows: where f is the focal length. The matrix for the opposite direction of propagation is identical. Here it is assumed that the thin lens is surrounded by ‘spaces’ with index of refraction n = 1.
Transmission through a free space:
As mentioned above, the beam in free space can be described by one base parameter q_{0}. In some cases it is convenient to use a matrix similar to that used for the other components to describe the zdependency of q(z) = q_{0} + z. On propagation through a free space of the length L and index of refraction n, the beam parameter is transformed as follows. The matrix for the opposite direction of propagation is identical.
Interferometer Matrix with HermiteGauss Modes
In the planewave analysis Section 1.4, a laser beam is described in general by the sum of various frequency components of its electric field
Here we include the geometric shape of the beam by describing each frequency component as a sum of HermiteGauss modes:
The shape of such a beam does not change along the zaxis (in the paraxial approximation). More precisely, the spot size and the position of the maximum intensity with respect to the zaxis may change, but the relative intensity distribution across the beam does not change its shape. Each part of the sum may be treated as an independent field that can be described using the equation for planewaves with only two exceptions:

the propagation through free space has to include the Gouy phase shift, and

upon reflection or transmission at a mirror or beam splitter the different HermiteGauss modes may be coupled (see below).
The Gouy phase shift can be included in the simulation in several ways. For example, for reasons of flexibility the Gouy phase has been included in Finesse as a phase shift of the component space.
Coupling of HermiteGauss modes
Let us consider two different cavities with different sets of eigenmodes. The first set is characterised by the beam parameter q_{1} and the second by the parameter q_{2}. A beam with all power in the fundamental mode u_{00}(q_{1}) leaves the first cavity and is injected into the second. Here, two ‘misconfigurations’ are possible:

if the optical axes of the beam and the second cavity do not overlap perfectly, the setup is called misaligned,

if the beam size or shape at the second cavity does not match the beam shape and size of the (resonant) fundamental eigenmode (q_{1}(z_{cav}) ≠ q_{2}(z_{cav})), the beam is then not modematched to the second cavity, i.e., there is a mode mismatch.
The above misconfigurations can be used in the context of simple beam segments. We consider the case in which the beam parameter for the input light is specified. Ideally, the ABCD matrices then allow one to trace a beam through the optical system by computing the proper beam parameter for each beam segment. In this case, the basis system of HermiteGauss modes is transformed in the same way as the beam, so that the modes are not coupled.
For example, an input beam described by the beam parameter q_{1} is passed through several optical components, and at each component the beam parameter is transformed according to the respective ABCD matrix. Thus, the electric field in each beam segment is described by HermiteGauss modes based on different beam parameters, but the relative power between the HermiteGauss modes with different mode numbers remains constant, i.e., a beam in a u_{00} mode is described as a pure u_{00} mode throughout the entire system.
In practice, it is usually impossible to compute proper beam parameter for each beam segment as suggested above, especially when the beam passes a certain segment more than once. A simple case that illustrates this point is reflection at a spherical mirror. Let the input beam be described by q_{1}. From Figure 49 we know that the proper beam parameter of the reflected beam is
with R_{C} being the radius of curvature of the mirror. In general, we get q_{1} ≠ q_{2} and thus two different ‘proper’ beam parameters for the same beam segment. Only one special radius of curvature would result in matched beam parameters (q_{1} = q_{2}).
Coupling coefficients for HermiteGauss modes
HermiteGauss modes are coupled whenever a beam is not matched to a cavity or to a beam segment or if the beam and the segment are misaligned. This coupling is sometimes referred to as ‘scattering into higherorder modes’ because in most cases the laser beam is a considered as a pure TEM_{00} mode and any mode coupling would transfer power from the fundamental into higherorder modes. However, in general, every mode with nonzero power will transfer energy into other modes whenever mismatch or misalignment occur, and this effect also includes the transfer from higher orders into a low order.
To compute the amount of coupling the beam must be projected into the base system of the cavity or beam segment it is being injected into. This is always possible, provided that the paraxial approximation holds, because each set of HermiteGauss modes, defined by the beam parameter at a position z, forms a complete set. Such a change of the basis system results in a different distribution of light power in the new HermiteGauss modes and can be expressed by coupling coefficients that yield the change in the light amplitude and phase with respect to mode number.
Let us assume that a beam described by the beam parameter q_{1} is injected into a segment described by the parameter q_{2}. Let the optical axis of the beam be misaligned: the coordinate system of the beam is given by (x, y, z) and the beam travels along the zaxis. The beam segment is parallel to the z′axis and the coordinate system (x′, y′, z′) is given by rotating the (x, y, z) system around the yaxis by the misalignment angle γ. The coupling coefficients are defined as
where u_{nm}(q_{1}) are the HermiteGauss modes used to describe the injected beam and \({u_{{n^{\prime}}\,{m^{\prime}}}}({q_2})\) are the ‘new’ modes that are used to describe the light in the beam segment. Note that including the plane wave phase propagation within the definition of coupling coefficients is very important because it results in coupling coefficients that are independent of the position on the optical axis for which the coupling coefficients are computed.
Using the fact that the HermiteGauss modes u_{nm} are orthonormal, we can compute the coupling coefficients by the convolution [6]
Since the HermiteGauss modes can be separated with respect to x and y, the coupling coefficients can also be split into \({k_{nm{n^{\prime}}{m^{\prime}}}} = {k_{n{n^{\prime}}}}{k_{m{m^{\prime}}}}\). These equations are very useful in the paraxial approximation as the coupling coefficients decrease with large mode numbers. In order to be described as paraxial, the angle γ must not be larger than the diffraction angle. In addition, to obtain correct results with a finite number of modes the beam parameters q_{1} and q_{2} must not differ too much.
The convolution given in Equation (164) can be computed directly using numerical integration. However, this is computationally very expensive. The following is based on the work of BayerHelms [6]. Another very good description of coupling coefficients and their derivation can be found in the work of Vinet [55]. In [6] the above projection integral is partly solved and the coupling coefficients are given by simple sums as functions of γ and the mode mismatch parameter K, which are defined by
where K_{0} = (z_{R} −z′_{R})/z′_{R} and K_{2} = ((z − z_{0}) − (z′ − z′_{R}))/z′R_{R}. This can also be written using q = i z_{r} + z − z_{0}, as
The coupling coefficients for misalignment and mismatch (but no lateral displacement) can then be written as
where
The corresponding formula for \({k_{m{m^{\prime}}}}\) can be obtained by replacing the following parameters: n → m, n′ → m′, X, \(\bar X \rightarrow 0\) and E^{(x)} → 1 (see below). The notation [n/2] means
The other abbreviations used in the above definition are
In general, the Gaussian beam parameter might be different for the sagittal and tangential planes and a misalignment can be given for both possible axes (around the yaxis and around the xaxis), in this case the coupling coefficients are given by
where \({k_{n{n^{\prime}}}}\) is given above with
and γ → γ_{y} is a rotation about the yaxis. The \({k_{m{m^{\prime}}}}\) can be obtained with the same formula, with the following substitutions:
and γ → γ_{x} is a rotation about the xaxis.
At each component a matrix of coupling coefficients has to be computed for transmission and reflection; see Figure 54.
Finesse examples
Beam parameter
This example illustrates a possible use of the beam parameter detector ‘bp’: the beam radius of the laser beam is plotted as a function of distance to the laser. For this simulation, the interferometer matrix does not need to be solved. ‘bp’ merely returns the results from the beam tracing algorithm of Finesse.
Finesse input file for ‘Beam parameter’
Mode cleaner
This example uses the ‘tem’ command to create a laser beam which is a sum of equal parts in u_{00} and u_{10} modes. This beam is passed through a triangular cavity, which acts as a mode cleaner. Being resonant for the u_{00}, the cavity transmits this mode and reflects the u_{10} mode as can be seen in the resulting plots.
Finesse input file for ‘Mode cleaner’
LG33 mode
Finesse uses the HermiteGauss modes as a base system for describing the spatial properties of laser beams. However, LaguerreGauss modes can be created using the coefficients given in Equation (149). This example demonstrates this and the use of a ‘beam’ detector to plot amplitude and phase of a beam cross section.
Finesse input file for ‘LG33 mode’
Notes
 1.
^{1} In many implementations of numerical matrix solvers the input vector is also called the righthand side vector.
 2.
^{2} Note that in other publications the tuning or equivalent microscopic displacements are sometimes defined via an optical pathlength difference. In that case, a tuning of 2π is used to refer to the change of the optical path length of one wavelength, which, for example, if the reflection at a mirror is described, corresponds to a change of the mirror’s position of λ_{0}/2.
 3.
^{3} The signal sidebands are sometimes also called audio sidebands because of their frequency range.
 4.
^{4} The term effective refers to that amount of incident light, which is converted into photoelectrons that are then usefully extracted from the junction (i.e., do not recombine within the device). This fraction is usually referred to as quantum efficiency η of the photodiode.
 5.
^{5} Please note that in the presence of losses the coupling is defined with respect to the transmission and losses. In particular, the impedancematched case is defined as T_{1} = T_{2} · Loss, so that the input power transmission exactly matches the light power lost in one roundtrip.
 6.
^{6} Also known as the farfield angle or the divergence of the beam.
 7.
^{7} Please note that this formula from [50] is very compact. Since the parameter q is a complex number, the expression contains at least two complex square roots. The complex square root requires a different algebra than the standard square root for real numbers. Especially the third and fourth factors can not be simplified in any obvious way: \({\left({{{{q_0}} \over {q(z)}}} \right)^{1/2}}{\left({{{{q_0}{q^{\ast}}(z)} \over {q_0^{\ast}q(z)}}} \right)^{n/2}} \neq {\left({{{q_0^{n + 1}{q^{\ast n}}(z)} \over {{q^{n + 1}}(z)q_0^{\ast n}}}} \right)^{1/2}}\)!
 8.
^{8} [50] states that the indices must obey the following relations: 0 ≤ l ≤ p. However, that is not the case.
References
 [1]
Abramochkin, E., and Volostnikov, V., “Beam transformations and nontransformed beams”, Opt. Commun., 83, 123–135, (1991). [ADS]. (Cited on page 64.)
 [2]
Abramowitz, M., and Stegun, I.A., eds., Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables, (Dover, New York, 1965), corr. edition. [Google Books]. (Cited on page 23.)
 [3]
Acernese, F. (Virgo Collaboration), “The Virgo automatic alignment system”, Class. Quantum Grav., 23(8), S91–S101, (2006). [DOI]. (Cited on page 6.)
 [4]
Acernese, F. (Virgo Collaboration), Advanced Virgo Baseline Design, VIR027A09, (Virgo, Cascina, 2009). Related online version (cited on 22 September 2009): http://www.virgo.infn.it/advirgo/docs.html. (Cited on page 5.)
 [5]
Advanced LIGO Reference Design, LIGO M06005608M, (LIGO, Pasadena, CA, 2007). Related online version (cited on 20 September 2009): http://www.ligo.caltech.edu/docs/M/M06005608/M06005608.pdf. (Cited on page 5.)
 [6]
BayerHelms, F., “Coupling coefficients of an incident wave and the modes of spherical optical resonator in the case of mismatching and misalignment”, Appl. Optics, 23, 1369–1380, (1984). [DOI]. (Cited on pages 70 and 71.)
 [7]
Beijersbergen, M.W., Allen, L., van der Veen, H.E.L.O., and Woerdman, J.P., “Astigmatic laser mode converters and transfer of orbital angular momentum”, Opt. Commun., 96, 123–132, (1993). [DOI]. (Cited on page 64.)
 [8]
Boyd, R.W., “Intuitive explanation of the phase anomaly of focused light beams”, J. Opt. Soc. Am., 70(7), 877–880, (1980). (Cited on page 62.)
 [9]
Braginsky, V.B., Gorodetsky, M.L., Khalili, F.Y., and Thorne, K.S., “Energetic quantum limit in largescale interferometers”, in Meshkov, S., ed., Gravitational Waves: Third Edoardo Amaldi Conference, Pasadena, California, 12–16 July, 1999, AIP Conference Proceedings, vol. 523, pp. 180–190, (American Institute of Physics, Melville, NY, 2000). [DOI], [ADS], [arXiv:grqc/9907057]. (Cited on page 7.)
 [10]
Caves, C.M., “Quantummechanical noise in an interferometer”, Phys. Rev. D, 23, 1693–1708, (1981). [DOI]. (Cited on page 7.)
 [11]
Chelkowski, S., Squeezed Light and Laser Interferometric Gravitational Wave Detectors, Ph.D. Thesis, (Universitat Hannover, Hannover, 2007). Related online version (cited on 28 September 2009): http://edok01.tib.unihannover.de/edoks/e01dh07/537859527.pdf. (Cited on page 27.)
 [12]
Chelkowski, S., Hild, S., and Freise, A., “Prospects of higherorder LaguerreGauss modes in future gravitational wave detectors”, Phys. Rev. D, 79, 122002, 1–11, (2009). [DOI], [arXiv:0901.4931 [grqc]]. (Cited on page 62.)
 [13]
Davis, T.A., Direct Methods for Sparse Linear Systems, Fundamentals of Algorithms, vol. 2, (SIAM, Philadelphia, 2006). Related online version (cited on 28 September 2009): http://www.cise.ufl.edu/research/sparse/CSparse/. (Cited on page 12.)
 [14]
Drever, R.W.P., Hall, J.L., Kowalski, F.V., Hough, J., Ford, G.M., Munley, A.J., and Ward, H., “Laser phase and frequency stabilization using an optical resonator”, Appl. Phys. B, 31, 97–105, (1983). [DOI]. (Cited on page 47.)
 [15]
Drever, R.W.P., Hough, J., Munley, A.J., Lee, S.A., Spero, R.E., Whitcomb, S.E., Ward, H., Ford, G.M., Hereld, M., Robertson, N.A., Kerr, I., Pugh, J.R., Newton, G.P., Meers, B.J., Brooks III, E.D., and Gürsel, Y., “Gravitational wave detectors using laser interferometers and optical cavities: Ideas, principles and prospects”, in Meystre, P., and Scully, M.O., eds., Quantum Optics, Experimental Gravity, and Measurement Theory, Proceedings of the NATO Advanced Study Institute, held August 16–29, 1981 in Bad Windsheim, Germany, NATO ASI Series B, vol. 94, pp. 503–514, (Plenum Press, New York, 1983). (Cited on page 6.)
 [16]
Fabry, C., and Pérot, A., “Theorie et applications d’une nouvelle methode de spectroscopie interferentielle”, Ann. Chim. Phys., 16, 115–144, (1899). (Cited on pages 6 and 36.)
 [17]
Fattaccioli, D., Boulharts, A., Brillet, A., and Man, C.N., “Sensitivity of multipass and FabryPérot delay lines to small misalignments”, J. Optics (Paris), 17(3), 115–127, (1986). [DOI]. (Cited on page 6.)
 [18]
Forward, R.L., “Wideband laserinterferometer gravitationalradiation experiment”, Phys. Rev. D, 17, 379–390, (1978). [DOI]. (Cited on pages 6 and 36.)
 [19]
Freise, A., “FINESSE: An Interferometer Simulation”, personal homepage, Andreas Freise. URL (cited on 16 January 2010): http://www.gwoptics.org/finesse. (Cited on pages 5 and 76.)
 [20]
Freise, A., The Next Generation of Interferometry: MultiFrequency Optical Modelling, Control Concepts and Implementation, Ph.D. Thesis, (Universität Hannover, Hannover, 2003). Related online version (cited on 28 September 2009): http://edok01.tib.unihannover.de/edoks/e01dh03/361006918.pdf. (Cited on page 33.)
 [21]
Freise, A., Bunkowski, A., and Schnabel, R., “Phase and alignment noise in grating interferometers”, New J. Phys., 9, 433, (2007). [DOI], [arXiv:0711.0291]. URL (cited on 17 January 2010): http://stacks.iop.org/13672630/9/433. (Cited on page 6.)
 [22]
Freise, A., Heinzel, G., Lück, H., Schilling, R., Willke, B., and Danzmann, K., “Frequencydomain interferometer simulation with higherorder spatial modes”, Class. Quantum Grav., 21(5), S1067–S1074, (2004). [DOI], [arXiv:grqc/0309012]. (Cited on pages 5 and 76.)
 [23]
Fritschel, P., “Second generation instruments for the Laser Interferometer Gravitational Wave Observatory (LIGO)”, in Cruise, M., and Saulson, P., eds., GravitationalWave Detection, Waikoloa, HI, USA, 23 August 2002, Proc. SPIE, vol. 4856, pp. 282–291, (SPIE, Bellingham, WA, 2003). [DOI], [grqc/0308090]. (Cited on page 5.)
 [24]
Giovannetti, V., Lloyd, S., and Maccone, L., “QuantumEnhanced Measurements: Beating the Standard Quantum Limit”, Science, 306, 1330–1336, (2004). [DOI], [ADS], [arXiv:quantph/0412078]. (Cited on page 7.)
 [25]
Gouy, L.G., “Sur la propagation anomale des ondes”, C. R. Acad. Sci., 111, 33, (1890). (Cited on page 62.)
 [26]
Gouy, L.G., “Sur une propriete nouvelle des ondes lumineuses”, C. R. Acad. Sci., 110, 1251, (1890). (Cited on page 62.)
 [27]
Gradshteyn, I.S., and Ryzhik, I.M., Tables of Integrals, Series, and Products, (Academic Press, San Diego; London, 1994), 5th edition. (Cited on page 23.)
 [28]
Hecht, E., Optics, (AddisonWesley, Reading, MA, 2002), 4th edition. (Cited on page 13.)
 [29]
Heinzel, G., Advanced optical techniques for laserinterferometric gravitationalwave detectors, Ph.D. Thesis, (Universitat Hannover, Hannover, 1999). Related online version (cited on 28 September 2009): http://edok01.tib.unihannover.de/edoks/e002/265099560.pdf. (Cited on pages 17 and 24.)
 [30]
Heinzel, G., Strain, K. A., Mizuno, J., Skeldon, K. D., Willke, B., Winkler, W., Schilling, R., Rüdiger, A., and Danzmann, K., “Experimental Demonstration of a Suspended Dual Recycling Interferometer for Gravitational Wave Detection”, Phys. Rev. Lett., 81, 5493–5496, (1998). [DOI]. (Cited on page 6.)
 [31]
Herriott, D., Kogelnik, H., and Kompfner, R., “OffAxis Paths in Spherical Mirror Interferometers”, Appl. Optics, 3, 523–526, (1964). [DOI], [ADS]. (Cited on page 6.)
 [32]
Jaekel, M.T., and Reynaud, S., “Quantum Limits in Interferometric Measurements”, Europhys. Lett., 13, 301–306, (1990). [DOI], [quantph/0101104]. (Cited on page 7.)
 [33]
Kenyon, I.R., The Light Fantastic: A Modern Introduction to Classical and Quantum Optics, (Oxford University Press, Oxford; New York, 2008). [Google Books]. (Cited on page 13.)
 [34]
Kogelnik, H., “On the Propagation of Gaussian Beams of Light Through Lenslike Media Including those with a Loss or Gain Variation”, Appl. Optics, 4(12), 1562–1569, (1965). [ADS]. (Cited on pages 58 and 66.)
 [35]
Kogelnik, H., and Li, T., “Laser Beams and Resonators”, Proc. IEEE, 54, 1312–1329, (1966). (Cited on pages 53 and 57.)
 [36]
Kuroda, K. (LCGT Collaboration), “The status of LCGT”, Class. Quantum Grav., 23(8), S215–S221, (2006). [DOI]. (Cited on page 5.)
 [37]
Lax, M., Louisell, W.H., and McKnight, W.B., “From Maxwell to paraxial wave optics”, Phys. Rev. A, 11, 1365–1370, (1975). [DOI]. (Cited on page 53.)
 [38]
Loudon, R., and Knight, P.L., “Squeezed Light”, J. Mod. Opt., 34, 709–759, (1987). [DOI], [ADS]. (Cited on page 7.)
 [39]
Malec, M., Commissioning of advanced, dualrecycled gravitationalwave detectors: simulations of complex optical systems guided by the phasor picture, Ph.D. Thesis, (Universität Hannover, Hannover, 2006). Related online version (cited on 28 September 2009): http://edok01.tib.unihannover.de/edoks/e01dh06/510301622.pdf. (Cited on page 27.)
 [40]
Matuschek, N., Kartner, F.X., and Keller, U., “Exact coupledmode theories for multilayer interference coatings with arbitrary strong index modulations”, IEEE J. Quantum Electron., 33, 295–302, (1997). [DOI]. (Cited on page 13.)
 [41]
Meers, B.J., “Recycling in laserinterferometric gravitationalwave detectors”, Phys. Rev. D, 38, 2317–2326, (1988). [DOI]. (Cited on page 6.)
 [42]
Michelson, A.A., and Morley, E.W., “On the Relative Motion of the Earth and the Luminiferous Ether”, Am. J. Sci., 34, 333–345, (1887). Related online version (cited on 20 September 2009): http://www.aip.org/history/gap/PDF/michelson.pdf. (Cited on page 36.)
 [43]
Mizuno, J., and Yamaguchi, I., “Method for analyzing multiplemirror coupled optical systems”, J. Opt. Soc. Am. A, 16, 1730–1739, (1999). [DOI]. (Cited on page 13.)
 [44]
Morrison, E., Meers, B.J., Robertson, D.I., and Ward, H., “Automatic alignment of optical interferometers”, Appl. Optics, 33, 5041–5049, (1994). [DOI]. (Cited on page 6.)
 [45]
Newport Catalogue, (Newport Corporation, Irvine, CA, 2008). Related online version (cited on 20 September 2009): http://www.newport.com/. (Cited on page 14.)
 [46]
Rigrod, W.W., “The optical ring resonator”, Bell Syst. Tech. J., 44, 907–916, (1965). (Cited on page 57.)
 [47]
Rowan, S., and Hough, J., “Gravitational Wave Detection by Interferometry (Ground and Space)”, Living Rev. Relativity, 3, lrr20003, (2000). URL (cited on 20 September 2009): http://www.livingreviews.org/lrr20003. (Cited on page 5.)
 [48]
Rüdiger, A., “Phase relationship at a symmetric beamsplitter”, unknown status, (1998). (Cited on page 16.)
 [49]
Saulson, P.R., Fundamentals of Interferometric Gravitational Wave Detectors, (World Scientific, Singapore; River Edge, NJ, 1994). (Cited on page 50.)
 [50]
Siegman, A.E., Lasers, (University Science Books, Sausalito, CA, 1986). [Google Books]. See also errata list at http://www.stanford.edu/%7Esiegman/AES%20LASERS%20Book/. (Cited on pages 53, 58, 62, and 66.)
 [51]
Siegman, A.E., “Laser Beams and Resonators: Beyond the 1960s”, IEEE J. Select. Topics Quantum Electron., 6, 1389–1399, (2000). [DOI]. Related online version (cited on 18 January 2010): http://www.stanford.edu/∼siegman/beams_and_resonators/beams_and_resonators_2. pdf. (Cited on page 53.)
 [52]
Siegman, A.E., “Laser Beams and Resonators: The 1960s”, IEEE J. Select. Topics Quantum Electron., 6, 1380–1388, (2000). [DOI]. Related online version (cited on 18 January 2010): http://www.stanford.edu/∼siegman/beams_and_resonators/beams_and_resonators_1. pdf. (Cited on page 53.)
 [53]
Vahlbruch, H., Chelkowski, S., Danzmann, K., and Schnabel, R., “Quantum engineering of squeezed states for quantum communication and metrology”, New J. Phys., 9(10), 371, (2007). [DOI], [arXiv:0707.2845]. URL (cited on 17 January 2010): http://stacks.iop.org/13672630/9/371. (Cited on page 7.)
 [54]
Vinet, J.Y., “On Special Optical Modes and Thermal Issues in Advanced Gravitational Wave Interferometric Detectors”, Living Rev. Relativity, 12, lrr20095, (2009). URL (cited on 20 September 2009): http://www.livingreviews.org/lrr20095. (Cited on page 62.)
 [55]
Vinet, J.Y. (Virgo Collaboration), The Virgo Physics Book, Vol. II: Optics and Related Topics, (Virgo, Cascina, 2001). URL (cited on 20 September 2009): http://www.virgo.infn.it/vpb/. (Cited on pages 64 and 71.)
 [56]
“Virgo”, project homepage, Virgo Collaboration. URL (cited on 19 September 2009): http://www.virgo.infn.it/. (Cited on page 17.)
 [57]
Winkler, W., Danzmann, K., Grote, H., Hewitson, M., Hild, S., Hough, J., Lück, H., Malec, M., Freise, A., Mossavi, K., Rowan, S., Rüdiger, A., Schilling, R., Smith, J.R., Strain, K.A., Ward, H., and Willke, B., “The GEO 600 core optics”, Opt. Commun., 280, 492–499, (2007). [DOI]. (Cited on page 6.)
 [58]
Yariv, A., Quantum Electronics, (J. Wiley & Sons, New York, 1989), 3rd edition. (Cited on page 31.)
Acknowledgements
We would like to thank our colleagues in the GEO 600 project for many useful discussions over the years. AF acknowledges support from the University of Birmingham. KS acknowledges support from the University of Glasgow and the Albert Einstein Institute, Hannover. Some of the illustrations have been prepared using the component library by Alexander Franzen.
Author information
Affiliations
Corresponding author
The Interferometer Simulation Finesse
The Interferometer Simulation Finesse
Throughout this document we have provided a number of text files that can be used as input files for the interferometer simulation Finesse [19, 22]. Finesse is a numerical simulation written in the C language; it is available free of charge for Linux, Windows and Macintosh computers and can be obtained online: http://www.gwoptics.org/finesse/.
Finesse provides a fast and versatile tool that has proven to be very useful during the design and commissioning of interferometric gravitationalwave detectors. However, the program has been designed to allow the analysis of arbitrary, userdefined optical setups. In addition, it is easy to install and use. Therefore Finesse is well suited to study basic optical properties, such as, the power enhancement in a resonating cavity and modulationdemodulation methods.
We encourage the reader to obtain Finesse and to learn its basic usage by running the included example files (and by making use of its extensive manual). The Finesse input files provided in this article are in most cases very simple and illustrate single concepts in interferometry. We believe that even a Finesse novice should be able to use them as starting points to play and explore freely, for example by changing parameters, or by adding further optical components. This type of ‘numerical experimentation’ can provide insights similar to real experiments, supplementing the understanding through a mathematical analysis with experience and intuitions.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Freise, A., Strain, K. Interferometer Techniques for GravitationalWave Detection. Living Rev. Relativ. 13, 1 (2010). https://doi.org/10.12942/lrr20101
Accepted:
Published:
Keywords
 Beam Splitter
 Gaussian Beam
 Light Field
 Beam Parameter
 Michelson Interferometer
Latest
Interferometer techniques for gravitationalwave detection Published:
 17 February 2017
 Received:
 04 December 2015
 Accepted:
 21 July 2016
DOI: https://doi.org/10.1007/s4111401600028
Original
Interferometer Techniques for GravitationalWave Detection Published:
 25 February 2010
 Accepted:
 15 February 2010
DOI: https://doi.org/10.12942/lrr20101