1 Introduction

One of the most prevalent and successful measurement techniques in the applied sciences is the four-point probe method [1, 2]. The van der Pauw method [3, 4] is an example of such a method used to measure the resistivity (or its inverse quantity, the conductivity) of a conducting laminate sample. Van der Pauw’s name is associated to this method because he was the first to point out that, owing to the conformal invariance of the underlying boundary value problem for the electrical voltage potential, the basic idea of the four-point method works for determining the resistivity of uniform two-dimensional samples of any shape provided the contacts are placed at the edges of the sample [3, 4]. Samples must have a flat shape of unit thickness, be isotropic of uniform resistivity, and be simply connected. That is, they must not have holes.

Figure 1 shows a set-up for the original van der Pauw measurement: four electrical contacts \((\Omega _a,\Omega _b,\Omega _z,\Omega _w)\) are placed on the perimeter of a test sample. If \(\Omega _a\) and \(\Omega _b\) represent a point source and sink of current \(J_{ab}\) respectively, then the potential difference \(V_{zw}\) between two other points \(\Omega _z\) and \(\Omega _w\) can be measured while this current is flowing. The resistance \(R_{ab}^{zw}= V_{zw}/J_{ab}\) is then a measured quantity; a second resistance \(R_{aw}^{zb}\) can be measured in exactly the same way. Van der Pauw [4] showed that for any arrangement of four electrical contacts, and given these two resistance measurements \(R_{ab}^{zw}\) and \(R_{aw}^{zb}\), the specific resistivity \(\sigma \) can be found by solving the non-linear equation

$$\begin{aligned} \exp \left( -\frac{R_{ab}^{zw}}{\lambda } \right) + \exp \left( -\frac{R_{aw}^{zb}}{\lambda }\right) = 1 \end{aligned}$$
(1)

with

$$\begin{aligned} \lambda \equiv {\sigma \over \pi d} \end{aligned}$$
(2)

and where d is the uniform thickness of the two-dimensional sample. We will refer to this as the classical van der Pauw equation and it provides the basis for the van der Pauw method, the aim of which is to determine \(\lambda \) and, hence, the sample resistivity. Because this method needs only two resistance measurements, the method is widely applied for measuring the resistivity of superconductors or Hall coefficients of materials in laboratory experiments [3, 5]. An efficient numerical method to determine \(\lambda \) is discussed in [6].

While powerful, the van der Pauw technique is limited by its applicability only to simply connected samples. In reality, a given sample may well have holes, and these will affect the voltage measurements. Consequently, in recent years the van der Pauw method for samples with a single hole, or several holes, has been studied extensively [7,8,9,10,11,12,13,14,15,16]. It has been reported that the van der Pauw equation (1) is inaccurate in samples with several holes [17]. This is not surprising given that the formula (1) takes no account of the presence of those holes.

The present paper has been inspired by several conjectures in the literature that have arisen in the aforementioned attempts to generalize the van der Pauw methodology to holey samples. Given that the original van der Pauw method rests on the cross-ratio identity, these conjectures represent an interesting challenge for function theorists. As we show here, two new mathematical tools turn out to be relevant to the van der Pauw problem as it pertains to holey samples: the theory of the so-called prime function on the Schottky double of a multiply connected planar domain [18] and the Fay trisecant identities also associated with those same compact Riemann surfaces and which involve the prime function. This article focusses on the function theory aspects. A companion article [19] explores some of the practical implications of our results.

In this paper, we use these mathematical tools—both of which are new to this application area—to prove two recent conjectures made in the applied science literature [7, 12]. These conjectures concern a sample with an isolated hole, which is the natural first case to study. We now outline the nature of these conjectures.

It is a well-known consequence of an extension of the Riemann mapping theorem due to Koebe (see [20]) that any 2D sample with a single isolated hole can be transplanted conformally into an annulus \(\rho< |\zeta | <1\) in a complex \(\zeta \) plane, say, where the radius of the inner circle of the annulus depends on the shape of sample [18, 21]. By conducting both numerical and actual experiments Szymański et al. showed [7] that the van der Pauw equation (1) does not hold for a sample with a hole but they conjectured that the data instead satisfies the inequality

$$\begin{aligned} \exp \left( -\frac{R_{ab}^{zw}}{\lambda } \right) + \exp \left( -\frac{R_{aw}^{zb}}{\lambda }\right) \le 1. ~ \end{aligned}$$
(3)

The same inequality has been mentioned in a series of papers [8, 9]. It is one of the objectives of this paper to prove that (3) does indeed hold.

Fig. 1
figure 1

The original van der Pauw set-up. The first measurement is the voltage difference between z and w with a point source of current at a and a point sink of current at b. A second measurement is of the voltage difference between z and b with a source a and a sink w. The associated resistances satisfy the van der Pauw equation (1) which can be solved for the sought-after parameter \(\lambda \)

Fig. 2
figure 2

(i) Evidence for the two “envelopes”. Numerical experiments for pairs (XY) where \(X\equiv \exp (-R_{ab}^{zw}/\lambda )\) and \(Y\equiv \exp (-R_{aw}^{zb}/\lambda )\) following [8]. The point \(z=1\) is fixed, but (abw) are picked at random with the ordering \(0=\mathrm{arg}[z]< \mathrm{arg}[w]< \mathrm{arg}[a]< \mathrm{arg}[b] < 2 \pi \). When \(\rho =0\), all pairs (XY) are on the line \(X+Y=1\) which is (1). However, when \(\rho >0\), all points (XY) lie in the gray-shaded region bounded by the straight line \(X+Y=1\) and a “lower envelope” which is curved. As shown in the center and right, the size of the gray-shaded area increases with \(\rho \). (ii) The upper envelope (red line) and the lower envelope (blue line). The envelope is defined by its set of tangents. The gray region is above the tangent line for each point on the lower envelope. The figure to the left shows what we mean by the “symmetric” choice of contact points in which the sector formed by the pair (ab) subtends the same angle \(\theta \) at the origin as that formed by the pair (zw)

In [8] Szymański et al. also find that the pair of measured resistances \((R_{ab}^{zw},R_{aw}^{zb})\) satisfies another inequality which they dubbed a “lower envelope” and they proposed a method to measure \(\lambda \) based on the existence of this envelope. To explain this, Fig. 2 shows the results of a repeat of the same numerical experiment conducted in [8]. It shows the data from 40,000 pairs (XY), where

$$\begin{aligned} X\equiv \exp (-R_{ab}^{zw}/\lambda ), \qquad Y\equiv \exp (-R_{aw}^{zb}/\lambda ). \end{aligned}$$
(4)

Three different samples are used with \(\rho =0, 0.2\) and 0.5. The contact points z, w, a, and b are chosen at random but always such that they retain the ordering \(0=\mathrm{arg}[z]<\mathrm{arg}[w]<\mathrm{arg}[a]<\mathrm{arg}[b]<2\pi \). The data is found to fall in the gray-shaded regions shown in Fig. 2. With no hole, which means \(\rho =0\), the pair satisfies \(X+Y=1\) which is just the original van der Pauw equation (1) in evidence. However, if \(\rho > 0\), this is no longer true and the data (XY) “fills in” a crescent-shaped domain shown shaded in Fig. 2. Szymański et al. [8] conjecture that the data (XY) always lies in such a domain bounded by the upper envelope \(X+Y \le 1\) and some lower envelope which depends only on the conformal modulus \(\rho \). They also conjecture, again without a rigorous mathematical proof, that the lower envelope might correspond to the pair of \((X_\theta ,Y_\theta )\), where \((X_\theta ,Y_\theta )\) are measurements with the four electrical contacts having a certain symmetry shown on the lower left of Fig. 2(ii). The angle \(\theta \) is defined as \(\theta \equiv \mathrm{arg}[w/z]=\mathrm{arg}[b/a]\). In such a symmetric configuration the sector formed by the pair (ab) subtends the same angle \(\theta \) at the origin as that formed by the pair (zw).

If these conjectures hold then the form of the upper and lower envelopes can be expressed mathematically as

$$\begin{aligned} X+Y \le 1, \qquad Y-Y_\theta \ge \frac{\partial Y_\theta /\partial \theta }{\partial X_\theta /\partial \theta } (X-X_\theta ),\qquad \mathrm{for} \ \ 0<\theta <\pi . \end{aligned}$$
(5)

Two examples of the tangent line (5) on the lower envelope are shown on the right of Fig. 2(ii). These inequalities have been put forward in several papers [7,8,9,10], but no rigorous proof of them has yet been obtained.

This paper aims to understand the envelope structure mathematically and to offer proofs of the aforementioned conjectures. This is done by introducing, for the first time, two important tools into this area of investigation: (i) use of the prime function, denoted by \(\omega (\zeta ,c)\), associated with the concentric annulus \(\rho< |\zeta | <1\); (ii) use of the Fay trisecant identity satisfied by this prime function. Using these tools we can gain insights into the two envelopes associated with the resistance measurements (4).

2 The Van der Pauw Equation and the Cross Ratio Identity

While applied scientists are familiar with the van der Pauw equation (1) mathematicians are perhaps more familiar with a similar-looking cross-ratio identity given by

$$\begin{aligned} p_0(z,w;b,a) + p_0(z,b;w,a) = 1, \end{aligned}$$
(6)

where the classical cross-ratio is defined by

$$\begin{aligned} p_0(z,w;a,b) \equiv {\omega (z,a) \omega (w,b) \over \omega (z,b) \omega (w,a)}, \end{aligned}$$
(7)

and where, in a step usually not carried out, we introduce the prime function associated with the unit disc [18] as the simple monomial function

$$\begin{aligned} \omega (\zeta ,c) = \zeta -c. \end{aligned}$$
(8)

Establishing the identity (6) is a simple exercise. In complex analysis the cross-ratio [18, 22] is most commonly encountered in a geometrical context as the Möbius mapping that provides a conformal mapping, as a function of the variable z, between 3 arbitrary complex points (awb) in the complex z plane and the canonical choice of points \((0,1,\infty )\).

Concerning the prime function (8), it is so simple in this case that it is usually not even given the designation “prime function”. However, a recent monograph [18] makes the case that recognizing it as the simplest instance of a more general notion of a prime function is important for generalizing many known results for simply connected planar geometries to multiply connected cases in a natural way. The van der Pauw problem of interest here is no exception: it will be shown later that the natural way to extend the classical van der Pauw method to multiply connected geometries is to treat the problem using the prime function—that is, the multiply connected generalization of (8)—and to make use of some important identities satisfied by those functions.

There is a connection between (1) and (6). It arises by considering the complex potential \(H_0(\Omega )\) of the complex variable \(\Omega = x+ \mathrm{i} y\), where (xy) denotes Cartesian coordinates in the physical plane, whose real part is the harmonic voltage potential V(xy) in the sample:

$$\begin{aligned} H_0(\Omega ) = V(x,y) + \mathrm{i} \chi (x,y), \end{aligned}$$
(9)

where we have introduced \(\chi (x,y)\), the harmonic conjugate of V(xy) in the physical (xy) plane. V(xy) is harmonic in the sample and its normal derivative vanishes on the sample boundary; equivalently, by the Cauchy-Riemann equations, the harmonic conjugate \(\chi (x,y)\) is constant on the boundary.

By the well-known Riemann mapping theorem, and the conformal invariance of the boundary value problem for the potential V(xy) driven by a source at \(\Omega _a\) and a compensating sink at \(\Omega _b\) [22], the complex potential \(h_0(\zeta ) \equiv H_0(\Omega )\) is given by the explicit formula

$$\begin{aligned} h_0(\zeta ) = \frac{\sigma J_{ab}}{\pi d} \log \left( {\omega (\zeta ,a) \over \omega (\zeta ,b)} \right) , \end{aligned}$$
(10)

where, once again, we notice the appearance of the prime function (8) and

$$\begin{aligned} \Omega _a = f(a), \qquad \Omega _b = f(b), \end{aligned}$$
(11)

where \(\Omega = f(\zeta )\) is the conformal mapping between the unit disc \(D_0\) in the \(\zeta \) plane and the given laminate sample in the complex \(\Omega \) plane. The potential difference \(V_{zw}\) is therefore given by

$$\begin{aligned} \begin{aligned} V_{zw} \equiv \mathrm{Re}[h_0(z)] - \mathrm{Re}[h_0(w)]&= \lambda J_{ab} \log \left| {\omega (z,a) \omega (w,b) \over \omega (z,b) \omega (w,a)} \right| \\&= \lambda J_{ab} \log \left| p_0(z,w;a,b) \right| , \end{aligned} \end{aligned}$$
(12)

where z and w are the preimages of the two measurement contact points \(\Omega _z\) and \(\Omega _w\):

$$\begin{aligned} \Omega _z=f(z), \qquad \Omega _w = f(w). \end{aligned}$$
(13)

It follows that

$$\begin{aligned} R_{ab}^{zw} \equiv {V_{zw} \over J_{ab}} = \lambda \log p_0(z,w;a,b), \end{aligned}$$
(14)

where we have removed the modulus symbols because the cross-ratio is real and positive when all the points zwa,  and b are on the unit circle in the \(\zeta \) plane, and \(0\le \mathrm{arg}[z]<\mathrm{arg}[w]<\mathrm{arg}[a]<\mathrm{arg}[b]<2\pi \). On combining (14) with (6), we arrive at the van der Pauw equation (1).

3 The Prime Function for the Annulus \(\rho< |\zeta | <1\)

Now consider the van der Pauw method for a doubly connected sample. Let D denote a bounded sample with an isolated hole. Let \(\partial D_0\) be the outer boundary of the sample and \(\partial D_1\) the boundary of the hole. Similar to the original van der Pauw method, we assume that the sample thickness is d. We assume that the hole in the sample carries no net charge. The set-up is illustrated in Fig. 3.

Suppose that 4 point contacts \((\Omega _a,\Omega _b,\Omega _z,\Omega _w)\), of infinitesimal width, are placed on \(\partial D_0\). It is known, by an extension of the Riemann mapping theorem [18, 20], that any such domain is conformally equivalent to a concentric annulus \(\rho< |\zeta | < 1\) with circular boundaries \(C_0\) and \(C_1\) and \(0 \le \rho < 1\). The circle \(C_0\) is the unit circle; \(C_1\) is the circle \(|\zeta | =\rho \). In other words, there exists an analytic function

$$\begin{aligned} \Omega = f(\zeta ) \end{aligned}$$
(15)

that transplants the annulus \(\rho< |\zeta | < 1\) to the domain D with \(C_0\) being transplanted to \(\partial D_0\) and \(C_1\) to \(\partial D_1\).

Let the required complex potential, as a function of \(\Omega = x+ \mathrm{i} y\), be

$$\begin{aligned} H(\Omega ) = V(x,y) + \mathrm{i} \chi (x,y). \end{aligned}$$
(16)

We can still exploit the conformal invariance for the problem of determining the potential V(xy) in this doubly connected domain. Crowdy [18, 23] has shown that the complex potentials for any point source/sink driven harmonic field in a multiply connected domain can be written down explicitly in terms of the prime function associated with that domain. It is important to emphasize, for possible future generalization of this work, that this fact holds for domains of any finite connectivity not just the doubly connected case of interest here.

The prime function \(\omega (\zeta ,c)\) for the annulus \(\rho< |\zeta | < 1\) can be defined explicitly in this case by the formula [18]

$$\begin{aligned} \omega (\zeta ,c) = -\dfrac{c}{{\hat{P}}(1)}P\,\left( \frac{\zeta }{c}\right) , \end{aligned}$$
(17)

where

$$\begin{aligned} P(\zeta ) \equiv (1-\zeta ){\hat{P}}(\zeta ),\qquad {\hat{P}}(\zeta ) \equiv \prod _{n=1}^{\infty }(1-\rho ^{2n}\zeta )\left( 1-\frac{\rho ^{2n}}{\zeta }\right) . ~ \end{aligned}$$
(18)

It is easy to show, directly from these definitions via an infinite product, that

$$\begin{aligned} P(\zeta ^{-1}) = - \zeta ^{-1} P(\zeta ), \qquad P(\rho ^2 \zeta ) = - \zeta ^{-1} P(\zeta ). \end{aligned}$$
(19)

These identities, which are the fundamental properties of the prime function in this case (see [18, Ch. 5] which is devoted to this function and its properties), will be useful later. For notational brevity, we write \(P(\zeta )\) even though this function also depends on the parameter \(\rho \) as is clear from its definition (18). The reader should bear in mind this additional parametric dependence.

It is important to mention that while, merely for convenience and brevity of exposition, we have here defined the prime function for the concentric annulus in terms of an infinite product (18) the monograph [18] gives a more intrinsic derivation of the prime function for multiply connected planar domains using the first-type Green’s function as the starting point. Interested readers are encouraged to refer there for a much more general mathematical perspective.

Fig. 3
figure 3

The van der Pauw set-up for a sample with an isolated hole. The first measurement is the voltage difference between z and w with a source of current a and a sink of current b. The second measurement is the difference between z and b with a source a and a sink w. Similar to the original van der Pauw method, the measured resistances are denoted by the logarithm of the Schottky–Klein prime function

We will also need the functions \(K(\zeta )\) and \(L(\zeta )\) defined as

$$\begin{aligned} K(\zeta )\equiv \zeta \dfrac{\partial }{\partial \zeta }\log P(\zeta ), \qquad L(\zeta ) \equiv \zeta {\partial K(\zeta ) \over \partial \zeta }. \end{aligned}$$
(20)

The logarithmic derivative of the prime function \(K(\zeta )\) and its derivative \(L(\zeta )\) are also important functions in the general function theory on multiply connected domains [18]. For the concentric annulus it is easily shown, on use of (19), that these functions satisfy the functional relations

$$\begin{aligned} \begin{aligned} K(\zeta ^{-1}) = 1-K(\zeta ),&\qquad K(\rho ^2 \zeta ) = K(\zeta ) -1, \\ L(\zeta ^{-1}) = L(\zeta ),&\qquad L(\rho ^2 \zeta ) = L(\zeta ). \end{aligned} \end{aligned}$$
(21)

Actually, the function \(K(\zeta )\) can be related to the Weierstrass zeta function and \(L(\zeta )\) to the Weierstrass \(\wp \)-function [18] but those theoretical connections will not be needed here and this remark is made merely as a point of interest.

Following [18, 23] the complex potential \(h(\zeta ) \equiv H(f(\zeta ))\) is given by

$$\begin{aligned} h(\zeta )&= \frac{\sigma J_{ab}}{2\pi d} \log \left( \frac{{\bar{a}}b}{|ab|}\frac{\omega (\zeta ,a)\omega (\zeta ,{\bar{a}}^{-1})}{\omega (\zeta ,b)\omega (\zeta ,{\bar{b}}^{-1}) } \right) =\lambda J_{ab} \log \left( \frac{a}{b}\frac{P(\zeta /a)}{P(\zeta /b)}\right) , \end{aligned}$$
(22)

where the first equality is derived in [18, 23] and follows from the general properties of the prime function, and where we have used (17) in the second equality. The relations (11) and (13) give the relationship between (abzw) and \((\Omega _a, \Omega _b, \Omega _z, \Omega _w)\) although it is understood that the mapping \(f(\zeta )\) is now the new mapping from the concentric annulus to the holey sample. Note that, because two electrical contacts a and b are on \(C_0\), \({\bar{a}} = a^{-1}\) and \({\bar{b}}= b^{-1}\).

The voltage difference between z and w is given by

$$\begin{aligned} V_{ab}^{zw} \equiv \mathrm{Re}[h(z)] - \mathrm{Re}[h(w)]= \lambda J_{ab} \log \frac{P(z/a)P(w/b)}{P(z/b)P(w/a)}. \end{aligned}$$
(23)

Since all contacts are located on the same edge of the annulus, then \(\mathrm{Im}[h(z)] - \mathrm{Im}[h(w)]= 0\). The measured resistance \(R_{ab}^{zw}\) is defined as

$$\begin{aligned} R_{ab}^{zw}\equiv \frac{V_{zw}}{J_{ab}} = \lambda \log \frac{P(z/a)P(w/b)}{P(z/b)P(w/a)}. \end{aligned}$$
(24)

In the same way,

$$\begin{aligned} R_{aw}^{zb}\equiv \frac{V_{zb}}{J_{aw}} = \lambda \log \frac{P(z/a)P(b/w)}{P(z/w)P(b/a)}. \end{aligned}$$
(25)

It is straightforward to check that the formulas (24) and (25) above are equivalent to those given in [7]. More precisely, it can be checked that the function \(G(\phi )\) used in [7] is related to \(P(\zeta )\)—and hence to the prime function for the concentric annulus (17)—by the formula

$$\begin{aligned} P(e^{\mathrm{i}\phi }) = \left( 1-e^{\mathrm{i}\phi }\right) \prod _{n=1}^\infty \left( 1+\rho ^{4n} - 2\rho ^{2n}\cos \phi \right) = -2\mathrm{i}e^{\mathrm{i}\phi /2} {\hat{P}}(\mathrm{i})G(\phi ), \end{aligned}$$
(26)

where the function \(G(\phi )\) introduced in [7] is defined by

$$\begin{aligned} G(\phi ) \equiv \sin \frac{\phi }{2} \prod _{n=1}^{\infty } \left( 1- \frac{\cos \phi }{\cosh {\tilde{h}}n} \right) ,\qquad {\tilde{h}} \equiv 2\log \rho . \end{aligned}$$
(27)

Although (26) shows that our new expressions (24)–(25) coincide with those of [7], which is reassuring, there is much significance in having recognized that the resistances can be written in terms of this special transcendental function known as the prime function [18] of the preimage concentric annulus.

First, the notion of a prime function extends to a planar domain of any finite connectivity [18] which means that we have already provided a route to generalizing all the ideas in this paper presented here for the annulus to any higher connected domain, that is, to a sample with more than one hole. Crowdy [23] was the first to show how the complex potentials for source/sink driven harmonic fields in multiply connected domains of arbitrary connectivity can be written explicitly in terms of the associated prime functions. His treatment uses irrotational fluid mechanics as the physical context but, mathematically, the problem is identical to the electrical conduction problems of interest here.

Second, it is known [18] that prime functions, including those associated with domains of connectivity higher than one, satisfy a so-called Fay trisecant identity. This identity can be viewed as an analogue of the cross-ratio identity (6) on a higher genus Riemann surface [24] and is the topic of the next section.

4 The Fay Trisecant Identity for the Annulus \(\rho< |\zeta | <1\)

It is useful to introduce the function

$$\begin{aligned} p(z,w;a,b) \equiv {\omega (z,a) \omega (w,b) \over \omega (z,b) \omega (w,a)}. \end{aligned}$$
(28)

This formula is identical to that defining the cross-ratio (6) but this quantity is no longer a cross-ratio because the definition of the prime function has changed. On use of (17) formula (28) can be written in terms of \(P(\zeta )\) as

$$\begin{aligned} p(z,w;a,b) =\frac{P(z/a)P(w/b)}{P(z/b)P(w/a)}. \end{aligned}$$
(29)

From (29) and (24)–(25) we see that

$$\begin{aligned} \exp (-R_{ab}^{zw}/\lambda )=p(z,w;b,a), \qquad \exp (-R_{aw}^{zb}/\lambda ) = p(z,b;w,a). \end{aligned}$$
(30)

The Fay trisecant identity associated with this prime function is

$$\begin{aligned} \frac{P(kz/w)P(ka/b)}{P(kza/wb)}p(z,w;b,a) + \frac{P(kz/b)P(ka/w)}{P(kza/wb)}p(z,b;w,a) = P(k),~ \end{aligned}$$
(31)

where k is an arbitrary complex number. This statement (31) of the genus-1 Fay trisecant identity expressed purely in terms of the prime function of the concentric annulus has been taken from Exercise 8.9 of Chapter 8 of the monograph [18] which asks the reader to prove it using the properties of so-called loxodromic functions. While (31) is a particular form of the more general Fay trisecant identity [24, 25] the authors have not found it written in the form (31) anywhere else in the literature (besides [18]). This form (31) of the genus-one Fay trisecant identity will be a crucial tool in what follows.

Although it is well established [24, 25] from more general arguments it is instructive to sketch a proof this form (31) of the genus-one Fay trisecant identity.

First we consider the left hand side of (31) as a function of z with all other quantities being treated as parameters. Let

$$\begin{aligned} J(z)\equiv \frac{P(kz/w)P(ka/b)}{P(kza/wb)} p(z,w;b,a) + \frac{P(kz/b)P(ka/w)}{P(kza/wb)} p(z,b;w,a). \end{aligned}$$
(32)

The properties (19) of the function \(P(\zeta )\) can be used to show that

$$\begin{aligned} J(\rho ^2 z) = J(z). \end{aligned}$$
(33)

Hence, since it is also meromorphic as a function of z, it is a loxodromic function of z; see [18, Ch. 8]. A loxodromic function is the name of an automorphic function on the Schottky double of the concentric annulus; it is a meromorphic function of that surface satisfying the functional identity (33). If we write

$$\begin{aligned} J(z) = \frac{N(z)}{P(kza/wb)P(z/a)P(w/b)} \end{aligned}$$
(34)

so that

$$\begin{aligned} \begin{aligned} N(z)&=P(kz/w)P(ka/b)P(z/b)P(w/a) \\&\quad -(w/b)P(kz/b)P(ka/w)P(z/w)P(b/a) \end{aligned} \end{aligned}$$
(35)

then, it can be verified, again using the properties (19) of \(P(\zeta )\), that

$$\begin{aligned} N(a) = N(wb/ka) = 0. \end{aligned}$$
(36)

Since J(z) is a loxodromic function with removable poles at \(z=a\) and \(z=wb/ka\)—and, therefore, having no poles on the surface— it must be independent of z, which means it is a constant when considered as a function of z. We are employing a Liouville-type theorem on this genus-one Schottky double: any meromorphic function on it having no poles must be constant. Such results will be used extensively throughout this paper.

We can also consider the left hand side of (31) as a function for w and write

$$\begin{aligned} {\tilde{J}}(w)\equiv \frac{P(kz/w)P(ka/b)}{P(kza/wb)} p(z,w;b,a) + \frac{P(kz/b)P(ka/w)}{P(kza/wb)} p(z,b;w,a). \end{aligned}$$
(37)

This can also be shown to be loxodromic, i.e.,

$$\begin{aligned} {\tilde{J}}(\rho ^2 w) = {\tilde{J}}(w) \end{aligned}$$
(38)

and to have removable poles at \(w=b\) and \(w=kza/b\). It is therefore independent of w.

By similar arguments, considering the left hand side of (31) successively as a function of a and b it can be shown to be independent of those variables too. Putting all these facts together, we conclude that

$$\begin{aligned} \frac{P(kz/w)P(ka/b)}{P(kza/wb)} p(z,w;b,a) + \frac{P(kz/b)P(ka/w)}{P(kza/wb)} p(z,b;w,a) = C(k), \end{aligned}$$
(39)

where C(k) is a function to be determined. It can be found by matching to the limit of the left hand side of (39) in the double limit \(w\rightarrow z\) and \(b\rightarrow a\) which yields

$$\begin{aligned} C(k) = P(k). \end{aligned}$$
(40)

Thus we have established the Fay trisecant identity (31).

On substituting (24) and (25) into (31), we obtain

$$\begin{aligned} \frac{P(kz/w)P(ka/b)}{P(k)P(kza/wb)}\exp \left( -\frac{R_{ab}^{zw}}{\lambda }\right) + \frac{P(kz/b)P(ka/w)}{P(k)P(kza/wb)}\exp \left( -\frac{R_{aw}^{zb}}{\lambda }\right) = 1 ~. \end{aligned}$$
(41)

When \(\rho \rightarrow 0\), so that there is no hole in a sample, it is straightforward to check that

$$\begin{aligned} \frac{P(kz/w)P(ka/b)}{P(k)P(kza/wb)} = \frac{P(kz/b)P(ka/w)}{P(k)P(kza/wb)} =1 \end{aligned}$$
(42)

because \(P(\zeta ) = 1-\zeta \) when \(\rho =0\). The original van der Pauw equation (1) is therefore retrieved from (41). All dependence on the new parameter k disappears in this simply connected limit.

It is clear that (41) opens up new perspectives: that it reduces, as \(\rho \rightarrow 0\), to the original van der Pauw equation (1) is tantalizing. It also makes it a natural candidate, at least from the mathematical point of view, to find ways to extend the van der Pauw method to holey samples. In contrast to the original van der Pauw equation, the coefficients of \(\exp (-R_{ab}^{zw}/\lambda )\) and \(\exp (-R_{aw}^{zb}/\lambda )\) in (41) now depend not only on the electrical contact locations z, w, a, b but also on a fifth complex parameter k. It should be emphasized that (41) holds for arbitrary choices of abzw and k even though, for present purposes, we will assume that abz and w lie on \(C_0\). The freedom in the choice of k will be exploited in the next section to gain insights into the envelope structure evident in Fig. 2.

5 Analysis of the Envelopes: The Integrated Fay Identity

In Sect. 1 we discussed the existence of two envelopes, an “upper” and a “lower” envelope with the mathematical definitions given in (5). It is now shown how the new tools introduced in the previous two sections allow us to prove the conjecture enshrined in (5).

For arbitrary zwa,  and b on \(C_0\) we can introduce new angular coordinates \(\theta , \theta _1\) and \(\theta _3\):

$$\begin{aligned} z=1,\quad w=\exp (\mathrm{i}(\theta _1+\theta )),\quad a=\exp (\mathrm{i}(\theta _1+\theta _3)),\quad b=\exp (\text {i}(\theta + \theta _3)). \end{aligned}$$
(43)

Because \(0<\mathrm{arg}[w]<\mathrm{arg}[a]<\mathrm{arg}[b]<2\pi \), the range of \(\theta _1,\ \theta ,\ \theta _3\) are given by

$$\begin{aligned} -\theta< \theta _1< \theta ,\quad \theta< \theta _3< 2\pi - \theta ,\quad 0< \theta < \pi . ~ \end{aligned}$$
(44)

It is important to mention that the symmetry case shown in Fig. 2 corresponds to the choices \(\theta _1=0\) and \(\theta _3 = \pi \). An illustration of the complex coordinates is shown in Fig. 4. The parameters \(\theta _1\) and \(\theta _3\) are seen as displacements from the symmetric points with parameter \(\theta \).

Fig. 4
figure 4

Illustration of the complex coordinates. The parameters \(\theta _1\) and \(\theta _3\) are seen as displacements from the symmetric points where z is fixed to 1

The Fay trisecant identity (41) can be written

$$\begin{aligned} A(\theta _1,\theta ,{\hat{k}}) X_{\theta _1,\theta _3,\theta } + B(\theta _3,\theta ,{\hat{k}})Y_{\theta _1,\theta _3,\theta } = 1, \end{aligned}$$
(45)

where

$$\begin{aligned} \begin{aligned} X_{\theta _1,\theta _3,\theta } \equiv \exp \left( -{R^{zw}_{ab} \over \lambda }\right) = {P(e^{-\mathrm{i}(\theta +\theta _3)}) P(e^{\mathrm{i}(\theta -\theta _3)})\over P(e^{-\mathrm{i}(\theta _1+\theta _3)}) P(e^{\mathrm{i}(\theta _1-\theta _3)}) }, \\ Y_{\theta _1,\theta _3,\theta } \equiv \exp \left( -{R^{zb}_{aw} \over \lambda }\right) ={P(e^{-\mathrm{i}(\theta +\theta _1)}) P(e^{\mathrm{i}(\theta -\theta _1)})\over P(e^{-\mathrm{i}(\theta _1+\theta _3)}) P(e^{\mathrm{i}(\theta _3-\theta _1)}) }, \end{aligned} \end{aligned}$$
(46)

and the coefficient functions are

$$\begin{aligned} \begin{aligned} A(\theta _1,\theta ,{\hat{k}}) \equiv \frac{P(kz/w)P(ka/b)}{P(k)P(kza/wb)}&= \frac{P(ke^{-\mathrm{i}(\theta +\theta _1)})P(ke^{\mathrm{i}(-\theta +\theta _1)})}{P(k)P(ke^{-2\mathrm{i}\theta })} \\&=\frac{P({\hat{k}}e^{-\mathrm{i}\theta _1})P({\hat{k}}e^{\mathrm{i}\theta _1})}{P({\hat{k}}e^{-\mathrm{i}\theta })P({\hat{k}}e^{\mathrm{i}\theta })},\\ B(\theta _3,\theta ,{\hat{k}}) \equiv \frac{P(kz/b)P(ka/w)}{P(k)P(kza/wb)}&= \frac{P(ke^{-\mathrm{i}(\theta +\theta _3)})P(ke^{\mathrm{i}(-\theta +\theta _3)})}{P(k)P(ke^{-2\mathrm{i}\theta })} \\&=\frac{P({\hat{k}}e^{-\mathrm{i}\theta _3})P({\hat{k}}e^{\mathrm{i}\theta _3})}{P({\hat{k}}e^{-\mathrm{i}\theta })P({\hat{k}}e^{\mathrm{i}\theta })},~ \end{aligned} \end{aligned}$$
(47)

and where we have set \(k = {\hat{k}} e^{\mathrm{i}\theta }\) because k is arbitrary. Notice that, with this choice, the resistance measurements have the following symmetries:

$$\begin{aligned} \begin{aligned} X_{\theta _1,2\pi -\theta _3,\theta }&=X_{-\theta _1,\theta _3,\theta }=X_{-\theta _1,2\pi -\theta _3,\theta }=X_{\theta _1,\theta _3,\theta }, \\ Y_{\theta _1,2\pi -\theta _3,\theta }&=Y_{-\theta _1,\theta _3,\theta }=Y_{-\theta _1,2\pi -\theta _3,\theta }=Y_{\theta _1,\theta _3,\theta }. \end{aligned} \end{aligned}$$
(48)

It is also important to note that \(A(\theta _1,\theta ,{\hat{k}})\) is independent of \(\theta _3\) and \(B(\theta _3,\theta ,{\hat{k}})\) is independent of \(\theta _1\).

A crucial next step is to consider an integrated form of the Fay trisecant identity (41). Consider integrals of \(A(\theta _1,\theta ,{\hat{k}})\) and \(B(\theta _3,\theta ,{\hat{k}})\) with respect to \({\hat{k}}\) around the circle \(|{\hat{k}}|=\rho \). From (45), we obtain

$$\begin{aligned} \alpha (\theta _1,\theta )X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,\theta ) Y_{\theta _1,\theta _3,\theta } = 1, ~ \end{aligned}$$
(49)

where

$$\begin{aligned} \begin{aligned}&\alpha (\theta _1,\theta )\equiv \frac{1}{2\pi }\int \limits _{0}^{2\pi } A(\theta _1,\theta ,\rho e^{ \mathrm{i}\phi }) d\phi = \frac{1}{2\pi }\int \limits _{0}^{2\pi }\frac{P(\rho e^{\mathrm{i}(\phi -\theta _1)})P(\rho e^{\mathrm{i}(\phi +\theta _1)})}{P(\rho e^{\mathrm{i}(\phi -\theta )})P(\rho e^{\mathrm{i}(\phi +\theta )})}d\phi ,\\&\beta (\theta _3,\theta )\equiv \frac{1}{2\pi }\int \limits _{0}^{2\pi } B(\theta _3,\theta ,\rho e^{\mathrm{i}\phi }) d\phi = \frac{1}{2\pi }\int \limits _{0}^{2\pi }\frac{P(\rho e^{\mathrm{i}(\phi -\theta _3)})P(\rho e^{\mathrm{i}(\phi +\theta _3)})}{P(\rho e^{\mathrm{i}(\phi -\theta )})P(\rho e^{\mathrm{i}(\phi +\theta )})}d\phi . \end{aligned} \end{aligned}$$
(50)

The integrated Fay’s trisecant identity (49) turns out to be essential for proving the conjectures about the envelope structure. On taking a derivative of (49) with respect to \(\theta \),

$$\begin{aligned} \begin{aligned}&\frac{\partial \alpha (\theta _1,\theta )}{\partial \theta } X_{\theta _1,\theta _3,\theta } + \frac{\partial \beta (\theta _3,\theta )}{\partial \theta }Y_{\theta _1,\theta _3,\theta } \\&\qquad \qquad + \alpha (\theta _1,\theta ) \frac{\partial X_{\theta _1,\theta _3,\theta }}{\partial \theta } + \beta (\theta _3,\theta ) \frac{\partial Y_{\theta _1,\theta _3,\theta }}{\partial \theta } =0 . \end{aligned}\nonumber \\ \end{aligned}$$
(51)

The sum of the first two terms is zero because

$$\begin{aligned} \begin{aligned}&\frac{\partial \alpha (\theta _1,\theta )}{\partial \theta } X_{\theta _1,\theta _3,\theta } + \frac{\partial \beta (\theta _3,\theta )}{\partial \theta }Y_{\theta _1,\theta _3,\theta } \\&\quad = \frac{\mathrm{i}}{2\pi }\int \limits _{0}^{2\pi }[A(\theta _1,\theta ,\rho e^{\mathrm{i}\phi })X_{\theta _1,\theta _3,\theta }+B(\theta _3,\theta ,\rho e^{\mathrm{i}\phi })Y_{\theta _1,\theta _3,\theta }] \\&\qquad \times [K(\rho e^{ \mathrm{i}(\phi -\theta )})-K(\rho e^{ \mathrm{i}(\phi +\theta )})]d\phi \\&\quad = \frac{\mathrm{i}}{2\pi }\int \limits _{0}^{2\pi } [K(\rho e^{ \mathrm{i}(\phi -\theta )}) - K(\rho e^{\mathrm{i}(\phi +\theta )})]d\phi = 0, \end{aligned}\nonumber \\ \end{aligned}$$
(52)

where we used Fay’s identity (45) in the second equality. On use of (52) in (51), it is found that

$$\begin{aligned} \alpha (\theta _1,\theta )\frac{\partial X_{\theta _1,\theta _3,\theta }}{\partial \theta } + \beta (\theta _3,\theta )\frac{\partial Y_{\theta _1,\theta _3,\theta }}{\partial \theta } =0. \end{aligned}$$
(53)

Suppose now that we fix the two parameters \(\theta _1\) and \(\theta _3\). Then \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) depends only on the single parameter \(\theta \) and the tangent at \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) when viewed as a function of \(\theta \) is defined as the set of (XY) satisfying

$$\begin{aligned} Y-Y_{\theta _1,\theta _3,\theta } = \frac{\partial Y_{\theta _1,\theta _3,\theta }/\partial \theta }{\partial X_{\theta _1,\theta _3,\theta }/\partial \theta }(X-X_{\theta _1,\theta _3,\theta }). \end{aligned}$$
(54)

If we now make use of both (53) and (49) the tangent line (54) is equivalent to

$$\begin{aligned} \alpha (\theta _1,\theta )X+\beta (\theta _3,\theta )Y=1. \end{aligned}$$
(55)

It is helpful to visualize this: Fig. 5 shows examples of these tangent lines. The red line shows the collection of points \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) where both \(\theta _1\) and \(\theta _3\) are fixed but where \(\theta \) is varying; the blue lines in Fig. 5, given by (55), are tangent to those red lines. Because of the angular conditions (44), \(\theta \) varies between \(|\theta _1|\) and \(\pi -|\pi -\theta _3|\).

Fig. 5
figure 5

The red lines are collections of data points \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) with both \(\theta _1\) and \(\theta _3\) fixed and only parameter \(\theta \) changed. The blue lines, given by (55), are tangent to the red lines at \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\). When \(\theta _1=0\) and \(\theta _3=\pi \), the red line is a tangent to a lower envelope

The important point, however, is that because the symmetry case shown in Fig. 2 corresponds to \(\theta _1=0\) and \(\theta _3 = \pi \), the two inequalities in (5) are equivalent to the two statements

$$\begin{aligned}&X+Y \le 1, \end{aligned}$$
(56)
$$\begin{aligned}&\alpha _\theta X + \beta _\theta Y \ge 1, \qquad \alpha _\theta \equiv \alpha (0,\theta ),\qquad \beta _\theta \equiv \beta (\pi ,\theta ), ~ \end{aligned}$$
(57)

for all pairs (XY) and \(0<\theta <\pi \). The first statement is that all pairs (XY) for \(\rho \ge 0\) lie on or below the line \(X+Y=1\) relevant to the classical van der Pauw case \(\rho =0\); the second statement is that all pairs (XY) for \(\rho \ge 0\) lie above a \(\theta \)-parametrized curve defined for \(\theta _1=0\) and \(\theta _3=\pi \) where the parameter \(\theta \) corresponds geometrically to points arranged with the symmetry shown in Fig. 2.

The strategy is to prove (56) and (57) for all (XY) by considering the maximum and minimum values of the coefficient functions \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _3,\theta )\) subject to the condition (44). From the integrated Fay identity (49),

$$\begin{aligned} \underset{-\theta<\theta _1<\theta }{\mathrm{min}}&(\alpha (\theta _1,\theta ))X_{\theta _1,\theta _3,\theta } + \underset{\theta<\theta _3<2\pi -\theta }{\mathrm{min}}(\beta (\theta _3,\theta ))Y_{\theta _1,\theta _3,\theta } \nonumber \\&\le \alpha (\theta _1,\theta )X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,\theta )Y_{\theta _1,\theta _3,\theta }=1, \end{aligned}$$
(58)
$$\begin{aligned} \underset{-\theta<\theta _1<\theta }{\mathrm{max}}&(\alpha (\theta _1,\theta ))X_{\theta _1,\theta _3,\theta } + \underset{\theta<\theta _3<2\pi -\theta }{\mathrm{max}}(\beta (\theta _3,\theta ))Y_{\theta _1,\theta _3,\theta } \nonumber \\&\ge \alpha (\theta _1,\theta )X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,\theta )Y_{\theta _1,\theta _3,\theta } = 1. \end{aligned}$$
(59)

The idea is to show that (58) is equivalent to (56), and (59) is equivalent to (57).

Fig. 6
figure 6

Visualizing the relationship between tangent lines and the function \(g_\theta (\eta )\) defined in (60). Red lines show the curves produced by changing only \(\theta \), and blue lines are tangents to the envelope at \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\). When \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _3,\theta )\) are minimised, the tangent line corresponds to \(X+Y=1\). When \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _3,\theta )\) are maximised, the line becomes a tangent line to the lower envelope at \((X_\theta ,Y_\theta )\), where \(X_\theta \equiv X_{0,\pi ,\theta }\) and \(Y_\theta \equiv Y_{0,\pi ,\theta }\)

To study the extrema of \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _1,\theta )\), we define the function

$$\begin{aligned} g_\theta (\eta ) \equiv \frac{1}{2\pi } \int \limits _{0}^{2\pi }\frac{P(\rho e^{\mathrm{i}(\phi -\eta )})P(\rho e^{\mathrm{i}(\phi +\eta )})}{P(\rho e^{\mathrm{i}(\phi -\theta )})P(\rho e^{\mathrm{i}(\phi +\theta )})}d\phi = \frac{1}{2\pi }\int \limits _{0}^{2\pi }{\mathcal {G}}_{\theta ,\eta }(\rho e^{\mathrm{i}\phi })d\phi , \end{aligned}$$
(60)

where \(0 \le \eta \le 2 \pi \) and the integrand is defined as

$$\begin{aligned} {\mathcal {G}}_{\theta ,\eta }(\zeta ) \equiv \frac{P(\zeta /\mu )P(\zeta \mu )}{P(\zeta /\nu )P(\zeta \nu )}, \qquad \mu \equiv e^{\mathrm{i}\eta },\qquad \nu \equiv e^{\mathrm{i} \theta }. \end{aligned}$$
(61)

From the definitions (50) it is clear that the two coefficients functions in (49) can be written in terms of this single function:

$$\begin{aligned} \alpha (\theta _1,\theta ) = g_\theta (\theta _1), \qquad \beta (\theta _3,\theta ) = g_\theta (\theta _3). \end{aligned}$$
(62)

Analysis of this function \(g_\theta (\eta )\) provides the key to the proofs of the conjectures.

Before presenting the details, the strategy explained in (58)–(59) is illustrated in Fig. 6. By minimising both \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _3,\theta )\), the upper envelope \(X+Y=1\) is obtained. In contrast, by maximising both \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _3,\theta )\), a tangent line to the lower envelope is obtained.

It is important to mention that because of the definition of \(P(\zeta )\),

$$\begin{aligned} P(\rho e^{ \mathrm{i}\phi }) = \prod _{n=1}^{\infty }\left( 1+\rho ^{4n-2} - 2\rho ^{2n-1}\cos \phi \right) > 0, \end{aligned}$$
(63)

which means \({\mathcal {G}}_{\theta ,\eta }(\rho e^{\mathrm{i}\phi })\) is real and positive. By a log-sum and sum-log inequality, we obtain

$$\begin{aligned} \begin{aligned} \log g_\theta (\eta )&= \log \left[ \frac{1}{2\pi } \int \limits _{0}^{2\pi }\frac{P(\rho e^{\mathrm{i}(\phi -\eta )})P(\rho e^{\mathrm{i}(\phi +\eta )})}{P(\rho e^{\mathrm{i}(\phi -\theta )})P(\rho e^{\mathrm{i}(\phi +\theta )})}d\phi \right] \\&\ge \frac{1}{2\pi }\int \limits _{0}^{2\pi }\log \frac{P(\rho e^{\mathrm{i}(\phi -\eta )})P(\rho e^{\mathrm{i}(\phi +\eta )})}{P(\rho e^{\mathrm{i}(\phi -\theta )})P(\rho e^{\mathrm{i}(\phi +\theta )})}d\phi \\&=\frac{1}{2\pi }\int \limits _{0}^{2\pi }[\log (P(\rho e^{\mathrm{i}(\phi -\eta )}))+\log (P(\rho e^{\mathrm{i}(\phi +\eta )})) \\&\quad -\log (P(\rho e^{\mathrm{i}(\phi -\theta )}) -\log (P(\rho e^{\mathrm{i}(\phi +\theta )}))]d\phi = 0. \end{aligned} \nonumber \\ \end{aligned}$$
(64)

Thus, we can conclude that \(g_\theta (\eta )\ge 1\) for all \(\theta \) and \(\eta \). We will use this in the next section to analyse the behavior of the function \(g_\theta (\eta )\).

6 Analysis of the Functions \({g_{\theta (\eta )}}\) and \(\frac{\partial g_{\theta }}{\partial \eta }\)

This section studies the behavior of the function \(g_\theta (\eta )\). First, we propose an alternative expression of \(g_\theta (\eta )\) to be:

$$\begin{aligned} g_\theta (\eta ) = \frac{P(\nu /\mu )P(\mu \nu )}{{\hat{P}}(1)P(\nu ^2)}[K(\mu /\nu ) - K(\mu \nu )], ~ \end{aligned}$$
(65)

where we recall that \(\mu =e^{\mathrm{i} \eta }\) and \(\nu =e^{\mathrm{i}\theta }\). This expression is useful for the analysis of \(g_\theta (\eta )\).

To explain the derivation of (65) we note that the integrand \({\mathcal {G}}_{\theta , \eta }(\zeta )\) defined in (61) has two simple poles at \(\zeta = \nu \) and \(\zeta = 1/\nu \) and is readily confirmed, on use of (19), to be loxodromic. The function \({\mathcal {G}}_{\theta , \eta }(\zeta )\) can therefore also be written as

$$\begin{aligned} {\mathcal {G}}_{\theta , \eta }(\zeta ) =\frac{P(\zeta /\mu )P(\zeta \mu )}{P(\zeta /\nu )P(\zeta \nu )} = c_1[K(\zeta /\nu ) - K(\zeta \nu )] + c_2, \end{aligned}$$
(66)

where \(c_1, c_2 \in {\mathbb {C}}\). Since \(K(\zeta )\) is a logarithmic derivative of \(P(\zeta )\) and has functional properties (21), the right hand side of (66) is also a loxodromic function and has two simple poles at \(\zeta =\nu \) and \(\zeta =1/\nu \). The coefficients \(c_1\) and \(c_2\) are determined by considering the limits \(\zeta \rightarrow \nu \) and \(\zeta \rightarrow \mu \),

$$\begin{aligned} c_2 = -c_1[K(\mu /\nu ) - K(\mu \nu )],\qquad c_2 = \frac{P(\nu /\mu )P(\mu \nu )}{{\hat{P}}(1)P(\nu ^2)}[K(\mu /\nu ) - K(\mu \nu )]. \end{aligned}$$
(67)

A Liouville-type argument then confirms the equivalence of the two expressions for \({\mathcal {G}}_{\theta , \eta }(\zeta )\) in (66). We therefore conclude by using (60) and (66) that

$$\begin{aligned} \begin{aligned} g_\theta (\eta )&= \frac{1}{2\pi }\int \limits _{0}^{2\pi } {\mathcal {G}}_{\theta , \eta }(\rho e^{\mathrm{i}\phi })d\phi \\&= \frac{1}{2\pi }\int \limits _{0}^{2\pi } \left( c_1[K(\rho e^{\mathrm{i}(\phi -\theta )}) - K(\rho e^{\mathrm{i}(\phi +\theta )})] + c_2\right) d\phi = c_2, \end{aligned}\nonumber \\ \end{aligned}$$
(68)

which is precisely (65).

Now, we consider the behavior of the function \(g_\theta (\eta )\) by exploiting the derivative of \(g_\theta (\eta )\):

$$\begin{aligned} \frac{\partial g_\theta }{\partial \eta } = \frac{\partial \mu }{\partial \eta }\frac{\partial g_\theta }{\partial \mu } =\mathrm{i}\frac{P(\nu /\mu )P(\mu \nu )}{{\hat{P}}(1)P(\nu ^2)}(L(\mu /\nu ) - L(\mu \nu )) + \mathrm{i}g_\theta [K(\mu \nu ) - K(\nu /\mu )]. \end{aligned}$$
(69)

The function \(L(\mu /\nu ) - L(\mu \nu )\) is a loxodromic function as a function of \(\mu \), and has two second-order poles at \(\mu = \nu \) and \(\mu = 1/\nu \). Similar to the expression of \(g_\theta (\eta )\), we propose another expression of \(L(\mu /\nu )-L(\mu \nu )\) given by

$$\begin{aligned} L(\mu /\nu ) - L(\mu \nu ) = \frac{{\hat{P}}(1)^2 P(\mu ^2)P(\nu ^2)}{P(\nu /\mu )P(\mu /\nu )P(\mu \nu )^2}. \end{aligned}$$
(70)

To establish this expression, consider a new function

$$\begin{aligned} {\mathcal {L}}(\mu )\equiv L(\mu /\nu ) - L(\mu \nu ) - \frac{{\hat{P}}(1)^2 P(\mu ^2)P(\nu ^2)}{P(\nu /\mu )P(\mu /\nu )P(\mu \nu )^2}. \end{aligned}$$
(71)

The function \({\mathcal {L}}(\mu )\) is shown to be a loxodromic function by using the functional properties (19) and (21) as follows:

$$\begin{aligned} \begin{aligned} {\mathcal {L}}(\rho ^2 \mu )&= L(\rho ^2 \mu /\nu )-L(\rho ^2 \mu \nu ) - \frac{{\hat{P}}(1)^2 P(\rho ^4 \mu ^2)P(\nu ^2)}{P(\nu / \rho ^2\mu )P(\rho ^2 \mu /\nu )P(\rho ^2\mu \nu )^2}\\&=L(\mu /\nu )-L(\mu \nu )-\frac{{\hat{P}}(1)^2 (\rho ^2 \mu ^4)^{-1}P( \mu ^2)P(\nu ^2)}{(\rho ^2 \mu /\nu )^{-1} P(\nu / \mu ) (\mu /\nu )^{-1}P( \mu /\nu ) (\mu \nu )^{-2}P(\mu \nu )^2} \\&= {\mathcal {L}}(\mu ). \end{aligned} \end{aligned}$$
(72)

Furthermore, the asymptotic expansions of each term at \(\mu =\nu \) and \(\mu =1/\nu \) are

$$\begin{aligned} \begin{aligned} L(\mu /\nu ) - L(\mu \nu )&\sim -\frac{1}{(1-\mu /\nu )^2} + \frac{1}{1-\mu /\nu } + {\mathcal {O}}(1) \quad \mathrm{at}\ \ \mu = \nu ,\\ L(\mu /\nu ) - L(\mu \nu )&\sim \frac{1}{(1-\mu \nu )^2} - \frac{1}{1-\mu \nu } + {\mathcal {O}}(1) \quad \mathrm{at}\ \ \mu = 1/\nu ,\\ \frac{{\hat{P}}(1)^2 P(\mu ^2)P(\nu ^2)}{P(\nu /\mu )P(\mu /\nu )P(\mu \nu )^2}&\sim \frac{1}{(1-\nu /\mu )(1-\mu /\nu )} + {\mathcal {O}}(1)\\&=-\frac{1}{(1-\mu /\nu )^2} + \frac{1}{1-\mu /\nu } + {\mathcal {O}}(1) \quad \mathrm{at}\ \ \mu = \nu ,\\ \frac{{\hat{P}}(1)^2 P(\mu ^2)P(\nu ^2)}{P(\nu /\mu )P(\mu /\nu )P(\mu \nu )^2}&\sim \frac{1}{(1-\mu \nu )^2} - \frac{1}{1-\mu \nu } + {\mathcal {O}}(1) \quad \mathrm{at}\ \ \mu = 1/\nu . \end{aligned} \end{aligned}$$
(73)

Thus the two poles at \(\nu \) and \(1/\nu \) are removable. We conclude that \({\mathcal {L}}(\mu )\) is a constant function, i.e., it is independent of \(\mu \). In addition, we define another function

$$\begin{aligned} \tilde{{\mathcal {L}}}(\nu )\equiv L(\mu /\nu ) - L(\mu \nu ) - \frac{{\hat{P}}(1)^2 P(\mu ^2)P(\nu ^2)}{P(\nu /\mu )P(\mu /\nu )P(\mu \nu )^2}. \end{aligned}$$
(74)

By similar arguments, the function \(\tilde{{\mathcal {L}}}(\nu )\) can be seen to be a loxodromic function of \(\nu \) with removable poles \(\nu =\mu \) and \(\nu =1/\mu \). Thus, \({\mathcal {L}}(\mu )\) is independent of both \(\mu \) and \(\nu \).

On use of the functional property (21) of \(L(\zeta )\), it can be verified that \({\mathcal {L}}(\rho )=0\), which means \({\mathcal {L}}(\mu )\) vanishes everywhere leading to expression (70).

Fig. 7
figure 7

Typical behavior of the function \(g_\theta (\eta )\), \(k_\theta (\eta )\), and \(\partial g_\theta /\partial \eta \). A local maximum of \(g_\theta (\eta )\) is obtained when \(\eta =0\) or \(\eta =\pi \). A local minimum of \(g(\eta )\) is obtained when \(\eta =\theta \) or \(2\pi - \theta \)

Expression (70) can now be used to analyse the behaviour of \(g_\theta (\eta )\). By substituting (70) into (69),

$$\begin{aligned} \begin{aligned} \frac{\partial g_\theta }{\partial \eta }&= \mathrm{i}\left[ \frac{{\hat{P}}(1)P(\mu ^2)}{P(\mu /\nu )P(\mu \nu )} + g_\theta (\eta ) (K(\mu \nu ) - K(\nu /\mu ))\right] \\&=\mathrm{i}(K(\mu \nu ) - K(\nu /\mu ))\left[ g_{\theta }(\eta )-\frac{1}{K(\nu /\mu )-K(\mu \nu )} \frac{{\hat{P}}(1) P(\mu ^2)}{P(\mu /\nu )P(\mu \nu )} \right] \\&=\mathrm{i}(K(\mu \nu ) - K(\nu /\mu ))\left[ g_\theta (\eta ) - \frac{1}{g_\eta (\theta )}\right] ~. \end{aligned}\nonumber \\ \end{aligned}$$
(75)

Since, from (64), \(g_\theta (\eta ) \ge 1\) and \(g_\eta (\theta ) \ge 1\) then

$$\begin{aligned} g_\theta (\eta ) - \frac{1}{g_\eta (\theta )} \ge 0. \end{aligned}$$
(76)

The sign of the derivative (75) is therefore determined by the function

$$\begin{aligned} k_\theta (\eta )\equiv \mathrm{i}(K(\mu \nu ) - K(\nu /\mu )). \end{aligned}$$
(77)

The derivative of \(k_\theta (\eta )\) with respect to \(\eta \) is

$$\begin{aligned} \frac{\partial k_\theta }{\partial \eta } = -(L(\mu \nu ) + L(\nu /\mu )). \end{aligned}$$
(78)

But it can also be shown that \(L(e^{\mathrm{i}\phi })\) for \(0<\phi <2\pi \) is real and positive—the proof is given in Appendix A. This means that

$$\begin{aligned} \frac{\partial k_\theta }{\partial \eta }< 0, \qquad 0<\eta <2\pi . \end{aligned}$$
(79)

It is straightforward to check that \(k_\theta (0) = k_\theta (\pi ) = 0\) and consequently, using continuity arguments,

$$\begin{aligned} \lim _{\eta \rightarrow \theta -0} k_\theta (\eta )&= -\infty ,&\quad \lim _{\eta \rightarrow \theta +0} k_\theta (\eta )&= +\infty , \end{aligned}$$
(80)
$$\begin{aligned} \lim _{\eta \rightarrow 2\pi - \theta -0} k_\theta (\eta )&= -\infty ,&\lim _{\eta \rightarrow 2\pi - \theta +0} k_\theta (\eta )&= +\infty . \end{aligned}$$
(81)

With this information we can determine the maxima and the minima of \(g_\theta (\eta )\). The local minima occur when \(\eta =\theta \) or \(\eta = 2\pi - \theta \). By the definition (60),

$$\begin{aligned} g_\theta (\theta ) = g_\theta (2\pi -\theta ) = 1. \end{aligned}$$
(82)

On the contrary, local maxima occur when \(\eta =0\) and \(\eta =\pi \). Recall that this case corresponds to the symmetric choice of points with parameter \(\theta \). Figure 7 shows the typical behaviour of the function \(g_\theta (\eta )\) for a range of \(\rho \) values; recall that \(P(\zeta )\), and hence \(g_\theta (\eta )\) and \(k_{\theta }(\eta )\) depend on the parameter \(\rho \) although this dependence is hidden in our notation.

7 Proof of the Two Conjectures (56)–(57)

We will now prove the two conjectures (56) and (57) using what we have established about the behavior of \(g_\theta (\eta )\). From (58) and (82), we can conclude that

$$\begin{aligned} X_{\theta _1,\theta _3,\theta }+ Y_{\theta _1,\theta _3,\theta } \le 1 \end{aligned}$$
(83)

for all \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\). This is equivalent to the conjecture (56).

The maxima of \(\alpha (\theta _1,\theta )\) and \(\beta (\theta _3,\theta )\) occur when \(\theta _1= 0\) and \(\theta _3 = \pi \). Hence, from (59), we conclude that

$$\begin{aligned} \alpha _\theta X_{\theta _1,\theta _3,\theta } + \beta _\theta Y_{\theta _1,\theta _3,\theta } \ge 1. \end{aligned}$$
(84)

The inequality (84) is not, however, equivalent to the inequality (57). What has been proven in (84) is that for \(0<\theta <\pi \), the pair \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) is above the tangent at \((X_{0,\pi ,\theta },Y_{0,\pi ,\theta })\) on the lower envelope. Thus, for the final step, we need to prove that for another \(\theta '\ne \theta \), the pair \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) lies above the tangent at \((X_{0,\pi ,\theta '},Y_{0,\pi ,\theta '})\). The condition is equivalent to

$$\begin{aligned} \alpha _{\theta '} X_{\theta _1,\theta _3,\theta }+\beta _{\theta '}Y_{\theta _1,\theta _3,\theta }\ge 1. \end{aligned}$$
(85)

Notice that because of the choice of angular coordinates (44) the range of \(\theta _1\) and \(\theta _3\) are related to \(\theta \), but \(\theta '\) should be chosen independently of \(\theta _1\) and \(\theta _3\) since we need it to parametrize the whole of the lower envelope.

The procedure to prove (85) can be divided into two steps.

In the first step (Step 1) we find a specific \({\tilde{\theta }}\) such that \(\alpha _{\theta '}\ge \alpha (\theta _1,{\tilde{\theta }})\) and \(\beta _{\theta '}\ge \beta (\theta _3,{\tilde{\theta }})\). This step can be viewed as a movement from a point on the lower envelope (the set of 3 blue curves in Fig. 8) to a point on another curve (the red curve in Fig. 8) on which the point \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) lies.

In the next step (Step 2) we make a use of a log-sum and sum-log inequality to prove that

$$\begin{aligned} \alpha (\theta _1,{\tilde{\theta }})X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,{\tilde{\theta }}) Y_{\theta _1,\theta _3,\theta } \ge 1. \end{aligned}$$
(86)
Fig. 8
figure 8

Three possible cases for the set \((\theta ',\theta _1,\theta _3)\). (i) Case 1: \(\alpha _{\theta '}\ge \alpha (\theta _1,\theta ')\) and \(\beta _{\theta '}\ge \beta (\theta _3,\theta ')\). (ii) Case 2a: Because \(|\theta _1|>\theta '\), we choose \({\tilde{\theta }}=|\theta _1|\). We can see that \(\alpha _{\theta '}\ge \alpha (\theta _1,|\theta _1|)=1\) and \(\beta _{\theta '}> \beta (\theta _3,|\theta _1|)\). (iii) Case 2b: Because \(|\pi -\theta _3|>\pi -\theta '\), we choose \({\tilde{\theta }}=\pi -|\pi -\theta _3|\). We can see that \(\alpha _{\theta '}\ge \alpha (\theta _1,\pi -|\pi -\theta _3|)\) and \(\beta _{\theta '}> \beta (\theta _3,\pi -|\pi -\theta _3|)=1\)

First let us explain Step 1. For an arbitrary value of \(\theta '\) with \(0<\theta '<\pi \), there are three possible cases for the set \((\theta ',\theta _1,\theta _3)\). The three cases are illustrated in Fig. 8: they can be understood as corresponding to the lower envelope being split into three parts: the three blue curve segments sitting on the lower envelope in Fig. 8 as shown in top, middle, and bottom of this figure. These will be called cases 1, 2a and 2b, respectively. The red solid curve on each Fig. 8 is drawn by changing \(\theta \) from \(|\theta _1|\) to \(\pi -|\pi -\theta _3|\) while \(\theta _1\) and \(\theta _3\) are fixed. The range of \(\theta \) comes from the condition of angular coordinates (44). The reason for the separation into three cases is that when \(\theta '<|\theta _1|\) or \(\pi -|\pi -\theta _3|<\theta '\), an auxiliary point \((X_{\theta _1,\theta _3,\theta '},Y_{\theta _1,\theta _3,\theta '})\) does not lie on the red curve. Based on the argument above, we therefore assign a value \({\tilde{\theta }}\) to a value of \(\theta '\) according to the following three conditions:

  • Case 1: When \(|\theta _1|\le \theta '\le \pi -|\pi -\theta _3|\), which means that the point \((X_{\theta '},Y_{\theta '})\) marked as “+” in the top right of Fig. 8 lies on the blue solid curve, an auxiliary point \((X_{\theta _1,\theta _3,\theta '},Y_{\theta _1,\theta _3,\theta '})\) also lies on the red curve marked as “x”. From the behavior of \(g_{\theta '}(\eta )\), we can see that \(\alpha _{\theta '}\ge \alpha (\theta _1,\theta ')\) and \(\beta _{\theta '}\ge \beta (\theta _3,\theta ')\). In this case, we choose \({\tilde{\theta }}=\theta '\).

  • Case 2a: When \(\theta '<|\theta _1|\) as shown in the middle of Fig. 8, an auxiliary point \((X_{\theta _1,\theta _3,\theta '},Y_{\theta _1,\theta _3,\theta '})\) does not lie on the red curve. In this case, the end point of the red curve is chosen, that is, we make the choice \({\tilde{\theta }}=|\theta _{1}|\). From the behavior of \(g_\theta (\eta )\), it is apparent that \(\alpha _{\theta '}\ge \alpha (\theta _1,|\theta _1|)=1\). Because of the angular condition (44) such that \(|\theta _1|<\theta _3<2\pi -|\theta _1|\) and because \(\beta _{\theta }\) is a monotonically decreasing function with respect to \(\theta \) for \(0<\theta <\pi \) (Appendix B), \(\beta _{\theta '}>\beta _{|\theta _1|}=\beta (\pi ,|\theta _1|)\ge \beta (\theta _3,|\theta _1|)\).

  • Case 2b: This is the same as case 2a but refers to the other end of the red curve. When \(\pi -|\pi -\theta _3|<\theta '\) as shown in the bottom of Fig. 8, a point \((X_{\theta _1,\theta _3,\theta '},Y_{\theta _1,\theta _3,\theta '})\) does not lie on the red curve (cf: Case 2a). In this case, the opposite boundary point of the red curve is now chosen, which means that we choose \({\tilde{\theta }}=\pi - |\pi - \theta _3|\). From the behavior of \(g_\theta (\eta )\), it is apparent that \(\beta _{\theta '}\ge \beta (\theta _3,\pi -|\pi -\theta _3|)=1\). Because \(\pi -|\pi -\theta _3|<\theta '\), \(|\theta _1|<\pi -|\pi -\theta _3|\), and because \(\alpha _{\theta }\) is a monotonically increasing function with respect to \(\theta \) for \(0<\theta <\pi \) (Appendix B), \(\alpha _{\theta '}> \alpha _{\pi -|\pi -\theta _3|}\ge \alpha (\theta _1,\pi -|\pi -\theta _3|)\).

In all cases, it is possible to find a specific \({\tilde{\theta }}\) such that \(\alpha _{\theta '}\ge \alpha (\theta _1,{\tilde{\theta }})\) and \(\beta _{\theta '}\ge \beta (\theta _3,{\tilde{\theta }})\). Thus, we obtain the important result that

$$\begin{aligned} \alpha _{\theta '} X_{\theta _1,\theta _3,\theta } + \beta _{\theta '} Y_{\theta _1,\theta _3,\theta } \ge \alpha (\theta _1,{\tilde{\theta }}) X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,{\tilde{\theta }}) Y_{\theta _1,\theta _3,\theta }. \end{aligned}$$
(87)

This facilitates Step 2 where we use a log-sum and sum-log inequality for the right hand side of (87) as follows:

$$\begin{aligned}&\log \left[ \alpha (\theta _1,{\tilde{\theta }}) X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,{\tilde{\theta }}) Y_{\theta _1,\theta _3,\theta }\right] \nonumber \\&\quad =\log \left[ \frac{1}{2\pi }\int \limits _{0}^{2\pi }(A(\theta _1,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })X_{\theta _1,\theta _3,\theta } + B(\theta _3,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })Y_{\theta _1,\theta _3,\theta }) d\phi \right] \nonumber \\&\quad \ge \frac{1}{2\pi }\int \limits _{0}^{2\pi }\log \left[ A(\theta _1,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })X_{\theta _1,\theta _3,\theta } + B(\theta _3,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })Y_{\theta _1,\theta _3,\theta } \right] d\phi . \end{aligned}$$
(88)

However the right hand side can be written

$$\begin{aligned} \begin{aligned}&\frac{1}{2\pi }\int \limits _{0}^{2\pi }\log \left[ X_{\theta _1,\theta _3,\theta } + \frac{B(\theta _3,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })}{A(\theta _1,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })}Y_{\theta _1,\theta _3,\theta } \right] d\phi + \frac{1}{2\pi }\int \limits _{0}^{2\pi }\log A(\theta _1,{\tilde{\theta }}, \rho e^{\mathrm{i}\phi })d\phi \\&\qquad =\frac{1}{2\pi }\int \limits _{0}^{2\pi }\log \left[ X_{\theta _1,\theta _3,\theta } + \frac{B(\theta _3,\theta ,\rho e^{\mathrm{i}\phi })}{A(\theta _1,\theta ,\rho e^{\mathrm{i}\phi })}Y_{\theta _1,\theta _3,\theta } \right] d\phi + \frac{1}{2\pi }\int \limits _{0}^{2\pi }\log A(\theta _1,\theta , \rho e^{\mathrm{i}\phi })d\phi \\&\qquad =\frac{1}{2\pi }\int \limits _{0}^{2\pi }\log \left[ A(\theta _1,\theta ,\rho e^{\mathrm{i}\phi })X_{\theta _1,\theta _3,\theta } + B(\theta _3,\theta ,\rho e^{\mathrm{i}\phi })Y_{\theta _1,\theta _3,\theta } \right] d\phi = 0, \end{aligned} \end{aligned}$$
(89)

where we used the fact that the integral of \(\log A(\theta _1,\theta ,\rho e^{\mathrm{i}\phi })\) is

$$\begin{aligned} \begin{aligned} \frac{1}{2\pi }\int \limits _{0}^{2\pi } \log A(\theta _1,\theta ,\rho e^{\mathrm{i\phi }})d\phi&= \frac{1}{2\pi }\int \limits _{0}^{2\pi } \log \left( \frac{P(\rho e^{\mathrm{i}(\phi -\theta _1)})P(\rho e^{\mathrm{i}(\phi +\theta _1)}) }{P(\rho e^{\mathrm{i}(\phi -\theta )})P(\rho e^{\mathrm{i}(\phi +\theta )})}\right) d\phi \\&= \frac{1}{2\pi } \int \limits _{0}^{2\pi } [\log P(\rho e^{\mathrm{i}(\phi -\theta _1)}) + \log P(\rho e^{\mathrm{i}(\phi +\theta _1)}) \\&~~~~~~~~~~~~~~~- \log P(\rho e^{\mathrm{i}(\phi -\theta )}) - \log P(\rho e^{\mathrm{i}(\phi +\theta )})] d\phi = 0 \end{aligned} \end{aligned}$$
(90)

and the fact that

$$\begin{aligned} {B(\theta _1,{\tilde{\theta }},\rho e^{\mathrm{i}\phi }) \over A(\theta _3,{\tilde{\theta }},\rho e^{\mathrm{i}\phi })} = \frac{B(\theta _1,\theta ,\rho e^{\mathrm{i}\phi })}{A(\theta _3,\theta ,\rho e^{\mathrm{i}\phi })} = \frac{P(\rho e^{\mathrm{i}(\phi -\theta _3)})P(\rho e^{\mathrm{i}(\phi +\theta _3)})}{P(\rho e^{\mathrm{i}(\phi -\theta _1)})P(\rho e^{\mathrm{i}(\phi +\theta _1)})} \end{aligned}$$
(91)

is independent of \(\theta \) and \({\tilde{\theta }}\). In the last line, we have used the Fay trisecant identity (45). Putting all this together it has been shown that

$$\begin{aligned} \log \left[ \alpha (\theta _1,{\tilde{\theta }}) X_{\theta _1,\theta _3,\theta } + \beta (\theta _3,{\tilde{\theta }}) Y_{\theta _1,\theta _3,\theta }\right] \ge 0. \end{aligned}$$
(92)

From (92) it follows that

$$\begin{aligned} \alpha _{\theta '}X_{\theta _1,\theta _3,\theta } + \beta _{\theta '}Y_{\theta _1,\theta _3,\theta } \ge 1, \end{aligned}$$
(93)

which means for \(0<\theta <\pi \), \(\alpha _{\theta } X + \beta _{\theta } Y \ge 1\) for all (XY). We have proved all the conjectured features of the envelope structure observed by previous authors.

We now summarize steps 1 and 2 geometrically with the aid of Fig. 8. In step 1 we found a choice of an auxiliary tangent line (the red line) to the pink curve on which \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) sits (and which corresponds to a fixed \(\theta _1\) and \(\theta _3\)) above the tangential line on the lower envelope (the blue line) for the three segments of the lower envelope into which it naturally divides for any fixed \(\theta _1\) and \(\theta _3\) (i.e., cases 1, 2a and 2b). In step 2 the point \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) is then shown to be above that red tangent line by using the log-sum and sum-log inequalities. We thus prove that \((X_{\theta _1,\theta _3,\theta },Y_{\theta _1,\theta _3,\theta })\) is above any point on the lower envelope.

8 Discussion

By introducing the prime function on the Schottky double of a conformally equivalent concentric annulus to any given doubly connected sample with a single uncharged hole, and by using an integrated form of an associated Fay trisecant identity expressed in terms of that prime function, we have shown how to prove some conjectures made in the literature on an observed envelope structure associated with pairs of 4-point resistance measurements in a van der Pauw procedure. More practical implications of our results are discussed elsewhere [19].

The van der Pauw method for a sample with two or more holes is, of course, of interest and several studies have recently appeared [9, 11]. The theory of the prime function on domains of general finite connectivity has been expounded in a recent monograph [18] and the relevance of the prime function to expressing the complex potential for source/sink flows in such domains is already known [23]. At the same time, the Fay trisecant identity also carries over to higher genus Riemann surfaces, where the Fay identity can be written down in terms of the so-called prime form on those surfaces and associated theta functions [24]. Therefore, while we have focused here on what is the simplest non-trivial case of a holey sample, we expect all the ideas to be extendible to the case of a sample with any finite number of holes. The details, however, remain to be worked out and constitute an interesting future challenge.