Being neat
When you’re folding a sheet,
That’s mathematics!
Tom Lehrer
Abstract
We fold a triangle once along a straight line and study how small the area of the folded figure can be. It can always be as small as the fraction \(2-\sqrt{2}\) of the area of the original triangle.
This is best possible: For every positive number \(\varepsilon\) there are triangles that cannot be folded better than \(2-\sqrt{2}-\varepsilon\).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 The question
Suppose we have a paper in the form of a triangle. We fold it once along a straight line and ask how small the area of the folded object can be – equivalently, how large the area of the doubly covered part can be. Clearly this latter area cannot be larger than one half of that of the triangle, and it is exactly one half if and only if the triangle is isosceles. But how large can the area of the doubly covered part be if the triangle is scalene? And how small is the area in the worst possible case?
We may obviously ask what happens if we fold polygons with four, five, six, … sides. As Bo Senje reports, problems on the folding of rectangles have been given to students. But rectangles are quite special quadrilaterals. And then we should pass to polyhedra in \(n\geq 3\) dimensions …
2 The answer
Theorem 1
Every triangle can be folded along a straight line so that the area of the doubly covered part is the fraction \(\sqrt{2}-1\approx 0.4142\) of the area of the original triangle. This is best possible in the sense that, given any positive number \(\varepsilon\) , there exist triangles that cannot be folded to yield an area of the doubly covered part larger than the fraction \(\sqrt{2}-1+\varepsilon\) .
The area of the folded figure and the area of its doubly covered part sum up to the area of the original triangle. To ask for the smallest area of the folded figure is therefore equivalent to asking for the largest area of the doubly covered part. For the calculations to be made, it is easier to consider the latter quantity, which we shall do from now on. (See, e. g., Proposition 5. In the simple but important estimate in Proposition 7, the folded object has infinite area, so we can only work with the area of the doubly covered part of the strip.)
Denote the side lengths of the triangle by \(A\), \(B\) and \(C\), with the notation chosen so that \(A\leq B\leq C\).
We shall fold a triangle in two different ways:
-
1.
If we fold along the bisector of the smallest angle, then the relative area of the doubly covered part is \(B/(B+C)\). This quantity is \(\geq\sqrt{2}-1\) if and only if
$$B\geq C/\sqrt{2}.$$(1) -
2.
If we fold along a line which is orthogonal to the longest side, then the best result to be obtained is \(C^{2}/(B^{2}-A^{2}+2C^{2})\). This quantity is \(\geq\sqrt{2}-1\) if and only if
$$B^{2}\leq A^{2}+(\sqrt{2}-1)C^{2}.$$(2)
One of (1) and (2) always holds.
So to obtain the fraction \(\sqrt{2}-1\), we need two different kinds of folding: along lines that pass through a vertex and lines that are orthogonal to a side. Since we use only special foldings for these considerations, calculations are not very complicated.
To prove that it is not possible to do anything better, we must consider all foldings, so calculations become more involved. We shall study triangles with vertices at \(a=(0,0)\), \(b=(1,0)\), and \(c=(c_{1},c_{2})\), with \(c_{1}=\frac{1}{2}\sqrt{2}\) and \(c_{2}> 0\) very small. This means that the side lengths \(A:B:C\) are almost proportional to \((1-\frac{1}{2}\sqrt{2}):\frac{1}{2}\sqrt{2}:1\). These narrow triangles will give us the examples where all foldings are bad.
When the line avoids the direction of the longest side as well as a direction orthogonal to the longest side, a simple estimate shows that the area of the doubly covered part cannot be close to \(\sqrt{2}-1\) (Sect. 9.1 and Fig. 2). The situations when the line is almost parallel or almost orthogonal to the longest side of these thin triangles remain to be studied. This is done in Sects. 9.2 and 9.3, respectively.
3 The history
Sometime during the years 1997–1999, Bo Senje at Halmstad University College asked a question: How large can the doubly covered part of a folded triangle be? He had seen some student problems on the folding of rectangles but, to his surprise, never any on the folding of triangles. (Now we may suspect that the explanation was that triangles are more difficult than rectangles.)
Georgi Mihailov Tchilikov (1967–2010), also at Halmstad University College, worked on the problem from time to time since December 2004. Some time later (according to Bo most probably in 2008) he found formula (19) for foldings along a bisector.
Bo also asked: Will the best folding always go through a vertex of the triangle?
The answer to this second question is in the negative as we shall see: for certain triangles, folding orthogonally to a side gives better results than folding along a bisector (see Remark 1). Also Georgi had found this. His results have not been published.
The questions posed by Bo reached me in April 2005. I did not know about the results proved by Georgi until February 2015.
Independently of what was known in Halmstad, I proved (in 2005 or 2006) the formula for foldings along lines through a vertex, and Martin Herschend then found a triangle where folding orthogonally to a side gives a better result than folding along a line through a vertex.
After having tried many different ways to approach the question – too many to be remembered – I could finally prove in February 2015 that \(\sqrt{2}-1\) is best possible. The proof was, however, so cumbersome that I did not dare to submit it to any journal. Only in the evening of 2016 October 13 did I find a proof that might convince more than one person. Of course I cannot exclude that there exist simpler proofs.
I thank Bo Senje for having asked these questions – a most important activity and the starting point of any kind of research. He also checked an earlier version of the manuscript and sent me comments.
I am grateful to two anonymous referees, whose comments have led to clarifications in the presentation.
For origamics in general, see [1] and [3]. More recently, [4, 5], and [2] studied paper foldings. The first-mentioned article treats mostly folding of rectangles, then of right-angled triangles, and finally of general triangles. However, the best possible result is not obtained. Oswald’s paper [5] is about results on paper foldings obtained by Adolf Hurwitz (1859–1919).
4 Calculating with triangles
4.1 Best foldings and the worst best folding
Let \(T\) be a triangle, understood as the convex hull of three non-collinear points in the plane, and let \({\mathbf{area}}(T)\) be its area. We shall denote by \(T(a,b,c)\) the triangle with vertices \(a\), \(b\), \(c\).
We denote by \(Y(\varphi,t)\) the half plane defined by
and by \(L(\varphi,t)\) its boundary, defined by
forming an angle \(\varphi\) with the positive \(x_{1}\)-axis. Obviously \(Y(\varphi+2\pi,t)=Y(\varphi,t)\) and \(L(\varphi+\pi,t)=L(\varphi,-t)\).
Let \(Y\) be a half plane and let \(T^{[\partial Y]}=T^{[L]}\) be the reflection of \(T\) in its boundary \(\partial Y=L\). We fold \(T\) along \(L\) and place the folded object in \(Y\). Then the part which is doubly covered is \(T\cap T^{[\partial Y]}\cap Y\). We denote it by \(T_{Y}\). It is in general a quadrilateral. The folded object is \((T\cup T^{[\partial Y]})\cap Y\), to be denoted by \(T^{Y}\). It is in general a heptagon. See Figs. 1 and 4, respectively.
Depending of the shape of the triangle and the position of the line, some geometric elements can coincide: the doubly covered part can be a triangle; the folded object can have less than seven sides: six, five, four, or even three sides. See Fig. 3 for an example where \(T_{Y}\) is a quadrilateral and \(T^{Y}\) has five vertices. These special cases need not be treated separately since the calculations in the general case cover them all.
We have
so that the relative areas satisfy
The problem is to maximize the area of the set \(T_{Y}\) which is doubly covered.
We define the best folding of \(T\) as the largest relative area of the doubly covered object,
where the supremum is taken over all possible half planes \(Y\). For each triangle, the supremum is attained for at least one choice of \(Y\). The quantity \(\text{BF}(T)\) satisfies
As already noticed, the second inequality in (6) is an equality if and only if \(T\) is isoceles and is realized when \(L=\partial Y\) is the bisector between two equal sides.
How bad can \(\text{BF}(T)\) be? Let us define the worst best folding WBF as
4.2 Standard coordinates
It will often be convenient use the following coordinate system.
Thus all possible triangles are represented by the point \(c\) in the set \(\Gamma\), which is bounded by the segments \([(\frac{1}{2},\frac{1}{2}\sqrt{3}),(\frac{1}{2},0)]\) and \([(\frac{1}{2},0),(1,0)]\), and the circular arc from \((1,0)\) to \((\frac{1}{2},\frac{1}{2}\sqrt{3})\) with center at the origin and of radius \(1\). The triangles with \(c_{1}=\frac{1}{2}\) as well as those with \(c\) on the circular arc are isoceles, and for them \(\text{BF}(T)=\frac{1}{2}\).
We shall prove that, in standard coordinates,
The right-hand side is larger than \(\sqrt{2}-1\) for all \(c\in\Gamma\). It follows that \(\text{BF}(T)> \sqrt{2}-1\) for all triangles \(T\), and that \(\text{WBF}\geq\sqrt{2}-1\). Thus the first part of the main theorem follows.
The second part of the main theorem, to be proved in Sect. 9, says that the infimum of \(\text{BF}(T)\) over all triangles \(T\) is equal to \(\sqrt{2}-1\). It follows that \(\text{WBF}=\sqrt{2}-1\).
The right-hand side in (9) can be extended to a continuous function of \(c\in\overline{\Gamma}=\Gamma\cup[(1/2,0),(1,0)]\), and this extension takes the value \(\sqrt{2}-1\) only at \(c=(1/\sqrt{2},0)\). Hence the bad triangles are only those with \(c\) close to \((1/\sqrt{2},0)\). So, the infimum defining WBF is not attained: there is no worst triangle, but a family \((T_{\varepsilon})_{\varepsilon> 0}\) of bad triangles such that \(\text{BF}(T_{\varepsilon})\) tends to WBF as \(\varepsilon\searrow 0\); in standard coordinates, the vertex \(c\in\Gamma\) must tend to \((1/\sqrt{2},0)\notin\Gamma\).
4.3 A calculus of points and lines
As already mentioned, the vertices of a triangle will be called \(a\), \(b\), \(c\). Its angles will be denoted by \(\alpha\), \(\beta\), \(\gamma\); its side lengths \(A=\|b-c\|_{2}\), \(B=\|c-a\|_{2}\), \(C=\|a-b\|_{2}\), where \(\|x\|_{2}=\sqrt{x_{1}^{2}+x_{2}^{2}}\) denotes the Euclidean norm of a vector \(x\). Often, but not always, we shall choose notation so that \(A\leq B\leq C\). Then also \(\alpha\leq\beta\leq\gamma\), and \(\sin\alpha\leq\sin\beta\leq\sin\gamma=\sin(\alpha+\beta)\). We have
In addition to these inequalities we have \(\alpha\leq\beta\leq\frac{1}{2}\pi-\frac{1}{2}\alpha\), so that the admissible pairs \((\alpha,\beta)\) belong to the closed triangle with vertices at \((0,0)\), \((\frac{1}{3}\pi,\frac{1}{3}\pi)\) and \((0,\frac{1}{2}\pi)\) with the segment \([(0,0),(0,\frac{1}{2}\pi)]\) removed.
We shall need some explicit calculations on lines and reflections in lines. We study half planes \(Y(\varphi,t)\) defined by (3) and lines \(L(\varphi,t)\) defined by (4).
Two straight lines \(L(\varphi_{1},t_{1})\) and \(L(\varphi_{2},t_{2})\) with \(\varphi_{1}-\varphi_{2}\notin\pi\mathbf{Z}\) meet in a unique point
Two different points \(p\) and \(q\) determine a line \(L(\theta,s)\) with
By reflection in the straight line \(L(\varphi,t)\), a point \(p\) is mapped to a point \(p^{*}\), which is given by
A line \(L(\theta_{1},s_{1})\), when reflected in a line \(L(\varphi,t)\), is mapped to the line \(L(\theta_{2},s_{2})\), where
4.4 A calculus of areas
We shall use the notation \(\rho_{T}(\varphi,t)\) for the relative area of \(T_{Y(\varphi,t)}\), thus
We can take the supremum in two steps:
where
the best that can be obtained if we fold only along lines parallel to a fixed line.
If the vertices \(a\), \(b\), \(c\) are taken in the positive direction, the triangle \(T(a,b,c)\) has the area
where we think of \(a\in\mathbf{R}^{2}\) as a column vector, \(a=(a_{1},a_{2})^{\mathrm{T}}\), and write
A quadrilateral with vertices \(a\), \(b\), \(c\), \(d\) will be denoted by \(Q(a,b,c,d)\). If the vertices are taken in the positive direction, it has the area
4.5 Special foldings
We shall also briefly discuss quantities \(\text{BF}^{\text{vertex}}(T)\), \(\text{BF}^{\perp}(T)\), \(\text{WBF}^{\text{vertex}}\), and \(\text{WBF}^{\perp}\), defined in analogy with \(\text{BF}(T)\) and WBF, but when we fold only along lines that pass through a vertex or are orthogonal to a side, respectively. Clearly
yielding \(\text{WBF}^{\text{vertex}},\text{WBF}^{\perp}\leq\text{WBF}\). Actually we have strict inequality here:
We shall find families \((T^{\text{vertex}}_{\varepsilon})_{\varepsilon> 0}\) and \((T^{\perp}_{\varepsilon})_{\varepsilon> 0}\) of bad triangles for these two special foldings.
In the next sections, we shall start by studying some special foldings. We first fold along a straight line which is parallel to a side; see Sect. 5. This kind of folding can never give a better result than \(\frac{1}{3}<\sqrt{2}-1\).
To prove that a triangle can always be folded to yield the relative area \(\sqrt{2}-1\) of the doubly convered set we shall study two special ways of folding a triangle.
We fold along a line which passes through a vertex; see Sect. 6 and Corollary 2. In this case
where \(\alpha\leq\beta\leq\gamma\) are the angles of \(T\).
If we fold along a line which is parallel to a bisector, the result is the same, i. e., the supremum of \({\mathbf{area}}(T_{Y})\) over all half planes with \(L=\partial Y\) parallel to a certain bisector is the same as the supremum of the area over all half planes \(Y\) with \(L=\partial Y\) passing through the vertex of that bisector.
The bad triangles for this kind of folding are those with vertices in \((0,0)\), \((0,1)\), \((c_{1},c_{2})\), where \(c_{1}=\textbf{Au}\) and \(c_{2}\) is a small positive number. Here Au is the Golden Ratio, equal to \(\frac{1}{2}(\sqrt{5}-1)\approx 0.6180\) (also known as the Golden Section or Golden Proportion). For them the best folding tends to \(1/(1+1/{\textbf{Au}})=\frac{3}{2}-\frac{1}{2}\sqrt{5}\approx 0.3820\) (see Corollary 2).
Then we fold along a straight line which is orthogonal to a side (see Sect. 7). It turns out that the worst best folding \(\text{WBF}^{\perp}\) for these foldings is \(\frac{1}{3}\).
It can happen that the supremum when we keep \(L\) parallel to a given direction which is not a bisector is larger than when \(L\) passes through a vertex (see Remark 1).
Combining two of the methods, viz. folding either along lines through a vertex or orthogonally to a side (a choice depending on the triangle), we find that the universal constant WBF is at least equal to \(\sqrt{2}-1\approx 0.4142\); see (9).
5 Folding along a line parallel to a side
When folding along a line parallel to a side, it matters whether the angles adjacent to the side are at most \(\frac{1}{2}\pi\) or whether one is larger then \(\frac{1}{2}\pi\). In the first case, the best result is always \(1/3\), thus smaller than \(\sqrt{2}-1\); in the second case it is even smaller than \(1/3\). Thus folding along a line parallel to a side never yields a good result.
Proposition 2
Let \(T\) be the triangle with vertices at \(a=(0,0)\) , \(b=(1,0)\) , \(c=(c_{1},c_{2})\) , assuming \(0\leq c_{1}\leq 1\) and \(c_{2}> 0\) . We fold it along the line \(L(0,t)=\partial Y(0,t)\) , thus of equation \(x_{2}=t\) , and put the folded object in the half plane \(Y(0,t)\) , defined by \(x_{2}\leq t\) . Then the relative area of
is
Its maximum is attained for
It can be observed that by performing a shear, the general case \(0\leq c_{1}\leq 1\) can be reduced to the special case when \(c_{1}=1\), which means that the line is orthogonal to the side \([a,b]\). In turn this is equivalent to the special case when \(c_{1}=1\) in Proposition 5. Anyway, the proof is very simple even if we assume only that \(0\leq c_{1}\leq 1\), and is therefore omitted.
If one of the adjacent angles is larger than \(\frac{1}{2}\pi\), the best result is smaller than \(1/3\):
Proposition 3
Let \(T\) be the triangle with vertices at \(a=(0,0)\) , \(b=(1,0)\) , \(c=(c_{1},c_{2})\) , assuming now \(c_{1}\geq 1\) and \(c_{2}> 0\) . Then the relative area of
where the half plane \(Y(0,t)\) is defined by \(x_{2}\leq t\) , is
Its maximum is attained for
Thus, in both cases,
The proof of this proposition is also easy and is omitted here.
When \(c_{1}\nearrow+\infty\), equivalently \(\beta\nearrow\pi\), the best folding becomes very bad: the relative area of the doubly covered set tends to \(0\)
Corollary 1
If no angle in a triangle is larger than \(\frac{1}{2}\pi\) , then the best result for the area of the doubly covered object when folding along a line parallel to a side is \(\frac{1}{3}\) in all three cases.
If one angle, say \(\beta\) , is larger than \(\frac{1}{2}\pi\) , then folding along a line parallel to \([a,b]\) , \([b,c]\) , and \([c,a]\) yields, respectively, the best result
6 Folding along a line parallel to a bisector
Proposition 4
Let \(T\) be the triangle with vertices at \(a=(0,0)\) , \(b=(b_{1},b_{2})\) , \(c=(c_{1},c_{2})\) , assuming \(c_{1}<0<b_{1}\) and \(0<c_{2}\leq b_{2}\) . Assume that the line \(L(-\pi/2,t)\) , i. e., the line of equation \(x_{1}=t\) , is parallel to the bisector between the sides \([a,b]\) and \([a,c]\) . It follows that \(c_{2}/c_{1}=-b_{2}/b_{1}\) . Then the relative area of \(T_{Y(-\pi/2,t)}=T\cap T^{[\partial Y(-\pi/2,t)]}\cap Y(-\pi/2,t)\) is
where
Its maximum is attained for
Proof. It is clear that, for \(c_{1}\leq t\leq 0\), the value of the doubly covered area is proportional to the the square of \(t-c_{1}\), and similarly proportional to \((b_{1}-t)^{2}\) for \(0\leq t\leq b_{1}\). To determine \(K_{1}\) and \(K_{2}\) we only have to calculate \(\rho_{T}(-\pi/2,0)=K_{1}c_{1}^{2}=K_{2}b_{1}^{2}\). \(\square\)
We see that the side \([a,b]\) reflected in the line \(L(-\pi/2,t)\) is parallel to the side \([a,c]\), and this accounts for the discontinuity in the derivative of \(\rho_{T}(-\pi/2,t)\) for \(t=0\).
Corollary 2
If we fold a triangle with side lengths \(A\leq B\leq C\) along the bisector of its smallest angle, then the relative area of the doubly covered set is \(B/(B+C)\) . Similarly, folding along the bisector of the angle opposite to \(B\) yields \(A/(A+C)\) . If we fold along the bisector of the largest angle, the result is \(A/(A+B)\) . If the lengths of the sides of a triangle are \(A\) , \(B\) and \(C\) with \(A\leq B\leq C\) and we fold along lines through a vertex, the maximum of \({\mathbf{area}}(T_{Y})/{\mathbf{area}}(T)\) is
It follows that
and that the universal constant \(\text{WBF}^{\text{vertex}}\) is equal to
Since the constant \(\text{WBF}^{\text{vertex}}\) is \(\frac{3}{2}-\frac{1}{2}\sqrt{5}\), the constant WBF is larger than or equal to this number.
Proof. There are three bisectors, and Proposition 4 shows that the best we can do is the maximum of the three quantities is \(B/(B+C)\), \(A/(A+B)\) and \(A/(A+C)\). However, the last quantity is always majorized by each of the other two. \(\square\)
The bad triangles for this kind of folding are \(T^{\text{vertex}}_{\varepsilon}=T(a,b,c)\) in standard coordinates with \(c_{1}=\textbf{Au}=\frac{1}{2}\sqrt{5}-\frac{1}{2}\approx 0.6180\) and \(c_{2}=\varepsilon\).
We see that \(\text{BF}^{\text{vertex}}(T)=B/(B+C)\) when the middle side is large, meaning that \(B\geq\sqrt{AC}\), the geometric mean of the lengths of the two other sides. To get a good result, we then fold along the bisector of the smallest angle, \(\alpha\). When the middle side is small, meaning that \(B\leq\sqrt{AC}\), then \(\text{BF}^{\text{vertex}}(T)=A/(A+B)\). Then we fold along the bisector of the largest angle, \(\gamma\). In particular, when \(A\approx 1-\frac{1}{2}\sqrt{2}\), \(B\approx\frac{1}{2}\sqrt{2}\), and \(C=1\), then \(B> \sqrt{AC}\), so the best result is obtained by folding along the bisector at \(a\), yielding \(B/(B+C)\approx\sqrt{2}-1\).
While the infimum of all \(\text{BF}^{\text{vertex}}(T)\) is \(\frac{1}{2}(3-\sqrt{5})\), we can see easily that equality cannot occur in the inequality \(\text{BF}^{\text{vertex}}(T)> \frac{1}{2}(3-\sqrt{5})\), for if so, we must have
which implies that \(C/B\geq\frac{1}{2}(\sqrt{5}+1)\), \(B/A\geq\frac{1}{2}(\sqrt{5}+1)\), which in turn implies that \(A+B\leq C\), contradicting the strict triangle inequality \(C<A+B\).
7 Folding along a line orthogonal to a side
Proposition 5
Let \(T\) be a triangle in standard coordinates ( 8 ). We fold along a line \(L(-\pi/2,t)=L(\pi/2,-t)\) , i. e., of equation \(x_{1}=t\) , orthogonal to \([a,b]\) . Then the relative area \(\rho_{T}(-\pi/2,t)\) of the triangle \(T_{Y(\pi/2,-t)}\) is
The function \(t\mapsto\rho_{T}(-\pi/2,t)\) is a spline function of degree \(2\). Its derivative at the nodes \(t=0\), \(1/2\), \(c_{1}\), \(1\) has the values \(0\), \(1/c_{1}\), \(-2\), and \(0\), respectively. Its maximum is attained for
By choosing \(c_{1}<1\) close to \(1\) we can obtain that the best relative area is just a little bit larger than \(\frac{1}{3}\), similarly to the result when we fold along a line parallel to a side; see Proposition 2. In fact, the bad triangles for this kind of folding are \(T^{\perp}_{\varepsilon}=T(a,b,c)\) with \(c_{1}=1-\varepsilon\) and \(c_{2}=\varepsilon^{2}\). Then folding along a normal to \([a,b]\) yields \(1/(2c_{1}+1)=1/(3-2\varepsilon)\searrow\frac{1}{3}\) as \(\varepsilon\searrow 0\). Foldings along normals to \([b,c]\) or \([c,a]\) give worse results since \(\gamma> \frac{1}{2}\pi\). Therefore \(\text{WBF}^{\perp}=\frac{1}{3}\).
If we choose \(c_{1}=\frac{1}{2}\sqrt{2}\), the largest relative area is \(\sqrt{2}-1\), attained for \(t=t_{\max}=2-\sqrt{2}\). The expression for \(t\in[1/2,1/\sqrt{2}]\) is
We get the following values of \(\rho_{T}(-\pi/2,t)\) for three values of \(t\):
Proof. The formulas for \(t\leq\frac{1}{2}\) and \(c_{1}\leq t\) are easily found.
For \(\frac{1}{2}<t<c_{1}\), the two segments \([(2t,0),(t,c_{2}t/c_{1})]\) and \([(c_{1},c_{2}),(1,0)]\) intersect at a point \(s=(s_{1},s_{2})\) with
We shall calculate the area of the doubly covered figure, which is the quadrilateral \(Q(p,r,s,q)\) with vertices at \(p=(t,0)\), \(r=b=(1,0)\), \(s=(s_{1},s_{2})\) just defined, and \(q=(t,c_{2}t/c_{1})\).
By (17), twice the area of \(Q(p,r,s,q)\) is given by the formula
which divided by \(c_{2}\), twice the area of the triangle, gives the second-degree polynomial in the statement of the proposition. The derivative is zero at \(t=2c_{1}/(2c_{1}+1)\) and this yields the maximum area. \(\square\)
When \(c_{1}-1/2\) is a small positive number, the second derivative of \(\rho_{T}\) with respect to \(t\) in the interval \([1/2,c_{1}]\) is a large negative number. When \(c_{1}=1/2\), we pass directly from the expression \(2t^{2}\) for \(0\leq t\leq 1/2\) to \(2(1-t)^{2}\) for \(1/2\leq t\leq 1\); there is a jump in the first derivative.
We note that, when \(c_{1}> 1/2\), the maximum is not attained when the line passes through the vertex (\(t=c_{1}\)) but at a point \(t\) to the left of \(c_{1}\): \(1/2<t=2c_{1}/(2c_{1}+1)<c_{1}\).
If we want to find only the maximum of \(\rho_{T}(-\pi/2,t)\), we can observe that, using the notation in the proof, it is attained when \(c_{2}(s_{1}-t)/c_{1}\) is half of \(q_{2}=c_{2}t/c_{1}\).
8 Combining two methods of folding
We have seen that folding through a vertex and folding orthogonally against a side sometimes yield good results, but for different shapes. We now combine the two methods.
Theorem 2
Any triangle can be folded along a straight line such that the area of the doubly covered part is at least \(\sqrt{2}-1\approx 0.4142\) . This bound can be obtained by using one of two methods: folding along the bisector of the smallest angle and folding orthogonally to the largest side. This proves the first part of the main result. In standard coordinates, the best relative area that can be obtained by these two methods is
The values of \(\Phi\) satisfy \(\sqrt{2}-1<\Phi(c)\leq\frac{1}{2}\). The infimum of \(\Phi\) is \(\sqrt{2}-1\). It is not attained, but is the limit as \(c\) approaches the boundary point \(c=(\frac{1}{2}\sqrt{2},0)\in\partial\Gamma\).
Remark 1
We can see from the expression for \(\Phi\) that folding along a line orthogonal to the longest side gives a strictly better result than folding along a bisector if and only if \(c\in\Gamma\), \(\frac{1}{2}\leq c_{1}\leq\frac{1}{2}\sqrt{2}\), and \(2c_{1}\|c\|_{2}<1\). They give the same result on a portion of the four-degree curve of equation \(4c_{1}^{2}(c_{1}^{2}+c_{2}^{2})=1\). In polar coordinates \((c_{1},c_{2})=(r\cos\varphi,r\sin\varphi)\), the equation of the curve is \(2r^{2}\cos\varphi=1\), \(0\leq\varphi\leq\frac{1}{3}\pi\). In particular, if \(B<\frac{1}{2}\sqrt{2}\), then \(c_{1}<B<\frac{1}{2}\sqrt{2}\) and folding along a normal to the side \([a,b]\) gives \(1/(2c_{1}+1)> \sqrt{2}-1\) while folding along the bisector at \(a\) yields \(B/(B+C)<\sqrt{2}-1\).
Proof. Let the notation be chosen so that \(A\leq B\leq C\). We shall see that it is enough to fold along a line through \(a\) or orthogonally against the side opposite to \(c\).
Consider the two inequalities
If we choose coordinates as in (8), these inequalities are equivalent to
The first covers the permitted values of \(c\) except those in the set with \(c_{1}^{2}+c_{2}^{2}<\frac{1}{2}\) and \(c_{2}> 0\). These points are then taken care of by the second case.
If (21) holds, we fold along the bisector through \(a\) and obtain from Corollary 2
If (21) does not hold, then (22) and (24) hold, and we can fold along a line orthogonal to the side \([a,b]\). We obtain \(1/(2c_{1}+1)\geq\sqrt{2}-1\). \(\square\)
9 The constant found is best possible
We shall now prove the second part of Theorem 1. This will be done in three steps.
First we shall prove that if the angle \(\varphi\) of the line avoids the set \(\frac{1}{2}\pi\mathbf{Z}\), then an easy estimate shows that the doubly covered set cannot have a relative area larger than \(\sqrt{2}-1\) (Sect. 9.1). Next we shall study the behavior of the best folding when the angle is in a neighborhood of zero (Sect. 9.2); finally when the angle is in a neighborhood of \(\frac{1}{2}\pi\) (Sect. 9.3).
The triangles which will provide us with examples are those which in standard coordinates have \(c=(c_{1},c_{2})\) with \(c_{1}=\frac{1}{2}\sqrt{2}\) and \(c_{2}\) positive and very small. Equivalently, \(c_{1}=\frac{1}{2}\sqrt{2}\) and the angle \(\alpha\) is very small. The smaller \(\varepsilon\) in the statement of Theorem 1 is, the smaller we have to take \(\alpha\). This has the advantage that we can approximate to simplify many expressions, for example all of the quantities \(c_{2}/c_{1}=\tan\alpha\), \(\sin\alpha\), \(\frac{1}{2}\sin 2\alpha\) can be approximated by \(\alpha+O(\alpha^{3})\) as \(\alpha\searrow 0\), which means that the error is not larger than a constant times \(\alpha^{3}\) when \(\alpha\) is positive and sufficently small; in fact for all \(\alpha\) satisfying \(0\leq\alpha\leq\pi/4\). Similarly, \(\cos\alpha=1+O(\alpha^{2})\) as \(\alpha\searrow 0\). In the sequel we shall use these approximations many times, however without repeating the expression “as \(\alpha\searrow 0\).”
When \(c_{1}=\frac{1}{2}\sqrt{2}\), we have
implying that these two special methods of folding give approximately the same result when \(\alpha\) is small. However, this is not sufficient: we now have to consider all kinds of folding.
9.1 Folding along a line with angle avoiding \(\frac{1}{2}\pi\mathbf{Z}\)
Proposition 7
Let \(T\) be a triangle in standard coordinates ( 8 ). With a half plane \(Y(\varphi,t)\) and its boundary \(L(\varphi,t)\) as defined in ( 3 ) and ( 4 ), assume that \(\varphi\notin\frac{1}{2}\pi\mathbf{Z}\).
Then the relative area of the doubly covered set when folding along \(L(\varphi,t)\) is less than \(c_{2}/|\sin 2\varphi|\) .
In the inequality for the relative area, we can either fix an angle \(\varphi\notin\frac{1}{2}\pi\mathbf{Z}\) and choose a small \(c_{2}\) to make the inequality \({\mathbf{area}}(T_{Y})/{\mathbf{area}}(T)<c_{2}/|\sin 2\varphi|\leq\sqrt{2}-1\) valid, or fix \(c_{2}\) and consider the set of angles \(\varphi\) which satisfy the inequality.
This means that when \(|\sin 2\varphi|\geq(\sqrt{2}+1)c_{2}\), the relative area is smaller than \(\sqrt{2}-1\), implying that only a certain neighborhood of \(\frac{1}{2}\pi\mathbf{Z}\) remains to be studied. This neighborhood is
where \(\eta\), which depends on \(c_{2}\), is given by the conditions \(0<\eta<\frac{1}{4}\pi\) and \(\sin 2\eta=c_{2}(\sqrt{2}+1)\). When \(\alpha\) is small, the angle \(\eta\) is approximately equal to \((\frac{1}{2}+\frac{1}{4}\sqrt{2})\alpha\); actually \(\eta/\alpha\) tends to \(\frac{1}{2}+\frac{1}{4}\sqrt{2}\approx 0.8535\) as \(\alpha\) tends to zero.
Proof. We denote by \(S\) the strip \(S=\{x\in\mathbf{R}^{2};\ 0\leq x_{2}\leq c_{2}\}\), bounded by \(L(0,0)\) and \(L(0,c_{2})\). Since \(T\subset S\), even strictly, the area of \(T_{Y(\varphi,t)}\) is less than that of \(S_{Y(\varphi,t)}\) (the area of \(S^{Y(\varphi,t)}\) is infinite). Let now \(\varphi\notin\frac{1}{2}\pi\mathbf{Z}\). It is enough to consider the case \(0<\varphi\leq\frac{1}{4}\pi\); the other cases, like \(-\frac{1}{2}\pi<\varphi\leq-\frac{1}{4}\pi\), \(-\frac{1}{4}\pi\leq\varphi<0\), and \(\frac{1}{4}\pi\leq\varphi<\frac{1}{2}\pi\) being similar. Then \(S_{Y(\varphi,t)}\) is a triangle with vertices \(p\), \(q\) and \(r\), given by
We find that \(p_{1}<q_{1}<r_{1}\) and that
where \(s=(s_{1},c_{2})\) is the reflection of \(q\) in \(L(\varphi,t)\). The area of \(T_{Y(\varphi,t)}\) is therefore less than \(\frac{1}{2}c_{2}(r_{1}-s_{1})=\frac{1}{2}c_{2}^{2}/\sin 2\varphi\) . The area of \(T\) is \(\frac{1}{2}c_{2}\), so that the relative area is less than \(c_{2}/\sin 2\varphi\) , a finite positive number. \(\square\)
9.2 Folding with an angle near zero
Consider a triangle \(T(a,b,c)\) with standard coordinates (8) and intersect it with a line \(L(\varphi,t)\). In view of Proposition 7 it is enough to study the lines \(L(\varphi,t)\) with \(\varphi\) either close to \(0\) or close to \(\frac{1}{2}\pi\), more precisely, when \(\varphi\) belongs to the intervals \([-\eta,\eta]\) and \([\pi/2-\eta,\pi/2+\eta]\), where \(\eta\) is defined after formula (25). To do so, it is enough to use the approximations that follow.
For \(\varphi\) near \(0\) and in the neighborhood \(W(c_{2})\) defined by (25), we shall study angles \(\varphi\in[-\eta,\eta]\). When \(-\alpha\leq\varphi\leq 0\), we can use the results on folding along a line parallel to the bisector of the angle at \(b\). Then the best result is
see Corollary 2. So this is much worse than the results for positive \(\varphi\) to be discussed now. We study separately the cases \(0\leq\varphi\leq\frac{1}{2}\alpha\) and \(\frac{1}{2}\alpha\leq\varphi\leq\alpha\).
Proposition 8
Let the angle \(\varphi\) satisfy \(0\leq\varphi\leq\alpha\) , and let \(\varepsilon\) be any positive number. Then, for \(\alpha\) small enough, the relative area of the doubly covered set \(T_{Y}\) is never larger than \(\sqrt{2}-1+\varepsilon\) .
Proof. We reflect the line \(L(\alpha,0)\), which contains the side \([a,c]\), in the line \(L(\varphi,t)\) to obtain the line \(L(2\varphi-\alpha,2t\cos(\varphi-\alpha))\). We reflect the line \(L(-\beta,\sin\beta)\supset[c,b]\) in the line \(L(\varphi,t)\) and obtain the line \(L(2\varphi+\beta,2t\cos(\varphi+\beta)-\sin\beta)\).
We then define four points \(p,q,r,s\) by intersecting these lines:
We have \(c_{2}=c_{1}\tan\alpha=(1-c_{1})\tan\beta\), so that
The triangles that are of interest for us now are the ones with \(c_{1}=\frac{1}{2}\sqrt{2}\) and \(c_{2}\) positive and very small. For these triangles we have good approximations,
For the line \(L(\varphi,t)\) to actually intersect the sides \([a,c]\) and \([c,b]\), we must, for \(0\leq\varphi\leq\alpha\), have \(t\) in the interval
so also \(t\) is of the order \(\alpha\) when we study lines with angle \(\varphi\) close to zero that actually cut the triangle.
The relative area, which is a polynomial in \(t\in\mathbf{R}\) with coefficients that are rational functions of sines and cosines of \(\alpha\), \(\beta\) and \(\varphi\), can therefore be approximated by a polynomial in \(t\) with coefficients that are rational functions of \(\alpha\) and \(\varphi\).
We obtain the following values for the coordinates of the points introduced above.
Twice the area of \(Q(p,r,s,q)\) is
This is a polynomial in \(t\) of order \(2\), the coefficients of which are rational functions in the trigonometric functions of the angles indicated. Our task is to prove that this expression cannot exceed \(2(\sqrt{2}-1){\mathbf{area}}(T(a,b,c))=(\sqrt{2}-1)c_{2}\).
In the calculation to follow it is convenient to use the constant \(K\),
and the variables
The points expressed in these constants and variables are:
Twice the area of the quadrilateral \(Q(p,r,s,q)\) is
while twice the area of the whole triangle is
so that the relative area is
where \(D\) and the \(M_{j}\) are given below. The zeros of \(D\) are \(1\), \(\frac{1}{2}\), \(v_{1}\), \(v_{2}\), and \(M_{2}\) is zero at \(v_{3}\), where the \(v_{j}\) are
We have
So \(D\) a polynomial in \(\lambda\) of degree \(4\), whereas the \(M_{j}\), \(j=0,1,2\), are polynomials in \(\lambda\) of degree \(3-j\). The signs are chosen so that \(D\) and the \(M_{j}\) are positive for \(0<\lambda<\frac{1}{2}\).
Taking the supremum of the relative area over all \(\tau\), we obtain
Here the numerator is a polynomial in \(\lambda\) of degree \(4\) and the denominator a polynomial of degree \(5\).
However, the numerator here is divisible by \(D\), so that the expression is equal to a constant divided by \(M_{2}\), more precisely
We write
where
clearly an increasing function on \([0,\frac{1}{2}]\), which implies that on this interval, the supremum of \(S\) is equal to the value \(S(\frac{1}{2})=\sqrt{2}-1\).
Since \(\sigma_{T}(\lambda\alpha)\) and \(S(\lambda)\) differ at most by a constant times \(\alpha\), this result implies that, for any given positive number \(\varepsilon\), the largest relative area of \(T_{Y}\) is smaller than \(\sqrt{2}-1+\varepsilon\) when \(\alpha\) is sufficiently small. \(\square\)
A similar study can be made for \(\frac{1}{2}\alpha\leq\varphi\leq\alpha\).
9.3 Folding with an angle near \(\pi/2\)
We shall now study an angle \(\varphi\in W(c_{2})\) close to \(\frac{1}{2}\pi\).
Proposition 9
Let us, in a triangle with standard coordinates (Sect. 4.2), fold along a line \(L(\varphi,-t)\) with angle \(\varphi\) satisfying \(|\varphi-\frac{1}{2}\pi|\leq\alpha\) , and let \(\varepsilon\) be any positive number. Then, for \(\alpha\) small enough, the relative area of the doubly covered set \(T_{Y}\) is never larger than \(\sqrt{2}-1+\varepsilon\) .
Proof. It turns out that we can make a direct comparison of the folding along a line \(L(\varphi,-t)\) with the folding along the line \(L(\frac{1}{2}\pi,-q_{1})=L(-\frac{1}{2}\pi,q_{1})\), where \(q=(q_{1},q_{2})\) is the point where \(L(\varphi,t)\) intersects \([a,c]\). We already know that the relative area \(\rho_{T}(-\frac{1}{2}\pi,q_{1})\) is at most equal to \(\rho_{T}(-\frac{1}{2}\pi,2-\sqrt{2})=\sqrt{2}-1\). The only values of \(q_{1}\) that are of interest are those that satisfy \(\frac{1}{2}\leq q_{1}\leq\frac{1}{2}\sqrt{2}\).
We introduce \(\psi=\varphi-\frac{1}{2}\pi\), a small angle to which \(q_{1}\) and \(t\) are related by the equation
We reflect the line \(L(0,0)\), which contains the side \([a,b]\), in the line \(L(\varphi,-t)\) to obtain the line \(L(2\varphi,-2t\cos\varphi)=L(2\psi,-2t\sin\psi)\).
We also reflect the line \(L(\alpha,0)\supset[c,a]\) in the line \(L(\varphi,-t)\) and obtain the line
We then define four points \(p,q,r,s\) by intersecting these lines:
If \(\psi<0\), then \(r_{1}<0\) and \(r\) is replaced by \(b\); see Figs. 3 and 4.
The coordinates for these points are
With \(K=1+\sqrt{2}\) and \(\lambda=\psi/\alpha\) the coordinates can be approximated as follows.
9.3.1 Angles less than \(\pi/2\)
For angles \(\theta\) satisfying \(0<\theta\leq\pi/4\) we have
We shall apply these inequalities in some simple estimates. In particular we note that, with \(c_{1}=\frac{1}{2}\sqrt{2}\), we must have \(0<\alpha\leq\frac{1}{4}\pi\).
We first consider angles \(\varphi<\frac{1}{2}\pi\) and compare the two quadrilaterals \(Q(p,b,s,q)\) and \(Q(p^{\perp},b,s^{\perp},q)\). Here \(p^{\perp}=(q_{1},0)\), and \(s^{\perp}\) is defined as the intersection of the side \([b,c]\) with the line obtained by reflecting the line \(L(0,0)\) in the line \(L(\frac{1}{2}\pi,-q_{1})\).
The differences are (see Fig. 3):
and
Since the interior of a triangle has the same area as the triangle itself, we see that
Here the second term is small: it is equal to
the angle at the vertex \(q\) being \(|\psi|\). Here
This implies that the area is at most
The area of the whole triangle satisfies
implying that the relative area can be estimated from above as
The other triangle, \(T(q,s,s^{\perp})\), has the area
since
This implies that the relative area is at least
We shall simplify equation (28) in two ways, depending on whether \(q_{1}\) is closer to \(2-\sqrt{2}\) or closer to \(\frac{1}{2}\sqrt{2}\).
If \(q_{1}\geq 1-\frac{1}{4}\sqrt{2}\) (the midpoint between of \(2-\sqrt{2}\) and \(\frac{1}{2}\sqrt{2}\)), then \(q_{1}-(2-\sqrt{2})\geq\frac{3}{4}\sqrt{2}-1\), and the relative area of \(Q(p,b,s,q)\) is, in view of (20), at most
if
If on the other hand, \(q_{1}\leq 1-\frac{1}{4}\sqrt{2}\), then \(\frac{1}{2}\sqrt{2}-q_{1}\geq\frac{3}{4}\sqrt{2}-1\), and the relative area of \(Q(p,b,s,q)\) is, in view of (29) and (30), at most
if
We see that, in both cases, the relative area of \(Q(p,b,s,q)\) is at most \(\sqrt{2}-1\) for small values of \(\alpha\).
9.3.2 Angles larger than \(\pi/2\)
When \(\psi=\varphi-\frac{1}{2}\pi> 0\), we compare two quadrilaterals, the given one \(Q(p,r,s,q)\) and the one obtained by decreasing the angle to \(\frac{1}{2}\pi\), \(Q(p^{\perp},b,s^{\perp},q)\), where as before \(p^{\perp}=(q_{1},0)\), and \(s^{\perp}\) is the intersection between the side \([b,c]\) and the line obtained as the reflection of the line \(L(0,0)\) in the line \(L(\frac{1}{2}\pi,-q_{1})\).
The differences are
and
see Fig. 4. We see that
We compare the second and the third terms. The second term is
the angle at \(q\) being \(2\psi\), and the third term is
the angle at \(p\) also being \(2\psi\). We define
Then the area of the quadrilateral \(Q(p,r,s,q)\) can be estimated from above as
We note that \(s^{\perp}_{1}\) is an affine function of \(q_{1}\) which takes the value \(1\) when \(q_{1}=\frac{1}{2}\) and the value \(\frac{1}{2}\sqrt{2}\) when \(q_{1}=\frac{1}{2}\sqrt{2}\). It follows that
We compare the four quantities constituting \(V\) with their limits as \(\alpha\) tends to zero and indicate the deviation, which is at most a constant times some power of \(\alpha\).
The limit of \(V\) as \(\alpha\) tends to zero is
since also \(\psi\) must tend to zero with \(\alpha\), and in the limit \(p_{1}\) and \(q_{1}\) tend to \(t\). The quantity \(V(0,0,t)\) is nonnegative since we only consider \(q_{1}\) in the interval \(\frac{1}{2}\leq q_{1}\leq\frac{1}{2}\sqrt{2}\). With the estimates of the errors we have
In view of (31) we can conclude that the area of \(Q(p,r,s,q)\) is at most
implying that the relative area is at most \(\sqrt{2}-1+O(\alpha)\leq\sqrt{2}-1+\varepsilon\) if \(\alpha\) is small enough. \(\square\)
References
Flachsmeyer, J.: Origami und Mathematik. Papier falten – Formen gestalten. Berliner Studienreihe zur Mathematik, vol. 20. Heldermann, Lemgo (2008). ISBN 978-3885381204. ii + 241 pp
Flachsmeyer, J.: Zu „Adolf Hurwitz faltet“. Math Semesterber 63(2), 195–200 (2016)
Haga, K.: Origamics. Mathematical explorations through paper folding. World Scientific Publishing, Hackensack (2008). xviii + 134 pp, edited and translated by Fonacier, J. C. and Isoda, M.
Jäger, J., Kroll, W., Schupp, H.: Blattfaltungen. Math Semesterber 61(1), 93–114 (2014)
Oswald, N.M.R.: Adolf Hurwitz faltet Papier. Math Semesterber 62(2), 123–130 (2015)
Open access funding provided by Uppsala University.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Kiselman, C.O. How to best fold a triangle. Math Semesterber 65, 65–89 (2018). https://doi.org/10.1007/s00591-017-0192-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00591-017-0192-5