1 Introduction

Generally, it is preferable that the loads are mainly carried by membrane and axial forces instead of bending moments. A measure to which extent this goal is reached is the percentage of the ”non-membrane” energy of the total strain energy. It is defined as \({({U-U_M})}/{U}\), where \(U_M\) denotes the membrane (stretching) energy and U stands for the total strain energy. The lower bound of this ratio is zero, and it refers to pure stretching. The upper bound is one, and it refers to pure bending.

The basic idea of this work is to find a geometric quantity such that its lower and upper bound agrees with the bounds of \({({U-U_M})}/{U}\). This is done with the help of a novel linear eigenvalue problem in the framework of the Finite Element Method (FEM). Such a quantity is the radius of the first Frenet-curvature of a curve on the surface of an N-dimensional unit hypersphere, denoted as \(\rho \). The reason why \(\rho \) is associated with such a hypersphere is that the vertex of the unit vector, describing this curve, is an eigenvector of a novel linear eigenvalue problem referring to a structure with N degrees of freedom, discretized by the FEM. One of the two coefficient matrices of this eigenvalue problem is the tangent stiffness matrix. The other one needs to be determined such that the two aforementioned limiting values of \(\rho \) can be realized for pure stretching and pure bending, respectively.

The significance of these two limiting cases of \({({U-U_M})}/{U}\) stems from a priori knowledge of the values of this ratio. Hence, there is no need to verify the hypothetically assumed correlation between \({({U-U_M})}/{U}\) and \(\rho \) by computing U and \(U_M\) separately by the FEM. If, however, an expanded form of the geometric-mechanical synergism described in this work could be shown to exist, which so far has not been the case, direct computation of \({({U-U_M})}/{U}\) would be advantageous. It is mentioned, in passing, that the deformations do not directly enter into the calculation of \(\rho \).

The first forerunner of the present work was a paper by Mang et al. [8]. The linear eigenvalue problem used for the determination of \(\rho \) was the so-called ”Consistently Linearized Eigenvalue Problem” (CLE). It was originally proposed by Helnwein [3]. The two coefficient matrices of this eigenvalue problem are the tangent stiffness matrix and its derivative with respect to a dimensionless load parameter in the framework of proportional loading. A disadvantage of this model was the explicit dependence of \(\rho \) on the relevant eigenvalue, defined as the zero eigenvalue at the stability limit. This resulted in the vanishing of \(\rho \) at the stability limit. Thus, the hypothesis \(\rho ={({U-U_M})}/{U}\) [8] incorrectly signaled pure stretching at this point instead of pure bending. Another weak spot of the CLE was the ab initio indefiniteness of the derivative of the tangent stiffness matrix, resulting in positive as well as negative eigenvalues in the prebuckling domain and in conjugate complex eigenvalues when the tangent stiffness matrix became indefinite after the stability limit [16], notwithstanding the insignificance of the primary load-displacement path after this has happened. A numerical challenge of the CLE was a sufficiently accurate finite-difference approximation of the derivative of the tangent stiffness matrix [4, 9].

Another forerunner of the present work was a paper by Mang [6], characterized by the replacement of \(\rho \) by \(k\,a\) , where a denotes the acceleration of a fictitious particle moving along a curve on the surface of an N-dimensional unit hypersphere, obtained by the CLE, and where k stands for a proportionality factor. Although good numerical results were obtained for the two aforementioned limiting cases, in retrospect, lack of the invariance of \(k\,a\) with respect to the chosen parameter is viewed as a shortcoming of that approach.

The present work is organized as follows: In Section 2, the FEM-based linear eigenvalue problem is introduced. One of its two coefficient matrices is the tangent stiffness matrix. The other one is yet to be determined. Herein, it is considered to be a constant, symmetric, positive definite matrix. Section 3 is devoted to the numerical implementation of the theoretical concept. This includes the numerical solution of the underlying eigenvalue problem and the numerical determination of \(\rho \) with the help of finite-difference approximations of the first and the second derivative of the relevant eigenvector with respect to the parameter, to be defined in Section 2. Section 4 deals with the numerical verification of the asserted geometric-mechanical synergism for the limiting cases of pure stretching and pure bending. In Section 5, current attempts to numerically prove the existence of a wider range of validity of the geometric-mechanical synergism, delineated in this work, are described.

2 Linear eigenvalue problem for the determination of \(\rho \)

The mathematical formulation of the chosen eigenvalue problem reads as

$$\begin{aligned} \left[ {\mathbf {K}}_T(\xi (\lambda ))-\chi (\xi (\lambda ))\,\mathbf {B}\right] \cdot \mathbf {r}(\xi (\lambda ))=\mathbf {0},\qquad \xi =\int _{\mathbf {q}=\mathbf {q}(0)}^{\mathbf {q}=\mathbf {q}(\lambda )}\left\| \mathrm d\mathbf {q}\right\| \,, \end{aligned}$$
(1)

where \({\mathbf {K}}_T\) is the tangent stiffness matrix in the framework of the FEM and \(\mathbf {B}\) is a constant, symmetric, positive definite matrix. \(\mathbf {B}\) must enable determination of the first eigenpair \((\chi _1,\mathbf {r}_1)\), with \(\chi _1\) denoting the smallest eigenvalue and \(\mathbf {r}_1\) standing for the corresponding eigenvector, normalized to 1, such that

$$\begin{aligned} \mathbf {r}_1(\xi (\lambda ))=\mathbf{const}. \end{aligned}$$
(2)

The parameter \(\xi \) represents an arc length in the context of the FEM. It depends on the vector of differential node displacements, following from the equilibrium relation

$$\begin{aligned} {\mathbf {K}}_T\cdot \mathrm d\mathbf {q}=\mathrm d\lambda \,\widetilde{\mathbf {P}}\,, \end{aligned}$$
(3)

with \(\widetilde{\mathbf {P}}\) standing for the vector of work-equivalent node forces. If snap-through can be ruled out, \(\xi \) is replaced by the dimensionless load parameter \(\lambda \).

Once \(\mathbf {r}_1\) is known, \(\rho _1\) can be computed from [2, 5, 14]

$$\begin{aligned} \rho _1=\frac{\left\| \dot{\mathbf {r}}_1\right\| ^3}{\sqrt{\dot{\mathbf {r}}_1^2\,{\ddot{\mathbf {r}}_1}^2-\left( \dot{\mathbf {r}}_1\cdot \ddot{\mathbf {r}}_1\right) ^2}}\,. \qquad \dot{~}:=\frac{\mathrm d}{\mathrm d\xi }\,. \end{aligned}$$
(4)

In order to check whether a specific matrix \(\mathbf {B}\) enables verification of the hypothesized correlation of \(U=U_M\) with \(\mathbf {r}_1(\xi (\lambda ))=\mathbf{const}. \), it must be shown that this matrix allows for

$$\begin{aligned} (\dot{\mathbf {r}}_1)_0=\mathbf {0},\qquad (\ddot{\mathbf {r}}_1)_0=\mathbf {0},\qquad \ldots \,, \end{aligned}$$
(5)

where the subscript 0 indicates the onset of loading, i. e. \(\lambda =0\). Since the eigenvectors represent a complete basis, \(\dot{\mathbf {r}}_1\) can be expressed in terms of \(\mathbf {r}_j\), with \(j=2,3,\ldots ,N\):

$$\begin{aligned} \dot{\mathbf {r}}_1=\sum _{j=2}^{N}\,c_{1j}\,\mathbf {r}_j\,, \end{aligned}$$
(6)

where

$$\begin{aligned} c_{1j}=-\frac{\mathbf {r}_j\cdot \dot{\mathbf {K}}_T\cdot \mathbf {r}_1}{(\chi _j-\chi _1)\,\mathbf {r}_j\cdot \mathbf {B}\cdot \mathbf {r}_j}\,. \end{aligned}$$
(7)

Appendix 1 contains the derivation of (7). It would be unfeasible to use (6) for the numerical computation of \(\dot{\mathbf {r}}_1\). Instead of doing this, \(\dot{\mathbf {r}}_1\) is approximated by a central finite-difference expression. In the following, it will be shown that setting \(\mathbf {B}\) as

$$\begin{aligned} \mathbf {B}=({\mathbf {K}}_T)_0\,, \end{aligned}$$
(8)

where \(({\mathbf {K}}_T)_0\equiv {\mathbf {K}}_T\)(\(\xi \)(\(\lambda =0\))) enables \((\dot{\mathbf {r}}_1)_0=\mathbf {0}\). Substitution of (8) into (1) gives

$$\begin{aligned} \left[ {\mathbf {K}}_T-\chi _i \,({\mathbf {K}}_T)_0\right] \cdot \mathbf {r}_i=\mathbf {0}\,. \end{aligned}$$
(9)

Specialization of (9) for \(\lambda =0\) yields

$$\begin{aligned} \left[ ({\mathbf {K}}_T)_0-\left( \chi _{i}\right) _0\,({\mathbf {K}}_T)_0\right] \cdot \left( \mathbf {r}_{i}\right) _0=\left( 1-\left( \chi _{i}\right) _0\right) ({\mathbf {K}}_T)_0\cdot \left( \mathbf {r}_{i}\right) _0=\mathbf {0}\qquad \forall i\in \{1,2,\ldots ,N\}\,. \end{aligned}$$
(10)

Since \(({\mathbf {K}}_T)_0\) is a positive definite matrix,

$$\begin{aligned} ({\mathbf {K}}_T)_0\cdot \left( \mathbf {r}_{i}\right) _0\ne \mathbf {0}\,. \end{aligned}$$
(11)

Consequently,

$$\begin{aligned} \left( \chi _{i}\right) _0=1\,. \end{aligned}$$
(12)

Thus, the initial eigenvalues are an N-fold eigenvalue, equal to 1. Specialization of (7) for \(\lambda =0\) gives

$$\begin{aligned} \left( c_{1j}\right) _0=-\frac{\left( \mathbf {r}_j\right) _0\cdot \left( \dot{\mathbf {K}}_T\right) _0\cdot \left( \mathbf {r}_1\right) _0}{\left( \chi _j-\chi _1\right) _0\,\left( \mathbf {r}_j\right) _0\cdot ({\mathbf {K}}_T)_0\cdot \left( \mathbf {r}_j\right) _0}\,. \end{aligned}$$
(13)

Differentiation of

$$\begin{aligned} \left[ {\mathbf {K}}_T-{\chi _1}\,({\mathbf {K}}_T)_0\right] \cdot \mathbf {r}_1=\mathbf {0}\, \end{aligned}$$
(14)

with respect to the chosen parameter yields

$$\begin{aligned} \left[ \dot{\mathbf {K}}_T-{{\dot{\chi }}_1}\,({\mathbf {K}}_T)_0\right] \cdot \mathbf {r}_1 +\left[ {{\mathbf {K}}_T}-{{\chi }_1}\,({\mathbf {K}}_T)_0\right] \cdot \dot{\mathbf {r}}_1 =\mathbf {0}\,. \end{aligned}$$
(15)

Specialization of (15) for \(\lambda =0\) and consideration of

$$\begin{aligned} \left[ \left( {{\mathbf {K}}_T}\right) _0-\left( {{\chi _1}}\right) _0\,({\mathbf {K}}_T)_0\right] =\left( 1-\left( \chi _1\right) _0\right) \,\left( {{\mathbf {K}}_T}\right) _0=\mathbf {0} \end{aligned}$$
(16)

results in

$$\begin{aligned} \left[ \left( \dot{\mathbf {K}}_T\right) _0-\left( {\dot{\chi }}_1\right) _0\,({\mathbf {K}}_T)_0\right] \,\left( {\mathbf {r}}_1\right) _0=\mathbf {0}\,. \end{aligned}$$
(17)

Substitution of the orthogonality relations

$$\begin{aligned} \left( {\mathbf {r}}_j\right) _0\cdot \left( \dot{\mathbf {K}}_T\right) _0\cdot \left( {\mathbf {r}}_1\right) _0=0\,, \end{aligned}$$
(18)

which follow from (17), into (13), and consideration of

$$\begin{aligned} \left( \chi _j-\chi _1\right) _0=0\, \end{aligned}$$
(19)

give

$$\begin{aligned} \left( c_{1j}\right) _0&=-\frac{\left( \mathbf {r}_j\right) _0\cdot \left( \dot{\mathbf {K}}_T\right) _0\cdot \left( \mathbf {r}_1\right) _0}{\left( \chi _j-\chi _1\right) _0\, \left( \mathbf {r}_j\right) _0\cdot ({\mathbf {K}}_T)_0\cdot \left( \mathbf {r}_j\right) _0} ={\frac{``0"}{``0"}} \nonumber \\&= -\frac{ \left( \dot{\mathbf {r}}_j\right) _0\cdot \left( \dot{\mathbf {K}}_T\right) _0\cdot \left( \mathbf {r}_1\right) _0 +\left( \mathbf {r}_j\right) _0\cdot \left( \ddot{\mathbf {K}}_T\right) _0\cdot \left( \mathbf {r}_1\right) _0 +\left( \mathbf {r}_j\right) _0\cdot \left( \dot{\mathbf {K}}_T\right) _0 \cdot \left( \dot{\mathbf {r}}_1\right) _0 }{\left( \dot{\chi _j}-\dot{\chi _1}\right) _0\,\left( \mathbf {r}_j\right) _0\cdot ({\mathbf {K}}_T)_0\cdot \left( \mathbf {r}_j\right) _0}\,, \end{aligned}$$
(20)

where L’Hôpital’s rule has been used. In general, \(\left( c_{1j}\right) _0\ne 0 \Leftrightarrow \left( \dot{\mathbf {r}}_1\right) _0\ne \mathbf{0 }\). As follows from (20), for the special case of \(\left( c_{1j}\right) _0=0 \Leftrightarrow \left( \dot{\mathbf {r}}_1\right) _0=\mathbf{0 }\), the following relation is fulfilled:

$$\begin{aligned} \left( \dot{\mathbf {r}}_j\right) _0\cdot \left( \dot{\mathbf {K}}_T\right) _0\cdot \left( \mathbf {r}_1\right) _0 +\left( \mathbf {r}_j\right) _0\cdot \left( \ddot{\mathbf {K}}_T\right) _0\cdot \left( \mathbf {r}_1\right) _0=0\,. \end{aligned}$$
(21)

Differentiation of (15) with respect to the chosen parameter and specialization of the obtained equation for \(\lambda =0\) and \((\dot{\mathbf {r}}_1)_0=\mathbf {0}\), gives

$$\begin{aligned} \left[ (\ddot{\mathbf {K}}_T)_0-(\ddot{\chi }_1)_0\,(\mathbf {K}_T)_0\right] \cdot (\mathbf {r}_1)_0=\mathbf {0}\,, \end{aligned}$$
(22)

resulting in the orthogonality relations

$$\begin{aligned} (\mathbf {r}_j)_0\cdot (\ddot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1)_0=0 \end{aligned}$$
(23)

in addition to the orthogonality conditions

$$\begin{aligned} (\mathbf {r}_j)_0\cdot (\mathbf {K}_T)_0\cdot (\mathbf {r}_1)_0=0\,, \end{aligned}$$
(24)

following from (1), and to the orthogonality conditions (18).

Computation of the second derivative of (15) with respect to the chosen parameter and specialization of the result for \(\lambda =0\), \((\dot{\mathbf {r}}_1)_0=\mathbf {0}\) and \((\ddot{\mathbf {r}}_1)_0=\mathbf {0}\) yields

$$\begin{aligned} \left[ (\dddot{\mathbf {K}}_T)_0-(\dddot{\chi }_1)_0\,(\mathbf {K})_0\right] \cdot (\mathbf {r}_1)_0=\mathbf {0}\,, \end{aligned}$$
(25)

resulting in the orthogonality relations

$$\begin{aligned} (\mathbf {r}_j)_0\cdot (\dddot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1)_0=0 \end{aligned}$$
(26)

in addition to the three aforementioned orthogonality relations. Hence, \((\dot{\mathbf {r}}_1)_0=\mathbf {0}\), \((\ddot{\mathbf {r}}_1)_0=\mathbf {0}\), ...entails

$$\begin{aligned} (\mathbf {r}_j)_0\cdot (\dot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1)_0=0\,,\quad (\mathbf {r}_j)_0\cdot (\ddot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1)_0=0\,,\quad (\mathbf {r}_j)_0\cdot (\dddot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1)_0=0\,,\quad \ldots \ . \end{aligned}$$
(27)

If \(\mathbf {B}\) had alternatively been chosen e. g. as the unit matrix \(\mathbf {I}\), (17) would have had to be replaced by

$$\begin{aligned} \left[ (\dot{\mathbf {K}}_T)_0-({\dot{\chi }}^{*}_1)_0\,\mathbf {I}\right] \cdot (\mathbf {r}^{*})_0+\left[ ({\mathbf {K}}_T)_0-({\chi }^{*}_1)\,\mathbf {I}\right] \cdot (\dot{\mathbf {r}}^{*})_0=\mathbf {0}\,, \end{aligned}$$
(28)

where the symbols marked with an asterisk have replaced the corresponding symbols without an asterisk, reserved for \(\mathbf {B}=(\mathbf {K}_T)_0\). In this case, \((\dot{\mathbf {r}}_1^*)_0=\mathbf {0}\), \((\ddot{\mathbf {r}}_1^*)_0=\mathbf {0}\), ... just entails

$$\begin{aligned} (\mathbf {r}_j^*)_0\cdot (\dot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1^*)_0=0\,,\quad {(\mathbf {r}_j^*)_0\cdot (\ddot{\mathbf {K}}_T)_0\cdot (\mathbf {r}_1^*)_0=0\,,\quad \ldots }\ . \end{aligned}$$
(29)

In Section 4, it will be shown numerically that \(\rho _1=0\) correlates with pure stretching and that \(\rho _1=1\) correlates with pure bending.

3 Convergence studies concerning spatial and ”temporal” discretizations

The linear eigenvalue problem

$$\begin{aligned} \left[ {\mathbf {K}}_T-\chi _i \,\mathbf {B}\right] \cdot \mathbf {r}_i=\mathbf {0}\,, \end{aligned}$$
(30)

with

$$\begin{aligned} \mathbf {B}=({\mathbf {K}}_T)_0\,, \end{aligned}$$
(31)

is solved with the help of six different finite beam elements. Information about these elements is given in Appendix 2. To underline the significance of the coefficient matrix \(({\mathbf {K}}_T)_0\) in (30), the analysis results are compared with the ones obtained with

$$\begin{aligned} \mathbf {B}=\mathbf {I}\,. \end{aligned}$$
(32)

The numerical analysis involves spatial discretizations in the framework of the FEM and ”temporal” discretizations resulting from finite-difference approximations of \(\dot{\mathbf {r}}_1\) and \(\ddot{\mathbf {r}}_1\) appearing in the expression for \(\rho _1\), see (5). In order to separate convergence studies concerning these discretizations from the numerical verification of the asserted geometric mechanical synergism, they have been moved forward to this Section. Table 1 refers to a convergence study concerning the dependence of \(\rho _1\) on the number of finite elements used for numerical analysis of a thrust-line arch and a beam subjected to pure bending, treated in Section 4.

Table 1 Median value of \(\rho _1\) for different numbers of finite elements, for \(\mathbf {B}=(\mathbf {K}_T)_0\)

The results of the thrust-line arch were obtained by the Abaqus finite element B32. The deviation of \(\rho _1\) from the hypothesized value 0 is less than 0.005709 in analyses with more than 20 elements. The beam subjected to pure bending was analyzed by the Abaqus finite elements B32OS and B32OSH. The results obtained by the former did converge, however, not to the hypothesized value 1. The results obtained by the latter are close to 1 up to 50 elements, with the absolute difference smaller than 0.0034. One reason why fewer degrees of freedom result in a value closer to 1 might be that the projection of a curve onto a unit-sphere in an infinite-dimensional frame onto a finite-dimensional frame leads to an increase of \(\rho _1\). In the extreme case of only two degrees of freedom, \(\rho _1\) becomes equal to the value of 1.

As follows from (4), computation of \(\rho _1\) requires knowledge of \(\dot{\mathbf {r}}_1\) and \(\ddot{\mathbf {r}}_1\). These vectors are approximated by the following central finite-difference expressions:

$$\begin{aligned} \dot{\mathbf {r}}_1(\xi )\approx&\ \frac{\mathbf {r}_1(\xi +\Delta \xi )-\mathbf {r}_1(\xi -\Delta \xi )}{2\,\Delta \xi }\,, \end{aligned}$$
(33)
$$\begin{aligned} \ddot{\mathbf {r}}_1(\xi )\approx&\ \frac{\mathbf {r}_1(\xi +\Delta \xi )-2\,\mathbf {r}_1(\xi )+\mathbf {r}_1(\xi -\Delta \xi )}{\Delta \xi ^2}\,. \end{aligned}$$
(34)

Tables 2 and 3 refer to a convergence study concerning the dependence of the first Frenet-radius \(\rho _1\) and the second Frenet-curvature \(\kappa _2\), respectively, on the size of the load step \(\Delta \lambda \). \(\kappa _2\) (\(\kappa _2\) is part of the expression for \({\dot{\rho }}_1\), see (39)) is considerably more sensitive to the size of \(\Delta \lambda \) than \(\rho _1\) because of depending also on the third derivative of \(\mathbf {r}_1\), see (41). For the thrust-line arch, analyzed with the Abaqus element B32, the median value of \(\kappa _2\) is numerically stable for \(\Delta \lambda \ge 0.005\). For pure bending, the median value of \(\kappa _2\) for the Abaqus element B32OS is numerically stable for \(\Delta \lambda \ge 0.002\).

Table 2 Median value of \(\rho _1\) for different ”temporal” discretizations, for \(\mathbf {B}=(\mathbf {K}_T)_0\)
Table 3 Median value of \(\kappa _2\) for different ”temporal” discretizations, for \(\mathbf {B}=(\mathbf {K}_T)_0\)

4 Numerical verification of the asserted geometric-mechanical synergism

4.1 Pure stretching of a two-hinged parabolic arch subjected to a uniformly distributed vertical line load

Figure 1 shows a two-hinged parabolic arch of span L and height H, subjected to a uniformly distributed vertical line load \(\lambda \,{\bar{p}}\), where \({\bar{p}}\) denotes the reference load. The values of L, H, and \({\bar{p}}\) are given as \(6\,\text {m}\), \(2.4\,\text {m}\), and \(8.33\cdot 10^{6}\,\text {N}/\text {m}\), respectively. The initially parabolic arch has a rectangular cross-section. Its height, h, is \(0.2\,\text {m}\), and its width, b, is \(0.1\,\text {m}\). The modulus of elasticity, E, and Poisson’s ratio, \(\nu \), are given as \(200\cdot 10^{9}\,\text {Pa}\) and 0.3, respectively. Although the structure is not a three-hinged arch, the bending moments produced by the given loading are small. Hence, the structure may approximately be considered as a thrust-line arch.

Fig. 1
figure 1

Two-hinged parabolic arch subjected to a uniformly distributed vertical line load

The numerical investigation of the arch was performed with six different finite beam elements. Details of these elements are given in Appendix 2. The convergence study, documented in Table 1, has shown that the discretization with 20 finite elements provides sufficiently accurate results. Therefore, further analyses were performed with 20 finite elements. Preliminary analyses, see Table 1, have shown that this discretization provides sufficiently accurate results. Loss of stability of the arch by flexural bifurcation buckling, characterized by \(\det \left( {\mathbf {K}}_T\right) =0\) and \({\dot{\lambda }}>0\), occurred at \({p=p_S\approx 2.77\cdot 10^{6}\,\text {N}/\text {m}}\). The existence of a stability limit is relevant to this work only insofar as the choice of the coefficient matrix \(\mathbf {B}\) should not have a significantly larger influence on the solution for the radius of the first Frenet-curvature at, and in the vicinity of, the point on the surface curve on the N-dimensional unit hypersphere that corresponds to the stability limit than on the remaining part of this curve.

Following the formulation given in Section 2, the eigenvalue problem (1) was solved for \(\mathbf {B}=({\mathbf {K}}_T)_0\) as well as for \(\mathbf {B}=\mathbf {I}\). According to the hypothetically asserted geometric-mechanical synergism, pure stretching should correlate with \(\rho _1=0\).

Table 4 Verification (in green and bold-face) and falsification (in red and italics), respectively, of the hypothetically asserted geometric-mechanical synergism for \(\mathbf {B}=({\mathbf {K}}_T)_0\) and \(\mathbf {B}=\mathbf {I}\)

It is seen that for \(\mathbf {B}=(\mathbf {K}_T)_0\) the hypothesis is verified for the first four out of the six finite elements considered. Conversely, for \(\mathbf {B}=\mathbf {I}\), the hypothesis is verified just for the last two of these elements. The conclusion from the given numerical results that \(\mathbf {B}=(\mathbf {K}_T)_0\) is superior to \(\mathbf {B}=\mathbf {I}\) would be premature. Nevertheless, the orthogonality of the eigenvectors with respect to \(\mathbf {B}=(\mathbf {K}_T)_0\) in addition to the one with respect to \(\mathbf {K}_T\) is viewed as an advantage because of leading to an a priori known N-fold initial eigenvalue \((\chi _i)_0=1\), representing a constraint on \(\chi _1(\xi (\lambda ))\), which is the basis for computation of \(\rho _1(\xi (\lambda ))\).

Figure 2 shows \(\rho _1-p\) diagrams, obtained with \(\mathbf {B}=({\mathbf {K}}_T)_0\) and \(\mathbf {B}=\mathbf {I}\) and with the Abaqus-element B32. Apart from small deviations of \(\rho _1\) from 0, ranging from 0.0011 to 0.17, \(\mathbf {B}=({\mathbf {K}}_T)_0\) provides the correct result. The upper bound, \(\rho _1=0.17\), refers to a load level well above the stability limit, and out of the region of p in Fig. 2a. This confirms the main theoretical finding, reported in Section 2, that \(\mathbf {B}=({\mathbf {K}}_T)_0\) enables \(\mathbf {r}_1=\mathbf{const}. \), which is the basis for the hypothesized geometric-mechanical synergism of \(\rho _1=0\) and \(U=U_M\).

Fig. 2
figure 2

\(\rho _1-p\) diagrams obtained with (a\(\mathbf {B}=({\mathbf {K}}_T)_0\) and (b\(\mathbf {B}=\mathbf {I}\), and with 20 Abaqus-elements B32

4.2 Pure bending of a simply supported beam subjected to equal bending moments at both ends

Figure 3 shows a simply supported beam of length \(L=5\,\text {m}\). The IPE 400 beam is subjected to bending moments \(\lambda \,{\bar{M}}_y\), with \({\bar{M}}_y=500\,\text {kNm}\), at both ends. The values of E, \(\nu \), and of the second moment of area, \(I_y\), are \(210\cdot 10^{9}\,\text {Pa}\), 0.3, and \(220\cdot 10^{-6}\,\text {m}^4\), respectively.

Fig. 3
figure 3

Simply supported beam subjected to equal bending moments at both ends

The numerical investigation of the beam was performed with the same finite elements that were used for the analysis of the thrust-line arch. The convergence study, documented in Table 1, has shown that the discretization with 20 finite elements provides sufficiently accurate results. Loss of stability of the beam by flexural-torsional buckling, characterized by \(\det \left( {\mathbf {K}}_T\right) =0\), occurred at \(M_y=M_{y,S}\approx 286\cdot 10^{3}\,\text {N}\,\text {m}\).

The linear eigenvalue problem (1) was solved with \(\mathbf {B}=({\mathbf {K}}_T)_0\) as well as with \(\mathbf {B}=\mathbf {I}\). According to the hypothetically asserted geometric-mechanical synergism, \(U_M=0\) should correlate with \(\rho _1=1\). Table 5 refers to verification and falsification, respectively, of this assertion for the two coefficient matrices \(\mathbf {B}\) and the six finite beam elements described in Appendix 2.

Table 5 Verification (in green and bold-face) and falsification (in red and italics), respectively, of the hypothetically asserted geometric-mechanical synergism for \(\mathbf {B}={({\mathbf {K}}_T)_0}\) and \(\mathbf {B}={\mathbf {I}}\)

It is seen that for \(\mathbf {B}=({\mathbf {K}}_T)_0\) the hypothesis is only verified for the accurate Pi&Bradford element and the Abaqus elements B32OSH and B31OSH. Since the approximate Pi&Bradford element and the Abaqus elements B32OS and B33H provided the correct results for the displacements of the beam and the von Mises stress, membrane locking was ruled out as a possible reason for the falsification of the hypothesis with these three elements. The fact that the Abaqus element B32OSH is an expansion of the Abaqus element B32OS, characterized by additional degrees of freedom to avoid volumetric locking for materials with values of Poisson’s ratio larger than 0.4999999 [1, p. 357], see Appendix 2, might be the reason for its success in the given case, although the value of Poisson’s ratio was chosen as 0.3. A counterargument to the assumed significance of avoiding volumetric locking is the failure of the hypothesis for the Abaqus element B32OSH in case of \(\mathbf {B}=\mathbf {I}\). Interestingly, for that case, both Pi&Bradford elements and the Abaqus element B32OS provide the correct result. Figure 4 shows \(\rho _1-M\) diagrams obtained with \(\mathbf {B}=({\mathbf {K}}_T)_0\) and \(\mathbf {B}=\mathbf {I}\) and with the Abaqus element B32OSH.

Fig. 4
figure 4

\(\rho _1-M_y\) diagrams obtained with (a) \(\mathbf {B}=({\mathbf {K}}_T)_0\) and (b) \(\mathbf {B}=\mathbf {I}\), and with the Abaqus element B32OSH

In order to demonstrate that the existence of a stability limit may affect the solution for \(\rho _1\) in case of an inappropriate choice of the coefficient matrix \(\mathbf {B}\), the \(\rho _1-\lambda \) diagram obtained with the CLE, i. e. with the variable, symmetric, indefinite matrix \(\mathbf {B}=-{\dot{{\mathbf {K}}}}_T\), is shown in Fig. 5.

Fig. 5
figure 5

\(\rho _1-\lambda \) diagram obtained with the CLE [8], taken from [8]; (the dashed line connects the last computed value with the incorrect theoretical value \(\rho _1(\lambda _S)=0\))

The reason why the hypothesis fails at the stability limit is shown hereafter. Analogous to the derivation of (7), see Appendix 1, the coefficient \(c_{1j}\) of

$$\begin{aligned} \dot{\mathbf {r}}_1=\sum _{j=2}^N\,c_{1j}\,\mathbf {r}_j\, \end{aligned}$$
(35)

is obtained as [8]

$$\begin{aligned} c_{1j}={\chi _1}\,\frac{\mathbf {r}_j\,\ddot{{\mathbf {K}}}_T\,\mathbf {r}_1}{(\chi _1-\chi _j)\,\mathbf {r}_j\,{\dot{{\mathbf {K}}}}_T\,\mathbf {r}_j}\,. \end{aligned}$$
(36)

Because of \(\chi _1=0\) at \(\lambda =\lambda _S\),

$$\begin{aligned} \dot{\mathbf {r}}_1(\lambda =\lambda _S)=\mathbf {0}\,, \end{aligned}$$
(37)

resulting in

$$\begin{aligned} \rho _1(\lambda =\lambda _S)=0 \end{aligned}$$
(38)

instead of 1.

5 Numerical falsification of the hypothetically extended range of validity of the geometric-mechanical synergism

The range of the geometric-mechanical synergism was hypothetically extended to a variable ratio \(({U-U_M})/U\). It has been hypothesized that the ratio \(({U-U_M})/U\), depending on the load level, is equal to a variable radius of the first Frenet-curvature, \(\rho _1\). This calls for a mathematical expression for the rate of change of \(\rho _1\), which is given as

$$\begin{aligned} {\dot{\rho }}_1=-\dot{s}_1\,\kappa _{2}\,(\mathbf {r}_1\!\cdot \!\mathbf {e}_{3})\,. \end{aligned}$$
(39)

Appendix 3 contains the derivation of (39). In this relation,

$$\begin{aligned} \dot{s}_1=\left\| \dot{\mathbf {r}}_1\right\| \end{aligned}$$
(40)

denotes the speed of a fictitious particle, moving along the curve on the surface of the N-dimensional unit hypersphere, described by the vector \(\mathbf {r}_1(\xi (\lambda ))\), \(\kappa _{2}\,\mathbf {e}_{3}\) is a vector of length \(\kappa _{2}\) in the direction of the binormal vector \(\mathbf {e}_{3}\), which is a unit vector; \(\kappa _{2}\) stands for the second Frenet-curvature [2, 5] of the surface curve; \(\kappa _{2}\,\mathbf {e}_{3}\) is given as [2, 5, 14]

$$\begin{aligned} \kappa _{2}\,\mathbf {e}_{3}=\frac{\kappa _{1}\,\mathbf {r}_1'''-\kappa _{1}'\,\mathbf {r}_1''}{\kappa ^2_{1}}+\mathbf {r}_1',\qquad ':=\frac{\mathrm d}{{\mathrm d}{{{s_1}}}}\,. \end{aligned}$$
(41)

In (41), \(\kappa _{1}=1/{\rho _1}\), with \(\kappa _{1}\) denoting the first Frenet-curvature of the surface curve, and

$$\begin{aligned} \mathbf {r}_1'=\frac{\dot{\mathbf {r}}_1}{\dot{s}_1},\qquad \mathbf {r}_1''=\frac{1}{\dot{s}^2_1}\left( \ddot{\mathbf {r}}_1-\frac{\ddot{s}_1}{\dot{s}_1}\,\dot{\mathbf {r}}_1\right) ,\qquad \mathbf {r}_1'''=\frac{1}{\dot{s}^3_1}\left[ \dddot{\mathbf {r}}_1-3\,\frac{\ddot{s}_1}{\dot{s}_1}\,\ddot{\mathbf {r}}_1+\left( 3\,\frac{\ddot{s}^2_1}{\dot{s}^2_1}-\frac{\dddot{s}_1}{\dot{s}_1}\right) \,\dot{\mathbf {r}}_1\right] \,, \end{aligned}$$
(42)

with \(\dot{s}_1\) according to (40) and

$$\begin{aligned} \ddot{s}_1=\frac{\dot{\mathbf {r}}_1\cdot \ddot{\mathbf {r}}_1}{\dot{s}_1}, \qquad \dddot{s}_1=\frac{\left( \ddot{\mathbf {r}}_1\cdot \ddot{\mathbf {r}}_1+\dot{\mathbf {r}}_1\cdot \dddot{\mathbf {r}}_1\right) \,\dot{s}_1-\left( \dot{\mathbf {r}}_1\cdot \ddot{\mathbf {r}}_1\right) \,\ddot{s}_1}{\dot{s}^2_1}\,. \end{aligned}$$
(43)

In the subsequent numerical investigation, \(\dot{\mathbf {r}}_1\) and \(\ddot{\mathbf {r}}_1\) are approximated by the central finite-difference expressions (33) and (34); \(\ddot{\mathbf {r}}_1\) is approximated by the central finite-difference expression

$$\begin{aligned} \dddot{\mathbf {r}}_1(\xi _1)\approx \frac{\mathbf {r}_1(\xi _1+2\,\Delta \xi _1)-2\,\mathbf {r}_1(\xi _1+\Delta \xi _1)+2\,\mathbf {r}_1(\xi _1-\Delta \xi _1)-\mathbf {r}_1(\xi _1-2\,\Delta \xi _1)}{2\,\Delta \xi _1^3}\,. \end{aligned}$$
(44)

The two examples presented in Section 4 are characterized by

$$\begin{aligned} \frac{U-U_M}{U}=\text {const.},\qquad \rho _1\approx \text {const.}\,.\quad \Longrightarrow \quad \left| { \dot{s}}\,\kappa _{2}\,\left( \mathbf {r}_1\!\cdot \!\mathbf {e}_{3}\right) \right| \approx 0\,. \end{aligned}$$
(45)

The condition for an extreme value of \(\rho _1\), for which \(\kappa _2\ne 0\), is given as

$$\begin{aligned} \mathbf {r}_1\!\cdot \!\mathbf {e}_{3}=0\,. \end{aligned}$$
(46)

Figure 6 shows a bar of length \(5\,\text {m}\). The IPE 400 bar is subjected to an eccentric force \(\lambda \,{\bar{P}}\), with \({{\bar{P}}=1\,\text {kN}}\) as the reference compressive force. The eccentricity e, E, and \(\nu \) were chosen as \({40.447\cdot 10^{-3}\,\text {m}}\), \({210\cdot 10^{9}\,\text {Pa}}\) and 0.3, respectively.

Fig. 6
figure 6

Bar subjected to eccentric compression

With the exception of \(({U-U_M})/U\) at \(\lambda =0\), for which \(2^{nd}\)-order theory provides the correct result, the given problem must be solved by a full nonlinear analysis, occasionally called \(3^{rd}\)-order theory [7], for which an analytic solution does not exist. The initial value of \(({U-U_M})/U\) is obtained by specializing the general solution for this ratio by means of \(2^{nd}\)-order theory for \(\lambda =0\). This yields

$$\begin{aligned} \frac{U_B}{U_M}=\frac{\frac{(N_x\,e)^2}{EI_\zeta }}{\frac{N_x^2}{EA}}=\frac{e^2\,A}{I_\zeta }\quad \Leftrightarrow \quad \frac{U-U_M}{U}=\frac{e^2\,A}{e^2\,A+I_\zeta }\,, \qquad e=\frac{M_\zeta }{N_x}\,, \end{aligned}$$
(47)

where A, given as \(8.0678\cdot 10^{-3}\,\text {m}^2\), stands for the area of the cross-section, and \(I_\zeta \), given as \({13.1985\cdot 10^{-6}}\,\text {m}^4\), denotes the second moment of area with respect to the cross-sectional axis \(\zeta \). The value of the eccentricity of the compressive force was chosen such that \(({U-U_M})/U\) at \(\lambda =0\) is equal to 0.5. This resulted in \({e=4.0447\,\text {cm}}\). Table 6 shows that the initial values of \(\rho _1\), obtained with \(\mathbf {B}=(\mathbf {K}_T)_0\) and \(\mathbf {B}=\mathbf {I}\), with four different finite elements each, do not agree with the initial value of \({(U-U_M)}/U\).

Table 6 Initial values of \(\rho _1\) obtained with \(\mathbf {B}=(\mathbf {K}_T)_0\) and \(\mathbf {B}=\mathbf {I}\), with four different finite elements each

Irrespective of the incorrect initial values of \(\rho _1\), the course of the functions \(\rho _1(P)\), where \(P=\lambda \,{\bar{P}}\), will be discussed in the following. Figure 7 shows that \(\rho (P)\) is a non-monotonic function. This correlates with the expected non-monotony of the ratio \(({U-U_M})/U\), indicating that initially the bending energy is increasing more strongly than the membrane energy. After reaching its maximum value, \(\rho (P)\) is decreasing. This correlates with the decrease of \(({U-U_M})/U\), indicating that after reaching its maximum value the membrane energy is increasing more strongly than the bending energy. The reason for termination of the \(\rho -P\) diagram relatively soon after reaching the maximum value of \(\rho \) is the irrelevance of the assumption of a linear elastic material for large deformations. Since the value of \(\rho \)(\(P=0\)) is significantly smaller than the initial value of \(({U-U_M})/U\), i. e. 0.5, the hypothetically extended range of validity of the geometric-mechanical synergism could not be verified. This result is corroborated by the supposition that the load level of \(\rho _{\text {max}}\) does not agree with the one of \((({U-U_M})/U)_{\text {max}}\).

Fig. 7
figure 7

\(\rho _1-P\) diagrams obtained with (a) \(\mathbf {B}=({\mathbf {K}}_T)_0\) and (b) \(\mathbf {B}=\mathbf {I}\), and the Abaqus element B32OS

The expression for \({\dot{\rho }}_1\), see (39), refers to a curve on the surface of an N-dimensional unit hypersphere. The (\(\mathbf {r}_1\!\cdot \!\mathbf {e}_{3}\))\(-P\) diagrams shown in Fig. 8 correspond to the \(\rho _1-P\) diagrams illustrated in Fig. 7. This proves that (46), i. e. \(\mathbf {r}_1\!\cdot \!\mathbf {e}_3=0\), is the condition for an extreme value of \(\rho _1\).

Fig. 8
figure 8

\((\mathbf {r}_1\!\cdot \!\mathbf {e}_3)-P\) diagrams corresponding to the \(\rho _1-P\)-diagrams shown in Fig. 7

One attempt to improve the original hypothesis was to replace \(\rho _1\) by \(\sqrt{1-(\mathbf {r}_1\!\cdot \!\mathbf {e}_3)^2}\). The rationale for this modification was that

$$\begin{aligned} \rho _1\le \sqrt{1-(\mathbf {r}_1\!\cdot \!\mathbf {e}_3)^2}\le 1. \end{aligned}$$
(48)

Another attempt was to explicitly consider \(\kappa _2\) in the hypothesis, albeit without abandoning the restriction of the constancy of the coefficient matrix \(\mathbf {B}\) in the linear eigenvalue problem (1). Both attempts were not successful. The second one did not rule out the possibility of values of \(\rho _1\) larger than 1, which obviously do not correlate with corresponding values of \(({U-U_M})/U\).

6 Conclusions

. .