1 Introduction

Spiral bevel gears are extensively utilized in diverse industrial applications owing to their capability to efficiently transmit high torque, operate smoothly, and offer high reliability in power transmission. Nevertheless, the deployment of spiral bevel gears in high-end applications is confronted with some design challenges, such as minimization of efficiency losses, noise, vibration and harshness (NVH).

The efficiency of spiral bevel gears may be impacted by various factors, which can be categorized into two types: load-independent and load-dependent losses. The former encompass churning and windage losses, which refer to the lubricant being pumped between the mating members and its splashing caused by inertial effects. The latter pertains to rolling and sliding frictional losses, with the former being negligible in comparison to the latter, particularly in geared transmissions. Our study specifically concentrates on sliding frictional losses, which are primarily influenced by the sliding motion between the teeth, gear mesh stiffness, and tooth modifications.

The core aspects involved in correctly modeling the mechanical losses are:

  1. 1.

    loaded tooth contact analysis (LTCA) to predict load sharing between the mating teeth and the pressure distribution over the contact zones;

  2. 2.

    calculation of the geometric and kinematic properties of the contacting zones (i.e., entraining velocity of the lubricant, sliding velocity, and surface curvatures);

  3. 3.

    estimation of the elasto-hydrodynamic lubrication (EHL) friction coefficient;

Numerous models and methods exist in the literature for evaluating friction-induced losses. The primary differences between these models and methods lie in the semi-empirical formulas used to estimate the EHL friction coefficient and/or the tools used to perform the LTCA.

Li et al. [1] proposed an EHL friction model for helical gears. In this model, the contacting bodies are sliced along the entire contact zone, and a line contact model is postulated on each slice. The authors noted a conflicting trend on tooth flank modifications between mechanical efficiency and other transmission specifications, such as maximum pressure and loaded transmission error (LTE).

Xu et al. [2] applied a similar EHL friction model to estimate efficiency losses in hypoid gears. In this model, a large set of results from EHL analyses were fitted to obtain a semi-empirical coefficient formula, while LTCA was carried out using the same simulation tool employed in this paper (Calyx, [3]). However, their EHL model did not account for mixed and boundary lubrication conditions.

Kolivand et al. [4] started from the same EHL friction model and improved it to also account for mixed and boundary lubrication, with the aim of investigating efficiency losses in hypoid gears. However, the LTCA tool was replaced by the Hypoid Analysis Program (HAP) [5], which uses a semi-analytical shell theory to compute the tooth compliance and an ease-off–based approach to perform LTCA analyses. If compared to the former, HAP trades computational efficiency for accuracy, especially at high loads.

Cao et al. [6] evaluated the lubrication performance under different possible contact paths on bevel gears. Mohammadpour et al. [7] carried out an EHL analysis on hypoid gears under relatively high loads, while Pu [8] analyzed the lubrication of hypoid gears taking into account the three-dimensional surface roughness. However, despite their accurate tribological analyses, many of the cited contributions fall short in providing reliable contact analysis results. In fact, an accurate LTCA tool has a paramount importance for the evaluation of the contact pressures and the subsequent estimation of the friction coefficient. Moreover, most of the cited references do not offer precise estimations of the lubricant’s entraining velocity, often relying instead on calculating it as the average of the absolute velocities of the mating pairs at their contact points.

Ziegltrum et al. [9] provided an accurate estimation of the local friction coefficient of spur gears under transient thermo-elasto-hydrodynamic lubrication (TEHL) conditions using FE-based multiphysics software. Their results were also validated against experimental data for different lubricants.

Paschold et al. [10] mathematically modeled the heat transfer during meshing of worm gears in order to accurately investigate their overall efficiency losses.

NVH issues in spiral bevel gears can largely be attributed to gear meshing errors, manufacturing errors, and misalignments. These problems may result in increased noise levels, vibration, and reduced reliability of the gear system. Consequently, understanding the underlying causes of NVH problems and efficiency losses in spiral bevel gears and developing strategies to alleviate them is crucial to enhance their performance and reliability. In this study, improvements in NVH are dealt with by minimizing the harmonic amplitudes of the LTE through appropriate ease-off modifications.

The relationship between LTE and micro-geometry has been extensively demonstrated in the literature. Simon [11] demonstrated how adjusting higher-order motions associated with the cradle radial setting and the modified roll could decrease LTE. Similarly, Su et al. [12] proposed a methodology to design a seventh-order transmission ratio function based on rigid TCA assumptions. A numerical test case demonstrated that this method could provide advantages in terms of transmission error under load. However, the effective values of the parameters representing the seventh-order function are assumed to be known, and no guidelines are provided for deriving them. Along the same lines, [13] proposed an optimization framework to minimize LTE. Here, higher-order motions associated with the coefficients of the modified vertical and helical motions are identified during the optimization process.

Several studies in the literature have proposed automatic optimization methods to improve the performance of spiral bevel gears during meshing. Mermoz et al. [14] used accurate finite element analyses to carry out LTCA and determine the contact pressure, which then served as the objective function minimized by the BOSS/Quattro commercial optimization software. Similarly, Astoul et al. [15] also used this methodology to minimize quasi-static transmission errors. However, both studies directly optimize a limited subset of machine settings, specifically the modified-roll coefficients, which may not be ideal when starting from a basic design that exhibits poor contact patterns.

Likewise, a recent study by Li et al. [16] uses finite element analyses to perform multi-objective optimization, wherein efficiency, contact pressure, and LTE are simultaneously minimized. Machine settings are selected as the optimization variables following a sensitivity analysis.

However, it should be noted that the presented optimization test case is somewhat questionable, as the LTCA is carried out with just 10 Nm pinion torque (a very lightly loaded tooth contact analysis).

Notably, the Particle Swarm Optimization-Gravitational Search Algorithm (PSOGSA) [17] and the Fast Elitist Nondominated Sorting Genetic Algorithm (NSGA-II) [18] have been effectively utilized for optimizing and exploring Pareto-optimal solutions in spiral bevel and hypoid gears. However, it is widely recognized that these algorithms are computationally expensive, especially in terms of number of function evaluations to convergence (and the number of iterations to convergence is often set a priori). Therefore, to obtain solutions within reasonable time frames, these solvers still need to be coupled with relatively simple and approximate contact models.

The present study involves the automated optimization of tooth flank topography using ease-off polynomial functions to explore potential trade-offs between NVH and efficiency. Initially, an EHL friction model is briefly presented, where the entraining velocity is rigorously evaluated to estimate the shear strain rate within the film. This model has already been introduced and validated against experimental data in [19, 20], specifically for a hypoid gearset with a 75W90 oil, but further improvements are described in the present work. Other losses, such as windage, churning, and rolling resistance, are excluded due to their limited dependence on the tooth flank topography. The study uses the MIL-L-23699 lubricant, which is a neopentyl polyol ester base oil commonly used in aerospace applications such as gas turbine engines and aircraft gearboxes.

Subsequently, the multi-objective optimization framework previously described in [21] is briefly reviewed. To ensure accurate LTCA analyses, the Calyx contact solver is employed, although the methodology proposed in this work may well be complemented by other contact analysis tools (i.e. Becal [22] or other general-purpose FEM software). The optimizations are carried out by a relatively simple direct-search algorithm (MATLAB’s patternsearch). An important characteristic of this innovative framework is that, despite utilizing ideal polynomial functions to steer the optimization process, a tailored identification algorithm is applied at each iteration to project the ease-off into the set of deviations that can be actually manufactured, as presented in [21]. This approach offers the benefit of driving the expensive optimization loop with a minimal number of variables, while determining, with the embedded identification algorithm, the most appropriate manufacturable flank modifications from a relatively extensive set of machine-tool settings.

To determine the boundaries of the Pareto front, single-objective optimizations are initially carried out. Then, the Pareto front is explored by solving additional multi-objective optimization problems. The contact patterns for all points sampled on the Pareto front are also shown.

2 EHL friction coefficient model and efficiency estimation

2.1 EHL friction

There are a number of factors that influence friction, such as the lubrication regime (full film, mixed, boundary), the behaviour of the lubricant with varying operating conditions (temperature, pressure, shear rate) and the surrounding environment (the boundary conditions of the lubricated contact).

An EHL friction coefficient model similar to the one described in [19] is proposed here. The model is briefly reintroduced in this section, since there are some differences compared to the previous implementation, particularly in the load sharing function and in the rheological model of the lubricant.

The friction coefficient \(f\) is determined based on a load sharing factor between boundary and full-film lubrication conditions, which is described by the function \(X(\Lambda)\). It is well known in the literature that \(f\) can be calculated using the following equation:

$$f=f_{h}(1-X(\Lambda))+f_{b}(X(\Lambda)).$$
(1)

Here \(f_{h}\) and \(f_{b}\) are the friction coefficients related to the hydrodynamic (full fluid) and the boundary lubrication conditions, respectively. \(\Lambda\) is the ratio between the film thickness \(h\) and the equivalent root-mean-square roughness of the contacting body surfaces:

$$\Lambda=\frac{h}{\sqrt{R_{q1}^{2}+R_{q2}^{2}}}$$
(2)

Usually, the central film thickness \(h_{c}\) is used, and \(R_{q1}\) and \(R_{q2}\) are the root-mean-square roughnesses of the two surfaces.

Several expressions can be found for the load sharing factor \(X(\Lambda)\) in the literature. Differently from what has been described in [19], the recent findings by Taylor and Sherrington [23] are employed in this work. Their study suggests a reverse S‑shaped curve for \(X(\Lambda)\), as represented by the following formula:

$$X(\Lambda)=\frac{1}{(1+\Lambda^{k})^{a}}$$
(3)

According to [23], and based on experimental data from several lubricant oils, the recommended values for \(k\) and \(a\) here employed are 1.453 and 1.32, respectively.

The coefficient of boundary friction \(f_{b}\) is considered constant, and it has to be experimentally derived. Based on data available in the literature [23], its value ranges from 0.1 to 0.14, depending on the employed lubricant oil. In this work, a worst-case value of \(f_{b}=0.14\) is assumed.

The mean value of \(f_{h}\) can be obtained by dividing the shear stress, \(\tau\), by the mean contact pressure \(p_{m}\):

$$f_{h}=\frac{\tau}{p_{m}}$$
(4)

In our framework, \(p_{m}\) is obtained directly from contact analysis tool for each contact zone.

The shear stress \(\tau\) is estimated according to the Bair-Winer rheological model:

$$\tau=\tau_{L}\left(1-e^{-\frac{\eta}{\tau_{L}}\dot{\gamma}}\right).$$
(5)

Here, \(\tau_{L}\) is the limiting shear stress, \(\eta\) is the dynamic viscosity, and the shear strain rate is approximated by \(\dot{\gamma}=\frac{v_{s}}{h_{c}}\), with \(\Delta u\) being the sliding velocity between the two mating teeth at their nominal contact point.

The limiting shear stress \(\tau_{L}\) is derived for specific pressure and temperature conditions according to the following exponential relation [24]:

$$\tau_{L}=\tau_{L0}e^{\alpha_{L}(p_{m}-p_{a})+\beta_{L}(\frac{1}{T}-\frac{1}{T_{0}})}$$
(6)

The dynamic viscosity \(\eta\) variation with temperature \(T\) and pressure \(p\) is described by the Roelands and modified Barus formulas:

$$\eta=\eta_{0}e^{\alpha(p_{m}-p_{a})},$$
(7)
$$\eta_{0}=6.31\cdot 10^{-5+G_{0}(1+\frac{T}{135})^{-S_{0}}},$$
(8)

The viscosity-pressure coefficient \(\alpha\) can be estimated as follows:

$$\alpha=m+n\log(\eta_{0})$$
(9)

To estimate the remaining coefficients in Eqs. (8), (7) and (9), empirical data specific to the lubricant being used is necessary. Table 1 contains the relevant data for the MIL-L-23699 lubricant.

Different expressions for the evaluation of the central film thickness can be found in the literature. Four different lubrication regimes can be observed for nonconformal contacts, depending on the elastic deformation of the bodies and the variation of viscosity with pressure. The four regimes are usually indicated as isoviscous-rigid IR, piezoviscous-rigid PR, isoviscous-elastic IE and piezoviscous-elastic PE [25]. Bassani et al. [26] provided semi-empirical formulas for the central film thickness for the more general case in which the entraining velocity is not collinear with any principal direction of the curvatures:

$$\begin{aligned}h_{c_{IR}}=128\frac{\eta_{0}^{2}u^{2}R_{e}^{2}R_{s}}{F^{2}}\left(0.131\arctan{\frac{R_{s}}{2R_{e}}}+1.683\right)^{2}\left(1+\frac{2R_{e}}{3R_{s}}\right)^{-2}\end{aligned}$$
(10)
$$\begin{aligned}h_{c_{PR}}=141\frac{\eta_{0}^{1.25}\alpha^{0.375}u^{1.25}R_{e}^{1.5}}{F^{0.875}}\left(1-e^{-0.0387\frac{R_{s}}{R_{e}}}\right)\end{aligned}$$
(11)
$$\begin{aligned}h_{c_{IE}}=11.15\frac{\eta_{0}^{0.66}u^{0.66}R_{e}^{0.766}}{F^{0.213}E^{0.447}}\left(1-0.72e^{-0.28(\frac{R_{s}}{R_{e}})^{\frac{2}{\pi}}}\right)\end{aligned}$$
(12)
$$\begin{aligned}h_{c_{PE}}=3.61\frac{\eta_{0}^{0.68}\alpha^{0.53}u^{0.68}R_{e}^{0.446}}{E^{0.087}F^{0.063}}\left(1-0.61e^{-0.73(\frac{R_{s}}{R_{e}})^{\frac{2}{\pi}}}\right)\end{aligned}$$
(13)

Here, \(E\) is the equivalent elastic modulus, \(u\) is the entraining velocity, and \(R_{e}\) and \(R_{s}\) are the equivalent radii of curvature, respectively, parallel and perpendicular to \(u\). The calculation of those quantities is detailed in the next subsection. The proper lubrication regime is determined in a practical way by taking the highest of the values given by Eqs. (10)–(13).

Thermal effects are included using a reduced value for the central film thickness. This is obtained via scaling it by a dimensionless reduction factor \(\Phi\), for which several models are available in the literature. The one described in [27] is employed in our model:

$$\Phi=\frac{1}{1+0.1(1+8.33S^{0.83})L^{0.64}},$$
(14)

where \(S=\Delta u/u\) and \(L=\beta\frac{\eta_{0}u^{2}}{k}\) is the dimensionless thermal loading parameter. Their values can be elicited from Table1.

It should be emphasized that the equations introduced within this section serve as preliminary estimations of the lubricant film thickness and coefficient of friction. Of noteworthy consideration, the thermal effects linked to high sliding speeds are currently unaccounted for in the given formulas.

Table 1 Coefficients of the MIL-L-23699 oil

2.2 Kinematic relationships at the contact point

In order to use the formulas presented in the previous subsection, it is necessary to calculate the entraining velocity \(u\) and the slide-to-roll ratio \(S\). The entraining velocity of the lubricant is the relative velocity of the surface with respect to the contact point, or vice versa. The overall entraining velocity of the lubricant is the average of the entraining velocities of the two mating surfaces.

Previous studies [20, 28] have presented formulas for the entraining velocity that presume the availability of a local orthogonal parametrization of surface coordinates [29]. This is usually too strong of an hypothesis and certainly not very convenient when dealing with spiral bevel (and hypoid) gears. Here, we set out to generalize the calculations of the entraining velocity. To this sake, we first introduce the general equations that describe the contact point on the surfaces of two mating bodies, as shown in Fig. 1. Given \(\mathbf{c}_{1}=\mathbf{c}_{1}(\boldsymbol{\alpha}_{1})\) and \(\mathbf{c}_{2}=\mathbf{c}_{2}(\boldsymbol{\alpha}_{2})\) in tangent contact, the following two equations are satisfied:

$$\begin{cases}\mathbf{d}_{1}+\mathbf{c}_{1}-\mathbf{d}_{2}-\mathbf{c}_{2}=0\\ \mathbf{n}_{1}\left(\widehat{\boldsymbol{\omega}}_{1}\mathbf{c}_{1}+\dot{\mathbf{d}}_{1}-\widehat{\boldsymbol{\omega}}_{2}\mathbf{c}_{2}-\dot{\mathbf{d}}_{2}\right)=0.\end{cases}$$
(15)

Here, \(\boldsymbol{\alpha}_{1},\,\boldsymbol{\alpha}_{2}\in\mathbb{R}^{2}\) represent the surface coordinates and \(\mathbf{d}_{1}\) and \(\mathbf{d}_{2}\) mark the position of a generic point on the axis of each mating surface. The first equation expresses coincidence of the two surfaces at the contact point, while the second is the equation of meshing. The symbol \(\mathbf{n}_{1}\) denotes the unit normal vector of surface 1 at the contact point, while \(\widehat{\boldsymbol{\omega}}_{1}\) and \(\widehat{\boldsymbol{\omega}}_{2}\) are the spatial angular velocities (with respect to the fixed frame \({S_{0}}\)) in their skew-symmetric matrix representation.

Fig. 1
figure 1

Schematic representation of two contacting bodies

To determine the contact point velocity, we differentiate the system of Eqs. (15) with respect to time, which results in the following linear system of equations in \(\dot{\boldsymbol{\alpha}}_{1}\) and \(\dot{\boldsymbol{\alpha}}_{2}\):

$$A\begin{bmatrix}\dot{\boldsymbol{\alpha}}_{1}\\ \dot{\boldsymbol{\alpha}}_{2}\end{bmatrix}=\mathbf{b},$$
(16)

where

$$\begin{aligned}A=\begin{bmatrix}\mathbf{c}_{1},_{\boldsymbol{\alpha}_{1}}&-\mathbf{c}_{2},_{\boldsymbol{\alpha}_{2}}\\ \left(\widehat{\boldsymbol{\omega}}_{1}\mathbf{c}_{1}-\widehat{\boldsymbol{\omega}}_{2}\mathbf{c}_{2}\right)\mathbf{n}_{1,{\boldsymbol{\alpha}_{1}}}+{\mathbf{n}_{1}}^{T}\widehat{\boldsymbol{\omega}}_{1}\mathbf{c}_{1},_{\boldsymbol{\alpha}_{1}}&-{\mathbf{n}_{1}}^{T}\widehat{\boldsymbol{\omega}}_{2}\mathbf{c}_{2},_{\boldsymbol{\alpha}_{2}}\end{bmatrix},\end{aligned}$$
(17)
$$\mathbf{b}=\begin{bmatrix}\widehat{\boldsymbol{\omega}}_{2}\mathbf{c}_{2}-\widehat{\boldsymbol{\omega}}_{1}\mathbf{c}_{1}\\ -\mathbf{n}_{1}^{T}\left(\widehat{\boldsymbol{\omega}}_{1}\widehat{\boldsymbol{\omega}}_{2}-\widehat{\boldsymbol{\omega}}_{2}^{2}\right)\mathbf{c}_{2}\end{bmatrix},$$
(18)

and

$$\mathbf{c}_{i,\boldsymbol{\alpha}_{i}}=\begin{bmatrix}\mathbf{c}_{i,\alpha_{i1}}&\mathbf{c}_{i,\alpha_{i2}}\end{bmatrix},\text{ }i=1,2.$$
(19)

We shall note that the terms in \(A\) and \(\mathbf{b}\) have been obtained by assuming \(\dot{\mathbf{d}}_{1}=\dot{\mathbf{d}}_{2}=\ddot{\mathbf{d}}_{1}=\ddot{\mathbf{d}}_{2}=\mathbf{0}\) (which is exact for mating gears) and \(\dot{\widehat{\boldsymbol{\omega}}}_{1}\simeq\dot{\widehat{\boldsymbol{\omega}}}_{2}\simeq 0\) (which is correct under the hypothesis of constant angular velocities). The scheme of two mating bevel gears are shown in Fig. 2. It can be observed that \(\mathbf{d}_{1}\) and \(\mathbf{d}_{2}\) have the function of incorporating possible misalignments or mounting errors into the system. These errors are represented by the typical \((E,P,G,\alpha)\) parameters. The solution of Eq. (16) allows us to obtain the velocity of the contact point as it moves over the two surfaces as follows:

$$\begin{aligned}\mathbf{u}_{\text{e}1} & =\mathbf{c}_{1},_{\boldsymbol{\alpha}_{1}}\dot{\boldsymbol{\alpha}}_{1}\end{aligned}$$
(20)
$$\begin{aligned}\mathbf{u}_{\text{e}2} & =\mathbf{c}_{2},_{\boldsymbol{\alpha}_{2}}\dot{\boldsymbol{\alpha}}_{2}.\end{aligned}$$
(21)

The overall entraining velocity then becomes:

$$\mathbf{u}_{\text{e}}=\frac{1}{2}\left(\mathbf{u}_{\text{e}1}+\mathbf{u}_{\text{e}2}\right)$$
(22)
Fig. 2
figure 2

Reference frames of two contacting bevel gears with misalignments \((E,P,G,\alpha)\). Symbol \(\Sigma\) denotes the shaft angle

The surface contact points \(\mathbf{c}_{\text{j}}\), tangential vectors \(\mathbf{c}_{\text{j},\boldsymbol{\alpha}_{\text{j}}}\), and normals \(\mathbf{n}_{\text{j}}\) (\(j=1,2\)) required in Eq. (16) can be obtained at each time step and contact zone through post-processing of the LTCA data. Additionally, the principal curvatures \(K_{x}\) and \(K_{y}\) and their corresponding directions \(\boldsymbol{\tau}_{x}\) and \(\boldsymbol{\tau}_{y}\) for the equivalent contact can also be extracted.

Finally, to obtain the radius of curvature \(R_{e}\) along the entraining direction and along the side-leakage direction \(R_{s}\), the angle \(\theta\) between \(\boldsymbol{\tau}_{x}\) and \(\mathbf{u}_{\text{e}}\) can be determined by:

$$\theta=\arctan\left(\left({\mathbf{u}_{\text{e}}}^{T}\boldsymbol{\tau}_{y}\right)/\left({\mathbf{u}_{\text{e}}}^{T}\boldsymbol{\tau}_{x}\right)\right)$$
(23)

which can then be used to transform the principal radii of curvature:

$$\begin{aligned}\displaystyle R_{e}=\left(K_{x}\cos^{2}\theta+K_{y}\sin^{2}\theta\right)^{-1}\\ \displaystyle R_{s}=\left(K_{x}\sin^{2}\theta+K_{y}\cos^{2}\theta\right)^{-1}.\end{aligned}$$
(24)

2.3 Frictional power losses estimation

The friction coefficient computed using Eq. (1) represents the mean coefficient of friction for a mating tooth pair over the instantaneous contact zone. Although for the determination of such friction coefficient the sliding and entraining velocities are evaluated at the mean contact point, power loss estimation requires that the sliding velocity field is considered over the entire contact zone. To achieve this, sliding velocities and normal loads applied to each cell of the computational grid are extracted from the simulation post-process data. The computational grid is the local surface mesh employed by the FEA solver to compute contact pressures (through the Boussinesq solution for an elastic half-space), as shown in Fig. 3. The frictional power loss at the \(i\)-th timestep and at the \(k\)-th contact zone, associated with a friction coefficient \(f_{ik}\), is then estimated as the sum of the products of the sliding velocities \(v_{s_{ikl}}\) and the normal loads \(F_{ikl}\) for all \(N_{k}\) cells within the contact zone:

$$P_{ik}=f_{ik}\sum_{l}^{N_{k}}{v_{s_{ikl}}F_{ikl}}.$$
(25)
Fig. 3
figure 3

Schematic representation of the surface mesh grid employed by the simulation tool. The lubrication properties are evaluated by computing the kinematic properties at the principal contact point of the instantaneous contact zone, whereas the power losses account for the sliding speeds and normal forces at all the individual cells of the grid

3 Optimization and Pareto-front exploration

This section introduces the framework for optimizing the micro-geometry of face-milled spiral bevel gears. The optimization pipeline for the micro-geometry is depicted in Fig. 4, outlining the steps presented in the next sections. The framework utilizes MATLAB interfaced with a FEA-based contact solver to launch an analysis and subsequently post-process the results, as initially described in [21].

Fig. 4
figure 4

Flowchart of the parallelized micro-geometry optimization framework

The main components of the pipeline are the following:

  1. 1.

    definition of the initial geometry and the optimization variables;

  2. 2.

    identification of the machine-tool settings which generate the closest flank modifications described by the polynomial ease-off;

  3. 3.

    description of the interface between the optimizer and the LTCA simulation;

  4. 4.

    post-processing of the LTCA simulation results for the calculation of objective functions and constraints.

The LTCA solver, by being finite element-based, is known to be computationally expensive, especially when employed in automatic optimizations. Furthermore, numerical noise caused by the discretization of the underlying problem would hamper its direct use in gradient-based optimizers.

To overcome both limitations at once, the (direct search) patternsearch algorithm, from the MATLAB’s Optimization Toolbox, has been employed with parallel calls to LTCA simulations. At each iteration, a complete poll is performed in parallel, exploring the variable space along all its directions, thereby significantly reducing the optimization time. It is important to note, however, that patternsearch is designed as a local minimizer. During the exploration of the Pareto-front within a multi-objective optimization scenario (presented in the numerical section), our expertise guided the choice of initial guesses to ensure that a satisfactory solution is obtained by the optimizer, particularly for the initial single-objective minimizations. Subsequently, the initial guesses were provided by averaging the solutions found during the Pareto-front exploration.

3.1 Initial gear geometry generation and optimization variables

The surface geometry generation is a well-established topic in the gear literature. Pioneered by Litvin [30], mathematical tools to simulate the envelope surface between the grinding tool and the working blank, borrowed from differential geometry, have been extensively described and applied. Nonetheless, to make this step more systematic and elegant, the Lie Group method borrowed from robotics [31] has been employed to parameterize the face-milling gear generation.

In order to clearly describe the machine-tool setting identification algorithm, it is necessary to reintroduce the fundamental mathematical concepts involved in describing the family of tool motions and the well-known equation of meshing. Fig. 5 shows the cross section profile of the grinding wheel. The tool surface points are defined by \(\mathbf{p}_{t}(\mathbf{x}_{t},\xi,\theta)\) and the associated normals by \(\mathbf{n}_{t}(\mathbf{x}_{t},\xi,\theta)\), where \((\xi,\theta)\) are the surface variables and \(\mathbf{x}_{t}\) are the tool settings that define its shape. The envelope family of tool surfaces (and its normals) during motion can be expressed as followsFootnote 1:

$$\begin{aligned}\displaystyle\mathbf{p}_{g}(\mathbf{x},\boldsymbol{\zeta})=g_{gt}(\mathbf{x}_{m},\varphi)\mathbf{p}_{t}(\mathbf{x}_{t},\xi,\theta)\\ \displaystyle\mathbf{n}_{g}(\mathbf{x},\boldsymbol{\zeta})=g_{gt}(\mathbf{x}_{m},\varphi)\mathbf{n}_{t}(\mathbf{x}_{t},\xi,\theta)\end{aligned},$$
(26)

The homogeneous transformation matrix \(g_{gt}(\mathbf{x}_{m},\varphi)\in SE(3)\) maps the tool reference frame \(\{T\}\) to the gear blank frame \(\{G\}\), according to the Gleason UMC hypoid generator shown in Fig. 6, and \(\varphi\) represents the cradle rotation. The vector \(\mathbf{x}_{m}\) includes all the machine settings.

Fig. 5
figure 5

Tool settings that define the grinding wheel section profile. Profile arcs: (I) Edge fillet, (II) Toprem Blend, (III) Spherical radius and (IV) Flankrem blend

Fig. 6
figure 6

Schematic of the Gleason UMC hypoid generator

The points of the tool family belong to the envelope surface if the equation of meshing is satisfied:

$$f(\mathbf{x},\boldsymbol{\zeta})=\mathbf{n}_{t}(\mathbf{x}_{t},\xi,\theta)^{T}\widehat{V}_{gt}^{t}(\mathbf{x}_{m},\varphi)\mathbf{p}_{t}(\mathbf{x}_{t},\xi,\theta)=0.$$
(27)

Here, \(\hat{V}^{t}_{gt}(\mathbf{x}_{m},\varphi)\in se(3)\) is the homogeneous form of the body twist that describes the rigid-body velocity of \(\{T\}\) with respect to \(\{G\}\), with components expressed in frame \(\{T\}\). With \(\boldsymbol{\zeta}=\begin{bmatrix}\xi&\theta&\varphi\end{bmatrix}^{T}\), we compactly represent the enveloping triplet.

To optimize the micro-geometry, ease-off modification surfaces are described by polynomials \(\mathbf{p}_{e}(u,v;\mathbf{c})\) expressed as:

$$\mathbf{p}_{e}(u,v;\mathbf{c})=\mathbf{S}(u,v)^{T}\mathbf{c}.$$
(28)

Specifically, shape functions \(\mathbf{S}(u,v)\) [32] are used to construct a quartic polynomial. A vector \(\mathbf{c}\) is constructed from nine coefficients, each corresponding to a node, as shown in Fig. 7. This vector is used to define the shape of the tooth flank modification, and it is employed as the vector of optimization variables. However, only eight of the nine coefficients are actually used as optimization variables: the 9‑th node is kept fixed (at zero) to avoid tooth thickness variations.

Fig. 7
figure 7

Polynomial ease-off used to drive the optimization, represented in the \(zR\) axial plane of the blank. The optimization variables \(\mathbf{c}\) represent the nodal values of the quartic element employed to describe the ease-off

After sampling the polynomial over a \(11\times 22\) grid of \(N=242\) points, the ease-off values are mapped onto the \(zR\) axial plane of the tooth flank, as in Fig. 7.

The ease-off \(h_{i}\) at the \(i\)-th point of the grid then defines a target grid point according to the following equation:

$$\mathbf{p}_{i}^{*}=\mathbf{p}_{g}(\mathbf{x},\boldsymbol{\zeta}_{i})+h_{i}\mathbf{n}_{g}(\mathbf{x},\boldsymbol{\zeta}_{i}),\;(i=1,\dots,N)$$
(29)

This target grid is then used to identify the machine-tool settings \(\mathbf{x}^{*}\) that generate the best-fit envelope surface, as described in the following section.

3.2 Machine-Tool Settings Identification

In a previous study [21], we demonstrated the benefits of using a sparse formulation variant of [33] to describe the nonlinear programming (NLP) problem of identifying the machine-tool settings required to obtain a given tooth flank modification. This approach offered benefits in terms of both robustness and computational efficiency. The computational efficiency was so remarkable that it allowed the identification step to be integrated directly into the optimization loop, immediately preceding the LTCA simulation.

We briefly introduce here the algorithm formulation. The identification is framed as the following constrained nonlinear least-squares problem (NLP):

$$\begin{aligned}\displaystyle&\displaystyle\min_{\mathbf{x},\boldsymbol{\zeta},\mathbf{h}}{\,\frac{1}{2}\mathbf{h}^{T}\mathbf{h}}\\ \displaystyle&\displaystyle\text{s.t. }\begin{cases}\dots\\ \mathbf{p}(\boldsymbol{\zeta}_{i},\mathbf{x})+\mathbf{n}(\boldsymbol{\zeta}_{i},\mathbf{x})h_{i}-\mathbf{p}_{i}^{*}=0\\ f(\boldsymbol{\zeta}_{i},\mathbf{x})=0\\ \dots\end{cases}(i=1,\dots,N)\end{aligned}$$
(30)

where:

  • \(\mathbf{x}=\begin{bmatrix}\mathbf{x}_{t}&\mathbf{x}_{m}\end{bmatrix}^{T}\) is the vector of machine-tool settings;

  • \(\boldsymbol{\zeta}_{i}=\begin{bmatrix}\xi_{i}&\theta_{i}&\varphi_{i}\end{bmatrix}^{T}\) is the triplet at the \(i\)-th point (of \(N\)) of the pinion tooth surface;

  • \(\mathbf{h}\) is the vector of the residuals (deviations) between the basic flank surface and the target surface; \(h_{i}\) defines the \(i\)-th component (positive for material removal);

  • \(\mathbf{p}(\boldsymbol{\zeta}_{i},\mathbf{x})\) and \(\mathbf{n}(\boldsymbol{\zeta}_{i},\mathbf{x})\) are the generic position and normal vectors of the family of the tool surfaces during motion;

  • \(f(\boldsymbol{\zeta}_{i},\mathbf{x})\) is the equation of meshing associated with the \(i\)-th point.

The CasADi framework [34] is used to create efficient symbolic expressions for the NLP problem and its gradients. The interior-point algorithm IPOPT [35], directly interfaced with CasADi, is employed as back-end solver.

3.3 MOO problem

The micro-geometry design is formulated as a multi-objective optimization problem according to the method described in [36]:

$$\min_{\mathbf{c}\in\mathbb{F}}{\mathbf{f}(\mathbf{x}^{*}(\mathbf{c}))},$$
(31)

where \(\mathbf{f}=\begin{bmatrix}f_{1}&f_{2}\end{bmatrix}^{T}\) is the vector of the two nonlinear objectives that need to be simultaneously minimized, and \(\mathbf{c}\) are the polynomial coefficients introduced previously. In the present paper, the trade-offs between NVH performances and efficiency losses are sought. Therefore:

  • \(f_{1}(\mathbf{x}^{*}(\mathbf{c}))=\text{RMS}\left(H_{\text{LTE}}(\mathbf{x}^{*}(\mathbf{c}))\right)\), root mean square of the first three harmonics of the loaded transmission error (LTE) (in \(\upmu\)rad);

  • \(f_{2}(\mathbf{x}^{*}(\mathbf{c}))=\text{e}_{\text{L}}(\mathbf{x}^{*}(\mathbf{c}))\), efficiency loss (in percentage points).

The following nonlinear constraint needs to be satisfied:

$$\begin{aligned}\displaystyle g_{1}(\mathbf{x}^{*}(\mathbf{c}))&\displaystyle=\max\bigl(p_{ \textrm{root}},p_{ \textrm{tip}},p_{ \textrm{heel}},p_{ \textrm{toe}}\bigr)-p_{ \textrm{edge}}\leq 0\\ \displaystyle g_{2}(\mathbf{x}^{*}(\mathbf{c}))&\displaystyle=p_{\text{max}}-p_{\text{tol}}\leq 0\end{aligned}$$
(32)

The constraints ensure that edge-loading is avoided by specifying that the contact pressure values at the tooth edges (root, tip, heel, and toe) must not exceed a certain threshold \(p_{ \textrm{edge}}\). The maximum contact pressure is not accounted for in the objective vector. However, to take account of the material limitations in terms of pitting load carrying capacity, its maximum value is restrained by the tolerance \(p_{\text{tol}}\) in the second nonlinear constraint of Eq. (32).

To guarantee manufacturability of the tooth flank modifications, the previously described identification is performed before the LTCA step. This is highlighted by the dependency of the objectives and constraints on \(\mathbf{x}^{*}(\mathbf{c})\): these are the identified machine-tool settings \(\mathbf{x}^{*}\) associated with the ease-off represented by specific \(\mathbf{c}\) values. As a matter of fact, while the optimization is guided by the polynomial coefficients \(\mathbf{c}\), only the identified machine-tool settings \(\mathbf{x}^{*}\) are fed back to the LTCA software to re-generate the geometry and to carry out the analysis, at each iteration.

The approach described in [36] is adopted to reformulate the problem as a single-objective optimization problem via a scalarizing achievement function \(s:\mathbb{R}^{3}\times\mathbb{R}^{3}\rightarrow\mathbb{R}\) and to enforce the constraints through an exact penalty formulation:

$$\min_{\boldsymbol{c}\in\mathbb{B}}{s(\boldsymbol{f}(\mathbf{x}^{*}(\mathbf{c}));\tilde{\boldsymbol{f}})+\mu\sum_{k=1}^{2}\left(\max{(g_{k}(\mathbf{x}^{*}(\mathbf{c})),0)}\right)}$$
(33)

where \(\vec{\tilde{f}}\) is the selected reference point and \(\mu\) is the penalty coefficient. The design variables are bounded to a reasonable set \(\mathbb{B}\) to better guide the optimization solver. The following non-differentiable achievement function is employed:

$$\begin{aligned}s(\vec{f}(\mathbf{x}^{*}(\mathbf{c}));\tilde{\vec{f}})=\max{\left(\frac{f_{i}(\mathbf{x}^{*}(\mathbf{c}))-\tilde{f}_{i}}{f_{i}^{ \textrm{(nad)}}-f_{i}^{ \textrm{(id)}}}\right)}+\rho_{\infty}\sum_{i=1}^{2}\left(\frac{f_{i}(\mathbf{x}^{*}(\mathbf{c}))-\tilde{f}_{i}}{f_{i}^{ \textrm{(nad)}}-f_{i}^{ \textrm{(id)}}}\right)\end{aligned}$$
(34)

In Eq. (34), \(f_{i}^{ \textrm{(id)}}\) and \(f_{i}^{ \textrm{(nad)}}\) represent the ideal and the nadir values of the objectives. They are respectively the lower and the upper bounds of the Pareto front and can be obtained by minimizing the two objectives individually. Importantly, they allow to properly normalize the objectives. A small augmentation parameter \(\rho_{\infty}\) (usually \(\approx 10^{-4}\)) is used to avoid obtaining weakly Pareto-optimal solutions.

Compared to the more traditional weighting methods used in similar optimizations, Eq. (34) allows non-convex portions of the Pareto front to be explored. For further details, the reader is referred to [36].

It is important to remark that the optimization methods proposed here revolve on simulating only the nominal operating conditions in terms of speed, torque and misalignments. Different load cases may significantly steer the corresponding misalignments which can result in sub-optimal meshing conditions. To properly account for different operating loads, further steps are necessary, as highlighted for example in [37].

4 Numerical test case

The spiral bevel gear set used in this study is described in Table 2, which provides the main data about its macro-geometry. The optimization focuses on the operating conditions of the drive side and, as such, the ease-off functions are utilized to modify the pinion’s concave flank.

Table 2 Macro-geometry data for the spiral bevel gearset under study
Table 3 Initial machine-tool settings for the conjugate pinion

Before proceeding to the optimization, the machine-tool settings are derived to define a virtual conjugate gearset, which operates under the specified misalignments, according to the methods outlined in [21]. The machine-tool settings that are associated with the concave flank of the conjugate pinion are listed in Table 3.

The optimization variables are the \(\mathbf{c}\) coefficients described previously. The following machine-tool settings are selected to identify manufacturable tooth flank modifications prior to the LTCA step:

  • all the tool settings excluding: edge radius, Toprem angle, Flankrem angle (see Fig. 5);

  • the following zero-th degree coefficients: \(R_{0}\), \(q_{0}\), \(E_{0}\), \(S_{0}\) and \(D_{0}\);

  • modified roll coefficients up to the 4‑th order: \(m\), \(C\), \(D\), \(E\);

  • modified radial motion coefficients up to the 4‑th order: \(R_{1}\), \(R_{2}\), \(R_{3}\), \(R_{4}\);

  • modified helical motion coefficients up to the 4‑th order: \(H_{1}\), \(H_{2}\), \(H_{3}\), \(H_{4}\).

Since the ease-off function used for optimization is a quartic polynomial, using machine settings associated with higher-degree motions may not be worthwhile.

The constraints are defined to enforce the maximum contact pressure to be below 1600 MPa (i.e., \(p_{\text{tol}}=1600\) MPa) and below 450 MPa at the tooth edges (i.e., \(p_{\text{edge}}=450\) MPa).

4.1 Single objective minimizations

The two single-objective minimizations are carried out first. This step is fundamental when dealing with objectives with different units of measurements and/or different scales.

The solution obtained by minimizing the efficiency loss \(\text{e}_{\text{L}}\), shown in Fig. 8, features a contact pattern (Fig. 8a) that is localized around the trace of the instantaneous axis of rotation on the tooth surface. Further localization of the contact patterns is limited by the maximum allowable contact pressure \(p_{\text{tol}}=1600\) MPa. It is worth noting that a peak value \(p_{\text{max}}=1595\) MPa was registered, which indeed complies with the introduced pressure constraint.

Fig. 8
figure 8

Optimal solution corresponding to the minimum of \(\text{e}_{\text{L}}\). a Contact pressure pattern: the sliding velocity pattern is overlayed to highlight the concentration of contacts near the instantaneous axis of rotation. b Optimal manufacturable ease-off (achieved via new machine-tool settings). c Residual error after identification of the polynomial ease-off

Fig. 8b shows a contour plot of the optimal manufacturable ease-off, defined as \(\mathbf{x}^{*}(\mathbf{c})\), at the solution. In Fig. 8c, the difference between optimal ease-off associated with \(\mathbf{c}\) and optimal manufacturable ease-off \(\mathbf{x}^{*}(\mathbf{c})\) (after machine-tool setting identification has been completed) is illustrated. The picture remarks the importance of the embedded identification step (see Fig. 4). If the identification process had been performed post-optimization as a secondary step, the discrepancy of the registered error in the range of 5 to 15 micrometers could have significantly affected the resultant contact pattern, due to the potential deviation of the identified parameters from the optimal values obtained through the optimization process.

It is worth also noting that, despite minimizing energy losses, the \(\text{e}_{\text{L}}\) solution is associated with a higher friction coefficient (see the lower \(\Lambda\) values in Fig. 11) and possibly with a higher risk of pitting caused by an increased chance of contacts among surface asperities.

On the contrary, as shown in Fig. 9a, the solution minimizing LTE is spread over the whole tooth surface. In this case, the requirement of minimizing LTE does not map distinctively into a predictable contact pattern shape. This brings about the chance that multiple contact patterns share the same performance in terms of LTE minimization with the associated drawback of multiple local minima. Fig. 9b and c illustrate again the importance of the embedded identification step to minimize the risk of potential deviations when performing machine-tool setting identification as an afterthought.

Fig. 9
figure 9

Optimal solution associated to the minimum of \(\text{RMS}\left(H_{\text{LTE}}\right)\). a The contact pressure pattern is spread over the whole tooth surface. The requirement of minimizing the LTE does not map distinctively into a predictable contact pattern shape. b Optimal manufacturable ease-off (achieved via new machine-tool settings). c Residual error after identification of the polynomial ease-off

The significant disparity in frictional losses between two single-objective solutions lies in the different averaged sliding velocities, which are calculated by taking a weighted average with respect to the local contact pressure over the instantaneous contact zones. This is highlighted in Fig. 10.

Fig. 10
figure 10

Comparison of the averaged sliding velocities of the contact zones over the meshing cycle between the two single-objective solutions.

Fig. 11
figure 11

Comparison of the lambda ratio over the meshing cycle between the two single-objective solutions

4.2 Pareto front exploration

As a last step, we demonstrate that the proposed framework allows to systematically explore distinct multi-objective optimal solutions by simply selecting different values for the reference point \(\tilde{\mathbf{f}}\). Fig. 12 displays the explored Pareto front, which includes the solutions from the previous single-objective optimizations, labeled \(P_{1}\) and \(P_{6}\), as well as four additional Pareto-optimal solutions, namely the points \(P_{2},\ldots,P_{5}\). Quantitative values of the solutions are provided in Table 4.

Table 4 Detailed properties of the found Pareto-optimal solutions

Notably, Pareto solutions \(P_{2}\) and \(P_{3}\), despite belonging to a concave region of the front, have been correctly determined by our procedures. This clearly indicates the advantage of the employed scalarized achievement function defined in Eq. (34) rather than the typical weighting method, which would have been unable to obtain such solutions. Moreover, solution \(P_{5}\) is nearly dominant over \(P_{6}\), with a significant reduction of \(\approx 0.135\%\) in efficiency losses (from \(0.890\%\) to \(0.755\%\)) and only a negligible increase in \(\text{RMS}(H_{\text{LTE}})\) (from \(0.76\,\mu\)rad to \(1.23\,\mu\)rad).

It is worth noting that simultaneous minimization of maximum contact pressure (\(p_{\text{max}}\)) and efficiency losses (\(\text{e}_{\text{L}}\)) would have resulted in a qualitatively similar Pareto front, indeed confirming their conflicting nature. Intuitively, contact pattern localization to minimize efficiency losses leads inevitably to higher maximum contact pressure. However, the same cannot be concluded when considering \(\text{RMS}(H_{\text{LTE}})\) and \(p_{\text{max}}\). Their correlation appears to be more subtle and unpredictable as the minimum of the \(\text{RMS}(H_{\text{LTE}})\) is not necessarily associated with conjugate mating surfaces, since elastic tooth contact and bending deflections play a significant role.

Fig. 12
figure 12

Scaled Pareto front for the two conflicting objectives \(\text{RMS}\left(H_{\text{LTE}}\right)\), root mean square of the first three harmonics of the loaded transmission error (LTE) (\(\upmu\)rad), [vertical axis] and \(\text{e}_{\text{L}}\), efficiency loss (\(\%\)), [horizontal axis]

If preventing gear pitting is a major concern, \(P_{6}\) could be the best solution, although it presents significant efficiency losses. Solutions \(P_{2}\) to \(P_{5}\) provide a good compromise between efficiency, LTE, and contact pressures. However, \(P_{1}\) may pose an increased risk of gear pitting, but it may be the optimal solution for gears with very good surface finish and advanced heat treatments, since it maximizes efficiency.

5 Conclusions

In this paper, an automatic methodology to explore Pareto-optimal designs of bevel gears has been presented. Concurrent minimization of two conflicting objectives, namely NVH level and frictional losses, was demonstrated. In the first part, a semi-empirical model to estimate frictional power losses under elasto-hydrodynamic lubrication was described, which has been coupled to a state-of-the-art loaded tooth contact analysis tool to obtain accurate predictions of the instantaneous load during meshing. The automatic design relied on a multi-objective optimization approach, where the design variables are represented by few coefficients of a polynomial basis employed to define the tooth flank ease-off topography. Manufacturability constraints were considered by a properly devised mechanism that allows to project the polynomial modifications onto the feasible set of the machine-tool envelopes. The frictional losses were evaluated with the aforementioned model, whereas the NVH level was measured by the loaded transmission error, directly available from the simulation tool. The contact pressures, limited by the material properties, were properly included as nonlinear constraints in the optimization problem from the outset. Application to a test case involving the design of a spiral bevel gearset has revealed that our methodology allows the designer to find multiple Pareto-optimal solutions in a systematic and automatic manner. We believe that the availability of a tool that can present a range of solutions, enabling the designer to choose the most appropriate one based on the unique requirements of the given application, represents a significant advancement in the design of robust, low-noise, and highly efficient gears.

Future research will be devoted to obtaining robust solutions which also account for multiple operating loads and corresponding misalignments.