1 Introduction

The development in the field of wind energy has been characterized by the installation of new wind turbines with increasing rated power and the repowering of old turbines and wind farms [1,2,3]. As a result, the load situation in the drivetrain components in a wind turbine (WT) is intensifying [3]. Higher towers and larger rotor diameters result in higher power as well as higher torsional loads on the WT drivetrain. The torsional load is superimposed by the aerodynamic turbulence, which is characterized by both the tower height and location dependencies [4, 5]. Thus, the load situation within the drivetrain has a time-variant and unpredictable dynamic, which has a strong influence on the utilization of individual components. This results in deviations between the design load spectra and the actual load spectra occurring in operation [5, 6]. Such deviations could cause faster damage accumulations in drivetrain components, resulting in unexpected maintenance and increased downtime [5].

Additionally, the limited availability of space for wind farms due to logistical problems or social restrictions gave rise to a trend to increase the number of WTs per wind farm [5]. As a result, investigations showed that the operating conditions of WTs within a wind farm can be strongly location dependent [5, 7]. For example, WTs of the same type within a wind farm can have uneven overall utilization and component utilization [5]. Therefore, with the development of larger WTs and more densely packed wind farms, the need for monitoring the load situation within the WT drivetrain rises in order to gain a more accurate impression on component utilization and reduce the likelihood of unplanned downtime.

One of the drivetrain components that lead to the longest downtimes and most costly repairs is the gearbox [8, 9]. Condition Monitoring Systems (CMS) are used to detect potential faults or initialized damages of gearbox components. These systems collect gearbox condition data and operating data, which are then evaluated and assessed. Although current CMSs are able to provide evaluation variables for the condition of the transmission and to detect possible damage at an early stage, currently deployed systems are not capable of measuring the input transmission mechanical loads [10]. For example, several investigations aimed to develop wind turbine CMS based on measurements of the low-speed shaft torque (hereinafter referred to as “shaft torque”) [11,12,13]. However, developed systems for measuring shaft torque are yet to be deployed in new turbines [10]. This is mainly due to the high cost and low practicality of the measurement equipment needed for direct load measurements on the low-speed shaft [10]. However, the high potential of continuously monitoring transmission input loads such as shaft torque maintain the need for an affordable measurement system for measuring such loads. Therefore, this investigation aims to develop a prototype of a low-cost virtual sensor system for measuring transmission input loads in all 6 degrees of freedom (6-DOF).

The problem of measuring parameters of interest that are too costly or unpractical to directly sense requires novel sensor solutions to cost-effectively infer such parameters from alternative sources of information. With the recent developments in the machine learning field, virtual sensing has demonstrated high potential to bring down the cost and enhance the practicality of measurement systems in a variety of applications [14,15,16,17,18]. A virtual sensor allows a measurement system to indirectly measure a parameter of interest using a set of sensor values. The motivation is typically the significant difference in the cost of implementing the virtual sensor in comparison to directly measuring the parameter(s) of interest. With the help of machine learning (ML) techniques, virtual sensor algorithms can be trained using time-series data when both the parameter of interest (target variables) and the sensor values from a set of affordable sensors (predictor variables) were recorded. Thus, measurements of the parameter of interest, which is costly to directly measure, are still required to develop the virtual sensor algorithm. Further, it is also beneficial to iteratively experiment with different sensor setups (e.g. different measured parameters, different sensor locations) to identify the optimal set of affordable sensors to feed the virtual sensor algorithm. In this investigation for example, collecting the needed data to train a 6-DOF virtual sensor of transmission input loads would entail a resource-intensive measurement campaign of an operational WT drivetrain. In addition, equipping such a turbine with the necessary measuring equipment would be a difficult task to repeat when a different sensor configuration is to be experimented with during the prototyping phase of the envisaged virtual sensor. Therefore, reducing the resources required to prototype the envisaged virtual sensor system during the early development phase is necessary.

Time-series computer simulations, specifically MBS, offer a flexible platform capable of supplying the needed data to develop an ML-based virtual sensor. This paper will demonstrate a process for prototyping a virtual sensor for estimating 6‑DOF WT transmission input loads using data obtained from a WT MBS model experiencing simulated wind fields. Simulated wind fields according to IEC 61400‑1 [4] covering a variety of nominal wind speeds and load scenarios were generated and applied to an MBS model of a Vestas V52 wind turbine following a procedure outlined in Sect. 2.1. The turbine contains a high-speed drivetrain with 4‑points main bearing arrangement, a common drivetrain configuration in the field. Foreseen in the project for comparison reasons is the design and test of a newly built three point main bearing arrangement for the V52. The simulation was used to generate time-series data of the target (output) and predictor (input) parameters for the development of the virtual sensor algorithm, in this case an ANN model. After the ANN was trained using the time-series data collected from the MBS, the developed prototype of the virtual sensor algorithm was tested by comparing the estimated 6‑DOF transmission input loads from the ANN to the simulated 6‑DOF transmission input loads from the MBS. Fig. 1 provides an overview of the prototyping process leading to a trained and tested ANN-based virtual sensor algorithm.

Fig. 1
figure 1

Flow Chart of Virtual Sensor Development using MBS and ANN models

To summarize, several investigations aimed to develop algorithms for directly or indirectly measuring WT transmission input loads [10,11,12,13, 19]. On the one hand, the main obstacles for direct measurement solutions have been the low practicality and high costs of the needed sensor equipment [10]. On the other hand, virtual systems targeting the indirect measurement path have been hindered by the critical need for WT data in the form of measurements collected over different operating conditions in order to design and calibrate the envisaged systems [10]. This investigation addresses these needs by not only developing algorithms to perform virtual sensing of transmission input loads using practical measurement equipment, but also presenting a process to collect the needed data without performing an extensive measurement campaign, typically required to collect such data [10]. Therefore, this paper presents a holistic approach to the prototyping and development of a sensor system for WT transmission input load estimation which mitigates the main obstacles faced by previous investigations in this domain and brings this technology closer to field deployment.

This paper is organised as follows. Chap. 1 discusses the methods used in this investigation. Sect. 2.1 outlines the modelling and simulation approach, and the resulting data is described in Sect. 2.2 followed by an explanation of the data pre-processing steps in Sect. 2.3. Sect. 2.4 presents the data analysis performed, and Sect. 2.5 elaborates on considerations and challenges in practical implementation of the proposed method. Chap. 3 presents the results of the investigation followed by a discussion of the results in Chap. 4. The conclusions are outlined in Chap. 5.

2 Methods

2.1 Modelling and simulation approach

In order to generate time-series simulated data for training the envisaged virtual sensor algorithm, a model of the Vestas V52 turbine was constructed in a virtual MBS environment and subjected to simulated wind fields in accordance with the IEC 61400‑1 standard [20]. Fig. 2 outlines the approach followed to perform the simulation and collect the simulated data. The main purpose of the simulation was to generate time-series simulated data covering the predictor and target parameters for the envisaged virtual sensor algorithm. Therefore, the level of detail in the simulation was tuned to ensure those parameters are accurately captured and to limit the error in the resulting data, e.g. force and torque estimations. Since the 6‑DOF transmission input loads represent the target parameters, the rotor blades were first modelled in finite elements (FE) and then modally decomposed to the MBS environment [21]. Similarly, the tower was modally decomposed via the Craig-Bampton method and implemented in the MBS [22]. As the deflections and misalignments of the torque arms in all 6‑DOF were also of interest as predictor parameters for the virtual sensor algorithm, the main shaft as well as the main frame and all pins within the gearbox and the planet carrier were modelled as flexible bodies in the MBS. In order to maintain the speed of the time-series simulation while covering the main deformation behavior of the flexible elements, the following procedure was applied. First, CAD model of elements (e.g. planet carrier) were simplified and transferred to FE environment. After that, the simplified CAD models were meshed using the FE method. The FE models were then modally decomposed to the MBS environment with the goal of covering the main eigenfrequencies and deformation behavior. The torque arms were simplified with a spring-damper force element. Similarly, the generator was connected to the main frame using spring-damper force elements. The stiffnesses of the main bearing were calculated using the BearinX® software and implemented as one dimensional characteristic curves for radial and axial directions using force elements. Additionally, micro-level flaws, e.g. material and tolerance imperfections, in WT components were not incorporated in the simulations.

Fig. 2
figure 2

Modelling and Simulation Approach

The wind fields were simulated using Turbsim [23] and design load case calculations were performed via co-simulation with MATLAB. For those calculations, the AERODYN force element and a SIMULINK PI controller was used [24]. In order to demonstrate a proof of concept of the proposed method, three nominal wind speeds covering the cut-in, rated, and cut-out wind speeds of the Vestas V52 turbine were chosen for the wind field simulations in this investigation. Design load cases 1.1 and 1.2 were used to simulate six wind fields with a normal turbulence model and power production as the design condition. The wind fields covered two random seeds for each of the following nominal wind speeds: 3, 12, and 25 m/s. Each field was simulated for 600 s.

2.2 Data description

The predictor and target parameters were collected from the simulations for use as input and output variables, respectively, for the ANN-based virtual sensor algorithm. The collected target variables are the 6‑DOF transmission input loads, and the predictor parameters are listed in Table 1. The orientations of the x, y, and z axes with respect to the WT drivetrain are as illustrated in Fig. 1. In Table 1, wind speed refers to the average wind speed at hub height for each time step during the simulations. The hub height was chosen for the reason that wind speed measurement during operation is typically performed at nacelle height. In this investigation, the three blade pitch angles were identical. However, the decision to include three input blade pitch angles for the virtual sensor algorithm was made to enable training the algorithm on scenarios where offsets between pitch angles may be present. The sampling frequency of the parameters was 200 Hz. Therefore, for each wind field, the MBS simulation generated a dataset containing 120,000 data points for 28 variables.

Table 1 Predictor Parameters Collected from MBS Wind Turbine Model

The data pre-processing steps implemented before training the ANN-based virtual sensor algorithm will be detailed in Sect. 2.3.

2.3 Data pre-processing

Before the collected data could be used to train and test the envisaged ANN model, certain preprocessing steps are necessary. Firstly, the data needed to be normalized to improve the speed of convergence when training the ANN [25]. This was performed on the data belonging to each variable in the datasets using the following formula:

$$Z=\frac{X_{\left(i\right)}-u}{\sigma }$$
(1)

Where X, u, and σ represent the sample, mean of sample, and standard deviation of sample, respectively. The available data was then distributed among 3 datasets: training, validation, and testing datasets. The training set contained 45% of the data, while the validation and testing sets contained 5 and 50%, respectively.

2.4 Data analysis

After the data was preprocessed and split into the three datasets to be used for training, validating, and testing the ANN-based virtual sensor, the data analysis could be performed to reach the objective of this investigation. The Python programming language was used to perform the data analysis in this paper [26].

Feed-forward ANN models were trained to estimate the 6‑DOF transmission input loads collected during the time-series simulations, taking the predictor parameters, specified in Table 1, as input. The pre-processed data consists of 720,000 data points from six wind field simulations. Six ANNs were developed estimating one of the 6‑DOF transmission input loads. As shown in Fig. 3, the process used to develop the ANN models started with an initial construction or architecture of the neural network which was iteratively improved through the process of hyperparameter tuning. The goal of hyperparameter tuning is to iteratively improve the accuracy of the trained model, when validated against the validation dataset, by altering the model’s architecture. Some of the most influential hyperparameters on feed-forward ANN performance are the number of hidden layers, the number of nodes per layer, the types of activation functions, the type and parameters of regularization, the type of loss function, and the parameters of the optimizer function. For the training phase, the investigator must also decide parameters such as batch size and number of epochs of training which also highly influence the performance of the resulting ANN model. The hyperparameters were individually tuned for the six ANN models developed during this investigation until accuracy ceased to improve.

Fig. 3
figure 3

Process Diagram for ANN Model Development

Table 2 lists some of the hyperparameters used in this investigation for training the top performing ANN models. The rectified linear unit (ReLu) [27] activation function was used after each hidden layer in all models due to its superior performance compared to activation functions such as the Sigmoid or the hyperbolic tangent (Tanh) activation functions [28, 29]. In addition, dropout layers were used during training as a countermeasure against overfitting. Dropout layers randomly switch off neurons, or nodes, at each update during training by setting the output value of the affected nodes to zero [30]. The adaptive moment estimation (Adam) [31] was the optimization algorithm used to update the weights of the models during training to improve model accuracy, and the learning rate used for training the models was 0.001.

Table 2 Hyperparameters of Developed ANN Models for 6‑DOF Load Estimation

During the testing phase, the developed models were tested on the previously unseen data from the test dataset. The resulting model estimations were evaluated using several metrics in order to thoroughly assess the errors of their estimations with respect to each of the 6‑DOF transmission input loads. To give an impression of the absolute error in the respective load units, Eq. 2 [32] was used to calculate the mean absolute error (MAE) for each input load.

$$MAE=\frac{\sum _{i=1}^{n}\left| y_{i}\right.-\left.\hat{y}_{i}\right| }{n}$$
(2)

Where \(y_{i}\), \(\hat{y}_{i}\), and n represent the ith expected value, the ith estimated value, and the number of values in the dataset containing the target parameter, respectively.

In order to put the error into context, it is necessary to compare the deviations between estimations and expected values to the typical behaviours, or in this case, loading scenarios present in the available data. Due to the importance of this step in interpreting the resulting model errors, a twofold approach was implemented to investigate the relative errors of the model estimations. Firstly, the coefficient of determination, R2, was calculated for each of the 6‑DOF loads by applying Eq. 3 to the target values from the test dataset and model estimations [33].

$$R^{2}\left(y,\hat{y}\right)=1-\frac{\sum _{i=1}^{n}\left(y_{i}-\hat{y}_{i}\right)^{2}}{\sum _{i=1}^{n}\left(y_{i}-\overline{y}\right)^{2}}$$
(3)

Where \(\overline{y}\) is the mean of the expected values of the target parameter, i.e. \(\overline{y}=\frac{1}{\mathrm{n}}\sum _{i=1}^{n}y_{i}\).

As can be read in Eq. 3, the error is calculated in relation to the term in the denominator which includes the expected value as well as the mean of the expected values of the target parameter from the test dataset.

A second approach to relative error was developed and implemented in order to provide more transparency and further evaluate the performance of the models. The approach begins with an extraction of relevant load statistics from the simulated loads at rated wind speed. The extracted statistics are minimum, maximum, and average values of the respective loads for each of the 6‑DOF transmission input loads. Due to the nature of wind turbine input loads, a nuanced approach was needed in order to select the relative statistic for calculating relative error. For example, forces along the y axis (Fy) typically fluctuate around an average value close to zero. Thus, comparing the calculated MAE to the average value of Fy would result in an inflated error figure. Similarly, always selecting the maximum values of the 6‑DOF loads to compare to the absolute error of the estimated loads would arguably result in an exaggerated impression of the skill of the models since it would result in relatively low errors. In addition, scaling the absolute error against the expected value for each data point as shown in Eq. 4 [34, 35] would result in extremely high error values when the expected value is zero.

$$\text{RMSRE}=\sqrt{\frac{1}{n}\cdot \sum _{i=1}^{n}\left(\frac{\hat{y}_{i}-y_{i}}{y_{i}}\right)^{2}}$$
(4)

Therefore, a dedicated approach was developed to tackle these issues and calculate relative error using Eq. 5:

$$\text{Load-Relative Error }(\mathrm{LRE})=\frac{\sum _{i=1}^{n}\left| y_{i}\right.-\left.\hat{y}_{i}\right| }{n\cdot \text{ median}\left(\left| y_{\max }\right| ,\left| y_{\min }\right| ,\left| \overline{y}\right| \right)}$$
(5)

Where \(y_{\max }\) and \(y_{\min }\) are the maximum and minimum values of the target parameter, respectively.

By scaling the MAE against the median of the three statistics (minimum, maximum, average), an error can be calculated relative to a simple statistic representing the loads at rated wind speed while addressing the aforementioned issues with comparing error to only one statistic. As far as we know, this method of calculating relative error was developed during this investigation, inspired by the nature of the 6‑DOF transmission input loads and the challenges therein. By using Eq. 6, a load-relative score (LRS) can be calculated comparable to the R2 score:

$$\text{Load-Relative Score}\,\left(\mathrm{LRS}\right)=1-\mathrm{LRE}$$
(6)

2.5 Considerations and challenges in practical implementation

In this investigation, simulation data is used to train ANN-based virtual sensor algorithms; therefore, the accuracy of the developed virtual sensor is a function of the validity of the simulation model and its resulting data. Since absolute validity of a simulation model is refuted by most experts [36,37,38,39], the validity of a simulation model is only defined within the limits of the project and its intended application [40]. In other words, validation of a simulation model attempts to answer the following question: Is the model sufficiently accurate for its intended application? Indeed, the sufficiency of model accuracy from the perspective of its intended application is often used to define model validity [37, 41,42,43]. Different approaches to analyse the validity of simulation models have been proposed in literature with varying levels of comprehensiveness and associated costs [40]. The following are some examples:

  • Micro-level examination and verification by system designers [41].

  • Comparisons to real-world performance of the modelled system [41].

  • A flow analysis of conserved quantities, such as heat and kinetic energy, through the model [44].

  • Subjective assessment by the model development team on whether a model is valid based on results obtained during model development process [36].

When implementing the methodology proposed in this paper, investigators must gain a clear understanding of what is considered sufficient model accuracy with respect to the intended application of their envisaged virtual sensor. The degree of detail of the simulation model and the comprehensiveness of the validation analyses must fit the demands of the application. Therefore, the necessary quality of the MBS model for implementing the proposed method depends heavily on the intended application. Resource limitations must also be considered and an assessment needs to be made on whether the proposed method is a viable option for the intended application given the available resources for developing and validating the simulation models. It may be more affordable to develop the envisaged virtual sensor using data from real world experiments on the actual system as opposed to developing simulation models to provide sufficiently accurate data. However, this approach may become prohibitively costly if it would be a goal to investigate different potential sensor setups (e.g. different locations or types of sensors). This is due to the high cost of instrumenting a WT with all candidate sensor setups and performing sufficient experimentation with each setup. The proposed method would be beneficial in investigations which rely on real world measurement data to train the final virtual sensor algorithms by allowing for more affordable screening of potential sensor setups prior to costly real world experimentation.

Taking this project as an example, it is planned in the future to instrument and test two drivetrain configurations of the Vestas V52 as part of a measurement campaign aimed at further developing the envisaged 6‑DOF transmission input loads virtual sensor. This presents a valuable opportunity to design and test a predefined sensor setup on actual systems. However, it also presents a challenge to find the optimal sensor setup prior to the test since it would not be possible to modify the sensor setup during the test; the drivetrain would be less accessible and pausing testing would cost valuable testing time. Therefore, the ability to prototype and screen different virtual sensor systems, albeit with less accuracy as compared to real world testing, provides a valuable opportunity to identify suitable sensor types and locations for the resource-intensive and less flexible real world test.

Lastly, the choice of the Vestas V52 for the first application of the proposed method is linked to the availability of a Vestas V52 drivetrain for the planned testing. Due to the age of the V52, design differences exist between its design and the design of more modern WTs e.g. more lightweight construction. The ANN-based virtual sensors in the proposed method use deformations and misalignments as input to sense transmission input loads. Therefore, it is expected that the lightweight construction of more recent turbines would at least maintain and more likely enhance the sensitivity of those parameters to experienced loads. This also may necessitate the modelling of some components in higher detail to reflect the additional deformability of such components during operation. As mentioned earlier, an assessment of model validity during the model development phase is vital for a successful implementation of the proposed method. A screening of different sensor setups may also lead to improvements in the accuracy of the ANN-based virtual sensor algorithms.

3 Results

As mentioned earlier, six feed-forward ANN models were trained to estimate each of the 6‑DOF transmission input loads using the predictor parameters, listed in Table 1, as input. In addition to adjusting the number of nodes per layer, using the Rectified Linear Unit (ReLu) activation function and an Adaptive Moment Estimation (Adam) optimizer proved particularly useful in increasing model accuracy as well as training speed. The use of dropout layers was an effective measure against overfitting. After testing the developed models on the test dataset, the aforementioned error metrics were evaluated with respect to the model estimations of the target parameters. The calculated errors as well as the corresponding simulated loads statistics at rated wind speed are listed in Table 3. The median value selected for the calculation of the LRS for each transmission input load is in bold to facilitate interpretation. The Tensorflow framework was utilized to implement the ANN algorithms [45].

Table 3 Error Metrics of ANN Models Estimations & Simulated Loads Statistics at Rated Wind Speed

The load relative error distributions of the estimated transmission input forces and moments are also plotted in Figs. 4 and 5, respectively. The LRE of the ith estimated value for each target parameter was calculated using Eq. 7:

$$\mathrm{LRE}_{i}=\frac{y_{i}-\hat{y}_{i}}{\text{median}\left(\left| y_{\max }\right| ,\left| y_{\min }\right| ,\left| \overline{y}\right| \right)}$$
(7)
Fig. 4
figure 4

Load Relative Error Distributions for Estimated Transmission Input Forces

Fig. 5
figure 5

Load Relative Error Distributions for Estimated Transmission Input Moments

4 Discussion

The results demonstrate high potential of the presented method for prototyping virtual sensor algorithms and sensor setups for estimating 6‑DOF wind turbine transmission input loads. The mean absolute errors of the estimated forces and moments, listed in Table 3, are below the kN and kNm range, respectively, which corresponds to R2 scores as well as load relative scores above 0.9. Moreover, the load relative error distributions of the estimated loads shown in Figs. 4 and 5 demonstrate that the majority of estimations have an LRE close to zero. Therefore, the results of the developed virtual sensor prototype demonstrate the potential capability of feed-forward ANNs to estimate transmission input loads in a WT application without relying on costly, rotating measuring equipment such as telemetry systems. In addition, the flexibility offered by the MBS modelling environment used in this investigation can potentially reduce the resources needed for prototyping a virtual sensor for similar applications. The achieved results motivate further applications of the methodology presented in this paper in order to evaluate its transferability to other WT drivetrains and other wind fields based on different design load cases and wind speeds.

The predictor parameters, listed in Table 1, needed for the trained ANN models to estimate 6‑DOF transmission input loads in a WT can be sensed using stationary sensors as well as sensors from the typical SCADA system in WTs. Displacement and angular misalignment parameters can be measured using stationary distance sensors mounted on the main frame or the gearbox housing. Despite the significant potential of enabling WT CMS to use WT drivetrain input loads such as shaft torque, such systems are so far limited to research and development environments [10]. One of the main obstacles hindering the deployment of such systems is the high cost and impracticality of the required measurement equipment. Therefore, the developed models could reduce the cost and increase the practicality of shaft load measurement, potentially enabling field-deployed CMS to use those measurements for fault identification in the future [10].

5 Conclusion

In this investigation, ANN-based virtual sensor algorithms were trained using simulated data obtained from a model of the Vestas V52 WT subjected to simulated wind fields in an MBS environment. The main conclusions are as follow:

  1. 1.

    The developed Feed-forward ANN models demonstrate promising potential for this technology in estimating 6‑DOF WT transmission input loads using indirect, stationary measurement equipment as opposed to costly solutions, e.g. telemetry systems.

  2. 2.

    It is possible to collect the training data for developing an ANN-based virtual sensor for WT 6‑DOF transmission input loads using the simulation approach outlined in this paper.

  3. 3.

    The flexibility offered by the presented methodology for virtual sensor development has the potential of dramatically reducing the resources needed for prototyping and testing virtual sensor systems compared to physical testing.

  4. 4.

    Due to the promising results achieved using the presented methodology, the authors aim to further test the methodology with different boundary conditions, e.g. different WT drivetrain configurations and different design load cases.

  5. 5.

    The load-relative score (LRS) was developed and applied during this investigation as a method of assessing the 6‑DOF virtual sensor models estimation capabilities using selective scaling of a simple statistic representing the loads at rated wind speed.