Introduction

Cardiorespiratory fitness (CRF) has been extensively studied in recent years, as the increase in cardiovascular diseases (CVD) is an expanding problem worldwide (Laxmi et al. 2014). Low levels of CRF have been identified as a potential risk factor for CVD and all-cause mortality; moreover, substantial health benefits might be gained by improvements in CRF, which can be obtained by physical activity and exercise (Rapp et al. 2018; Ross et al. 2016). Therefore, CRF is now identified as an important marker of cardiovascular health and has even been recommended as a new vital sign by the American Heart Association (Ross et al. 2016). Cardiopulmonary Exercise Testing (CPET) allows assessment of CRF. The measurement of the maximum amount of oxygen uptake during exercise, referred to as the VO2max, is in particular relevant in evaluating an individual’s aerobic fitness (Mezzani et al. 2009; Rapp et al. 2018; Takken et al. 2019) and is considered the best measure of CRF and exercise capacity (Fletcher et al. 2013).

In physiological terms, the VO2max is the maximum level of oxygen uptake that can be consumed during exhausting exercise with large muscle groups (Fletcher et al. 2013; Herdy and Uhlendorf 2011; Mezzani et al. 2009). As a result, the VO2 level reaches a plateau phase. For this to happen, achievement of truly maximal effort is essential (Mezzani et al. 2009). An important component of the value of VO2max, that is up to 50%, is being established by hereditary capacity. However, there are several other determinants influencing the level of VO2max, in particular age and sex, although body size, exercise training habits, lifestyle, and cardiovascular status play a role, as well (Almeida et al. 2014; Fletcher et al. 2013; Guazzi et al. 2012, 2018; Herdy and Caixeta 2016; Kaminsky et al. 2013; Koch et al. 2009; Mezzani et al. 2009; Ross et al. 2016; Takken et al. 2019). For instance, it has been observed that levels of VO2max reach maximum values between the age of 15 and 30, and decrease progressively after that age. Furthermore, it has been found that males have higher levels of VO2max compared to females, supposedly because of differences in muscle mass, haemoglobin levels, and cardiac stroke volume (Dubowy et al. 2008; Fletcher et al. 2013; Guazzi et al. 2012; Kaminsky et al. 2013; Koch et al. 2009; Mezzani et al. 2009; Takken et al. 2019).

Determining the level of VO2max can be beneficial in different settings. For example, it can be used to assess the response to exercise training. Besides, the VO2max is convenient for evaluating CRF in patients with for instance heart or lung diseases as well as gauging their therapeutic efficacy and has been consistently determined a prognostic marker for pre-surgical risk. Furthermore, training intensity and training targets may be established by using percentages of VO2max, which can be useful for healthy individuals and athletes, as well (Fletcher et al. 2013; Guazzi et al. 2018).

The VO2max can be directly measured by incremental exercise using respiratory gas analysis, which is considered the golden standard (Almeida et al. 2014; Fletcher et al. 2013; Rapp et al. 2018; Ross et al. 2016; Takken et al. 2019). For the interpretation of a person’s VO2max, reliable reference values are extremely important. Due to the close correlation existing between the VO2max and both age and sex, it is critical to interpret these values using age- and gender-specific reference values (Rapp et al. 2018).

Worldwide frequently used age- and gender-specific reference values are proposed by Jones et al. in 1985 and Wasserman et al. in 2005. However, these reference values were obtained years ago and were based on, respectively, Canadian and American cohorts. Nevertheless, in the Netherlands and other European countries, these reference values are still widely used, but in clinical practice, these are considered quite low, especially for young adults. Hence, up-to-date and dependable reference values based on a Dutch cohort are highly needed. Furthermore, most research in this area has focused on specific age groups, for instance children, elderly, or adults in general. Only few studies have provided reference values for a broader range of ages (Takken et al. 2019).

The aim of this study is to analyse the interaction between the VO2max per kilogram (VO2max/kg) and both age and sex and develop reference values using these two determinants, based on the Dutch population. For this purpose, we applied a large apparently healthy Dutch/Flemish cohort with a broad age range, including children from the age of 6 years to adults of 65 years.

Methods

Study design

This study was carried out using existing data from the LowLands Fitness Registry. This database contains exercise testing data from 11 centres in the Netherlands and Belgium. These centres include Diving Medical Center Den Helder, Erasmus Medical Center Rotterdam, Hospital Jan Portaels Vilvoorde, In2Motion Sports Bureau, InspanningLoont Centre Utrecht, Isala Hospital Zwolle, Maxima Medical Center Veldhoven, Ministry of Defense Testing Center Soesterberg, Radboud UMC Nijmegen, St Anna Hospital Geldrop, and University of Applied Sciences Utrecht. These centres submitted de-identified exercise testing data to the LowLands Fitness Registry. The Medical Ethics Committee of the UMC Utrecht in the Netherlands has approved the study (protocol 16/167) (Van de Poppe et al. 2019).

Subjects

For this study, we used data from 4637 subjects from the Lowlands Fitness Registry of which known athletes, smokers, and subjects with a Body Mass Index (BMI) > 30 were already excluded. From these subjects, we excluded the participants who did not perform a maximum effort, as well. A maximum effort was determined as a respiratory exchange ratio (RER) of ≥ 1.0 (Kokkinos et al. 2018) and a minimum of 85% of the predicted maximum heart rate. We implemented Tanaka’s equation for the prediction of maximum heart rate: [208 – (0.7 × age)] (Tanaka et al. 2001). Relevant subject characteristics and data distribution of the study participants are listed, respectively, in Tables 1 and 2.

Table 1 Subject characteristics of the study populationa
Table 2 Data distribution in males and femalesa

Testing protocol

All exercise tests were performed using electromagnetically braked cycle ergometers. These ergometers came from distinctive manufacturers, that is Lode BV, Groningen, the Netherlands and Ergoline, Bitz, Germany (Van de Poppe et al. 2019). The VO2max was defined as the maximum amount of oxygen uptake during maximum effort measured around sea level and was expressed in ml min−1 kg−1. This value was measured with a calibrated respiratory gas-analysis system. These systems were also from distinctive manufacturers, in particular from Cortex Metalyzer, Leipzig, Germany; Carefusion, Hoghberg, Germany; Geratherm, Bad Kissingen, Germany; Cosmed, Rome, Italy; and Medisoft, Sorrines, Belgium. The equipment was calibrated before every exercise test in accordance with the manufacturer’s instruction (Van de Poppe et al. 2019).

Statistical analysis

Statistical analysis was performed using the SPSS Statistics software, version 25 (IBM Corp. Released 2017. IBM SPSS statistics for Windows, Version 25.0. Armonk, NY: IBM Corp.). Our data were modelled to create reference values using LMS Chartmaker Pro program™, version 2.54. This program works according to the LMS method, which is a way to summarize the distribution of the data as it changes via three curves representing the Lambda (L: the Box-Cox power describing the skewness), Median (M), and the generalised coefficient of variation (S). With a penalised likelihood function, the three curves were fitted as cubic splines using non-linear regression (Vamvakas et al. 2019; Van de Poppe et al. 2019). Hence, centile curves were created, representing the percentiles of P3 (lower limit of normal), P10, P25, P50 (median), P75, P90, and P97 (upper limit of normal). With Microsoft Excel version 2016, trend lines were added to the centile curves of the lower limit of normal, median, and upper limit of normal, describing the equation, R square (R2), and standard error of the estimate (SEE) of the reference values. For cross validation, the paired samples T test was performed in SPSS, to demonstrate the mean difference between the reference values and the actual measured data. p values below 0.05 were considered as significant.

Results

Subject characteristics

Data from 4612 participants were eligible for analysis, involving 3671 males and 941 females. Descriptive characteristics of the participants, divided in both age categories of 10 years and sex, are given in Table 1. Females had, as expected, lower weight and height than males. The measured values of VO2max/kg, maximum heart rate (HRmax), and RER are listed in Table 2, by the same division in age categories and sex.

Interaction with age and sex

Univariate linear regression analysis shows a negative significant effect of age on the VO2max/kg (p < 0.001) for males as well as females. Figure 1 illustrates this decline of the VO2max/kg with increasing age by applying centile curves. The mean value of VO2max/kg was 45.39 ± 8.33 ml min−1 kg−1 for males and 37.23 ± 8.71 for females. Consequently, the overall mean difference between males and females was 18.0%, ranging from 9.6% for 5–9-year old children to 23.5% for people above 60 years old.

Fig. 1
figure 1

VO2max (ml min−1 kg−1) according to age represented as P3, P10, P25, P50, P75, P90, and P97 in a males and b females

Reference values

Reference values for the VO2max/kg for both males and females were developed using the median (P50), the lower limit of normal (P3), and upper limit of normal (P97), and are depicted in Fig. 1. Table 3 displays the prediction equations with corresponding R2 and SEE values. In Table 4 the reference values are compared with several previously published values for different ages.

Table 3 Reference equations for the VO2max per kilogram
Table 4 Reference values for the VO2max per kilogram compared with previously published values for different ages

Cross-validation

Supplementary data of 3135 subjects, who were not included in the primary analysis, were applied for performing cross validation. These data were derived from additional exercise tests performed at Diving Medical Center Den Helder. The sample consisted of 3017 males and 138 females. Paired samples t test demonstrates a mean difference of 0.03 ± 6.59 ml min−1 kg−1 with a p = 0.781, in favour of the predicted VO2max for males. For females, there is a small but significant mean difference of -1.88 ± 5.67 ml min−1 kg−1 with a p < 0.001, due to higher measured than predicted values.

Discussion

The purpose of this study was to provide up-to-date reference values for the VO2max/kg by analysing the correlation of the VO2max/kg between both age and sex based on a Dutch and Flemish cohort. Data from the LowLands Fitness Registry were analysed, in which children and adults aged 6 until 65 who performed a maximal effort during exercise testing on a cycle ergometer were included.

Age showed a negative significant effect on the VO2max/kg (p < 0.001) among both sexes. Males showed higher levels of VO2max/kg with an overall mean difference of 18%. This difference is comparable to the literature. Using the composite equation of Cooper and Storer (Cooper and Storer 2001), the mean sex difference in VO2max is 16.6% at the age of 20. Our reference values were developed using the median (p50, centile curve). The lower limits of normal (p3) and upper limits of normal (p97) reference equation were established as well. These lower and upper limits are important to established normalcy of VO2max.

Cross-validation with data not included in the primary analysis showed no statistical difference between the predicted values and the actual measured values for males (mean difference 0.03 ± 6.59 ml min−1 kg−1, p = 0.781), which means that these reference values are reliable. For females, a statistical significance was found between the measured values and our reference values (− 1.88 ± 5.67 ml min−1 kg−1, p < 0.001), which means that our reference values predict lower values than the actual outcome. The problem might be the relatively small sample size of females available for cross validation, which is a 138 female subjects. Besides, a mean difference of 1.88 ml min−1 kg−1, although statistical significant, is small and might be therefore not of clinical relevance.

When comparing the reference values of this study to previously published values based on cycle ergometer exercise testing, it is remarkable that our reference values are higher. There are several explanations for this difference. First of all, the reference values are based on different geographic cohorts, Canadian (Jones et al. 1985), American (Wasserman et al. 2005), and German (the SHIP study, Koch et al. 2009), while our study is based on a Dutch and Flemish cohort. Physical and cultural differences between these cohorts may account for differences in the VO2max/kg. For instance, the VO2max tends to expand with increasing height, and the Dutch population is proven to be the tallest in the world (NCD Risk Factor Collaboration 2017). Besides, the VO2max shows a decline with increasing body mass index (BMI) (Koch et al. 2009) and mean BMI is lower in the Netherlands, compared to Canada, Germany, and the USA (26.1 compared to, respectively, 27.3, 27.4, and 29.1 kg/m2) (World Health Organization). In addition, since the VO2max is expressed in ml min−1 kg−1, a lower body weight will lead to a higher value of VO2max/kg. From a cultural perspective, the use of cycling in daily transfers is the most prominent in the Netherlands, compared to the rest of the world (Buehler and Pucher 2012). This may have led to an advantage in performing cycle ergometry. Furthermore, 2 of the 11 centres that submitted data to the Lowlands Fitness Registry have a military background, meaning that the physical condition of their subjects is likely to be above average, which is associated with a higher VO2max (Fletcher et al. 2013), and may have led to higher reference values, as well. Of interest to note is the non-linear decline in VO2max with age for both males and females. Traditionally, regression equations for VO2max are often presented as a simple linear regression of VO2max with age (Cooper and Storer 2001). The results of the current study do not endorse this approach. We recommend to use a non-linear approach to model VO2max data.

Where the studies described above only concern adults and elderly, a comparison for children is possible with the widely used data of Cooper et al. (1984). The Cooper study showed lower values, as well, but it must be noted that the difference between the Cooper study and our study is smaller than the difference between the studies discussed above and our study for boys, in particular for 12-year-old boys. However, the difference between the VO2max values of girls between the current study and the Cooper data are as big as the difference between adult studies. A recent case study also observed that the Cooper values for VO2max in children are substantially lower compared to those previously published by our group (Waterfall et al. 2020).

Taking this comparison into consideration, it is impossible to ignore the remarkable differences between our study outcome and previously published reference values, especially for adults. This demonstrates and explains the clinical experience of the existing reference values being too low for the Dutch population. Therefore, this shows the importance of having access to population-specific reference values.

Besides providing population-specific reference values, the aim of this study was to provide up-to-date values, as well. However, it is questionable whether the reference values of, for instance, Jones et al. (1985) and Wasserman et al. (2005) are outdated. Worldwide, there is a tendency towards an increase in cardiovascular diseases, of which CRF is an important marker (Ross et al. 2016). Nevertheless, this trend does not automatically mean that the health standards need to be reduced, as well. Values that relate to a standard based on desirable health conditions are officially called criterion-referenced fitness standards (Welk et al. 2011).

For people with obesity, it is impractical to use the VO2max/kg, since dividing the absolute value of VO2max (ml min−1) by a high body weight automatically leads to a low VO2max/kg. Our recommendation for people with obesity is describing the VO2max in absolute values (ml min−1) instead of relative to body mass. Then, comparison with the values of Mylius et al. (2019) is possible, who provided absolute values for the VO2max that were based on the LowLands Fitness Registry, as well. An alternative would be calculating a person’s ideal body weight and then dividing the absolute value of VO2max by this ideal weight, after which comparison with our reference values is possible.

There are multiple strengths to this study. First of all, to our knowledge, this is the first study to provide reference values for the VO2max/kg based on a Dutch and Flemish cohort. Second, we applied a relative substantial sample size to this study (n = 4612), compared to, for instance, Jones et al. (n = 100) (Jones et al. 1985), Wasserman et al. (n = 77) (Wasserman et al. 2005), and Koch et al. (n = 1708) (Koch et al. 2009). The different centres that submitted data to the LowLands Fitness Registry are properly distributed among the Netherlands and Flanders, and hence, the reference values are broadly representative of the Netherlands and Flanders. Besides, the data were obtained through mandatory health checks among different professions instead of voluntary health checks; consequently, there is minor selection bias at which healthier people will participate in voluntary health checks (Van de Poppe et al. 2019). Subjects with a broad age range were included, so the reference values are suitable for both children and adults from 6 until 65 years of age.

There are also some limitations to this study that need to be acknowledged. The VO2max/kg is the maximum amount of oxygen uptake during exercise, which is physiologically demonstrated by a plateau phase. However, during most exercise testing, the VO2max/kg does not reach this plateau phase. This means that the VO2max/kg is measured at an estimated maximum effort instead of at the physiological plateau phase. Therefore, it is possible that the definite physiological VO2max/kg may be a fraction higher than the estimated VO2max/kg used in our study. No verification procedure for VO2max was performed (Poole and Jones 2017). Moreover, our data were obtained by mandatory health checks among different professions, for instance divers and militaries. These professions acquire good physical condition, and therefore, the cardiorespiratory fitness might be overestimated, compared to the general Dutch population. Furthermore, our database contained almost 4 times as many male subjects as female subjects, and hence, the accuracy of our reference values might be higher for males than females. Finally, there is a skew distribution of subjects among the different age categories, since the decades of 20 s through the 50 s are most represented.

To provide more reliable reference values for females, future research with more female subjects is needed. Another recommendation is including more subjects in the lowest and highest age categories, especially subjects of 4 until 10 years old and above 65 years, to provide reference values with more reliability for children and elderly, as well.

Conclusions

This study provided reference values for the VO2max/kg using cycle ergometry as mode for CPET based on both age and sex, and showed reliable results during cross-validation testing, especially for males. Comparison demonstrated remarkable higher levels of VO2max/kg using the reference values of our study than those of previous published studies. Therefore, our reference values should be able to result in more accurate interpretations of measured VO2max/kg for specifically the West-European population.