Introduction

Maximal oxygen uptake (VO2max) is a physiological indicator of an individual’s cardiorespiratory fitness and represents the maximum rate of the cardiovascular and respiratory systems in delivering oxygen to the working muscles. VO2max can be expressed in absolute (L O2/min) and relative (mL O2/min/kg) terms. Relative VO2max has been widely recognized as a measure for assessing health status, predicting cardiovascular disease risk, and evaluating athletic performance (Bassett and Howley 2000; Myers et al. 2002).

Two factors that can impact VO2max are exercise level and body mass index (BMI). Exercise level may directly impact VO2max, as regular physical activity of sufficient intensity have been shown to increase an individuals VO2max (Aadahl et al. 2007). Also, BMI may also have impact on VO2max level with a higher BMI often correlating with a lower VO2max (Mondal and Mishra 2017).

Traditionally, direct measurement of VO2max requires participants to undergo maximal exercise testing on a graded treadmill or cycle ergometer until voluntary exhaustion. While this approach provides accurate and precise results, it is often not feasible due to restrictions to laboratory conditions and expertise and health risks in mixed nonathlete populations.

To address these limitations, various methods have been developed to estimate VO2max using submaximal exercise tests (Noonan and Dean 2000). These tests often involve standardized practices where variables such as heart rate, power output, time, speed, or distance are measured. By extrapolating the submaximal data using predictive equations or nomograms, VO2max can be estimated.

Despite the availability of several norm predictions for estimating VO2max, translating these general norms into specific tests could introduce bias (Noonan and Dean 2000). Normative values derived from diverse populations may not accurately reflect the characteristics of the individuals being tested, introducing inherent error and potentially affecting the validity of the estimated VO2max values. Additionally, test-specific norms based on large, well-defined samples are needed to ensure accurate and reliable estimation of VO2max.

Two popular submaximal cycle ergometer tests are the Åstrand test (Å-test) (Astrand and Ryhming 1954; Astrand 1960) and the Ekblom-Bak test (EB-test) (Ekblom-Bak et al. 2014; Björkman et al. 2016). While the EB-test uses the change in heart rate response between two submaximal workloads, each for 4 min, the Å-test utilizes the heart rate response to cycling on a single submaximal workload for 6 min. These tests have gained popularity due to their simplicity, feasibility, and ability to estimate VO2max without requiring maximal exertion.

However, these tests may have some inherent biases against directly measured VO2max. Therefore, this study aims to provide test-, sex- and age-specific reference values for estimated VO2max using the Å-test and the EB-test, respectively, in a diverse population. A second aim was to study the influence of age, exercise level, and BMI.

Method

Data was obtained from the HPI Health Profile Institute database, which includes Health Profile Assessments (HPAs) performed in the Swedish working population since the 1980-ies. An HPA includes a self-reported lifestyle questionnaire, anthropometric measurements, resting blood pressure evaluation, a submaximal cycle test, and concludes with a session with a health coach. HPAs are voluntarily accessible and free for employees in companies offering occupational health services. Historically, the Å-test, developed in the early 1960s, was the standard in HPAs. However, the EB-test, introduced in 2014, became an alternative in recent years.

For a contemporary study population mirroring current estimated VO2max levels, we included Å-tests and/or EB-tests from Swedish working individuals aged 20–69, conducted between 2010 and 2022. Out of a total sample of 481,815 men and women, a total of 370,880 had either performed an EB-test or an Å-test. Participants reporting intake of medicine that could affect heart rate response to physical activity were excluded. The resulting samples were for the Å-test n = 263,374 and for the EB test n = 95,043. As the output variables from the EB-test (sex, age, and heart rate response on the higher workload) can be used for calculation of estimated VO2max using the Å-test nomogram, n = 85,094 of the participants performing the EB-test also contributed with an estimated VO2max by the Å-test (Astrand and Ryhming 1954). See Supplement Figs. 1–3 for flowcharts.

The Åstrand test

The Å-test is based on measuring steady-state heart rate during the last minute of six-minute submaximal cycling on constant work rate (pedal frequency 50 rpm), aiming to obtain a rating of perceived exertion (RPE) of≈13 on the Borg’s scale (Borg 1970). VO2max is then estimated from a nomogram using workload, steady state heart rate, and sex, and further age-corrected (Astrand and Ryhming 1954; Astrand 1960). In a previous validation study, the mean (95%CI) difference between measured and estimated VO2max by the Å-test was -0.07 L/min (− 0.21 to − 0.06), and the coefficient of variance was 18.1% for men and women combined. Sex-specific analyses showed that men were being underestimated, 0.41 L/min (− 0.61 to − 0.20), coefficient of variance 14.8%, and women overestimated, 0.13 L/min (− 0.02 to 0.28), coefficient of variance 17.2%.

The Ekblom-Bak test

The EB-test uses the change in heart rate response between two four-minute submaximal workloads (pedal frequency 60 rpm), where cycling on a standard rate with a resistance of 0.5 kiloponds precedes a higher, individually chosen work rate to obtain an RPE of 13–14 on the Borg’s scale (Borg 1970). Mean heart rate values are calculated by measuring heart rate every fifteen seconds during the final minute of each workload. VO2max is estimated using the sex-specific prediction EB-test equations (Björkman et al. 2016). In a cross-validation study, the EB-test showed no significant difference on group level between measured and estimated absolute VO2max, mean (95% CI) of difference 0.02 (− 0.04 to 0.08) and coefficient of variance 9.4% for men and women combined (Bjorkman et al. 2016). Men experienced a small overestimation by the EB-test, 0.11 (0.02 to 0.20) and coefficient of variance 8.3%, and women a small underestimation, − 0.09 (− 0.16 to − 0.01) and coefficient of variance 10.0%.

Other measurements

Body mass and height measurements were acquired using standard methods, with individuals wearing lightweight clothing. BMI was determined using the formula: weight in kilograms divided by square height in meters (kg/m2). Exercise level was self-reported as weekly exercise frequency to maintain or improve physical fitness, health, using the following options: ‘Never,’ ‘Sometimes,’ ‘1–2 times/week,’ ‘3–5 times/week,’ or ‘At least 6 times/week,’ with individuals specifying their exercise frequency to maintain or improve physical fitness, health, and well-being. Exclusion criteria were smoking habits, categorized as ‘At least 20 cig/day,’ ‘11–19 cig/day,’ ‘1–10 cig/day,’ ‘Occasionally,’ or ‘Never.’ Only those reporting ‘Never’ were included. Additionally, self-reported medication usage for hypertension or those affecting heart rate and high blood pressure diagnoses were recorded as ‘yes’ or ‘no’.

Statistics

The variables underwent visual normality inspection, revealing an approximation to a normal distribution. Consequently, we report the mean and standard deviation (SD). For analyzing differences between men and women, we employed independent t-tests. Additionally, to assess the effect size of these differences, we calculated Cohen’s d. Reference categories for relative estimated VO2max, segmented by 5-year age groups, were defined using percentiles: 0–10 (Very low), 11–25 (Low), 26–40 (Somewhat low), 41–60 (Average), 61–75 (Somewhat high), 76–90 (High), and 91–100 (Very high). Density plots were employed to provide smoothed probability density estimates, comparing the age-related distributions of the Å-test and EB-test in 10-year age groups. To explore associations between estimated VO2max and exercise and BMI, overall trends and percent of variance explained (R2) were used. These R2 values were derived from crude and sex- and age-adjusted generalized additive models with integrated smoothness estimation, utilizing five knots for the VO2max-BMI relationship. Additionally, crude and sex- and age-adjusted ordinary least squares regression was applied for the exercise-VO2max relationship.

The sample was divided into 10-year age groups (20–29, 30–39, 40–49, 50–59, 60–69) to assess the association with age. After that, the percentual difference between mean estimated VO2max of the current and the previous 10-year age group was calculated according to the following equation; (meanprevious decade − meancurrent decade)/meanprevious decade × 100 per decade.

All data handling, figures, and statistical analyses were performed with R version 4.2.0 Vigorous Calisthenics and the package tidyverse, and the flowcharts were made with the package dtrackr.

Results

Table 1 shows the study population’s characteristics. Estimated absolute and relative mean (SD) VO2max was 2.84 ± 0.76 L/min and 36.8 ± 10.0 mL/min/kg for the Å-test, and 3.10 ± 0.74 L/min and 39.5 ± 8.5 mL/min/kg for the EB-test. Men exhibited significantly higher absolute estimated VO2max compared to women for both the EB-test (3.58 L/min vs. 2.41 L/min, p < 0.001, Cohen’s d = 2.470) and the Å-test (3.11 L/min vs. 2.48 L/min, p < 0.001, Cohen’s d = 0.899), and also significantly higher relative VO2max according to the EB-test (42.4 mL/min/kg vs. 35.5 mL/min/kg, p < 0.001, Cohen’s d = 0.910). However, for the Å-test, while a statistically significant difference was observed (36.9 mL/min/kg for men vs. 36.6 mL/min/kg for women, p < 0.001), the effect size (Cohen’s d = 0.030) was negligible. Tables 2 and 3 present the sex- and age-specific reference categories.

Table 1 Characteristics of the study populations
Table 2 Age and sex-specific reference values (in mL/min/kg) for the Å-test
Table 3 Sex- and age-specific reference values (in ml/min/kg) for the EB test

Relative VO2max was lower in higher 10-year age group (Fig. 1). The percentual difference of relative VO2max per 10-year age group compared to the previous (30–39 vs. 20–29 years, 40–49 vs 30–39 years, etc.) was for the the Å-test; men − 6.4%, − 7.8%, − 9.2%, − 7.6%, and women − 6.9%, − 9.3%, − 9.9% and − 9.0%. For the EB-test; men − 7.3%, − 7.8%, − 9.0%, and − 8.0%, and women − 5.2%, − 6.6%, − 7.5%, and − 6.9% for women.

Fig. 1
figure 1

Density plot of estimated VO2max by the Å-test and the EB-test, respectively, in relation to 10-year age groups. Colors represent percentile groups in the order 0–10, 11–25, 26–40, 41–60, 61–75, 76–90 and 91–100

Estimated VO2max according to the Å-test as well as the EB-test were higher in participants reporting higher levels of exercise (p < 0.001 overall trend) (Fig. 2A). A model including sex, age, and self-reported exercise level explained 25% and 43% of the variance in estimated VO2max by the Å-test and the EB-test, respectively. The explained variance was 10% and 8% when including only self-reported exercise level.

Fig. 2
figure 2

A The association between estimated VO2max and self-reported exercise level in relation to 10-year age groups. The middle point represents the median. The thicker line contains 66% of the study sample while the narow line contains 95%. B The association between estimated VO2max and BMI in relation to 10-year age groups

Concerning BMI, estimated VO2max was lower with higher BMI, overall trend p = 0.001 for the Å-test and p < 0.001 for the EB-test (Fig. 2B). A model including sex, age, and BMI explained 34% and 67% of the variance in estimated VO2max by the Å-test and the EB-test, respectively, with an explained variance of 23% and 29% when only including BMI.

Discussion

This study presents one of the most extensive reference samples for estimated VO2max values using two commonly used submaximal cycle ergometer tests. The main findings were that the EB-test generally indicated a higher mean estimated VO2max for men than women, while the Å-test showed comparable relative values between sexes. Estimated VO2max was lower with higher age for both tests. While individuals reporting more frequent weekly exercise had higher estimated VO2max using both tests, the explained variance was low. Also, higher BMI was associated with lower estimated VO2max, with moderate explained variance.

Comparison of the two tests

The EB-test was developed with inspiration from the Å-test but aimed at reducing the quite large individual prediction error of the Å-test compared to direct measurements of VO2max (Ekblom-Bak et al. 2014; Bjorkman et al. 2016). Although both tests use a cycle ergometer and measure heart rate response to submaximal steady-state exercise, several differences between the tests may explain some of the variance between the reference values. Firstly, while the Å-test utilizes the heart rate response to cycling on one single submaximal workload for 6 min, the EB-test uses the change in (delta) heart rate response between two submaximal workloads, each for 4 min. This modification by the EB-test to use the change in heart rate response was one of the essential a priori principles that helped reduce the individual prediction error compared to the Å-test (Ekblom-Bak et al. 2014). While heart rate response to one single workload are influenced by both internal and external stimuli (nervousness, hot/cold temperature, stress, etc.), the delta change between two heart rate responses was shown to be more robust (Ekblom-Bak et al. 2014). Other diversities between the tests include, for example, the use of a hand-made nomogram with additional age-correction factors to estimate VO2max by the Å-test, while the EB-test equation was derived using computer-based regression modeling. Also, while the Å-test prediction model uses the assumption of linearity between heart rate and % of VO2max, the EB-test uses a logarithmic data-driven association between delta heart rate response and VO2max. Finally, the Å-test was developed based on data from a young, healthy population (men and women 18–30 years) (Astrand and Ryhming 1954), with age-correcting factors later developed to extend the use to older age groups (Astrand 1960). The EB-test was developed based on data from an already age-diverse population of men and women (20–86 years) (Björkman et al. 2016). Given the discrepancies between the two tests, tests that merely estimate rather than measure VO2max can introduce biases. Hence, references to be used for a test that estimates VO2max should be based on results from the same test.

Comparison to directly measured VO2max

Loe et al. have presented one of the largest European reference materials of directly measured VO2max in 3816 men and women from the HUNT study, reporting mean absolute VO2max of 3.83 ± 0.72 L/min for men and 2.53 ± 0.49 L/min for women and relative VO2max of 45.4 ± 8.9 mL/min/kg for men and 37.0 ± 7.5 mL/min/kg for women (Loe et al. 2013b). This is somewhat higher than the estimated values in the present study. One main reason for differences in reference values derived from different populations is the population under study. Direct VO2max measurements demand maximal effort, potentially skewing recruitment towards younger and more fit individuals. This might account for the higher VO2max values in the HUNT study. Assessment mode also varies; VO2max is directly measured in maximal tests and estimated in submaximal ones. However, the laboratory equipment used when measuring maximal VO2max is not flawless. For example, the gas exchange analyzer used in the HUNT study (MetaMax) has been validated against the gold standard Douglas bag system with 8% higher VO2max values (Steene‐Johannessen et al. 2009). Compared to another Norwegian study with directly measured VO2max, the present study’s relative VO2max values were generally lower but closer to its average (Edvardsen et al. 2013). Yet, the EB-test and Å-test displayed higher VO2max values than American (Jackson et al. 1996; Talbot et al. 2000), Japanese (Sanada et al. 2007), and Brazilian populations (Herdy and Uhlendorf 2011). Based on a validation study (Bjorkman et al. 2016), these results indicate that the Å-test and the EB-test provide reliable average values for population-based research.

Association of age, sex, exercise, and BMI to VO2max

A review including cross-sectional studies reports a 4%–12% lower relative VO2max per age-decade, with most studies displaying an approximately 10% lower relative VO2max per decade (Hawkins and Wiswell 2003; Letnes et al. 2023). The article further notes that a non-linear difference in VO2max occurs during the twenties and thirties in sedentary individuals. On the other hand, athletic individuals who reduce or stop their exercise habits experience a non-linear difference in cardiorespiratory fitness (Hawkins and Wiswell 2003). Authors of the HUNT study reported that the measured VO2max was specifically lower in the age group 40–49 years and older compared to younger age groups, similar to what was seen for the Å-test and the (Loe et al. 2013a). The Hunt study shows a relative decline per decade in VO2max (age group 20–29 years to 60–69 years) of 6.9% for men and 7.0% for women. Another extensive Norwegian study reported a per-decade lower measured VO2max of 8.3% in men and 7.2% in women. Notably, the VO2max values in the HUNT study were, on average, 9% higher than those in the other Norwegian study across all cohorts and both sexes (Edvardsen et al. 2013). This can be compared to the EB-test, where the difference in estimated VO2max by decade was relatively greater in men (− 7.1%) than women (− 5.9%). For the Å-test, the relation was opposite, − 6.9% for men and − 7.7% for women, which may be related to the fact that the Å-test test has been reported to underestimate VO2max in men and overestimate in women (Ekblom-Bak et al. 2014). These findings can be compared to other studies indicating that, in general, men tend to have higher VO2max levels than women (Sandvik et al. 1993; Hollenberg et al. 1998; Talbot et al. 2000; Fleg et al. 2005; Jackson et al. 2009; Wang et al. 2010; Herdy and Uhlendorf 2011). The higher VO2max in men than women is attributed to differences In muscle mass, hemoglobin levels, and cardiac stroke volume (Fletcher et al. 2013; Santisteban et al. 2022). Further, other studies have noted that the sex-based differences in cardiorespiratory fitness seem greater earlier in life and begin to narrow in elderly individuals (Hawkins and Wiswell 2003; Kaminsky et al. 2015).

Figure 2A illustrates that individuals who engage in higher levels of self-reported exercise tend to exhibit higher VO2max values, as observed for both the Å-test and the EB-test. This observation aligns with prior research findings that have consistently reported a positive correlation between exercise frequency and VO2max (Tager et al. 1998; Talbot et al. 2000; van Poppel et al. 2010; Loe et al. 2013a). However, the variance explained by exercise was relatively low in the present study (Å-test; R2 = 10%, EB-test; R2 = 8%), consistent with previous research findings (Tager et al. 1998; Talbot et al. 2000; van Poppel et al. 2010; Loe et al. 2013a). For instance, the HUNT study also demonstrated a limited overall fit between their Physical Activity Index and VO2max, yielding R2 values of 9% for men and 7% for women.

Both VO2max and BMI include body mass in their calculations, leading to an anticipated correlation. Our study confirmed this with a coefficient of determination showing R2 = 29% for the EB-test and R2 = 23% for the Å-test (Mondal and Mishra 2017). Additionally, our findings (Fig. 2B) for both the EB-test and Å-test indicate that higher BMI values are associated with lower relative estimated VO2max values, which has been reported by other research (Zeiher et al. 2019).

Implications

The Å-test and the EB-test are practical and cost-effective means of estimating VO2max in large-scale studies. The tests can be easily implemented, allowing researchers to assess participants’ cardiorespiratory fitness easily. The estimated VO2max values can then be utilized to investigate associations between cardiorespiratory fitness levels and various health outcomes, such as chronic diseases, mortality, or cognitive function. Moreover, the reference values obtained from this study can serve as benchmarks for future research examining fitness trends in different populations or evaluating the impact of public health interventions on aerobic capacity.

The reference values established for estimated VO2max could also have implications for public health initiatives and fitness promotion. They aid in setting fitness goals, accounting for age and gender differences. Using estimated VO2max, health professionals can promote regular physical activity to enhance health and decrease chronic disease risk.

Further information of the Å-test may be found in the publications (Astrand 1960), while the EB-test has a public website: https://www.gih.se/ekblombaktest-english.

Strengths and limitations

This study’s strengths include a vast and diverse sample covering both sexes and various ages from regions across Sweden, enabling detailed analysis of VO2max across age brackets. However, it’s not without limitations. The focus on the working population might limit its generalizability to non-working individuals. While using different test sites could present variability, standardized training provided by the HPI Health Profile Institute ensures consistent testing protocols.

Conclusion

This study presents reference values for estimated VO2max using two commonly used submaximal cycle ergometer tests, the Å-test and the EB-test. While the EB-test indicated differences in estimated VO2max between men and women, estimated VO2max from the Å-test were similar between sexes. Age, exercise level, and BMI influenced the level of estimated VO2max. These test-specific reference values may be used in screenings and clinical practice to evaluate the estimated VO2max of an individual in relation to individuals of the same submaximal test, sex, and age.