Introduction

In recent decades, there has been an exponential growth of office work in Western societies, dominated by sedentary activities [1]. Sedentary behaviour (SB) includes sitting, lying, or reclining activities with low energy expenditure [2]. Employees in high-income countries across the globe were found to sit for about 60% of their days [1]. In a study in Norway, Chau and colleagues found that compared to employees exhibiting more active jobs involving more walking and lifting, office workers have 35% increased mortality rates [3]. One of the reasons for this is that independently of leisure time exercise, SB increases the risk for cardio-metabolic diseases [4, 5]. Despite increasing numbers of interventions to reduce office workers’ SB, there is mixed evidence of their effectiveness [6].

Although interventions that involved environmental restructuring, i.e. the implementation of standing desks, or that involved personal coaching, have been found to be effective in reducing SB, they are cost-intensive [7,8,9,10]. For a large-scale implementation, low-cost interventions are needed. However, current interventions that are low-cost and focus on changing attitudes as determinants of behaviour without environmental or workplace policy changes supporting behaviour change show mixed effects [11, 12]. Therefore, we developed a low-cost alternative to personal coaching, UPcomplish, involving personal coaches supporting participants with automated content [13]. During the coaching, participants wear the VitaBit sensor [14]. The VitaBit toolkit includes the sensor measuring physical behaviour (i.e. SB and physical activity), a mobile phone monitoring application, and a computer portal, where participants can set goals and compete with others. VitaBit also provides a coaching portal, where coaches, if they are authorized, can retrieve participants’ physical behaviour data.

By employing Intervention Mapping (IM), a framework for the systematic development of behaviour change interventions [15], we used findings from behaviour change theories, the empirical literature, and our research data to systematically develop UPcomplish. The result was a logic model of the intervention, of which an excerpt is shown in Fig. 1. It depicts the causal mechanisms from the practical applications of the UPcomplish intervention to the behavioural outcome, i.e. reducing SB. For example, tailored feedback on the achievement of goals, combined with positive reinforcement, is theorized to change the psychosocial determinant attitude. One of the underlying attitudinal beliefs (~ Change objectives) being targeted by this feedback is as follows: “Indicate that the number of resources (time, skills) that will need to be invested to perform certain strategies [being suggested to reduce sitting] will be worthwhile as it will lead to positive outcomes”. The logic model assumes that changing this attitudinal belief will help to reduce SB [13].

Fig. 1
figure 1

Illustration of the logic model of the UPcomplish and VitaBit intervention. The performance objectives on the individual level are shown in the upper branch, the ones on the interpersonal level are shown in the lower branch

UPcomplish is a data-driven, tailored, and motivational intervention involving the VitaBit toolkit that allows for the self-monitoring of SB. We implemented UPcomplish among 15 workplace sites to investigate its effectiveness. For the effect evaluation, we had expected the intervention to be effective in reducing daily sitting proportion and prolonged sitting as well as in increasing quality of life (QoL; i.e. vitality, performance, and well-being). Yet, compared to the VitaBit-only baseline phases (i.e. control condition), we did not find significant improvements. Both between and within participants did the UPcomplish intervention reduce SB reduce or increase QoL [16]. Possible reasons for this may be a recruitment bias among the intervention population (e.g. only employees being motivated volunteered), but also unexpected deviations from the logic model of change underlying the intervention (Fig. 1) [13]. For example, post hoc analyses found that improvements in the psychosocial determinants were not associated with improvements in SB, and improvements in SB were not associated with improvements in QoL. It might be that either SB among office workers is less of a reasoned action than we assumed or that certain sub-groups of participants engaged more in the intervention, and profited from improvements in determinants, in SB, or in QoL. The intervention population was dominated by females (68%), and the participants reported high baseline QoL and psychosocial determinants. These and other baseline and participants’ characteristics might have been factors that moderated the effectiveness of UPcomplish.

The purpose of this study is to explore potential moderators of the effectiveness of UPcomplish. Effectiveness refers to improvements in psychosocial determinants, in SB, and in QoL, as well as performance objectives (i.e. average registering, monitoring, and engagement with coach) [15], which serve as dependent variables. As independent variables, firstly, participant characteristics such as gender, age, body mass index (BMI), or employment status are explored. Secondly, we assumed that low baseline SB, high moderate-to-vigorous physical activity (MVPA), positive baseline determinants, and high baseline QoL result in lower potential for improvement and, therefore, less effectiveness of the intervention. Thirdly, the intervention messages might not have been accepted, read, or understood. Instead of using a randomized control trial, the data were gathered using a stepped wedge design with continuous recruitment. As a result, we received annual spread data, increased statistical power, and avoided having a waiting control group (which is often associated with compliance issues) [17].

Methods

The study was pre-registered under: NL7503 (https://www.trialregister.nl/trial/7503). The protocol of the intervention, with more details about the design, has been published elsewhere [13]. Additional material, the raw data, and the R scripts are fully disclosed in the supplementary material https://osf.io/qzp9m/. This manuscript adheres to the Consolidated Standards of Reporting Trials (CONSORT) checklist of information to include when reporting a stepped wedge cluster randomized trial [18].

Study Design and Sample

We had five intervention groups each including participants from 2 to 5 different companies (i.e. sub-groups). The groups started with time lags of about 7 weeks, each worksite receiving a kick-off meeting with a minimum of 5 employees per kick-off. This was followed by the baseline, VitaBit-only week, and the 12-week UPcomplish intervention. The eligibility criteria included that participants were able to walk and stand, that they were willing to download the VitaBit smartphone application, that they were office workers, and that they were able to speak and understand German. If any of the inclusion criteria were not met, participants were excluded.

VitaBit Software provided us with 200 devices, which we could use for the evaluation study (May 2019–January 2020). Assuming five intervention groups of 40 participants each and a drop-out of 20% (32 participants per group, one group serving as both baseline and control), we conducted power calculations with an expected sample size of N = 192 and a Cohen’s d estimate of 0.5. The population effect size would very likely (95%) be somewhere between 0.21 and 0.79, which we considered being sufficiently accurate [13]. We recruited participants from German companies in multiple industries (e.g. public service, education, and automotive). Of the 193 eligible participants who communicated interest in participating, 150 participants created a VitaBit account, and 142 wore their VitaBit at baseline. The flow of the participants in the intervention is shown in Fig. 2: 45 participants wore the VitaBit device for 12 weeks or longer, whereas 38 participants collected less than 6 weeks of VitaBit data and were therefore excluded from the analyses of the current study. The number of participants that filled out the surveys is illustrated in Fig. 3. The baseline survey (T0) was filled out by 129 (91%), the mid-evaluation survey (T1) by 67 (47%), and the end evaluation survey (T2) by 62 (44%) participants.

Fig. 2
figure 2

Number of participants per intervention group per number of weeks having collected VitaBit

Fig. 3
figure 3

Number of participants that filled out the survey at baseline (T0), in the middle of the intervention (T1), and directly after the intervention (T2)

Participants could refuse their participation at all times, without giving a reason. This study and the consent procedure were approved by the Ethics Review Committee of the Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, The Netherlands (ERCPN-188_11_02_2018). The trial was pre-registered in the Netherlands Trial Register under: NL7503 (https://www.trialregister.nl/trial/7503).

Procedure

Flyers with information about the study (incl. inclusion criteria, benefits to expect, and what to do) were distributed among German companies and potential participants (i.e. employees) who, if they were interested, further forwarded the flyers. If the management agreed, the employees could participate. Emails with an invitation to the personal kick-off meeting, instructions on creating a VitaBit account, and the information sheet were sent to volunteering participants. The kick-offs, which took place in the participants’ companies, took between 35 and 60 min and included an introduction round, information about SB, the intervention, and the VitaBit toolkit. Additionally, participants were supported to pair the VitaBit devices with their smartphones. After written informed consent, participants started wearing the device. The first week was the baseline, VitaBit-only week. This was followed by the 12-week intervention. Participants who were interested, but unable to attend the kick-offs, received all information via email. At baseline, in week 6, and directly after the intervention, participants received surveys on QoL and determinants. After the intervention, everyone received an individual and a group (i.e. company) report and a VitaBit voucher as compensation. The devices were collected earliest 4 weeks after the end of the intervention.

Intervention

The intervention consists of two components: the VitaBit mobile phone application (app) and UPcomplish. The VitaBit app serves as monitoring tool, providing information about current SB, standing, and physical activity, and showing the user’s personal goals. UPcomplish serves as motivational support and includes 14 feedback messages (FBMs) that are sent to participants via their preferred channel (e.g. WhatsApp, email). In the first 6 weeks, the FBMs were sent twice, and, as of week 6, once per week. They were tailored to individuals’ physical behaviour, their goals (set during the kick-off and adapted after the first week if too easy or too difficult), and their perceived barriers. If participants did not drop out, they received (1) a FBM, (2) a reminder if they forgot to wear their device, or (3) no message in case of a holiday. In the latter two cases, the upcoming FBMs were delivered delayed. The last two FBMs were not delayed and were delivered to all participants having data at the concerning point in time. Thereby, all participants could compete with each other and receive tips on how to keep the new habits. The FBMs included support in goal setting, goal adjustment, breaking down the goal to graded sub-goals, and feedback about the goals. Additionally, they included feedback about SB patterns (e.g. “On Tuesday afternoon, your sitting periods seem to be specifically long”). After being asked about their hurdles to sit less (e.g. time constraints, kind of work), the participants received tailored advice on how to overcome these hurdles. Every 2 weeks, they received activity challenges, such as not using the toilet on the same floor. In the end, the coach gave tips on how to sustain the new habits.

Measures

Figure 4 shows the measurements that were implemented during data collection.

Fig. 4
figure 4

Overview of the measures that were conducted for the evaluation of UPcomplish. At baseline, in week 6, and in the end, surveys on determinants, moderate-to-vigorous physical activity (MVPA), and quality of life were distributed. Physical behaviours (i.e. behavioural outcomes) were continuously measured with the VitaBit. VitaBit data also provided information on performance objectives; for example, how often participants wore the device (i.e. registered behaviour) and opened the app to synchronize their data (i.e. monitored behaviour)

Continuous Measurements

The VitaBit device (3.9 × 1.4 × 0.9 cm, 4.8 g) measured SB, standing, and activity. It was magnetically attached to clothing fabric at the thigh or placed in trouser pockets. The device samples data with a rate of 33 Hz and an output rate of 30 s, which are stored on the device for at least 30 days. The data on the device are synchronized via Bluetooth with the VitaBit app, before being delivered via mobile Internet to the backend server. The data are stored in a time series database in a pseudonymized way, where they can be downloaded by authorized persons. In a validation study, the device showed a sensitivity of 85.7% and a specificity of 91.2% for sitting [14]. The raw data are in a long format csv file (i.e. each row representing 30 s of a person) and include a user identifier, a time stamp, and three columns for each physical behaviour.

The VitaBit device stores the data even if it is not immediately synchronized with the app. Therefore, information regarding behavioural registering and monitoring could be retrieved indirectly (i.e. performance objectives in Fig. 1). First, the total number of days per week with available data on the VitaBit device provided information on the registration of participants’ SB, i.e. how often the device was worn. Second, to monitor their SB, participants needed to open the VitaBit app and synchronize their data. Since VitaBit did not synchronize the data without manually opening the app, behavioural monitoring could be assessed at the weekly FBMs: The number of days with data that were missing at the time of the FBM was used as a proxy because it gave an estimate on how often the app was opened. More days with data missing would indicate lower monitoring rates.

As a proxy for the third, performance objective (engagement in the coaching, such as action planning and the discovery of barriers), we used the proportion of responses to coaching messages. Participants with more responses to the coaching messages would have higher engagement values than participants who only responded rarely to coaching messages.

Online Surveys

Online surveys at baseline (T0), in week 6 (T1), and directly after the intervention (T2) included questions on psychosocial determinants and QoL. The survey at T0 additionally included sociodemographic and job-related variables, and the survey at T2 additionally asked about intervention characteristics. The English version of the Individual Work Performance Questionnaire was translated into German using back-translation [19]. We calculated omegas (ω; > 2 items) and Pearson’s correlations (r; 2 items) to provide estimates for internal consistency [20, 21]. For all numerical variables, higher survey scores indicated higher values regarding the variable being measured (e.g. higher self-efficacy).

Gender, age, educational level, height, weight, and job-related variables were obtained when the participants created their VitaBit accounts. They could choose between 8 different educational degrees (e.g. high school degree), between 29 different job titles (e.g. sales manager, administrative), and between 17 main company industries (e.g. educational, service). Additionally, in the online survey, they were asked about the usual number of workdays per week (1 item) and their employment status (full-time/part-time; 1 item), and they received questions about sedentary job tasks (5 items). These could be phone calls, computer work, desk work, having meetings, and travelling/visiting clients, such as “How much - on average per day (in %) - do you estimate you spend on […] Phone calls?” [22].

The questions on acceptability (rated on a 5-point Likert scale; 1 = I don’t agree, 5 = I agree) encompassed program-related variables (e.g. understandability; 12 items; e.g. “How much do you agree with the following statements: […] The questions within the recommendations were clear”), questions about the coach’s advice (e.g. credibility; 7 items), and questions about behavioural maintenance (2 items) [22].

Participants were asked to indicate how much they agreed with statements on SB, which they indicated on a 5-point Likert scale. These statements covered attitude (6 items; e.g. […] walking around at work is healthy”; ω = .62), perceived social support (2 items; e.g. […] walking around at work is encouraged by my colleagues”; r = .62), perceived behavioural control (PBC; 4 items; e.g. “I am sure that I can […] walk around at work, even though I feel bad, tired, tense or depressed”; ω = .70), and intention (2 items; e.g. “Are you planning to interrupt long sitting periods at work with […] walking breaks?”; r = .43) [22]. Perceived susceptibility to prolonged sitting was assessed with 2 items (e.g. “My daily sitting time is more compared to what is recommended.”; r = .72) [23, 24].

The Individual Work Performance Questionnaire (seldom = 0 to always = 5) was used to assess task and contextual performance. Task performance (5 items; ω = .72) refers to the ability to perform the tasks that are essential, e.g. “During the last week, I was able to perform my work well with minimal time and effort”. Contextual performance (9 items; ω = .57) refers to the organizational, social, or psychological factors that are required for adequate functioning at work, e.g. “I took on extra responsibilities.” [25]. Furthermore, we used the Perceived Stress Scale (10 items; 1 = never, 5 = very often; e.g. “How often have you felt nervous and ‘stressed’?”; ω = .89) [26, 27] and the bodily pain (2 items; e.g. “How much bodily pain have you had?”; r = .85), emotional well-being (5 items; e.g. “How much of the time have you been a happy person?”; ω = .83), and vitality (4 items; e.g. “How much of the time did you have a lot of energy?”; ω = .86) sub-scales of the SF-36 [28].

The VitaBit tool measures SB, standing, and physical activity, but does not distinguish between different intensities of physical activity (i.e. light versus moderate-to-vigorous activity). Hence, we additionally assessed light and MVPA with the German version of the international physical activity questionnaire short form (max. 6 items; excluding SB; e.g. “During the last 7 days, on how many days did you do vigorous physical activities like heavy lifting, digging, heavy construction, or climbing stairs as part of your work?”) [29].

Data Analyses

To clean and analyse the data, we used R version 4.0.2. We inspected the data using descriptive univariate analyses, and we visualized them with histograms and QQ plots to check for normality. We reported normally distributed variables as means and standard deviations (SD), non-normally distributed variables as medians and inter-quartile ranges (IQR), and categorical variables as absolute numbers and percentages. SB was represented as proportion of the entire waking day (i.e. when the device was worn) by applying a compositional data approach (CoDA) (i.e. \({z1}_{sitting}=\sqrt{2/3 } \;\mathrm{ln}(Sitting\%/\sqrt{Standing\% \times Activity\%} ))\) [30] and as sum of the squared sitting bouts (SSSB) (\(SSSB=\sum_{0}^{n}{SitBout}_{i}^{2}\)) [13]. We used only those days where a participant had collected at least 8 h of physical behaviour data (i.e. the sum of minutes measured as sitting, standing, or active) [, 15, 31], and we excluded holidays from the analyses. The Mahalanobis distance method was used to detect and exclude outliers [32].

To calculate the within-subjects improvements of SB, QoL, and psychosocial determinants, we only used calendar weeks, in which baseline data (i.e. of other participants still being in their baseline week, i.e. control condition) were available. These baseline data were used to centre the outcome variables to control for seasonal trends. The within-subjects improvements of the variables (in %) that were collected with surveys were calculated as follows. If lower values were considered healthier, such as in perceived stress, survey 2 was subtracted from survey 1, else survey 1 was subtracted from survey 2. We then divided by survey 1 to retrieve the percentual improvement. If the survey was filled out 3 times, additionally, the same calculation was performed with surveys 2 and 3. We then calculated a survey-to-survey improvement by using the mean of the two results. For SB, we took the averages for calendar weeks and calculated the week-to-week improvements.

Linear regression models were used with ordinary least squares, if residuals were normally distributed, else with percentage least squares [33], to explore potential moderators of effectiveness. Thereby, participant characteristics (e.g. gender, age, company industry, BMI), baseline physical behaviours (e.g. SB, MVPA), baseline QoL (e.g. perceived stress, vitality), and intervention perception (e.g. understandability, acceptance of the intervention) were regressed on within-subjects improvements (i.e. difference scores after centring around calendar week means [34]) in psychosocial determinants (e.g. attitude, perceived social norms), on performance objectives (e.g. average registering, monitoring), on improvements in SB, and in QoL. For testing statistical significance (two-sided), we used an alpha of 0.05, which we corrected by the help of the Benjamini–Hochberg procedure [35, 36].

To gather insights into potential ceiling effects, additional post hoc analyses were done with a sub-group of participants who did not score as high in relevant determinants and quality of life variables. The seven variables that were found to be associated with the effectiveness of UPcomplish were used to create this sub-group: only participants who scored below the median in at least four (i.e. the majority) of these seven variables were included in this sub-group (n = 51). We calculated pairwise Pearson’s correlations between all variables of the 4 parts of the logic model of the intervention (i.e. psychosocial determinants, performance objectives, SB, and QoL). A positive improvement can be interpreted as a beneficial intra-individual week-to-week (as in SB) or as a measurement-to-measurement (as in QoL) development. Week-to-week SB improvement was calculated as proportional improvement in %, measurement-to-measurement improvement of the survey variables was calculated as average absolute improvement. For this analysis, we did not centre the variables around calendar week means, because of the lower number of participants and available baseline data in the concerning calendar weeks.

Results

Participant Characteristics

Table 1 presents the descriptive characteristics of the sample at baseline. Among the participants who agreed to participate, 143 (47 males) participants with a median age of 42.0 (IQR = 21.5) years and a mean BMI of 23.4 (SD = 5.2) kg/m2 created a VitaBit account. Males had a higher (p < .01) BMI than females. At baseline, most participants indicated that their work tasks encompassed mainly computer and/or desk work. The majority had a full-time position and a usual work week of 5 workdays.

Table 1 Descriptive characteristics of participants at baseline

The psychosocial determinants (range 1 to 5) regarding sitting ranged from a mean of 3.4 (SD = 0.9) for perceived social support to a median of 5.0 (IQR = 1.0) for perceived susceptibility. At baseline, the participants wore their VitaBit device on average for 823.4 (SD = 107.5) min per day, of which the device measured a median of 510.2 (IQR = 95.3) SB minutes, 199.6 (IQR = 102.8) standing minutes, and 91.7 (IQR = 45.7) activity minutes. Females collected more (p < .001) standing time than males, while males collected more (p < 0.01) activity time than females. Performance at baseline was on average 3.3 (SD = 0.6) for task and 3.6 (SD = 0.6) for contextual performance (1 to 5). On average, perceived stress (0 = no stress, 40 = high stress) was 15.0 (SD = 10.0), perceived pain (0 = much pain, 100 = no pain) was 77.5 (SD = 32.5), and vitality and emotional well-being (both 0 = low, 100 = high) were 54.4 (SD = 18.8) and 76.0 (SD = 20.0), respectively.

Variables Affecting Improvements in Psychosocial Determinants

Table 2 presents the results of the regression models exploring moderators affecting improvements in psychosocial determinants. After Benjamini–Hochberg corrections, higher baseline intentions were associated with significantly less improvement in intention during participation in the intervention. None of the other improvements in determinants was related to participant characteristics, job-related variables, baseline behaviours, or how the intervention messages were perceived.

Table 2 Linear regression models for the effects of participant characteristics, baseline variables, and intervention perception on improvements in psychosocial determinants

Variables Affecting the Performance Objectives

Table 3 presents the results of the regression models exploring factors being associated with performance objectives. None of the performance objectives was associated with participant characteristics, job-related variables, baseline behaviours, or how the intervention messages were perceived.

Table 3 Linear regression models for the effects of participant characteristics, baseline variables, and intervention perception on average performance objectives

Variables Affecting Improvements in Sedentary Behaviour

Table 4 presents the results of the regression models exploring variables being associated with SB improvement. SB improvement was not found to be associated with participant characteristics, job-related variables, baseline behaviours, or how the intervention messages were perceived.

Table 4 Linear regression models for the effects of participant characteristics, baseline variables, and intervention perception on improvements in SB

Variables Affecting Improvements in Quality of Life

After the Benjamini–Hochberg correction, most of the QoL improvements were associated with their own baseline values (see Tables 5 and 6). Higher baseline task performance was associated with fewer improvements in task performance, higher baseline stress with more improvement in perceived stress, higher baseline vitality with less improvement in vitality, and higher baseline emotional well-being with less improvement in emotional well-being. Furthermore, lower baseline stress and higher baseline emotional well-being were associated with more improvement in contextual performance. Finally, higher baseline attitude and PBC were associated with fewer improvements in emotional well-being.

Table 5 Linear regression models for the effects of participant characteristics, baseline variables, and intervention perception on improvements in quality of life
Table 6 Linear regression models for the effects of participant characteristics, baseline variables, and intervention perception on improvements in quality of life

Post HocAnalyses: Can Subjects Scoring Low on Relevant Determinants and Quality of Life Profit?

Figure 5 shows the correlations and the univariate distributions of the variables.

Fig. 5
figure 5

Pearson correlations and plots illustrating the linear and smoothed associations, respectively, between improvements in determinants (measurement-to-measurement), performance objectives, improvements in SB (week-to-week), and improvements in QoL (measurement-to-measurement). Abbreviations: PBC, perceived behavioural control; PSS, perceived social support; PercSusc, perceived susceptibility; SB CoDA, SB proportion; SSSB, summed squared sitting bouts; TP, task performance; CP, contextual performance; EWB, emotional well-being. ***r > .50; **r > .40; *r > .30

Correlations Within the Parts of the Logic Model of the Intervention

Among the sub-group, improvement in PBC was positively associated with improvement in attitude (r = .46; 95% CI = .15, .68; p < .01; pcorrected = .03) and in intention (r = .45; 95% CI = .14, .68; p < .01; pcorrected = .04), which was also found when analysing all participants. Average monitoring delay was negatively associated with average SB registering (r =  − .65; 95% CI =  −.78, −.45; p = pcorrected < .001), which was also found for the entire group. Improvement in vitality was positively associated with improvement in emotional well-being (r = .59; 95% CI = .33, .77; p < .001; pcorrected = .001), and improvement in perceived stress was positively associated with improvement in emotional well-being (r = .58; 95% CI =  −.76, − 0.30; p < .001; pcorrected < .01). All associations within improvements in QoL were also found when analysing all participants.

Correlations Between the Parts of the Logic Model of the Intervention

After the Benjamini–Hochberg correction, none of the improvements in one part of the logic model was associated with improvements in another part of the model among the sub-group.

Discussion

The purpose of this study was to explore moderators of the effectiveness of the UPcomplish intervention, which had previously been found neither to have effects on SB, on psychosocial determinants, nor on QoL [16]. Expectedly, we found that baseline psychosocial determinants and baseline QoL factors were negatively associated with improvements in determinants and QoL. Since baseline determinants and QoL were high among the participants of this study, we conducted a post hoc analysis to investigate whether participants starting lower in determinants and QoL profited from the UPcomplish intervention. Among this sub-group, improvement in PBC was associated with improvement in prolonged sitting, which itself was related to improvement in task performance (see Fig. 5). Yet, these were not significant anymore after the Benjamini–Hochberg correction.

We hypothesized that baseline characteristics of the sample such as psychosocial determinants, working tasks, or QoL would predict intra-individual improvements when receiving UPcomplish. In line with previous research, we had a selective sample majorly including female participants [37, 38], and participants with high intentions to reduce their SB [39]. Additionally, the sample of the current study had higher baseline attitude, PBC (i.e. self-efficacy), and perceived social support as opposed to previous SB intervention studies [40, 41]. Participants showed very high values in perceived susceptibility to too much sitting, which has also been found previously [38, 41]. Beyond showing the selectivity of the sample, these high baseline values might have caused ceiling effects such as baseline intention being associated with lower improvement of intention. None of the determinants was associated with improvements in SB or in the performance objectives (e.g. monitoring behaviour). Since this was found neither with the original sample nor in the post hoc analysis using participants lower in baseline determinants, the psychological determinants per se might not be enough to predict improvements in SB. This is in line with previous research that did not find psychosocial determinants to be mediators for improvements in SB [40].

UPcomplish was mainly aimed at influencing attitudinal, normative, and control beliefs. This might have resulted in too much focus on creating an intention rather than translating the intention into actual behaviour. Others already suggested that the challenge of reducing SB is rather the volitional process, which is one way to bridge the gap between the intention and the actual behaviour [42]. Volition can be promoted (1) by action planning, which includes goal setting and the anticipation of barriers of behavioural change, and (2) by PBC, which elsewhere was already found to be a moderator in reducing workplace SB [40, 42]. Similarly, in the post hoc analyses, we found that among a sub-group of participants scoring lower in baseline determinants, improvement in PBC was the only factor that was marginally related to improvement in SB. Although the UPcomplish intervention did include goal setting, the anticipation of barriers, and several tips aiming at an increase of PBC, the participants did not report an increase in PBC nor did they improve their SB [16]. However, at baseline, the participants had a median score of 4.0 out of 5.0 on PBC, which might have been one of the core reasons for the lack of effectiveness. Therefore, the UPcomplish intervention might only be effective for office workers scoring low in PBC at baseline. Additionally, SB might be less of a reasoned action and more a behaviour that is determined by automaticity and environmental conditions. To break the automaticity of SB, it might be combined with environmental changes (e.g. implementing standing desks, cue altering), and methods to change habits (e.g. reminders). Additionally, support from employers or other leaders might also help bridge the intention-behaviour gap. These could serve as role models, support standing and walking meetings, or implement policies that allow for breaks in sitting time.

Except for perceived vitality, the sample of this study indicated having good QoL, which could be due to a selectivity bias. However, there is no evidence that health affects participation in workplace health interventions [37, 39]. Hence, concerning QoL, the sample of this study might be representative of the working population in Germany. Additionally, although some aspects of QoL at baseline were associated with improvements in QoL during the intervention, they were likely be caused by ceiling effects because they did not relate to the performance objectives, or to improvements in psychosocial determinants and in SB [43]. Only in the post hoc analysis among a sub-group, we found a tendency that improvement in perceived physical pain was associated with fewer reductions of SB. However, this was not significant after the Benjamini–Hochberg correction. In another study, lower back pain at the beginning of the intervention predicted less improvement in SB, which was assumed to be caused by a limited capacity of standing due to the perceived pain [44].

Several steps of the implementation of the UPcomplish intervention might increase its effectiveness. Firstly, increasing the reach by also including employees being less motivated and self-efficacious at baseline could improve its effectiveness considering the focus of UPcomplish being on psychosocial determinants. To overcome challenges in the adoption of workplace health programs [45], a systematically developed implementation plan using Implementation Mapping might help to increase the reach of UPcomplish [46]. For example, it would be important to increase awareness of the program, self-efficacy towards participation (e.g. to overcome time constraints and tiredness), and attitudes regarding the program among all potential participants already before they potentially adopt the program [45]. Secondly, although the acceptability, understandability, and the message processing of the UPcomplish intervention were positive, more components need to be included to address other ecological levels [15, 37]. Multi-component interventions have the potential for higher adoption rates due to an increased likelihood to match with the needs of participants [37]. Additionally, a workplace SB intervention including a psychosocial intervention, but also managers serving as role models, financial incentives to increase sustained participation, and environmental (e.g. standing desks) and cultural (e.g. walking around is seen as healthy and not as time-wasting) restructuring is likelier to be effective on the long run because it tackles both automatic and controlled motivational processes [41, 47, 48]. Lastly, although the UPcomplish intervention was systematically developed using the IM framework [13], the intervention content might not have tackled all important psychosocial determinants [49]. This should be investigated within the scope of a process evaluation implementing the intervention among participants with low baseline determinants.

Strengths and Limitations

This study has several strengths. First, we had longitudinal data of diverse company industries to our disposal that were collected during 75% of an entire year, and we additionally accounted for seasonal variations by centring the variables around calendar week means. A cheap and unobtrusive measurement tool with long battery life, the VitaBit device, facilitated the continuous collection of SB data. This increases the external validity of the results. Second, we were the first to our knowledge that incorporated information on the entire logic model of a SB intervention, which provides interesting insights into the underlying mechanisms of reducing workplace SB. Third, we focused on the health effects for the target group, which was the reason to analyse SB during the entire day and not merely SB during workdays. Fourth, not the absolute time spent sitting but rather prolonged, uninterrupted SB is associated with detrimental health outcomes. Hence, we applied both a compositional data approach to account for inter-dependencies of physical behaviours and a new value (i.e. SSSB) to represent prolonged SB. Last, UPcomplish was highly accepted among participants: the participants did not only indicate that they perceived the intervention positively also did they drop out late and mostly if they had technical problems rather than if they lost their motivation [50].

One of the limitations is that, since the psychosocial determinants and QoL were measured using self-reports, participants might have provided socially desirable answers [51]. However, concerning QoL, using self-reports enabled the assessment of a large number of participants with lower timely and financial resources. Another limitation concerns the employees that did not adopt the intervention. Voluntary participation might have resulted in a selection bias, and our sample included mainly females, and participants scoring high in psychosocial determinants at baseline. However, we conducted a post hoc analysis to investigate potential effects among a sub-group scoring lower in the psychosocial determinants. Lastly, we could not cluster the data by company using multilevel models nor could we centre the variables in the post hoc analysis around calendar week means because this would have resulted in fewer data, and therefore, less statistical power and a problem of singularity.

Conclusions

Especially high baseline values in, for example, intention were negatively related to intra-individual improvement in the intention to sit less. However, this study showed that, except for PBC, the psychosocial determinants (attitude, perceived social norms, perceived susceptibility, intention) do not seem to be important when reducing workplace sitting, and it might be more determined by the organizational environment and automatic behaviours. When promoting health at the workplace, it is a challenge to reach a representative sample of employees including the ones being less interested in improving their health. Yet, this study showed, that probably especially these employees could profit most from a motivational intervention. It needs to be investigated whether UPcomplish could be effective in combination with changes in the physical and cultural environment of companies.