In the broadest sense, temporal psychology refers to the way in which thoughts about and/or feelings towards the past, present and future, influence behavior. A growing body of research continues to suggest that time perspective is significantly and meaningfully (in terms of effect size) related to both mental health and addictive behaviors (e.g., Cole et al., 2016; Kaya Lefèvre et al., 2019; Oyandel & Buela-Casal, 2014). One of the most widely-used assessment measures of time perspective is the Zimbardo Time Perspective Inventory (ZTPI; Zimbardo & Boyd, 1999) which assesses cognitive, affective, and behavioral engagement with the past, present, and future, in five domains: past positive (PP), past negative (PN), present hedonistic (PH), present fatalistic (PF), and future (F).

In their development of the ZTPI, Zimbardo and Boyd (1999) reported significant and meaningful correlations between depressive symptoms and PN (r = .59), PF (r = .37), PH (r = −.20) and F (r = −.19) time perspectives. Further, these authors reported significant and meaningful relationships between trait anxiety scores and scores on PN (r = .62), PF (r = .38), and PP (r = −.25). More recently, greater PN scores (d = 1.04), and lower PH scores (d = 0.87) have been reported by patients diagnosed with depression compared to controls (Kaya Lefèvre et al., 2019), while elsewhere, increased PF scores (positively), and increased F time perspective scores (negatively) have been shown to be correlated with allostatic load (Bourdon et al., 2020). In terms of alcohol use, a growing literature has demonstrated that time perspective is also a variable of consequence such that, broadly speaking, a greater present focus or perspective, and/or a reduced future focus or perspective, is significantly related to more problematic drinking behavior (e.g., Barnett et al., 2013; Daugherty & Brase, 2010; McKay et al., 2018).

Although they did not empirically test it, Zimbardo and Boyd suggested that a balanced time perspective is optimal in terms of human functioning. A balanced time perspective was hypothesized by Zimbardo and Boyd to be represented by relatively high scores on PP, PH, and Fe, with relatively low scores on PN and PF. Building on this hypothesis, Stolarski et al. (2011) developed an equation for the assessment of deviation from a balanced time perspective (DBTP). In order to determine ‘balanced’ these authors proposed ‘ideal’ mean scores based on Zimbardo and Boyd’s (2008) cross-cultural collective database as follows: MPP = 4.6, MPH = 3.9, MF = 4.0, MPN = 1.95, and MPF = 1.5. These scores in turn formed part of an equation: DBTP = √ (oPP – ePP)2 + (oPN – ePN)2 + (oPH – ePH)2 + (oPF – ePF)2 + (oF – ePF2), where o is the optimal or ‘ideal’ mean score, and e is the empirical, or observed mean score.

Indeed, the degree of DBTP has been shown to be significantly related to a wide range of variables, in particular wellbeing, mental health, personality, cognitive functioning, self-control, and interpersonal relations (see Stolarski et al., 2020 for a review). In particular, in their systematic review of the DBTP literature, Stolarski et al. (2020) reported on a number of studies which had reported small-to-moderate effect sizes in the relationship between DBTP and alcohol use, as well as studies wherein DBTP scores had been shown to relate meaningfully to mental health and well-being scores. Accordingly, the further the DBTP score is from ‘ideal’, the lower individuals have been found to score on measures of well-being.

More recently, Jankowski et al. (2020) suggested and empirically tested a different set of ‘ideal’ means in their development of the DBTP-r, as follows: MPP = 5.0, MPH = 3.40, MF = 5.00, MPN = 1.00, and MPF = 1.00. In many respects this makes intuitive sense, as otherwise a mean score of PN = 1.95 would be considered more adaptive than PN = 1.00, suggesting that some degree of PN is better than none. These authors suggested that DBTP scores using this new formula would better discriminate between outcomes than the one previously suggested. In their exploratory study, Jankowski et al. (2020) reported a stronger (higher correlation coefficients) relationship between DBTP-r and life satisfaction (negatively), symptoms of depression, and negative affect (negatively) than was the case for DBTP. Indeed, DBTP was more strongly related to positive affect than DBTP-r (negatively). In large part this is attributable to the fact that the DBTP-r creates a more ‘extreme’ version of a balanced time perspective. For example, where previously (in the DBTP formula) a mean score for past negative of 1.95 was considered adaptive, the DBTP-r posits that a mean score of 1 (in other words a total absence of past negative thoughts or feelings) is adaptive. The same is true for F time perspective, where previously a mean of 4 was considered adaptive, whereas in the DBTP-r, a mean of 5 (complete agreement with questions on future thinking or feelings) is considered adaptive. Given the novelty of the DBTP-r approach, no studies have yet examined how scores on that relate to alcohol use.

The present study sought to examine the degree of similarity between results using these formulae, with regard to alcohol use, and symptoms of depression and anxiety. In line with the suggestion and findings of Jankowski et al. (2020), and given the fact that the revised DBTP-r essentially assesses deviation from the most ideal possible circumstance (no PN or PF, and the best possible F and PP scores), we hypothesized to find results using the DBTP-r would be larger in magnitude than those using the DBTP. To the best of our knowledge this is the first study to do so.

Methods

Participants

Participants were 940 adults, recruited through a University undergraduate project in a University in the North West of England (aged 17–75 [Mage = 22.44, SD = 7.36]; N = 452 Males [48.09%]). The study formed part of a student project, and recruitment to the study was undertaken by University students using opportunistic and snowball sampling of peers, friends, and family members. The study was given ethical approval by the relevant university ethics committee and all participants gave written informed consent.

Measures

The Alcohol Use Disorders Identification Test (AUDIT) (Saunders et al., 1993) assesses alcohol use. Eight of the items are responded to on a 5-point scale (e.g., “How often do you have six or more drinks on one occasion?”) and two on a 3-point scale (e.g., “Have you or someone else been injured as a result of your drinking?”). The AUDIT demonstrated good sensitivity (.94) and specificity (.92) in undergraduates (Adewuya, 2005). The original guidelines recommended by Saunders et al. (1993) are used here. Accordingly, a score of eight or more (out of 40) indicates hazardous to harmful alcohol use. Reliability for AUDIT scores in the present study was α = .83.

The Hospital Anxiety and Depression Scale (HADS; Zigmond & Snaith, 1983) was used to screen for symptoms of depression (e.g., “I still enjoy the things I used to enjoy”), and anxiety (e.g., “I get sudden feelings of panic”). The HADS produces scores for anxiety (HADS-A) and depression (HADS-D) on separate subscales that range from zero to twenty-eight. Bjelland et al. (2002) suggested that a score of eight or more on each subscale indicates caseness. Scores on both subscales of the HADS have been shown to be valid and internally consistent (HADS-A: α = .83; HADS-D: α = .82), with equal levels of sensitivity (.80) and specificity (.80; see Bjelland et al., 2002). Both subscales of the HADS demonstrated good internal consistency in the present study (HADS-A: α = 0.82; HADS-D: α = 0.80).

The 56-item Zimbardo Time Perspective Inventory (ZTPI; Zimbardo & Boyd, 1999) assesses individual differences on five subscales. PN (α = .82) and PP (α = .80), measure a negative and positive view of the past, respectively. PH (α = .79), measures a risk-taking or impulsive approach to life, oriented toward present pleasure, while PF (α = .74) assesses a pessimistic view of life. Finally, F (α = .77) assesses preparation for the future and orientation to longer-term outcomes. Responses to each item were on a 5-point Likert scale: 1 = very uncharacteristic of me, through 5 = very characteristic of me. Reliabilities for ZTPI subscale scores in the present study were as follows: PN, α = .81; PP, α = .70; PH, α = .81; PF, α = .74; F, α = .75. Data were also gathered on participants’ age and sex.

Analyses

Two sets of optimal mean scores were used in order to derive the DBTP and DBTP-r values. In the first place, those formerly suggested on Zimbardo and Boyd’s database (www.thetimeparadox.com/surveys, 17-09-2012; the most recent version), were used as follows: MPP = 3.67; MPN = 2.10; MPH = 4.33; MPF = 1.67; MF = 3.69). Secondly, the optimal means from the DBTP-r equation proposed by Jankowski et al. (2020) were also used as follows: MPP = 5; MPN = 1; MPH = 3.40; MPF = 1; MF = 5). Jankowski et al. reported inconsistencies both in their results, and in the extant literature, for PH scores, such that, in some cases higher scores were significantly associated with better outcomes, while elsewhere, worse outcomes. Accordingly, a ‘neutral’ value of 3.4 was hypothesized as optimal or ideal for balance.

Hierarchical regression models (adjusted for sex and age) were computed independently to examine the relationship between DBTP and separately, DBTP-r scores, and AUDIT and HADS scores. A second series of models were then computed which also additionally adjusted for scores on the five ZTPI dimensions, additional to the DBTP or DBTP-r score. Finally, binary logistic models were computed wherein AUDIT and HADS cut-offs (value > 8 = caseness) were employed in order to examine the relationship between DBTP and DBTP-r with AUDIT and HADS categories. We employed Ferguson’s (2009) effect size recommendations as a guide to interpreting results, where β > .20 is regarded as the minimum for ‘practical significance’.

Results

Given the reliance on an equation that includes empirical or observed mean scores, we only proceeded with further analyses on the basis that Cronbach’s alpha estimates for all five ZTPI dimensions were > .70. Table 1 displays descriptive statistics, and the results of Pearson’s correlations between scores on study measures. Of note, the correlation between the two DBTP scores suggests related but discrete values. However, the two DBTP values correlated with HADS scores to a very similar degree, differing though in the way that they related to AUDIT score. There was no observed significant sex difference between scores on DBTP (MMale = 2.00 [95% CI 1.95, 2.05]; MFemale = 1.98 [95% CI = 1.94, 2.03]), nor DBTP-r (MMale = 3.58 [95% CI 3.51, 3.64]; MFemale = 3.47 [95% CI = 3.40, 3.54]). In order to test the degree to which raw DBTP and DBTP-r scores respectively related to scores on HADS-A, HADS-D, and AUDIT, we empirically tested the magnitude of these independent correlation coefficients (Preacher, 2002). Results showed that the difference in magnitude of the DBTP/DBTP-r relationship with HADS-A was not significant (p = .634), nor was the difference in magnitude of the relationships with HADS-D (p = .630). However, DBTP-r was correlated with AUDIT score to a significantly greater degree than DBTP (p < .001).

Table 1 Bivariate Pearson’s correlations (two-tailed) between study measures

Table 2 displays the results of hierarchical regression models examining the relationship between DBTP versions, and scores on criterion variables, adjusted for age and sex. The total variance explained by the DBTP-r models was a little higher in all cases than that explained by the corresponding DBTP model. DBTP score was not significantly related to AUDIT score, whereas DBTP-r score was, however, not to Ferguson’s ‘practically significant’ (β > .20) threshold. In both models, higher AUDIT score was significantly related to being younger, and being male. Higher HADS-A score was significantly related to a greater DBTP in both models (DBTP and DBTP-r), with the β values both ‘practically significant’ (Ferguson, 2009). Further, higher HADS-A score was also significantly related to being female (p < .05), but only in the DBTP-r model, and with a small effect size. Higher HADS-D score was significantly related to a greater DBTP in both models, with practically significant effect sizes. Additionally, higher HADS-D score was significantly related to being older, in both DBTP models.

Table 2 Results of hierarchical regression models examining the relationship between both DBTP and DBTP-r and criterion variables

In order to further examine these relationships, we re-ran the models adjusting for ZTPI dimension scores. The full output for these models is displayed in Supplementary S1. Results are displayed in Table 3, and show that more variance was explained in these models than that explained in those including DBTP(−r) scores only (Table 2). Adjusting for ZTPI dimension scores, both DBTP models remained significantly associated with HADS-A and HADS-D scores, and largely to a practically significant degree (beta values). Of note, AUDIT scores in both models became non-significant with the inclusion of dimension scores.

Table 3 Results of hierarchical regression models examining the relationship between both DBTP and DBTP-r and criterion variables. Results are adjusted for ZTPI dimension mean scores

Because cut-offs have been established for both the AUDIT and the HADS, we further explored the relationship between DBTP, DBTP-r, and these variables using scale categories. In both cases a score > 8 is considered to be indicative of problematic alcohol use, and problematic level of symptomatology. In all six models (detailed in Supplementary S2 and adjusted for age and sex), neither DBTP nor DBTP-r scores were significantly related to membership of AUDIT or HADS categories.

Discussion

Comparing raw correlational results, and results for both the HADS-A and HADS-D in models adjusted only for sex and age, seems to suggest that the DBTP and DBTP-r function quite similarly, with the caveat that the DBTP-r model explained slightly more overall variance. Results for the AUDIT, however, illustrate that DBTP-r scores discriminated drinking behaviour in a way that the DBTP did not, although the effect size was small. In their literature review of the DBTP, Stolarski et al. (2020) illustrated that DBTP relates to criterion variables to different degrees, and not well overall to alcohol use.

Results in models adjusted for ZTPI dimensions illustrate that for HADS-A and HADS-D, DBTP-r was better at discriminating than the DBTP, with larger standardized beta values and overall amount of variance explained. However, it is somewhat disappointing that neither the DBTP nor DBTP-r was significantly associated with more problematic alcohol use or symptomatology in models examining AUDIT and HADS scores categorically. It would have spoken more to the explanatory power of the DBTP, and in particular to the revised DBTP-r had scores been able to differentiate non-problematic from potentially problematic behaviour.

The results of the present study need to be understood and interpreted in the context of a broader, ongoing debate about the ZTPI, and the use of both the idea of a balanced time perspective and the associated DBTP. In the first instance, the psychometric validity and internal consistency of ZTPI scores continues to be questioned (Davis & Cernas-Ortiz, 2017; Worrell et al., 2018), with multiple sample-specific, shortened versions created in order to achieve acceptable reliability and fit (e.g., Kocayoruk & Simsek, 2020; Sircova et al., 2014; Temple et al., 2019). The internal consistency estimates for ZTPI domains in the present study were all satisfactory, however, McKay and Worrell (2020) have highlighted the fact that internal consistency and/or psychometric validity of ZTPI scores were problematic in many of the manuscripts retained for Stolarski et al. (2020) review.

Following on from this, the temporal psychology field needs to reconsider a previously asked question (McKay et al., 2019), namely, the degree to which both the idea of a balanced time perspective and by extension, deviation from a balanced time perspective, are mere exercises in empiricism. Previous manuscripts have drawn attention to the fact that while Zimbardo and Boyd’s balanced time perspective idea is intuitive, it rarely emerges in, for example, cluster analyses (e.g. McKay et al., 2019). Relatedly, it appears self-evident that an individual reporting no PN, and the highest possible PP score (part of the metrics used in the DBTP-r) would be highly functional. However, in reality, for whom is such a situation representative? It is unlikely that any such person would exist, therefore, if the DBTP/DBTP-r are merely measuring deviation from an unrealistic psychological reality, it is difficult to know how either advances our understanding of human (temporal) psychology greatly.

In conclusion, the DBTP-r did explain variance in drinking behaviour that DBTP scores did not. This is not altogether surprising as, the DBTP-r is essentially an extreme version of the DBTP. By this we mean that essentially optimal mean score on the functional ZTPI dimensions (PP and F) has been pushed to the extreme (mean = 5.00), with the opposite true for the dysfunctional ZTPI dimensions (PN and PF). This is the first study to compare scores using the DBTP with those using the DBTP-r, and results must be interpreted in light of that limitation. In other words, it would not be appropriate to make definitive claims on the basis of results from a single follow-up study. Further studies will be required in order to determine whether these findings are both generalizable and replicable.