Introduction

Substance use disorder (SUD) is the most prevalent form of mental illness [1, 2] and one of the major contributors to the Global Burden of Disease [3]. In this context, optimizing treatment for SUD is one of the highest priorities for translating research to benefit the individual, family, and community at large. Traditionally, SUD treatments focus directly on preventing use, in terms of quantity or frequency, to encourage patients to avoid the substance they are treated for. In recent years, a shift has been emerging towards a widening of addiction treatment objectives, to obtain a global improvement in patient functioning [4]. Among the known factors influencing such outcomes, functional inhibition appears predominant to ensure optimal treatment benefit efficiency [5, 6]. Inhibition is defined as the ability to deliberately inhibit dominant, automatic, or prepotent responses when necessary [7]. Deficits in this function have been hypothesized to be directly linked to enhanced use of the primary substance [8], through underlying cerebral alterations [9] or by invoking moderating effects of other risk factors such as craving [10]. However, most patients treated for SUD do not exhibit a single, substance use pattern and co-occuring use of other substances contributes to the morbidity and mortality of hospitalized patients [11]. This consideration highlights the importance of investigating if and how prognostic factors could protect from the use of any substance in general. As substance use [12], craving [13], and inhibition [14] fluctuate rapidly in time, the conjoint testing of their interaction requires advanced methodologies that merge neuroimaging and intensive real-time, longitudinal assessment of risk factors and executive functioning, not previously addressed.

Inhibition deficits have been depicted as a core component of SUD [15], both as consequences [16] and predispositions [17] of use. Substance misuse can produce cerebral disturbance in executive-related areas, typically frontal systems that could in turn produce an inhibition deficit, thereby leading to the vicious circle of addiction [18,19,20]. The imbalance caused by such deficits would lead to loss of control over use and the dysregulation craving [15, 21]. Conversely, substance abstinence can afford opportunities to enhance inhibitory control [22] and ultimate recovery of compromised neural substrates [23].

The challenge remains regarding how to identify the conditions under which inhibition is associated with treatment failure or success [24]. Use of a drug, legal or not, is known to increase the probability of use of other substances, in all ages and drug-using populations [25]. Indeed polysubstance use drastically interferes with the clinical progress potentially by precipitating relapse in the primary substance or acting as a substitution for the primary drug [26]. Such dynamic influence of use over later use along with their predictors can be assessed in real life settings with a smartphone using Ecological Momentary Assessment (EMA). The intensively repeated nature of such data collection allows investigation of time-lagged predictions of subsance use (i.e., “what comes first?”) and the complex interaction between factors of relapse, notably craving and inhibition [27]. Herein, we aimed to exploit the benefits of EMA coupled with neuroimaging data to uncover the potential of inhibitory control to avert misuse of any substance.

The aim of this EMA study was to investigate direct and indirect effects of inhibition on the use of primary substance and any substance use in patients treated for primary Alcohol, Tobacco and Cannabis Use Disorders. To test for specificity of brain-behavior relations in patients with SUD, we included a sample of healthy subjects to compare executive and rs-fMRI integrity between the groups.

Method

Participants

Participants were volunteers and provided their written informed consent. Financial compensation was provided with a maximum of €100 in store vouchers for the completion of both the EMA and MRI study phases. Participants with SUD were recruited at Centre Hospitalier Charles Perrens where they were beginning regular outpatient treatment for addiction. Hence the entirety of the experimental protocol was conducted after treatment initiation. All patients met DSM-5 criteria for a current primary alcohol, tobacco, or cannabis use disorder and could receive care including pharmacotherapy, individual behavioral treatments (relapse prevention and psychosocial support), or both. Patients exhibiting multiple SUD could be included in the study if one substance (Alcohol, Tobacco or Cannabis) was prioritized during treatment as defined by the patient and the psychiatrist. Substance-related data were assessed using a validated French version of the Addiction Severity Index [28, 29], modified to take into account tobacco addiction [30]. The Interviewer Severity Ratings from the drug (in this study, cannabis), alcohol and tobacco sections of the ASI were used to assess the severity of the addiction in the primary substance. Patients with a history or current bipolar or schizophrenia disorder (Mini International Neuropsychiatric Interview 5.0.0 (MINI [31]) were excluded; patients who presented a current comorbid depressive disorder were included. All participants had to be free from conditions incompatible with the use of a smartphone and MRI scanning.

This study was conducted in compliance with ethical standards depicted in the Helsinki declaration and approved by the ethical committee “Comité de Protection des Personnes de Sud-Ouest et Outre-Mer III» (N° 2014-A01668-39). Sample size for the MRI analyses was estimated using the package pwr available on R [32]. For a linear regression with 2 predictors (inhibition performance and age) with a minimal power of 0.80, an alpha level of 0.05 to detect a medium effect size (R2 = 0.20), we estimated that 41 subject would be needed.

Procedure

The feasibility and validity of the EMA methodology, including mobile cognitive testing (adaptation of Stroop test) were previously demonstrated in patients with SUD [33]. All participants were trained to operate the smartphone (Samsung Galaxy S with a 10.6 cm screen, 12-point font size) assigned for the study before the 7 consecutive days of EMA assessments. They were instructed to carry the smartphone and to respond to five electronic surveys per day randomly occurring in time intervals based on their personalized time of “beginning” and “end” of the day (resulting in approximately 3 hours between each survey). EMA assessments included questionnaires to rate craving intensity on a scale from 1 to 7 and to record use of the primary substance and any substance 5 times per day; in addition, inhibition testing occurred twice per day with the timed incongruent condition of the Stroop task. Participants had to name the ink color of 16 incongruent color names (e.g. blue written in red) as quickly as possible. Verbal responses were recorded via the EMA device in order to estimate the precise completion time in ms, with longer times indicating worse performances.

A typical day of EMA is presented in Fig. 1. A subsample of 40 patients with SUD and 34 control participants underwent neuroimaging to acquire anatomical and rs-fMRI data within the 48 h before day 1 of the EMA phase.

Fig. 1: Typical day of EMA assessments and illustration of potential direct vs indirect effect of inhibition on use.
figure 1

Usual “wake” and “sleep” times were selected by each participant to ensure that they would be able to answer each signal. Signals occurred randomly within 5-time intervals periods and could concern craving and substance use only or could be followed by a Stroop test. Craving was assessed on a scale from 1 to 7, and Substance use was coded in a dichotomous manner (“Yes” or “No”) for the primary substance and for any substance. Stroop refers to the timed ink color naming of an incongruent color-related word (i.e., blue written in yellow ink), with longer time completion indicating a poorer inhibition performance. All variables can be used to predict the same-time assessment level or next-time assessment level (i.e., lagged in time) of another measure. This prediction can test direct effect as indicated by the continuous arrow or indirect effect, such as moderation, as indicated by the dotted arrow.

MRI acquisition

Brain imaging data were collected using a 3.0 Tesla GE MRI system equipped with a 32-channel head coil. Anatomical volumes were acquired using a sagittal 3D T1-weighted sequence (Repetition Time = 8.5 ms, Echo Time = 3.2 ms, flip angle = 11°, FOV = 256 mm × 256 mm, voxel size = 1 mm3, 176 slices). The resting-state functional images were collected using a single-shot echo-planar sequence (RT = 2.2 s, ET = 27 ms, flip angle = 80°, FOV = 192 mm × 192 mm, voxel size = 3 mm × 3 mm × 3.5 mm, 42 axial slices), during which participants were instructed to keep their eyes closed, remain awake, and not think about anything in particular.

Preprocessing

Preprocessing steps included bias field correction, skull stripping, MNI normalization, segmentation, slice timing, motion correction, distortion correction, coregistration, regression of motion parameters, and band pass filtering were conducted using FMRIPREP [34]. Further details can be found in [35]. All MRI were inspected by a radiologist for anatomical anomalies that could interfere with image analysis and resulted in the exclusion of 4 controls and 2 patients with SUD; additionally, 2 patients with SUD and 1 control with movement >3 mm during the rs-fMRI acquisition were excluded from the rs analysis. First level rs analyses including network computation, ROI extraction, and individual connectome calculation were conducted using Nilearn [36] on python 3.9 [37]. Second level connectivity analyses were carried out using the CONN.16 toolbox [38] implemented in SPM12 Software (https://www .fil.ion.ucl.ac.uk/spm/software/spm12/) on MATLAB, version 12.0 (http:// www.mathworks.fr/products/matlab).

Dictionary learning assessments of functional connectivity

Dictionary Learning is a multivariate reduction method allowing decomposition of 4D rs-brain signals into a set of spatially located functional networks. In contrast to an Independent Component Analysis (ICA), decomposition is based on sparsity rather than independence by combining a sparsity-inducing penalty to a reconstruction loss, resulting in better performance, stability, and higher contrast than ICA (for details see [39]). Based on current recommendations in the literature and data driven estimation of number of components derived from MELODIC version 3.04 [40], we extracted 20 components of the signal. Resulting spatial maps were then reduced to 84 regions of interest (ROI) by keeping the more intense nonzero brain voxels across all maps with a threshold of 0.5. Individual connectomes were then computed using Pearson correlations between BOLD signals in all ROIs (for Dictionary Learning analyses, see: https://nilearn.github.io/stable/connectivity/resting_state_networks.html). The MNI coordinates of each ROI’s center were extracted and automatically labeled using label4MRI (retrieved from: https://github.com/yunshiuan/label4MRI) on R [32].

Statistical analyses

Group comparisons

Due to the non-normality of the data distribution, group comparisons for Stroop performance, substance use, and craving were performed using non-parametric tests: Mann-Whitney tests for the effect of group (patients versus controls) and one-way Kruskal–Wallis (non-parametric) ANOVA and post-hoc Dwass-Steel-Critchlow-Fligner (DSCF) pairwise comparisons for the effect of SUD sub-groups (alcohol, tobacco, cannabis).

Hierarchical modeling of craving, Inhibition functioning, and substance use

We used the lmer4 packages available on R [41] to fit the generalized linear mixed-effects model (for binomial outcomes) and Linear Mixed-Effects Models (for continuous outcomes) with random intercepts on the EMA sample (presented in supplementary Table 1). We modeled a time lag in our measures to predict an outcome at time t + 1 from any variables at time t. To avoid contamination of night-time effects, this time lag excluded all predictions of the first assessment of a new day by the last assessment of the previous days. Missing data in either the outcome or the predictor were discarded from the analysis.

Craving level was centered around the participant’s habitual level during the week. To take into account both within and between-person level variations in Stroop performances we used two centering methods [42]. Individual-centered Stroop refers to the momentary performance centered around the participant’s own level during the week, hence depicting momentary increase or decrease in inhibition performance. Group-centered Stroop refers to the momentary performance centered around the SUD group level during the week to further take into account high or low momentary performances. Use of any substance, use of primary substance, and Stroop performances at time t were successively entered as predictors of use of any substance, use of primary substance, and Stroop performances at time t + 1. For description purposes, we also assessed the impact of time defined as the number of assessment occasions on substance use. To test for a moderating effect of the Stroop on the craving/substance use link, we entered the interactions term craving * Stroop performance at time t as the predictor of substance use at time t + 1. To assess potential learning effects, we also tested the time * Stroop performances interaction impact on substance use at time t + 1. Finally, we conducted Causal Mediations Analysis using the mediation package available on R [43] to further characterize the interplay between and among all the significant main effects.

Whole brain ROI to ROI functional connectivity

To assess resting-state networks related to executive functioning in the SUD group of the MRI sample, we employed user-defined contrast in the Conn Toolbox with Stroop time as the between-subject variable and the connectivity strength in the connectome derived from Dictionary Learning as the outcome variable while controlling for age. The results were corrected for multiple comparisons using false discovery rate (FDR) with an alpha level of 0.05.

The strength of connections in the identified networks were then extracted in the two groups (SUD and control) for comparison. Finally, Spearman correlations were used to assess relations between connectivity strength and the sum of substance use occasions during the week for any substance and for the primary substances in the SUD group. These analyses were performed using JAMOVI software version 2.3 (The jamovi project (2022). Retrieved from https://www.jamovi.org). All statistical tests were two-sided and results were considered significant when p ≤ 0.05 was reached.

Results

Sample characteristics

All behavioral analyses were based on the overall EMA sample of 99 participants: 64 individuals with SUD (33 men; mean age 41.66 ± 11.81), including 32 treated for alcohol, 20 for tobacco, and 12 for cannabis use disorder. The overall prevalence of comorbid depression was 17.2%. The control group included 35 participants (18 men; mean age 34 ± 8.22). Supplementary Table 1 presents the descriptive statistics of these groups. The neuroimaging analyses included the MRI sample, consisting of 74 of the 99 EMA participants: 40 SUD patients (20 men; mean age 42.42 ± 11.34), including 20 treated for alcohol, 13 for tobacco and 7 for cannabis use disorders. The overall prevalence of comorbid depression in the SUD group was 12.5%. The control group included 34 participants (18 men; mean age 34.26 ± 8.19). Statistical results for descriptive and group comparisons are presented in Table 1 for the MRI sample.

Table 1 Descriptive statistics of the behavioral and psychological variables of interest in the MRI sample (n = 74).

Association of craving, substance use, and executive functioning in SUD

Statistical results for principal and mediations effects in the SUD group are presented in Table 2a–c. Predictions include the two operationalization of Stroop performance, group-centered and individual-centered.

Table. 2 Results of the substance use as outcomes models in the EMA sample.

Prediction of substance use at t + 1 by use at time t (Table 2a)

Use of any substance at a given time t significantly predicted later use of both any or primary substance. Use of primary substance only predicted later use of primary substances but not the use of any substances.

Prediction of use at t + 1 by Stroop at time t (Table 2a)

Increase in Individual-centered Stroop performance at time t significantly predicted a lower probability of the use of any substance at time t + 1. Better Group-centered Stroop performance at time t significantly predicted the use of the primary substance at time t + 1 but not the use any substance.

Prediction of use at t + 1 by craving at time t (Table 2a)

Increase in craving at time t significantly predicted a higher probability of the use of any substance and of the primary substance at time t + 1.

Prediction of Stroop scores at t + 1 by use at time t (Table 2b)

Decrease in Individual-centered Stroop performance at time t + 1 was significantly predicted by use of the primary substance at time t but not by use of any substance. Worst Group-centered Stroop performance at time t + 1 was significantly predicted by both the use of the treated and any substance at time t.

Interaction of Stroop * craving on substance use (Table 2a)

The Stroop * craving interaction at time t did not predict significantly the use of any or primary substance at time t + 1.

Interaction of Stroop * time on substance use (learning effect) (Table 2a)

Our results indicated no significant effect on Stroop * time interaction at time t and use of any or primary substance at time t + 1.

Mediation of Stroop between use at time t and use at time t + 1 (Table 2c)

Group-centered Stroop performance at time t significantly mediated the relationship between use of primary substance at time t and use of primary substance at time t + 1 (see Supplementary Figure 2).

Neuroimaging results

Dictionary learning networks and ROI extraction

We computed 20 Dictionary Learning components, representing resting state networks. Graphical description components and their corresponding variance explained appear in supplementary Figure 1. From these components, 84 regions of interest were extracted. Description of each ROI including their center MNI coordinates and labels according to the AAL atlas are in Supplementary Table 2.

Stroop time-related connectivity in the SUD group

The mean Stroop time across all tests correlated with the strength of connectivity within 7 pairs of areas in the SUD group while controlling for age (see Fig. 2). Longer Stroop times correlated with greater strength of connectivity between the right angular and right superior occipital area. Further, longer Stroop times correlated with weaker connectivity between the right middle occipital area and five right unilateral and one midline region: the interior parietal, middle orbital frontal, middle frontal, angular, cerebellar lobule IX, and vermis 10.

Fig. 2: Rs connections linked to mean stroop time and amount of substance use during the week.
figure 2

Cereb Cerebellum, Occ. Occipital, Par. Parietal, Mid. Middle, Sup. Superior, Inf. Inferior, Orb. Orbital, R Right. Regression analyses were conducted between the individual connectomes and mean Stroop times from each participant while controlling for age and correcting for false discovery rate (FDR) with alpha level of 0.05. Resulting connections are highlighted in the above figure and colored depending on their association with Stroop times, positively (highlighted in green) or negatively (highlighted in red). The strength of connectivity within these connections were entered in a correlation matrix along with the number of any or primary substance use occasions. Significant correlations with use are highlighted by the node colored in red (positive correlation) and blue (for negative correlation).

Associations between rs-MR functional networks and EMA variables

No significant difference was found between the SUD and the control groups in connectivity strength in the 7 above-mentioned connections (See Table 3). Moreover, significant negative correlations were found between the number of any substance use occasions during the week and connectivity strength between the right middle occipital and cerebellar vermis 10 areas and the middle frontal and right orbital frontal areas in the SUD group. Finally, a greater number of primary substance use occasions correlated with stronger connectivity between the right middle occipital and right inferior parietal areas and are presented in Table 4 and Fig. 2.

Table 3 Non parametric group comparisons of rs networks connectivity strength between the SUD and the control group.
Table 4 Correlations between number of substances use occasions within the week and rs networks connectivity in the SUD group.

Discussion

To the best of our knowledge, we characterized for the first time the time course of daily use in interaction with inhibition function in patients under treatment for primary alcohol, tobacco, and cannabis use disorders. Not only did use exhibit a different pattern regarding primary or any substance, but the contribution of inhibition in averting use also differed for both outcomes, whereby the protective role was beneficial only on the primary drug during treatment. Although this effect was not explained by an interaction with craving, it was partially sustained by rs-connectivity strength in frontal, occipital, parietal, and cerebellar areas, itself being linked to the frequency of use.

During a typical day of EMA, the use of any substance predicted increased probabilities of use of both any and the primary substance during the following hour. Good inhibition performance, indicated by the performance at the time of assessment in comparison to the group, neither interacted with this pattern nor conferred protection from the use of any substance. Conversely, the use of the primary substance predicted only an increased probability of use of this same substance during the following hours and was impacted by better momentary inhibition performance than the group that directly predicted lower probabilities of later use for the primary substance. Moreover, differences in inhibitory control mediated this pattern of use, where better inhibitory control enabled one to resist the urge to use the same substance again, and where compromised inhibitory control precipitated substance use. This study is not the first to demonstrate higher risk use of a primary substance after having used another substance [44, 45] but suggests that once the primary substance is used, patients tend to use this substance again rather than a different one.

The influence of the inhibition performance on use was also dependent of previous use. A sudden drop in inhibition performance, as indicated by the performance of this assessment compared to the patient’s own mean, predicted a lower probability of later use of any substance. Whereas poorer inhibition performance as predicting a lower probability of later use could seem surprising, it was predicted by the use of primary substance within the previous hours. This study thus replicated the already described deleterious effect of use on later inhibition performance, for alcohol [46], cannabis [47], although such effects in tobacco use are an ongoing debate [48].

Concerning how inhibition integrity protects one from relapse, our results supported certain hypotheses regarding the protective effect of brain integrity. Our brain imaging analysis revealed that better inhibition performance was sustained by specific connections that did not significantly differ between patients with SUD and controls. Better Stroop performance was linked to stronger connectivity between the occipital cortex and parietal, frontal, and cerebellar areas and to weaker connectivity between the right angular and the right occipital superior areas. Such brain regions have been linked to inhibition functions during the Stroop interference task in healthy populations with activations studies citing the involvement of the right middle occipital gyrus, right middle and orbital frontal areas, right parietal inferior areas [49], angular gyrus [50] and the cerebellum [51]. Meta-analytic studies applying conjunction analysis to executive functioning tasks emphasize the involvement of these areas in a more global, superordinate network that would support cognitive control [52]. Specific regional engagement within this network could vary, however, to adapt to task-specific demands allowing for stronger frontal connectivity with sensory or motor regions, such as the cerebellum, to support response selection and inhibition [53]. As consequences, the observed protective effect of inhibition could be mediated by cerebral integrity within those networks as previously suggested [9, 54,55,56].

Inhibition performance could also be interpreted as protecting one from substance use by moderating the effect of craving. In support of this interpretation, we previously demonstrated that executive-related brain areas were involved in the regulation of craving dynamics over time [13]. Even if both craving and inhibition independently predicted use, our current study failed to observe significant interactions between craving and inhibition performance, thus precluding endorsement of this interpretation.

Although patients with SUD performed worse at the mobile Stroop task than the control group, the highest functioning patients with SUD compared to their own group exhibited the lowest tendencies to use their primary substance, better ability to resist even after an episode of use, and the highest functional connectivity integrity in Stroop-related networks. These protective effects were closely linked to the participants’ preferred substance. These results could be interpreted in light of the Limited Resource theory of self-control that posits that the high cognitive demands imposed by self-control over impulses (such as trying to refrain from using a substance) can lead to a depletion state, namely ‘ego depletion’, of this resource [57] and thus in the case of SUD higher risk of substance use [14]. It would be possible that in a similar manner, patients direct high self-control towards the substance they are supposed to avoid, leading to lower allocations of the inhibition component toward the avoidance of other substances. In line with recent recommendations, our work emphasizes the potential of targeting inhibition in populations treated for different types of SUD [21] but raises important considerations regarding its limits as they apply to polysubstance use.

Limitation

Several limits should be considered when interpreting the results of the present study. Firstly, because our research focused on the dissociation between the primary substance during treatments versus any substances, we did not assess the availability of the primary drug in the immediate environment. It is thus possible that the use of any substance could reflect momentary preference or more simply unavailability of the primary drug. Secondly, craving assessments were not specific to the primary substance but general, which could impact the interaction between craving and other risk factors. Thirdly, should be noted that for the feasibility of EMA settings, the Stroop-like mobile test only includes the incongruent condition without the baseline control condition. While this standard in EMA settings has already been linked to accurate assessments of executive functioning in patients with SUD [33, 35], potential effects of visuo-motor speed confounds may be present in these data. Finally, whereas we focused our article on inhibition, others cognitive components can interplay during the Stroop test. Indeed, attentional components, particularly attention to the relevant stimuli, could also play an important role in our results. Further studies could be conducted to uncover the unique and joint contribution of diverse executive and attention processes in use.

Conclusion

Inhibitory control is a potentially powerful target of intervention, enabling the opportunity to break the precipitating relapse cycle that is characteristic of SUD treatment attempts. This beneficial effect, however, may be restricted to the primary drug during treatment, whereas polysubstance use loom as a critical risk factor in clinical settings.